Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2009 Feb 11.
Published in final edited form as: Physica A. 2007 Mar 15;376:725–737. doi: 10.1016/j.physa.2006.10.067

Delay estimation in a two-node acyclic network

Radhakrishnan Nagarajan
PMCID: PMC2639718  NIHMSID: NIHMS90022  PMID: 19214240

Abstract

Linear measures such as cross-correlation have been used successfully to determine time delays from the given processes. Such an analysis often precedes identifying possible causal relationships between the observed processes. The present study investigates the impact of a positively correlated driver whose correlation function decreases monotonically with lag on the delay estimation in a two-node acyclic network with one and two-delays. It is shown that cross-correlation analysis of the given processes can result in spurious identification of multiple delays between the driver and the dependent processes. Subsequently, delay estimation of increment process as opposed to the original process under certain implicit constraints is explored. Short-range and long-range correlated driver processes along with those of their coarse-grained counterparts are considered.

1. Introduction

Estimating delays from the observed processes has been an area of great interest both from theoretical and experimental standpoints. Inferring delays from temporal processes is an inverse problem and can be also be useful in inferring causal relationships between them [1]. The present study investigates a primitive two-node acyclic network comprising of a driver and a dependent process with single and two delays, Fig. 1. We consider the class of drivers (x) whose auto-correlation functions Rx (k) = E(xnxn+k) are positive and decreases monotonically as a function of lag (k).

Figure 1.

Figure 1

Two-node acyclic networks with one and two-delays are shown in (a) and (b) respectively. The driver and the dependent processes are represented by (x) and (y).

Classical delay estimation techniques using linear measures such as cross-correlation function are useful when the driver process is uncorrelated. The procedure begins, by estimating the cross-correlation functions Rxy (k) = E(xnyn+k) between the driver (x) and the dependent processes (y) as function of lag (k), Fig. 1. A non-zero cross-correlation at a given lag is chosen as the desired delay between x and y. However, drivers need not necessarily be uncorrelated. A classic example is that of a genetic network where an up-stream gene (driver) with auto-regulatory feedback regulates a down-stream gene (dependent) through multiple pathways with distinct delays. In such cases, we show that direct estimation of the delay between x and y from their observed values using measures such as cross-correlation may not be sufficient. Subsequently, we explore delay estimation from the increment processes as opposed to that of the original processes. It is shown that such an approach is highly suitable for correlated drivers under certain constraints.

2. Methods and Results

A. Statistically significant delays

Only positive cross-correlation estimates between the driver and the dependent processes are assessed for statistical significance. The cross-correlation estimate at a given lag is deemed significant if its value is considerably higher than those obtained on the random shuffled counterparts. A brief description of the procedure is enclosed below.

Step 1 Estimate the cross-correlations as a function of the lags Rxy (k), τ = 1…T between the driver and dependent processes xn and yn.

Step 2 Generate random shuffled counterparts xi and yi,i=1ns of xn and yn by resampling without replacement [2]. Estimate the cross-correlation as a function of the delays on the ns shuffled counterparts Rxiyi(τ),τ=1τmax,i=1ns.

Step 3 Cross-correlation estimate at lag k is statistically significant if Rxy(k)>Rxiyi(k),i=1ns. This lag (k) is the desired delay between the driver and the dependent processes. Thus a one-side test is sufficient. The number of surrogates was fixed at ns = 99, this corresponds to a significance level of α+ = 1/(99+1) = 0.01 [3-4] for a one-sided test.

In order to estimate statistically significant delays from the increment processes δxn = xn+1 - xn and δyn = yn+1 - yn repeat Steps 1, 2 and 3 for the increment processes.

Prior to a detailed discussion we illustrate the motivation behind the choice of delay estimation on the increment process with a simple example.

Example: Consider a two-node acyclic network with a single delay

Driver(x):x1x2x3..........xnxn+1xn+2........................................xNDependent(yn=xnτ):y1y2y3..........ynyn+1yn+2.......yn+τyn+τ+1yn+τ+2.....yNWhite noise process(e):e1e2e3..........enen+1en+2..........................................eN

Case (i) Uncorrelated Driver

Delay estimation from the given processes

Consider the uncorrelated driver (x) sampled from a white-noise process e with zero-mean and variance Re (0) (i.e. xn = en,n = 1…N) and the dependent process (yn = xn). Cross-correlation estimates as a function of lag k yields

E(xn+kyn+τ)=E(xn+kxn+τ)=E(en+ken+τ)=Re(0),fork=0=0,fork0 (1)

A positive cross-correlation estimate exists only for k = 0, which corresponds to delay τ between xn and yn.

Delay estimation from the increment processes

Consider the increment processes δxn = xn+1 - xn and δyn = yn+1 - yn.

E(δxn+kδyn+τ)=E(δxn+kδxn+τ)=E(en+ken+τ)=2Re(0),fork=0=2Re(k)Re(k+1)Re(k1),fork0 (2)

Unlike (1) cross-correlation of the increment processes persist for delays k = -1, 0, 1. However, from (2) cross-correlation estimate is positive only for k = 0 and negative for k = -1 and 1. From our definition of statistical significance (Sec. A), cross-correlation estimate only at k = 0 is statistically significant.

Since cross-correlation is a linear measure it is possible to identify the delay even for nonlinearly correlated drivers (x) with fast linear de-correlation time comparable to that of white noise process. An example of such a driver is a chaotic logistic map given by the expression xn+1 = 4.xn. (1 - xn). Therefore, in the subsequent discussion the term correlated drivers implicitly refers to drivers whose linear de-correlation time is comparatively larger than those of white noise.

Case (ii) Correlated Driver

Consider a driver process generated as linear combination of samples from a white noise process i.e. xi = ei + ei+1, i = 1…n -1. Let the dependent process be yn = xn.

Delay Estimation from the given processes
E(xnyn+τ)=Rx(0)=2Re(0)>0E(xn+1yn+τ)=Re(0)>0 (3)

For the correlated driver, positive cross-correlation (3) persists for delays other than τ. Such correlations are an outcome of the correlated nature of the driver and shall be referred to as correlation leak in the subsequent sections. Correlation leak can be statistically significant (Sec. A), and may imply spurious existence of multiple delays between the driver and the dependent processes.

Delay Estimation from the increment processes
E(δxnδyn+τ)=2[Rx(0)Rx(1)]=2Re(0)>0E(δxn+1δyn+τ)=2.Re(k)Re(k+2)Re(k2)fork0 (4)

Cross-correlation analysis of the increment series (4) reveals statistically significant positive correlation (Sec. A), only at k = 0. While cross-correlation persists for lags k = -2, 2, these are negative. Unlike (3), cross-correlation analysis of the increment process can be useful in minimizing contributions due to correlation leak.

Inspired by the above example, cross-correlation analysis of increment processes in conjunction with those of the original process in delay estimation in a two-node acyclic network is explored. As noted earlier, the driver processes is implicitly assumed to be positively correlated with monotonic decreasing auto-correlation function. In this respect, we discuss the results for short-range correlated stationary first-order Gauss-Markov driver process and long-range correlated stationary fractional auto-regressive integrated moving average driver (FARIMA) process [5]. Instances of delay estimation on the coarse-grained counterparts of their increment series are also discussed.

B. Short-range correlated driver

Short-range correlated stationary first-order Gauss-Markov process is given by the expression

xt=αxt1+et (5)

where et sampled from a normally distributed white noise with zero-mean and unit variance. Since we consider positively correlated driver process with monotonic decreasing auto-correlation function we consider only processes (5) where 0 <α <1. The corresponding auto-correlation function Rx (k) for 0 <α <1

Rx(k)=E(xnxn+k)=akR(0)>0k (6)

B1. Short-range correlated driver and single-delay

An example of two-node acyclic network with single delay is shown in Fig. 1a. Consider the cases where the dependent node yno which lags the driver node by a delay τ given by

yno=βxnτsuch thatβ>0,τ>0 (7)

In (7), β contributes to the overall variance of the process, hence can be factored out to obtain the normal form yn,

yn=ynoβ=xnτ (8)
Delay estimation from the original process

Cross-correlation function between the driver xn (7) and the dependent yn (8) processes at lag k is given by

Rxy(τk)=E(xn+kyn+τ)=E(xn+kxn)=Rx(k) (9)

Substituting for the auto-correlation function Rx ()from (6) we get

Rxy(τk)=αkRx(0)>0k (10)

Since 0 <α <1, we have Rxy(τ-k) < Rxy(τ),∀k > 0. Thus irrespective of the choice of the process parameterα, the driver and the dependent nodes are maximally correlated at lag k = τ which corresponds to the delay between xn and yn. However, there is considerable positive correlation leak across lags (τ - k), k ≠ 0. whose magnitude Rxy (τ - k), k ≠ 0 increases as a function of the process parameterα. The correlation leak is especially significant in the limitα → 1. An instance of cross-correlation estimate as a function of lags for the driver and the dependent processes (5 and 8) with parameters (α = 0.9, τ = 10) is shown in Fig. 2a. Statistically significant cross-correlation (Sec. A) is observed at a number of lags in addition to k = τ. This is not a drawback of the estimation procedure but an inherent feature due to the correlated nature of the driver. As noted earlier (3), it is possible to infer spurious existence of multiple delays (directional paths) from the driver to the dependent process.

Figure 2.

Figure 2

Cross-correlation estimates as a function of delay (k) for the original Rxy (k) (a) and increment processes Rδxδy (k) (b) in a two-node acyclic network with a single delay (τ = 10) and Gauss-Markov driver (α = 0.9, N = 4000). Statistically significant delay estimates (ns = 99, Sec A) are shown by circles.

Delay estimation from the increment process

Consider the increment processed δyn+1 = yn+1 - yn and δxn+1 = xn+1 - xn corresponding to xn (5) and yn (8). The corresponding cross-correlation function at lag k is given by

Rδxδ(τk)=E(δxn+kδyn+τ)=2Rx(k)Rx(k+1)Rx(k1) (11)

Substituting for the auto-correlation function Rx ()from (6) we get

Rδxδy(τ)=2.(1α)Rx(0)>0 (12)
Rδxδy(τk)=α(k1)(1α2)<0 (13)

From (12 and 13) we note that Rδxδy (τ - k) < Rδxδy (τ). More importantly, we note that Rδxδy (τ) > 0 whereas Rδxδy (τ - k) < 0, ∀ k ≠ 0. An instance of cross-correlation estimates as a function of lags for the increment of the driver and the dependent processes with (α = 0.9, τ = 10) is shown in Fig. 2b. The cross-correlation estimate was statistically significant (Sec. A), only at lag k = τ which corresponds to the delay between the driver and the dependent processes. These results have to be contrasted with those of Fig. 2a, where the correlation leak Rxy (τ - k) resulted in identifying multiple delays between the driver and the dependent processes.

Summary I For a two-node acyclic network with a single delay and Gauss-Markov driver with parameter0 < α < 1, delay estimation using cross-correlation on the original process can result in significant positive correlation at several delays in addition to that of τ, attributed to inherent correlation leak. These in turn may indicate spurious existence of multiple delays (directional paths) between the driver and the dependent processes. However, analysis on the increment processes resulted in positive cross-correlation only at lag corresponding to the delay between the driver and the dependent processes.

B2. Short-range correlated driver and two-delays

An example of two-node acyclic network with two delays and a correlated driver (5) is shown in Fig. 1b. The dependent process is generated as a linear combination of the driver (5) with delays τ1 and τ2 as

yno=β1xnτ1+β2xnτ2such thatβ2>β1>0,τ2>τ1>0 (14)

In order to obtain the normal form yn of yno we follow the steps below

yno=β2(β1β2xnτ1+xnτ2)

Substituting, β=β1β2 such that 0 < β < 1 in the above expression, we get

yno=β2(βxnτ1+xnτ2)Iyn=ynoβ2=βxnτ1+xnτ2such thatτ2>τ1>0,0<β<1 (15)

In (15) β2 affects the overall variance of yno, hence can be factored out. In the subsequent discussion we shall only consider the normal form yn (15).

Delay estimation from the original process

From (15) we have

yn+τ1=βxn+xnτ,whereτ=τ2τ1>0 (16)
yn+τ2=βxn+τ+xn,whereτ=τ2τ1>0 (17)

Their corresponding cross-correlation functions with xn (5) is given by

Rxy(τ1)=E(xnyn+τ1)=β.Rx(0)+Rx(τ)>0,whereτ=τ2τ1 (18)
Rxy(τ2)=E(xnyn+τ2)=β.Rx(τ)+Rx(0)>0,whereτ=τ2τ1 (19)

From (18 and 19) it can be seen that the magnitude of the cross-correlation between the driver and the dependent process is proportional to parameter β.

Remark1Rxy(τ2)>Rxy(τ1) (20)

Subtracting (18) from (19) we get

Rxy(τ2)Rxy(τ1)=(1β)[Rx(0)Rx(τ)]

Since Rx (m) > Rx(n) for m < n and 0 < β < 1, Rxy2) > Rxy1).

In the case of uncorrelated driver, the following inequality holds Rxy2) > Rxy1) > Rxy (k) = 0 for k ≠ τ12. Thus ranking the cross-correlation function in descending order is useful in inferring the delays between the driver (5) and the dependent (15) processes. However, such a ranking need not necessarily hold in the case of correlated drivers. As correlation leak around delayτ2 can be significantly higher than that of Rxy1). This in turn implies that ranking the cross-correlation can result in spurious identification of delays between the driver and the dependent processes. In the following Remark, we derive a constraint on the process parameters (α and β) in order to preserve the ranking Rxy2) > Rxy1) > Rxy (k) for k ≠ τ12.

Remark 2 Constraint on the parameters α and β such that Rxy2) > Rxy1) > Rxy (k) for k ≠ τ12.

From (16), we have

E(xn+1yn+τ2)=Rxy(τ21)=E(xn+1yn+τ2)=β.Rxy(τ1)+Rx(1) (21)

In order for the ranking to be preserved we need

Rxy(τ1)>Rxy(τ21) (22)

Since Rxy2) > Rxy2 - 1) and Rxy2) > Rxy1) from (20) Substituting from (18) and (21) in (22) we get

β.Rx(0)+Rx(τ)>β.Rx(τ1)+Rx(1)β>Rx(1)Rx(τ)(Rx(0)Rx(τ1))

Substituting for the auto-correlation function Rx ()from (6) we get

β>αατ1ατ1=αi.e.β>α (23)

Thus the constraint on the parameters (α and β) so as to preserve the ranking Rxy2) > Rxy1) > Rxy (k) is β > α.

Cross-correlation estimates Rxy (k) between the driver and the dependent processes as a function of lag (k) for (β <α) with parameters (β = 0.5,α = 0.7,τ1 = 5,τ2 = 11) is shown in Fig. 3a. As expected (23), correlation leak around (τ2 = 5) results in statistically significant cross-correlation estimates at lags (k = 10 and 11) considerably larger than those at (τ1 = 5). This in turn disrupts the ranking Rxy2) > Rxy1) > Rxy (k). However, for β > α with (β = 0.8,α = 0.7) dominant cross-correlation estimates occur at delays (τ1 = 5,τ2 = 11) preserving the ranking Rxy2) > Rxy1) > Rxy (k), Fig. 3c. While the ranking is preserved for constraint β > α, cross-correlation estimates at lags other than (k = 5,11) corresponding to delays (τ1 and τ2) are rendered statistically significant. As seen earlier (Sec. B), these can indicate spurious existence of multiple delays between the driver and the dependent processes in addition to (τ1 and τ2). It is also important to note that the constraint β > α for preserving the rank turns out to be stringent especially in the limit α → 1 (5), i.e. the family of processes from which the delays can be inferred reduces dramatically as α → 1.

Figure 3.

Figure 3

Cross-correlation estimates as a function of delay (k) of the original Rxy (k) (a, c) and increment processes Rδxδy (k) (b, d) in a two-node acyclic network with a two delays (τ1 = 5,τ2 = 11) and Gauss-Markov driver (α = 0.7, N = 4000). Cross-correlation estimates Rxy (k) for original processes violating constraint (23) i.e. (β < α, β = 0.5,α = 0.7)is shown in Fig 3a. Those of its increment Rδxδy (k) are shown in Fig. 3b. Cross-correlation estimates of original processes Rxy (k) satisfying constraint (β > α, β = 0.8,α = 0.7) is shown in Figs. 3c. Those of its increment series Rδxδy (k) are shown in Fig. 3d. Statistically significant delay estimates (ns = 99, Sec. A) are shown by circles.

Delay estimation from the increment process

Cross-correlation between the increment series δxn+1 = xn+1 - xn and δyn+1 = yn+1 - yn delays τ1 and τ2 are given by

Rδxδy(τ1)=E(δxnδyn+τ1)=2β.[Rx(0)Rx(1)]+[2Rx(τ)Rx(τ+1)Rx(τ1)] (24)
Rδxδy(τ2)=E(δxnδyn+τ2)=2.[Rx(0)Rx(1)]+β[2Rx(τ)Rx(τ+1)Rx(τ1)] (25)

Substituting for the auto-correlation function Rx ()from (6) we get

Rδxδy(τ1)=[2β.(1α)ατ1(1α)2]Rx(0) (26)
Rδxδy(τ2)=[2.(1α)βατ1(1α)2]Rx(0) (27)

It is important to note that the expressions (26) and (27) need not necessarily be positively correlated for every choice of the parameters (α and β). As noted earlier (Sec. B), we are interested in identifying only delays whose cross-correlation functions are positive. Therefore, prior to checking rank preservationR2) > R1) > R(k), ∀k ≠ τ12 we impose the constraint for positive cross-correlations at delays τ1 and τ2.

Remark 3 Constraint on parameters (α and β) such that Rδxδy1) and Rδxδy (τ 2) are positively correlated.

Substituting for Rδxδy1) from (26) and imposing the constraint for positive correlation i.e. Rδxδy1) > 0 we get

β>(12)ατ1(1α) (28)

Subtracting (26) from (27) we get

Rδxδy(τ2)Rδxδy(τ1)=(1β).(1α)[2.+ατ1(1α)]Rx(0) (29)

From (5) and (15) we know 0 < α < 1 and 0 < β < 1, therefore

Rδxδy(τ2)Rδxδy(τ1)>0

From (28) and (29) we obtain

Rδxδy(τ2)>Rδxδy(τ1)>0forβ>(12)ατ1(1α) (30)

While the constraint on the original processes (23) is a function of the parameter (α), the constraint on the increment processes (30) is a function of the parameter (α) as well as the differential delay(τ = τ2 - τ1). It is important to note that the constraint on the increment process (30) is not as stringent as that on the original process (23) in general. For instance, cross-correlation analysis of the increment process, Figs. 3b and 3d, preserves the ranking Rxy2) > Rxy1) > Rxy (k) for both the instances (β > α) and (β < α) discussed earlier, Figs. 3a and 3c. However, for the special case where the differential delay (τ = τ2 - τ1 = 1), the constraint on β for the increment process (30), (β>1α2) can be considerably larger than those on the original process (23) (β > α) especially for (α < 1/3). Thus delay estimation on the original process as opposed to that of the increment process is preferred for (α < 1/3) and (τ = τ2 - τ1 = 1). An instance with parameters (α = 0.1, β = 0.3,τ1 = 10,τ2 = 11) is shown in Fig. 4a. For these choices of parameters constraint (23) is satisfied whereas constraint (30) is not. Therefore, the delays can be successfully estimated from the original processes, Fig. 4a and not from the increment processes, Fig. 4b. However, cross-correlation estimates of the original processes reveals delays (τ1 = 9 and τ2 = 12) as being statistically significant in addition to (τ1 = 10 and τ2 = 11), Fig. 4a. For τ > 1, constraint (30) is considerably less stringent than constraint (23) irrespective of the choice of α, encouraging estimation of the delay from increment process as opposed to the original process. An instance (β = 0.05,α = 0.7,τ1 = 10,τ1 = 12) where neither of the constraints (23 and 30) is satisfied is shown in Figs. 4c and 4d respectively. In such cases, it is not possible to estimate the delays using the techniques described in the present study.

Figure 4.

Figure 4

Cross-correlation estimates as a function of delay (k) of the original Rxy (k) (a) and increment processes Rδxδy (k) (b) in a two-node acyclic network with a two delays (τ1 = 10,τ2 = 11) and Gauss-Markov driver (α = 0.1, N = 4000). Cross-correlation estimates that satisfy constraints (23) and (30) i.e. (β = 0.3,α = 0.1,τ = 1)for the original and increment processes is shown in Figs. 4a and 4b respectively. Cross-correlation estimates for the original and increment processes with parameters (β = 0.05,α = 0.7,τ1 = 10,τ1 = 12) is shown in Figs. 4c and 4d respectively. Statistically significant delay estimates (ns = 99, Sec. A) are show by circles.

Finally, we show in the following remark that the ranking of the cross-correlation R2) > R1) > R(k)∀k ≠ τ12 between the driver and the dependent processes is implicitly preserved in the increment series unlike those of the original series (23). Cros-correlation estimates satisfy Rδxδy2) > Rδxδy1) > 0 under constraint (30). The only possibility that can disrupt the ranking Rδxδy2) > Rδxδy1) > Rδxδy (k), k ≠ τ1, τ2 is correlation leak around τ1 and τ2. In the following remark we show that correlation leak around τ1 and τ 2 are strictly negative. Therefore, the only positive cross-correlations estimates on the increment processes occur at delays τ1 and τ2. i.e. Rδxδy2) > Rδxδy1) > 0 whereas Rδxδy(k) < 0 for k ≠ τ12.

Remark 4 Exn+kδyn1) < 0 for any k > 0, τ > 0

E(δxn+kδyn+τ1)=β.[2.Rx(k)Rx(k+1)Rx(k1)]+[2Rx(k+τ)Rx(k+τ+1)Rx(k+τ1)]

Substituting for the auto-correlation function Rx ()from (5) we get

E(δxn+kδyn+τ1)=(1α)2(βαk1+αk+τ1)Rx(0)<0 (31)

Remark 5 Exn+kδyn+t2) < 0 for k > 0, τ > 0 Substituting for the auto-correlation function Rx ()from (5) we get For k > 0, 0 < t < k

E(δxn+kδyn+t2)=(1α)2αk1(ατβ+1)Rx(0)<0 (32)

For k > 0, t > k

E(δxn+kδyn+t2)=(1α)2(ατk1β+αk1)<0 (33)

Summary II For a two-node network with two delays and Gauss-Markov driver (0 < α < 1), delay estimation on the increment processes results in significant positive cross-correlation only at the respective delays τ1 and τ2 under constraint (30). This should be contrasted against delay estimation on the original processes where significant positive cross-correlations is observed at several delays in addition to that of τ1 andτ2. Thus it is possible to identify multiple delays in addition to τ1 and τ2 on cross-correlation analysis of the original processes. Constraint (23) imposed on the original processes for preserving the rank R2) > R1) > R(k), ∀k ≠ τ12 is in general more stringent than the constraint (30) on the increment processes.

C. Long-range correlated driver with single and two-delays

Gauss-Markov driver process (5) considered in the above discussion is a short-range correlated driver whose correlation function decays exponentially as a function of lag (6). Non-markovian or long-range correlations have been observed in a wide-range of experimental systems [5, 6] and accompanied by auto-correlation functions that decay as a power-law [5, 7] with lag. Identifying delays from the original and increment processes for a two-node acyclic network with a long-range correlated driver is briefly discussed below.

Power-law correlated driver

Auto-correlation function of classical long-range correlated noise exhibit power-law decay at large time scales (k) and follows the generic form [5, 7].

Rx(k)=kγ,where the Hurst exponentγline in the interval(0.5,1) (34)

The auto-correlation function (34) is positive and decays monotonically as function of the lag k.

C1. Long-range correlated driver and one delay

Consider the driver process (34) and the dependent process (Sec. B1)

yn=xnτ (35)
Delay estimation from the original process

Following procedure similar to (Sec. B1) we get

E(xn+kyn+τ)=Rx(k)>0k (36)

Also from (34)

Rx(k)<Rx(0) (37)

As in the case of Gauss-Markov process (9, 10, Sec. B1) positive cross-correlations persist for lags other than delay τ.

Delay estimation from the increment process

Following procedure similar to (Sec. B2) we get

E(δxnδyn+τ)=2[Rx(0)Rx(1)]>0 (38)
E(δxn+kδyn+τ)=2Rx(k)Rx(k+1)Rx(k1) (39)

Substituting for Rx (k) from (34) into (39) we get

E(δxn+kδyn+τ)=kγ[2(1+1k)γ(1+1k)γ] (40)

Binomial expansion of (40) under the assumptions in (34), i.e. k >> γ and 0 < β < 1 we get

E(δxn+kδyn+τ)=kγ[γ(γ+1)k2+.....]<0 (41)

The above expression (41) is negative for ∀k ≠ 0. As in the case of Gauss-Markov driver (12, 13, Sec. B1) Exnδyn) > 0 whereas Exn+kδyn) < 0 for ∀ k ≠ 0.

C2. Long-range correlated driver and two delays

Consider the case of two delays, where the driver process xn satisfies (34) and the dependent process (Sec. B2) satisfies

yn=β.xnτ1+xnτ2;0<β<1,τ2>τ1>0 (42)
Delay estimation from the original process

As in the case of the Gauss-Markov process (20) we obtain

Rxy(τ2)>Rxy(τ1)

Constraint on the parameter (β) (23) in order to preserve the ranking R2) > R1) > R (k), ∀k ≠ τ1, τ2 is

β>Rx(1)Rx(τ)(Rx(0)Rx(τ1)) (43)
Delay estimation from the increment process

Following procedure similar to (Sec. B2) and from the binomial expansion (41) it is possible to obtain a constraint for Rδxδy2) > Rδxδy1) > 0. Following procedure similar to Remarks 4 and 5 and using the binomial expansion (41) it can be shows that Exn+kδyn+τ1) < 0 and Exn+kδyn+t2) < 0.

Summary III As in the case of Gauss-Markov process (Summary I and Ii), delay estimation on the increment process of long-range correlated driver can significantly minimize the impact of spurious identification of delays between the driver and the dependent process.

An instance of delay estimation from two-node acyclic network with long-range correlated driver and with one and two-delays is shown in Figs. 5 and 6. Long-range correlated driver process was generated from stationary fractional auto-regressive integrated moving average process FARIMA (0, d, 0) with Gaussian innovations and parameter d = 0.3 [5]. This corresponds to Hurst exponent γ = d + 0.5 (34). Cross-correlation analysis Rxy (k) between FARIMA (0, d, 0) driver and the dependent process (yn = xn, τ = 10, N = 4000) along with those of their increment processes Rδxδy (k) is shown in Figs. 5c and 5d respectively. As seen earlier, delay estimation of the increment process minimizes spurious statistically significant delays. Cross-correlation estimates for long-range correlated driver and dependent process yn = β.xn1 + xn2, with parameters (β = 0.5, τ1 = 5, τ2 = 11, N = 4000) is shown in Figs. 6c and 6d respectively. The ranking Rδxδy2) > Rδxδy1) > Rδxδy (k), k ≠ τ1, τ2 is preserved on cross-correlation analysis of the increment process, Fig. 6d. This has to be contrasted to analysis of the original process where the ranking is not preserved, Fig. 6c. Analysis of the original process also reveals statistically significant cross-correlation estimates at several lags in addition to (τ1 = 5 and τ2 = 11).

Figure 5.

Figure 5

Cross-correlation estimates as a function of delay (k) for the original Rxy (k) (a) and increment processes Rδxδy (k) (b) in a two-node acyclic network with a single delay (τ = 10), with Gauss-Markov (α = 0.9, N = 4000) (a, b) and FARIMA (0, d, 0) driver (d = 0.3, N = 4000) (c, d). Cross-correlation estimates of the corresponding coarse-grained realizations of the original Rxcyc (k) and increment series Rδxcδyc (k) of the Gauss-Markov (e, f) and FARIMA (0, d, 0) (g, h) driver is shown right below them. Statistically significant delay estimates (ns = 99, Sec. A) are show by circles.

Figure 6.

Figure 6

Cross-correlation estimates as a function of delay (k) for the original Rxy (k) (a) and increment processes Rδxδy (k) (b) in a two-node acyclic network with two delays (τ1 = 5,τ2 = 11, β = 0.5) with Gauss-Markov (α = 0.9, N = 4000) (a, b) and FARIMA (0, d, 0) driver (d = 0.3, N = 4000) (c, d). Cross-correlation estimates of the corresponding coarse-grained realizations of the original Rxcyc (k) and Rδxcδyc (k) increment series corresponding to the Gauss-Markov (e, f) and FARIMA (0, d, 0) (g, h) driver is shown right below them. Statistically significant delay estimates (ns = 99, Sec. A) are show by circles.

D. Delay estimation from coarse-grained realizations

Coarse-grained realizations are simplified representations of the actual processes. An example is that of a one-dimensional ising spin model where each element is either an up (+1) or a down (-1) spin. In the present study, we generate coarse-grained realizations of the given process about their mean, E(x), given by

xci=+1ifxi>E(x)=1otherwise (44)

For stationary zero-mean normally distributed processes, an analytical expression can be derived relating the correlation of the original process Rx (k) to that of its coarse-grained counterpart Rxc (k) [8, 9], given by

Rxc(k)=2πarcsinRx(k)Rx(0) (45)

It is important to note that in Sec. B and C, the short-range (5) and the long-range (34) correlated driver were generated as linear combinations of normally distributed variables, hence normally distributed. This in turn implies that coarse-grained realizations about the mean of the driver processes (5 and 34) follow relation (45). Since the short-range and long-range driver processes Rx (k) considered have monotonic decreasing autocorrelation function, those of their coarse-grained counterpart Rxc (k) (45) are also monotonic decreasing. Dependent processes yn = xn and yn = β.xn1 + xn2 of the normally distributed driver xn are linear combinations of normal processes, hence implicitly normal. Thus coarse-grained representation of the driver process and the dependent processes about their means follows relation (45). It should also be noted that the corresponding increment series by definition is the difference of normally distributed processes, hence normal.

In the following discussion, coarse-grained realizations of the original and the increment driver xn and dependent yn processes shall be represented by xnc and ync respectively. The coarse-grained realizations of the increment processes (δxn and δyn) are represented by (δxnc and δync). The cross-correlation estimates on the coarse-grained original and increment processes are represented by Rxcyc (k) and Rδxcδyc (k).

We show instances where Rδxcδyc (k) is useful identifying delays whereas unlike Rxcyc (k). This is demonstrated on the two-node acyclic networks with short-range and long-range correlated drivers with one and two-delays. Cross-correlation estimates Rxcyc (k) for the coarse-grained realizations of the Gauss-Markov driver (5) with (α = 0.9, N = 4000) and the dependent process (yn = xn,τ = 10), along with those of the ir increment series Rδxcδyc (k) is shown in Figs. 5e and 5f respectively. Cross-correlation estimates Rxy (k) and Rδxδy (k) obtained on xn and yn is shown in Figs. 5a and 5b for qualitative comparison. It is important to note that the estimation on the increment series results is minimizing the effect of correlation leak as observed earlier (Summary I). Similar results were obtained in the case of FARIMA (0, d, 0) driver with (d = 0.3, N = 4000), Figs. 5g and 5h. These results conform to earlier observations (Summary I and III), where analysis of the increment processes minimize statistically significant false-positive correlation.

Cross-correlation estimates Rxcyc (k) for the coarse-grained realizations of the Gauss-Markov driver (5) with (α = 0.9, N = 4000) with dependent process (yn = β.xn1 + xn2, τ1 = 5, τ2 = 11, N = 4000) is shown in Fig. 6e. Cross-correlation analysis of coarse-grained counterparts of the corresponding increment series Rδxcδyc (k) is shown in Fig. 6f. A similar analysis of the FARIMA (0, d, 0) driver with (d = 0.3, N = 4000) is shown in Figs. 6g and 6h. These results conform to earlier observations (Summary II and III), where analysis of the increment processes minimizes statistically significant correlation leak and preserves the rank ordering R2) > R1) > R(k), k ≠ τ12. Therefore, analysis of the increment process can minimize statistically significant false-positive correlations even in the case of coarse-grained counterparts.

3. Discussion

The present study, investigated statistical estimation of delays between the driver and the dependent processes of a two-node acyclic network with one and two delays using linear measures such as cross-correlation function. While delay estimation is straightforward in the case of uncorrelated drivers, correlated drivers can result in significant correlation leak around the actual delay between the driver and dependent process. Such correlation leak can result in spurious identification of statistically significant delays and existence of multiple paths between the driver and dependent process. Cross-correlation analysis of the increment processes was shown to significantly minimize the effect of correlation leak under certain constraints. In the presence of two-delays between the driver and the dependent node, cross-correlation analysis preserved the ranking of the auto-correlation function in addition to identifying the delays. This was demonstrated on short-range correlated Gauss-Markov process whose auto-correlation function decays exponentially and long-range correlated FARIMA (0, d, 0) driver with power-law decaying autocorrelation function. Correlation properties of stationary normal processes are analytically related to correlation of their corresponding coarse-grained counterpart generated about their mean. An instance was shown where cross-correlation estimates on the coarse-grained realizations of the increment series significantly minimized the effect of correlation leak. Thus from the above results cross-correlation analysis of the increment processes can provide insight into the nature of delays not evident from the analysis of the original processes.

4. Acknowledgement

The present study is supported by funds from National Library of Medicine (1R03LM008853-1) and junior faculty grant from American Federation for Aging Research (AFAR).

Reference

  • 1.Govindan RB, Raethjen J, Arning K, Kopper F, Deuschl G. Time Delay and Partial Coherence Analyses to Identify Cortical Connectivities. Biological Cybernetics. 2006;94(4):262–275. doi: 10.1007/s00422-005-0045-5. [DOI] [PubMed] [Google Scholar]
  • 2.Theiler J, Eubank S, Longtin A, Galdrikian B, Farmer JD. Testing for nonlinearity in time series: the method of surrogate data. Physica D. 1992;58:77. [Google Scholar]
  • 3.Rapp PE, Albano AM, Zimmerman ID, Jimdnez-Montano MA. Phase randomized surrogates can produce spurious identifications of non-random structure. Phys. Lett. A. 1994;192:27–33. [Google Scholar]
  • 4.Schreiber T, Schmitz A. Improved surrogate data for nonlinearity tests. Phys. Rev. Lett. 1996;77:635. doi: 10.1103/PhysRevLett.77.635. [DOI] [PubMed] [Google Scholar]
  • 5.Beran J. Statistics for Long-Memory Processes. Chapman & Hall; New York: 1994. [Google Scholar]
  • 6.Willinger W, Paxson V, Riedi RH, Taqqu MS. Long-range dependence and data network traffic. In: Doukhan, Oppenheim, Taqqu, editors. Long range Dependence : Theory and Applications. 2001. [Google Scholar]
  • 7.Peng C-K, Buldyrev SV, Havlin S, Simons M, Stanley HE, Goldberger AL. Mosaic organization of DNA nucleotides. Phys. Rev. E. 1994;49:1685–1689. doi: 10.1103/physreve.49.1685. [DOI] [PubMed] [Google Scholar]
  • 8.Papoulis A, Pillai SU. Probability, Random Variables and Stochastic Processes. fourth ed. McGraw-Hill; New York: 2002. [Google Scholar]
  • 9.Lawson JL, Uhlenbeck GE. Threshold Signals. McGraw-Hill; NY: 1950. [Google Scholar]

RESOURCES