Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2019 Nov 2.
Published in final edited form as: J Phys A Math Theor. 2018 Oct 3;51(44):445601. doi: 10.1088/1751-8121/aae0af

Fluid heterogeneity detection based on the asymptotic distribution of the time-averaged mean squared displacement in single particle tracking experiments

Kui Zhang 1, Katelyn P R Crizer 3, Mark H Schoenfisch 3, David B Hill 2, Gustavo Didier 1
PMCID: PMC6486181  NIHMSID: NIHMS1019339  PMID: 31037119

Abstract

A tracer particle is called anomalously diffusive if its mean squared displacement grows approximately as σ2tα as a function of time t for some constant σ2, where the diffusion exponent satisfies α ≠ 1. In this article, we use recent results on the asymptotic distribution of the time-averaged mean squared displacement [20] to construct statistical tests for detecting physical heterogeneity in viscoelastic fluid samples starting from one or multiple observed anomalously diffusive paths. The methods are asymptotically valid for the range 0 < α < 3/2 and involve a mathematical characterization of time-averaged mean squared displacement bias and the effect of correlated disturbance errors. The assumptions on particle motion cover a broad family of fractional Gaussian processes, including fractional Brownian motion and many fractional instances of the generalized Langevin equation framework. We apply the proposed methods in experimental data from treated P. aeruginosa biofilms generated by the collaboration of the Hill and Schoenfisch Labs at UNC-Chapel Hill.

Keywords: mean squared displacement, asymptotic distribution, anomalous diffusion, fluid heterogeneity

1. Introduction

In this paper, we start from the asymptotic distribution of the time-averaged mean squared displacement of nanometric tracer particles [20] to construct statistical protocols for detecting physical fluid heterogeneity. The assumptions on particle motion cover a broad family of fractional Gaussian processes, including fractional Brownian motion and many instances of the generalized Langevin equation framework. The testing protocols allowed providing more accurate quantitative analysis of experimental data from the Hill and Schoenfisch Labs (UNC-Chapel Hill), and the results reported in [73] were generally confirmed.

Improvements in light microscopy, fluorescence techniques, nanoparticle synthesis and high-speed video have ushered in a flurry of experimental activity [81]. Single particle tracking has become a common tool in many scientific areas, such as colloid physics [34], the microrheology of complex fluids [53, 83, 54, 45, 35] and the study of nanobiophysical systems, both in vivo and in vitro [2, 12]. This includes the diffusion of single molecules, e.g., proteins, on biopolymers such as DNA or microtubules, on surfaces or in lipid membranes, inside in vivo cells and in actin solutions [84, 26, 32, 89, 33, 61, 82, 95, 46, 67, 80, 4, 30], as well as two-dimensional biological membranes [58] and heterogeneuous tracer diffusion and first passage characteristics in two-dimensional crowded environments [23].

Of primary concern in the analysis of particle path data is the ensemble mean squared displacement (MSD), where X is the tracer particle’s position. A basic dynamic characterization of the latter is given by the relation

X2(t)=EX2(t)σ2tα,σ2,α>0,t0,ξ:=(logσ2,α). (1)

In (1), α is the diffusion exponent and σ2 = 2D, where D is the diffusivity constant. The parameter value α = 1 corresponds to classical diffusion. If α ≠ 1, the stochastic process X is said to be anomalously diffusive, more specifically sub- or superdiffusive depending on whether α < 1 or > 1, respectively. Anomalous diffusion may emerge, for example, as a consequence of binding-unbinding events, of geometrical constraints on the particle’s movement, or of fluid viscoelasticity [57, 77, 76, 49].

The dominant statistical technique in the biophysical literature for estimating the parameters σ2 and α is based on the so-named time-averaged mean squared displacement (TAMSD). Suppose that a single particle experiment generates a tracer bead sample path with observations X(j), j = 1, …, N. The pathwise statistic

MN(τ):=1Nτj=1Nτ{X(j+τ)X(j)}2 (2)

is the TAMSD at lag value τ, i.e., the statistical counterpart of the MSD 〈X2(τ)〉. One generates an estimator of ξ = (log σ2, α) by means of the linear regression

logMN(τk)=logσ2+αlogτk+εk,k=1,,m, (3)

possibly over several independent particle paths, where m is the number of lag values used and {εk}k=1,…,m is a random vector with an unspecified distribution and correlated entries (see (7) and (22)). Plots of TAMSD curves as a function of the lag value τ, often on a log-log scale, are widely reported as part of anomalous diffusion data analysis (e.g., [90, 50]). The choice of lag values τ1, …, τm reflects the analyst’s visual perception of the range where the slope of the TAMSD curves stabilize and thus indicate the true diffusive regime and power law.

The potential heterogeneity of fluid samples in fields such as microrheology implies that estimating ξ from single trajectories is of great interest [9, 11, 93, 51]. However, the experimental and statistical difficulties involved in estimating ξ based on the regression system (3) have been pointed out by many authors. A non-exhaustive list of issues includes limited fluorophore life-times, proteins diffusing out of the field of view, finite-resolution imaging and motion blurring due to camera integration times, measurement errors, the presence of drifts and intra-path correlation [71, 3, 60, 40, 93, 10, 56, 8, 36]. Such difficulties call for a deeper understanding of the stochastic behavior of the TAMSD and, accordingly, a wealth of literature on the subject has developed. Starting from an underlying fractional stochastic process, several properties of the TAMSD such as ergodicity were established [17, 59, 39, 13, 75, 6, 38]. In particular, finite sample exact characterizations and mathematically convenient approximations to the distribution of the TAMSD under Gaussianity are provided in [71, 27, 28, 6, 1, 65, 7, 29, 79, 22]. In [79], assuming an observed fractional Brownian motion (see Example 2.2), it is shown that the standard TAMSD-based estimator is consistent, with vanishing bias and variance.

We say that a cumulative distribution function (c.d.f.) F gives the asymptotic distribution of a sequence of random variables {WN}N∈ℕ if the c.d.f. FN(x) of WN converges to F(x) at every x ∈ ℝ where F is continuous. Results on convergence in distribution such as the classical central limit theorem (CLT; see Example 2.1) have a number of interesting statistical consequences. Typically, statements are robust, i.e., they hold for a multitude of models. Moreover, they naturally lead to useful data analysis protocols such as confidence intervals and hypothesis tests with error margins that are quantifiable and whose accuracy provably increases at an explicit rate (e.g., N for the CLT). In the probability literature, the study of the asymptotic distribution of sums of functions of Gaussian random variables has been carried out over many decades now (see [74, 85, 86, 21, 52, 24, 31, 25, 69] for just a few references). In the context of anomalous diffusion modeling, in turn, the related asymptotic distribution of the TAMSD was established in [20] for a broad class of Gaussian fractional stochastic processes. It was shown that the convergence in distribution of the TAMSD occurs at different rates, and that the limiting distribution may be Gaussian or non-Gaussian, all depending on the value of the diffusion exponent α. This made it possible, for example, to construct asymptotically valid confidence intervals for the anomalous diffusion parameters starting from a single observed particle path.

In this paper, we propose particle path-based statistical protocols for detecting fluid heterogeneity that builds upon the TAMSD’s asymptotic distribution. The protocols test fluid heterogeneity in two different experimental situations, namely,

  1. assuming local physical homogeneity, whether different regions of the fluid are heterogeneous;

  2. assuming global physical homogeneity of each fluid sample, whether two samples from each fluid are heterogeneous.

Hereinafter, these two senses are referred to as intra- and interfluid heterogeneity, respectively. The testing methodology is based on an improved single-path TAMSD-based estimation technique. To construct the latter, we tackle two of the main issues involved in TAMSD-based estimation, namely: (a) the presence of bias in log-TAMSD-based methods; and (b) the effect of correlated disturbances {εk}k=1,…,m in (3). Starting from a concentration inequality [5], we address these issues by providing mathematical characterizations of the bias and finite sample estimation variance which are by themselves of interest, as well as by introducing procedures for bias-correction and nearly optimal estimation under intra-path correlation. Motivated by applications in viscoelastic diffusion, the single-path estimation and heterogeneity testing protocols are mathematically established for 0 < α < 3/2, which covers all the subdiffusive range and part of the superdiffusive regime, and are asymptotically valid. For the sake of completeness, we also discuss and provide computational studies on the strong superdiffusivity range 3/2 ≤ α < 2 (see Remark 3.2 on the difficulties involved in dealing with the possibly non-Gaussian asymptotic distribution of the TAMSD). To guide experimental practice under common technical constraints such as limited camera recording time, we also apply the proposed tools in investigating the difference between observing longer particle paths and using a larger number of particle paths of given length. To illustrate the use of the protocols in physical practice, we make inferences on fluid viscoelasticity with data from the Hill and Schoenfisch Labs (UNC-Chapel Hill) on biofilm eradication, as first reported and described in [73].

The paper is organized as follows. In Section 2, we summarize the key mathematical results on the asymptotic distribution of the TAMSD. In Section 3, assuming a single observed path of realistic length, we characterize the bias and the variance in TAMSD-based estimation to construct the improved single-path estimator and compare it with the standard TAMSD-based estimator in terms of statistical performance. In Section 4, assuming multiple observed paths, we use the estimator developed in Section 3 to construct statistical testing protocols for intra- and interfluid heterogeneity detection. In Section 5, we model and test fluid heterogeneity through experimental data. For the reader’s convenience, Section A of the Appendix contains mathematically accurate statements of the results in Section 2 and [20]. Sections B, C and D contain all new mathematical results and their proofs. Newly designed Matlab routines containing the estimation and testing protocols will be made available on the authors’ websites at the time of publication.

2. Background

Before we revisit the results in [20] on the asymptotic behavior of the TAMSD, for the sake of exposition we consider some classical results from probability theory.

Example 2.1

Consider independent and identically distributed random variables X1, …, XN, each with mean 〈X1〉 = μ and finite variance VarX1:=X12X12=φ2>0. If X¯N=N1i=1NXi denotes the sample mean, then the celebrated central limit theorem states that, for large N, the distribution of the standardized sample mean approaches that of a standard normal, i.e.,

N(X¯Nμ)φdN(0,1),N. (4)

Apart from naturally leading to confidence intervals and hypothesis tests, the convergence (4) also implies that X¯N is a consistent estimator of μ, namely, it converges in probability to μ. This is so because

X¯Nμ=(φN)N(X¯Nμ)φP0,N. (5)

The zero limit in probability in (5) stems from the fact that the vanishing term φ/N0 multiplies a standardized sample mean that converges in distribution (4) (see [78]).

Apart from distinct assumptions on the observations, the claims in [20] on the asymptotic behavior of the TAMSD are reminiscent of the classical statements (4) and (5), with two differences: (i) the rate of convergence is not typically N in biophysical modeling; (ii) the asymptotic distribution of the TAMSD is not necessarily Gaussian.

So, consider the random vector

(MN(τ1),,MN(τm)), (6)

namely, a vector of TAMSD terms (2) at m different lag values, obtained from one path of a Gaussian, stationary increment process. Fitting (3) and (6) by means of ordinary least squares (OLS) regression is the most intuitive way of constructing an estimator of the diffusion parameter vector ξ = (log σ2, α). This corresponds to the common practice in the biophysical literature, both in experimental and methodological work (e.g., [90, 50, 11, 51] among many references). Throughout this paper,

Estand=(Lstand,Astand) (7)

denotes this standard estimator (see (22) for a precise expression). In this framework, we need to make the lag sizes τ1, …, τm themselves go to infinity, though no faster than the sample size N. This mathematically expresses the practical analysis of anomalous diffusion data: the lag size has to be

  • (L1) large enough for the TAMSD regime to become log-linear;

  • (L2) but, at the same time, not too large because of the increased variance of the TAMSD.

For a generic lag value τ, we can model this idea by writing

τN. (8)

The limit and inequality in (8) express (L1) and (L2), respectively (the accurate mathematical statements are given by condition (53); see also Figure 1).

Figure 1: Bias and inconsistency over small scales τ vs vanishing bias and consistency as τ → ∞.

Figure 1:

In general, taking the double limit τ,N → ∞ (see (8)) is necessary. Over fixed (“small”) lag values τ, TAMSD-based estimation is biased and, for most anomalous diffusion models other than fBm, inconsistent. As mathematically characterized by expansion (17), estimation bias is fundamentally a consequence of the fact that 〈log ·〉 ≠ log〈·〉 and of the presence of the small scale factor O(τδ), whereas, in turn, inconsistency generally appears as a consequence of this same factor. The left and right plots show, respectively, 20 independent ifOU paths (length 211, α = 0.6) and 20 particle paths (length 1800) from P. aeruginosa biofilm after COS2-NO treatment at concentration level 8 mg ml−1. The first and second red lines in each plot indicate, respectively, the fitted slope over small (τ = 1, 2) and large (τ = 8, 32) lag values. Based on the former lag values, A = 1.42 and 1.10 (evidence of superdiffusivity) for simulated and experimental data, respectively, whereas, by contrast, A = 0.70 and 0.69 (evidence of subdiffusivity) based on the latter. This illustrates the fact that bias and inconsistency vanish when τ (and N) becomes large.

The asymptotic distribution of the TAMSD random vector (6) after centering is briefly described in Table 1. This leads to the asymptotic behavior of the standard estimator (7), which is summarized in Table 2 in terms of convergence rate, asymptotic distribution and consistency. In both cases, the value of α determines the convergence rate and the nature of the asymptotic distribution. In particular, over almost the whole strong superdiffusivity range (i.e., over 3/2 < α < 2), the asymptotic distribution is non-Gaussian (Rosenblatt-type; see Theorem A.1 and [74, 85, 88, 91]). For any instance, by an argument analogous to (5), the standard estimator is consistent, i.e.,

EstandPξ. (9)

Table 1:

Asymptotic behavior of the TAMSD random vector (6) (see Theorem A.1).

parameter range rate of convergence asymptotic distribution
0 < α < 3/2 Nτ1τα Gaussian
α = 3/2 NlogN1τ2 Gaussian
3/2 < α < 2 N2ατ2 2 non-Gaussian

Table 2:

Asymptotic behavior of the standard TAMSD-based estimator (7) (see Corollary A.1).

parameter range rate of convergence joint asymptotic distribution consistency
Lstand Astand
0 < α < 3/2 Nτ1τα1logτ Nτ1τα Gaussian Yes
α = 3/2 NlogN1τ21logτ NlogN1τ2 Gaussian Yes
3/2 < α < 2 N2ατ21logτ N2ατ2 2 non-Gaussian Yes

The family of stochastic processes for which the limits in distribution in Tables 1 and 2 hold is broad and contains a number of popular models. Three examples are fractional Brownian motion (fBm), fractional instances of the generalized Langevin equation (GLE) and the (integrated) fractional Ornstein-Uhlenbeck process (ifOU).

Example 2.2

Together with the continuous time random walk, fBm is one of the most popular models of anomalous diffusion [87, 2]. For some value of the so-named Hurst parameter H ∈ (0, 1) and a variance parameter D > 0, a fBm BH(t) is the only Gaussian, stationary increment process with covariance function

BH(s)BH(t)=D{|t|2H+|s|2H|ts|2H},s,t. (10)

The particular parameter value H = 1/2 corresponds to the ordinary Brownian motion (Wiener process). In view of (10), which implies exact self-similarity, for fBm the MSD scaling relation (1) holds as an equality, i.e.,

BH2(t)=σ2tα,t, (11)

where

σ2=2D,α=2H. (12)

Example 2.3

The GLE has been used as a universal model of anomalous diffusion in the biophysical field of microrheology [53, 96, 68, 66]. A subclass of interest of the GLE framework is the fractional GLE family [44, 43, 18], which is obtained almost surely as the solution of the stochastic differential equation

mdV(t)=λtΓ(ts)V(s)dsdt+dBH(t),1/2<H<1. (13)

In (13), m, λ > 0 and the memory kernel has the form Γ(t) = 2H(2H −1)|t|2H−2, t ≠ 0, which is a consequence of invoking the fluctuation-dissipation relation [19, 51]. The integrated fractional generalized Langevin process (ifGL) is given by X(t)=0tV(s)ds, t > 0, where {V (t)}t≥0 is a solution of the fractional GLE. For the ifGL, relation (1) holds with α = 2(1 − H) (subdiffusive) as t → ∞.

Example 2.4

The ifOU is given by X(t)=0tV(s)ds, t > 0, where the so-named fractional Ornstein-Uhlenbeck process {V (t)}t≥0 is the almost surely continuous solution to the fBm-driven Langevin equation

dV(t)=λV(t)dt+φdBH(t),t0,λ>0,0<H<1 (14)

(see [14, 70]). The ifOU process is a mathematically convenient model of anomalous diffusion. In the subdiffusive range, it displays a similar correlation structure to that of the ifGL process. For the ifOU, relation (1) holds with (12) as t → ∞.

Remark 2.1

The results in [20] do not cover some important anomalous diffusion models such as continuous time random walks. For the latter family of models, limit theorems typically involve distinct nonstandard asymptotic distributions depending on the assumptions (see, for instance, [55, 41] and references therein; for general guidelines on the use of the TAMSD, see [42]).

3. Improved TAMSD-based estimation

The standard estimator Estand = (Lstand, Astand) in (7) has at least two significant shortcomings: finite sample bias and suboptimal performance in the presence of correlation among the regression disturbance terms {εk}k=1,…,N. We propose a single-path improved estimation protocol that addresses these issues. Accordingly, it involves two components, which we describe next. These two components involve asymptotically valid mathematical expressions for finite-sample bias and variance. Hereinafter, different lag values are expressed as

τk=wkτ,w1<<wk, (15)

for fixed constants w., where τ = τ(N) grows as function of N.

(a) Bias correction

In TAMSD-based scaling analysis, there at least two sources of bias. First, bias appears if the particle movement is not exactly self-similar (not a fBm), i.e.,

X2(t)σ2tαoverarangeoft.

In fact, the deviation of the MSD from exact self-similarity or power scaling is generally controlled by the relation

|X2(t)σ2tα1|Ctδforlarget, (16)

for some constant σ2 > 0, where the deviation parameter δ > 0 mostly depends on the high frequency behavior of the particle motion (see Proposition A.1). Second, even under self-similarity, bias stems from the elementary fact that the logarithm of the ensemble average and the ensemble average of the logarithm are distinct (e.g., [92, 63, 62, 64]). In the context of (3), this means that

logMN(τ)logMN(τ)=αlogτ+logσ2,τ.

So, by reinterpreting log MN(τ) itself as an estimator of α log τ + log σ2, we can express the bias involved in TAMSD-based estimation as

logMN(τ)(αlogτ+logσ2)=τNβN(α,τ)+O(1τδ)+O(τN) (17)

for the same δ > 0 as in (16) (for 0 < α < 3/2 – see Theorem C.1; see also Remark 3.2 on the range 3/2 ≤ α < 2). The term of order O(τδ), then, is mostly determined by the high frequency behavior of the anomalously diffusive particle (see Figure 1 and expressions (52), (54)). In (17), the main bias factor is given by the function

βN(α,τ)=14τi=N+1N1(1|i|N){|iτ+1|α2|iτ|α+|iτ1|α}2. (18)

Note that (18) depends on the unknown parameter α. So we use Astand and (18) to define an estimator of the bias vector by

(βN(Astand,τk))k=1,,m. (19)

(b) Accounting for disturbance correlation

In linear estimation theory, the method for dealing with correlated random errors is called generalized least squares (GLS). In fact, the resulting GLS estimator is the best linear unbiased estimator, since it outperforms its OLS counterpart in terms of mean squared error (MSE) (see [15]).

In the context of TAMSD-based estimation, to better understand the difference between the standard, OLS-based estimator and the related GLS-based estimator, recast the vector system (3) as the regression model

z=Xξ+ε. (20)

In (20), the term ξ is as in (1), and the dependent variable and the regressor are given by, respectively,

z=(logMN(τk))k=1,,m,X=(1logτ11logτm). (21)

It is well known that the expression

Estand:=(XTX)1XTz=(Lstand,Astand)T (22)

gives the standard estimator (7) generated by the OLS solution to the system (20). By contrast, let

Υ(ξ)=(vk1,k2(ξ))k1,k2=1,,m, (23)

be the finite sample covariance matrix of the vector z as in (21). The GLS solution is given by

(XTΥ(ξ)1X)1XTΥ(ξ)1z, (24)

which involves the unknown matrix (23). In practice, then, one needs to estimate such matrix. For this purpose, we first establish the entrywise expansion

vk1,k2(ξ)=τNζN(α,τk1,τk2)+O(τ1δN)+o(τN),k1,k2=1,,m (25)

(for 0 < α < 3/2 – see Theorem C.2; see also Remark 3.2 on the range 3/2 ≤ α < 2). In (25), the main variance factor is given by

ςN(α,τk1,τk2)=12τi=N+1N1(1|i|N){|iτk1τk2+τk1τk2|α|iτk1τk2+τk2τk2τk2τk1|α|iτk1τk2|α+|iτk1τk2τk2τk1|α}2. (26)

Note that expression (26) does not involve the constant σ2, but it is still a function of the unknown parameter α. Second, and in view of this, we can use Astand and (26) to define an estimator of the covariance matrix by

Υ(Astand):=(τNζN(Astand,τk1,τk2))k1,k2=1,,m. (27)

Drawing upon (a) and (b), we can further construct an improved estimator of ξ by a quasi-GLS procedure based on the estimator Υ(Astand) and by replacing (3) with the bias-corrected regression system

logMN(τk)+τkNβN(Astand,τk)=logσ2+αlogτk+εk,k=1,,m. (28)

The resulting estimator can be expressed as

E=(L,A)=(XTΥ1(Astand)X)1XTΥ1(Astand)y, (29)

where X is again as in (21) and

y=(logMN(τ1)+τ1NβN(Astand,τ1)logMN(τm)+τmNβN(Astand,τm)). (30)

For the reader’s convenience, the construction of the estimator E is summarized in the form of pseudocode in Appendix D.

To compare the performances of E and Estand, we generated 1000 independent paths of length 210 and estimated the diffusion exponent based on the two methods. Figure 2 displays the results in terms of Monte Carlo bias, standard deviation and square root MSE. The improved estimator E outperforms the usual estimator Estand by any of the three criteria for different values of α.

Figure 2: Comparative analysis of bias, standard deviation and MSE.

Figure 2:

of the estimators E (see (29)) and Estand (see (7)) as a function of diffusion exponents α (x-axis). Left plot: estimation of α. Right plot: estimation of log σ2. Solid and dashed lines represent E and Estand, respectively. For any parameter value α, the proposed estimator E has smaller bias, standard deviation and square root MSE than Estand. The total number of Monte Carlo runs is 1000 based on paths of length 210.

Remark 3.1

Note that the main bias and variance factors βN(α, τ) and ςN(α,τk1,τk2) in (18) and (26), respectively, converge as N → ∞ (see Lemma C.1). Moreover, after standardization, the estimator (29) is provably asymptotically normal and consistent for 0 < α < 3/2 (see (35) in Section 4 and Proposition C.1). See also Remark 3.2 on the range 3/2 ≤ α < 2.

Remark 3.2

Although we do not provide proofs in this paper, the methods developed in this section and also in Section 4 can be extended to the strongly superdiffusive range 3/2 ≤ α < 2. For example, due to nonstandard convergence rates, expressions (17) and (25) hold after replacing O(τN) with O((τlogNN)) (for α = 3/2) or O((τN)42α) (for 3/2 < α < 2). Likewise, the asymptotic non-Gaussian distribution of the estimator (35) in Section 4, with nonstandard convergence rates, can be established. However, inference involving the nonstandard limiting distribution can be cumbersome, and the computational studies in this section and in Section 4 show that the methods in the proposed format work reasonably well for realistic path lengths. See also [20], Remark 2, on how to construct asymptotically valid confidence intervals for ξ based on the standard estimator Estand assuming prior knowledge that 3/2 < α < 2.

4. Testing heterogeneity

Single particle tracking experiments with viscoelastic diffusion often generate data in the form of multiple particle paths. As discussed in the Introduction, fluid heterogeneity can be tested in the intra- and interfluid senses. The pathwise framework constructed in Section 3 can be used in new testing protocols with good finite sample and asymptotic properties. We remind the reader that the mathematical statements cover the diffusion exponent range 0 < α < 3/2 (subdiffusive and mildly superdiffusive range), though we also include computational experiments for the strongly superdiffusive range 3/2 ≤ α < 2.

Tables 3 and 4 display the proposed framework. For each type of fluid heterogeneity, they show the appropriate hypotheses and testing procedures, respectively. In the remainder of this section, we provide a detailed description of the protocols. To set the notation, we recall that, for a given a hypothesis test, the conditional probability

(H0isrejected|ξsatisfiesH0)=:ϵ[0,1] (31)

is called the size (or significance level) of the test, whereas the function

ξ(H0isrejected|ξsatisfiesH1)[0,1] (32)

is called the power of the test.

Table 3:

Hypotheses

heterogeneity H0 Ha
intrafluid ξ1 = ··· = ξν ξiξj for some 1 ≤ i,jν
interfluid ξI = ξII ξIξII

Table 4:

Tests

heterogeneity rejection region test statistic number of paths
intrafluid Rintra (see (38)) S12, S22 (see (37)) ν
interfluid Rinter (see (46)) T1, T2 (see (45)) νI, νII

Intrafluid heterogeneity

Suppose ν ∈ ℕ bead diffusion paths of length N from a single fluid sample are available. If the fluid is physically homogeneous, it is expected to generate particle paths with nearly identical parameter values ξ. The alternative is that ξiξj for some pair i, j, namely, their anomalous diffusion parameters differ. These two possibilities, labeled H0 and Ha, respectively, are listed on the row “intrafluid” in Table 3.

Starting from the ν particle paths, let

Ei,i=1,,ν, (33)

be vector-valued estimators as in (29). For the purpose of constructing a test statistic, we need a normalized (standardized) estimator. Note that the variance of the GLS-type solution (24) is given by

(XTΥ1(ξ)X)1 (34)

(cf. [15]). So, define a standardized estimator by

Zi=(Zi,1Zi,2)=Λ1/2(Astand,i)Ei,i=1,,ν, (35)

where

Λ(Astand,i):=(XTΥ1(Astand,i)X)1 (36)

and the variance estimator Υ(Astand,i) is given by (27). Then, (35) converges in distribution to ν independent and identically distributed normal random vectors with uncorrelated entries (see Proposition C.1). So, for Z¯j=ν1i=1νZi,j, let

Sj2=1ν1i=1ν(Zi,jZ¯j)2,j=1,2, (37)

be the normalized and decorrelated sample variances of {Zi,j}i=1,…,ν, j = 1, 2, as in (35). Then, under H0,

((ν1)S12,(ν1)S22)d(X1,X2),Xjχν12,j=1,2,

as N → ∞, where X1 and X2 are independent random variables. To test heterogeneity at significance level ϵ, we can use Bonferroni-type correction (e.g., [15], section 5.3) and reject the null hypothesis H0 if

Rintra:(ν1)S12>χν1,ϵ/22or(ν1)S22>χν1,ϵ/22, (38)

where χν1,ϵ/22 is a chi-square quantile (c.f. Table 4, “intrafluid” rows).

To check the size of the test (38) over finite samples, we conducted a Monte Carlo study with 50 simulated paths of length 212 and recorded whether or not the null hypothesis H0 is rejected at ϵ = 0.05 significance level. This procedure was repeated 2000 times. Since each outcome is a Bernoulli trial (reject or not H0), the simulation rejection rate follows a binomial distribution with n = 2000 and p = 0.05. Thus, a normal approximation to the 95% confidence interval of the rejection rate gives (0.040, 0.060). As shown in Figure 3, left plot, the observed simulation rejection rate was around 0.05 and within the 95% confidence interval (for 0 < α < 3/2), as expected. Unreported computational experiments for different significance levels lead to analogous conclusions.

Figure 3: Intra- and interfluid heterogeneity: test sizes.

Figure 3:

Monte Carlo sizes of the intrafluid test (left plot; see (38)) and interfluid test (right plot; see (46)) as a function of the diffusion exponent α (x-axis). For every value of α, each of 2000 Monte Carlo runs consisted of generating 50 independent paths of length 212 and conducting a test at ϵ = 0.05 (see (31)) or, equivalently, 95% confidence level. The Monte Carlo rejection rate is very close to the theoretical value of ϵ = 0.05 for almost all values of α.

Figure 4 displays Monte Carlo power curves for the intrafluid test. The study was conducted with a total of ν = ν1 + ν2 paths, where ν1 and ν2 have diffusion exponents α1 = 1 and α2, respectively, and α2 = 0.8 (left plot) or α2 = 0.7 (right plot). In each plot, the x-axis represents the proportion of paths

ν2ν1+ν2 (39)

with diffusion exponent α = α2. The power curves quickly converge to 1 as a function of the ratio (39), especially for the more distinguishable value α2 = 0.7 < 1 = α1.

Figure 4: Intrafluid heterogeneity: test power.

Figure 4:

Simulations were conducted with a total of ν = ν1 +ν2 paths, where ν1 and ν2 of them displayed diffusion exponents α = α1 = 1 and α = α2, respectively. In each plot, the y-axis represents the observed test power, or rejection rates (see (32)), and the x-axis is the proportion of paths with α = α2 (see (39)), starting at 0.02. The total number of Monte Carlo runs is 2000. Left plot: α2 = 0.8. Right plot: α2 = 0.7.

When α > 3/2, under H0 the estimators (33) converge in distribution to ν independent and identically distributed non-Gaussian random vectors. Hence, so do the estimators (35). In this case, the marginal distributions of the decorrelated vector ((ν1)S12,(ν1)S22) do not approach chi-squared distributions. In computational experiments, the size of the intrafluid test (38) did not significantly deviate from the 0.05 target for α = 1.6, indicating that the nonstandard asymptotic behavior is not a concern for paths of length 212. Deviation was significant for the extreme value α = 1.8, suggesting that approximating the test size by simulation may be generally recommendable for greater accuracy (see Figure 3, left plot).

Interfluid heterogeneity

Now suppose νI and νII paths,

νI,νII, (40)

are obtained from two physically homogeneous fluid samples I and II, respectively. We are interested in testing whether the samples I and II are homogeneous, namely, whether or not particle diffusion in the fluid samples displays the same underlying parameter value ξ. These two possibilities, labeled H0 and Ha, respectively, are described on the row “interfluid” in Table 3.

Since multiple (independent) particle paths are assumed available for each fluid sample, we can construct an estimator involving all available TAMSD terms. In fact, first define the overall average mean squared displacement over ν TAMSD terms (AMSD) by

MN*(τ)=1ν=1νMN(τ). (41)

By independence,

MN*(τ)=MN(τ)=X2(τ),VarMN*(τ)=1νVarMN(τ). (42)

Then, AMSD-type estimators

E*=(L*,A*) (43)

can be obtained by applying the pseudocode in Appendix D after replacing TAMSD terms MN(τk) with their AMSD counterparts MN*(τk), k = 1, …, m. Given two fluid samples I and II, let EI* and EII* be their respective AMSD-type estimators. Their finite sample covariance matrices are given by νI1Λ(ξI) and νII1Λ(ξII), respectively (cf. (34)). By analogy with (25) and (36), we can define their AMSD-type estimators

1νIΛ(AI*),1νIIΛ(AII*). (44)

Figure 5 displays a study of the accuracy of Λ(A*) as an estimator. It plots Monte Carlo variances of the estimator E as well as their estimates Λ(A*) for several values of the parameter α. The latter nearly perfectly match the former in the subdiffusive range. A slight deviation appears in the strongly superdiffusive range, but still within an acceptable margin.

Figure 5: Comparative analysis.

Figure 5:

of the standard error of the estimator E = (L, A) (dashed line; see (29)) and its AMSD-type estimator Λ(A*) (solid line; see (44)) as a function of the diffusion exponent α. The latter closely matches the former, especially in the subdiffusive range α < 1. The number of Monte Carlo runs is 1000 based on particle paths of length 210.

Hence, we define the standardized estimators

Zj*=(Zj,1*Zj,2*)=νjΛ1/2(Aj*)Ej*,j=I,II.

In view of Proposition C.1, these estimators are also asymptotically normal for 0 < α < 3/2. Hence, let

T1=ZI,1*ZII,1*2,T2=ZI,2*ZII,2*2 (45)

be the associated test statistics. The rejection region is given by

Rinter:|T1|>zϵ/4or|T2|>zϵ/4, (46)

where zϵ/4 is a standard Normal quantile (c.f. Table 4, row “interfluid”). In (46), the probability ϵ/4 stems, first, from applying a Bonferroni-type correction to a double testing region (hence yielding ϵ/2 significance level in each), and second, from the fact that in each region the test statistic distribution is two-sided.

To check the test’s size over finite samples, we produced a 2000-run Monte Carlo study based on two sets of 50 paths with the same diffusion exponent, where tests were conducted at significance level ϵ = 0.05. As shown in Figure 3, right plot, the rejection rate was close to 0.05, as expected.

In Figure 6, we investigate the interfluid test power as a function of the path lengths and number of paths. The x-axis represents the difference between the diffusion exponents from two fluids, namely,

δα=|αIαII|, (47)

whereas the y-axis is the test power at ϵ = 0.05. From top to bottom, the three plots correspond to αmin = min{αI, αII} = 0.2, 1.0, 1.8, respectively, for various combinations of realistic values of N and ν = νI = νII. In all cases, the power curves start at around 0.05, as expected, and quickly approach 1 as a function of δα as defined in (47). Larger path lengths, larger number of particle paths as well as not very large values of αmin are associated with faster convergence of power curves to 1.

Figure 6: Interfluid heterogeneity: test power.

Figure 6:

For various values of N (see (2)) and ν = νI = νII (see (40)), we plot test power (y-axis; see (32)) as a function of δα (x-axis; see (47)). The parameter values are given by αmin := min{αI, αII} in the range αmin = 0.2, 1.0, 1.8 (top, middle and bottom, respectively).

Interfluid heterogeneity: more or longer paths under technical–experimental constraints?

Figure 6 also illustrates the following phenomenon. For the subdiffusive and diffusive cases, there is no visible difference between doubling the path lengths or the number of paths. However, in the strongly superdiffusive range, doubling the number of paths increases the test power more than doubling the path lengths.

In real world lab conditions, conducting single particle experiments involves coping with technical restrictions. For example, there may be limited camera recording time, tracer particles may slip out of the field of view or there may be a limit on the total number of tracer particles per fluid sample while still assuming that particles diffuse independently. So, assuming technical– experimental restrictions are in place, it is relevant to ask: what is the difference between

  • Method I : recording the movement of a larger number of particles (ν) over a fixed period of time (hence, keeping constant the average sample path length N); and

  • Method II : recording the same number of particles ν over a longer period of time (hence, yielding a larger average N)?

We answer this question in the framework of interfluid heterogeneity testing.

In the regimes of Methods I and II, we investigate the performance of the AMSD-type estimator (43) in terms of bias, standard deviation and square root MSE. Bearing in mind expression (42), by a similar reasoning to the one leading to expression (17) for a single observed path, the bias of log2MN*(τ) as an estimator of α log τ + log σ2 is given by

O(τδ)(MN*(τ)X2(τ))22X2(τ)2=O(τδ)1ν(MN(τ)X2(τ))22X2(τ)2. (48)

Moreover, by the independence of particle paths, we can approximate the variance of log2MN*(τ) by

Var(logMN*(τ))VarMN(τ)νX2(τ)4. (49)

The performance of the estimator (43) in the two regimes depends on the interplay between the bias and variance components (48) and (49), respectively. In a computational experiment, we applied the following procedure.

  1. Start out in the same setting: 24 paths of length 28 for each method, run 500 Monte Carlo simulations to get the bias, standard deviation and square root MSE of A for Method I and II;

  2. for Method I, fix the path length and at each step generate 24 times the previous number of paths and redo the Monte Carlos experiments;

  3. for Method II, fix the number of paths and at each step generate paths of length 24 times the previous length, multiply all lags by 2 and redo the Monte Carlos experiments;

  4. repeat 2. and 3. three times.

For ease of comparison, Table 5 displays the multiple instances generated. Note that, at each step, the total number of points recorded

ν×N (50)

is identical for the two methods.

Table 5:

Methods I and II.

Method I Method II
step N ν ν × N N ν ν × N
1 28 24 212 28 24 212
2 28 28 216 212 24 216
3 28 212 220 216 24 220
4 28 216 224 220 24 224

We compare the results in Figure 7, top and middle plots, where the diffusion exponent is set to α = 0.6 and 1.0, respectively. Method II has smaller bias and square root MSE. The reason is that, when ν is large enough, the term O(τδ) dominates the bias. Thus, increasing the number of paths ν does not reduce the bias. However, increasing the path length N means that the TAMSD terms MN(τ) with larger lag values τ can be used in the regression procedure. This implies a reduction in magnitude of the term O(τδ), and hence, smaller bias. Method I displays smaller standard deviation because a 16-fold increase in ν reduces the standard error by a factor of 1/4. Meanwhile, noting that 〈X2(τ)〉 ~ σ2τα, expression (55) implies that for Method II the standard deviation is proportional to τ/N. By multiplying N by 16 and τ by 2, the standard error is reduced by a factor of 1/22.

Figure 7: Bias, standard deviation, and square root MSE for Methods I (blue) and II (red).

Figure 7:

The x-axis denotes the total number of recorded data points (see (50)). Top: α = 0.6. Middle: α = 1.0. Bottom: α = 1.8.

In Figure 7, bottom plot, we set α = 1.8. For this parameter value, the convergence rate of the TAMSD-based estimators is slower than 1N. Method II still shows a smaller bias by comparison to Method I, as expected. However, since δ increases as a function of α (see expression (54)), then O(τδ) shrinks with α. Therefore, the component O(τδ) carries less weight in the estimator’s bias for the superdiffusive case than for the subdiffusive case. Since α > 3/2, again by expression (55) (see also Remark 3.2) the standard deviation for Method II is proportional to (τ/N)2−α = (τ/N)0.2. Thus, again assuming a 16-fold increase in N and a 2-fold increase in τ, the standard error is reduced by a factor of 1/80.2, which is much slower than the standard error reduction factor of 1/22 for Method I. These are the two reasons why Method I displays smaller square root MSE than Method II.

5. Analysis of experimental data: heterogeneity of treated P. aeruginosa biofilms

The Hill and Schoenfisch Labs at UNC-Chapel Hill produced data from experiments on disruption and eradication of P. aeruginosa biofilms using nitric oxide-releasing chitosan oligosaccharides [73]. For the reader’s convenience, we provide a brief description of the experiments.

Cystic fibrosis (CF) lung disease is caused by defective chloride transport, resulting in thickened, dehydrated mucus. The latter restricts bacterial motility and promotes P. aeruginosa biofilm formation. Inhaled tobramycin is currently the only antibiotic recommended for the treatment of both initial and chronic P. aeruginosa infections in patients with CF. While inhaled tobramycin is effective at eradicating bacteria within biofilms, it fails to physically remove the structural remnants of the biofilm from the airways. This may lead to biofilm regrowth and the development of antibiotic-resistant infections. Therefore, an ideal anti-biofilm therapeutic for CF would both eradicate bacteria and physically degrade the biofilm, facilitating clearance from the airways.

On the other hand, nitric oxide (NO) is an endogenously produced diatomic free radical with significant antibacterial activity against P. aeruginosa biofilms. Atomic force microscopy revealed that NO exposure causes structural damage to the membranes of planktonic Gram-negative bacteria, including P. aeruginosa. The interest lies in the utility of NO-releasing chitosan oligosaccharides to both eradicate and physically alter P. aeruginosa biofilms and in comparing its effect with tobramycin. In order to measure the physical changes to bacterial biofilms caused by NO-releasing chitosan oligosaccharides, movements of fluorescent tracer particles embedded in P. aeruginosa biofilms were imaged at 60 frames per second for 30 seconds on an inverted microscope at 40× magnification. The tracer particle displacement as a function of time was quantified using Video Spot Tracker software (Center for Computer Integrated Systems for Microscopy and Manipulation, University of North Carolina at Chapel Hill); see [73] for more details.

Fluid heterogeneity has been correlated with increased viscoelasticity for complex biological materials such as sputum [16]. In the experiments we describe, the effect of antibacterial treatment on biofilm heterogeneity was thus evaluated at different concentrations based on tracer particle displacement data. In Table 6, we use the data to test the intrafluid heterogeneity of P. aeruginosa biofilms after tobramycin treatments at concentrations levels 25, 50, 100, 200, and 400 μg ml−1. From each of these fluid samples, we randomly select 100 paths of length N = 1800. An application of the intrafluid test (38) produces strong evidence (negligible p-values) of intrafluid heterogeneity in every sample. This conclusion matches those reported in [73]. Since no homogeneous fluid samples are detected from any of these five samples, we do not perform the interfluid heterogeneity test (46).

Table 6:

Intrafluid biofilm heterogeneity testing after treatment with tobramycin at concentration levels 25, 50, 100, 200, 400 μg ml−1. 100 independent paths of length 1800 were randomly selected for each concentration level.

Tobramycin (μg ml−1) p-value
25 < 10−16
50 < 10−16
100 < 10−16
200 < 10−16
400 < 10−16

In Table 7, we apply (38) in the testing of intrafluid heterogeneity of P. aeruginosa biofilms after COS2-NO treatment at concentration levels 1, 2, 4, 8, and 16 mg ml−1. COS2 releases NO, which cause the physical disruption and eradication of biofilms [73], and also reduces the viscoelastic properties of mucus [72]. As before, 100 paths of length 1800 were randomly selected for each concentration level. At concentrations 1 or 2 mg ml−1, the p-values are still less than machine error, which indicates strongly significant heterogeneity. As the concentration level increases to 4 and 8 mg ml−1, the p-values also increase. At concentration level 16 mg ml−1, the p-value reaches 0.18. Hence, we fail to reject the null hypothesis of intrafluid homogeneity. This provides evidence that the COS2-NO treatment is effective at eradicating P. aeruginosa biofilms. Once again, this analysis confirms the conclusions reported in [73]. In Table 8, by applying (46), we test the interfluid heterogeneity of P. aeruginosa biofilms after COS2-NO treatment at concentration level 16 mg ml−1. From each fluid sample (A, B and C), we selected 100 paths of length 1800 and conducted the test. It turns out that there is no evidence whatsoever of heterogeneity among fluid samples A, B and C.

Table 7:

Intrafluid biofilm heterogeneity testing after treatment with COS2-NO at concentration levels 1, 2, 4, 8, 16 mg ml−1. 100 independent paths of length 1800 were randomly selected for each concentration level.

COS2-NO (mg ml−1) p-value
1 < 10−16
2 < 10−16
4 2 × 10−13
8 3 × 10−9
16 0.18

Table 8:

Interfluid biofilm heterogeneity testing after treatment with COS2-NO at concentration level 16 mg ml−1. 100 independent paths of length 1800 were randomly selected for each group.

COS2-NO 16 mg ml−1 p-value
Group A vs Group B 0.9996
Group A vs Group C 0.9998
Group B vs Group C 0.9999

6. Conclusion

Motivated by applications in viscoelastic diffusion, in this paper we start from the TAMSD’s asymptotic distribution for a broad class of Gaussian fractional stochastic processes [20] to propose statistical protocols that make use of single-particle tracking data in the detection of fluid heterogeneity.

The testing methodology is based on an improved TAMSD-type estimator. To construct this estimator, we tackle two of the main issues involved in TAMSD-based estimation, namely, we mathematically characterize: (a) the finite-sample bias in log-TAMSD-based methods; and (b) the effect of disturbance correlation. The theoretical results on (a) and (b) allow us to propose a nearly optimal estimator by combining a bias-correction procedure and a generalized least squares-type regression solution. The improved TAMSD-based estimator (29) is asymptotically normal for 0 < α < 3/2, and computational experiments show that the new estimator outperforms the standard TAMSD-based estimator both in terms of bias and square root MSE for values of α over the whole parameter range (0, 2).

The estimator (29) is used in the construction of protocols for fluid heterogeneity detection in two different experimental situations, namely, when testing: (i) whether different regions of the same fluid are heterogeneous (intrafluid heterogeneity); or (ii) whether two samples from each homogenous fluid are heterogeneous (interfluid heterogeneity). Reflecting the asymptotic behavior of (29), for 0 < α < 3/2 the test statistics (37) and (45) for intra- and interfluid heterogeneity are asymptotically chi-square and asymptotically normally distributed, respectively. This ensures the tests and associated quantiles are asymptotically valid. Computational experiments confirm that the tests’ significance levels are accurate over finite samples, and that the tests display high power even for relatively small deviations from the null hypotheses.

In all cases, for the sake of completeness, we discuss and provide computational studies on the strong superdiffusivity range 3/2 ≤ α < 2. Although this may not affect physical areas of application where subdiffusion is prevalent, this research points to one difficulty involved in TAMSD-based modeling, namely, a potentially non-Gaussian (Rosenblatt-type) limiting distribution with an intricate cumulant structure (cf. expression (60)).

The constructed framework helps to shed light on the effect of common technical experimental constraints such as limited camera recording time: we characterize the difference between observing longer particle paths and using a larger number of particle paths of given length.

We apply the protocols in physical practice by making inferences on fluid viscoelasticity with data from the Hill and Schoenfisch Labs (UNC-Chapel Hill), as first reported and described in [73]. The testing protocols reveal that COS2-NO treatment is effective in eradicating P. aeruginosa biofilms, since greater concentration levels of the treatment clearly lead to greater fluid homogeneity as detected by tracer particle displacement data.

The research contained in this paper points to a number of interesting questions. From a modeling standpoint, it would be convenient to construct a heterogeneity testing framework for single particle experiments that, unlike TAMSD-based methods, mathematically covers the full anomalous diffusion parametric range 0 < α < 2 under the same limit parametric family of distributions. Moreover, ideally a heterogeneity testing framework should be robust with respect to nuisance trends and added experimental noise. In another research direction, the same questions can be asked for classes of anomalous diffusion models not covered by the results in [20] such as, for example, continuous time random walks and related stochastic processes.

A. The asymptotic distribution of the TAMSD

In Theorem A.1, we provide the asymptotic distribution of the TAMSD random vector after centering, which in Corollary A.1 allows developing the asymptotic distribution of the OLS estimator (7) of the diffusivity coefficient and diffusion exponent.

Theorem A.1

(Didier and Zhang [20], Theorem 1) Suppose the particle motion is a Gaussian, stationary increment process whose covariance function admits the harmonizable (Fourier domain) representation

X(s)X(t)=Cα2(eisx1)(eitx1)x2f(x)dx,α(0,2),Cα0. (51)

In (51), the spectral density has the generic form f(x)=s(x)|x|α/21/2, where the-valued high frequency function s(x) is bounded and satisfies the relations

|s(0)|2=1,||s(x)|21|C0|x|δ0,x(ε0,ε0), (52)

for constants C0, δ0, ε0 > 0. Suppose the growth of the lag value term τ = τ(N) ∈ ℕ as in (15) with respect to the sample size N is given by

τ(N)log2(N)N+Nτ(N)1+δ/20,N, (53)

where

δ=min{α/2,δ0/2}. (54)

Also consider the rates of convergence

{0<α<3/2:η(N)=N,ζ(τ)=τα+1/2;α=3/2:η(N)=Nlog(N),ζ(τ)=τ2;3/2<α<2:η(N)=Nα1,ζ(τ)=τ2. (55)

Then, as N → ∞,

(Nkη(Nk)ζ(τk)(MN(τk)X2(τk)))k=1,,mdZ, (56)

where, for k = 1, …, m, Nk = Nτk is the number of available terms in each TAMSD sum (2) and τk is given by (15). In (56), the distribution of the random vector Z can be described as follows.

  1. If 0 < α < 3/2, then ZN(0,Σ) (a m-variate Gaussian distribution), where the entry k1, k2 of the matrix Σ = Σ(α) is given by
    Σk1,k2=2wk1α1/2wk2α1/2(CαCH)4G^(y;wk1,wk2)L2()2, (57)
    k1, k2 = 1, …, m. In (57), we define
    CH=π1HΓ(2H)sin(Hπ)
    and
    G^(y;wk1,wk2)=CH2(eiwk1y1)(eiwk2y1)|y|α+1; (58)
  2. if α = 3/2, then ZN(0,Σ), where the entry k1, k2 of the matrix Σ = Σ(α) is given by
    Σk1,k2=4ϑ2,k1,k2=1,,m, (59)
    and
    ϑ=(CαCH)2α(α1)2;
  3. if 3/2 < α < 2, Z follows a multivariate Rosenblatt-type distribution whose characteristic function is given by
    ϕZ(t)=exp{12s=2[2iϑk=1mtk]sscs(α)} (60)
    around the origin. In (60), for s ≥ 2, cs(α) is given by
    [0,1]s|x1x2|α2|x2x3|α2|xsx1|α2dx1dxs. (61)

Consider again the regression system (20) and recall that expression (22) gives the standard estimator generated by the OLS solution to the system (20). The following corollary describes the asymptotic distribution of the standard estimator (22).

Corollary A.1

(Didier and Zhang [20], Corollary 1) Suppose the assumptions of Theorem A.1 hold. Then, as N → ∞,

Nταη(N)ζ(τ)(1logτ(Lstandlogσ2)Astandα)d(UTUT)AZ. (62)

In (62),

A=A(σ2,α)=diag(ζ(w1)/(σ2w1α),,ζ(wm)/(σ2wmα)), (63)

η(·), ζ(·) and Z are as in Theorem A.1, and

UT=1cw(k=1mlog(wk/w1),,k=1mlog(wk/wm)) (64)

with constant

cw=mk=1mlog2(wk)(k=1mlog(wk))2. (65)

In particular, the standard estimator (22) is consistent, namely, relation (9) holds.

Proposition A.1

(Didier and Zhang [20], Proposition 1, (i)) Under the assumptions of Theorem A.1, there is a constant σ2 > 0 such that (16) holds for some C > 0, where δ > 0 is given by (54).

B. Some lemmas

In this section, we present some lemmas that are used to prove Theorems C.1 and C.2 in Section C. Throughout this section, we assume 0 < α < 3/2 and the conditions of Theorem A.1. In proofs, whenever convenient C denotes a constant that may change from one line to the next.

In light of (56), define the standardized statistic

ϖ(τ)=MN(τ)X2(τ). (66)

In particular,

ϖ(τ)P1,N, (67)

so that a Taylor expansion can be applied to logϖ(τ) around 1. Meanwhile, we define the standardized increment

Wj(τ)=X(j+τ)X(j)X2(τ). (68)

We will use the following results in our proofs. The first one is the classical Isserlis theorem, which reduces the higher moments of a multivariate normal vector to its second moments. The second one is a concentration inequality that will allow us to establish sharp bounds on the tails of centered quadratic forms.

Theorem B.1

(Isserlis, [37]). Let (Z1, Z2, …, Z2N) be a zero mean, multivariate normal random vector. Then,

Z1Z2Z2N=ZiZj,

where the notation stands for summing over all distinct ways of partitioning Z1, …, Z2N into pairs Zi, Zj and each summand is a product of these N pairs.

Theorem B.2

[47, 5] Let Z1,,ZNi.i.dN(0,1) and consider constants η1, …, ηN ≥ 0, not all zero. Letη2 andη be the Euclidean square and sup norms of the vector η = (η1, …, ηN)T.

Also, define the random variable X=i=1Nηi,N(Zi21). Then, for every x > 0,

(X2η2x+2ηx)exp(x),
(X2η2x)exp(x).

The following lemma describes some basic properties of the central moments of (66).

Lemma B.1

As N → ∞,

(ϖ(τ)1)2=O(τN). (69)

Moreover, any moment of ϖ(τ)1 is bounded in N, i.e.,

|(ϖ(τ)1)κ|=O(1),κ. (70)

Proof. Expression (70) (for κ ≥ 3) can be proved by adapting the argument for establishing expression (C.24) in [94], while making use of the bound (76) and Lemma B.2 (expressions (73) and (74)). So, for the reader’s convenience, we establish (69) (for κ = 2). The left-hand side of (69) can be rewritten as

1N2k1=1Nk2=1N((X(τ+k1)X(k1))2X2(τ)1)((X(τ+k1)X(k2))2X2(τ)1)=1N2k1=1Nk2=1N(Wk12(τ)1)(Wk22(τ)1). (71)

By applying the Isserlis theorem (Theorem B.1),

(Wk12(τ)1)(Wk22(τ)1)=Wk12(τ)Wk22(τ)Wk12(τ)Wk22(τ)+1=2Wk1(τ)Wk2(τ)2=2X2(τ)2γτ2(k1k2),

where

γh(k1k2)=(X(k1+h)X(k1))(X(k2+h)X(k2)).

Thus, (71) can be recast as

2N2X2(τ)2k1,k2=1Nγτ2(k1k2), (72)

Note that

2N2X2(τ)2=O(1N2τ2α)=O(τNζ2(τ)η2(N)),

where ζ(τ), η(N) are defined by (55). Then, by (72), Lemmas C.1 – C.4 in [20], expression (71) is of the order O(τN), as claimed. □

The next lemma draws upon Theorem B.2 and Lemma B.1 to construct a concentration inequality for (66) (see also [22]).

Lemma B.2

Fix −∞ < r < 1/2 < 3/2 < rˊ. Then, for any 0 < ξ < 1/2 and some C > 0,

(ϖ(τ)r)exp{C(Nτ)ξ} (73)

and

(ϖ(τ)r)exp{C(Nτ)ξ}. (74)

Proof. Let Wj(τ), j = 1, …, N be as in (68). Then, for ϖ(τ) as in (67), we can write

ϖ(τ)=1Nj=1NWj2(τ)=1NWNTWN,

where WN = (W1(τ), …, WN(τ))T is a multivariate normal vector with covariance matrix Γ. Consider the spectral decomposition QΛQT = Γ, where Q is a N × N orthogonal matrix and Λ = diag{λ1, …, λN} is a N × N diagonal matrix. Then, WN=dQΛ1/2ZN, where ZN=(Z1,,ZN)TN(0,IN), IN is the N×N identity matrix, and =d denotes equality in distribution. Therefore,

ϖ(τ)=d1N(QΛ1/2ZN)TQΛ1/2ZN=1NZNTΛZN=j=1Nηj,NZj2, (75)

where ηj,N=λjN. Let ηN = (ηi,N)i=1,…,N be the vector of coefficients η·,N. By expression (69) in Lemma B.1,

ηN2ηN22=Varϖ(τ)=(ϖ(τ)1)2=O(τN). (76)

By Theorem B.2, by using the same argument as in the proof of Lemma C.3 in [94], and applying the bound (76),

(ϖ(τ)r)=(j=1Nηj,N(Zj21)r1)exp{CηN22}exp{CNτ}

for some C > 0. Thus, (73) follows. To show (74), it suffices to adapt the proof of expression (C.34) in [94]. In fact, by (76), 0 < ξ < 1/2 and Theorem B.2,

(ϖ(τ)r)=(j=1Nηj,N(Zj21)r1)(j=1Nηj,N(Zj21)2ηN2(Nτ)ξ2+2ηN(Nτ)ξ)exp{C(Nτ)ξ}.

The following lemma is used in the proof of Lemma B.6.

Lemma B.3

Let p ≥ 1, there is a constant Kp only depending on p such that

|logϖ(τ)|pKp (77)

Proof. By (75), ϖ(τ) is a nonnegative weighted sum of independent chi-squared random variables, where not all weights are zero. Then, relation (77) is a consequence of expression (96) in [63], p. 184. □

The following lemma can be shown based on Lemma B.2 and an adaptation of the proof of expressions (C.38) and (C.39) in [94], which pertains to higher order (cross)moments of wavelet variance terms.

Lemma B.4

Let κ1, κ2 ∈ ℕ ∪ {0}, κ1 + κ2 ≥ 3, and fix 0 < r < 1/2. Then, as N → ∞,

(ϖ(τ1)1)κ1(ϖ(τ2)1)κ2=O[(τN)2].
(ϖ(τ1)1)κ1(ϖ(τ2)1)κ21{min{ϖ(τ1),ϖ(τ2)}>r}=O[(τN)2]. (78)

Lemmas B.5, B.6 and B.7, stated and shown next, are used in the proofs of Theorems C.1 and C.2. The lemmas provide expressions for (cross)moments and (cross)moments of logarithms of the random variables (66) at different lag values.

Lemma B.5

(ϖ(τk1)1)(ϖ(τk2)1)=12ni=N+1N1(1|i|N)××{|iτk1τk2+τk1τk2|α|iτk1τk2+τk1τk2τk2τk1|α|iτk1τk2|α+|iτk1τk2τk2τk1|α}2(1+O(τδ)). (79)

Proof. For notational simplicity, assume k1 =1 and k2 = 2. By (68), the left-hand side of (79) can be rewritten as

1N2j1,j2=1N(Wj12(τ1)1)(Wj22(τ2)1)=1N2j1,j2=1NWj12(τ1)Wj22(τ2)1. (80)

By Theorem B.1 (Isserlis),

Wj12(τ1)Wj22(τ2)=Wj12(τ1)Wj22(τ2)+2Wj1(τ1)Wj2(τ2)2=1+2Wj1(τ1)Wj2(τ2)2. (81)

By Lemma A.1 in [20],

(X(j1+τ1)X(j1))(X(j2+τ2)X(j2))X2(τ1)X2(τ2)=(BH(j1+τ1)BH(j1))(BH(j2+τ2)BH(j2))BH2(τ1)BH2(τ2)(1+O(τδ)), (82)

where BH is a standard fBm with Hurst parameter given by the relation (12). By (68), (82) and expression (10) for the covariance function of fBm,

Wj1(τ1)Wj2(τ2)=12{|j1j2τ1τ2+τ1τ2|α|j1j2τ1τ2+τ1τ2τ2τ1|α|j1j2τ1τ2|α+|j1j2τ1τ2τ2τ1|α}(1+O(τδ)). (83)

Since 〈Wj1 (τ1)Wj2 (τ2)〉 = 〈Wj1+k (τ1)Wj2+k (τ2)〉, then by expression (81) we can rewrite (80) as

12Ni=N+1N11Nj1j2=i,j1,j2=1N(2Wj1(τ1)Wj2(τ2))2. (84)

Relation (79) is now a consequence of (83) and (84). □

Lemma B.6

logϖ(τk1)logϖ(τk2)=(ϖ(τk1)1)(ϖ(τk2)1)+o(τN). (85)

Proof. For notational simplicity, assume k1 = 1 and k2 = 2. Let

S1=logϖ(τ1)logϖ(τ2)logϖ(τ1)logϖ(τ2)1{min{ϖ(τ1),ϖ(τ2)}>r},S2=logϖ(τ1)logϖ(τ2)1{min{ϖ(τ1),ϖ(τ2)}>r}(ϖ(τ1)1)(ϖ(τ2)1)1{min{ϖ(τ1),ϖ(τ2)}>r},S3=(ϖ(τ1)1)(ϖ(τ2)1)1{min{ϖ(τ1),ϖ(τ2)}>r}(ϖ(τ1)1)(ϖ(τ2)1).

Note that

logϖ(τ1)logϖ(τ2)=S1+S2+S3.

Therefore, establishing (85) is equivalent to showing that S1+S2+S3=o(τN). It suffices to show that

max{|S1|,|S2|,|S3|}=o(τN). (86)

Let 0 < r < 1/2. We start off with S2 by writing out the almost sure Taylor expansion

logϖ(τ)1{ϖ(τ)>r}={(ϖ(τ)1)12(ϖ(τ)1σ+2(ϖ(τ)))2}1{ϖ(τ)>r}, (87)

where σ+2(ϖ(τ))[min{ϖ(τ),1},max{ϖ(τ),1}]. Then,

logϖ(τ1)logϖ(τ2)1{min{ϖ(τ1),ϖ(τ2)}>r}=(ϖ(τ1)1)(ϖ(τ2)1)1{min{ϖ(τ1),ϖ(τ2)}>r}12(ϖ(τ1)1)(ϖ(τ2)1σ+2(ϖ(τ2)))21{min{ϖ(τ1),ϖ(τ2)}>r}12(ϖ(τ1)1σ+2(ϖ(τ1)))2(ϖ(τ2)1)1{min{ϖ(τ1),ϖ(τ2)}>r}+14(ϖ(τ1)1σ+2(ϖ(τ1)))2(ϖ(τ2)1σ+2(ϖ(τ2)))21{min{ϖ(τ1),ϖ(τ2)}>r} (88)

The second, third and fourth terms can be bounded by a similar argument, so we only develop the latter. Recast

(ϖ(τ)1σ+2(ϖ(τ)))21{ϖ(τ)>r}=(ϖ(τ)1σ+2(ϖ(τ)))2(1{1/2>ϖ(τ)>r}+1{ϖ(τ)1/2})(ϖ(τ)1r)21{1/2>ϖ(τ)>r}+(ϖ(τ)11/2)21{ϖ(τ)1/2}. (89)

Therefore, we can rewrite the fourth term in (88) as

(ϖ(τ1)1σ+2(ϖ(τ1)))2(ϖ(τ2)1σ+2(ϖ(τ2)))21{min{ϖ(τ1),ϖ(τ2)}>r}1r4(ϖ(τ1)1)21{1/2>ϖ(τ1)>r}(ϖ(τ2)1)21{1/2>ϖ(τ2)>r}+1(r/2)2(ϖ(τ1)1)21{ϖ(τ1)1/2}(ϖ(τ2)1)21{1/2>ϖ(τ2)>r}+1(r/2)2(ϖ(τ1)1)21{1/2>ϖ(τ2)>r}(ϖ(τ2)1)21{ϖ(τ2)1/2}+1(1/2)4(ϖ(τ1)1)21{ϖ(τ2)1/2}(ϖ(τ2)1)21{ϖ(τ2)1/2} (90)

By (78), the fourth term in (90) is bounded by

O[(τN)2]. (91)

By the Cauchy-Schwarz inequality, (73) and (78), the first term in the sum (90) is bounded by

1r4(ϖ(τ1)1)4(ϖ(τ1)1)41{1/2>ϖ(τ1)>r}1{1/2>ϖ(τ2)>r}1r4O(τN)(1/2>ϖ(τ1)>r)(1/2>ϖ(τ2)>r)1r4O(τN)exp{C(Nτ)1ξ}=o(τN). (92)

Again by the Cauchy-Schwarz inequality, (73) and (78), the second term in the sum (90) is bounded by

4r2(ϖ(τ1)1)4(ϖ(τ1)1)41{ϖ(τ1)1/2}1{1/2>ϖ(τ2)>r}4r2O(τN)(ϖ(τ1)1/2)(1/2>ϖ(τ2)>r)4r2O(τN)exp{C(Nτ)1ξ}=o(τN). (93)

An analogous bound holds for the third term in the sum (90). Therefore, |S2| is bounded by the right-hand side of (86). To tackle S3, rewrite it as

(ϖ(τ1)1)(ϖ(τ2)1)(1{ϖ(τ1)>r}1{ϖ(τ2)r}+1{ϖ(τ1)r}1{ϖ(τ2)>r}+1{ϖ(τ1)r}1{ϖ(τ2)r}). (94)

By the Cauchy-Schwarz inequality, (73) and (78), the first term on the right-hand side of (94) is bounded by

(ϖ(τ1)1)2(ϖ(τ2)1)2(ϖ(τ2)r)O(τN)exp{C(Nτ)1ξ}=o(τN).

Similar bounds hold for the remaining terms on the right-hand side of (94). Therefore, |S3| is also bounded by the right-hand side of (86). As for S1, it can be reexpressed as

logϖ(τ1)logϖ(τ2)(1{ϖ(τ1)>r}1{ϖ(τ2)r}+1{ϖ(τ1)r}1{ϖ(τ2)>r}+1{ϖ(τ1)r}1{ϖ(τ2)r}). (95)

Note that, by Lemma B.3, log4ϖ(τ) is bounded. Then, by applying the Cauchy-Schwarz inequality twice, the first term on the right-hand side of (95) is bounded by

log2ϖ(τ1)log2ϖ(τ2)(ϖ(τ2)r)(log4ϖ(τ1)log4ϖ(τ2))1/4exp{C(Nτ)1ξ}=o(τN).

Similar bounds hold for the remaining terms on the right-hand side of (95). Therefore, |S1| is bounded by the right-hand side of (86). This shows (85). □

Lemma B.7

logϖ(τ)+12(ϖ(τ)1)2=O(τN). (96)

Proof. Fix 0 < r < 1/2. Let

T1=logϖ(τ)logϖ(τ)1{ϖ(τ)>r},T2=logϖ(τ)1{ϖ(τ)>r}+12(ϖ(τ)1)21{ϖ(τ)>r},T3=12(ϖ(τ)1)212(ϖ(τ)1)21{ϖ(τ)>r}.

Recall that, by Lemma B.3, log2ϖ(τ) is bounded. Thus, by the Cauchy-Schwarz inequality and by Lemma B.2,

T1=logϖ(τ)1{ϖ(τ)r}log2ϖ(τ)(ϖ(τ)r)log2ϖ(τ)exp{C(Nτ)1ξ}=o(τN). (97)

By a similar reasoning, we can further prove that

T3=o(τN). (98)

Now, we turn to T2. By an almost sure Taylor expansion,

logϖ(τ)1{ϖ(τ)>r}={(ϖ(τ)1)12(ϖ(τ)1)2+13(ϖ1σ+2(ϖ))3}1{ϖ(τ)>r},

where σ+2(ϖ(τ))[min{ϖ(τ),1},max{ϖ(τ),1}]. Then, T2 is bounded by

|(ϖ(τ)1)1{ϖ(τ)>r}|+13|(ϖ1σ+2(ϖ))31{ϖ(τ)>r}|. (99)

Since ϖ(τ)1=0, by the Cauchy-Schwarz inequality and Lemmas B.1 and B.2, the first term in (99) can be bounded by

|ϖ(τ)1)1{ϖ(τ)>r}|=|(ϖ(τ)1)1{ϖ(τ)>r}ϖ(τ)1|=|(ϖ(τ)1)1{ϖ(τ)r}|(ϖ(τ)1)2(ϖ(τ)r)=o(τN).

Meanwhile, by the Cauchy-Schwarz inequality and Lemmas B.2 and B.4, the second term in (99) is bounded by

13r3|(ϖ1)31{1/2>ϖ(τ)>r}|+13(1/2)3|(ϖ1)31{ϖ(τ)1/2}|O(τN) (100)

Thus,

T2=O(τN). (101)

Relations (97), (98) and (101) imply (96). □

C. Bias and variance of Estand and the asymptotic distribution of E

We are now in a position to prove Theorems C.1 and C.2 and Proposition C.1, which give, respectively, asymptotically valid characterizations of the bias and variance involved in TAMSD-based estimation, and the asymptotic distribution of the standardized estimator (35).

The proof of Theorem C.1 is a consequence of a Taylor expansion, followed by using estimates of the decay of TAMSD moments. Constructing the latter requires using a concentration inequality (e.g., [48, 5]), which was done in Section B.

Theorem C.1

For 0 < α < 3/2, under the assumptions of Theorem A.1, (17) holds.

Proof. The left-hand side of (17) can be rewritten as

logMN(τ)X2(τ)+logX2(τ)σ2τα=logϖ(τ)+logX2(τ)σ2τα. (102)

By Proposition 1 in [20], we can rewrite the second sum term on the right-hand side of (102) as

log(1+O(τδ))=O(τδ),N.

By Lemmas B.5 and B.7, we can recast the first sum term on the right-hand side of (102) as

12(ϖ(τ)1)2+O(τN)=14Ni=N+1N1(1|i|N){|iτ+1|α2|iτ|α+|iτ1|α}2+O(τδ)+O(τN).

Thus, (17) follows. □

Next, the proof of Theorem C.2 relies on Taylor expansions of the moments of the logarithm of the TAMSD.

Theorem C.2

For 0 < α < 3/2, under the assumptions of Theorem A.1, expression (25) holds. Proof. For k1, k2 = 1, …, m, rewrite

vk1,k2=Cov(logMN(τk1),logMN(τk2))=[logMN(τk1)logMN(τk1)][logMN(τk1)logMN(τk1)]=[logϖ(τk1)logϖ(τk1)][logϖ(τk2)logϖ(τk2)]=logϖ(τk1)logϖ(τk2)logϖ(τk1)logϖ(τk2). (103)

By Lemmas B.1 and B.7,

logϖ(τk1)=O(τN).

Therefore, (103) can be reexpressed as

logϖ(τk1)logϖ(τk2)+o(τN). (104)

By Lemmas B.5, B.6 and C.1 (expression (121)), expression (25) holds. □

The proof of Proposition C.1 builds upon Taylor expansions and characterizing the asymptotic behavior of the standardization term in the definition of the estimator (35).

Proposition C.1

Under the assumptions of Theorem A.1, suppose 0 < α < 3/2. Then, the estimator (35) satisfies

Λ1/2(Astand)(Eξ)dN(0,I),N, (105)

where the vector ξ is given by (1). In particular, the estimator is consistent, i.e,

EPξ.

Proof. Recast the estimator (35) as

Z=(XTΥ1(Astand)X)1/2(XTΥ1(Astand)X)1XTΥ1(Astand)y. (106)

Rewrite Υ(α) = Υ(α, N) as to express the dependence of the latter matrix on N. Define

(NτΥ(Astand,N))1=:S(Astand,N)=(sk1,k2(Astand,N))k1,k2=1,,m, (107)
sN(Astand)=k1=1mk2=1msk1,k2(Astand,N).

By (23), (25) and (121), we can write

S(Astand,N)PS(α)=(sk1,k2(α))k1,k2=1,,m,N, (108)

and

s(α):=k1=1mk2=1msk1,k2(α) (109)

for some constant matrix S(α). For notational simplicity, write sN = sN(Astand, N) and sk1,k2(N)=sk1,k2(Astand,N). Then,

XTS(ξ)X=(sNk1=1mk2=1m(logτk1)sk1,k2(N)k1=1mk2=1m(logτk1)sk1,k2(N)k1=1mk2=1m(logτk1logτk2)sk1,k2(N)).

By a simple calculation and relation (108),

cw(N):=det(XTS(Astand,N)X)=sNk1=1mk2=1mlogwk1logwk2sk1,k2(N)(k1=1mk2=1mlogwk1sk1,k2(N))2Ps(α)k1=1mk2=1mlogwk1logwk2sk1,k2(α)(k1=1mk2=1mlogwk1sk1,k2(α))2=cw(α).

Moreover, by (107),

(XTΥ1(Astand,N)X)1XTΥ1(Astand,N)=(XTS(Astand,N)X)1XTS(Astand,N) (110)
=1cw(N)(logτ{k1=1mk2=1mlogwk1sk1,k2(N)(k=1msk,j(N))sNk=1mlogwksk,j(N)}+(k1=1mk2=1mlogwk1logwk2sk1,k2(N)k=1msk,j(N)k1=1mk2=1mlogwk1sk1,k2(N)k=1mlogwksk,j(N))sNk=1mlogwksk,j(N)k1=1mk2=1mlogwk1sk1,k2(N)(k=1msk,j(N)))j=1,,m=:(logτan,j+bn,jan,j)j=1,,m, (111)

where the sequences of constants {an,j}N∈ℕ and {bn,j}N∈ℕ, converge to constants aj and bj, respectively, for j = 1, …, m.

Recall that, for a symmetric positive definite matrix

M=(m11m12m12m22),

we can write its square root in closed form as

M1/2=1tr(M)+2det(M)(m11+det(M)m12m12m22+det(M)).

Therefore,

(XTS(Astand)X)1/2=1tr(XTS(Astand)X)+2cw(N)(sN+cw(N)k1=1mk2=1m(logτk1)sk1,k2(N)k1=1mk2=1m(logτk1)sk1,k2(N)k1=1mk2=1m(logτk1logτk2)sk1,k2(N)+cw(N)). (112)

Note that

tr(XTS(Astand)X)+2cw(N)log2τsN. (113)

By expressions (108), (109), (111), (112) and (113),

(XTΥ1(Astand,N)X)1/2(XTΥ1(Astand,N)X)1XTΥ1(Astand,N)=Nτ(XTS(Astand,N)X)1/2(XTS(Astand,N)X)1XTS(Astand,N)=Nτ1tr(XTS(Astand,N)X)+2cw(N)1cw(N)(an,j[(logτ)(sN+cw(N))k1=1mk2=1m(logτk1)sk1k2(N)]+bn,j(sN+cw(N))an,j[(logτ)k1=1mk2=1m(logτk1)sk1k2(N)k1=1mk2=1m(logτk1logτk2)sk1k2(N)cw(N)]+bn,j[k1=1mk2=1m(logτk1)sk1k2(N)])j=1,,m=Nτ1tr(XTS(Astand,N)X)+2cw(N)1cw(N)(an,j[(logτ)cw(N)k1=1mk2=1m(logwk1)sk1k2(N)]+bn,j(sN+cw(N))an,j[(logτ)k1=1mk2=1m(logwk1)sk1k2(N)k1=1mk2=1m(logwk1logwk2)sk1k2(N)cw(N)]+bn,j[(logτ)sN+k1=1mk2=1m(logwk1)sk1k2(N)])j=1,,mPNτ1s(α)1cw(α)(ajcw(α)bjs(α)aj[k1=1mk2=1m(logwk1)sk1k2(α)])j=1,,m=:NτΨ2×m, (114)

as N → ∞. For y and X as in (30), rewrite the left-hand side of expression (105) as

(XTΥ1(Astand,N)X)1/2(Eξ)=(XTΥ1(Astand,N)X)1/2(XTΥ1(Astand,N)X)1XTΥ1(Astand,N)(yXξ). (115)

Recast

y=(log[MN(τk)eτkNβN(Astand,τk)])k=1,,m.

By entrywise first order Taylor expansions,

yXξ=(log(MN(τk)eτkNβN(Astand,τk)σ2τkα))k=1,,m=(MN(τk)eτNβN(Astand,τk)σ2τkα1)k=1,,m+(O(MN(τk)eτNβN(Astandτk)σ2τkα1)2)k=1,,m (116)

However, for k = 1, …, m, the first term on the right-hand side of (116) can be reexpressed as

MN(τk)σ2τkα(eτkNβN(Astand,τk)1)+MN(τk)σ2τkα1. (117)

Again by a first order Taylor expansion,

eτkNβN(Astand,τk)1=τkNβN(Astand,τk)+oP(τkN). (118)

Therefore, by (114), (117) and (118), we can rewrite relation (115) as

(Ψ+oP(1))Nτ(MN(τk)σ2τkα1)k=1,,m+oP(τN). (119)

Expression (105) is a consequence of (119) and Theorem A.1, where the estimator (106) is asymptotically standardized. □

The following lemma establishes the convergence of the main bias and variance factors and is used in the proofs of Proposition C.1 and Theorem C.2.

Lemma C.1

For 0 < α < 3/2, consider the main bias and variance factors (18) and (26), respectively, under the assumptions of Theorem A.1. Then, there are functions β(α,·) > 0 and ς(α,·,·) > 0 such that

βN(Astand,τk)P(β(α,τk))k=1,,m, (120)
(ζN(Astand,τk1,τk2))k1,k2=1,,mP(ζ(α,k1,k2))k1,k2=1,,m, (121)

as N → ∞.

Proof. From expression (25) for the variance term vk1,k2(ξ), recast

ζN(α,τk1,τk2)=12{|i|τ+τ+1|i|N1}(1|i|N){|iτwk1wk2+wk1wk2|α|iτwk1wk2+wk1wk2wk2wk1|α|iτwk1wk2|α+|iτwk1wk2wk2wk1|α}21τ. (122)

The second sum term on the right-hand side of (122) is bounded by

12(1τN)τ+1|i|N1{|iτwk1wk2+wk1wk2|α|iτwk1wk2+wk1wk2wk2wk1|α|iτwk1wk2|α+|iτwk1wk2wk2wk1|α}21τ.12(1τN)τ+1|y|n{|ywk1wk2+wk1wk2|α|ywk1wk2+wk1wk2wk2wk1|α|ywk1wk2|α+|ywk1wk2wk2wk1|α}2dy0, (123)

as N → ∞. On the other hand, by condition (53) and the dominated convergence theorem, the first sum term on the right-hand side of (122) converges to

ζ(α,k1,k2):=1211{|ywk1wk2+wk1wk2|α|ywk1wk2+wk1wk2wk2wk1|α|ywk1wk2|α+|ywk1wk2wk2wk1|α}2dy>0. (124)

Moreover, by Corollary A.1, AstandPα. So, pick a small enough ϵ0 such that α ∈ (ϵ0, 3/2 − ϵ0). Let A={ω:Astand(ω)(αϵ02,α+ϵ02)}. In the set A, by a simple adaptation of the argument leading to (123) and the convergence to (124),

ζN(Astand,τk1,τk2)ζ(α,k1,k2),N, (125)

where ℙ(A) → 1. This shows (121). A similar argument can be used to show (120). □

D. Pseudocode for generating the improved TAMSD-based estimator E

Generating the improved pathwise estimator E (see (29))

Input:
• one observed particle path {X1, X2, …, XN}N∈ℕ of length N;
• regression lag values τk, k = 1, …, m (typically, τk = τwk, w1 < … < wm, τN);
• the expression for the asymptotic covariance matrix Υ(α) as a function of α;
Step 1: obtain the standard estimator Astand over the chosen lag values;
Step 2: estimate the asymptotic covariance matrix Υ(ξ) by means of Υ(Astand) (see (27));
Step 3: use Astand and the estimator (19) of the bias vector to produce the bias-corrected regression system (28);
Step 4: obtain the estimator E by means of Υ(Astand)-based GLS on the bias-corrected regression system (28).

Acknowledgments

D.B.H. was partially supported by the awards DMS 1462992 (National Science Foundation), AI1 12029 and HL 108808 (National Institutes of Health), and Hill16XX0 (Cystic Fibrosis Foundation). G.D. was partially supported by the prime award no. W911NF-14–1-0475 from the Biomathematics subdivision of the Army Research Office, USA. The authors would like to thank John Fricks for his suggestions and comments on this paper.

References

  • [1].Andreanov A and Grebenkov D. Time-averaged MSD of Brownian motion. Journal of Statistical Mechanics: Theory and Experiment, 2012(07):P07001, 2012. [Google Scholar]
  • [2].Barkai E, Garini Y, and Metzler R. Strange kinetics of single molecules in living cells. Physics Today, 65(8):29–35, 2012. [Google Scholar]
  • [3].Berglund AJ. Statistics of camera-based single-particle tracking. Physical Review E, 82(1):011917, 2010. [DOI] [PubMed] [Google Scholar]
  • [4].Bertseva E, Grebenkov DS, Schmidhauser P, Gribkova S, Jeney S, and Forró L. Optical trapping microrheology in cultured human cells. European Physical Journal E, 35(7):63, 2012. [DOI] [PubMed] [Google Scholar]
  • [5].Boucheron S, Lugosi G, and Massart P. Concentration inequalities: a nonasymptotic theory of independence. Oxford University Press, 2013. [Google Scholar]
  • [6].Boyer D, Dean DS, Mejía-Monasterio C, and Oshanin G. Optimal estimates of the diffusion coefficient of a single Brownian trajectory. Physical Review E, 85(3):031136, 2012. [DOI] [PubMed] [Google Scholar]
  • [7].Boyer D, Dean DS, Mejía-Monasterio C, and Oshanin G. Distribution of the leastsquares estimators of a single Brownian trajectory diffusion coefficient. Journal of Statistical Mechanics: Theory and Experiment, 2013(04):P04017, 2013. [Google Scholar]
  • [8].Briane V, Kervrann C, and Vimond M. A statistical analysis of particle trajectories in living cells. arXiv:1707.01838, pages 1–38, 2017. [DOI] [PubMed]
  • [9].Burnecki K. FARIMA processes with application to biophysical data. Journal of Statistical Mechanics: Theory and Experiment, 2012(05):P05015, 2012. [Google Scholar]
  • [10].Burnecki K, Kepten E, Garini Y, Sikora G, and Weron A. Estimating the anomalous diffusion exponent for single particle tracking data with measurement errors-an alternative approach. Scientific Reports, 5(11306):1–11, 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [11].Burnecki K, Kepten E, Janczura J, Bronshtein I, Garini Y, and Weron A. Universal algorithm for identification of fractional Brownian motion. a case of telomere subdiffusion. Biophysical Journal, 103(9):1839–1847, 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [12].Burnecki K, Muszkieta M, Sikora G, and Weron A. Statistical modelling of subdiffusive dynamics in the cytoplasm of living cells: a FARIMA approach. Europhysics Letters, 98(1):10004, 2012. [Google Scholar]
  • [13].Burov S, Jeon J-H, Metzler R, and Barkai E. Single particle tracking in systems showing anomalous diffusion: the role of weak ergodicity breaking. Physical Chemistry Chemical Physics, 13(5):1800–1812, 2011. [DOI] [PubMed] [Google Scholar]
  • [14].Cheridito P, Kawaguchi H, and Maejima M. Fractional Ornstein-Uhlenbeck processes. Electronic Journal of Probability, 8(3):1–14, 2003. [Google Scholar]
  • [15].Christensen R. Plane answers to complex questions: the theory of linear models. Springer Science & Business Media, 2011. [Google Scholar]
  • [16].Dawson M, Wirtz D, and Hanes J. Enhanced viscoelasticity of human cystic fibrotic sputum correlates with increasing microheterogeneity in particle transport. Journal of Biological Chemistry, 278(50):50393–50401, 2003. [DOI] [PubMed] [Google Scholar]
  • [17].Deng W and Barkai E. Ergodic properties of fractional Brownian-Langevin motion. Physical Review E, 79(1):011112, 2009. [DOI] [PubMed] [Google Scholar]
  • [18].Didier G and Fricks J. On the wavelet-based simulation of anomalous diffusion. Journal of Statistical Computation and Simulation, 84(4):697–723, 2014. [Google Scholar]
  • [19].Didier G, McKinley SA, Hill DB, and Fricks J. Statistical challenges in microrheology. Journal of Time Series Analysis, 33(55):724–743, September 2012. [Google Scholar]
  • [20].Didier G and Zhang K. The asymptotic distribution of the pathwise mean squared displacement in single particle tracking experiments. Journal of Time Series Analysis, 38(3):395–416, May 2017. [Google Scholar]
  • [21].Dobrushin R and Major P. Non-central limit theorems for non-linear functional of Gaussian fields. Probability Theory and Related Fields, 50(1):27–52, 1979. [Google Scholar]
  • [22].Gajda J, Wyłomańska A, Kantz H, Chechkin AV, and Sikora G. Large deviations of time-averaged statistics for Gaussian processes. Statistics & Probability Letters, 2018. [Google Scholar]
  • [23].Ghosh SK, Cherstvy AG, Grebenkov DS, and Metzler R. Anomalous, non-Gaussian tracer diffusion in crowded two-dimensional environments. New Journal of Physics, 18(1):013027, 2016. [Google Scholar]
  • [24].Giraitis L and Surgailis D. Clt and other limit theorems for functionals of Gaussian processes. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete, 70(2):191, 1985. [Google Scholar]
  • [25].Giraitis L and Surgailis D. A central limit theorem for quadratic forms in strongly dependent linear variables and its application to asymptotical normality of Whittle’s estimate. Probability Theory and Related Fields, 86(1):87–104, 1990. [Google Scholar]
  • [26].Gorman J and Greene EC. Visualizing one-dimensional diffusion of proteins along DNA. Nat. Struct. Mol. Biol, 15(8):768–774, 2008. [DOI] [PubMed] [Google Scholar]
  • [27].Grebenkov D. Probability distribution of the time-averaged mean-square displacement of a Gaussian process. Physical Review E, 84(3):031124, 2011. [DOI] [PubMed] [Google Scholar]
  • [28].Grebenkov D. Time-averaged quadratic functionals of a Gaussian process. Physical Review E, 83(6):061117, 2011. [DOI] [PubMed] [Google Scholar]
  • [29].Grebenkov D. Optimal and suboptimal quadratic forms for noncentered Gaussian processes. Physical Review E, 88(3):032140, 2013. [DOI] [PubMed] [Google Scholar]
  • [30].Grebenkov DS, Vahabi M, Bertseva E, Forró L, and Jeney S. Hydrodynamic and subdiffusive motion of tracers in a viscoelastic medium. Physical Review E, 88(4):040701, 2013. [DOI] [PubMed] [Google Scholar]
  • [31].Guyon Xavier and León José. Convergence en loi des H-variations d’un processus Gaussien stationnaire sur R. Annales de l’IHP: Probabilités et Statistiques, 25(3):265–282, 1989. [Google Scholar]
  • [32].Halford SE and Marko JF. How do site-specific DNA-binding proteins find their targets? Nucleic Acids Res., 32(10):3040–3052, 2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [33].Helenius J, Brouhard G, Kalaidzidis Y, Diez S, and Howard J. The depolymerizing kinesin MCAK uses lattice diffusion to rapidly target microtubule ends. Nature, 441(7089):115–119, 2006. [DOI] [PubMed] [Google Scholar]
  • [34].Hess ST, Girirajan TPK, and Mason MD. Ultra-high resolution imaging by fluorescence photoactivation localization microscopy. Biophysical Journal, 91(11):4258–4272, 2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [35].Hill DB, Vasquez PA, Mellnik J, McKinley SA, Vose A, Mu F, Henderson AG, Donaldson SH, Alexis NE, Boucher RC, and Forest MG. A biophysical basis for mucus solids concentration as a candidate biomarker for airways disease. PloS one, 9(2):e87681, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [36].Hozé N and Hochman D. Statistical methods for large ensembles of super-resolution stochastic single particle trajectories in cell biology. Annual Review of Statistics and its Application, 4:189–223, 2017. [Google Scholar]
  • [37].Isserlis L. On certain probable errors and correlation coefficients of multiple frequency distributions with skew regression. Biometrika, 11:185190, 1916. [Google Scholar]
  • [38].Jeon J-H, Barkai E, and Metzler R. Noisy continuous time random walks. Journal of Chemical Physics, 139(12):121916, 2013. [DOI] [PubMed] [Google Scholar]
  • [39].Jeon J-H and Metzler R. Analysis of short subdiffusive time series: scatter of the time-averaged mean-squared displacement. Journal of Physics A: Mathematical and Theoretical, 43(25):252001, 2010. [Google Scholar]
  • [40].Kepten E, Bronshtein I, and Garini Y. Improved estimation of anomalous diffusion exponents in single-particle tracking experiments. Physical Review E, 87(5):052713, 2013. [DOI] [PubMed] [Google Scholar]
  • [41].Kepten E, Weron A, Sikora G, Burnecki K, and Garini Y. Correlated continuous time random walks. Statistics and Probability Letters, 79:1194–1202, 2009. [Google Scholar]
  • [42].Kepten E, Weron A, Sikora G, Burnecki K, and Garini Y. Guidelines for the fitting of anomalous diffusion mean square displacement graphs from single particle tracking experiments. PLoS One, 10(2):e0117722, 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [43].Kou SC. Stochastic modeling in nanoscale biophysics: subdiffusion within proteins. Annals of Applied Statistics, 2(2):501–535, 2008. [Google Scholar]
  • [44].Kou SC and Xie XS. Generalized Langevin equation with fractional Gaussian noise: subdiffusion within a single protein molecule. Physical Review Letters, 93(18):180603, 2004. [DOI] [PubMed] [Google Scholar]
  • [45].Lai SK, Wang YY, Cone R, Wirtz D, and Hanes J. Altering mucus rheology to solidify human mucus at the nanoscale. PLoS One, 4(1):e4294, 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [46].Lasne D, Blab GA, Berciaud S, Heine M, Groc L, Choquet D, Cognet L, and Lounis B. Single nanoparticle photothermal tracking (SNaPT) of 5-nm gold beads in live cells. Biophysical Journal, 91(12):4598–4604, 2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [47].Laurent B and Massart P. Adaptive estimation of a quadratic functional by model selection. Annals of Statistics, pages 1302–1338, 2000. [Google Scholar]
  • [48].Ledoux M. The concentration of measure phenomenon. Number 89. American Mathematical Society, 2005. [Google Scholar]
  • [49].Levine AJ and TC Lubensky. One-and two-particle microrheology. Physical Review Letters, 85(8):1774–1777, 2000. [DOI] [PubMed] [Google Scholar]
  • [50].Lieleg O, Vladescu I, and Ribbeck K. Characterization of particle translocation through mucin hydrogels. Biophysical Journal, 98(9):1782, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [51].Lysy M, Pillai N, Hill DB, Forest MG, Mellnik J, Vasquez P, and McKinley SA. Model comparison for single particle tracking in biological fluids. To appear in Journal of the American Statistical Association, pages 1–44, 2016. [Google Scholar]
  • [52].Major P. Limit theorems for non-linear functionals of Gaussian sequences. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete, 57(1):129–158, 1981. [Google Scholar]
  • [53].Mason TG and Weitz DA. Optical measurements of the linear viscoelastic moduli of complex fluids. Physical Review Letters, 74:1250–1253, 1995. [DOI] [PubMed] [Google Scholar]
  • [54].Matsui H, Wagner VE, Hill DB, Schwab UE, Rogers TD, Button B, Taylor RM, Superfine R, Rubinstein M, Iglewski BH, and Boucher RC. A physical linkage between cystic fibrosis airway surface dehydration and Pseudomonas aeruginosa biofilms. Proceedings of the National Academy of Sciences, 103(48):18131, 2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [55].Meerschaert M and Scheffler H-P. Limit theorems for continuous-time random walks with infinite mean waiting times. Journal of Applied Probability, 41:623–638, 2004. [Google Scholar]
  • [56].Mellnik JWR, Lysy M, Vasquez PA, Pillai NS, Hill DB, Cribb J, McKinley SA, and Forest MG. Maximum likelihood estimation for single particle, passive microrheology data with drift. Journal of Rheology, 60(3):379–392, 2016. [Google Scholar]
  • [57].Meroz Y and Sokolov IM. A toolbox for determining subdiffusive mechanisms. Physics Reports, 573:1–29, 2015. [Google Scholar]
  • [58].Metzler R, Jeon J-H, and Cherstvy AG. Non-Brownian diffusion in lipid membranes: experiments and simulations. Biochimica et Biophysica Acta, 1858(10):2451–2467, 2016. [DOI] [PubMed] [Google Scholar]
  • [59].Metzler R, Tejedor V, Jeon JH, He Y, Deng WH, Burov S, and Barkai E. Analysis of single particle trajectories: from normal to anomalous diffusion. Acta Physica Polonica B, 40(5):1315–1331, 2009. [Google Scholar]
  • [60].Michalet X and Berglund AJ. Optimal diffusion coefficient estimation in single-particle tracking. Physical Review E, 85(6):061916, 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [61].Minoura I, Katayama E, Sekimoto K, and Muto E. One-dimensional Brownian motion of charged nanoparticles along microtubules: a model system for weak binding interactions. Biophysical Journal, 98(8):1589–1597, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [62].Moulines E, Roueff F, and Taqqu MS. Central limit theorem for the log-regression wavelet estimation of the memory parameter in the Gaussian semi-parametric context. Fractals, 15(04):301–313, 2007. [Google Scholar]
  • [63].Moulines E, Roueff F, and Taqqu MS. On the spectral density of the wavelet coefficients of long-memory time series with application to the log-regression estimation of the memory parameter. Journal of Time Series Analysis, 28(2):155–187, 2007. [Google Scholar]
  • [64].Moulines E, Roueff F, and Taqqu MS. A wavelet Whittle estimator of the memory parameter of a nonstationary Gaussian time series. Annals of Statistics, pages 1925–1956, 2008. [Google Scholar]
  • [65].Nandi A, Heinrich D, and Lindner B. Distributions of diffusion measures from a local mean-square displacement analysis. Physical Review E, 86(2):021926, 2012. [DOI] [PubMed] [Google Scholar]
  • [66].Nguyen HD and McKinley SA. Anomalous diffusion and the generalized Langevin equation. https://arxiv.org/abs/1711.00560, pages 1–40, 2017.
  • [67].Nishimura SY, Lord SJ, Klein LO, Willets KA, He M, Lu Z, Twieg RJ, and Moerner WE. Diffusion of lipid-like single-molecule fluorophores in the cell membrane. J. Phys. Chem. B, 110(15):8151–8157, 2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [68].Ottobre M and Pavliotis G. Asymptotic analysis for the generalized Langevin equation. Nonlinearity, 24(5):1629, 2011. [Google Scholar]
  • [69].Pipiras V and Taqqu MS. Long-Range Dependence and Self-Similarity Cambridge Series on Statistical and Probabilistic Mathematics. Cambridge University Press, Cambridge, United Kingdom, 2017. [Google Scholar]
  • [70].Prakasa Rao BLS. Statistical Inference for Fractional Diffusion Processes. Wiley Series in Probability and Statistics, 2010. [Google Scholar]
  • [71].Qian H, Sheetz M, and Elson E. Single particle tracking. Analysis of diffusion and flow in two-dimensional systems. Biophysical Journal, 60(4):910–921, 1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [72].Reighard KP, Ehre C, Rushton ZL, Ahonen MJR, Hill DB, and Schoenfisch MH. Role of nitric oxide-releasing chitosan oligosaccharides on mucus viscoelasticity. ACS Biomaterials Science & Engineering, 3(6):1017–1026, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [73].Reighard KP, Hill DB, Dixon GA, Worley BV, and Schoenfisch MH. Disruption and eradication of P. aeruginosa biofilms using nitric oxide-releasing chitosan oligosaccharides. Biofouling, 31(9–10):775–787, 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [74].Rosenblatt M. Independence and dependence. In Proceedings of the 4th Berkeley symposium on mathematical statistics and probability, volume 2, pages 431–443, 1961. [Google Scholar]
  • [75].Sandev T, Metzler R, and Tomovski Ž. Velocity and displacement correlation functions for fractional generalized Langevin equations. Fractional Calculus and Applied Analysis, 15(3):426–450, 2012. [Google Scholar]
  • [76].Saxton MJ. Anomalous diffusion due to obstacles: a monte carlo study. Biophysical Journal, 66(2):394–401, 1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [77].Saxton MJ. Anomalous diffusion due to binding: a monte carlo study. Biophysical Journal, 70(3):1250–1262, 1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [78].Shiryaev AN. Probability Theory. Springer-Verlag, New York, 2000. [Google Scholar]
  • [79].Sikora G, Teuerle M, Wyłomańska A, and Grebenkov D. Statistical properties of the anomalous scaling exponent estimator based on time-averaged mean-square displacement. Physical Review E, 96(2):022132, 2017. [DOI] [PubMed] [Google Scholar]
  • [80].Smith MB, Karatekin E, Gohlke A, Mizuno H, Watanabe N, and Vavylonis D. Interactive, computer-assisted tracking of speckle trajectories in fluorescence microscopy: application to actin polymerization and membrane fusion. Biophysical Journal, 101(7):1794–1804, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [81].Sokolov IM. Statistics and the single molecule. Physics, 1:8, 2008. [Google Scholar]
  • [82].Sonesson AW, Elofsson UM, Callisen TH, and Brismar H. Tracking single lipase molecules on a trimyristin substrate surface using quantum dots. Langmuir, 23(16):8352–8356, 2007. [DOI] [PubMed] [Google Scholar]
  • [83].Suh J, Dawson M, and Hanes J. Real-time multiple-particle tracking: applications to drug and gene delivery. Advanced Drug Delivery Reviews, 57:63–78, 2005. [DOI] [PubMed] [Google Scholar]
  • [84].Tafvizi A, Mirny LA, and van Oijen AM. Dancing on DNA: kinetic aspects of search processes on DNA. Chem. Phys. Chem, 12(8):1481–1489, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [85].Taqqu MS. Weak convergence to fractional Brownian motion and to the Rosenblatt process. Probability Theory and Related Fields, 31(4):287–302, 1975. [Google Scholar]
  • [86].Taqqu MS. Convergence of integrated processes of arbitrary Hermite rank. Probability Theory and Related Fields, 50(1):53–83, 1979. [Google Scholar]
  • [87].Taqqu MS. Fractional Brownian motion and long range dependence In Theory and Applications of Long-Range Dependence (Doukhan P, Oppenheim G and Taqqu MS, eds.), pages 5–38. Birkhäuser, Boston, 2003. [Google Scholar]
  • [88].Taqqu MS. The Rosenblatt process. In The selected works of Murray Rosenblatt (Davis RA and Lii K-S and Politis DN, eds.), pages 29–45. Springer, 2011. [Google Scholar]
  • [89].Vale RD, Soll DR, and Gibbons IR. One-dimensional diffusion of microtubules bound to flagellar dynein. Cell, 59(5):915–925, 1989. [DOI] [PubMed] [Google Scholar]
  • [90].Valentine M, Kaplan P, Thota D, Crocker J, Gisler T, Prudomme R, Beck M, and Weitz DA. Investigating the microenvironments of inhomogeneous soft materials with multiple particle tracking. Physical Review E, 64(6):061506, 2001. [DOI] [PubMed] [Google Scholar]
  • [91].Veillette M and Taqqu MS. Properties and numerical evaluation of the rosenblatt distribution. Bernoulli, 19(3):982–1005, 2013. [Google Scholar]
  • [92].Veitch D and Abry P. A wavelet-based joint estimator of the parameters of long-range dependence. IEEE Transactions on Information Theory, 45(3):878–897, 1999. [Google Scholar]
  • [93].Vestergaard CL, Blainey PC, and Flyvbjerg H. Optimal estimation of diffusion coefficients from single-particle trajectories. Physical Review E, 89(2):022726, 2014. [DOI] [PubMed] [Google Scholar]
  • [94].Wendt H, Didier G, Combrexelle S, and Abry P. Multivariate Hadamard self-similarity: testing fractal connectivity. Physica D: Nonlinear Phenomena, 356–357:1–36, 2017. [Google Scholar]
  • [95].Wieser S and Schütz GJ. Tracking single molecules in the live cell plasma membrane – do’s and don’t’s. Methods, 46(2):131–140, 2008. [DOI] [PubMed] [Google Scholar]
  • [96].Zwanzig R. Nonequilibrium Statistical Mechanics. Oxford University Press, 2001. [Google Scholar]

RESOURCES