Skip to main content
Elsevier Sponsored Documents logoLink to Elsevier Sponsored Documents
. 2021 Apr;42:100438. doi: 10.1016/j.spasta.2020.100438

Developments in statistical inference when assessing spatiotemporal disease clustering with the tau statistic

Timothy M Pollington a,c,, Michael J Tildesley b, T Déirdre Hollingsworth c,1, Lloyd AC Chapman d,1
PMCID: PMC7985614  PMID: 33816096

Abstract

The tau statisticτ uses geolocation and, usually, symptom onset time to assess global spatiotemporal clustering from epidemiological data. We test different methods that could bias the clustering range estimate based on the statistic or affect its apparent precision, by comparison with a baseline analysis of an open access measles dataset.

From re-analysing this data we find evidence against no clustering and no inhibition, p-value[0,0022] (global envelope test). We develop a tau-specific modification of the Loh & Stein spatial bootstrap sampling method, which gives bootstrap tau estimates with 24% lower sampling error and a 110% higher estimated clustering endpoint than previously published (61⋅0 m vs. 29 m) and an equivalent increase in the clustering area of elevated disease odds by 342%. These differences could have important consequences for control efforts.

Correct practice of graphical hypothesis testing of no clustering and clustering range estimation of the tau statistic are illustrated in the online Graphical abstract. We advocate proper implementation of this useful statistic, ultimately to reduce inaccuracies in control policy decisions made during disease clustering analysis.

Keywords: Second order dependence, Pointwise confidence interval, Bias corrected accelerated BCa, Percentile confidence interval, Spatial bootstrap, Graphical hypothesis test

Graphical abstract

graphic file with name fx1.jpg

Highlights

  • Point estimation methods can heavily bias disease clustering range estimates.

  • A 110% radial clustering bias is amplified to 342% on an areal intervention scale.

  • A modified Loh & Stein spatial bootstrap loses less unique pair information.

  • Clustering endpoint estimates appear more precise using this modified bootstrap.

  • BCa CIs not percentile CIs are preferred for asymmetric bootstrap distributions.

1. Introduction

Assessing if spatiotemporal clustering is present and measuring its magnitude and range is informative for epidemiologists working to control infectious diseases. The tau statistic (Section 2) is more appropriate than most statistics for this task as it measures spatiotemporal rather than just spatial clustering, produces non-parametric estimates (without process assumptions) and, unlike the K function (Gabriel and Diggle, 2009), offers a relative magnitude in the difference of risk, rate or odds of disease versus the background level (Section 2.1) (Lessler et al., 2016, Pollington et al., 2019a). The tau statistic herein should not be confused with ‘Kendall’s tau statistic/rank correlation coefficient’ (Bland, 2000). This study is motivated by a review of its use that found that its current implementation inflates type I errors (incorrectly rejecting a true null hypothesis) when testing for no clustering and no inhibition, and may bias estimates of the range of clustering (Pollington et al., 2019a).

We investigate these aspects by analysing a well-studied open access measles dataset containing household geolocations and symptom onset times of cases (Section 3.1). It represents a spatially discrete process since infection is only recorded and can only occur at discrete household locations, so the (statistical) support is not spatially continuous (Diggle et al., 2010).

We adopt an ordered approach: we first test for no clustering and no inhibition (Section 3.3) and then, conditional on finding evidence against this null hypothesis, we estimate the clustering endpoint and its first sampling error estimate (Section 3.4; online Graphical abstract). This approach is contrary to the current methods applied to the tau statistic and similar statistics (Pollington et al., 2019a), which incorrectly combine graphical hypothesis testing for no clustering and estimation of the clustering range (Section 3.2). We hope these improved methods will encourage proper application of this burgeoning statistic.

2. The tau statistic

The tau statistic τ is a non-parametric global clustering statistic which takes a disease frequency measure (risk, odds or rate) within a certain annulus around a case and compares it to the background measure (at any distance) and averaged over all cases (Salje et al., 2012, Lessler et al., 2016, Pollington et al., 2019a). It measures the tendency of case pairs to spatially cluster while implicitly accounting for how related they are in terms of transmission using temporal information, making it a spatiotemporal statistic.

2.1. Tau statistic (odds ratio estimator)

We describe the most common tau estimator τˆodds, which is based on the relative odds of disease (Lessler et al., 2016), rather than other forms of the statistic (including a new rate ratio estimator), which are described in a detailed review (Pollington et al., 2019a) from which this subsection draws heavily.

The distance form of the tau statistic τodds is the ratio of (i) the odds θ(dl,dm) of finding any case j that is ‘related’ to any case i, within a half-closed annulus [dl,dm), (l,mZ+, m=l+1), around case i, to (ii) the odds θ(0,) of finding any case j related to any case i at any distance separation (dij0) for n total cases (Eq. (1) & Fig. 1).

τˆodds(dl,dm)θˆ(dl,dm)θˆ(0,) where θˆ(dl,dm)=i=1nj=1,jin1(zij=1,dldij<dm)i=1nj=1,jin1(zij=0,dldij<dm) (1)

The odds estimate θˆ in Eq. (1) is the ratio of the number of related case pairs (zij=1) within [dl,dm), versus the number of unrelated case pairs (zij=0) within [dl,dm). The main computation is effectively a double sum over ‘relatedness’ indicator functions 1() for case pairs.

Fig. 1.

Fig. 1

A single distance band [dl,dm), the primary argument of the distance form of a tau estimator function: a half-closed annulus of radii dl,dm with a case j inside, around case i, separated by distance dij.

τˆ(dl,dm) is then evaluated over B total distance bands to give a distance band set Δ_{[dl,dm),l,mZ+,m=l+1,lB}. Tau values signify either the presence of spatiotemporal clustering (τ>1), no clustering and no inhibition (τ=1) or inhibition (τ<1). Sometimes an expanding disc is described by setting dl=0 and relabelling dm=d to give τˆ(d) instead. Although τˆ is strictly evaluated for a given distance band [dl,dm), when a τ-distance graph is drawn a value of τˆ(d) can be obtained through linearly interpolating between the distance band endpoints. The half-closed annulus is a correction to the original open interval (Lessler et al. (2016): appendix 5); it was incorporated in December 2018 into the IDSpatialStats R package (which calculates the tau statistic) (Giles et al., 2018, Giles et al., 2019).

The relatedness of a case pair zij is commonly determined using temporal information (e.g. difference in onset times of cases i and j, i.e. tjti) (Pollington et al., 2019a). The serial interval is the period between the onset times of symptoms in the infector ti and their infectee tj. Typically cases are defined as being temporally related when their onset times are within a single mean serial interval of each other.

τ

In the following sections (Sections 3, 4) we provide a descriptive analysis of the data, before systematically testing several aspects of the tau statistic’s implementation and their impact on the estimated clustering range and its sampling error.

3. Methods

3.1. The dataset and baseline analysis

We analyse an infectious disease dataset of measles from case households in Hagelloch, Germany in 1861 (Pfeilsticker, 1863, Oesterle, 1992, Neal and Roberts, 2004, Höhle et al., 2019). Computations were coded in R (R Core Team, 2019) with further detail in the Supplementary material. We have reproduced Lessler et al.’s (unpublished) analysis as a baseline result (Fig. 3); using their interpretation of Fig. 3, spatiotemporal clustering is reported up to 30 m (Lessler et al., 2016). As τ is a global statistic, ideally we would explore a distance band set covering the majority of pairwise distances (200 m). However here we restrict the plots to 120 m for diagnostic (Fig. 3) or point estimation (Figs. 5, 6, 8 & B.3) purposes, to be consistent with the baseline analysis.

Fig. 3.

Fig. 3

Baseline result: a reproduction of a previous analysis (Lessler et al., 2016 Fig. 4C); note their x-axis used the midpoint of the distance band rather than the endpoint. The superimposition of their envelope and ours validates our implementation of tau functions from their IDSpatialStatsR package (Giles et al., 2018). According to Lessler et al.’s convention, the end of the clustering range is where the lower bound of the envelope intersects τ=1 (Dˆbase=29 m). We do not endorse this convention however, and take the point where the point estimate line τˆ intersects (Dˆ=61 m). Additionally we generally recommend the use of BCa pointwise CIs for diagnostic plots such as this.

Their 29 m value was previously reported as 15 m due to a misinterpretation in the midpoint distance graph reading, as confirmed by Lessler (personal comm.)). Instead we use the endpoint so that a point (d,τ(d)) read off the graph pertains to the single distance band [dm1,dm=d), and more commonly [0,d) when dl=0. Whereas the midpoint requires an additional step by the reader that can easily introduce error.

Fig. 5.

Fig. 5

Point estimation: Effect of number of samples on Dˆ sampling error, when using RISB sampling. Both CIs used 100% of simulations. Dˆ=610 m; N=100: 95% percentile CI (29⋅0, 83⋅0 m); N=2500: CI (29⋅2, 83⋅5 m). Distance band set as Fig. 4.

Fig. 6.

Fig. 6

Point estimation: Effect of spatial bootstrap sampling method on Dˆ sampling error. RISB 95% BCa CI (29⋅3, 84⋅4 m); MMPSB CI (29⋅8, 71⋅8 m); both CIs used 100% of simulations. Distance band set as Fig. 4, Dˆ=610 m; N=2500.

Fig. 8.

Fig. 8

Point estimation: Effect of distance band set on Dˆ sampling error using MMPSB sampling. Overlapping set Δ_overlap{[0,10),[0,12),[0,14),,[0,50),[2,52),[4,54),,[70,120)m} (Lessler et al., 2016) and distinct Δ_dis{[0,7),[7,15),[15,20),[20,25),[25,30),,[115,120)m}. Δ_dis yields lower Dˆ=189 m and more erratic point estimate τˆ yet tighter 95% BCa CI for Dˆ (18⋅4, 28⋅6 m) versus Δ_overlap with Dˆ=610 m and CI (29⋅8, 71⋅8 m) however on further investigation the distribution D_ for the Δ_dis is heavily bimodal; both CIs used 100% of simulations.

3.2. Our approach to graphical hypothesis testing and point estimation

An envelope is loosely defined as a collection of connected-line (syn. piecewise linear) functions in the Cartesian plane, with some bound applied above and below. Central/null envelopes describe the line function, i.e. whether it originates from bootstrap simulations of a point estimate or time-mark permuted null distribution (Section 3.3), respectively; whereas global envelope or pointwise confidence interval (syn. confidence band) refer to the way line functions are bounded. A global envelope is a confidence interval (CI) for a collection of line functions but does not represent a single distance band of one tau point estimate τˆ(dl,dm) (i.e. a pointwise CI), but rather the entire distance band set Δ_. At say a 95% significance level, in 95% of outcomes of constructing a global envelope, the random envelope would contain the true value of τ(dl,dm),[dl,dm)Δ_ (Baddeley et al., 2015).

Our graphical hypothesis test (Section 3.3) and point estimation methods (Section 3.4; online Graphical abstract) offer corrections to the implementations of the tau statistic or similar statistics used in many papers reviewed in Pollington et al. (2019a), i.e. (Salje et al., 2012, Salje et al., 2016a, Salje et al., 2016b, Salje et al., 2017, Salje et al., 2018, Bhoomiboonchoo et al., 2014, Grabowski et al., 2014, Levy et al., 2015, Grantz et al., 2016, Hoang Quoc et al., 2016, Lessler et al., 2016, Azman et al., 2018, Rehman et al., 2018, Succo et al., 2018, Truelove et al., 2019). These studies incorrectly used an envelope about the point estimates or simulated null distribution, constructed from pointwise CIs, as a graphical hypothesis test to reject clustering which amounts to multiple hypothesis testing and inflated type I errors (Figs. 2a & b) (Pollington et al., 2019a). Additionally they defined the clustering endpoint D as the distance at which the lower bound of the first pointwise (percentile) CI belonging to a point estimate above τ=1, touches τ=1 (Fig. 2a), or where the point estimate line touches the upper bound of the null envelope formed from pointwise CIs (Fig. 2b); thus mixing graphical hypothesis testing with point estimation (Pollington et al., 2019a).

Fig. 2.

Fig. 2

Previous naive methods: several authors (Section 3.2) choose one envelope type as ‘central’ (a) or ‘null’ (b), then simultaneously test the hypothesis of no clustering and estimate the clustering endpoint parameter Dˆ (Pollington et al., 2019a). The single red line τ=1 represents no spatiotemporal clustering nor inhibition. Grey lines indicate (a) a collection of spatial bootstrap estimates τˆ (denoted by , see Section 3.4.1) from a typical tau estimator characterised by negative exponential lines with Normal noise, or (b) simulations of the tau estimator on time-mark permuted data for null envelope construction, represented here as lines at τ=1 with Normal noise; black lines mark out the envelope bounds constructed from pointwise confidence intervals. The solid blue line characterises an empirical tau point estimate τˆ(d).

Instead, we split the method into the separate steps of graphical hypothesis testing (Fig. 4) and point estimation (e.g. Fig. 5) in Sections 3.3 & 3.4, respectively. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

3.3. Graphical hypothesis test of no clustering and no inhibition

We instead construct a global envelope around the distribution of the null hypothesis (H0: τ=1, no spatiotemporal clustering nor inhibition) (Myllymäki and Mrkvička, 2019a). This is generated by randomly permuting the time marks (onset timei) of the spatiotemporal data points Xi= (x-coordinatei, y-coordinatei, onset timei), i={1,,n} to scramble any spatiotemporal clustering present and simulate what τˆ would be under H0. We assess if a subset of distance bands δ_ of Δ_ exists (as contiguous or disjoint regions) where the tau point estimate τˆ(d) is ever above/below the upper/lower bound, respectively, of this global null envelope. This global envelope is of extreme rank type (“defined as the minimum of pointwise ranks”) with 95% significance level and extreme rank length p-value interval (note: a range, not a single p-value) as constructed by the GET R package (Myllymäki et al., 2019b) (see online Graphical abstract). We compute 2500 time-mark permuted tau simulations for an optimal test (Myllymäki et al., 2017).

The test is two-tailed, which is necessary as only once the graph is plotted is the presence of clustering or inhibition apparent (alternative hypothesis H1: τ1). One should use a global Δ_ covering all pairwise distances, but at large distances null simulations commonly diverge from τ=1. This however can be assessed by tracking the upper & lower quartiles of null simulations (see code in Supplementary material) and choosing a shorter maximum distance; here the maximum pairwise distance is 319 m and we choose 220 m.

3.4. Point estimation of the clustering endpoint and its sampling error

If graphical hypothesis testing establishes evidence against no spatiotemporal clustering nor inhibition within a subset of distance bands δ_ (Section 3.3) and visual graph inspection indicates clustering, then it is acceptable to estimate the clustering endpoint Dˆ for the clustering range [d1=0 (assumed),dm=Dˆ); this is where the point estimate line τˆ(d) intercepts τ=1, i.e. τˆ(Dˆ)=1. Due to discrete distance bands we linearly interpolate between their endpoints, i.e. [dl,dm) with τˆ>1, and [dl+1,dm+1) with τˆ<1, to obtain Dˆ.

To estimate the sampling error of Dˆ we use spatial bootstrap (denoted by , see Section 3.4.1) tau estimates τ_ˆ. For every bootstrap simulation (that represents a connected line of simulated tau estimates for increasing d, i.e. {τˆ(dl,dm):[dl,dm)Δ_}), we record those that originate from above τ=1 and then intersect τ=1 at some greater distance D within Δ_, i.e. those that satisfy {τˆ(D)=1:DΔ_}. We use N=2500 samples which is more than sufficient for a typical bootstrap (Efron and Tibshirani, 1998). We then take this horizontal set of values D_ and use it to obtain a CI to describe the sampling error in Dˆ (see online Graphical abstract).

We now investigate spatial bootstrap methods (Section 3.4.1), CI construction (Section 3.4.2) and distance band sets (Section 3.4.3).

3.4.1. Spatial bootstrap sampling methods for τˆ

The method that generates spatial bootstrap tau estimates τˆ may impact the sampling error of the CI of the clustering endpoint Dˆ. Through bootstrap theory, the sampling distribution τ_ˆ may serve as a proxy for the actual distribution of τˆ on the data. Furthermore the (central) envelopes constructed from τ_ˆ may approximate the envelope of τˆ on the data, and as a corollary the D_ envelope formed by the intercept of τ_ˆ with τ=1 (Efron, 1979).

We compare three spatial bootstrap methods that define the bootstrap either on the data beforehand (see Resampled-index spatial bootstrap (RISB)), or bootstrap the local τ functions (see Modified point spatial bootstrap (MPSB) in Appendix A.4) or locally-(un/)related mark functions m (see Modified marked point spatial bootstrap (MMPSB)). All are non-parametric because they randomly resample the data, or local τ or m functions, without imposing a distribution (Loh, 2008).

Resampled-index spatial bootstrap (RISB).

We start again with spatiotemporal data set X=Xii={1,,n} where Xi = (x-coordinatei, y-coordinatei, onset timei). Using the Uniform distribution we resample with replacement the data’s index vector i_=(1,,n) n times (equal to the number of cases), to generate a spatial bootstrap of the data, with sample index vector i_=(i1,,in) and subsequent data set X=Xi; i_ and i_ have the same length, but i_ is bound to contain duplicated indices due to sampling with replacement. To obtain N bootstrap τ estimates τ_ˆ=(τˆ1,,τˆN) we apply the tau odds estimator individually to N bootstrap data sets X1,,XN; the same approach could be applied to other τ estimators.

Loh critiques this “naive” sampling with replacement of the points Xi of a spatial dataset Xi to produce a spatial bootstrap sample, because “the spatial dependence structure has to be preserved as much as possible” (Loh, 2008) “to reflect properties of the original process” (Loh and Stein, 2004). Lessler et al. (2016) and others used this method and additionally for any i, j resampled index pairs, dropped pairs (Xi,Xj) for which Xi=Xj, as they represented the same point, to avoid ‘self comparisons’.

Modified marked point spatial bootstrap (MMPSB).

Like MPSB (discussed in Appendix A.4) we bootstrap local functions not the data (RISB), but like all three methods still use i_ to decide the sample. Unlike MPSB rather than using local τ functions (Equation A.2), we go deeper and compute the number of locally-related/unrelated cases from mark functions mi(k) local to case i, according to their binary time-relatedness k.

The number of time-related cases (#related) out of all empirical cases j, within a distance [dl,dm) around a case i (chosen in the bootstrap sample) is:

#related(dl,dm,k=1,i)mi(dl,dm,k=1)jj_,ji1(dldij<dm,zij=1) (2)

and then an average is taken over the n cases in the bootstrap sample of indices i_:

#related(dl,dm)¯m(k=1)=1nii_jj_,ji1(dldij<dm,zij=1), (3)

and similar steps for time-unrelated cases yield:

#unrelated(dl,dm)¯m(k=0)=1nii_jj_,ji1(dldij<dm,zij=0), (4)

and finally the odds and odds ratio estimators can be calculated as before:

θ(dl,dm)=#related(dl,dm)¯#unrelated(dl,dm)¯=ii_jj_,ji1(dldij<dm,zij=1)ii_jj_,ji1(dldij<dm,zij=0) (5)
τMMPSB(dl,dm)=θ(dl,dm)θ(0,) (6)

3.4.2. Confidence interval (CI) construction

Bias-corrected and accelerated (BCa) CIs can cope with asymmetrical distributions (like D_ defined in Section 3.4) better than percentile CIs. For non-parametric problems Carpenter and Bithell (2000) consistently found Efron’s BCa method best, due to its low theoretical coverage errors for approximating the exact CI. BCa had “second-order correct coverage” errors under some assumptions, while a percentile CI was first-order correct at best (Efron, 1987). The BCa algorithm transforms a distribution of bootstrap calculations by normalisation to stabilise its variance so that a CI can be constructed, then back-transforms it (Efron, 1987). We calculated it using the coxed R package (Kropko and Harden, 2019).

3.4.3. Distance band sets

The tau statistic is non-unique as it depends on the distance band set chosen (Pollington et al., 2019a), so the potential variation in τ estimates from this choice is of interest; we explore this briefly in a non-systematic way. From analysing cases’ pairwise distances we propose a reasonable distinct (non-overlapping) distance band set, i.e. Δ_dis{[0,7), [7,15), [15,20), [20,25), [25,30), …, [115,120 m)} as a comparison to the overlapping set in the baseline analysis Δ_overlap{[0,10), [0,12), [0,14), …, [0,50), [2,52), [4,54), …, [70,120 m)} (Lessler et al., 2016), and test these using N=2500 samples under MMPSB sampling.

4. Results & discussion

4.1. Dataset description

The epidemic over a small 280 x 240 m2 area lasted nearly three months and five distinct generations can be discerned from the epidemic curve (Fig. B.1). Out of the 197 under-14 year olds, 185 became infected, along with three teenagers, leaving 377 remaining teenagers and adults uninfected (Neal and Roberts, 2004). Fig. B.2 indicates a weak signal of direct transmission between cases, as cases with onsets close together in time (shown by similar colours) tended to be nearby to each other.

4.2. Graphical hypothesis test: global envelope vs. pointwise CIs

There is moderate evidence against the hypothesis of no spatiotemporal clustering nor inhibition (p-value [0,0022]) based on constructing the global envelope around τ=1 under the null hypothesis (Fig. 4), and thus we conclude that the data X is inconsistent with the null model (H0: τ=1). So we turn to the alternative hypothesis, that there is clustering and/or inhibition. Fig. 4 suggests clustering at short distances and 190 m, and inhibition at long distances. Unfortunately it is not possible to compare our results with those of previous papers (see Section 3.2), since they used an incorrect pointwise CI approach to assess clustering, for which a p-value is not available.

Fig. 4.

Fig. 4

Graphical hypothesis testing: Global envelope test, ‘extreme rank’ type, two-sided at 95% significance level using 2500 simulations of the null hypothesis (H0: no spatiotemporal clustering nor inhibition, i.e. τ=1), p-value [0,0022]. Note there is a region where τˆ just exits the global envelope lower bound (suggesting inhibition at 100 m) as well as the obvious departures above the upper bound (suggesting clustering at close distances and 190 m). We are confident that we are simulating H0 well because the median simulation stays close to τ=1 throughout. Distance band set {[0,10),[0,12),[0,14),,[0,50),[2,52),[4,54),,[170,220)m}.

4.3. Impact on the estimated clustering endpoint

Given the evidence of no clustering with the clear clustering signal in Fig. 4 we continue with parameter calculation. We estimate the clustering endpoint as Dˆ= 61⋅0 m with a 95% percentile CI of (29⋅0, 83⋅0 m) over 100 bootstrap simulations using RISB sampling (Fig. 5), or (29⋅2, 83⋅5 m) over 2500 simulations (using 100% of simulations, see Appendix A.2); more bootstrap simulations do not appear to affect the sampling error.

The clustering identified at 190 m (Fig. 4) is ignored as we are interested in the first clustering range for baseline comparison; in general this may provide useful information of medium-range spatial structure to understand disease spread, but secondary to control policy around an index case household.

The point estimate Dˆ= 61⋅0 m is 110% higher than the baseline clustering endpoint (Dˆbase=29 m) (Fig. 3). Previous estimates derived via the improper method of finding the distance at which the lower bound of the central envelope (around τˆ) touches τ=1, have probably substantially underestimated this range. The plateauing shape of τˆ(d) before it reaches τ=1 contributes to the increased imprecision in the estimate of Dˆ. This highlights the utility of visually assessing the graph rather than rigidly using a τ=1 threshold, as it is likely that disease control over say a 60 m radius around an average case would see the biggest gains over the first 30 m with diminishing returns at wider radii (Fig. 6).

The 110% increase in the radial parameter Dˆ due to the corrected estimation method (Section 3.4) is further compounded for public health interventions, as their time and cost are more closely proportional to area not radius, and the areal increase is 342% (since π(Dˆ2Dˆbase2)πDˆbase2=342, assuming dl=0).

4.4. Spatial bootstrap impact on sampling error: modified marked point vs. resampled-index

Using the modified marked point spatial bootstrap (MMPSB) (Section 3.4.1) yields a 24% narrower envelope than the resampled-index spatial bootstrap (RISB), leading to a 95% BCa CI for Dˆ of (29⋅8, 71⋅8 m) (Fig. 6); both CIs used 100% of simulations.

If through simpler spatiotemporal structures or more evenly spread households the tau point estimate τˆ had been smoother in approach to the τ=1 intercept, then the range of clustering would be far larger and the benefit of this estimation method (Section 3.4) more apparent. Given the reasons why MMPSB is better (Section 3.4.1), we believe RISB will poorly sample Dˆ and unfairly represent its precision.

MMPSB outperforms RISB because the latter loses more pair information from resampling indices and avoiding self-comparisons. This was checked empirically for the measles data: the tau point estimate is computed on 188 x 187 = 35156 pairs. In comparison, on average from 1000 simulations, RISB samples from 119 unique people, leading to 119 x 118 = 14042 unique pairs evaluated or 399% of the original pairs. Of course many additional duplicate pairs are used in RISB but we are only interested in unique pair information that is retained. MMPSB only has 119 unique mark functions, but each of them is compared with the other 187 cases, leading to 63⋅3% of pairs being retained.

4.5. CI type: BCa vs. percentile

Histograms of D_={Dˆi:τˆ(Dˆi)=1,i=1,,N} by number of bootstrap samples and sampling method strongly indicate asymmetric distributions, which support the principled use of BCa (Figs. 7a-d). Furthermore Figs. 7b & d show how MMPSB tau estimates τ_ˆ intercept at values of d closer to Dˆ than RISB. Despite this, percentile (29⋅2, 83⋅5 m) and BCa (29⋅3, 84⋅4 m) CIs, using RISB and N=2500, differ marginally (Figs. 5 & 6). RISB appears to introduce a slight positive skew (mean > median) in D_ whereas MMPSB with sufficient samples (N=2500) has a negative skew (Figs. 7a-d). At N=2500, MMPSB noticeably reduces the bias (D¯_Dˆ), between mean/median estimates of D_ and the point estimate Dˆ from 10 m to 5 m, or 8% of Dˆ.

Fig. 7.

Fig. 7

Point estimation: The distribution of D_, formed by the sampled values of Dˆi, i.e. D_={Dˆi:τˆ(Dˆi)=1,i=1,,N} (illustrated in online Graphical abstract), by number of bootstrap samples N = 100 (top row) or N = 2500 (bottom) and by spatial bootstrap sampling method RISB (left column) or MMPSB (right). Vertical dotted lines indicate the point estimate Dˆ=610 m (red); with the mean (green) and median (blue) of the sampled estimates Dˆi, obtained from where the bootstrap tau estimates τ_ˆ intercept τ=1. RISB and MMPSB (N=100) have positive skew as the mean estimate is greater than the median estimate, whereas for MMPSB (N=2500) it has a negative skew. All bootstrap estimations have a negative bias with respect to mean or median summary measures versus the point estimate Dˆ. At N=2500 this is 10 m for RISB and 5 m for MMPSB. The data points used to construct the 95% BCa CIs (purple line on horizontal axis) from the Dˆi estimates in (a) are copied from Fig. 5 (N = 100 simulations) while those for (c) & (d) are from Fig. 6, while (b) has been freshly calculated. All four CIs used 100% of simulations. Distance band set as Fig. 4. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

4.6. Distance bands: overlapping vs. distinct

Overlapping distance band sets Δ_overlap appear to produce Dˆ estimates with higher variance (95% BCa CI (29⋅8, 71⋅8 m)) than distinct Δ_dis (CI (18⋅4, 28⋅6 m)) (Fig. 8), but a clearer and smoother trend in tau with increasing distance (both CIs used 100% of simulations). The distribution of D_ is strongly bi-modal for Δ_dis because the simulations are more erratic about τ=1. The increased volatility of τˆ also results in multiple intercepts with τ=1, but for usability we prefer a single range of clustering, given in this case by Δ_overlap.

5. Conclusion and recommendations for improved use

We have shown that the way the endpoint of the clustering range Dˆ is currently calculated using the tau statistic can lead to biased estimates. Using a redefined D_ for the Hagelloch measles dataset, resulted in bias reductions equivalent to increasing the clustering area of elevated odds by 342%. An improved spatial bootstrap sampling method delivered Dˆ estimates with 24% lower sampling error. These improvements will appear in future versions of the IDSpatialStats R package (Giles et al., 2018). Our results (Section 4) support the following recommendations:

  • using the point d where the tau point estimate line τˆ(d) intercepts τ=1 to define the clustering endpoint estimate Dˆ avoids underestimating it like earlier papers. However this estimation should be conditional on graphical hypothesis testing and visual plot inspection.

  • the modified marked point spatial bootstrap should be used to simulate τˆ instead of the resampled-index method, for CIs that better represent the precision of the clustering endpoint Dˆ.

  • BCa, rather than percentile, CIs should be used as they give better coverage since the distribution of bootstrap tau simulations τ_ˆ or clustering endpoint estimates D_ is commonly asymmetric.

Tau statistic limitations.

The distance band set choice [dl,dm)Δ_ clearly biases Dˆ and affects its smoothness and sampling error. A better understanding of how to choose distance bands for a given purpose is now needed. It is also unknown how the time-relatedness interval choice [T1,T2] (where zij=1((tjti)[T1,T2])) biases the tau statistic through inclusion of extraneous co-primary or secondary cases. It is unclear how second-order correlation functions like the tau statistic and Ripley’s K function (Gabriel and Diggle, 2009), originally founded in spatiotemporal point processes with continuous support in R2, behave for this spatially discrete process. Finally the number of bootstrap samples required for graphical hypothesis testing and estimation purposes is unknown; we believe that related research by Davidson and MacKinnon (2000) could inform a heuristic algorithm.

τ

We encourage the adoption of the statistical protocol described (see online Graphical abstract) to properly test for clustering, and, if appropriate, estimate its range. Control programmes have already been informed by the tau statistic and applying these bias-reduction methods will improve its accuracy in future health policy decisions. In addition to modellers or epidemiologists working on real-time outbreaks or post-study analysis, we hope statisticians are inspired to apply this statistic to spatiotemporal branching processes in new fields.

CRediT contribution statement

TMP: Conceptualisation, Methodology, Software, Validation, Formal analysis, Investigation, Data curation, Writing - original draft, Writing - review & editing, Visualisation. MJT: Conceptualisation, Writing - review & editing, Supervision. TDH: Conceptualisation, Writing - review & editing, Supervision, Funding acquisition. LACC: Conceptualisation, Software, Validation, Data curation, Writing - review & editing, Supervision. Justin Lessler: Data curation. Peter J. Diggle: Methodology, Writing - review & editing.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

TMP would also like to thank Shaun Truelove & John Giles for useful discussions at a poster presentation (Pollington et al., 2019b); Mari Myllymäki for GET R package support; and The Editors and Reviewers of Spatial Statistics who made pertinent suggestions.

TMP, LACC & TDH gratefully acknowledge funding of the NTD Modelling Consortium by the Bill & Melinda Gates Foundation (BMGF) (grant number OPP1184344) and LACC acknowledges funding of the SPEAK India consortium by BMGF (grant number OPP1183986). Views, opinions, assumptions or any other information set out in this article should not be attributed to BMGF or any person connected with them. TMP’s PhD is supported by the Engineering & Physical Sciences Research Council, Medical Research Council and University of Warwick (grant number EP/L015374/1). TMP thanks Big Data Institute for hosting him during this work. All funders had no role in the study design, collection, analysis, interpretation of data, writing of the report, or decision to submit the manuscript for publication.

Figure summary

The figures above illustrate the spatial coverage of the statistic (Fig. 1), show previous methods and the baseline analysis (Figs. 2a-b & 3), perform a graphical hypothesis test (Fig. 4), or investigate effects on the parameter point estimate Dˆ and distribution D_ (unless stated the distance band set is ‘overlapping’, see Fig. 4 caption):

  • number of samples N=100 or 2500, on RISB sampling using percentile CIs (Fig. 5)

  • RISB vs. MMPSB, or MMPSB vs. MPSB sampling (Figs. 6 & B.3, respectively) (using N=2500 and BCa CIs)

  • RISB vs. MMPSB sampling and N=100 or 2500 (with BCa CIs) (Figs. 7a-d)

  • overlapping vs. distinct distance band sets (using N=2500, MMPSB sampling and BCa CIs) (Fig. 8)

Footnotes

Online supplementary material ( https://doi.org/10.1016/j.spasta.2020.100438) contains Appendices A.1-5, detailing the computation methods and two of the spatial bootstrap sampling methods, while Appendix B contains Figs. B.1-3 that were not central to the manuscript. The analysis code in R Markdown is available from https://github.com/t-pollington/developments_tau_statistic under a GNU General Public License v3.0 licence.

A. Supplementary material

The following is the Supplementary material related to this article.

MMC S1

The supplementary material is composed of two appendices. Appendix A details the computation methods and MPBS and MMPBS spatial bootstrap sampling methods. Appendix B contains figures that were not central to the manuscript.

mmc1.pdf (1.4MB, pdf)

References

  1. Azman A.S., Luquero F.J., Salje H., Mbaïbardoum N.N., Adalbert N., Ali M., Bertuzzo E., Finger F., Toure B., Massing L.A., Ramazani R., Saga B., Allan M., Olson D., Leglise J., Porten K., Lessler J. Micro-hotspots of risk in urban cholera epidemics. J. Inf. Dis. 2018;218(7):1164–1168. doi: 10.1093/infdis/jiy283. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Baddeley A., Rubak E., Turner R. first ed. CRC Press/Taylor & Francis; Boca Raton: 2015. Spatial Point Patterns: Methodology and Applications with R. [DOI] [Google Scholar]
  3. Bhoomiboonchoo P., Gibbons R.V., Huang A., Yoon I.-K., Buddhari D., Nisalak A., Chansatiporn N., Thipayamongkolgul M., Kalanarooj S., Endy T., Rothman A.L., Srikiatkhachorn A., Green S., Mammen M.P., Cummings D.A.T., Salje H. The spatial dynamics of dengue virus in Kamphaeng Phet, Thailand. PLoS Neglected Trop. Dis. 2014;8(9):6–11. doi: 10.1371/journal.pntd.0003138. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bland J.M. third ed. Oxford medical publications, OUP Oxford; New York, USA: 2000. An Introduction to Medical Statistics. [Google Scholar]
  5. Carpenter J., Bithell J. Bootstrap confidence intervals: when, which, what? A practical guide for medical statisticians. Stat. Med. 2000;19:1141–1164. doi: 10.1002/(SICI)1097-0258(20000515)19:9&#x000a1;1141::AID-SIM479&#x000bf;3.0.CO;2-F. [DOI] [PubMed] [Google Scholar]
  6. Davidson R., MacKinnon J.G. Bootstrap tests: how many bootstraps? Econometric Rev. 2000;19(1):55–68. doi: 10.1080/07474930008800459. [DOI] [Google Scholar]
  7. Diggle P.J., Kaimi I., Abellana R. Partial-likelihood analysis of spatio-temporal point-process data. Biometrics. 2010;66(2):347–354. doi: 10.1111/j.1541-0420.2009.01304.x. [DOI] [PubMed] [Google Scholar]
  8. Efron B. Bootstrap methods: Another look at the jackknife. Ann. Statist. 1979;7(1):1–26. doi: 10.1214/aos/1176344552. [DOI] [Google Scholar]
  9. Efron B. Better bootstrap confidence intervals. J. Amer. Statist. Assoc. 1987;82(397):171–185. doi: 10.1080/01621459.1987.10478410. [DOI] [Google Scholar]
  10. Efron B., Tibshirani R. Chapman & Hall/CRC; Boca Raton, London: 1998. An Introduction to the Bootstrap. [Google Scholar]
  11. Gabriel E., Diggle P.J. Second-order analysis of inhomogeneous spatio-temporal point process data. Stat. Neerlandica. 2009;63(1):43–51. doi: 10.1111/j.1467-9574.2008.00407.x. [DOI] [Google Scholar]
  12. Giles J., Salje H., Lessler J. 2018. IDSpatialStatsR package development version v0.3.9. https://github.com/HopkinsIDD/IDSpatialStats. [Google Scholar]
  13. Giles J., Salje H., Lessler J. The IDSpatialStats R package: quantifying spatial dependence of infectious disease spread. The R Journal. 2019;11(2):308–327. https://journal.r-project.org/archive/2019/RJ-2019-043 [Google Scholar]
  14. Grabowski M.K., Lessler J., Redd A.D., Kagaayi J., Laeyendecker O., Ndyanabo A., Nelson M.I., Cummings D.A.T., Baptiste Bwanika J., Mueller A.C., Reynolds S.J., Munshaw S., Ray S.C., Lutalo T., Manucci J., Tobian A.A.R., Chang L.W., Beyrer C., Jennings J.M., Nalugoda F., Serwadda D., Wawer M.J., Quinn T.C., Gray R.H. The role of viral introductions in sustaining community-based HIV epidemics in rural Uganda: Evidence from spatial clustering, phylogenetics, and egocentric transmission models. PLoS Med. 2014;11(3) doi: 10.1371/journal.pmed.1001610. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Grantz K.H., Rane M.S., Salje H., Glass G.E., Schachterle S.E., Cummings D.A.T. Disparities in influenza mortality and transmission related to sociodemographic factors within Chicago in the pandemic of 1918. Proc. Natl. Acad. Sci. USA. 2016;113(48):13839–13844. doi: 10.1073/pnas.1612838113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Hoang Quoc C., Salje H., Rodriguez-Barraquer I., In-Kyu Y., Chau N.V.V., Hung N.T., Tuan H.M., Lan P.T., Willis B., Nisalak A., Kalayanarooj S., Cummings D.A.T., Simmons C.P. Synchrony of dengue incidence in Ho Chi Minh City and Bangkok. PLoS Neglected Trop. Dis. 2016;10(12):1–18. doi: 10.1371/journal.pntd.0005188. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Höhle M., Meyer S., Paul M., Held L., Burkom H., Correa T., Hofmann M., Lang C., Manitz J., Riebler A., Sabanés Bové D., M. Salmon, Schumacher D., Steiner S., Virtanen M., Wei W., Wimmer V., R Core Team . 2019. surveillanceR package v1.18.0. https://cran.r-project.org/package=surveillance. [Google Scholar]
  18. Kropko J., Harden J. 2019. coxedR package v0.3.2. https://cran.r-project.org/package=coxed. [Google Scholar]
  19. Lessler J., Salje H., Grabowski M.K., Cummings D.A.T. Measuring spatial dependence for infectious disease epidemiology. PLoS One. 2016;11(5):1–13. doi: 10.1371/journal.pone.0155249. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Levy J.W., Bhoomiboonchoo P., Simasathien S., Salje H., Huang A., Rangsin R., Jarman R.G., Fernandez S., Klungthong C., Hussem K., Gibbons R.V., Yoon I.-K. Elevated transmission of upper respiratory illness among new recruits in military barracks in Thailand. Influenza Respir. Viruses. 2015;9(6):308–314. doi: 10.1111/irv.12345. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Loh J.M. A valid and fast spatial bootstrap for correlation functions. Astrophys. J. 2008;681(1):726–734. doi: 10.1086/588631. [DOI] [Google Scholar]
  22. Loh J.M., Stein M.L. Bootstrapping a spatial point process. Statist. Sinica. 2004;14(1):69–101. [Google Scholar]
  23. Myllymäki M., Mrkvička T. 2019. GET: Global envelopes in R. arXiv/stat.ME. [Google Scholar]
  24. Myllymäki M., Mrkvička T., Grabarnik P., Hahn U., Kuronen M., Rost M., Seijo H. 2019. GETR Package v0.1-6. https://cran.r-project.org/package=GET. [Google Scholar]
  25. Myllymäki M., Mrkvička T., Grabarnik P., Seijo H., Hahn U. Global envelope tests for spatial processes. J. Royal Stat. Soc. Ser. B. 2017;79(2):381–404. doi: 10.1111/rssb.12172. [DOI] [Google Scholar]
  26. Neal P.J., Roberts G.O. Statistical inference and model selection for the 1861 Hagelloch measles epidemic. Biostatistics. 2004;5(2):249–261. doi: 10.1093/biostatistics/5.2.249. [DOI] [PubMed] [Google Scholar]
  27. Oesterle H. Eberhard-Karls-Universitäat Tübingen; 1992. Statistische Reanalyse Einer Masernepidemie 1861 in Hagelloch. (PhD thesis) [Google Scholar]
  28. Pfeilsticker A. Eberhard-Karls-Universität Tübingen; 1863. Beiträge zur Pathologie Der Masern Mit Besonderer Berücksichtigung Der Statistischen VerhäLtnisse. (PhD thesis) [Google Scholar]
  29. Pollington T.M., Tildesley M.J., Hollingsworth T.D., Chapman L.A.C. 2019. The spatiotemporal tau statistic: a review. https://arxiv.org/abs/1911.11476. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Pollington T.M., Tildesley M.J., Hollingsworth T.D., Chapman L.A.C. 2019. Epidemics7 conference poster: Use global envelope tests not pointwise CIs, for graphical hypothesis tests of spatiotemporal clustering and tau statistic τ. [DOI] [Google Scholar]
  31. R Core Team . 2019. R v3.6.3: A language and environment for statistical computing. https://www.r-project.org. [Google Scholar]
  32. Rehman N.A., Salje H., Kraemer M.U.G., Subramanian L., Cauchemez S., Saif U., Chunara R. 2018. Quantifying the impact of dengue containment activities using high-resolution observational data. bioRxiv. [DOI] [Google Scholar]
  33. Salje H., Cauchemez S., Alera M.T., Rodriguez-Barraquer I., Thaisomboonsuk B., Srikiatkhachorn A., Lago C.B., Villa D., Klungthong C., Tac-An I.A., Fernandez S., Mark Velasco J., Roque Jr V.G., Nisalak A., Macareo L.R., Levy J.W., Cummings D.A.T., Yoon I.-K. Reconstruction of 60 years of chikungunya epidemiology in the Philippines demonstrates episodic and focal transmission. J. Inf. Dis. 2016;213(4):604–610. doi: 10.1093/infdis/jiv470. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Salje H., Cummings D.A.T., Rodriguez-Barraquer I., Katzelnick L.C., Lessler J., Klungthong C., Thaisomboonsuk B., Nisalak A., Weg A., Ellison D., Macareo L.R., Yoon I.-K., Jarman R., Thomas S., Rothman A.L., Endy T., Cauchemez S. Reconstruction of antibody dynamics and infection histories to evaluate dengue risk. Nature. 2018;557(7707):719–723. doi: 10.1038/s41586-018-0157-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Salje H., Lessler J., Berry I.M., Melendrez M.C., Endy T., Kalayanarooj S., A-Nuegoonpipat A., Chanama S., Sangkijporn S., Klungthong C., Thaisomboonsuk B., Nisalak A., Gibbons R.V., Iamsirithaworn S., Macareo L.R., Yoon I.-K., Sangarsang A., Jarman R.G., Cummings D.A.T. Dengue diversity across spatial and temporal scales: Local structure and the effect of host population size. Science. 2017;355(6331):1302–1306. doi: 10.1126/science.aaj9384. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Salje H., Lessler J., Endy T.P., Curriero F.C., Gibbons R.V., Nisalak A., Nimmannitya S., Kalayanarooj S., Jarman R.G., Thomas S.J., Burke D.S., Cummings D.A.T. Revealing the microscale spatial signature of dengue transmission and immunity in an urban population. Proc. Natl. Acad. Sci. USA. 2012;109(24):9535–9538. doi: 10.1073/pnas.1120621109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Salje H., Lessler J., Paul K.K., Azman A.S., Rahman M.W., Rahman M., Cummings D.A.T., Gurley E.S., Cauchemez S. How social structures, space, and behaviors shape the spread of infectious diseases using chikungunya as a case study. Proc. Natl. Acad. Sci. USA. 2016;113(47):13420–13425. doi: 10.1073/pnas.1611391113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Succo T., Noël H., Nikolay B., Maquart M., Cochet A., Leparc-Goffart I., Catelinois O., Salje H., Pelat C., de Crouy-Chanel P., de Valk H., Cauchemez S., Rousseau C. Dengue serosurvey after a 2-month long outbreak in Nîmes, France, 2015: was there more than met the eye? Eurosurveillance. 2018;23(23) doi: 10.2807/1560-7917.ES.2018.23.23.1700482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Truelove S.A., Graham M., Moss W.J., Metcalf C.J.E., Ferrari M.J., Lessler J. Characterizing the impact of spatial clustering of susceptibility for measles elimination. Vaccine. 2019;37(5):732–741. doi: 10.1016/j.vaccine.2018.12.012. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

MMC S1

The supplementary material is composed of two appendices. Appendix A details the computation methods and MPBS and MMPBS spatial bootstrap sampling methods. Appendix B contains figures that were not central to the manuscript.

mmc1.pdf (1.4MB, pdf)

RESOURCES