Skip to main content
Oxford University Press logoLink to Oxford University Press
. 2023 Sep 21;86(1):177–193. doi: 10.1093/jrsssb/qkad104

Spatial confidence regions for combinations of excursion sets in image analysis

Thomas Maullin-Sapey 1,✉,b, Armin Schwartzman 2,3, Thomas E Nichols 4
PMCID: PMC10852994  PMID: 38344135

Abstract

The analysis of excursion sets in imaging data is essential to a wide range of scientific disciplines such as neuroimaging, climatology, and cosmology. Despite growing literature, there is little published concerning the comparison of processes that have been sampled across the same spatial region but which reflect different study conditions. Given a set of asymptotically Gaussian random fields, each corresponding to a sample acquired for a different study condition, this work aims to provide confidence statements about the intersection, or union, of the excursion sets across all fields. Such spatial regions are of natural interest as they directly correspond to the questions ‘Where do all random fields exceed a predetermined threshold?’, or ‘Where does at least one random field exceed a predetermined threshold?’. To assess the degree of spatial variability present, our method provides, with a desired confidence, subsets and supersets of spatial regions defined by logical conjunctions (i.e. set intersections) or disjunctions (i.e. set unions), without any assumption on the dependence between the different fields. The method is verified by extensive simulations and demonstrated using task-fMRI data to identify brain regions with activation common to four variants of a working memory task.

Keywords: confidence regions, excursion sets, linear model, spatial statistics

1. Introduction

The collection and analysis of imaging data, modelled as a random field regularly sampled on a spatial domain, is central to a broad range of scientific disciplines such as neuroimaging, climatology, and cosmology. By combining spatial data from n observations of a random field, researchers can estimate a spatially varying target function μ:SR, where the spatial region S is a closed subset of RN. Examples of such outcomes include cosmic microwave background in astronomy, temperature data for climate, and changes in blood oxygenation in the brain. Empirical interest often lies in identifying locations where the target function exceeds a certain value, c; for example ‘Where has a significant change in temperature occurred?’, or ‘Which part of the brain demonstrated a strong response to stimuli?’. Such questions are addressed by the study of excursion sets, sets of the form {sS:μ(s)c}.

Previous literature has documented many geometric and topological properties of excursion sets of random functions (c.f. Adler, 1981; Adler & Taylor, 2009; Azas & Wschebor, 2009; Cao, 1999; Torres, 1994). These include the Euler characteristic (a measure of topological structure), the Hausdorff dimension (indicating how fractal the set may be), Lipschitz-Killing curvatures (describing high-dimensional volumes and areas), and Betti numbers (describing how many stationary points appear above the threshold, c) (Adler, 1977; Adler et al., 2017; Pranav et al., 2019; Worsley, 1996). However, much of this work is limited to homogeneous stationary processes, i.e. those with zero mean and a spatial covariance structure dependent upon only distance. Only the most recent literature, including the present paper, concerns processes with a non-zero mean function and a potentially heterogeneous covariance function.

At present, little is published concerning how excursion sets may be compared to one another. Often researchers collect data from two or more study conditions and contrast the results to draw some meaningful inference. For example, comparing temperature changes in winter vs. summer, heat measurements gathered using different imaging modalities, or brain activity in healthy subjects across a range of tasks. Such settings are modelled as M spatially aligned, potentially correlated, estimates, μ^n1,μ^n2,μ^nM:SR, of M target functions, μ1,μ2,μM.

In such settings, research questions are often expressed as logical statements involving logical conjunctions, negations, and disjunctions. For example, a climatologist may ask where significant changes in temperature were observed during either winter or summer. Alternatively, a neuroscientist may ask where was cognitive behaviour observed in the brain during one task and not another. Formally, these questions may be expressed as ‘Where does μ1orμ2 exceed c?’ and ‘Where does μ1, and notμ2, exceed c?’, respectively.

Images corresponding to different study conditions are often compared via rudimentary visual inspection (see Dijkstra et al., 2021; Ferreira et al., 2021; Zhang et al., 2021 for recent examples of qualitative assessment of ‘overlap’ and ‘differences’ in neuroimaging). Such a subjective practice may lead to biased or misleading results. To formalise these inferences, recent work has proposed confidence regions (CRs), subsets, and supersets that bound the excursion set of a single target function with a given confidence (Bowring et al., 2019, 2021; Cuevas et al., 2006; French & Sain, 2013; Ren et al., 2022; Sommerfeld et al., 2018). However, current CR theory does not offer a means for investigating logical conjunctions and disjunctions of exceedance statements, i.e. intersections and unions of excursion sets. In this work, we provide a new method for generating CRs for ‘conjunction inference’ in image analysis, and then discuss how the proposed method may be adapted to allow the investigation of statements containing disjunctions and negations. An example application is provided in Figure 1, which shows CRs generated for the region of the brain that was active in all four variants of a working memory task (see Section 5 for further detail).

Figure 1.

Figure 1.

Functional magnetic resonance imaging (fMRI) % blood oxygenation level dependent (BOLD) images (top) and 95% confidence regions (CRs) (bottom left) for the Human Connectome Project working memory task for each of the four visual stimuli types, alongside conjunction CRs assessing the overlap of the four thresholded %BOLD images (bottom right). The thresholds employed were c=1% BOLD change, for all images, and α=0.05, for the CRs. Upper and lower CRs are displayed in red and blue, respectively. Across the bottom row, the yellow sets are the point estimates A^c1,A^c4 and F^c respectively. The red conjunction set has localised regions in the superior frontal gyrus (top) and the angular gyri (left/right), for which we can assert with 95% confidence that, for all four study conditions, there was (at least) a 1% change in BOLD response. Displayed is axial slice z=46 mm.

Formally, we primarily focus on the intersection of M excursion sets, over which the conjunction statement ‘μ1(s)candμ2(s)candandμM(s)c’ holds. We denote M={1,,M}, and for each iM define the ith true and estimated excursion sets as

Aci={sS:μi(s)c}andA^ci={sS:μ^ni(s)c},

respectively. Next, we define Fc:=i{Aci}iM and F^c:=i{A^ci}iM. The spatial region we are interested in, and our estimate of that region, are now given by

Fc={sS:miniMμi(s)c}andF^c={sS:miniMμ^ni(s)c},

respectively. An illustration of Fc in a setting in which M=2 is provided in Figure 2. We note that the above construction can be adjusted to allow c to vary across study conditions (i.e. ‘μ1(s)c1andandμM(s)cM’ for c1,,cMR) by substituting each {μi}iM for {μic+ci}iM). For ease, we assume c is equal for all study conditions. In addition, although the results we shall present theoretically hold in arbitrary dimensions, we shall focus primarily on two-dimensional simulations and applications to investigate the method’s performance.

Figure 2.

Figure 2.

Illustration of the set Fc for a setting in which M=2. Shown in (a) are two spatially varying target functions, μ1(s) (blue) and μ2(s) (green), overlaid and thresholded at the level c. In (b), the regions at which μ1(s)c and μ2(s)c are displayed as red and yellow circles, respectively, with the intersection set, Fc, illustrated in purple. In (c), potential CRs, F^c+ (grey) and F^c (orange), for Fc are illustrated.

Our goal is to obtain CRs F^c+ and F^c such that the below holds as n tends to :

P[F^c+FcF^c]=1α (1)

for a predefined tolerance level α (say, α=0.05). Pictorially, the above states that the sets F^c, Fc, and F^c+ are nested within one another as shown in Figure 2c with probability (1α). To generate such regions, we build on the theory of Sommerfeld et al. (2018) to show that if F^c+ and F^c are defined as appropriately chosen level sets of a standardised minimum process, the above probability may be approximated using quantiles of a well-defined random variable. Using a wild t-bootstrapping procedure, under the assumption of a Gaussian central limit theorem, we will then demonstrate that the relationship between this random variable and the above probability can be used to generate F^c+ and F^c for any desired value of α. Unlike previous literature on random excursion sets, in this work no model is assumed for the processes’ means or spatial covariance structure, and we do not make any assumption of between-‘study condition’ independence (i.e. {μ^ni}iM may be correlated with one another).

If the CRs F^c+ and F^c closely resemble one another, this may be interpreted as signifying that F^c is a reliable estimate of Fc. Alternatively, little resemblance may suggest that there is a high degree of spatial variability present in the data and that the estimated shape, size, and locale of F^c are not particularly reliable. Such statements are of particular value in imaging applications where the aim of a statistical analysis is typically to assess how reliably some form of activity may be localised to a particular spatial region. For instance, in the fMRI example of Figure 1, F^c is the yellow conjunction region displayed in the bottom right panel, which represents an estimate of the brain regions which were active in response to all four variants of a working memory task. Here, the poor resemblance between the blue and red CRs may be interpreted as suggesting that more data is required for F^c to be reliable as an estimate of Fc.

Conjunction and disjunction inference is an active area of research in the broader statistical literature. A typical approach for such conjunction or disjunction inference is to employ null-hypothesis testing. A hypothesis test of a disjunction of nulls to assess evidence for a conjunction of alternative hypotheses can be conducted with an ‘intersection-union’ test. An intersection-union test rejects at level α when all of the individual nulls are rejected at level α. This approach has proven useful for tests of bioequivalence (Berger, 1997; Berger & Hsu, 1996), but does not consider the spatial aspect that is our focus here.

This work focuses on the quantification of spatial uncertainty present in the locale of intersections and unions of random excursion sets, which can be visualised using CRs. CR methodology has previously been developed for individual excursion sets in a wide range of contexts. For instance, Davenport et al. (2022) derive CRs for local extrema in Gaussian fields, whilst Craigmile et al. (2005) quantify uncertainty in extrema estimates within a Bayesian framework. Another active area of research concerns kernel density estimation, in which many authors have proposed CRs for excursion sets of kernel estimates (Chen et al., 2017; Mammen & Polonik, 2013; Qiao & Polonik, 2019). Further, in the setting of hotspot identification for geospatial data, CRs have been proposed to bound a target functions derived from Bayesian spatial mixed effects models (French & Hoeting, 2015; Hazra & Huser, 2021), as well as more general latent Gaussian models (Bolin & Lindgren, 2015; Mejia et al., 2020). Despite this, to our knowledge, no prior work exists that explicitly considers the generation of CRs for conjunction or disjunction inference. Moreover, such inference concerns spatial regions with piece-wise continuous boundaries, which are not typically considered in the CRs literature. The combinatoric nature of the boundary decompositions considered in this work adds an additional layer of complexity to the results and proofs provided, and has not previously been considered in the CRs literature.

In the following sections, we first describe the notation and assumptions upon which our theory relies. Following this, we provide the central theoretical result of this work, which relates equation (1) to an exceedance statement for a well-defined random variable. Next, via the use of the wild t-bootstrap, we detail how this result may be employed to obtain CRs for conjunction inference in the setting of linear regression modelling under the assumption of a Gaussian CLT. Finally, we validate our results with simulations and a real data example based on fMRI data taken from the Human Connectome Project (HCP) (Van Essen et al., 2013). Proofs of the theory presented in this work, alongside further illustration and extensive simulation results, are provided as online supplementary material.

2. Confidence regions for excursion set combinations

2.1. Notation

In the following sections, we shall need to denote the numerous possible low-dimensional sub-manifolds which can arise from intersecting the boundaries of M excursion sets. To do so, we first denote the power set of a finite set, A, as P(A), and define P+(A) as P+(A)=P(A){}. For each iM, we define Aci as the level set {sS:μi(s)=c}. Similarly, we define Fc as the level set {sS:miniMμi(s)=c}. For a spatial set, BS, its complement in S is denoted as Bc=SB, its topological closure is denoted as B¯, its interior is denoted as B, and its boundary is denoted as B. We note here that the notation Aci and B may conflict with one another. This conflict is resolved, however, by the assumptions of Section 2.2 which ensure that the boundaries of {Aci}iM and Fc are equal to the level sets {Aci}iM and Fc (at least locally, in the vicinity of Fc, which is sufficient for our purposes).

We can now partition the level set Fc into sub-manifolds using the set of possible intersections of the M excursion set boundaries. For all ϕP+(M), we define the boundary segment ϕFc as follows:

ϕFc=Fc(iϕAci)(jMϕ(Acj)c).

We note it is possible that, for some ϕP+(M), the boundary segment ϕFc is empty (in fact, this is often the case in practice). This notation is crucial to the statement of Theorem 2.1 and is illustrated in Figure 3.

Figure 3.

Figure 3.

Annotated images of ‘overlapping’ excursion sets (top) with corresponding illustrations of boundary partitions (bottom) for settings in which the number of study conditions, M, is equal to 2 (left) or 3 (right). Shown in Panels (a)-(d) are two overlapping excursion sets Ac1 and Ac2. In Panels (e) and (f), a third set has also been included. Panels (b) and (f) illustrate settings in which the excursion sets share boundaries, Panels (a), (d) and (e) exemplify instances where Fc has sharp corners, and Panels (c) and (f) show that the excursion sets may be nested.

In the fMRI example presented in Section 1, M={1,2,3,4} is a set of indexes representing the four working memory conditions and Fc represents the brain regions that are active in response to all four conditions. In this setting, an example of the notation ϕFc is given by the boundary segment {1}Fc, which represents the edges of Fc which arose solely due to the first working memory condition.

2.2. Assumptions

In this section, we outline and discuss the assumptions upon which our theory relies. Further detailed discussion of Assumption 2.2.3, alongside illustration, is given in online Supplementary Theory Section S2.

Assumption 2.2.1

For each iM, there exists a bounded function σi:SR+ and positive sequence τn0, such that the below joint central limit theorem (CLT) holds:

{μ^ni(s)μi(s)τnσi(s)}sS,iMd{Gi(s)}sS,iM,

where {Gi(s)}sS,iM is a well-defined multivariate random field with continuous sample paths in S and d represents convergence in distribution.

Remark

This assumption introduces the notation σi(s) and τn. In typical applications, σi(s) represents standard deviation across observations (for study condition i, at spatial location s) and τn is a decreasing function of sample size. If μ^ni were derived as the mean of n i.i.d. observed Gaussian random fields {X1i,,Xni}, then Assumption 2.2.1 would reduce to a standard Gaussian functional CLT with μi=E[X1i],σi=Var(X1i), τn=n12 and Gi being a Gaussian random field with unit variance.

For conciseness in the following text, we now define {gi}iM and {g^ni}iM as

gi(s)=μi(s)cσi(s)andg^ni(s)=μ^ni(s)cσi(s).

Pictorially, each {gi} may be visualised as a transformation of the surfaces depicted in Figure 2. Such a transformation would first vertically translate the entire image so that the purple plane, {Y=c}, coincides with the plane formed by the s1 and s2 axis. It would then stretch, or shrink, the resulting function away from, or towards, this plane. From this, it can be seen that the excursion sets of {μi} and {μ^i} at the level c are equal to those of {gi} and {g^ni} at the level 0 (i.e. Aci={sS:gi(s)0} and A^ci={sS:g^ni(s)0}).

Assumption 2.2.2

We assume that:

  1. The functions {gi}iM are continuous on S.

  2. For n large enough, the functions {g^ni}iM are continuous on S.

Remark

For the practical settings we consider, such as fMRI neuroimaging, astronomy and climate, the above assumptions of continuity are ubiquitous in the literature. Common reasons for this include: (1) the underlying estimator is assumed to be drawn from a smooth process and (2) spatial smoothing is often applied to the data during preprocessing to improve the signal-to-noise ratio (SNR) (see Adler & Taylor, 2009; Brett et al., 2004; Poldrack et al., 2011).

Assumption 2.2.2 ensures that all limits taken across space in the proofs in online Supplementary Theory Section S4 are well defined. However, it does not guarantee that the sets {Aci}iM will resemble the well-defined ‘blobs’ which may be expected. Without further assumptions, it is still possible for {Aci}iM to appear extremely erratic, possessing jagged edges and displaying fractal-like behaviour. A well-known example of a random field that satisfies Assumption 2.2.2 but which is not well-behaved in the sense we desire is the highly fractal Wiener process (also known as Brownian motion, c.f. Adler, 1981). To restrict our attention to situations encountered in imaging studies, the below assumption is required.

Assumption 2.2.3

We assume that:

  1. Every open ball around a point on the boundary Fc has a non-empty intersection with (Fc). In addition, for all ϕP+(M) every open ball around a point in ϕFc has a non-empty intersection with the set:
    Jcϕ:=(iϕ(Aci)c)(jMϕ(Acj)),
    where, if Mϕ is empty, the last term is defined to be equal to S.
  2. There is an open neighbourhood of Fc over which the functions {gi}iM and, for n large enough, {g^ni}iM are C1 with finite, non-zero, gradients. The same condition holds for the restrictions of each {gi}iM and {g^ni}iM to S.

  3. For every point sFc and ϕP+(M), if s(iϕAci) then the set (iϕAci) partitions every sufficiently small open ball around s into exactly two components, each of which is path-connected.

Remark

The above statements ensure that Fc is non-empty and has a well-defined boundary which is equal to the level set Fc={sS:miniMgi(s)=0}. All three statements ensure that no changes in topology occur at the level c by requiring that Fc does not contain plateaus, local minima or maxima. In addition, each statement handles pathological cases that the others do not. Statement (a) handles cases in which {Aci}iM coincide exactly with S, Statement (b) ensures that the notion of the gradient is well defined in a neighbourhood of Fc, and Statement (c) handles unusual, potentially unintuitive, fractal-like behaviour which is not ruled out by the first two statements. Further nuanced discussion and illustrations of the degenerate fractal-like cases that are ruled out by Statement (c) are provided in online Supplementary Theory Section S2.

Each of the examples presented in Figure 3 satisfies Assumptions 2.2.2 and 2.2.3 and highlight several features worthy of note. Firstly, as shown in examples (d) and (e), we do not assume Fc is a C1 curve, but rather piece-wise C1. Secondly, the assumptions allow for the possibility that the excursion sets could be nested within one another (c.f. examples (c) and (f)). Thirdly, the assumptions permit the excursion sets to share common boundaries (see examples (b), (c), and (f)). The last two observations are of practical relevance as, in many applications, it may be expected that the target functions for different study conditions exhibit a strong similarity. This relevance is exemplified by the four bottom-left panels of Figure 1 in Section 1, in which the regions of the brain that displayed activation in response to four study conditions are displayed in yellow. It can be seen that these regions are notably similar in terms of anatomy. Further, we note that, throughout this work, there is no requirement that Fc be (globally) path-connected. For instance, in the fMRI example shown in Figure 1, the estimated yellow set F^c in the bottom-right panel has many disconnected components.

2.3. Theory

We now present the central result of this work, Theorem 2.1. To do so, we now define the nested sets F^c and F^c+ by thresholding the statistic τn1miniMg^ni(s):

F^c±:=F^c±(a)={sS:τn1miniMg^ni(s)±a},

using some constant aR+. To find an appropriate value of a, such that equation (1) holds for a desired tolerance level (e.g. α=0.05), Theorem 2.1 is required.

Theorem 2.1

Under the assumptions of Section 2.2, the below holds:

limnP[F^c+FcF^c]=P[Ha], (2)

where the variable H is defined as follows:

H=maxϕP+(M)(supsϕFc|miniϕ(Gi(s))|).

The above theorem shows that violations of the inclusion statement, F^c+FcF^c, are dependent on the extreme values of limiting processes {Gi} along the boundary Fc. In the context of the fMRI example of Section 1, this means that the probability that the red and blue CRs in Figure 1 correctly bound the conjunction region for the four working memory conditions depends solely on the measurement noise distribution along the boundary of this region. The exact relationship between the measurement noise along a given boundary segment, and the probability of the inclusion failing in a vicinity of that segment, depends directly on which study conditions the segment was derived from. For instance, in the example of Figure 3c, it may be expected that the probability of observing a violation along the blue boundary segment, {1}Fc, will depend upon only the limiting noise distribution of the first study condition, G1. However, for the green boundary segment, {1,2}Fc, violations of the subset condition may depend on the noise distributions of both study conditions.

From the above, it follows that if aR satisfies P[Ha]=1α then asymptotically, the inclusion probability, P[F^c+FcF^c], will also equal 1α. Thus, given the (1α)th quantile of the distribution of H, CRs F^c and F^c+ may be constructed which satisfy equation (1). We shall take up the task of evaluating the quantiles of H in Section 3. Using De Morgan’s laws and direct algebraic substitution, Theorem 2.1 may also be employed to answer questions about logical disjunctions and negations. For brevity, the extraneous details are not given here but are instead discussed at length with worked examples in online Supplementary Theory Section S3.

3. Spatial conjunction inference for the linear model

In this section, we build upon the work of Sommerfeld et al. (2018) and Bowring et al. (2019), in which CRs were generated for a single target function, μ(s), derived from a linear regression model. In Section 3.1 we introduce the spatially varying linear model (LM) for M study conditions and, in Section 3.2, we apply the wild t-bootstrap to estimate quantiles of the variable H. Further implementation details, for when data are sampled from a discrete lattice rather than across a continuous space, alongside pseudocode, are provided in online Supplementary Results Section S2. This section focuses solely on conjunction inference. However, the methods described here may also be employed to perform inference on other logical statements via the corollaries of Section 2.3.

3.1. Model specification

For each study condition, study condition iM, we assume that the data follow a spatially varying LM defined at location sS as

Yi(s)=Xiβi(s)+ϵi(s),ϵi(s)N(0,Σi(s)). (3)

The known quantities in the model are: the (n×1) vector of responses, Yi(s), and the (n×p) design matrix, Xi. The unknown model parameters are: the (p×1) vector of regression coefficients, βi(s), and the (n×n) random error covariance matrix, Σi(s). We emphasise that βi(s) and Σi(s) model the signal observed at location sS, and do not incorporate any spatial information. In particular, Σi(s) denotes the covariance between the n observed values at s, and not the spatial covariance of the field.

To illustrate the above notation in practice, we briefly detail its usage for the fMRI example of Section 1. Here, Yi represents the blood oxygenation level dependent (BOLD) signal, a proxy measure of brain function derived from the properties of blood oxygenation, for n subjects in response to study condition i. Xi represents covariates of interest (e.g. subject age, gender, body mass index, etc.) which are assumed not to vary across the brain. βi models the relation between Xi and Yi, and ϵi represents random measurement noise.

For each spatial location, sS, the generalised least squares estimator of the parameter vector βi(s), β^i(s), is given by

β^i(s)=(XiΣi(s)1Xi)1XiΣi(s)1Yi(s).

In this context, under the ith study condition, at each spatial location s, our interest lies in assessing whether linear relationships hold between the elements of the parameter vector βi(s). Such relationships may be expressed using a contrast vector Li of dimension (p×1). In the notation of the previous sections, the target functions for the spatially varying LM (i.e. the surfaces depicted in Figure 2) are given as μi(s)=Liβi(s), with corresponding estimator functions μ^ni(s)=Liβ^i(s). τn1σi(s) is the contrast standard error, given as

τn1σi(s)=[Li(XiΣi(s)1Xi)1Li]12.

Although we have presented {β^i(s)}iM as being estimated separately using M different models, there is nothing in our theory that prevent the same model being used for all M study conditions. For example, it is possible for the model matrices, {Yi(s)}iM and {Xi}iM, and parameter vector, {βi(s)}iM, to be equal for all iM, with the only distinction between study conditions being represented by the contrast vector Li. This observation is noteworthy as it allows researchers to compile all study conditions into a single LM to obtain a heightened statistical power.

In the settings we are motivated by, such as neuroscience and geospatial data, it is common to assume that Yi(s), Σi(s) and βi(s) are smooth functions over space. Such assumptions are reasonable due to the point spread function of imaging devices or spatial smoothing applied during data preprocessing. As a result, we assume that derivative measures, such as contrast estimates and statistical values, are also spatially smooth. In the fMRI example of Section 1, the yellow excursion sets in the four bottom-left panels of Figure 1 were derived from a linear regression as excursion sets of the form A^ci:={sS:Lβ^i(s)c}, and each represent the BOLD response to the ith working memory stimuli type, respectively.

To apply Theorem 2.1 to the above LM, we now assume that Assumptions 2.2.12.2.3 hold. Such assumptions are commonly satisfied by standard conditions placed on the continuity and boundedness of the increments and moments of {βi(s)}sS and {ϵi(s)}sS. A sufficient list of such conditions is provided in online Supplementary Theory Section S5 and further detail may be found in Sommerfeld et al. (2018). We note here that the covariance matrix, Σi(s), is usually unknown in practice, meaning the function τn1σi(s) cannot be calculated. As demonstrated in Sommerfeld et al. (2018), however, Σi(s) can be replaced by any consistent estimator Σ^i(s) and the assumptions given in Section 2.2 are still satisfied. Such estimation is common in the statistics literature and is typically achieved by assuming that some fixed correlation structure applies to Σi(s) (e.g. diagonal independence, auto-regressive, etc.) so that the number of independent variance parameters which must be estimated is small.

3.2. The wild t-bootstrap

In practice, the distribution of H is unknown and must be estimated. In this section, for the model specification described in Section 3.1, we employ a wild t-bootstrap resampling scheme to obtain the quantile, a, of H which satisfies P[Ha]=1α.

To do so, we first define the (n×1)-dimensional decorrelated residual vector, Ri, for the LM of the ith study condition (iM) as

[R1i(s),,Rni(s)]=Ri(s)=[Σ^i(s)]12(Yi(s)Xiβ^i(s)).

The wild t-bootstrap proceeds by, in each bootstrap instance, generating n i.i.d. Rademacher random variables, {r1,,rn} (random variables which take the values 1 and +1 with equal probability) independently of the data. For the ith study condition, at spatial location s, a new bootstrap sample is then obtained by multiplying the elements of the decorrelated residual vector by the Rademacher variables (i.e. the new sample is given by {r1R1i(s),,rnRni(s)}). Once the boostrap sample has been generated, the standard deviation of the bootstrap sample, σ^*,i(s), is calculated. A bootstrap instance of the variable Gi, G~i, is now generated as follows:

G~i(s)=n12l=1nrlRli(s)σ^*,i(s).

A bootstrap instance of the variable H, H~, may then be computed using the bootstrap variables {G~i(s)}sS,iM as follows:

H~=maxϕP+(M)(supsϕF^c|miniϕ(G~i(s))|). (4)

Quantiles of the distribution of H may now be estimated empirically from the observed sampling distribution of H~. Discussion of how the above supremum, taken across continuous space, is evaluated in practice is deferred to online Supplementary Results Section S2.1. We note here that, as the locations of the boundary segments {ϕFc}, depicted in Figure 3, are in practice unknown, the boundary segments {ϕFc} have been substituted for estimates {ϕF^c}. The performance of substitutions of this form is discussed and verified extensively in the works of Sommerfeld et al. (2018), Bowring et al. (2019), and Bowring et al. (2021). In keeping with these publications, all simulations in this work have been replicated using both the true and estimated boundary segments, {ϕFc} and {ϕF^c}, with a full comparison being provided in online Supplementary Results Sections S5 and S6. We note that, due to this substitution, our proposed method may not be applied when F^c is empty.

Telschow and Schwartzman (2021) show that the above procedure provides asymptotically consistent estimates of the distribution of H if each {Gi}iM satisfies the below.

Assumption 3.2.1

For each {Gi}iM, we assume:

  • {Gi}iM is L2 Lipschitz with respect to the Euclidean distance metric. In other words, there exists a random variable A satisfying E[|A|2]< such that |Gi(s)Gi(s)|A|ss| for all s,sS.

  • {Gi}iM has finite l second moment. That is, E[|Gi(s)|2]<.

The above procedure is reliant on the Gaussianity of each Gi. However, it has been suggested that this approach may also be suitable for estimating quantiles of maxima distributions for general symmetric Gi (see Efron & Tibshirani, 1994; Hesterberg, 2015; Wu, 1986).

In order to utilise the results of Section 2 when the random variables {Gi}iM are not symmetric, the wild t-bootstrap must be adjusted to account for skewness. One approach commonly adopted for this purpose is to replace the Rademacher variables in the above with Mammen variables (variables which take values (51)/2 and (5+1)/2 with probabilities (5+1)/(25) and (51)/(25)) (c.f. Mammen, 1993). This procedure preserves the third moment of the bootstrap distribution but at the expense of an inflated kurtosis which may negatively impact the estimation of the distribution tail probabilities. For this reason, alternative resampling procedures, such as, for instance, the generalised Jackknife (c.f. Gomes et al., 2015), may provide improved performance over the Mammen-based wild t-bootstrap. As a comparison of such resampling methodology is beyond the scope of this work, in the following sections, we make the common simplifying assumption that {Gi}iM are Gaussian, with unknown spatial and between-‘study condition’ covariance structure, and refer the reader to Efron and Tibshirani (1994) for further discussion of resampling techniques. Here we stress that, whilst the theory presented in Section 2 places no restrictions on the distribution of {Gi}iM, throughout Section 3{Gi}iM are assumed to be Gaussian.

4. Simulations

4.1. Simulation settings

To assess the correctness and performance of the method, a range of simulations were conducted using synthetic data. Each simulation was designed to investigate how the empirical coverage (i.e. the proportion of simulation instances in which the inclusion F^c+FcF^c held) was affected by various secondary factors of practical interest. In this section, we describe three simulations designed to investigate how the methods’ performance was influenced by: (1) the degree of overlap between excursion sets, (2) the presence of correlation between the noise fields for different study conditions, and (3) the magnitude of the spatial rate of change of the signal.

In all simulations, the model used to generate the synthetic data took the form:

Yi(s)=μi(s)+ϵi(s),ϵi(s)N(0,σi(s)In), for eachiM, (5)

where n was allowed to vary from 40 to 500 in increments of 20. To induce spatial correlation between observations, ϵi(s) was smoothed using an isotropic Gaussian filter with a full-width half maximum (FWHM) of 3 pixels. The aim, across all simulations, was to assess the performance of the method when employed for conjunction inference on the true signal {μi}iM. This setting corresponds to asking research questions of the form ‘Where do all of the {μi}iM exceed c?’.

The mechanism used to generate the true signal, {μi}iM, varied across simulations. In Simulations 1 and 2, {μi}iM were generated using two binary images of circles and squares, respectively, positioned in a ‘Venn diagram’ arrangement. Each binary image was scaled by a predefined amount and then smoothed using an isotropic Gaussian filter with an FWHM of 5 pixels. The resulting signal for Simulation 1 strongly resembled that depicted in Figure 2, whilst Simulation 2’s signal resembled Figure 3b in order to test the effect of {1,2}Fc having non-zero length. In Simulation 3, μ1 and μ2 were simulated as a horizontal and vertical linear ramp, respectively, with predefined gradients. For each simulation, μ1 is depicted in Figure 4a, with corresponding binary images for Ac1, Ac2, and Fc shown in Figure 4b.

Figure 4.

Figure 4.

(a) The circular, square, and ramp high-SNR signals used for μ1 in Simulations 1, 2, and 3, respectively. (b) Binary images for Ac1,Ac2, and Fc, for Simulations 1, 2, and 3. (c) Instances of the high- and low-SNR data generated for study condition 1 in Simulation 1. (d) The estimated mean signal for study condition 1, μ^1, in Simulation 1, generated using n=120 high-SNR, and low-SNR, simulated data instances.

Each simulation was performed twice, once using noisier ‘low SNR’ synthetic data and again using less noisy ‘high-SNR’ synthetic data. To generate the high-SNR synthetic data for Simulations 1 and 2, all simulated {μi}iM were scaled by a factor of 3 prior to smoothing. This data generation process is identical to that employed in Bowring et al. (2019), in which it is noted that synthetic data generated using these parameters strongly resembles that of an fMRI analysis when voxels are of dimension 2mm3. In Simulation 3, for the high-SNR data, the linear ramps were initially simulated with a gradient of 1 per 50 pixels. Across all simulations, the low-SNR data was generated by reducing the signal magnitude of the high-SNR data by a factor of 4. In all simulations, thresholds of c=1/2 and c=2 were employed for the low-SNR data and high-SNR data, respectively. Data realisations and signal estimates employed for Simulations 1–3 are illustrated in Figure 4c and d, respectively.

As Simulation 1 aimed to investigate the method’s performance as the overlap between excursion sets increased, in this simulation, the distance between the centre of the circles was varied, ranging from 0 pixels (full overlap) to 50 pixels (barely overlapping) in increments of 2 pixels. Similarly, as Simulation 2 aimed to assess the effect of correlation between ϵ1 and ϵ2, in this simulation, this correlation was varied between 1 and 1 in increments of 0.1 (in all other simulations, the noise fields were generated independently). As Simulation 3 served to assess how sensitive the empirical coverage was to the rate of change of μ1 and μ2 over space, in this simulation, the initial ramp gradients were divided by a factor of k, where k was varied from 0.25 to 1.75 in 0.05 increments.

All simulation results were obtained as averages taken across 2,500 simulation instances and compared to the nominal coverage using 95% binomial confidence intervals. In all simulation instances, the image dimensions were (100×100) pixels, CRs were computed for a tolerance level of α=0.05 using 5,000 bootstrap realisations, σi(s) was computed using the ordinary least squares estimator and τn was computed as τn=n0.5. In each simulation instance, the assessment of whether the inclusion statement, F^c+FcF^c, held was verified using the interpolation-based methods of Bowring et al. (2019). In this approach, the inclusion statement was deemed to hold if, and only if, the sets of pixels which were identified as F^c, Fc and F^c+ were appropriately nested and, in addition, interpolation along the boundary Fc confirmed that no violations of the inclusion statement had occurred.

A further 12 simulations which investigated the influence of other secondary factors of interest were also conducted. The secondary factors considered by these simulations include: the presence of spatial structure in the noise, the effect of {1}Fc and {2}Fc having differing lengths, the effect of ϵ1 having a much larger variance than ϵ2, spatial smoothing on {μi} and {ϵi}, raising and lowering the threshold c, {ϵi} having heavy-tailed non-Gaussian symmetric distributions, and the impact of varying the value of M. Also considered was the effect of {Aci} having higher order shared boundary segments of the form depicted in Figure 3f. A full discussion of these simulations is deferred to online Supplementary Results Section S4. In addition, in keeping with the previous literature, tolerance levels of α=0.1 and α=0.2, as well as the substitution of Fc for F^c in equation (4), were also considered for simulation (c.f. Bowring et al., 2019, 2021; Sommerfeld et al., 2018). These simulation results may be found in online Supplementary Results Sections S5 and S6.

4.2. Simulation results

For a majority of the simulations conducted, the empirical coverage estimates for the proposed method were tightly distributed around the expected nominal coverage. In particular, for the high-SNR data, across all simulations, the 95% binomial confidence intervals for the proposed method consistently captured the nominal coverage at the predicted rate. However, when the simulations employed the low-SNR data, the proposed CRs method was conservative under certain unfavourable conditions. The low-SNR results for Simulations 1–3 are shown in Figure 5.

Figure 5.

Figure 5.

Simulation results generated using the low-SNR synthetic data. Nominal coverage is shown as a dashed line, with a corresponding binomial confidence interval also shaded. All data points are averages taken across 2,500 simulation instances.

For example, in Figure 5a, it can be seen that some over-coverage was observed for Simulation 1 when the data was noisy, and the number of subjects was low. This observation is corroborated by the remaining low-SNR synthetic data simulations (c.f. online Supplementary Results Section S5). However, in all cases, the degree of agreement between the empirical and nominal coverage improved as sample size increased, and no such over-coverage was observed for the equivalent high-SNR simulations. This observation matches expectation, as Theorem 2.1 holds only asymptotically, and is supported by the previous literature on CRs, in which similar results were observed for the single-‘study condition’ setting (c.f. Bowring et al., 2019; Sommerfeld et al., 2018).

The presence of strong negative correlation between the study conditions and the rate at which the signal varied across space both also appeared to influence the methods’ performance (c.f. Figure 5b and c, respectively). In both instances, it is likely that the observed over-coverage is due to difficulties in estimating the true location of the boundary Fc, as each of these factors make it harder to identify the locations at which small changes in miniMμi occur. When high-SNR data was used in the place of low-SNR data, both of these factors had a substantially reduced impact on the empirical coverage.

Despite the above observations, the simulation results overwhelmingly supported the claim that the proposed method is robust to a range of secondary factors. Included in the list of such factors are: variation in the shape and size of Fc, spatial correlation in the noise, variation in the lengths of individual boundary segments, between-‘study condition’ correlation, variation in the magnitude of the noise, high spatial smoothness of the signal and noise, raising and lowering the threshold, heavy-tailed noise distributions, large numbers of study conditions, and situations in which boundary segments are shared by many excursion sets. For further detail, see online Supplementary Results Sections S5 and S6.

In terms of time efficiency, the wild t-bootstrap provided extremely fast computational performance. For instance, using an Intel(R) Xeon(R) Gold 6126 2.60 GHz processor with 16 GB RAM, and averaging over the 2,500 simulation instances conducted for n=500 high-SNR observations, in Simulation 1, the time taken to generate CRs for a circle separation of 20 pixels was 5.98 s. For a comprehensive summary of computation times, see online Supplementary Results Section S7.

5. Real data application

In Section 1, an fMRI application based on using data from the HCP dataset (Van Essen et al., 2013) was presented. Here, we provide further detail for this application. In this example, task-fMRI data were collected as part of a block design from 80 healthy, unrelated, young adults as part of a working memory task that had four distinct components, each using a different stimuli type: pictures of places, tools, faces, and body parts. In online Supplementary Results Section S3, we provide a brief overview of the imaging acquisition protocol, task paradigm, preprocessing stages and first-level analysis employed for generating this dataset (for further detail, see Barch et al., 2013; Glasser et al., 2013). Following first-level analysis, the data consisted of 4 images for each of the 80 subjects, measuring the %BOLD response to each of the 4 stimuli types. In this example, our interest lies in identifying, at the group-level, which regions of the brain are associated with working memory regardless of stimuli type. In other words, ‘Which regions are active in response to all four working memory stimuli types?’.

As this work has focussed on two-dimensional excursion sets, a single slice of the brain was chosen for analysis (z=46mm, covering portions of the frontal gyrus involved in working memory). For each stimuli type, an n=80 group-level LM was constructed. Using the proposed method, CRs were then generated at the 5% confidence level to assess where the group-level percentage BOLD change exceeded 1% for all four stimuli types. To compare the proposed method to standard fMRI inference procedures, a group-level contrast was also generated using the Big LM toolbox for each of the four stimuli types. In addition, single-‘study condition’ CRs were also generated using the methods of Sommerfeld et al. (2018) for each of the four stimuli types.

The results are those shown in Figure 1. The conjunction inference identified several localised regions, including the superior frontal gyrus, which is well known for its involvement in working memory (Alagapan et al., 2019; Boisgueheneuc et al., 2006; Vogel et al., 2016), and the angular gyri, which are well known to be involved in memory retrieval, attention, and spatial cognition (Bréchet et al., 2018; Seghier, 2013).

Whilst the CRs for conjunction inference illustrated in Figure 1 exhibit a clear resemblance to the four contrast images, we note that the red set, F^c+, is very small in comparison to the blue set, F^c, which is large and diffuse. This observation conveys important information about the spatial variability of the regions identified. In particular, it can be seen that, in the regions immediately surrounding the superior frontal gyrus and angular gyri, there is a much higher resemblance between the blue (F^c) and yellow (F^c) sets than is seen across the rest of the brain. This resemblance provides an insight into the degree of spatial variation present surrounding these regions. In this case, the strength and localisation of this resemblance suggest that the spatial variability of F^c is less severe in and around the aforementioned anatomical regions than it is across the rest of the brain. It may be concluded that the estimated yellow ‘blobs’ corresponding to the superior frontal gyrus and the angular gyri have been more reliably localised than the other yellow ‘blobs’ appearing in the image.

In general, the single-‘study condition’ CRs can be seen to be much larger than those observed for conjunction inference. This observation matches expectation as the overlap of the four excursion sets is smaller than each set individually. In addition, in this example, the conjunction method employed the entire experimental design for analysis, as opposed to the single-‘study condition’ method which employed only the data concerning the stimuli of interest. It is therefore expected to have a higher statistical power and, thus, tighter confidence bounds.

6. Conclusion and discussion

In this work, we have produced a method for generating CRs for intersections and unions of multiple excursion sets. The CRs generated by the method generalise the notion of confidence intervals to arbitrary spatial dimensions and possess the usual frequentist interpretation that, were the 0.95 procedure repeated on numerous samples, the proportion of CRs, F^c+ and F^c, that correctly enclose Fc in the manner depicted in Figure 2c, would tend to be 0.95. Such CRs serve as an indicator of the reliability of F^c as an estimate of Fc.

We stress here that, whilst F^c is equal to the intersection of {A^ci}iM, the CRs F^c± are not the intersections of the single-‘study condition’ CRs {A^c±,i}iM which would be obtained using the methods of Sommerfeld et al. (2018). In other words, the intersection of the yellow sets in the four bottom-left panels of Figure 1 is equal to the yellow set in the fifth, but the corresponding statement does not hold for the blue or red sets. This claim follows from the fact that {A^c±,i}iM are defined using separate quantile estimates from different bootstraps and can be heavily influenced by the behaviour of {Aci}iM in spatial regions far from Fc. In fact, treating the naive intersection of the (1α) CRs {A^c±,i}iM as (1α) CRs for Fc is not a valid method in general and can result in undesirable asymptotic coverage lying anywhere inside the range [1Mα,1]. This claim is further discussed in online Supplementary Theory Section S6 and verified in Simulations 14 and 15 of the online Supplementary Results document, which show that naive intersections can provide over- and under-coverage, respectively. To our knowledge, the only valid approach for generating CRs for conjunction inference is that outlined in this work.

One potential limitation of the specific methods of Section 3 is that they allow for CRs to be generated only when the target functions, {μi}iM, are linear combinations of regression parameter estimates. As noted by Bowring et al. (2021), it is often preferable for an analysis to consider standardised effect sizes, such as Cohen’s d (Cohen, 2013) or Hedges’ G (Hedges, 1981), instead of regression parameter estimates, as standardised effect sizes allow for information about statistical power to be incorporated into the analysis. To this end, Bowring et al. (2021) outlined an approach for generating CRs (in the single-‘study condition’ setting) for images of Cohen’s d effect sizes. Here, we suggest that a potential avenue for future work is to combine the methods of Bowring et al. (2021) and those proposed in this work to allow for conjunction, or disjunction, inference to be performed on standardised effect sizes rather than regression parameter estimates.

It must also be noted that the wild t-bootstrap has only been shown to consistently estimate quantiles of the supremum in equation (2) when the {Gi} are Gaussian random fields satisfying Assumption 3.2.1 (c.f. Telschow & Schwartzman, 2021). As discussed in Section 3.2, it has been suggested that this approach may also be used for estimating suprema quantiles for symmetric non-Gaussian random fields, and further adapted to non-symmetric cases by substituting the Rademacher variables for Mammen variables. However, to our knowledge, the consistency of such approaches has not yet been shown to hold theoretically. Our preliminary simulations have indicated that consistency may hold in the symmetric case (c.f. online Supplementary Results Simulation 13), and thus we suggest theoretical exploration of this topic may provide a basis for future investigation.

Another limitation noted here is that, whilst our proposed method theoretically works for data of arbitrary dimensions, the simulations of Section 4 verify the performance of our method only for two-dimensional data. Whilst it is typical for many practical applications to focus on two-dimensional data (e.g. climate maps, surface images), higher dimensional datasets are abundant in a wealth of disciplines such as brain imaging and astrological mapping. For this reason, we intend to investigate further the performance of the method in higher dimensions in future work.

A possible direction for future exploration is to investigate the applicability of the method within the framework of latent Gaussian models. Such models seek to explain non-Gaussian outcomes as a combination of Gaussian processes transformed via a link function. By using the proposed method, it may be possible to assess the regions of space in which a combination of latent processes predicted the outcome. If successful, such an approach would have high utility for the analysis of spatial data with a count or binary structure, making it a promising direction for further research.

Supplementary Material

qkad104_Supplementary_Data

Acknowledgments

The authors would like to thank Dr. Fabian J.E. Telschow for his early involvement in formulating the project, and Professor Gary R. Lawlor for his email correspondence regarding the differentiability requirements for L’hospital’s rule for multivariable functions.

Contributor Information

Thomas Maullin-Sapey, Nuffield Department of Population Health, Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK.

Armin Schwartzman, Division of Biostatistics, University of California, San Diego, CA, USA; Halicioğlu Data Science Institute, University of California, San Diego, CA, USA.

Thomas E Nichols, Nuffield Department of Population Health, Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK.

Funding

This work was partially supported by the NIH under Grant [R01EB026859].

Data availability

Data were provided by the Human Connectome Project, WU-Minn Consortium (Principal Investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University. The code used to obtain the results of Sections 4 and 5 is freely available and may be found at: https://github.com/TomMaullin/ConfSets

Supplementary material

Supplementary material is available online at Journal of the Royal Statistical Society: Series B.

References

  1. Adler R. J. (1977). Hausdorff dimension and Gaussian fields. Annals of Probability, 5(1), 145–151. 10.1214/aop/1176995900 [DOI] [Google Scholar]
  2. Adler R. J. (1981). The geometry of random fields. Wiley Series in Probability and Mathematical Statistics. J. Wiley. ISBN 9780471278443. [Google Scholar]
  3. Adler R. J., Bartz K., Kou S. C., & Monod A. (2017). Estimating thresholding levels for random fields via Euler characteristics. arXiv preprint arXiv:1704.08562 [math.ST]. 10.48550/ARXIV.1704.08562. [DOI]
  4. Adler R. J., & Taylor J. E. (2009). Random fields and geometry. Springer Monographs in Mathematics. Springer. ISBN 9780387481166. [Google Scholar]
  5. Alagapan S., Lustenberger C., Hadar E., Shin H. W., & Fröhlich F. (2019). Low-frequency direct cortical stimulation of left superior frontal gyrus enhances working memory performance. NeuroImage, 184, 697–706. 10.1016/j.neuroimage.2018.09.064 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Azas J. M., & Wschebor M. (2009). Level sets and extrema of random processes and fields. (1st ed.). Wiley. 10.1002/9780470434642 [DOI] [Google Scholar]
  7. Barch D. M., Burgess G. C., Harms M. P., Petersen S. E., Schlaggar B. L., Corbetta M., Glasser M. F., Curtiss S., Dixit S., Feldt C., Nolan D., Bryant E., Hartley T., Footer O., Bjork J. M., Poldrack R., Smith S., Johansen-Berg H., Snyder A. Z., … Van Essen D. C. (2013). Function in the human connectome: Task-fMRI and individual differences in behavior. NeuroImage, 80, 169–189. 10.1016/j.neuroimage.2013.05.033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Berger R. L. (1997). Likelihood ratio tests and intersection-union tests (pp. 225–237). Birkhäuser. ISBN 978-1-4612-2308-5. 10.1007/978-1-4612-2308-5_15 [DOI]
  9. Berger R. L., & Hsu J. C. (1996). Bioequivalence trials, intersection-union tests and equivalence confidence sets. Statistical Science, 11(4), 283–319. 10.1214/ss/1032280304 [DOI] [Google Scholar]
  10. Boisgueheneuc F. d., Levy R., Volle E., Seassau M., Duffau H., Kinkingnehun S., Samson Y., Zhang S., & Dubois B. (2006). Functions of the left superior frontal gyrus in humans: A lesion study. Brain, 129(12), 3315–3328. ISSN 0006-8950. 10.1093/brain/awl244 [DOI] [PubMed] [Google Scholar]
  11. Bolin D., & Lindgren F. (2015). Excursion and contour uncertainty regions for latent Gaussian models. Journal of the Royal Statistical Society. Series B (Statistical Methodology), 77(1), 85–106. ISSN 13697412, 14679868. 10.1111/rssb.12055 [DOI] [Google Scholar]
  12. Bowring A., Telschow F., Schwartzman A., & Nichols T. E. (2019). Spatial confidence sets for raw effect size images. NeuroImage, 203, 116187. 10.1016/j.neuroimage.2019.116187 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Bowring A., Telschow F. J. E., Schwartzman A., & Nichols T. E. (2021). Confidence sets for Cohen’s d effect size images. NeuroImage, 226, 117477. 10.1016/j.neuroimage.2020.117477 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Bréchet L., Grivaz P., Gauthier B., & Blanke O. (2018, November). Common recruitment of angular gyrus in episodic autobiographical memory and bodily self-consciousness. Frontiers in Behavioral Neuroscience, 12, 270. 10.3389/fnbeh.2018.00270 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Brett M., Penny W., & Kiebel S. (2004). An introduction to random field theory. Human Brain Function (2nd ed., pp. 867–879). Academic Press. 10.1016/B978-012264841-0/50046-9 [DOI] [Google Scholar]
  16. Cao J. (1999). The size of the connected components of excursion sets of χ2, t and F fields. Advances in Applied Probability, 31(3), 579–595. 10.1239/aap/1029955192 [DOI] [Google Scholar]
  17. Chen Y.-C., Genovese C. R., & Wasserman L. (2017). Density level sets: Asymptotics, inference, and visualization. Journal of the American Statistical Association, 112(520), 1684–1696. 10.1080/01621459.2016.1228536 [DOI] [Google Scholar]
  18. Cohen J. (2013). Statistical power analysis for the behavioral sciences. Elsevier Science. ISBN 9781483276489. [Google Scholar]
  19. Craigmile P., Cressie N., Santner T., & Rao Y. (2005). A loss function approach to identifying environmental exceedances. Extremes, 8(3), 143–159. 10.1007/s10687-006-7964-y [DOI] [Google Scholar]
  20. Cuevas A., González-Manteiga W., & Rodríguez-Casal A. (2006). Plug-in estimation of general level sets. Australian & New Zealand Journal of Statistics, 48(1), 7–19. 10.1111/anzs.2006.48.issue-1 [DOI] [Google Scholar]
  21. Davenport S., Nichols T. E., & Schwarzman A. (2022). ‘Confidence regions for the location of peaks of a smooth random field’, arXiv, arXiv:2208.00251, preprint: not peer reviewed.
  22. Dijkstra N., van Gaal S., Geerligs L., Bosch S. E, & van Gerven M. (2021). No overlap between unconscious and imagined representations. PsyArXiv. 10.31234/osf.io/ctdmk [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Efron B., & Tibshirani R. J. (1994). An introduction to the bootstrap (1st ed.). Chapman and Hall/CRC. [Google Scholar]
  24. Ferreira R. A., Vinson D., Dijkstra T., & Vigliocco G. (2021). Word learning in two languages: Neural overlap and representational differences. Neuropsychologia, 150, 107703. 10.1016/j.neuropsychologia.2020.107703 [DOI] [PubMed] [Google Scholar]
  25. French J. P., & Hoeting J. A. (2015). Credible regions for exceedance sets of geostatistical data. Environmetrics, 27(1), 4–14. 10.1002/env.v27.1 [DOI] [Google Scholar]
  26. French J. P., & Sain S. R. (2013). Spatio-temporal exceedance locations and confidence regions. The Annals of Applied Statistics, 7(3), 1421–1449. 10.1214/13-AOAS631 [DOI] [Google Scholar]
  27. Glasser M. F., Sotiropoulos S. N., Wilson J. A., Coalson T. S., Fischl B., Andersson J. L., Xu J., Jbabdi S., Webster M., Polimeni J. R., Van Essen D. C., & Jenkinson M. (2013). The minimal preprocessing pipelines for the human connectome project. NeuroImage, 80, 105–124. 10.1016/j.neuroimage.2013.04.127 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Gomes M. I., Martins M. J., Neves M., & Figueiredo F. (2015). Resampling methodologies and reliable tail estimation. South African Statistical Journal, 49(1), 1–20. 10.10520/EJC168309 [DOI] [Google Scholar]
  29. Hazra A., & Huser R. (2021). Estimating high-resolution red sea surface temperature hotspots, using a low-rank semiparametric spatial model. The Annals of Applied Statistics, 15(2), 572–596. 10.1214/20-AOAS1418 [DOI] [Google Scholar]
  30. Hedges L. V. (1981). Distribution theory for Glass’s estimator of effect size and related estimators. Journal of Educational Statistics, 6(2), 107–128. ISSN 03629791. 10.3102/10769986006002107 [DOI] [Google Scholar]
  31. Hesterberg T. C. (2015). What teachers should know about the bootstrap: Resampling in the undergraduate statistics curriculum. The American Statistician, 69(4), 371–386. 10.1080/00031305.2015.1089789 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Mammen E. (1993). Bootstrap and wild bootstrap for high dimensional linear models. Annals of Statistics, 21(1), 255–285. 10.1214/aos/1176349025 [DOI] [Google Scholar]
  33. Mammen E., & Polonik W. (2013). Confidence regions for level sets. Journal of Multivariate Analysis, 122, 202–214. 10.1016/j.jmva.2013.07.017 [DOI] [Google Scholar]
  34. Mejia A. F., Yue Y. R., Bolin D., Lindgren F., & Lindquist M. A. (2020). A Bayesian general linear modeling approach to cortical surface fMRI data analysis. Journal of the American Statistical Association, 115(530), 501–520. 10.1080/01621459.2019.1611582 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Poldrack R. A., Mumford J. A., & Nichols T. E. (2011). Handbook of functional MRI data analysis. Cambridge University Press. [Google Scholar]
  36. Pranav P., van de Weygaert R., Vegter G., Jones B. J. T., Adler R. J, Feldbrugge J., Park C., Buchert T., & Kerber M. (2019, February). Topology and geometry of Gaussian random fields I: On Betti numbers, Euler characteristic, and Minkowski functionals. Monthly Notices of the Royal Astronomical Society, 485(3), 4167–4208. 10.1093/mnras/stz541 [DOI] [Google Scholar]
  37. Qiao W., & Polonik W. (2019). Nonparametric confidence regions for level sets: Statistical properties and geometry. Electronic Journal of Statistics, 13(1), 985–1030. 10.1214/19-EJS1543 [DOI] [Google Scholar]
  38. Ren J., Telschow F. J. E., & Schwartzman A. (2022). ‘Inverse set estimation and inversion of simultaneous confidence intervals’, arXiv, arXiv:2210.03933, preprint: not peer reviewed.
  39. Seghier M. L. (2013). The angular gyrus: Multiple functions and multiple subdivisions. The Neuroscientist, 19(1), 43–61. ISSN 1089-4098. 10.1177/1073858412440596 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Sommerfeld M., Sain S., & Schwartzman A. (2018). Confidence regions for spatial excursion sets from repeated random field observations, with an application to climate. Journal of the American Statistical Association, 113(523), 1327–1340. 10.1080/01621459.2017.1341838 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Telschow F., & Schwartzman A. (2021). Simultaneous confidence bands for functional data using the Gaussian kinematic formula. Journal of Statistical Planning and Inference, 216, 70–94. 10.1016/j.jspi.2021.05.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Torres S. (1994, March). Topological analysis of COBE-DMR cosmic microwave background maps. The Astrophysical Journal, 423(1), L9–L12. 10.1086/187223 [DOI] [Google Scholar]
  43. Van Essen D., Smith S., Barch D., Behrens T., Yacoub E., & Ugurbil K. (2013). The WU-Minn human connectome project: An overview. NeuroImage, 80, 62–79. 10.1016/j.neuroimage.2013.05.041 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Vogel T., Smieskova R., Schmidt A., Walter A., Harrisberger F., Eckert A., Lang U. E., Riecher-Rössler A., Graf M., & Borgwardt S. (2016). Increased superior frontal gyrus activation during working memory processing in psychosis: Significant relation to cumulative antipsychotic medication and to negative symptoms. Schizophrenia Research, 175(1–3), 20–26. 10.1016/j.schres.2016.03.033 [DOI] [PubMed] [Google Scholar]
  45. Worsley K. J. (1996). The geometry of random images. Chance, 9(1), 27–40. 10.1080/09332480.1996.10542483 [DOI] [Google Scholar]
  46. Wu C. F. J. (1986). Jackknife bootstrap and other resampling methods in regression analysis. The Annals of Statistics, 14(4), 1261–1295. 10.1214/aos/1176350142 [DOI] [Google Scholar]
  47. Zhang X., Li L., Huang G., Zhang L., Liang Z., Shi L., & Zhang Z. (2021). A multisensory fMRI investigation of nociceptive-preferential cortical regions and responses. Frontiers in Neuroscience, 15, 635733. 10.3389/fnins.2021.635733 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

qkad104_Supplementary_Data

Data Availability Statement

Data were provided by the Human Connectome Project, WU-Minn Consortium (Principal Investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University. The code used to obtain the results of Sections 4 and 5 is freely available and may be found at: https://github.com/TomMaullin/ConfSets


Articles from Journal of the Royal Statistical Society. Series B, Statistical Methodology are provided here courtesy of Oxford University Press

RESOURCES