Simultaneous confidence corridors for mean functions in functional data analysis of imaging data

Yueying Wang; Guannan Wang; Li Wang; R Todd Ogden

doi:10.1111/biom.13156

. Author manuscript; available in PMC: 2020 Jun 23.

Published in final edited form as: Biometrics. 2019 Nov 6;76(2):427–437. doi: 10.1111/biom.13156

Simultaneous confidence corridors for mean functions in functional data analysis of imaging data

Yueying Wang ¹, Guannan Wang ², Li Wang ¹, R Todd Ogden ³

PMCID: PMC7310608 NIHMSID: NIHMS1593433 PMID: 31544958

Abstract

Motivated by recent work involving the analysis of biomedical imaging data, we present a novel procedure for constructing simultaneous confidence corridors for the mean of imaging data. We propose to use flexible bivariate splines over triangulations to handle an irregular domain of the images that is common in brain imaging studies and in other biomedical imaging applications. The proposed spline estimators of the mean functions are shown to be consistent and asymptotically normal under some regularity conditions. We also provide a computationally efficient estimator of the covariance function and derive its uniform consistency. The procedure is also extended to the two-sample case in which we focus on comparing the mean functions from two populations of imaging data. Through Monte Carlo simulation studies, we examine the finite sample performance of the proposed method. Finally, the proposed method is applied to analyze brain positron emission tomography data in two different studies. One data set used in preparation of this article was obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database.

Keywords: bivariate splines, functional principal component analysis, image analysis, semiparametric efficiency, triangulation

1 |. INTRODUCTION

In recent years, as digital technology advanced significantly, valuable imaging data of body structures and organs can be easily collected during routine clinical practice. This new paradigm presents new opportunities to innovate in both research and clinical settings. Medical imaging technology has revolutionized health care over the past three decades, allowing doctors to find or detect tumors and other abnormalities and evaluate the effectiveness of treatment. Functional data analysis (FDA) provides modern analytical tools for imaging data, which can be viewed as realizations of random functions. Let Ω be a two-dimensional bounded domain, and z = (z₁, z₂) be a point in Ω. The model we consider is

Y_{i} (z) = μ (z) + η_{i} (z) + σ (z) ε_{i} (z), i = 1, \dots, n, z \in Ω,

(1)

which is one instance of the general function-on-scalar regression model. In model (1), Y_i (z) denotes the imaging measurement at location z ∈ Ω; η_i (z) is a stochastic process indexed by z, which characterizes subject-level image variations; and σ (z) is a positive deterministic function. We assume that η_i (z) and ε_i (z) are mutually independent, η_i (z) are i.i.d. copies of a L₂ stochastic process η(z) with mean zero and covariance function G_η(z, z′), ε_i (z) are i.i.d. instances of a stochastic process of ε(z) with mean zero, and covariance function Cov{ε(z), ε(z′)} = I (z = z′).

For biomedical imaging data, the objects (eg, tumor tissues, brain regions, etc) appearing in the images are typically irregularly shaped. Many smoothing methods in the literature, such as tensor product smoothing, kernel smoothing, and wavelet smoothing, suffer from the problem of “leakage” across the complex domains, that is, poor estimation over difficult regions as a result of smoothing inappropriately across boundaries of features.

In this article, we endeavor to address these challenges by applying bivariate splines over triangulations (Lai and Wang, 2013) to preserve important features (shape, smoothness) of imaging data. Spline functions defined this way offer more flexibility and varying amounts of smoothness, allowing us to better approximate the mean functions. We study the asymptotic properties of the spline estimators of μ(z) by using bivariate penalized splines (BPS) defined on triangulations and show that our estimator is consistent and asymptotically normal.

In addition, when analyzing biomedical imaging data, such as brain images, typical questions lie in estimating the mean function, μ(z), together with quantifying the estimation uncertainty and making comparisons between populations. However, making a statistically rigorous inference for imaging data is challenging, and one of the main obstacles is the complicated spatial correlation structure. The prevailing analytic technique, termed the “mass univariate” approach, involves regarding each pixel/voxel as a unit, and for each unit, making a traditional univariate statistical inference, such as a simple t test. The obvious multiple comparisons issue can be dealt with in many ways; popular approaches include the Bonferroni correction, the random field theory (Worsley et al., 2004; Adler and Taylor, 2007; Siegmund et al., 2011), and the cluster threshold-based approach (Forman et al., 1995).

However, many of the multiple testing methods are ad hoc methods, which involve setting the threshold by eye, based on the practitioner’s experience and knowledge. Our simulation study in Supporting Information Appendix A also demonstrates that those ad hoc methods heavily depend on the choice of the threshold. In this article, we propose an alternative approach that treats the imaging data as an instance of functional data, regarded as being continuously defined but observed on a regular grid. If we consider the imaging data as being functional, attention naturally turns from considering each pixel/voxel as the basic analytical unit toward analyzing the entire image simultaneous, for instance, calculating simultaneous confidence corridors (SCCs; also called “simultaneous confidence bands” or “uniform confidence band/region”). As pointed out in Choi and Reimherr (2018) and Degras (2017), conventional multiple comparison methods are less useful in the functional data setup because the infinite cardinality of the domain would lead to unbounded confidence regions.

In statistics, SCCs are vital and fundamental tools for inference on the global behavior of functions (Degras, 2017). However, they have received relatively little attention in the literature of FDA. Moreover, existing SCC work for FDA has concentrated on the one-dimensional case. For the development of SCCs for mean curves of functional data, see the simulation-based techniques (Degras, 2011; Cao et al., 2012; Zheng et al., 2014; Cao and Wang, 2018), the functional principal component (FPC) decomposition-based approach (Goldsmith et al., 2013), and the geometric approach by Choi and Reimherr (2018) in Hilbert spaces. Zhu et al. (2012) proposed SCCs for the regression coefficient functions for multivariate varying coefficient model for functional responses. Gu et al. (2014) and Chang et al. (2017) proposed the SCC for coefficient functions in the function-on-scalar regression model. However, there is scant literature on SCCs for imaging data or other more general two-dimensional (2D) functions. Although the geometric method in Choi and Reimherr (2018) can be used to construct SCCs in Hilbert spaces over rectangular domains, it doesn’t work well for objects over complex domains with arbitrary shape, which are very common in biomedical imaging studies. In addition, the geometric method is conservative because it is essentially based on a modification of Scheffé’s method.

In this article, we derive SCCs with exact coverage probability for the 2D functional mean function μ(z), z ∈ Ω, in (1) via the extreme value theory of Gaussian processes (Adler, 1990) and approximating mean functions with bivariate splines. Our simulation studies indicate that the proposed SCCs are computationally efficient and have the correct coverage probability for finite samples. We also show that the spline estimator and the accompanying SCC are asymptotically the same as if all the images are observed without noise.

Motivated by the need to statistically quantify the difference between two imaging data sets arising in medical imaging studies, we further consider two-sample inference and extend our SCC construction procedure to a two-sample problem. Specifically, we focus on constructing SCC for the difference of the mean functions from two independent samples. The comparison of mean functions is particularly useful for imaging analysis in some biomedical settings such as comparing imaging outcomes for groups randomized either to placebo or to active treatment. Any mean differences may be localized and irregularly shaped, and so an estimation method should be flexible enough to allow for such differences. The approach developed here allows comparison of treatments simultaneously across the entire domain of interest.

We organize our article as follows. Section 2 describes the BPS estimators, and establishes their asymptotic properties for imaging data. Section 3 proposes asymptotic pointwise confidence intervals and SCCs that are constructed based on the BPS estimators. In Section 4, we discuss how to estimate the unknown components involved in the SCC construction and other issues of implementation. Section 5 reports findings from a simulation study. In Section 6, we apply the proposed methods to two real brain imaging data sets. In Section 7, we conclude the article with some discussions. Proofs of the theoretical results and additional numerical results are provided in Supporting Information.

2 |. MODELS AND ESTIMATION METHOD

In practice, the functional imaging response variable, Y_i (⋅), is only measured on a regular grid of pixels, z_j ∈ Ω, j = 1, …, N. For notational simplicity, we let Y_ij = Y_i (z_j) be the imaging response of subject i at location j, and the actual data set consists of {(Y_ij, z_j)}, i = 1, …, n, j = 1, …, N, which can be modeled as

Y_{i j} = μ (z_{j}) + η_{i} (z_{j}) + σ (z_{j}) ε_{i j} .

(2)

2.1 |. Bivariate spline basis approximation over triangulations

For model (2), we first consider the estimation of the mean function, μ(⋅). Medical imaging data are typically observed on an irregular domain Ω. We approximate the mean function in (2) by the bivariate splines that are piecewise polynomial functions over a 2D triangulated domain; see Lai and Wang (2013). In the following, we briefly introduce the techniques of triangulations and describe the BPS smoothing method.

Triangulation is an effective tool for handling data distributed on irregular regions with complex boundaries and/or interior holes. In the following, we use T to denote a triangle which is a convex hull of three points that are not collinear. A collection Δ = {T₁, …, T_M} of M triangles is called a triangulation of $Ω = \cup_{m = 1}^{M} T_{m}$ if any nonempty intersection between a pair of triangles in Δ is either a shared vertex or a shared edge. Given a triangle T ∈ Δ, let|T| be its longest edge length, then the size of Δ is defined as |Δ| := max {|T|, T ∈ Δ}, that is, the length of the longest edge of all triangles in Δ.

For an integer r ⩾ 0, let $C^{r} (Ω)$ be the collection of all r-th continuously differentiable functions over Ω. Given a triangulation Δ, let $S_{d}^{r} (Δ) = {s \in C^{r} (Ω) : {s |}_{T} \in ℙ_{d} (T), T \in Δ}$ be a spline space of degree d and smoothness r over triangulation Δ, where s|_T is the polynomial piece of spline s restricted on triangle T, and $ℙ_{d}$ is the space of all polynomials of degree less than or equal to d. We use Bernstein basis polynomials to represent the bivariate splines. For any triangle T ∈ Δ and any fixed point z ∈ Ω, let b₁, b₂ and b₃ be the barycentric coordinates of z relative to T. Then, the Bernstein basis polynomials of degree d relative to triangle T are defined as $B_{i j k}^{T, d} (z) = {(i! j! k!)}^{- 1} d! b_{1}^{i} b_{2}^{j} b_{3}^{k}$ , i + j + k = d. Let ${B_{m}}_{m \in M}$ be the set of degree-d bivariate Bernstein basis polynomials for $S_{d}^{r} (Δ)$ , where $M$ stands for an index set of Bernstein basis polynomials. Denote by B the evaluation matrix of Bernstein basis polynomials, where the jth row of B is given by B^Τ(z_j) = {B_m (z_j), m ∈ $M$ }, for j = 1, …, N. We can approximate the mean function μ(z) by μ(z) ≈ B^Τ(z)γ, where γ^Τ = (γ_m, m ∈ $M$ ) is the spline coefficient vector. The above bivariate spline basis can be easily constructed via the R package BPST.

To define the penalized spline method, for any function g(z) and direction z_h, h = 1,2, let $\nabla_{z_{h}}^{v} g (z)$ denote the v-th order derivative in the direction z_h at the point z. We consider the following penalized least squares problem: $\min_{g \in S_{d}^{r} (Δ)} \sum_{i = 1}^{n} \sum_{j = 1}^{N} {Y_{i j} - g (z_{j})}^{2} + ρ_{n} E (g)$ , where $E (s) = \sum_{T \in Δ} \int_{T} \sum_{i + j = 2} (\begin{array}{l} 2 \\ i \end{array}) {(\nabla_{z_{1}}^{i} \nabla_{z_{2}}^{j} s)}^{2} d z_{1} d z_{2}$ is the roughness penalty, and ρ_n is the roughness penalty parameter. To meet the smoothness requirement of the splines, we need to impose some linear constraints on the spline coefficients γ: Hγ = 0 to be specific. See Section B.2 of the Supplementary Material of Yu et al. (2019) for a simple example of H. Thus, we have to minimize $\sum_{i = 1}^{n} \sum_{j = 1}^{N} {Y_{i j} - B^{T} (z_{j}) γ}^{2} + ρ_{n} γ^{T} P γ$ , subject to Hγ = 0, where P is the block diagonal penalty matrix satisfying γ^ΤPγ = ε(Bγ).

We first remove the constraint via QR decomposition of $H^{T} : H^{T} = QR = (Q_{1} Q_{2}) (\begin{array}{l} R_{1} \\ R_{2} \end{array})$ , where Q is orthogonal and R is upper triangular, the submatrix Q₁ is the first p columns of Q, where p is the rank of H, and R₂ is a matrix of zeros. Next, we reparametrize using γ = Q₂θ for some θ, then it is guaranteed that Hγ = 0. The minimization problem is thus converted to a conventional unrestricted penalized regression problem:

\sum_{i = 1}^{n} \sum_{j = 1}^{N} {Y_{i j} - {\tilde{B}}^{T} (z_{j}) Q_{2} θ}^{2} + ρ_{n} θ^{T} Q_{2}^{T} P Q_{2} θ,

(3)

where $\tilde{B} (z) = Q_{2}^{T} B (z)$ . Denote ${\bar{Y}}_{\cdot, j} = n^{- 1} \sum_{i = 1}^{n} Y_{i j}$ , $\bar{Y} = {({\bar{Y}}_{\cdot, 1}, \dots, {\bar{Y}}_{\cdot, N})}^{T}$ , U = BQ₂, and $D = Q_{2}^{T} P Q_{2}$ . Then, minimizing (3) is equivalent to minimizing

{‖ \bar{Y} - B Q_{2} θ ‖}^{2} + n^{- 1} ρ_{n} θ^{T} Q_{2}^{T} P Q_{2} θ = ‖ \bar{Y} - U θ ‖^{2} + n^{- 1} ρ_{n} θ^{T} D θ,

and the solution is given by $\hat{θ} = {U^{T} U + n^{- 1} ρ_{n} D}^{- 1} U^{T} \bar{Y}$ . Thus, the estimator of γ and μ(⋅) are: $\hat{γ} = Q_{2} \hat{θ}$ , $\hat{μ} (z) = B {(z)}^{T} \hat{γ}$ .

2.2 |. Functional principal component analysis

For the second component, η_i (z), in model (2), we consider a spectral decomposition of its covariance function G_η(z, z′). Denote the eigenvalue and eigenfunction sequences of the covariance operator G_η(z, z′) as ${λ_{k}}_{k = 1}^{\infty}$ and ${ψ_{k} (z)}_{k = 1}^{\infty}$ , in which $λ_{1} ⩾ λ_{2} ⩾ \dots ⩾ 0, \sum_{k = 1}^{\infty} λ_{k} < \infty$ , and ${ψ_{k}}_{k = 1}^{\infty}$ form an orthonormal basis of L₂ (Ω). It follows from spectral theory that $G_{η} (z, z^{'}) = \sum_{k = 1}^{\infty} λ_{k} ψ_{k} (z) ψ_{k} (z^{'})$ . The ith stochastic process {η_i (z), z ∈ Ω} allows the Karhunen-Loéve L₂ representation: $η_{i} (z) = \sum_{k = 1}^{\infty} ξ_{i k} ϕ_{k} (z)$ , where ϕ_k (z) = (λ_k)^1/2ψ_k (z), and the coefficients ξ_ik’s are uncorrelated random variables with mean 0 and E (ξ_ik ξ_ik′) = I k( = k′), referred to as the kth FPC score of the ith subject in classical functional principal component analysis (FPCA). Thus, the response measurements in (2) can be represented as follows:

Y_{i j} = μ (z_{j}) + \sum_{k = 1}^{\infty} ξ_{i k} ϕ_{k} (z_{j}) + σ (z_{j}) ε_{i j} .

Next, we describe the method of estimating the FPCA: the variance-covariance function G_η(z, z′) and its eigenvalues and eigenfunctions. For any i = 1, …, n, j = 1, …, N, let ${\hat{R}}_{i j} = Y_{i j} - \hat{μ} (z_{j})$ be the residual. We estimate η_i (z) individually by employing the bivariate spline smoothing method to ${({\hat{R}}_{i j}, z_{j})}_{j = 1}^{N}$ . To be more specific, for each i = 1, …, n, we define the spline estimator of η_i (z) as ${\hat{η}}_{i} (z) = \arg \min_{g_{i} \in S_{d}^{r} (Δ *)} \sum_{j = 1}^{N} {{\hat{R}}_{i j} - g_{i} (z_{j})}^{2} + ρ_{n}^{*} E (g_{i})$ , where the triangulation Δ* and smoothness penalty $ρ_{n}^{*}$ may be different from those introduced in Section 2 when estimating μ(z). Next, define the estimator of G_η(z, z′) as

{\hat{G}}_{η} (z, z^{'}) = n^{- 1} \sum_{i = 1}^{n} {\hat{η}}_{i} (z) {\hat{η}}_{i} (z^{'}),

(4)

and we estimate the eigenfunctions ψ_k (⋅) using the following eigenequations:

\int_{Ω} {\hat{G}}_{η} (z, z^{'}) {\hat{ψ}}_{k} (z) d z = {\hat{λ}}_{k} {\hat{ψ}}_{k} (z^{'}),

(5)

where ${\hat{ψ}}_{k}$ ’s are subject to $\int_{Ω} {\hat{ψ}}_{k}^{2} (z) d z = 1$ and $\int_{Ω} {\hat{ψ}}_{k} (z) {\hat{ψ}}_{k^{'}} (z) d z = 0$ for k′ < k. If N is sufficiently large, the left hand side of can be (5) approximated by $\sum_{j = 1}^{N} \hat{G} (z_{j}, z_{j^{'}}) {\hat{ψ}}_{k} (z_{j}) A (z_{j})$ , where A(z_j) is the area of the pixel z_j.

2.3 |. Theoretical properties of the estimators

We investigate the asymptotic properties of the proposed spline estimators. To discuss these properties, we introduce some notation first. For any function g over the closure of domain Ω, denote by $‖ g ‖_{L_{2} (Ω)}^{2} = \int_{Ω} g^{2} (z) d z$ the regular L₂ norm of g, and by $‖ g ‖_{\infty, Ω} = \sup_{z \in Ω} | g (z) |$ the supremum norm of g. Let $| g |_{v, \infty, Ω} = \max_{i + j = v} {‖ \nabla_{z_{1}}^{i} \nabla_{z_{2}}^{j} g ‖}_{\infty, Ω}$ be the maximum norms of all the υ th order derivatives of g over Ω. For notational simplicity, we suppress the subscript Ω below. Given random variables S_n for n ⩾ 1, we write S_n = O_P (b_n) if lim_c→∞ lim sup_nP (|S_n| ⩾ cb_n) = 0. Similarly, we write S_n = o_P (b_n) if lim_nP(|S_n| ⩾ cb_n) = 0, for any constant c > 0.

The following theorem provides the L₂ and uniform convergence rate of $\hat{μ} (\cdot)$ . The detailed proofs of this theorem are given in Web Appendix B.3 of Supporting Information.

Theorem 1. Suppose Assumptions (A1) to (A4) in Web Appendix B of Supporting Information hold, and N^1/2|Δ| → ∞ as N → ∞. Then the bivariate penalized spline estimator $\hat{μ} (\cdot)$ is consistent and satisfies

‖ \hat{μ} - μ ‖_{L_{2}} = O_{P} {\frac{ρ_{n}}{n N | Δ |^{3}} ‖ μ ‖_{2, \infty} + (1 + \frac{ρ_{n}}{n N | Δ |^{5}}) | Δ |^{d + 1} ‖ μ ‖_{d + 1, \infty} + \frac{1}{\sqrt{n}} + \frac{1}{\sqrt{n N} | Δ |}} .

In addition, if Assumptions (A1) to (A5) hold, we have $‖ \hat{μ} - μ ‖_{\infty} = o_{P} {{(n^{- 1} \log (n))}^{1 / 2}}$ and $‖ \hat{μ} - μ ‖_{L_{2}} = O_{P} (n^{- 1 / 2})$ .

Theorem 2 characterizes the uniform weak convergence of ${\hat{G}}_{η} (z, z^{'})$ and the convergence of ${\hat{ψ}}_{k}$ and ${\hat{λ}}_{k}$ .

Theorem 2. Under Assumptions (A1) to (A7) in Web Appendix B of Supporting Information, we have the following results: (a) The spline estimator ${\hat{G}}_{η} (z, z^{'})$ in (4) uniformly converges to G_η(z, z′) in probability, that is, $\sup_{(z, z^{'}) \in Ω^{2}} | {\hat{G}}_{η} (z, z^{'}) - G_{η} (z, z^{'}) | = o_{P} (1)$ ; (b) $‖ {\hat{ψ}}_{k} - ψ_{k} ‖ = o_{P} (1)$ , $| {\hat{λ}}_{k} - λ_{k} | = o_{P} (1)$ , for k = 1, …, κ.

Although, in theory, the Karhunen-Loéve representation of the covariance function consists of an infinite number of terms. In applications, it is typical to truncate the spectral decomposition to an integer chosen so as to account for some predetermined proportion of the variance. One can select the number of principal components using the Akaike information criterion (AIC; Yao et al., 2005) or Bayesian information criterion (BIC; Li et al., 2013).

3 |. SIMULTANEOUS CONFIDENCE CORRIDORS

3.1 |. One sample

Let G_η(⋅,⋅) be a positive definite function defined as $G_{η} (z, z^{'}) = \sum_{k = 1}^{κ} λ_{k} ψ_{k} (z) ψ_{k} (z^{'})$ , z, z′ ∈ Ω. Denote by ζ (z), z ∈ Ω a standardized Gaussian process such that Eζ (z) = 0, Eζ² (z) = 1 with covariance function Eζ (z)ζ (z′) = G_η(z, z′){G_η(z, z)G_η(z′, z′)}^−1/2, z, z′ ∈ Ω. Denote by q_1−α the 100(1 − α)th percentile of the distribution of the absolute maximum of ζ (z), z ∈ Ω, that is P{sup_z∈Ω |ζ (z)| ⩽ q_1−α} = 1 − α, α ∈ (0,1).

Define the “oracle” estimator $\bar{μ} (z) = μ (z) + n^{- 1} \sum_{i = 1}^{n} η_{i} (z)$ , which is infeasible due to the finite pixel grid {z_j: j = 1, …, N} and the measurement error. The following theorem presents the asymptotic properties of $\bar{μ} (z)$ and shows that the difference between the BPS estimator $\hat{μ} (z)$ and the “oracle” smoother $\bar{μ} (z)$ is uniformly bounded at an o_P (n^1/2) rate.

Theorem 3. Under Assumptions (A1) to (A6) in Web Appendix B of Supporting Information, for any α ∈ (0,1), as N → ∞, n → ∞,

P {\sup_{z \in Ω} n^{1 / 2} | \bar{μ} (z) - μ (z) | G_{η} {(z, z)}^{- 1 / 2} ⩽ q_{1 - α}} \to 1 - α, and \sup_{z \in Ω} | \bar{μ} (z) - \hat{μ} (z) | = o_{P} (n^{- 1 / 2}) .

Based on Theorems 1 and 3, we obtain the following asymptotic SCCs for μ(z), z ∈ Ω.

Corollary 1. Under the assumptions of Theorem 3, for any α ∈ (0,1), as N → ∞, n → ∞, an asymptotic 100(1 − α)% exact SCC for μ(z) is $\hat{μ} (z) \pm n^{- 1 / 2} q_{1 - α} G_{η} {(z, z)}^{1 / 2}$ .

3.2 |. Extension to two-sample case

While one-sample SCCs are of primary interest in many situations, in some brain imaging analysis, interest lies in comparing two groups, for example, patients and normal control subjects. Next, we extend our method to two-sample problems, constructing SCCs for the difference between mean functions from two independent groups, analogous to a two-sample t test.

Given two groups of imaging observations with sample sizes n₁ and n₂, respectively, defined on a common region Ω. For H = 1, 2, let $G_{H η} (z, z^{'}) = \sum_{k = 1}^{κ_{H}} ϕ_{H k} (z) ϕ_{H k} (z^{'})$ be a positive definite function and ${\hat{μ}}_{H}$ be the spline estimates for the group mean function μ_H. Let V (z, z′) = G_1η(z, z′) + τG_2η(z, z′), where $τ = \lim_{n_{1} \to \infty} n_{1} / n_{2}$ . Denote by W (z), z ∈ Ω, a standardized Gaussian process such that EW (z) = 0, EW² (z) = 1 with covariance E[W (z) W (z′)] = {V (z, z)}^−1/2V (z, z′){V (z′, z′)}^−1/2. Denote q_12,α the (1 − α)th quantitle of the absolute maximal distribution of W (z), z ∈ Ω.

Theorem 4. Under Assumptions (A1) to (A6) in Web Appendix B of Supporting Information, for any α ∈ (0,1), as N → ∞, n₁ → ∞,

P {\sup_{z \in Ω} \frac{n_{1}^{1 / 2} | ({\hat{μ}}_{1} - {\hat{μ}}_{2}) (z) - (μ_{1} - μ_{2}) (z) |}{\sqrt{V (z, z)}} ⩽ q_{12, α}} \to 1 - α .

Theorem 4 suggests that an asymptotic 100(1 − α)% exact SCC for (μ₁ − μ₂)(z) can be constructed as $({\hat{μ}}_{1} - {\hat{μ}}_{2}) (z) \pm n_{1}^{- 1} q_{12, α} {V (z, z)}^{1 / 2}$ .

4 |. IMPLEMENTATION

Without loss of generality, we describe the implementation of the proposed SCCs for the one-sample case. The procedure can be similarly adopted to the two-sample mean cases.

4.1 |. Quantile estimation and smoothing parameter selection

The quantile q_1−α used to construct the SCCs in Corollary 1 cannot be obtained analytically; however, it can be approximated by numerical simulation as follows: first, we simulate $ζ_{b} (z) = {\hat{G}}_{η}^{- 1 / 2} (z, z) \sum_{k = 1}^{κ} {\hat{λ}}_{k}^{1 / 2} Z_{k, b} {\hat{ψ}}_{k} (z)$ , where Z_k,b are i.i.d standard normal variables with 1 ⩽ k ⩽ κ and b = 1, …, B for a preset large integer B. Then, we estimate the quantile q_1−α by the corresponding empirical quantile of these maximum values by taking the maximal absolute value for each copy of ζ_b (z). To construct the SCC for the two-sample case, denote $\hat{V} (z, z^{'}) = {\hat{G}}_{1 η} (z, z^{'}) + τ {\hat{G}}_{2 η} (z, z^{'})$ . We simulate ${\hat{W}}_{b} (z) = {\hat{V} (z, z)}^{- 1 / 2} {\sum_{k = 1}^{κ_{1}} {\hat{λ}}_{1 k}^{1 / 2} Z_{1 k, b} {\hat{ψ}}_{1 k} (z) - {(n_{1} / n_{2})}^{1 / 2} \sum_{k = 1}^{κ_{2}} {\hat{λ}}_{2 k}^{1 / 2} Z_{2 k, b} {\hat{ψ}}_{2 k} (z)}$ , z ∈ Ω. Then, q_12,α can be estimated by the empirical quantile of the B simulated $| | {\hat{W}}_{b} | |_{\infty}$ ’s, b = 1, …, B.

Next, for a good fit of the data, it is necessary to choose a suitable value of the smoothing parameter ρ_n. A large value of ρ_n enforces a smoother fitted function with larger fitting errors, while a small ρ_n may result in overfitting of the data. Since the in-sample fitting errors cannot gauge the prediction accuracy of the fitted function, we select a criterion function that attempts to measure the out-ofsample performance of the fitted model. Minimizing the generalized cross-validation (GCV) criterion is one computationally efficient approach to selecting smoothing parameters that also has good theoretical properties. We choose the smoothing parameter by minimizing the following $GCV (ρ_{n}) = {‖ \bar{Y} - S (ρ_{n}) \bar{Y} ‖}^{2} / [N {1 - tr {S (ρ_{n})} / N}^{2}]$ over a grid of values of ρ_n, where S(ρ_n) = U(U^ΤU + n⁻¹ρ_nD)⁻¹U^Τ.

4.2 |. Spline basis and triangulation selection

To construct the SCC, we need to choose the spline basis functions and triangulation used in the BPS, a notoriously difficult task for constructing nonparametric pointwise confidence intervals or simultaneous confidence bands.

When the resolution of the imaging is relatively high and the mean imaging seems to be a realization from some smooth function without sharp edges, we suggest using smooth parameter r = 1 with degree d ⩾ 4. When d ⩾ 5, the proposed spline achieves full estimation power asymptotically (Lai and Wang, 2013). It is generally believed that subject-level image variation η_is are less smooth than the mean function. Thus, we suggest considering lower order splines, such as d = 2, when estimating the η_is.

An optimal triangulation is a partition of the domain which is best according to some criterion that measures the shape, size or number of triangles. For example, a “good” triangulation usually refers to those with well-shaped triangles, no small angles or/and no obtuse angles. We suggest building the triangulated meshes using typical triangulation construction methods such as Delaunay Triangulation (De Loera et al., 2010). The Matlab code DistMesh and R package Triangulation can be used to construct the triangulation. When estimating the mean function μ(⋅), we suggest choosing the triangulation Δ_μ based on leave-images-out k-fold cross-validation (CV). In the estimation of the η_i (⋅)’s, we suggest choosing the triangulation Δ_η so as to minimize a bootstrap estimator of the coverage error of the SCCs. In Algorithm A1 in Supporting Information, we describe our selection scheme for the one-sample case, which can be extended straightforwardly to the two-sample case.

4.3 |. Variance estimation for measurement errors and SCC adjustment

For certain imaging types and modalities, our Assumptions (A2) and (A3) about the measurement errors may not be completely satisfied. We propose a modification to the SCC procedure in Section 3 to deal with images with relatively large measurement errors.

For the one-sample SCC, for any j = 1, …, N, let ${\hat{ε}}_{i j} = {\hat{R}}_{i j} - {\hat{η}}_{i} (z_{j})$ , and we estimate σ²(z_j) by ${\hat{σ}}^{2} (z_{j}) = n^{- 1} \sum_{i = 1}^{n} {\hat{ε}}_{i j} {\hat{ε}}_{i j}$ . Next, denote $\hat{ε} (z) = {(n N)}^{- 1} \tilde{B} {(z)}^{T} Γ_{N, ρ}^{- 1} \sum_{i = 1}^{n} \sum_{j = 1}^{N} \tilde{B} (z_{j}) σ (z_{j}) ε_{i j}$ . We estimate the variance-covariance function of $\hat{ε} (z)$ , ${\tilde{G}}_{ε} (z, z^{'}) = Cov {\hat{ε} (z), \hat{ε} (z^{'})}$ , by

{\hat{G}}_{ε} (z, z^{'}) = n^{- 1} N^{- 2} \tilde{B} {(z)}^{T} Γ_{N, ρ}^{- 1} {\sum_{j = 1}^{N} \tilde{B} (z_{j}) {\hat{σ}}^{2} (z_{j}) \tilde{B} {(z_{j})}^{T}} \times Γ_{N, ρ}^{- 1} \tilde{B} (z^{'}),

where Γ_N,ρ is given in (B.5) in Supporting Information.

Denote $\hat{Σ} (z, z^{'}) = {\hat{G}}_{η} (z, z^{'}) + n {\hat{G}}_{ε} (z, z^{'})$ . We adjust the approximation procedure of quantile q_1−α as follows: first, we simulate

ζ_{b} (z) = {\hat{Σ}}^{- 1 / 2} (z, z) {\sum_{k = 1}^{κ} {\hat{λ}}_{k}^{1 / 2} {\hat{ψ}}_{k} (z) Z_{k, ξ}^{(b)} + N^{- 1} \tilde{B} {(z)}^{T} Γ_{N, ρ}^{- 1} \sum_{j = 1}^{N} \tilde{B} (z_{j}) \hat{σ} (z_{j}) Z_{j, ε}^{(b)}},

where $Z_{k, ξ}^{(b)}$ and $Z_{j, ε}^{(b)}$ are i.i.d standard normal variables with 1 ⩽ k ⩽ κ, 1 ⩽ j ⩽ N; next, we estimate the quantile q_1−α by the corresponding empirical quantile of the B simulated ${‖ ζ_{b} ‖}_{\infty}$ ; finally, we construct the SCC as $\hat{μ} (z) \pm n^{- 1 / 2} q_{1 - α} \hat{Σ} {(z, z)}^{1 / 2}, z \in Ω$ .

For the two-sample case, we can similarly modify the procedure by defining ${\hat{Σ}}_{H} (z, z^{'}) = {\hat{G}}_{η, H} + n_{H} {\hat{G}}_{ε, H}$ , for H = 1, 2, and $\hat{Ξ} (z, z^{'}) = {\hat{Σ}}_{1} (z, z^{'}) + n_{1} / n_{2} {\hat{Σ}}_{2} (z, z^{'})$ . Let ${\hat{σ}}_{H} (z)$ be the estimator of σ_H (z), for H = 1, 2. To estimate q_12,α, we simulate

{\hat{W}}_{b} (z) = {\hat{Ξ} (z, z)}^{- 1 / 2} {\sum_{k = 1}^{κ_{2}} {\hat{λ}}_{1 k}^{1 / 2} Z_{1 k, ξ}^{(b)} {\hat{ψ}}_{1 k} (z) - {(\frac{n_{1}}{n_{2}})}^{1 / 2} \sum_{k = 1}^{κ_{2}} {\hat{λ}}_{2 k}^{1 / 2} Z_{2 k, ξ}^{(b)} {\hat{ψ}}_{2 k} (z) + \tilde{B} {(z)}^{T} Γ_{N, ρ_{1}}^{- 1} \frac{1}{N} \sum_{j = 1}^{N} \tilde{B} (z_{j}) {\hat{σ}}_{1} (z_{j}) Z_{1 j, ε}^{(b)} - {(\frac{n_{1}}{n_{2}})}^{1 / 2} \tilde{B} {(z)}^{T} Γ_{N, ρ_{2}}^{- 1} \frac{1}{N} \sum_{j = 1}^{N} \tilde{B} (z_{j}) {\hat{σ}}_{2} (z_{j}) Z_{2 j, ε}^{(b)}},

where $Z_{H k, ξ}^{(b)}$ and $Z_{H j, ε}^{(b)}$ are i.i.d standard normal variables with 1 ⩽ k ⩽ κ_H, 1 ⩽ j ⩽ N for H = 1, 2. Then, q_12,α can be estimated by the empirical quantile of the B simulated ${‖ {\hat{W}}_{b} ‖}_{\infty}$ ’s, b = 1, …, B. A modified SCC for μ₁(z) − μ₂(z) can thus be constructed as $({\hat{μ}}_{1} - {\hat{μ}}_{2}) (z) \pm n_{1}^{- 1 / 2} q_{12, α} {\hat{Ξ} (z, z)}^{1 / 2}$ .

5 |. SIMULATION STUDIES

In this section, we describe two Monte Carlo simulations to examine the finite sample performance of the proposed method.

5.1 |. One sample SCC

In this simulation study, the measurements on the images are generated from the model:

Y_{i j} = μ (z_{j}) + \sum_{k = 1}^{2} λ_{k}^{1 / 2} ξ_{i j} ψ_{k} (z_{j}) + σ (z_{j}) ε_{i j}, i = 1, \dots, n, j = 1, \dots, N,

where z_j = (z_1j, z_2j) ∈ Ω ⊂ [0,1]², and Ω is the same as the domain of the brain images shown in Section 6. To demonstrate the practical performance of our theoretical results, we consider the following four mean functions:

(quadratic) μ(z) = 20{(z₁ − 0.5)² + (z₂ − 0.5)²},
(exponential) μ(z) = 5 exp[−15{(z₁ − 0.5)² + (z₂− 0.5)²}] + 0.5,
(cubic) μ(z) = 3.2(−z₁³ + z₂³) + 2.4,
(sine) μ(z) = −10[sin{5π (z₁ + 0.22)} − sin {5π (z₂ − 0.18)}] + 2.8,

shown in the first column of Figures A1 to A6 in Supporting Information.

To simulate the within-image dependence, we generate $ξ_{i k} \overset{i.i.d}{\sim} N (0, 1)$ , for k = 1,2. For the eigenvalues, we set λ₁ = 0.5, λ₂ = 0.2. For the eigenfunctions, we let ψ₁(z) = c₁ sin(πz₁) + c₂, ψ₂ (z) = c₃ cos(πz₂) + c₄, where c₁ = 0.988, c₂ = 0.5, c₃ = 2.157, and c₄ = −0.084 to guarantee that the eigenfunctions are orthonormal. We generate heterogenous measurement errors with σ (z) = 0.25{1 − (z₁ − 0.5)² − (z₂ − 0.5)²}. We consider n = 50, 100, 200, and for each image, we consider two types of resolution: 40 × 40 and 79 × 79 with N = 921 and 3682 pixels falling inside the domain, respectively.

To apply our method, we consider three different triangulations which are also shown in the first column of Figures A1 to A6 in Supporting Information. The first triangulation (Δ₁) contains 49 triangles and 38 vertices; the second triangulation (Δ₂) contains 80 triangles and 54 vertices; while the third triangulation (Δ₃) contains 144 triangles and 87 vertices. The estimated mean function based on these three triangulations are shown in the second columns of Figures A1 to A6, and the corresponding 99% SCCs are given in the last two columns. From these figures, one can see that all three triangulations result in almost the same estimates and SCCs. One can also see that even when the number of images is moderately large, the estimation is very accurate regardless of the type of underling mean functions.

Table 1 and Table A1 in Supporting Information summarize the estimated coverage rate of the SCCs based on 1000 replications for N = 921 and 3682, respectively. The number in parenthesis represents the average width of the SCCs. These two tables also confirm that there is little difference among the three triangulations and that the coverage rate is closer to the nominal confidence level for larger values of n.

TABLE 1.

Empirical coverage rates and average widths of the simultaneous confidence corridors (N = 921)

	α = 0.10			α = 0.05			α = 0.01
n	Δ₁	Δ₂	Δ₃	Δ₁	Δ₂	Δ₃	Δ₁	Δ₂	Δ₃
μ (z) = 20{(z₁ − 0.5)² + (z₂ − 0.5)²}
50	0.858 (0.651)	0.860 (0.651)	0.874 (0.659)	0.928 (0.739)	0.929 (0.739)	0.935 (0.747)	0.977 (0.908)	0.981 (0.908)	0.981 (0.916)
100	0.891 (0.473)	0.893 (0.473)	0.897 (0.474)	0.944 (0.535)	0.947 (0.535)	0.949 (0.537)	0.979 (0.657)	0.979 (0.657)	0.980 (0.659)
200	0.896 (0.335)	0.897 (0.336)	0.897 (0.337)	0.942 (0.379)	0.949 (0.380)	0.948 (0.381)	0.987 (0.465)	0.988 (0.466)	0.988 (0.467)
μ (z) = 5 exp[−15{(z₁ − 0.5)² + (z₂ − 0.5)²}] + 0.5
50	0.877 (0.664)	0.879 (0.666)	0.879 (0.667)	0.939 (0.752)	0.941 (0.754)	0.937 (0.755)	0.983 (0.921)	0.983 (0.923)	0.982 (0.924)
100	0.888 (0.473)	0.892 (0.474)	0.892 (0.474)	0.942 (0.535)	0.944 (0.536)	0.945 (0.537)	0.979 (0.657)	0.980 (0.658)	0.980 (0.659)
200	0.904 (0.341)	0.890 (0.336)	0.902 (0.342)	0.947 (0.385)	0.942 (0.381)	0.949 (0.386)	0.986 (0.470)	0.986 (0.466)	0.986 (0.472)
$μ (z) = 3.2 (- z_{1}^{3} + z_{2}^{3}) + 2.4$
50	0.876 (0.639)	0.879 (0.639)	0.880 (0.639)	0.934 (0.727)	0.937 (0.728)	0.938 (0.728)	0.980 (0.896)	0.981 (0.896)	0.981 (0.897)
100	0.870 (0.455)	0.876 (0.455)	0.884 (0.457)	0.929 (0.517)	0.935 (0.517)	0.938 (0.519)	0.979 (0.639)	0.980 (0.640)	0.980 (0.642)
200	0.890 (0.326)	0.889 (0.325)	0.906 (0.329)	0.941 (0.370)	0.942 (0.370)	0.953 (0.373)	0.984 (0.456)	0.986 (0.456)	0.985 (0.459)
μ (z) = −10[sin {5π (z₁ + 0.22)} − sin {5π (z₂ − 0.18)}] + 2.8
50	0.882 (0.734)	0.869 (0.740)	0.879 (0.754)	0.937 (0.821)	0.930 (0.828)	0.939 (0.843)	0.981 (0.989)	0.976 (0.996)	0.980 (1.011)
100	0.886 (0.522)	0.901 (0.534)	0.880 (0.536)	0.938 (0.584)	0.946 (0.596)	0.935 (0.598)	0.982 (0.705)	0.983 (0.718)	0.982 (0.721)
200	0.877 (0.370)	0.891 (0.378)	0.887 (0.384)	0.937 (0.414)	0.951 (0.423)	0.947 (0.429)	0.985 (0.499)	0.986 (0.508)	0.984 (0.514)

Open in a new tab

5.2 |. Two-sample simultaneous confidence corridor

In this simulation study, we examine the power of detecting a difference in mean images based on the proposed two-sample SCC. Two group of images are generated from the model

Y_{H, i j} = μ_{H} (z_{j}) + \sum_{k = 1}^{κ} λ_{k}^{1 / 2} ξ_{i j} ψ_{k} (z_{j}) + σ (z_{j}) ε_{i j}, H = 1, 2,

where ψ_k’s are generated as in the simulation in Section 5.1. We consider the following:

H_{0} : μ_{1} (z) = μ_{2} (z), for all z \in Ω vs H_{a} : μ_{1} (z) \neq μ_{2} (z) for some z \in Ω .

(6)

The mean functions for two groups considered here are μ₁(z) = 20{(z₁ − 0.5)² + (z₂ − 0.5)²}, and $μ_{2} (z) = μ_{1} (z) + δ (- z_{1}^{3} + z_{2}^{3})$ . The value of δ controls the difference between the two groups. The eigenvalues λ_k’s, eigenfunctions ψ_k’s and the measurement errors ε_ij’s are generated in the same way as in the simulation presented in Section 5.1, and we set σ(z) = 0.1.

Figure 1 and Table A2 in Supporting Information summarize the estimated probability of rejecting H₀ in (6) with nominal level α = 0.10, 0.05, and 0.01. When δ = 0, the probability should be close to the nominal level, and when δ is large, the estimated power should be close to 1. From Figure 1 and Table A2, one can see even when the numbers of the images n₁ and n₂ are moderately large, the size of the test is very close to the nominal level. The estimated power increases quickly as n₁ and n₂ increase. The performance of the procedure is similar and consistent for different triangulations.

Type I error and empirical power of two-sample tests for different α’s: (A) α = 0.10, (B) α = 0.05, and (C) α = 0.01

6 |. APPLICATIONS TO BRAIN IMAGING DATA

In this section, we implement the proposed SCCs to analyze brain imaging data. In particular, we consider data taken from positron emission tomography (PET) studies with two different settings: one using the tracer [C¹¹]WAY100635 that has an affinity for the serotonin 1 A receptor in a study of major depressive disorder (MDD); and one using the fluorodeoxyglucose tracer [F¹⁸]FDG, a glucose analog, in a study of dementia. The imaging data are naturally three-dimensional in each case, but we focus here on one strategically selected slice in each setting. For the MDD study, we select the horizontal slice which passes through the midbrain and the amygdala, two regions implicated in MDD (Parsey et al., 2010). As pointed out by Marcus et al. (2014), within the brain, the anatomical regions that are commonly affected by Alzheimer diseases are the bilateral superior medial frontal, anterior, middle cingulate and bilateral parietal cortices, while the regions such as the bilateral medial temporal lobes are usually less affected. Therefore, for the [F¹⁸]FDG study, we focus on the 48th horizontal slice of the brain since it passes through the frontal and parietal lobes. In each case, we consider the hypotheses in (6) for the difference between two mean functions.

For the [C¹¹]WAY100635 data, we have 40 subjects who are classified as normal controls and 26 who have been diagnosed with MDD (Parsey et al., 2006). Figure 2 displays the results of the application of the proposed procedure to these data. The portions of the SCCs not containing zero can be seen in (A); the estimation of the mean difference betweentowards the two groups is shown in (B), and the lower and upper SCCs are shown in (C) and (D).

Simultaneous confidence corridor (SCC) for comparison between normal control (CON) and major depressive disorder (MDD): (A) coverage of zero, (B) ${\hat{μ}}_{MDD} - {\hat{μ}}_{CON}$ , (C) lower simultaneous confidence corridor (SCC), and (D) upper SCC. In (A), yellow color indicates zero falls above the upper SCC and blue color indicates zero falls beneath the lower SCC

Next, we illustrate these procedures using the PET data from the Alzheimer’s Disease Neuroimaging Initiative (ADNI; adni.loni.usc.edu). One of the primary goals of the ADNI study is to test whether PET and some other biological markers can be combined to measure the progression of mild cognitive impairment (MCI) and early Alzheimer’s disease (AD).

We use the proposed method in Section 4.2 to choose the triangulation and spline basis functions. Among the three triangulations (Δ₁−Δ₃) considered in simulation studies, we choose Δ₃ when estimating the mean functions, and Δ₁ when estimating the covariance functions. We use smooth parameter r = 1 with degree d = 5 for the estimation of mean function and d = 2 for the estimation of η_i’s. The first row of Figure 3 displays the areas in which zero is not contained within the 95% SCC comparing each pair of diagnostic groups. This suggests that the AD group has widespread mean differences from each of the other two groups. We also stratify the data according to sex and age, and the breakdowns of these data in terms of these variables are given in Table A1 in Supporting Information. Within each stratum, we examine the SCC for the difference between all pairs of diagnostic groups, and the results are also shown in Figure 3. The large apparent differences in the full group analysis can be seen (but to a lesser extent) in the comparisons among the males and among the relatively younger population, but are less pronounced in the other subgroup analyses.

Coverage of zero of simultaneous confidence corridor (SCC) for pairwise comparisons among CON, mild cognitive impairment (MCI) and Alzheimer’s disease (AD). Yellow color indicates zero falls above the upper SCC and blue color indicates zero falls beneath the lower SCC

7 |. DISCUSSION

We develop SCCs for mean functions of imaging data in the functional data framework. We show that the proposed procedure has desirable statistical properties: the estimators are asymptotically efficient as if all images were observed with no error. One main advantage of our method is its computational efficiency and feasibility for large-scale imaging data. It greatly enhances the application of SCCs to imaging data in biomedical studies.

A few more issues still merit further research. For instance, the triangulation selection using the CV and wild bootstrap works well in practice, but a stronger theoretical justification for their use is still needed. In recent years, there has been a great deal of work on functional regression. It is interesting to extend the proposed methodology to functional regression models. The construction of SCCs in such models is a significant challenge and requires more in-depth investigation. Last but not least, it is also interesting to develop SCCs for large-scale longitudinal imaging data, in which accounting for the dependence within the subject as well as for the longitudinal design is crucial for making inference.

Supplementary Material

biom13156-sup-0003-FDA_Image_SCC_R2_supporting_information.pdf

NIHMS1593433-supplement-biom13156-sup-0003-FDA_Image_SCC_R2_supporting_information_pdf.pdf^{(3.1MB, pdf)}

biom13156-sup-0001-Eg1_OneSample.R

NIHMS1593433-supplement-biom13156-sup-0001-Eg1_OneSample_R.r^{(2.6KB, r)}

biom13156-sup-0002-Eg2_TwoSample.R

NIHMS1593433-supplement-biom13156-sup-0002-Eg2_TwoSample_R.r^{(3.4KB, r)}

ACKNOWLEDGMENTS

The authors are truly grateful to the editor, the associate editor, and two reviewers for their constructive suggestions that led to significant improvement of the article. Li Wang’s research was supported in part by NSF awards DMS-1916204 and DMS-1542332. Todd Ogden’s work was partially supported by NIH grants 5 R01 EB024526 and 2 P50 MH090964. Data used in preparation of this article were obtained from the ADNI database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at: http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf.

Funding information

National Institute of Biomedical Imaging and Bioengineering, Grant/Award Number: EB024526; National Institute of Mental Health, Grant/Award Number: MH090964; Division of Mathematical Sciences, Grant/Award Numbers: 1542332, 1916204

Footnotes

SUPPORTING INFORMATION

Web Appendices, Tables, and Figures referenced in Sections 1, 5, and 6 are available with this article at the Biometrics website on Wiley Online Library. The R package for the proposed method ImageSCC is available at https://github.com/funstatpackages/ImageSCC. For the implementations of R packages of Triangulation and BPST, see https://github.com/funstatpackages/Triangulation and https://github.com/funstatpackages/BPST.

REFERENCES

Adler RJ (1990) An Introduction to Continuity, Extrema, and Related Topics for General Gaussian Processes. Hayward, CA: Institute of Mathematical Statistics. [Google Scholar]
Adler RJ and Taylor JE (2007) Random Fields and Geometry. New York: Springer. [Google Scholar]
Cao G and Wang L (2018) Simultaneous inference for the mean of repeated functional data. Journal of Multivariate Analysis, 165, 279–295. [Google Scholar]
Cao G, Yang L and Todem D (2012) Simultaneous inference for the mean function based on dense functional data. Journal of Nonparametric Statistics, 24, 359–377. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chang C, Lin X and Ogden RT (2017) Simultaneous confidence bands for functional regression models. Journal of Statistical Planning and Inference, 188, 67–81. [Google Scholar]
Choi H and Reimherr M (2018) A geometric approach to confidence regions and bands for functional parameters. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 80, 239–260. [Google Scholar]
De Loera JA, Rambau J and Santos F (2010) Triangulations Structures for Algorithms and Applications. Berlin: Springer. [Google Scholar]
Degras DA (2011) Simultaneous confidence bands for nonparametric regression with functional data. Statistica Sinica, 21, 1735–1765. [Google Scholar]
Degras DA (2017) Simultaneous confidence bands for the mean of functional data. Wiley Interdisciplinary Reviews: Computational Statistics, 9, e1397. [Google Scholar]
Forman SD, Cohen JD, Fitzgerald M, Eddy WF, Mintun MA and Noll DC (1995) Improved assessment of significant activation in functional magnetic resonance imaging (fMRI): use of a clustersize threshold. Magnetic Resonance in medicine, 33, 636–647. [DOI] [PubMed] [Google Scholar]
Goldsmith J, Greven S and Crainiceanu C (2013) Corrected confidence bands for functional data using principal components. Biometrics, 69, 41–51. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gu L, Wang L, Härdle WK and Yang L (2014) A simultaneous confidence corridor for varying coefficient regression with sparse functional data. Test, 23, 806–843. [Google Scholar]
Lai MJ and Wang L (2013) Bivariate penalized splines for regression. Statistica Sinica, 23, 1399–1417. [Google Scholar]
Li Y, Wang N and Carroll RJ (2013) Selecting the number of principal components in functional data. Journal of the American Statistical Association, 108, 1284–1294. [DOI] [PMC free article] [PubMed] [Google Scholar]
Marcus C, Mena E and Subramaniam RM (2014) Brain pet in the diagnosis of Alzheimer’s disease. Clinical Nuclear Medicine, 39, e413. [DOI] [PMC free article] [PubMed] [Google Scholar]
Parsey RV, Ogden RT, Miller JM, Tin A, Hesselgrave N and Goldstein E (2010) Higher serotonin 1a binding in a second major depression cohort: modeling and reference region considerations. Biological Psychiatry, 68, 170–178. [DOI] [PMC free article] [PubMed] [Google Scholar]
Parsey RV, Oquendo MA, Ogden RT, Olvet DM, Simpson N, Huang YY et al. (2006) Altered serotonin 1a binding in major depression: a [carbonyl-c-11] way100635 positron emission tomography study. Biological Psychiatry, 59, 106–113. [DOI] [PubMed] [Google Scholar]
Siegmund D, Zhang N and Yakir B (2011) False discovery rate for scanning statistics. Biometrika, 98, 979–985. [Google Scholar]
Worsley KJ, Taylor JE, Tomaiuolo F and Lerch J (2004) Unified univariate and multivariate random field theory. Neuroimage, 23, S189–S195. [DOI] [PubMed] [Google Scholar]
Yao F, Müller H-G and Wang J-L (2005) Functional data analysis for sparse longitudinal data. Journal of the American Statistical Association, 100, 577–590. [Google Scholar]
Yu S, Wang G, Wang L, Liu C and Yang L (2019) Estimation and inference for generalized geoadditive models. Journal of the American Statistical Association, 1–27. [Google Scholar]
Zheng S, Yang L and Härdle WK (2014) A smooth simultaneous confidence corridor for the mean of sparse functional data. Journal of the American Statistical Association, 109, 661–673. [Google Scholar]
Zhu H, Li R and Kong L (2012) Multivariate varying coefficient model for functional responses. The Annals of Statistics, 40, 2634–2666. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

biom13156-sup-0003-FDA_Image_SCC_R2_supporting_information.pdf

NIHMS1593433-supplement-biom13156-sup-0003-FDA_Image_SCC_R2_supporting_information_pdf.pdf^{(3.1MB, pdf)}

biom13156-sup-0001-Eg1_OneSample.R

NIHMS1593433-supplement-biom13156-sup-0001-Eg1_OneSample_R.r^{(2.6KB, r)}

biom13156-sup-0002-Eg2_TwoSample.R

NIHMS1593433-supplement-biom13156-sup-0002-Eg2_TwoSample_R.r^{(3.4KB, r)}

[R1] Adler RJ (1990) An Introduction to Continuity, Extrema, and Related Topics for General Gaussian Processes. Hayward, CA: Institute of Mathematical Statistics. [Google Scholar]

[R2] Adler RJ and Taylor JE (2007) Random Fields and Geometry. New York: Springer. [Google Scholar]

[R3] Cao G and Wang L (2018) Simultaneous inference for the mean of repeated functional data. Journal of Multivariate Analysis, 165, 279–295. [Google Scholar]

[R4] Cao G, Yang L and Todem D (2012) Simultaneous inference for the mean function based on dense functional data. Journal of Nonparametric Statistics, 24, 359–377. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] Chang C, Lin X and Ogden RT (2017) Simultaneous confidence bands for functional regression models. Journal of Statistical Planning and Inference, 188, 67–81. [Google Scholar]

[R6] Choi H and Reimherr M (2018) A geometric approach to confidence regions and bands for functional parameters. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 80, 239–260. [Google Scholar]

[R7] De Loera JA, Rambau J and Santos F (2010) Triangulations Structures for Algorithms and Applications. Berlin: Springer. [Google Scholar]

[R8] Degras DA (2011) Simultaneous confidence bands for nonparametric regression with functional data. Statistica Sinica, 21, 1735–1765. [Google Scholar]

[R9] Degras DA (2017) Simultaneous confidence bands for the mean of functional data. Wiley Interdisciplinary Reviews: Computational Statistics, 9, e1397. [Google Scholar]

[R10] Forman SD, Cohen JD, Fitzgerald M, Eddy WF, Mintun MA and Noll DC (1995) Improved assessment of significant activation in functional magnetic resonance imaging (fMRI): use of a clustersize threshold. Magnetic Resonance in medicine, 33, 636–647. [DOI] [PubMed] [Google Scholar]

[R11] Goldsmith J, Greven S and Crainiceanu C (2013) Corrected confidence bands for functional data using principal components. Biometrics, 69, 41–51. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] Gu L, Wang L, Härdle WK and Yang L (2014) A simultaneous confidence corridor for varying coefficient regression with sparse functional data. Test, 23, 806–843. [Google Scholar]

[R13] Lai MJ and Wang L (2013) Bivariate penalized splines for regression. Statistica Sinica, 23, 1399–1417. [Google Scholar]

[R14] Li Y, Wang N and Carroll RJ (2013) Selecting the number of principal components in functional data. Journal of the American Statistical Association, 108, 1284–1294. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] Marcus C, Mena E and Subramaniam RM (2014) Brain pet in the diagnosis of Alzheimer’s disease. Clinical Nuclear Medicine, 39, e413. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] Parsey RV, Ogden RT, Miller JM, Tin A, Hesselgrave N and Goldstein E (2010) Higher serotonin 1a binding in a second major depression cohort: modeling and reference region considerations. Biological Psychiatry, 68, 170–178. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] Parsey RV, Oquendo MA, Ogden RT, Olvet DM, Simpson N, Huang YY et al. (2006) Altered serotonin 1a binding in major depression: a [carbonyl-c-11] way100635 positron emission tomography study. Biological Psychiatry, 59, 106–113. [DOI] [PubMed] [Google Scholar]

[R18] Siegmund D, Zhang N and Yakir B (2011) False discovery rate for scanning statistics. Biometrika, 98, 979–985. [Google Scholar]

[R19] Worsley KJ, Taylor JE, Tomaiuolo F and Lerch J (2004) Unified univariate and multivariate random field theory. Neuroimage, 23, S189–S195. [DOI] [PubMed] [Google Scholar]

[R20] Yao F, Müller H-G and Wang J-L (2005) Functional data analysis for sparse longitudinal data. Journal of the American Statistical Association, 100, 577–590. [Google Scholar]

[R21] Yu S, Wang G, Wang L, Liu C and Yang L (2019) Estimation and inference for generalized geoadditive models. Journal of the American Statistical Association, 1–27. [Google Scholar]

[R22] Zheng S, Yang L and Härdle WK (2014) A smooth simultaneous confidence corridor for the mean of sparse functional data. Journal of the American Statistical Association, 109, 661–673. [Google Scholar]

[R23] Zhu H, Li R and Kong L (2012) Multivariate varying coefficient model for functional responses. The Annals of Statistics, 40, 2634–2666. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Simultaneous confidence corridors for mean functions in functional data analysis of imaging data

Yueying Wang

Guannan Wang

Li Wang

R Todd Ogden

Abstract

1 |. INTRODUCTION

2 |. MODELS AND ESTIMATION METHOD

2.1 |. Bivariate spline basis approximation over triangulations

2.2 |. Functional principal component analysis

2.3 |. Theoretical properties of the estimators