Bayesian Wombling: Curvilinear Gradient Assessment Under Spatial Process Models

Sudipto Banerjee; Alan E Gelfand

doi:10.1198/016214506000000041

. Author manuscript; available in PMC: 2010 Mar 9.

Published in final edited form as: J Am Stat Assoc. 2006 Dec 1;101(476):1487–1501. doi: 10.1198/016214506000000041

Bayesian Wombling: Curvilinear Gradient Assessment Under Spatial Process Models

Sudipto Banerjee ¹, Alan E Gelfand ²

PMCID: PMC2835372 NIHMSID: NIHMS127622 PMID: 20221318

Abstract

Large-scale inference for random spatial surfaces over a region using spatial process models has been well studied. Under such models, local analysis of the surface (e.g., gradients at given points) has received recent attention. A more ambitious objective is to move from points to curves, to attempt to assign a meaningful gradient to a curve. For a point, if the gradient in a particular direction is large (positive or negative), then the surface is rapidly increasing or decreasing in that direction. For a curve, if the gradients in the direction orthogonal to the curve tend to be large, then the curve tracks a path through the region where the surface is rapidly changing. In the literature, learning about where the surface exhibits rapid change is called wombling, and a curve such as we have described is called a wombling boundary. Existing wombling methods have focused mostly on identifying points and then connecting these points using an ad hoc algorithm to create curvilinear wombling boundaries. Such methods are not easily incorporated into a statistical modeling setting. The contribution of this article is to formalize the notion of a curvilinear wombling boundary in a vector analytic framework using parametric curves and to develop a comprehensive statistical framework for curvilinear boundary analysis based on spatial process models for point-referenced data. For a given curve that may represent a natural feature (e.g., a mountain, a river, or a political boundary), we address the issue of testing or assessing whether it is a wombling boundary. Our approach is applicable to both spatial response surfaces and, often more appropriately, spatial residual surfaces. We illustrate our methodology with a simulation study, a weather dataset for the state of Colorado, and a species presence/absence dataset from Connecticut.

Keywords: Arc-length measure, Bayesian modeling, Directional derivative, Flux, Gaussian process, Line integral, Parametric curve, Wombling

1. INTRODUCTION

Spatially referenced datasets and their analysis arise in diverse areas of scientific interest, including geological and environmental sciences (Webster and Oliver 2001), ecological systems (Scheiner and Gurevich 2001), digital terrain cartography (Jones 1997), computer experiments (Santner, Williams, and Notz 2003), image analysis (Winkler 2003), and public health (Waller and Gotway 2004). Very often, such data will be referenced over a fixed set of locations in a region of study. These locations can be with regions or areas with well-defined neighbors (e.g., pixels in a lattice, counties in a map), in which case they are called areally-referenced or lattice data. Alternatively, they may be simply points with coordinates (e.g., latitude–longitude, easting–northing), in which case they are called point-referenced or geostatistical data. Statistical theory and methods for modeling and analyzing such data depend on these configurations. The last decade has seen enormous developments in such modeling (see, e.g., Cressie 1993; Chileú and Delfiner 1999; M¸ller 2003; Schabenberger and Gotway 2004; Banerjee, Carlin, and Gelfand 2004 for various methods and applications).

Customary inferential interest resides in estimation of model parameters, in spatial prediction, and, more generally, in assessing the estimated spatial surface over the spatial domain. Statistical estimation and interpolation of the spatial surface often help detect “features” such as cliffs, edges, and boundaries in the topography. In the fields of image analysis and pattern recognition, there has been much research on using statistical models for capturing “edge” and “line” effects. Such models are based on probability distributions (such as Gibbs distributions or Markov random fields) that model pixel intensities as conditional dependencies using the neighborhood structure (see, e.g., Chellappa and Jain 1993). Explicit modeling of the edges or pixel boundaries using probability distributions has been discussed by Geman and Geman (1984) and, more recently, by Dass and Nair (2003) with multivariate observations and missing data (see also Winkler 2003). Modeling objectives include identification of edges based on distinctly different image intensities in adjacent pixels. Another line of approach, often called image segmentation, uses a constructive approach to represent image “interfaces” that partition images into disjoint sets of pixels. This approach builds on work of Mumford and Shah (1989) using optimization of PDE-based objective functionals, unsupervised learning methods (e.g., Caillol, Wojciech, and Hillion 1997), or transforms such as the Hough transform (see, e.g., Walsh and Raftery 2002) to find interface curves. These algorithms help segment images by constructing contour lines and identifying image discontinuities using variational techniques. Morel and Solimini (1994) have provided a comprehensive review of these algorithms.

Much of the foregoing development uses the pixel-based spatial domain, where the edges are well-defined lines that separate adjacent pixels. In fact, the Markov random field models are constructed on these adjacency structures. However, in such disciplines as geology, environmental sciences, and ecology, data arise from irregular spatial locations that do not have natural neighborhoods but are referenced by their coordinates. Such data are widely modeled using spatial processes that model association as a function of intersite distance. Spatial process models assume, for a region of study D, a collection of random variables Y(s), where s indexes the points in D. The set {Y(s): s ∈ D} can be viewed as a randomly realized surface over D, which in practice is observed only at a finite set of locations in Inline graphic = {s₁, s₂, …, s_n}. Once such an interpolated surface has been obtained, investigation of rapid change on the surface may be of interest. For instance, environmental scientists are interested in ascertaining whether natural boundaries (e.g., mountains, forest edges) represent a zone of rapid change in weather, ecologists are interested in determining curves that delineate differing zones of species abundance, and public health officials want to identify changes in health care delivery across municipal boundaries, counties, or states.

The foregoing objectives require the notion of gradients and in particular assigning gradients to curves (curvilinear gradients) to identify curves that track a path through the region where the surface is rapidly changing. Such boundaries are commonly referred to as difference boundaries or wombling boundaries, after Womble (1951), who discussed their importance in understanding scientific phenomena (also see Fagan, Fortin, and Soykan 2003). As a concept, wombling is useful because it attempts to quantify spatial information in such objects as curves and paths that are not easy to model as regressors. It is similar to image analysis in that it also seeks to capture lurking “spatial effects” on curves. However, unlike images, where edges and lines represent discontinuities or breaks, wombling boundaries capture rapid surface change; cutting across a wombling boundary should tend to reveal a drop in elevation or, equivalently, a sharp gradient. Evidently, gradients are central to wombling, and the concerned spatial surfaces must be sufficiently smooth. This precludes such methods as wavelets that have been used to detect image discontinuities, such as ridges and cliffs (e.g., Csillag and Kabos 2002) but do not admit gradients.

Visual assessment of the surface over D often proceeds from contour and image plots of the surface fitted from the data using surface interpolators. Surface representation and contouring methods range from tensor product interpolators for gridded data (e.g., Cohen, Riesenfeld, and Elber 2001) to more elaborate adaptive control lattice or tessellation-based interpolators for scattered data (e.g., Akima 1996; Lee, Wolberg, and Shin 1997). Mitas and Mitasova (1999) have provided a review of several such methods available in Global Informational Systems (GIS) software (e.g., GRASS, available at http://grass.itc.it/). These methods are often fast and simple to implement and produce contour maps that reveal topographic features; however, they do not account for association and uncertainty in the data. Far from being competitive with statistical methods, they play a complementary role, creating descriptive plots from the raw data in the premodeling stage and providing visual displays of estimated response or residual surfaces in the postmodeling stage. It is worth pointing out that although contours often provide an idea of the local topography, they are not the same as wombling boundaries. Contour lines connect points with the same spatial elevation and may or may not track large gradients, so they may or may not correspond to wombling boundaries.

Existing wombling methods for point-referenced data concentrate on finding points with large gradients and attempting to connect them in an algorithmic fashion, which then defines a boundary. Such algorithms have been widely used in computational ecology, anthropology, and geography. For example, Barbujani, Jacquez, and Ligi (1990) and Barbujani, Oden, and Sokal (1989) used wombling on red blood cell markers to identify genetic boundaries in Eurasian human populations by different processes restricting gene flow; Bocquet-Appel and Bacro (1994) investigated genetic, morphometric, and physiologic boundaries; Fortin (1994, 1997) delineated boundaries related to specific vegetation zones; Fortin and Drapeau (1995) applied wombling on real environmental data; and Jacquez and Greiling (2003) estimated boundaries for breast, lung, and colorectal cancer rates in males and females in Nassau, Suffolk, and Queens Counties in New York. This last application is somewhat different from the others in that the data were areally referenced with counties. Unlike image pixels, these counties are not regularly spaced but still have a well-defined neighborhood structure (a topological graph), allowing direct application of the image analysis methods. The gradient is not explicitly modeled; boundary effects are considered edge effects and modeled using Markov random field specifications. A Bayesian framework for areal boundary analysis has been provided by Lu and Carlin (2006).

Our interest here lies in point-referenced curvilinear gradient or boundary analysis, a conceptually more difficult problem due to the lack of definitive candidate boundaries. Spatial process models help estimate not only response surfaces, but also residual surfaces after covariate and systematic trends have been accounted. Depending on the scientific application, boundary analysis may be desirable on either. Current methods treat statistical estimates of the surface as “data” and apply interpolation-based wombling to obtain boundaries. Although such methods produce useful descriptive surface plots, they preclude formal statistical inference. Indeed, boundary assessment using such reconstructed surfaces will suffer from inaccurate estimation of uncertainty. One might also contemplate predicting the surface on an overlaid pixel-based domain and applying edge detection methods for image analysis. Again, the uncertainty in spatial prediction will be difficult to account for, the optimal resolutions at which to predict become an issue, and implementation for spatial residual surfaces will be problematic.

Recent developments in GIS and related software have made it possible to extract points along natural boundaries and thus to achieve polygonal representations of curves. We then seek to test whether or not such a given curve is a wombling boundary. Therefore, from an implementation standpoint, we require a well-defined curve on which to construct our tests. Typically, these curves are supplied by the scientists; they may arise as features of interest, such as mountainous paths in weather analysis or forest edges in species abundance studies. In the absence of scientific candidates, some preliminary topographic analyses using simple surface interpolation may also suggest such curves. Once these curves are chosen, curvilinear gradient analysis can be performed on statistically estimated spatial surfaces to ascertain whether these features contain significant spatial information (in terms of local surface topography). Note that existing wombling algorithms construct curves algorithmically and do not accommodate evaluation of gradients along curves obtained externally from GIS or other software. Thus they do not use gradient information to address either question. We offer a statistical framework that incorporates inference on curvilinear boundaries within the diverse class of hierarchical spatial models for point-referenced data.

In the context of point-referenced data, we consider directional derivatives of spatial processes and subsequently use formal vector analysis to develop a wombling measure that evaluates the average gradient along a curve. This is done in a continuous sense in the direction normal to the curve, and leads to a formal definition of whether or not the curve is a wombling boundary. Line integrals emerge, and, using Gaussian spatial processes, we provide distributional details for such integrals. We also show how to implement full inference in a computationally feasible manner. This is offered in a Bayesian framework (e.g., Gelman, Carlin, Stern, and Rubin 2004) for the following reasons. First, it allows exact inference, avoiding possibly inappropriate asymptotics associated with likelihood methods (Stein 1999). Second, it supplies an entire posterior distribution for arbitrary functions of these gradients, such as those introduced in Section 3. A particularly attractive feature of our approach is that all desired gradient analysis can be done after the model for the surface is fitted. Assuming that Markov chain Monte Carlo (MCMC) methods (Gelman et al. 2004) are used to fit the model, we need to retain only the posterior samples of the model parameters. This greatly enhances the applicability of our proposed methods.

The article is organized as follows. Section 2 provides a brief overview of directional derivative processes. Section 3 formalizes curvilinear gradients for spatial processes with a vector analytic treatment. Section 4 discusses the relevant distribution theory for implementing relevant statistical inference for curvilinear gradients. Section 5 provides a simulated illustration that helps validate our methods with topographic maps and provides a comparison with the purely algorithmic methods. Section 6 follows by illustrations in spatial regression contexts, first with a weather dataset supplied by the National Center for Atmospheric Research (NCAR) in Boulder, Colorado and then with a binary regression model for species presence/absence from Connecticut. Section 7 concludes the article with a summary and discussion.

2. SPATIAL GRADIENTS AND DIRECTIONAL DERIVATIVE PROCESSES

As mentioned earlier, the concept of rapid change in the spatial surface is central to wombling and is mathematically formalized using spatial gradients. Adler (1981) and Mardia, Kent, Goodall, and Little (1996) discussed derivatives (more generally, linear functionals) of Gaussian processes, and Banerjee, Gelfand, and Sirmans (2003) laid out an inferential framework for directional gradients on a spatial surface. The latter methodology can be used for statistical estimation (in the form of a posterior distribution) of nonlinear functionals, such as the value of the maximum slope at any point and the direction of this maximum slope. This can be done over the entire spatial domain, irrespective of the configuration of the sampling locations. We seek to extend framework this to gradient processes along a curve, but first we provide a brief review of the statistical theory and inferential framework for gradients and directional derivative processes that arise from spatial processes.

Let Y(s) be a univariate stationary random field. (Stationarity is not required; we use it only to ensure smoothness of realizations and to simplify forms for the induced covariance function.) The process {Y(s): s ∈ ℜ^d} is L₂ (or mean squared) continuous at s₀ if lim_s _→ _s_₀ E(|Y(s) − Y(s₀)|)² = 0. The notion of a mean squared differentiable process can be formalized using the analogous definition of total differentiability of a function in ℜ^d in a nonstochastic setting (see, e.g., Banerjee et al. 2003). In particular, Y(s) is mean-squared differentiable at s₀ if it admits a first-order linear expansion for any scalar h and any unit vector u ∈ ℜ^d,

Y (s_{0} + h u) = Y (s_{0}) + h 〈 \nabla Y (s_{0}), u 〉 + o (h),

(1)

in the L₂ sense as h → 0, where ∇Y(s₀) is a d × 1 vector called the gradient vector and 〈·, ·〉 is the usual Euclidean inner product on ℜ^d. In other words, for any unit vector u, we require

lim_{h \to 0} E {(\frac{Y (s_{0} + h u) - Y (s_{0})}{h} - 〈 \nabla Y (s_{0}), u 〉)}^{2} = 0.

(1′)

A directional gradient process on ℜ^d is defined as the following L₂ limit:

D_{u} Y (s) = lim_{h \to 0} \frac{Y (s + h u) - Y (s)}{h} .

It immediately follows from (1′) that D_uY(s) = 〈∇Y(s), u〉, where the equality is again in the L₂ sense. Note that there is no loss of generality in choosing u as a unit vector. Any vector v can be written as ||v||u, where u is a unit vector collinear to v, so the foregoing linearity implies that D_vY(s) = ||v||D_uY(s) = 〈∇Y(s), v〉. The gradient, ∇Y(s), is invariant over the choice of the direction vector, and we could consistently define

D_{v} Y (s) = lim_{h \to 0} \frac{Y (s + h v) - Y (s)}{h}

for any vector v. In fact, the physical interpretation of this directional gradient depends on the choice of v. When v is a unit vector, the directional derivative evaluated at s₀ is the slope at s₀ of the curve traced out by slicing the surface Y in the direction v. This geometric interpretation is desirable for spatial gradients; in contrast, derivatives arising in physical experiments may be better interpreted with nonunit vectors. For example, in heat dynamic experiments, if Y(s) is taken as a spatial process that models temperature, then v is the velocity vector of the molecules in the domain at that point.

More generally, collecting a set of p directions in ℜ^d into the d × p matrix, U = [u₁, …, u_p], we can write D_UY(s) as the p × 1 vector, D_UY(s) = (D_u_₁ Y(s), …, Du_p Y(s))^T, so that D_UY(s) = U^T∇Y(s). In particular, setting p = d and taking U as the d × d identity matrix (i.e., taking the canonical basis, {e₁, …, e_d}, as our directions), we have D_IY(s) = ∇Y(s), which gives a representation of ∇Y(s) in terms of the partial derivatives of the components of Y(s). Explicitly,

\nabla Y (s) = {(\frac{\partial Y (s)}{\partial s_{1}}, \dots, \frac{\partial Y (s)}{\partial s_{d}})}^{T},

where $s = \sum_{i = 1}^{d} s_{i} e_{i}$ , so the s_i’s are the coordinates of s with respect to the canonical basis and D_UY(s) = U^TD_IY(s). Thus the derivative process in a set of arbitrary directions is a linear transformation of the partial derivatives in the canonical directions, and all inference about the directional derivatives can be built from this relationship.

An inferential framework for directional gradients is developed by letting Y(s) be a univariate stationary random field, GP(μ(s, β), K(·; φ)), on ℜ^d, where θ = (β, φ) is a set of possibly unknown model parameters to be estimated from the data. We will often suppress the parameters for notational convenience, writing μ(s) and K(·) instead. Suppose that we have observed the random field at locations, say Y = (Y(s₁), …, Y(s_n))^T, and let s₀ be a point at which we seek to investigate directional gradients. Based on our earlier discussion, it suffices to consider the predictive distribution P(∇Y(s₀)|Y), where ∇Y(s₀) is the d × 1 gradient vector. The joint process (Y(s₀), ∇Y(s₀)) has a valid stationary cross-covariance matrix function (see, e.g., Banerjee et al. 2003),

\begin{array}{l} (\begin{matrix} cov (Y (s_{0}), Y (s_{0} + δ)) & cov (Y (s_{0}), \nabla Y (s_{0} + δ)) \\ cov (\nabla Y (s_{0}), Y (s_{0} + δ)) & cov (\nabla Y (s_{0}), \nabla Y (s_{0} + δ)) \end{matrix}) \\ = (\begin{matrix} K (δ) & - {(\nabla K (δ))}^{T} \\ \nabla K (δ) & - H_{K} (δ) \end{matrix}), \end{array}

where H_K(δ) = ((∂²K(δ)/∂ δ_i ∂ δ_j)) is the d × d Hessian matrix of K(δ). The cross-covariance matrix induces a valid joint distribution P(Y, ∇Y(s₀)| θ), allowing predictive inference not only for the gradient at arbitrary points, but also for functions thereof, including the direction of the maximal gradient (∇Y(s₀)/|| ∇Y(s₀)||) and the size of the maximal gradient (||∇Y(s₀)||).

Simplifications arise when the mean surface, μ(s), admits a gradient ∇μ(s). Let μ = (μ(s₁), …, μ(s_n)), let ∑_Y denote the n × n dispersion matrix for the data Y, and let γ^T = (∇K(δ₀₁), …, ∇K(δ₀_n)) be the d × n matrix with δ₀_j = s₀ − s_j. Then P(Y, ∇Y(s₀)|θ) is distributed as the (d + n)-dimensional normal distribution

N_{d + n} ((\begin{matrix} μ \\ \nabla μ (s_{0}) \end{matrix}), (\begin{matrix} \sum_{Y} & γ \\ γ^{T} & - H_{K} (0) \end{matrix})),

(2)

and the conditional predictive distribution for the gradient, P(∇Y(s₀)|Y, θ), is the d-dimensional normal distribution

N_{d} (\nabla μ (s_{0}) - γ^{T} \sum_{Y}^{- 1} (Y - μ), - H_{K} (0) - γ^{T} \sum_{Y}^{- 1} γ) .

(3)

The posterior predictive distribution P(∇Y(s₀)|Y) is obtained as ∫ P(∇Y(s₀)|Y, θ)P(θ|Y) dθ. Simulation from this distribution is by composition; for each θ_l obtained from P(θ|Y), we draw ∇Y⁽^l⁾(s₀) from (3). As long as ∇μ(s₀) is computable, obtaining samples from the foregoing distribution is routine. In practice, we could have μ(s, β) = μ, a constant, in which case ∇μ(s) = 0. More generally, we would have μ(s, β) = f(s)^T β, where f(s) may be a vector of spatial covariates. If f(s)^T β describes a trend surface, then explicit calculation of ∇μ(s) will be possible. For a continuous covariate such as elevation, we can interpolate a surface and approximate ∇μ(s) at any location s. Note that it might suffice to consider gradients of a residual (or intercept) spatial process, say W(s), where Y(s) = μ(s; β) + W(s) + ε(s), where W(s) ~ GP(0, K(·; φ)) is a mean-0 Gaussian process and ε(s) ~ N(0, τ²) is a white noise process, often called a nugget. Inference will then proceed from the posterior distribution of P(∇ W(s₀)|Y).

Based on the foregoing distribution theory, formal statistical inference on gradients can be performed. For instance, given any direction u and any location s₀, a statistically “significant” directional gradient would mean that a 95% posterior credible interval for D_uW(s₀) would not include 0. Because D₋_uW(s₀) = −D_uW(s₀), inference for u is the same as that for −u. In addition, assessing significance of the spatial residual process W(s) is more general than for the parent process Y(s). Indeed, when ∇ μ(s₀) exists and there is no nugget (τ² = 0), then the former is equivalent to testing the significance of D_uY(s₀) as a departure from the trend surface gradient D_uμ(s₀) (the null value). But even when ∇ μ(s₀) is inaccessible, or there is a nugget τ² > 0, assessment of spatial gradients for W(s₀) is still legitimate.

3. WOMBLING MEASURE ASSOCIATED WITH A CURVE

We now extend the theory of the previous section to an inferential framework for curvilinear boundaries. The conceptual challenge in moving from points to curves is the formulation of a measure to associate with a curve to assess whether it can be declared a wombling boundary. In this regard, we can consider open or closed curves. In Section 3.1 we formally develop the notion of an average gradient to associate with an open curve. In Section 3.2 we turn to closed curves, the notion of flux, and special results for this case. In applications such curves might be proposed as, for instance, topographic or legislated boundaries or perhaps as level curves arising from a contouring routine. Finally, in Section 3.3 we offer a formal definition of a wombling boundary.

We use differential geometric notions for parametric boundaries as developed by, for example, Rudin (1976) or Frankel (2003). Because most spatial modeling is done on domains in ℜ², we restrict our attention to that case, focusing on a real-valued process Y(s) with the spatial domain as an open subset of ℜ². Thus we offer an independent development of gradients along planar curves without resorting to geometry on manifolds. For hypercurves in general ℜ^d, the theory is more complicated (especially if d > 3) and must involve the development of calculus on abstract manifolds.

3.1 Open Curves

Let C be an open curve in ℜ², and suppose that we want to ascertain whether such a curve is a wombling boundary with regard to Y(s). To do so, we seek to associate an average gradient with C. In particular, for each point s lying on C, we let D_n₍_s₎Y(s) be the directional derivative in the direction of the unit normal n(s). (Again, the rationale for the choice of direction normal to the curve is that for a curve tracking rapid change in the spatial surface, lines orthogonal to the curve should reveal sharp gradients.) We can define the wombling measure of the curve either as the total gradient along C,

\int_{C} D_{n (s)} Y (s) d ν = \int_{C} 〈 \nabla Y (s), n (s) 〉 d ν,

(4)

or perhaps as the average gradient along C,

\frac{1}{ν (C)} \int_{C} D_{n (s)} Y (s) d ν = \frac{1}{ν (C)} \int_{C} 〈 \nabla Y (s), n (s) 〉 d ν,

(4′)

where ν(·) is an appropriate measure. For (4) and (4′), ambiguity arises with respect to the choice of measure. For example, ν(C) = 0 if we take ν as two-dimensional Lebesgue measure, and indeed this is true for any ν that is mutually absolutely continuous with respect to Lebesgue measure. On reflection, an appropriate choice for ν turns out to be arc-length. This can be made clear by a parametric treatment of the curve C.

In particular, a curve C in ℜ² is a set parameterized by a single parameter t ∈ ℜ¹, where C = {s(t): t ∈ T}, with T ⊂ ℜ¹. We call s(t) = (s₍₁₎(t), s₍₂₎(t)) ∈ ℜ² the position vector of the curve; s(t) traces out C as t spans its domain. Then, assuming a differentiable curve with nonvanishing derivative s′(t) ≠ = 0 (such a curve is often called regular), we obtain the (componentwise) derivative s′(t) as the “velocity” vector, with unit velocity (or tangent) vector s′(t)/||s′(t)||. Letting n(s(t)) denote the parameterized unit normal vector to C, again if C is sufficiently smooth, then 〈 s′(t), n(s(t)) 〉 = 0, a.e. Inline graphic . In ℜ², we see that

n (s (t)) = \frac{(s_{(2)}^{'} (t), - s_{(1)}^{'} (t))}{∥ s^{'} (t) ∥} .

(5)

Under the foregoing parameterization (and the regularity assumption), the arc-length measure ν can be defined as

ν (T) = \int_{T} ∥ s^{'} (t) ∥ d t .

(6)

In fact, ||s′(t)|| is analogous to the “speed” (the norm of the velocity) at “time” t, so the foregoing integral is interpretable as the distance traversed or, equivalently, the arc-length ν (C) or ν ( Inline graphic ). In particular, if is an interval, say [t₀, t₁], then we can write

ν (T) = ν_{t_{0}} (t_{1}) = \int_{t_{0}}^{t_{1}} ∥ s^{'} (t) ∥ d t .

Thus we have dν_t_₀(t) = ||s′(t)|| dt, and, taking ν as the arc-length measure for C, we have the wombling measures in (4) (total gradient) and (4′) (average gradient) as

\begin{array}{l} Γ_{Y (s)} (T) = \int_{C} 〈 \nabla Y (s), n (s) 〉 d ν \\ = \int_{T} 〈 \nabla Y (s (t)), n (s (t)) 〉 ∥ s^{'} (t) ∥ d t and \\ {\bar{Γ}}_{Y (s)} (T) = \frac{1}{ν (T)} Γ_{Y (s)} (T) . \end{array}

(7)

This result is important because we want to take ν as the arc-length measure, but it will be easier to use the parametric representation and work in t space. Moreover, it is a consequence of the implicit mapping theorem in mathematical analysis (see, e.g., Rudin 1976) that any other parameterization s^*(t) of the curve C is related to s(t) through a differentiable mapping g such that s^*(t) = s(g(t)). This immediately implies [using (7)] that our proposed wombling measure is invariant to the parameterization of C and, as desired, is a feature of the curve itself.

For some simple curves, the wombling measure can be evaluated quite easily. For instance, when C is a segment of length 1 of the straight line through the point s₀ in the direction u = (u₍₁₎, u₍₂₎), we have C = {s₀ + tu: t ∈ [0, 1]}. Under this parameterization, s′(t)^T = (u₍₁₎, u₍₂₎), ||s′(t)|| = 1, and ν_t_₀ (t) = t. Clearly, n(s(t)) = (u₍₂₎, −u₍₁₎) (independent of t), which we write as u^⊥, the normal direction to u. Therefore, Γ_Y₍_s₎( Inline graphic ) in (7) becomes

\int_{0}^{1} 〈 \nabla Y (s (t)), n (s (t)) 〉 d t = \int_{0}^{1} D_{u^{⊥}} Y (s (t)) d t .

Another example is when C is the arc of a circle with radius r. For example, suppose that C is traced out by s(t) = (r cos t, r sin t) as t ∈ [0, π/4]. Then, because ||s′(t)|| = r, the average gradient is more easily computed as

\begin{array}{l} \frac{1}{ν (C)} \int_{0}^{π / 4} 〈 \nabla Y (s (t)), n (s (t)) 〉 r d t \\ = \frac{4}{r π} \int_{0}^{π / 4} 〈 \nabla Y (s (t)), n (s (t)) 〉 r d t \\ = \frac{4}{π} \int_{0}^{π / 4} 〈 \nabla Y (s (t)), n (s (t)) 〉 d t . \end{array}

In either case, n(s(t)) is given by (5).

Note that whereas the normal component, D_n₍_s₎Y(s), seems to be more appropriate for assessing whether a curve provides a wombling boundary, we may also consider the tangential direction, u(t) = s′(t)/||s′(t)||, along a curve C. In this case the average gradient will be given by

\begin{array}{l} \frac{1}{ν (C)} \int_{C} 〈 \nabla Y (s (t)), u (t) 〉 ∥ s^{'} (t) ∥ d t \\ = \frac{1}{ν (C)} \int_{C} 〈 \nabla Y (s (t)), s^{'} (t) 〉 d t . \end{array}

In fact, we have

\begin{array}{l} \int_{C} 〈 \nabla Y (s (t)), s^{'} (t) 〉 d t = \int_{t_{0}}^{t_{1}} 〈 \nabla Y (s (t)), s^{'} (t) 〉 d t \\ = \int_{s_{0}}^{s_{1}} 〈 \nabla Y (s), d s 〉 = Y (s_{1}) - Y (s_{0}), \end{array}

where s₁ = s(t₁) and s₀ = s(t₀) are the endpoints of C. That is, the average directional gradient in the tangential direction is independent of the path C, depending only on the endpoints of the curve C.

3.2 Closed Curves

A related concept is that of the “flux” of a region bounded by C (e.g., C might be the boundary of a county, census unit, or school district). Then C is a closed curve, and the integral over a closed curve is denoted by ∮_C. The average gradient in the normal direction to the curve C is denoted by

\begin{array}{l} \frac{1}{ν (C)} \oint_{C} 〈 \nabla Y (s), n (s) 〉 d s \\ = \frac{1}{ν (C)} \oint_{C} 〈 \nabla Y (s (t)), n (s (t)) 〉 ∥ s^{'} (t) ∥ d t . \end{array}

If we assume that the surface Y(s) is twice mean-squared differentiable, then the closed-line integral can be written as a double integral over the domain of s, and no explicit parameterization by t is required. In fact, let F(s) = ∇Y(s) = (F₁(s), F₂(s))^T, so that F₁(s) = ∂Y/∂s₁ and F₂(s) = ∂Y/s₂. Then

\begin{array}{l} \oint_{C} 〈 \nabla Y (s), n (s) 〉 d s \\ = \oint_{C} 〈 F (s), n (s) 〉 d s \\ = \oint_{C} 〈 F (s (t)), n (s (t)) 〉 ∥ s^{'} (t) ∥ d t \\ = \oint_{C} (F_{1} \frac{s_{2}^{'} (t)}{∥ s^{'} (t) ∥} - F_{2} \frac{s_{1}^{'} (t)}{∥ s^{'} (t) ∥}) ∥ s^{'} (t) ∥ d t \\ = \oint_{C} F_{1} {d s}_{2} - F_{2} {d s}_{1} \\ = \int \int_{D} (\frac{\partial F_{1}}{\partial s_{1}} + \frac{\partial F_{2}}{\partial s_{2}}) {d s}_{1} {d s}_{2}, \end{array}

where the last step follows from Green’s theorem in the plane (see the App. for a simple proof). Noting that F(s) = ∇Y(s), we rewrite the foregoing in terms of average directional gradients as

\frac{1}{ν (C)} \oint_{C} 〈 \nabla Y (s), n (s) 〉 d s = \frac{1}{ν (C)} \int \int_{D} 〈 \nabla^{2} Y (s), e_{1} + e_{2} 〉 d s,

(8)

where e₁ = (1, 0) and e₂ = (0, 1). This result has computational advantages because the right-side integral can be computed by sampling within the region, which in general is simpler than sampling along a curve. However, if we work exclusively with line segments as described in Sections 4 and 5, then the computational advantage to (8) disappears. Finally, from path independence, it follows that the total gradient in the tangential direction along a closed boundary is 0 because s(t₀) = s(t₁) = s₀, and so

\begin{array}{l} \oint_{C} 〈 \nabla Y (s (t)), s^{'} (t) 〉 d t = \int_{t_{0}}^{t_{1}} 〈 \nabla Y (s (t)), s^{'} (t) 〉 d t \\ = Y (s_{0}) - Y (s_{0}) = 0. \end{array}

Thus, for a closed curve, with respect to total or average, gradient it does not make sense to consider the tangential direction.

3.3 Wombling Boundary

With the foregoing formulation in place, we now offer a formal definition of a curvilinear wombling boundary.

Definition

A curvilinear wombling boundary is a curve C that reveals a large wombling measure, Γ_Y₍_s₎( Inline graphic ) or Γ̄_Y₍_s₎() [as given in (7)] in the direction normal to the curve.

If the surface were fixed, then we would have to set a threshold to determine what “large” (say, in absolute value) means. Because the surface is a random realization, Γ_Y₍_s₎( Inline graphic ) and Γ̄_Y₍_s₎() are random. Hence we declare a curve to be a wombling boundary if, say, a 95% credible set for Γ̄_Y₍_s₎() does not contain 0. It is worth pointing out that although one normal direction [as defined in (5)] is used in (7), −n(s(t)) also would have been a valid choice. Because D₋_n₍_s₍_t₎₎Y(s(t)) = −D_n₍_s₍_t₎₎Y(s(t)), we note that the wombling measure with respect to one is simply the negative of the other. Thus, in the foregoing definition, large positive as well as large negative values of the integral in (7) would signify a wombling boundary. Being a local concept, an uphill gradient is equivalent to a downhill gradient across a curve, as are the fluxes radiating outward or inward for a closed region.

We also point out that, being a continuous average (or sum) of the directional gradients, the wombling measure may “cancel” the overall gradient effect. For instance, imagine a curve C that exhibits a large positive gradient in the n(s) direction for the first half of its length and a large negative gradient for the second half, thereby canceling the total or average gradient effect. A potential remedy is to redefine the wombling measure using absolute gradients, |D_n₍_s₎Y(s)|, in (4) and (4′). The corresponding development does not entail any substantially new ideas, but would destroy the attractive distribution theory in Section 4 and make the computation less tractable. In particular, it would make calibration of the resulting measure with regard to significance much more difficult. (For example, how do we select a threshold?) Moreover, in practice a descriptive contour representation is usually available in which sharp gradients will usually reflect themselves, and one could instead compute the wombling measure for appropriate subcurves of C. Although somewhat subjective, identifying such subcurves is usually unambiguous and leads to robust scientific inference. More fundamentally, in certain applications a signed measure may actually be desirable; one might want to classify a curve as a wombling boundary if it reflects either an overall “large positive” or a “large negative” gradient effect across it. For these reasons, we confine ourselves to working with D_n₍_s₎Y(s) and turn to the distribution theory for the wombling measure in the next section.

4. DISTRIBUTION THEORY

Curvilinear wombling amounts to performing predictive inference for a line integral parameterized over Inline graphic . Let us suppose that is an interval, [0, T], which generates the curve C = {s(t): t ∈ [0, T]}. For any t^* ∈ [0, T], let ν(t^*) denote the arc length of the associated curve C_t_^*. The line integrals for total gradient and average gradient along C_t_^* are given by Γ_Y₍_s₎(t^*) and Γ̄_Y₍_s₎(t^*) as

\begin{array}{l} Γ_{Y (s)} (t^{*}) = \int_{0}^{t^{*}} D_{n (s (t))} Y (s (t)) ∥ s^{'} (t) ∥ d t and \\ {\bar{Γ}}_{Y (s)} (t^{*}) = \frac{1}{ν (t^{*})} Γ_{Y (s)} (t^{*}) . \end{array}

(9)

We seek to infer about Γ_Y₍_s₎(t^*) based on data Y = (Y(s₁), …, Y(s_n)). Because D_n₍_s₍_t₎₎Y(s(t)) = 〈∇Y(s(t)), n(s(t))〉 is a Gaussian process (from Sec. 2), Γ_Y₍_s₎(t^*) is a Gaussian process on [0, T], equivalently on the curve C. Note that although D_n₍_s₎Y(s) is a process on ℜ^d, our parameterization of the coordinates by t ∈ Inline graphic ⊆ ℜ¹ induces a valid process on . In fact, Γ_Y₍_s₎(t^*) is GP(μ_Γ_Y₍_s₎(t^*), K_Γ_Y₍_s₎(·, ·)), where

{μ_{Γ}}_{Y (s)} (t^{*}) = \int_{0}^{t^{*}} D_{n (s (t))} μ (s (t)) ∥ s^{'} (t) ∥ t

and

\begin{array}{l} K_{Γ_{Y (s)}} (t_{1}^{*}, t_{2}^{*}) = \int_{0}^{t_{1}^{*}} \int_{0}^{t_{2}^{*}} n^{T} (s (t_{1})) H_{K} (Δ (t_{1}, t_{2})) n (s (t_{2})) \\ \times ∥ s^{'} (t_{1}) ∥ ∥ s^{'} (t_{2}) ∥ {d t}_{1} {d t}_{2}, \end{array}

with Δ(t₁, t₂) = s(t₂) − s(t₁). In particular,

\begin{array}{l} var (Γ_{Y (s)} (t^{*})) = K_{Γ_{Y (s)}} (t^{*}, t^{*}) \\ = \int_{0}^{t^{*}} \int_{0}^{t^{*}} n^{T} (s (t_{1})) H_{K} (Δ (t_{1}, t_{2})) n (s (t_{2})) \\ \times ∥ s^{'} (t_{1}) ∥ ∥ s^{'} (t_{2}) ∥ d t_{1} d t_{2} . \end{array}

Evidently, Γ_Y₍_s₎(t^*) is mean squared continuous. But from the foregoing, note that even if Y(s) is a stationary process, Γ_Y₍_s₎(t^*) is not. For any s_j in the domain of Y,

\begin{array}{l} cov (Γ_{Y (s)} (t^{*}), Y (s_{j})) \\ = \int_{0}^{t^{*}} cov (D_{n (s (t))} Y (s (t)), Y (s_{j})) ∥ s^{'} (t) ∥ d t \\ = \int_{0}^{t^{*}} D_{n (s (t))} K (Δ_{j} (t)) ∥ s^{'} (t) ∥ d t, \end{array}

(10)

where Δ_j(t) = s(t) − s_j. Based on data Y = (Y(s₁), …, Y(s_n)), we seek the predictive distribution P(Γ_Y₍_s₎(t^*)|Y), but note that Y(s) and Γ_Y₍_s₎(t^*) are processes on different domains; the former is over a connected region in ℜ², while the latter is on a parameterized curve, s(t), indexed by Inline graphic . Nevertheless, Γ_Y₍_s₎(t^*) is derived from Y(s), and we have a valid joint distribution (Y, Γ_Y₍_s₎(t^*)) for any t^* ∈ , given by

N_{n + 1} ((\begin{matrix} μ \\ μ_{Γ_{Y (s)}} (t^{*}) \end{matrix}), (\begin{matrix} \sum_{Y} & γ_{Γ, Y} (t^{*}) \\ γ_{Γ, Y}^{T} (t^{*}) & K_{Γ} (t^{*}, t^{*}) \end{matrix})) .

(11)

Here μ = (μ(s₁), …, μ(s_n)) and

γ_{Γ, Y}^{T} (t^{*}) = (cov (Y (s_{1}), Γ_{Y (s)} (t^{*})), \dots, cov (Y (s_{n}), Γ_{Y (s)} (t^{*}))),

each component being evaluated from (10).

Suppose that we have regression parameters β in μ(s; β) and variance–covariance parameters η in K(·; η). For now, we assume that μ(s; β) is a smooth function in s [as would be needed to do prediction for Y(s)]. Using MCMC, these model parameters, θ = (β, η), are available to us as samples, {θ_l}, from their posterior distribution P(θ|Y). Therefore, P(Γ_Y₍_s₎(t^*)|Y) = ∫ P(Γ_Y₍_s₎(t^*)|Y, θ)P(θ|Y) dθ will be obtained by sampling, for each θ_l, $Γ_{Y (s)}^{l} (t^{*})$ from P(Γ_Y₍_s₎ (t^*)|Y, θ_l, which, using (11), is normally distributed as

\begin{array}{l} N (μ_{Γ_{Y (s)}} (t^{*}; β_{l}) - γ_{Γ, Y}^{T} (t^{*}; η_{l}) \sum_{Y}^{- 1} (η_{l}) (Y - μ), \\ K_{Γ_{Y (s)}} (t^{*}, t^{*}; η_{l}) - γ_{Γ, Y} {(t^{*}; η_{l})}^{T} \sum_{Y}^{- 1} (η_{l}) γ_{Γ, Y} (t^{*}; η_{l})) . \end{array}

(12)

In particular, for rectilinear wombling (i.e., where C_t_^* = {s₀ + tu: t ∈ [0, t^*]} is a line segment of length t^* joining s₀ and s₁ = s₀ + t^*u), we have seen after (7) that Γ_Y₍_s₎(t^*) equals $\int_{0}^{t^{*}} D_{u^{⊥}} Y (s (t)) d t$ . Thus, defining Δ₀_j = s₀ − s_j, we have

\begin{array}{l} μ_{Γ_{Y (s)}} (t^{*}; β) = \int_{0}^{t^{*}} 〈 u^{⊥}, \nabla μ (s (t); β) 〉 d t; \\ {(γ_{Γ, Y} (t^{*}; η))}_{j} = \int_{0}^{t^{*}} D_{u^{⊥}} K (Δ_{0 j} + t u; η) d t, j = 1, \dots, n; \end{array}

and

K_{Γ_{Y (s)}} (t^{*}, t^{*}; η) = \int_{0}^{t^{*}} \int_{0}^{t^{*}} - {(u^{⊥})}^{T} H_{K (η)} (Δ (t_{1}, t_{2})) u^{⊥} d t_{1} d t_{2} .

These integrals need to be computed for each θ_l = (β_l, η_l). Although they may not be analytically tractable [depending on our choice of μ(·) and K(·)], they are one- or two-dimensional integrals that can be efficiently computed using quadrature. Furthermore, because the θ_l’s will already be available, the quadrature calculations (for each θ_l) can be performed ahead of the predictive inference, perhaps using a separate quadrature program, and the output stored in a file for use in the predictive program. The only needed inputs are s₀, u, and the value of t^*. For a specified line segment, we will know these. In fact, for a general curve C, its representation on a given map is as a polygonal curve. As a result, the total or average gradient for C can be obtained through the Γ_Y₍_s₎(t^*) associated with the line segments that compose C.

Specifically, using GIS software we can easily extract (at high resolution) the coordinates along the boundary, thus approximating C by line segments connecting adjacent points. Thus $C = \cup_{k = 1}^{M} C_{k}$ , where the C_k’s are virtually disjoint (only one common point at the “join”) line segments and $ν (C) = \sum_{k = 1}^{M} ν (C_{k})$ . If we parameterize each line segment as before and compute the line integral along each C_k by the foregoing steps, then the total gradient is the sum of the piecewise line integrals. To be precise, if $Γ_{k} (s (t_{k}^{*}))$ is the line-integral process on the linear segment C_k, then we will obtain predictive samples, $Γ_{k}^{(l)} (s (t_{k}^{*}))$ , from each $P (Γ_{k} (s (t_{k}^{*})) ∣ Y)$ , k = 1, …, M. Inference on the average gradient along C will stem from posterior samples of

\frac{1}{\sum_{k = 1}^{M} ν (C_{k})} \sum_{k = 1}^{M} Γ_{k}^{(l)} (s (t_{k}^{*})) .

Thus, with regard to boundary analysis, wombling measure reduces a curve to an average gradient and inference to examination of the posterior of the average gradient.

For the flux of a closed region bounded by the curve C, the foregoing framework will still be applicable. Here C_M will join C₁. But, if ∇²Y(s) exists, then (8) suggests a simpler alternative. Because the average gradient is a double integral on the region Inline graphic , we may easily approximate the flux as a Monte Carlo average. Whether or not the consequent computational ease warrants the smoothness assumption would depend on the application.

When Y(s) is an isotropic Gaussian process with constant mean, μ(s) = μ, and correlation function K(||Δ||; σ², φ) = σ² exp(−φ|| Δ||²), the calculations are simplified. We have ∇ μ(s) = 0,μ_Γ(t^*) = 0, and H_K (Δ) = −2σ²φ exp(−φ|| Δ||²) × (I − 2φ ΔΔ^T). Further calculations reveal that (γ_YZ(t^*; σ², φ))_j can be computed as

\begin{array}{l} c (σ^{2}, φ, u^{⊥}, Δ_{0 j}) \\ \times (Φ (\sqrt{2 φ} (t^{*} + 〈 u, Δ_{0 j} 〉)) - Φ (\sqrt{2 φ} 〈 u, Δ_{0 j} 〉)), \end{array}

(13)

where Φ(·) is the standard Gaussian cdf and

\begin{array}{l} c (σ^{2}, φ, u^{⊥}, Δ_{0 j}) \\ = - 2 σ^{2} \sqrt{π φ} 〈 u^{⊥}, Δ_{0 j} 〉 exp (- φ ∣ 〈 u^{⊥}, Δ_{0 j} 〉 ∣^{2}) . \end{array}

These computations can be performed using the Gaussian cdf function, and quadrature is needed only for var(Γ_Y₍_s₎(t^*)). We use this model to demonstrate a simulation of curvilinear wombling. In real data settings, such as our example in Section 6, the Matérn is favored, providing more flexible control of surface smoothness.

Returning to the model Y(s) = x^T(s)β + W(s) + ε(s) with x(s) a general covariate vector, W(s) ~ GP(0, σ²ρ (·, φ)), and ε(s) a zero-centered white-noise process with variance τ², consider boundary analysis for the residual surface W(s). In fact, boundary analysis on the spatial residual surface is feasible in generalized linear modeling contexts with exponential families, where W(s) may be viewed as a nonparametric latent structure in the mean of the parent process; see Section 7.2.

Letting Γ_W₍_s₎(t) and Γ̄_W₍_s₎(t) denote the total and average gradient processes [as defined in (7)] for W(s), we seek the posterior distributions P(Γ_W₍_s₎(t^*)|Y) and P(Γ̄_W₍_s₎(t^*)|Y). Note that

\begin{array}{l} P (Γ_{W (s)} (t^{*}) ∣ Y) \\ = \int P (Γ_{W (s)} (t^{*}) ∣ W, θ) P (W ∣ θ, Y) P (θ ∣ Y) d θ d W, \end{array}

(14)

where W = (W(s₁), …, W(s_n)) denotes a realization of the residual process and θ = (β, σ², φ, τ²). Sampling of this distribution again proceeds in a posterior predictive fashion using posterior samples of θ and is expedited in a Gaussian setting because P(W|θ, Y) and P(Γ_W₍_s₎(t^*)|W, θ) are both Gaussian distributions.

Formal inference for a wombling boundary is done more naturally on the residual surface W(s) [i.e., for Γ_W₍_s₎(t^*) and Γ̄_W₍_s₎(t^*)] because W(s) is the surface containing any nonsystematic spatial information on the parent process Y(s). Because W(s) is a mean-0 process, μ_Γ_W₍_s₎ (t^*; β) = 0, and thus we need to check for the inclusion of this null value in the resulting 95% credible intervals for Γ_W₍_s₎(t^*) or, equivalently, for Γ̄_W₍_s₎(t^*). Again, this clarifies the issue of the normal direction mentioned in Section 3.3; significance using n(s(t)) is equivalent to significance using −n(s(t)). We need only select and maintain a particular orthogonal direction relative to the curve. In accordance with our remarks concerning absolute gradients in Section 3.3, we could compute (9) using |D_n₍_s₍_t₎₎Y(s(t))| using a Riemann sum, but this would be computationally expensive and would not offer a Gaussian calibration of significance.

5. A SIMULATED ILLUSTRATION OF ALGORITHMIC AND STATISTICAL WOMBLING

We illustrate curvilinear wombling with the following simulated data example. We first generated data from a stationary random field, Y(s), following GP(μ, σ² exp(−φ||Δ||²)). We use the Gaussian correlation function here simply for the convenience of closed-form (up to the Gaussian cdf) expressions for the posterior predictive distribution of Γ(t^*) [see eq. (13)]. In particular, we generated 100 observations from a 10 × 10 square region with parameters μ = 5, σ² = 1, and φ =.5. The maximum observed intersite distance was 12.6 units.

We analyzed the data using a flat prior for μ, an inverted-gamma IG(2,.001) prior for σ² (so that 1/σ² has mean 2,000 and variance 2.0E + 6), and a G(.5,.01) prior for φ. An MCMC algorithm was designed with Gibbs updates for μ and σ², and a Metropolis update for φ was used. Burn-in was diagnosed at 5,000 iterations, and posterior samples of size 6,000 were retained. Table 1 shows that the posterior estimation was successful in capturing the true value for each of the parameters. Posterior predictive samples were used to predict the random field Y(s) on a grid over the entire domain to produce the mean response surface in Figure 1, where the “+” indicate our 100 observation locations. The darker shades represent lower values of Y(s), and lighter shades represent higher values. Note that the overlaid contour lines provide additional details about the response surface topography and can help identify regions with high gradients. However, as mentioned in Section 1, contours are formed by connecting points with the same elevation and are not directly related to gradients.

Table 1.

Parameter Estimates in the Simulated Example

Parameters	50% (2.5%, 97.5%)	True value
μ	5.002 (4.661, 5.329)	5.0
σ²	.887 (.662, 1.271)	1.0
φ	.526 (.431,.858)	.5

Open in a new tab

The Predicted Mean Response Surface Plot for the Simulated Random Field With Overlaid Contours and the 100 Observation Locations Indicated by “+”.

Existing algorithmic wombling methods use deterministic interpolators. These interpolators are typically of the form $f (s) = \sum_{i = 1}^{n} w_{i} (S; s) f (s_{i})$ , where f (s_i) = Y(s_i)’s are the observed values in Inline graphic and w_i(; s) are weight functions of s that also depend on the entire configuration of observed sites . The choice of the interpolator depends on the regularity in the configuration of the locations, a triangulation of the domain, the smoothness of the surface, the speed of implementation, and numerical error analysis (see Cohen et al. 2001; Phillips 2003 for further details regarding such procedures). Here we demonstrate a typical implementation, using the popular wombling software BoundarySEER (www.terraseer.com), where the domain is first triangulated using a Delaunay triangulation with the observed locations forming the vertices of the triangles [see Figs. 2(a) and 2(b)]. Next, the bivariate linear interpolator f (s₁, s₂) = a + bs₁ + cs₂ is fitted to each triangular plate, with the coefficients a, b, and c determined from the observed data at the three vertices. The gradient vector ∇f and the maximal gradient value over all directions, ||∇f|| = [(∂f/∂s₁)² + (∂f/∂s₂)²]^1/2, are then computed at the centroids of the triangles, and those with the highest 5% or 10% of maximal gradients are identified as points that should form a part of the boundary. The corresponding triangles are then shaded as “zones” in which boundaries are likely to be found. Finally, these centroids are connected if they belong to adjacent zones (other angle-based connection rules also apply). Figures 2(a) and 2(b) show the output from such an implementation, with (a) corresponding to the 5% threshold and (b) corresponding to the 10% threshold.

Algorithmic Point-Referenced Wombling Using BoundarySEER [(a) and (b)] and a Predictive Surface of the Maximal Gradient Process (c).

Although such algorithms are fast and simple to implement, they rely on the particular triangulation to obtain the zones of rapid change that determine the boundary representations. Thus boundaries are “internally constructed,” and one cannot compute the average gradient along a curve obtained from GIS or external software. Furthermore, the construction of the boundaries is discrete and may not capture the local topography of the response surface. Uncertainty is not accounted for, and there is no way to test or validate these boundaries. Note that the directional derivative processes discussed in Section 2 can also be used to obtain zones of rapid change (also see Allard, Gabriel, and Bacro 2005). In fact, the image-contour plot in Figure 2(c) shows the mean predicted surface of ||∇Y(s)|| over the spatial domain. Unfortunately, it is not clear how to use these plots to ascertain curves that have large gradients.

Turning to the illustration of statistical inference with the same data, Figure 3 displays the mean predicted response surface but with certain curves and segments marked on them. It is important to note that it does not make sense to talk about “true” wombling boundaries here, because we do not know what the “true” equation of the surface is. Hence validation of our proposed methods must use the topographic information from contour plots. For this, we consider five different curves or segments, which we test for average gradients. The segments AB, CD, and EF are all of unit length with differing local topographies. The segment AB lies in the space between two contour lines, CD lies on relatively flat land with not much contours, and EF cuts across a stream of contour lines. We also investigate two closed curves for gradients and flux that are actually shown as contours: GHIJG and KLMNK. Finally, we evaluate gradients along PQ, QR, ST, UV, and WX, which are the “boundaries” proposed by BoundarySEER.

Curvilinear Boundary Analysis for Simulation Example.

Table 2 presents the results of our analysis. We present the average directional gradients along n(s(t)). We take n(s(t)) to be the uphill direction with positive median gradients. Again, we need not include the results for −n(s(t)) because the latter is just a reflection of the former. Recall that a curve will form a wombling boundary if the 95% credible interval for its wombling measure does not include 0. The true values in Table 2 correspond to the conditional mean, E[Γ(t)|Y, μ, σ², φ], with the true parameter values plugged in. The results are quite consistent with the local topography of the curves. Consider the rectilinear path of unit length, AB, going in the east–west direction. With u = (1, 0) and u^⊥ = (0, −1), we see a significant average gradient cutting across AB; in other words, AB forms a wombling boundary. This is not surprising, because the contours reveal an uphill gradient as one moves northward, cutting across AB. In contrast, CD is a unit-length rectilinear segment on relatively flat land. There are no visible gradients, as evidenced by the absence of contour lines, and indeed from Table 2 we see an insignificant average gradient across CD. Generally, significant boundaries are found parallel to the contour lines, not cutting across contours. This is because significant directional gradients are found in directions cutting across contour lines, and we take the average directional gradients perpendicular to the curve. This is illustrated by the curve EF, which cuts across a flow of north–south contours. The gradient here is parallel to EF (i.e., along u), and hence the directional gradients along u^⊥ are insignificant, producing an insignificant average.

Table 2.

Curvilinear Gradient Assessment From Figure 3 for the Mean Response Surface in the Simulation Example

Curve	Average gradient [n(s(t))]	True value [n(s(t))]
AB	1.242 (.886, 1.574)	1.275
CD	.055 (−.648,.745)	.110
EF	.140 (−.780, 1.060)	.032
Curve GHIJG
GH	.675 (.011, 1.312)	.735
HI	.899 (.545, 1.262)	.892
IJ	1.745 (1.313, 2.153)	1.799
JG	.935 (.590, 1.283)	.930
Flux	1.063 (.624, 1.493)	1.086
Curve KLMNK
KL	1.015 (.424, 1.619)	.967
LM	1.646 (.473, 2.842)	1.777
MN	2.291 (1.721, 2.854)	2.310
NK	2.119 (1.411, 2.819)	2.178
Flux	1.794 (1.024, 2.569)	1.839
PQ	2.152 (1.776, 2.572)	2.124
QR	.322 (−.163,.827)	.289
ST	2.627 (2.117, 3.125)	2.634
UV	.315 (−.198,.773)	1.381
WX	1.843 (1.355, 2.360)	1.779

Open in a new tab

NOTE: For each segment or curve, the estimates in the direction corresponding to a positive gradient are presented.

The next segment of Table 2 presents our results on curvilinear wombling for the contours GHIJG and KLMNK in Figure 3. The former lies in the slopes of a valley; the latter, on the slopes of a plateau. We first assess the significance of each curvilinear segment making up the contour, then evaluate the statistical flux of the enclosed region. Following Section 4, the statistical flux here is just a weighted average of the average gradients of each piece making up the contour. To be precise, suppose that C is the closed contour given by GHIJG. Once we have computed the curvilinear gradients for GH, HI, IJ, and JG, we can simply take the weighted average of the curvilinear average gradients of the four segments, with weights ν(C_k)/ν(C) yielding the statistical flux reported in Table 2. Each of the segments as well as the flux are significant, again corroborated by the topography. Note that n(s(t)) is always the uphill vector, which means that it is the “outward normal” for GHIJG and the “inward normal” for KLMNK. Evaluating the flux using Green’s result, we obtained a median “outward” flux of 1.070 with a credible interval of (.633, 1.512) for GHIJG and corresponding values of 1.758 and (1.002, 2.571) for the “inward” flux of KLMNK. These values concur with those obtained as weighted averages in Table 2.

The last segment of Table 2 presents results on testing significance of the boundaries proposed by algorithmic wombling. Indeed P, Q, R, S, T, U, and V are the selected centroids forming the top 5% of maximal gradients, and the connection algorithm produces the five boundaries shown. The significance of PQ is quite expected, lying along a stream of contours. Cutting across PQ would indeed indicate a smooth jump in elevation. Similarly, ST and WX are also formally tested to be wombling boundaries, although WX is somewhat less pronounced. QR and UV, on the other hand, do not have significant gradients and fail the formal test of being a wombling boundary. Although both lie in zones of possibly rapid change, they tend to cut across contours (much like EF) rather than move along them. Even though the points Q, R, U, and V all have significant maximal gradients, the tessellation-based connection of these points is unable to capture the local topography and the continuous gradient along the curve.

6. SPATIAL REGRESSION EXAMPLES

We illustrate boundary analysis on the spatial residual surfaces from two spatial regression models. The first of these models temperature against precipitation from a weather dataset obtained from the National Center for Atmospheric Research (NCAR) in Boulder, Colorado. The second uses a spatial logistic regression model for species presence/absence data from Connecticut. We note that Bayesian analysis of these models using posterior sampling is now available in user-friendly software such as WinBUGS (www.mrc-bsu.cam.ac.uk/bugs), and R packages such as geoR and geoRglm available from CRAN (comprehensive R archives network) (http://cran.us.r-project.org). Keep in mind, however, that the spatial processes modeled there usually assume a fixed Matérn smoothness parameter ν. Valid Bayesian boundary analysis can be performed by setting ν > 1 (or ν > 2 for evaluating the flux using Green’s theorem). The powered exponential family is also usually available. However, we present our analysis keeping ν unknown, as in the previous section, implementing our models in C/C++ with posterior summarizations and graphics in R.

6.1 A Spatial Gaussian Regression Example

We have mean temperature measurements (in 10°C units) obtained at 50 sites in July 1997 as our dependent variable Y(s). Also supplied are precipitation measurements (in 100-meter units) at each site, so that the covariate vector x(s) comprises an intercept and a precipitation. A univariate spatial model (see, e.g., Banerjee et al. 2004, chap. 5), Y(s) = x^T(s)β + W(s) + ε(s), is used to explain temperature given precipitation, allowing for both pure error [ $ε (s) \overset{iid}{\sim} N (0, τ^{2} I)$ ] and spatial correlation in the data [W(s) ~ GP(0, σ²ρ (·;φ, ν))].

We adopted a flat prior for β and relatively vague inverted-gamma, IG(2,.001), priors for σ² and τ². We used the Matérn correlation, ρ (φ, ν; d), with a gamma prior for the correlation decay parameter, φ, specified so that the prior spatial range has a mean of about half of the maximum intersite distance in our data, and a U(1, 2) prior for the smoothness parameter ν. That is, ν > 1 is needed for mean squared differentiability, whereas, following Stein (1999), the data will be unable to distinguish ν = 2 from values of ν > 2. Three parallel MCMC chains were run for 10,000 iterations. Convergence was diagnosed by monitoring mixing, Gelman–Rubin diagnostics, autocorrelations, and cross-correlations. In each case, 5,000 iterations were deemed enough to allow sufficient mixing of the chains, and the remaining 15,000 samples (5,000 × 3) were retained for posterior analysis.

Table 3 gives the parameter estimates for the spatial regression model. Precipitation seems to have a moderately positive impact on temperature. Note that the residual spatial story is quite strong here with σ²/(σ² + τ²) ≈.72, implying that the spatial variance explains about 72% of the residual variation. This is corroborated by Figure 4, which shows an image plot with contours of the posterior mean residual surface W(s) over the spatial domain, with the “+” indicating our 50 sampling locations. The coordinates are obtained from a sinusoidal planar projection of latitude and longitude and are in kilometers. The contours reveal an interesting topography with temperature residuals rich in spatial variation. Posterior samples of the spatial range are obtained by solving ρ (φ, ν; d) =.05 for each posterior draw of φ and ν. The estimated spatial range is about 262 km, which seems reasonable in our domain, with the maximum intersite distance of approximately 780 km.

Table 3.

Parameter Estimates for the Gaussian Spatial Regression Example

Parameters	50% (2.5%, 97.5%)
Intercept	2.827 (2.131, 3.866)
Precipitation	.037 (.002,.072)
σ²	.134 (.051, 1.245)
φ	7.39E–3 (4.71E–3, 51.21E–3)
ν	1.577 (1.210, 1.931)
Range (in km)	261.6 (37.8, 418.3)
τ²	.051 (.022,.092)

Open in a new tab

Curvilinear Boundary Analysis on the Spatial Residual Surface of Temperature Given Precipitation, Using GIS Extracted Boundary Segments for the Colorado Data.

It is expected that elevation can help explain the spatial temperature residuals. If elevation information is available over the entire domain, then one could, for example, model elevation as a regressor and study this impact. But when interest is focused on the impact of a curve or path of high elevation, estimating curvilinear gradients seems more appropriate. For choosing paths of high elevation to test for gradients, we used the open-source GIS software GRASS (http://grass.itc.it/) to identify topographic features. For example, in Figure 4 the linear segment AB and the curvilinear segment CD (along the illustrated contour) both represent mountain slopes, whereas the enclosed region bounded by KLMNOPK forms a valley.

To test for significant gradients on W(s) along the curves specified earlier, we compute the average gradients along AB, CD, and the curves forming the region KLMNOPK. These are computed using posterior predictive samples using (14). These gradients, along with the statistical flux of KLMNOPK, are presented in Table 4. As one might expect, we find a significant average gradient for both AB and CD, being in the middle of a dense flow of contour lines. As in the simulated example, difference boundaries on spatial residual surfaces are also likely to be seen parallel to dense flows of contours. The next part of Table 4 shows gradient evaluation of the curve KLMNOPK and the statistical flux of the enclosed region. The results seem quite consistent with the topography revealed in Figure 4. From the practitioner’s viewpoint, statistically confirming boundaries with high gradients is a way of capturing spatial information that is difficult to incorporate in the trend surface. Such analysis assists scientists in formulating physical models incorporating such information.

Table 4.

Boundary Assessment on the Spatial Residual Surface From Figure 4

Curve	Average gradient
AB	2.713 (2.328, 3.076)
CD	1.711 (1.280, 2.087)
KL	1.219 (.609, 1.830)
LM	.938 (−.003, 1.863)
MN	.842 (−.027, 1.750)
NO	1.668 (.727, 2.593)
OP	1.667 (.682, 2.542)
PK	1.129 (.587, 1.649)
Flux	1.257 (.448, 2.045)

Open in a new tab

6.2 A Spatial Logistic Regression Example

We consider data collected from 603 locations in Connecticut with presence/absence and abundance scores for some individual invasive plant species, plus environmental covariates. The covariates are available only at the sample locations, not on a grid. The response variable Y(s) is a presence–absence binary indicator (0 for absence) for one species, Celastrus orbiculatus, at location s. There are four categorical covariates: habitat class (representing the current state of the habitat) of four different types, land use and land cover (LULC) types (land use/cover history of the location; e.g., always forest, formerly pasture now forest) at five levels, and a 1970 category number (LULC at one point in the past, 1970; e.g., forest, pasture, residential) with six levels. In addition, we have an ordinal covariate, canopy closure percentage (percent of the sky blocked by “canopy” of leaves of trees; a location under mature forest would have close to 100% canopy closure, whereas a forest edge would have closer to 25%) with four levels in increasing order, a binary variable for heavily managed points (0 if “no”; “heavy management” implies active landscaping or lawn mowing), and a continuous variable measuring the distance from the forest edge in the logarithm scale. Figure 5 is a digital terrain image of the study domain, with the labeled curves indicating forest edges extracted using the GIS software ArcView (http://www.esri.com/). Ecologists are interested in evaluating spatial gradients along these 10 natural curves and identifying them as wombling boundaries.

A Digital Image of the Study Domain in Connecticut Indicating the Forest Edges as Marked Curves. These are assessed for significant gradients. Eastings range from 699148 to 708961; northings range from 4604089 to 4615875 for the picture.

We fit a logistic regression model with spatial random effects,

log (\frac{P (Y (s) = 1)}{P (Y (s) = 0)}) = x^{T} (s) β + W (s),

where x(s) is the vector of covariates observed at location s and W(s) ~ GP(0, σ² ρ(·; φ, ν)) is a Gaussian process with ρ (·; φ, ν) as a Matérn correlation function. Whereas Y(s) is a binary surface that does not admit gradients, conducting boundary analysis on W(s) is perfectly legitimate. The residual spatial surface reflects unmeasured or unobservable environmental features in the mean surface. Again, attaching curvilinear wombling boundaries to the residual surface tracks rapid changes in the departure from the mean surface.

We adopt priors similar to those in the preceding section: a completely noninformative flat prior for β, an inverted-gamma IG(2,.001) prior for σ², and the Matérn correlation function with a gamma prior for the correlation decay parameter, φ, specified so that the prior spatial range has a mean of about half of the observed maximum intersite distance (the maximum distance is 11,887 m based on a UTM projection), and a U(1, 2) prior for the smoothness parameter ν. Again, we ran three parallel MCMC chains for 15,000 iterations each, 10,000 iterations revealed sufficient mixing of the chains, and the remaining 15,000 samples (5,000 × 3) were used for posterior analysis.

Table 5 presents the posterior estimates of the model parameters. We do not have a statistically significant intercept, but most of the categorical variables reveal significance. Types 2 and 4 for habitat class have significantly different effects than type 1, all four types of LULC show significant departure from the baseline type 1, and for the 1970 category number, category 2 shows a significant negative effect, whereas categories 4 and 6 show significant positive effects compared with category 1. Canopy closure is significantly positive, implying higher presence probabilities of Celastrus orbiculatus with higher canopy blockage, whereas points that are more heavily managed appear to have a significantly lower probability of species presence, as does the distance from the nearest forest edge. Posterior summaries of the spatial process parameters are also presented, and the effective spatial range is approximated to be around 1,109.3 meters. These produce the mean posterior surface of W(s), shown in Figure 6 with the 20 endpoints of the forest edges from Figure 5 labeled to connect the figures.

Table 5.

Parameter Estimates for the Logistic Spatial Regression Example

Parameters	50% (2.5%, 97.5%)
Intercept	.983 (−2.619, 4.482)
Habitat class (baseline: type 1)
Type 2	−.660 (−1.044, −.409)
Type 3	−.553 (−1.254,.751)
Type 4	−.400 (−.804, −.145)
Land use land cover types (baseline: level 1)
Type 2	.591 (.094, 1.305)
Type 3	1.434 (.946, 2.269)
Type 4	1.425 (.982, 1.974)
Type 5	1.692 (.934, 2.384)
1970 category types (baseline: category 1)
Category 2	−4.394 (−6.169, −3.090)
Category 3	−.104 (−.504,.226)
Category 4	1.217 (.864, 1.588)
Category 5	−.039 (−.316,.154)
Category 6	.613 (.123, 1.006)
Canopy closure	.337 (.174,.459)
Heavily managed points (baseline: no)
Yes	−1.545 (−2.027, −.975)
Log edge distance	−1.501 (−1.891, −1.194)
σ²	8.629 (7.005, 18.401)
φ	1.75E–3 (1.14E–3, 3.03E–3)
ν	1.496 (1.102, 1.839)
Range (in m)	1109.3 (632.8, 1741.7)

Open in a new tab

The Spatial Residual Surface From the Presence–Absence Application for the 603 Observed Locations (not shown) in Connecticut. Also shown are the 20 endpoints for the 10 curves in Figure 5 to connect the figures.

Finally, Table 6 presents the formal curvilinear gradient analysis for the 10 forest edges in Figure 5. We see that 6 of the 10 edge curves (all except CD, EF, KL, and MN) are formally tested to be wombling boundaries. Our methodology proves useful here, because some of these edge curves meander along the terrain for very long distances. Indeed, although the residual surface in Figure 6 reveals a general pattern of spatial variation (higher residuals in the South), it is difficult to make visual assessments on the size (and significance) of average gradients for the longer curves. Furthermore, with nongridded data as here, the surface interpolators [in this case the Akima (1996) interpolator in R] often find it difficult to extrapolate beyond a convex hull of the site locations. Consequently, parts of the curve [e.g., endpoints C, G, and (almost) T] lie outside the fitted surface, making local visual assessment on them impossible.

Table 6.

Curvilinear Gradient Assessment for the 10 Forest Edges Labeled in Figure 5 for the Logistic Regression Example

Curve	Average gradient
AB	1.021 (.912, 1.116)
CD	.131 (−.031,.273)
EF	.037 (−.157,.207)
GH	1.538 (1.343, 1.707)
IJ	.586 (.136,.978)
KL	.036 (−.154,.202)
MN	.005 (−.021,.028)
OP	.227 (.087,.349)
QR	.282 (.118,.424)
ST	.070 (.017,.117)

Open in a new tab

Quickly and reliably identifying forest edges could be useful in determining boundaries between areas of substantial anthropogenic activity and minimally managed forest habitats. Such boundaries are important because locations where forest blocks have not been invaded by exotic plant species may be subject to significant seed rain from these species. These boundaries thus might form important “front lines” for efforts at monitoring or controlling invasive species.

7. SUMMARY AND DISCUSSION

Heretofore, a fully inferential approach to boundary analysis was not available in the literature. Although subjective assessments regarding wombling boundaries can be drawn from contour plots, formal inference (incorporating uncertainty) may be desirable. In contrast to edge detection and contouring problems in image analysis and pattern recognition, our primary focus has been on the modeling of spatial gradients along curves from possibly irregularly spaced point-referenced data, without resorting to alignment algorithms for mapping such data into pixels. Starting with recently developed theory on directional derivatives at points on a random realization of a Gaussian process, we have moved from points to curves, using formal vector analytic methods to determine whether a proposed curve should be declared a curvilinear wombling boundary. Our methods are particularly geared toward statistical modelers, allowing boundary analysis on response and residual surfaces from spatial regression models. In fact, these can be easily extended to nonstationary models with weighted mixtures of Gaussian processes, and even to nonparametric spatial models using continuous mixtures of Gaussian processes.

The vector-analytic framework with parametric curves also opens up a number of new directions for further research. For instance in practice it would be of interest to develop a stochastic or model-based method to construct wombling boundaries. As the earlier discussion suggests, it is both natural and easiest to construct boundaries using polygonal curves, and hence to consider only piecewise linear boundaries. Thus such a construction requires a starting point, say $s_{0}^{(1)}$ , and then travels an unknown distance $t_{(1)}^{*}$ in a direction $u_{opt}^{(1)}$ to a new point $s_{0}^{(2)}$ , which becomes the new starting point. Here $u_{opt}^{(1)} = u_{max}^{⊥}$ would be, say, the perpendicular direction to $u_{max} = \nabla Y (s_{0}^{(1)}) / ∥ \nabla Y (s_{0}^{(1)}) ∥$ , the direction of steepest spatial gradient at $s_{0}^{(1)}$ . However, the choice of starting point is not clear. It should be a point with a large maximum gradient, but this does not uniquely determine a point. In fact, if the maximum gradient at s₀ is “significant,” then our smoothness assumptions imply that this will be so for all points in a neighborhood of s₀. Moreover, there will be many parts of the region where we may look for s₀’s, many places to start. Next, even with $u_{opt}^{(1)}$ , according to sign, we have two choices for moving from s₀. Furthermore, if the maximum gradient at s₀ is the highest as we move in direction $u_{opt}^{(1)}$ , then, if we decided to select $t_{(1)}^{*}$ to maximize the average gradient along the line segment in this direction, we would not move; any departure from s₀ would yield a smaller average. Finally, even if we did move and created a sequence of segments, if a closed boundary were appropriate, then such a construction would close with probability 0. To avoid endless circling, ad hoc intervention must be imposed. In summary, we have examined “algorithms” that resolve these various problems by reference to a contour plot and found that these perform well in comparison with currently available wombling boundary software. However, due to the ad hoc decisions that are required, we cannot offer a satisfactory version at present.

More fundamentally, here we have confined ourselves to curves that track zones of rapid change. However, as we alluded to earlier, zones of rapid change are areal notions; description by a curve may be an unsatisfying simplification. Describing zones as areal quantities (i.e., as sets of nonzero Lebesgue measure in ℜ²) is an alternative. When proceeding, the crucial issue is to formalize shape-oriented definitions of a wombling boundary. Although much work has been done on statistical shape analysis, its use in the point-referenced spatial data context that we have set out herein is unexplored. There are many possibilities using formal differential geometry and calculus of variations, providing directions for future research. Finally, we also note that our approach is built entirely on the specification of a point-referenced spatial process model. One might examine the boundary analysis problem from an alternative modeling perspective, in which curves or zones arise as random processes. Possibilities include line processes, random tessellations, and random areal units.

Acknowledgments

The work of the first author was supported in part by National Institutes of Health (NIH) grant 1-R01-CA95995, and that of the second author was supported in part by NIH grant 2-R01-ES07750. The authors thank Andrew Latimer and John Silander, University of Connecticut for several useful discussions.

APPENDIX: GREEN’S THEOREM

Green’s theorem on the plane is a special case of Stokes’s theorem on general manifolds (see, e.g., Rudin 1976). We provide a direct derivation for simple closed regions. Consider the function F(s) = (F₁(s), F₂(s)), where each F_i(s) is a real-valued function and s = (s₁, s₂) references the coordinates in ℜ². Let C be a curve traced out by the parameter t ∈ ℜ¹, so that C = {s(t): t ∈ ℜ¹}. For the present context, assume that C = C₁ ∪ C₂ is composed of two curves, C₁ and C₂, such that C₁ = {(t, g₁(t)): a ≤ t ≤ b} and C₂ = {(t, g₂(t)): a ≤ t ≤ b}. Then the region enclosed by C is denoted by Inline graphic and can be expressed as = {(s₁, s₂): a ≤ s₁ ≤ b, g₁(s₁) ≤ s₂ ≤ g₂(s₁)}. Now consider the double integral over ,

\begin{array}{l} \int \int_{D} (\frac{\partial F_{1}}{\partial s_{1}} + \frac{\partial F_{2}}{\partial s_{2}}) {d s}_{1} {d s}_{2} \\ = \int \int_{D} \frac{\partial F_{1}}{\partial s_{1}} {d s}_{1} {d s}_{2} + \int \int_{D} \frac{\partial F_{2}}{\partial s_{2}} {d s}_{1} {d s}_{2} . \end{array}

Consider the second integral on the right side,

\begin{array}{l} \int \int_{D} \frac{\partial F_{2}}{\partial s_{2}} {d s}_{1} {d s}_{2} \\ = \int_{a}^{b} \int_{g_{1} (s_{1})}^{g_{2} (s_{1})} (\frac{\partial F_{2}}{\partial s_{2}} {d s}_{2}) {d s}_{1} \\ = \int_{a}^{b} (F_{2} (s_{1}, g_{2} (s_{1})) - F_{2} (s_{1}, g_{1} (s_{1}))) {d s}_{1} \\ = \int_{a}^{b} F_{2} (s_{1}, g_{2} (s_{1})) {d s}_{1} - \int_{a}^{b} F_{2} (s_{1}, g_{1} (s_{1})) {d s}_{1} \\ = - \int_{C_{2}} F_{2} (s_{1}, g_{2} (s_{1})) {d s}_{1} - \int_{C_{1}} F_{2} (s_{1}, g_{1} (s_{1})) {d s}_{1} \\ = - \oint_{C} F_{2} {d s}_{1} . \end{array}

Similarly,

\int \int_{D} \frac{\partial F_{1}}{\partial s_{1}} {d s}_{1} {d s}_{2} = \oint_{C} F_{1} {d s}_{2} .

Combining these, we obtain

\int \int_{D} (\frac{\partial F_{1}}{\partial s_{1}} + \frac{\partial F_{2}}{\partial s_{2}}) {d s}_{1} {d s}_{2} = \oint_{C} F_{1} {d s}_{2} - F_{2} {d s}_{1} .

(A.1)

Contributor Information

Sudipto Banerjee, Assistant Professor Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN 55414 (E-mail: sudiptob@biostat.umn.edu).

Alan E. Gelfand, J. B. Duke Professor of Statistics, Institute of Statistics and Decision Sciences, Duke University, Durham, NC 27708 (E-mail: alan@stat.duke.edu)

References

Adler RJ. The Geometry of Random Fields. Chichester, U.K.: Wiley; 1981. [Google Scholar]
Akima H. Algorithm 761: Scattered-Data Surface Fitting That Has the Accuracy of a Cubic Polynomial. ACM Transactions on Mathematical Software. 1996;22:362–371. [Google Scholar]
Allard D, Gabriel E, Bacro JN. Estimating and Testing Zones of Abrupt Changes for Spatial Data, research report. Unité de Biométrie, INRA; 2005. [Google Scholar]
Banerjee S, Carlin BP, Gelfand AE. Hierarchical Modeling and Analysis for Spatial Data. Boca Raton, FL: Chapman & Hall/CRC; 2004. [Google Scholar]
Banerjee S, Gelfand AE, Sirmans CF. Directional Rates of Change Under Spatial Process Models. Journal of the American Statistical Association. 2003;98:946–954. [Google Scholar]
Barbujani G, Jacquez GM, Ligi L. Diversity of Some Gene Frequencies in European and Asian Populations, V: Steep Multilocus Clines. American Journal of Human Genetics. 1990;47:867–875. [PMC free article] [PubMed] [Google Scholar]
Barbujani G, Oden NL, Sokal RR. Detecting Areas of Abrupt Change in Maps of Biological Variables. Systematic Zoology. 1989;38:376–389. [Google Scholar]
Bocquet-Appel JP, Bacro JN. Generalized Wombling. Systematic Biology. 1994;43:442–448. [Google Scholar]
Caillol H, Wojciech P, Hillion A. Estimation of Fuzzy Gaussian Mixture and Unsupervised Statistical Image Segmentation. IEEE Transactions on Image Processing. 1997;6:425–439. doi: 10.1109/83.557353. [DOI] [PubMed] [Google Scholar]
Chellappa R, Jain AK. Markov Random Fields. New York: Academic Press; 1993. [Google Scholar]
Chilés JP, Delfiner P. Geostatistics: Modelling Spatial Uncertainty. New York: Wiley; 1999. [Google Scholar]
Cohen E, Riesenfeld RF, Elber G. Geometric Modeling With Splines: An Introduction. Natick, MA: A.K. Peters; 2001. [Google Scholar]
Cressie NAC. Statistics for Spatial Data. 2. New York: Wiley; 1993. [Google Scholar]
Csillag F, Kabos S. Wavelets, Boundaries and the Analysis of Landscape Pattern. Ecoscience. 2002;9:177–190. [Google Scholar]
Dass SC, Nair VN. Edge Detection, Spatial Smoothing, and Image Reconstruction With Partially Observed Multivariate Data. Journal of the American Statistical Association. 2003;98:77–89. [Google Scholar]
Fagan WF, Fortin MJ, Soykan C. Integrating Edge Detection and Dynamic Modeling in Quantitative Analyses of Ecological Boundaries. BioScience. 2003;53:730–738. [Google Scholar]
Fortin MJ. Edge Detection Algorithms for Two-Dimensional Ecological Data. Ecology. 1994;75:956–965. [Google Scholar]
Fortin MJ. Effects of Data Types on Vegetation Boundary Delineation. Canadian Journal of Forest Research. 1997;27:1851–1858. [Google Scholar]
Fortin MJ, Drapeau P. Delineation of Ecological Boundaries: Comparisons of Approaches and Significance Tests. Oikos. 1995;72:323–332. [Google Scholar]
Frankel T. The Geometry of Physics: An Introduction. Cambridge, U.K.: Cambridge University Press; 2003. [Google Scholar]
Gelman A, Carlin JB, Stern HS, Rubin DB. Bayesian Data Analysis. 2. Boca Raton, FL: Chapman & Hall/CRC; 2004. [Google Scholar]
Geman S, Geman D. Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images. IEEE Transactions on Pattern Analysis and Machine Intelligence. 1984;6:721–741. doi: 10.1109/tpami.1984.4767596. [DOI] [PubMed] [Google Scholar]
Jacquez GM, Greiling DA. Geographic Boundaries in Breast, Lung and Colorectal Cancers in Relation to Exposure to Air Toxins in Long Island, New York. International Journal of Health Geographics. 2003;2:4. doi: 10.1186/1476-072X-2-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jones CB. Geographical Information Systems and Computer Cartography. Harlow, Essex, U.K.: Addison-Wesley, Longman; 1997. [Google Scholar]
Lee S, Wolberg G, Shin SY. Scattered Data Interpolation With Multilevel B-Splines. IEEE Transactions on Visualization and Computer Graphics. 1997;3:228–244. [Google Scholar]
Lu H, Carlin BP. Bayesian Areal Wombling for Geographical Boundary Analysis. Geographical Analysis. 2006 to appear. [Google Scholar]
Mardia KV, Kent JT, Goodall CR, Little JA. Kriging and Splines With Derivative Information. Biometrika. 1996;83:207–221. [Google Scholar]
Mitas L, Mitasova H. Spatial Interpolation. In: Longley P, Goodchild MF, Maguire DJ, Rhind DW, editors. Geographical Information Systems: Principles, Techniques, Management and Applications, GeoInformation International. New York: Wiley; 1999. pp. 481–492. [Google Scholar]
Møller J, editor. Spatial Statistics and Computational Methods. New York: Springer-Verlag; 2003. [Google Scholar]
Morel JM, Solimini S. Variational Methods in Image Segmentation, With Seven Image Processing Experiments. Boston: Birkhäuser; 1994. [Google Scholar]
Mumford D, Shah J. Optimal Approximation by Piecewise Smooth Functions and Associated Variational Problems. Communications in Pure and Applied Mathematics. 1989;42:577–685. [Google Scholar]
Phillips GM. Interpolation and Approximation by Polynomials. New York: Springer-Verlag; 2003. [Google Scholar]
Rudin W. Principles of Mathematical Analysis. New York: McGraw-Hill; 1976. [Google Scholar]
Santner TJ, Williams BJ, Notz WI. The Design and Analysis of Computer Experiments. New York: Springer-Verlag; 2003. [Google Scholar]
Schabenberger O, Gotway CA. Statistical Methods for Spatial Data Analysis. Boca Raton, FL: Chapman & Hall/CRC; 2004. [Google Scholar]
Scheiner SM, Gurevitch J. Design and Analysis of Ecological Experiments. 2. Oxford, U.K.: Oxford University Press; 2001. [Google Scholar]
Stein ML. Interpolation of Spatial Data: Some Theory of Kriging. New York: Springer-Verlag; 1999. [Google Scholar]
Waller LA, Gotway CA. Applied Spatial Statistics for Public Health Data. New York: Wiley-Interscience; 2004. [Google Scholar]
Walsh DCI, Raftery AE. Accurate and Efficient Curve Detection in Images: The Importance Sampling Hough Transform. Pattern Recognition. 2002;35:1421–1431. [Google Scholar]
Webster R, Oliver MA. Geostatistics for Environmental Scientists. New York: Wiley; 2001. [Google Scholar]
Winkler G. Image Analysis, Random Fields and Markov Chain Monte Carlo Methods: A Mathematical Introduction. New York: Springer-Verlag; 2003. [Google Scholar]
Womble W. Differential Systematics. Science. 1951;114:315–322. doi: 10.1126/science.114.2961.315. [DOI] [PubMed] [Google Scholar]

[R1] Adler RJ. The Geometry of Random Fields. Chichester, U.K.: Wiley; 1981. [Google Scholar]

[R2] Akima H. Algorithm 761: Scattered-Data Surface Fitting That Has the Accuracy of a Cubic Polynomial. ACM Transactions on Mathematical Software. 1996;22:362–371. [Google Scholar]

[R3] Allard D, Gabriel E, Bacro JN. Estimating and Testing Zones of Abrupt Changes for Spatial Data, research report. Unité de Biométrie, INRA; 2005. [Google Scholar]

[R4] Banerjee S, Carlin BP, Gelfand AE. Hierarchical Modeling and Analysis for Spatial Data. Boca Raton, FL: Chapman & Hall/CRC; 2004. [Google Scholar]

[R5] Banerjee S, Gelfand AE, Sirmans CF. Directional Rates of Change Under Spatial Process Models. Journal of the American Statistical Association. 2003;98:946–954. [Google Scholar]

[R6] Barbujani G, Jacquez GM, Ligi L. Diversity of Some Gene Frequencies in European and Asian Populations, V: Steep Multilocus Clines. American Journal of Human Genetics. 1990;47:867–875. [PMC free article] [PubMed] [Google Scholar]

[R7] Barbujani G, Oden NL, Sokal RR. Detecting Areas of Abrupt Change in Maps of Biological Variables. Systematic Zoology. 1989;38:376–389. [Google Scholar]

[R8] Bocquet-Appel JP, Bacro JN. Generalized Wombling. Systematic Biology. 1994;43:442–448. [Google Scholar]

[R9] Caillol H, Wojciech P, Hillion A. Estimation of Fuzzy Gaussian Mixture and Unsupervised Statistical Image Segmentation. IEEE Transactions on Image Processing. 1997;6:425–439. doi: 10.1109/83.557353. [DOI] [PubMed] [Google Scholar]

[R10] Chellappa R, Jain AK. Markov Random Fields. New York: Academic Press; 1993. [Google Scholar]

[R11] Chilés JP, Delfiner P. Geostatistics: Modelling Spatial Uncertainty. New York: Wiley; 1999. [Google Scholar]

[R12] Cohen E, Riesenfeld RF, Elber G. Geometric Modeling With Splines: An Introduction. Natick, MA: A.K. Peters; 2001. [Google Scholar]

[R13] Cressie NAC. Statistics for Spatial Data. 2. New York: Wiley; 1993. [Google Scholar]

[R14] Csillag F, Kabos S. Wavelets, Boundaries and the Analysis of Landscape Pattern. Ecoscience. 2002;9:177–190. [Google Scholar]

[R15] Dass SC, Nair VN. Edge Detection, Spatial Smoothing, and Image Reconstruction With Partially Observed Multivariate Data. Journal of the American Statistical Association. 2003;98:77–89. [Google Scholar]

[R16] Fagan WF, Fortin MJ, Soykan C. Integrating Edge Detection and Dynamic Modeling in Quantitative Analyses of Ecological Boundaries. BioScience. 2003;53:730–738. [Google Scholar]

[R17] Fortin MJ. Edge Detection Algorithms for Two-Dimensional Ecological Data. Ecology. 1994;75:956–965. [Google Scholar]

[R18] Fortin MJ. Effects of Data Types on Vegetation Boundary Delineation. Canadian Journal of Forest Research. 1997;27:1851–1858. [Google Scholar]

[R19] Fortin MJ, Drapeau P. Delineation of Ecological Boundaries: Comparisons of Approaches and Significance Tests. Oikos. 1995;72:323–332. [Google Scholar]

[R20] Frankel T. The Geometry of Physics: An Introduction. Cambridge, U.K.: Cambridge University Press; 2003. [Google Scholar]

[R21] Gelman A, Carlin JB, Stern HS, Rubin DB. Bayesian Data Analysis. 2. Boca Raton, FL: Chapman & Hall/CRC; 2004. [Google Scholar]

[R22] Geman S, Geman D. Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images. IEEE Transactions on Pattern Analysis and Machine Intelligence. 1984;6:721–741. doi: 10.1109/tpami.1984.4767596. [DOI] [PubMed] [Google Scholar]

[R23] Jacquez GM, Greiling DA. Geographic Boundaries in Breast, Lung and Colorectal Cancers in Relation to Exposure to Air Toxins in Long Island, New York. International Journal of Health Geographics. 2003;2:4. doi: 10.1186/1476-072X-2-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] Jones CB. Geographical Information Systems and Computer Cartography. Harlow, Essex, U.K.: Addison-Wesley, Longman; 1997. [Google Scholar]

[R25] Lee S, Wolberg G, Shin SY. Scattered Data Interpolation With Multilevel B-Splines. IEEE Transactions on Visualization and Computer Graphics. 1997;3:228–244. [Google Scholar]

[R26] Lu H, Carlin BP. Bayesian Areal Wombling for Geographical Boundary Analysis. Geographical Analysis. 2006 to appear. [Google Scholar]

[R27] Mardia KV, Kent JT, Goodall CR, Little JA. Kriging and Splines With Derivative Information. Biometrika. 1996;83:207–221. [Google Scholar]

[R28] Mitas L, Mitasova H. Spatial Interpolation. In: Longley P, Goodchild MF, Maguire DJ, Rhind DW, editors. Geographical Information Systems: Principles, Techniques, Management and Applications, GeoInformation International. New York: Wiley; 1999. pp. 481–492. [Google Scholar]

[R29] Møller J, editor. Spatial Statistics and Computational Methods. New York: Springer-Verlag; 2003. [Google Scholar]

[R30] Morel JM, Solimini S. Variational Methods in Image Segmentation, With Seven Image Processing Experiments. Boston: Birkhäuser; 1994. [Google Scholar]

[R31] Mumford D, Shah J. Optimal Approximation by Piecewise Smooth Functions and Associated Variational Problems. Communications in Pure and Applied Mathematics. 1989;42:577–685. [Google Scholar]

[R32] Phillips GM. Interpolation and Approximation by Polynomials. New York: Springer-Verlag; 2003. [Google Scholar]

[R33] Rudin W. Principles of Mathematical Analysis. New York: McGraw-Hill; 1976. [Google Scholar]

[R34] Santner TJ, Williams BJ, Notz WI. The Design and Analysis of Computer Experiments. New York: Springer-Verlag; 2003. [Google Scholar]

[R35] Schabenberger O, Gotway CA. Statistical Methods for Spatial Data Analysis. Boca Raton, FL: Chapman & Hall/CRC; 2004. [Google Scholar]

[R36] Scheiner SM, Gurevitch J. Design and Analysis of Ecological Experiments. 2. Oxford, U.K.: Oxford University Press; 2001. [Google Scholar]

[R37] Stein ML. Interpolation of Spatial Data: Some Theory of Kriging. New York: Springer-Verlag; 1999. [Google Scholar]

[R38] Waller LA, Gotway CA. Applied Spatial Statistics for Public Health Data. New York: Wiley-Interscience; 2004. [Google Scholar]

[R39] Walsh DCI, Raftery AE. Accurate and Efficient Curve Detection in Images: The Importance Sampling Hough Transform. Pattern Recognition. 2002;35:1421–1431. [Google Scholar]

[R40] Webster R, Oliver MA. Geostatistics for Environmental Scientists. New York: Wiley; 2001. [Google Scholar]

[R41] Winkler G. Image Analysis, Random Fields and Markov Chain Monte Carlo Methods: A Mathematical Introduction. New York: Springer-Verlag; 2003. [Google Scholar]

[R42] Womble W. Differential Systematics. Science. 1951;114:315–322. doi: 10.1126/science.114.2961.315. [DOI] [PubMed] [Google Scholar]

PERMALINK

Bayesian Wombling: Curvilinear Gradient Assessment Under Spatial Process Models

Sudipto Banerjee

Alan E Gelfand

Abstract

1. INTRODUCTION

2. SPATIAL GRADIENTS AND DIRECTIONAL DERIVATIVE PROCESSES

3. WOMBLING MEASURE ASSOCIATED WITH A CURVE

3.1 Open Curves

3.2 Closed Curves

3.3 Wombling Boundary

Definition

4. DISTRIBUTION THEORY

5. A SIMULATED ILLUSTRATION OF ALGORITHMIC AND STATISTICAL WOMBLING

Table 1.

Figure 1.

Figure 2.

Figure 3.

Table 2.

6. SPATIAL REGRESSION EXAMPLES

6.1 A Spatial Gaussian Regression Example

Table 3.

Figure 4.

Table 4.

6.2 A Spatial Logistic Regression Example

Figure 5.

Table 5.

Figure 6.

Table 6.

7. SUMMARY AND DISCUSSION

Acknowledgments

APPENDIX: GREEN’S THEOREM

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Bayesian Wombling: Curvilinear Gradient Assessment Under Spatial Process Models

Sudipto Banerjee

Alan E Gelfand

Abstract

1. INTRODUCTION

2. SPATIAL GRADIENTS AND DIRECTIONAL DERIVATIVE PROCESSES

3. WOMBLING MEASURE ASSOCIATED WITH A CURVE

3.1 Open Curves

3.2 Closed Curves

3.3 Wombling Boundary

Definition

4. DISTRIBUTION THEORY

5. A SIMULATED ILLUSTRATION OF ALGORITHMIC AND STATISTICAL WOMBLING

Table 1.

Figure 1.

Figure 2.

Figure 3.

Table 2.

6. SPATIAL REGRESSION EXAMPLES

6.1 A Spatial Gaussian Regression Example

Table 3.

Figure 4.

Table 4.

6.2 A Spatial Logistic Regression Example

Figure 5.

Table 5.

Figure 6.

Table 6.

7. SUMMARY AND DISCUSSION

Acknowledgments

APPENDIX: GREEN’S THEOREM

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases