Abstract
Area-based measures of neighborhood characteristics simply derived from enumeration units (e.g., census tracts or block groups) ignore the potential of spatial spillover effects, and thus incorporating such measures into multilevel regression models may underestimate the neighborhood effects on health. To overcome this limitation, we describe the concept and method of areal median filtering to spatialize area-based measures of neighborhood characteristics for multilevel regression analyses. The areal median filtering approach provides a means to specify or formulate “neighborhoods” as meaningful geographic entities by removing enumeration unit boundaries as the absolute barriers and by pooling information from the neighboring enumeration units. This spatializing process takes into account for the potential of spatial spillover effects and also converts aspatial measures of neighborhood characteristics into spatial measures. From a conceptual and methodological standpoint, incorporating the derived spatial measures into multilevel regression analyses allows us to more accurately examine the relationships between neighborhood characteristics and health. To promote and set the stage for informative research in the future, we provide a few important conceptual and methodological remarks, and discuss possible applications, inherent limitations, and practical solutions for using the areal median filtering approach in the study of neighborhood effects on health.
Keywords: USA, Areal data, Area-based measures, Median filter, Multilevel regression analysis, Spatial approach, Spatial median
Introduction
Several review articles have summarized a considerable amount of research conducted during the past two decades to understand the neighborhood effects on health.1–12 Throughout this paper, “previous studies” and “existing studies” generally refer to research articles summarized in these review articles. In previous studies,1–12 multilevel regression models13–16 have been used as a statistical tool to combine traditionally distinct individual-level and ecological analyses, and to avoid individualistic and ecological fallacies.17 Note that multilevel regression models are also known as hierarchical, mixed, mixed-effect, nested, or random-effect regression models in different fields of study. By separating the effects of contextual- and individual-level factors, previous studies1–12 have demonstrated the associations of neighborhood characteristics with various health behaviors and health outcomes after accounting for individual sociodemographic characteristics (e.g., age, gender, race/ethnicity, educational achievement, income level, employment status, and/or marital status). On the whole, both theoretical and empirical foundations of neighborhoods and health research offer a strong basis to suggest that “where we live” matters for our health over and above “who we are.”18
While a large number of studies have been conducted to date, the current state of knowledge about neighborhoods and their links to health is subject to several limitations.19,20 Notably, the vast majority of existing studies1–12 do not account for the potential of spatial spillover effects (or, more generally, the spatial relationships between geographic locations) in their multilevel regression analyses. Therefore, Diez Roux,19,20 Spielman and Yoo,21 and Perchoux et al.,22 have criticized that the neighborhood characteristics surrounding the residents are not adequately represented and analyzed in existing studies.1–12 Such a conceptual and methodological criticism primarily pertains to the definition of neighborhood boundary. In the United States (US), for example, census tracts (sometimes block groups or zip-code tabulation areas (ZCTAs)) have been routinely used to denote “neighborhoods” in previous studies.1–12 The use of census tracts has been favored in US studies due to two main reasons: (i) unlike block groups or ZCTAs, census tracts are relatively homogeneous with regard to population characteristics, economic status, and living conditions; they generally consist of about 4000 inhabitants,23 and (ii) census tracts are a national creation of democratic governance informed by local inputs, are also delineated historically in accordance with uniform standards, and tend to be stable over time.24 These desirable properties make census tracts reliable enumeration units for data aggregation by the US Census Bureau, and thus they have been widely used in US studies. For these reasons, using census tracts as neighborhoods is now considered as a standard practice in multilevel regression analyses.
Despite the desirable properties, census tracts (as well as block groups and ZCTAs) are artificial demarcations overlaid onto a geographic space. By and large, census tract boundaries follow visible and identifiable physical features (e.g., streets and rivers). The use of census tracts in neighborhoods and health research, therefore, assumes that residents’ activity space (i.e., the local area in which people move or travel) in their place of residence, and thus residents’ exposure to their residential environments take place only within such boundaries. However, neighborhoods are not confined to arbitrarily define enumeration unit boundaries; a meaningful specification or formulation of neighborhoods needs to embody the spatial nature of geographic locations.25,26 From a conceptual standpoint, using census tracts oversimplify residents’ daily mobility patterns in their place of residence.19,22 For instance, residents living near the edge of a census tract are likely to cross a street (which corresponds to the invisible boundary) and to be on the other side of the census tract on a daily basis. From a methodological standpoint, a misspecification of exposure to residential environments may produce unreliable or misleading results from a regression analysis.20,21 In particular, existing studies1–12 may have underestimated the neighborhood effects on health.21 Taken together, an unrealistic conceptualization of neighborhoods, and thus an inadequate statistical analysis of neighborhood characteristics in multilevel regression analyses would lead to incomplete (if not biased) knowledge about the role of neighborhood context in health (i.e., either an adverse, a protective, or a null effect).
In order to address some of these conceptual and methodological concerns,19–22 we describe the concept and method of areal median filtering to account for the potential of spatial spillover effects in multilevel regression analyses. We provide didactic examples both in conceptual and methodological ways. For demonstration purposes, we show how commonly used area-based measures of neighborhood affluence-deprivation and suburban-urban continua can be modified into spatial measures, and also demonstrate the differences between traditional and areal median filtering approaches. To promote and set the stage for informative research in the future, we provide a few important conceptual and methodological remarks, and also discuss possible applications, inherent limitations, and practical solutions. We use St. Louis, MO, as an example to demonstrate and illustrate these points.
Quantifying Neighborhood Characteristics
Toward a more accurate examination of the neighborhood effects on health,19–22 spatial spillover effects are an important source of externality in quantifying neighborhood characteristics associated with demographic composition and physical condition. An inquiry into the importance or relevance of externality in the spatial patterns of human and physical phenomena dates back a few decades,27,28 and thus the notion of spatial spillover effects or spatial externalities is widely accepted as the pivotal component in the analysis of area, environment, place, and space.29,30 The underlying motivation for such spatial thinking is grounded in Tobler’s first law of geography:31 “everything is related to everything else, but near things are more related than distant things” (p. 236). These, in turn, convey a spatial perspective that the characteristics of a neighborhood are not merely shaped by a particular bounded location, but are also shaped by the characteristics of its surrounding locations.
Moving beyond the traditional realm of city- and metropolitan-scale analysis,29,30 a number of studies have highlighted the role of spatial spillover effects in understanding the neighborhood effects on health. For example, Durlauf32 indicated that the ability of wealthier families to separate themselves from the low socioeconomic groups has led to the significant stratification of neighborhoods by income; wealthy families tend to live in an affluent neighborhood surrounded by similarly affluent neighborhoods, whereas poor families tend to live in a deprived neighborhood surrounded by equally deprived neighborhoods. He conceived spatial spillover effects to heighten the level of separation between the most affluent and most derived neighborhoods. In a parallel realm of importance, Morenoff and Sampson33 showed that high (or low) levels of homicide around a given neighborhood led to greater (or lesser) population losses; in their analysis, high levels of homicide and increases in the spatial proximity to homicide over time were significant predictors of such population changes. Another study by Sampson et al.34 also showed that the levels of social capital and collective efficacy for children in a given neighborhood were linked to the relative spatial position of that neighborhood within the city; they noted children living in a given neighborhood benefited from (or harmed by) their proximities to neighborhoods with high (or low) levels of shared parental expectations for child social control. Building upon these theoretical and empirical foundations, Galster25,26 suggested that the potential of spatial spillover effects needs to be considered in quantifying neighborhood characteristics associated with demographic composition and physical condition.
To account for the potential of spatial spillover effects, for example, a binary weight35 or a spatial kernel36,37 has been implemented into areal data. In brief, binary weight considers the spatial externalities of neighboring census tracts and gives equal weights to all neighboring census tracts, whereas spatial kernel considers the local intensity of the spatial externalities of all census tracts in a given area and gives more weights to census tracts located near than those farther away. Commonly considered in geographical research, spatial kernel captures the distance decay effect,38–42 which reflects the nature of spatial relationships that decline with increasing distances.31 However, it has long been emphasized that the form of geospatial data (i.e., areal or point-referenced data) and the (ir)regularity of geographic features determine the suitable approach in modeling spatial relationships.43,44 In fact, spatial kernels and associated distance-decay parameters are strongly influenced by the spatial structure, including the size and the geometric configuration of areal data.45–48 Specifically, spatial kernels are suited for modeling equally spaced grid-like patterns where the rate of distance decay is uniform across geographic space. In handling areal data, it is important to recall that the rate of distance decay between census tracts (as well as block groups and ZCTAs) is not uniform because the size of census tracts increases from urban to rural areas in relation to the density of settlement. As a result, spatial kernels tend to underestimate or even ignore the spatial relationships of irregular census tracts with varying spatial extents.
In addition to the computational difficulties associated with implementing a spatial kernel into areal data,45–48 a binary weight is not as ill-suited as long been concerned by different authors.36,37 Particularly for US studies, it can reasonably take into account for the potential of spatial spillover effects in urban settings. For example, recent studies conducted in three different regions of the US showed that the local area of participants’ daily mobility overlapped with their census tract of residence and its neighboring census tracts.49–51 Moreover, two studies conducted in different US cities showed that resident-defined neighborhood boundaries were very close to the size of a census tract (in terms of land area), but typically included portions of at least one neighboring census tract (rather than a complete single census tract),52 and that resident-defined neighborhood boundaries contained their census tract of residence and its neighboring census tracts.53 Given the diverse settings in which these five US studies were conducted, these findings substantiate the value to the desirable properties of census tracts.23,24 Put differently, when a census tract is used to denote a neighborhood, it may be adequate for quantifying the neighborhood characteristics of residents living at or near the center of the census tract, but may not be for those living near the border or edge of the census tract. Therefore, to account for the potential of spatial spillover effects, area-based measures of neighborhood characteristics need to consider not only the census tract of residence but also its neighboring census tracts.49–53
To summarize, neighborhoods may be viewed as the local areas corresponding to residents’ activity spaces.54,55 Recent US studies in urban settings49–53 corroborate the idea that the census tract of residence and its neighboring census tracts can be used in this regard. Therefore, using a binary weighting scheme to define the memberships of neighboring census tracts is preferred over the spatial kernel scheme. By accounting for the potential of spatial spillover effects, incorporating the derived area-based measures of neighborhood characteristics into multilevel regression analyses addresses some of the conceptual and methodological concerns raised by Diez Roux,19,20 Spielman and Yoo,21 and Perchoux et al.22 Hence, an integration of spatial thinking into multilevel thinking will allow us to more accurately examine the neighborhood effects on health.
Materials and Methods
Data
Area-based sociodemographic data at the census tract level were obtained from the 2005–2009 American Community Survey (ACS) for St. Louis, MO (St. Charles County, St. Louis County, and St. Louis City). Note that the 5-year ACS estimates are based on a larger sample size and thus more reliable than the 1- and 3-year estimates. The 2000 US census tract boundary shapefile was obtained from the US Census Bureau. Since census tract boundaries extend into rivers and/or include large ponds and lakes, such water bodies were removed from the boundary shapefile when the total land area (in square kilometers) was recalculated in GIS (ArcGIS 10.2; ESRI Inc., Redlands, CA, USA). This recalculation procedure better represents the actual land surface area in which people move or travel. A shapefile of water bodies was obtained from the Data & Maps Collection for ArcGIS on DVD, and all shapefiles were projected using the NAD 1983, State Plane Coordinate System.
Areal Median Filtering Approach
When census tract boundaries are simply used to denote neighborhoods, each census tract is treated as an “island” because information about the spatial relationships between census tracts is not considered. By design, area-based measures of neighborhood characteristics simply derived from such boundaries are aspatial in nature. Hereafter, “aspatial” refers to the insensitivity of analytical results to the spatial arrangement of enumeration units (e.g., census tracts or block groups). Rather than relying solely on census tract boundaries, a better approach is to remove those boundaries as the absolute barriers and to incorporate information about the spatial relationships between census tracts. This “spatializing” process converts aspatial measures of neighborhood characteristics into spatial measures.
To derive a spatial measure of neighborhood characteristics, Wong35 introduced the concept of composite population to more realistically portray the experience of residents living in a given census tract by accounting for the spatial interaction (i.e., the movement of people across geographic space) across neighboring census tracts. Adopting the concepts in modeling spatial autocorrelation,56,57 he suggested using the function cij(.) to remove the census tract boundaries as the absolute barriers for spatial interaction.35 Let census tract i be the reference unit, cij(.) is the element of a modified binary (0, 1) connectivity matrix where (i) cij = 1 indicates that census tracts i and j are neighbors, and cij = 0 otherwise, and (ii) i can equal to j and thus cii = 1. By implementing the function cij(.), for example, the composite population count of group G in census tract i (cgi) is modeled as
where gj is the population count of group G in census tract j. Therefore, cgi is the sum of the population count in census tract i plus the population counts in its neighboring census tracts j. This specification implicitly accounts for the spatial interaction of residents across census tracts.
Wong’s approach35 provides a more realistic representation of residents’ activity space49–53 than the traditional approach that relies solely on a single census tract. However, the concept of composite population35 was designed to handle only discrete variables (i.e., population counts), and thus cannot handle continuous variables (e.g., median household income and income-to-poverty ratio). As a possible way to handle continuous variables, the concept of composite population35 can be modified by integrating the concept of median filter (also referred to as moving medians).58 We define the integration of these two concepts as the concept of areal median filtering. Note that the median is a more robust measure of central tendency than the mean when the data are skewed. Therefore, the role of median filter in the areal median filtering approach is to measure a central tendency of neighborhood characteristics.
In handling continuous variables, the areal median filtered measure of census tract i (mqi) is modeled as
where qj is the interval or ratio measure of census tract j. Therefore, mqi is the median of the interval or ratio measure of census tract i and its neighboring census tracts j. This specification implicitly removes census tract boundaries as the absolute barriers and also pools information from neighboring census tracts to quantify a representative description of neighborhood characteristics surrounding the residents living in census tract i. Figure 1 illustrates the difference between traditional and areal median filtering approaches in quantifying the interval or ratio measure of neighborhood characteristics.
FIG. 1.
Illustration of the difference between traditional and areal median filtering approaches. The function c ij(.)35 is used to remove the enumeration unit boundaries as the absolute barriers for spatial interaction, and the median filter58 is used to measure a central tendency of neighborhood characteristics. The areal median filtering approach is designed to handle continuous variables (i.e., interval or ratio measures). This spatializing process addresses some of the conceptual and methodological concerns raised by different authors.19 – 22.
Among previous studies,1–12 neighborhood characteristics along the affluence-deprivation and suburban-urban continua were two common area-based measures used in neighborhoods and health research. The areal median filtering approach can be used to convert aspatial measures of these two neighborhood characteristics into spatial measures.
First, the aspatial measure of neighborhood deprivation (Mi, also known as the measure of material deprivation), which captures the affluence-deprivation continuum, is defined as
| 1 |
where hi is the median household income (US $) in census tract i and f−1(.) is the inverse function. Here, median household income was used because Oka59 showed that it was highly, but negatively correlated (−0.85 ≤ r ≤ −0.92) with area-based composite measures of socioeconomic position (SEP),60 socioeconomic deprivation (SED),61 and deprivation (DEP)62 in St. Louis, MO. Note that SEP, SED, and DEP are comprised of 6, 17, and 8 area-based socioeconomic variables, respectively. Although median household income was also highly and positively correlated with median family income (r = 0.94), median household income had less missing data than median family income. Since the increase in median household income refers to the increase in neighborhood affluence, its inverse correspondingly refers to the increase in neighborhood deprivation.59 Based on the areal median filtering approach, the spatial measure of neighborhood deprivation (SMi) is defined as
| 2 |
where mhi is the median of the median household income in census tract i and its neighboring census tracts j.
Second, the aspatial measure of neighborhood urbanness (Ui), which captures the suburban-urban or rural-suburban-urban continuum, is defined as
| 3 |
where di is the population density (the number of total population divided by the size of land area in square kilometers) in census tract i. Based on the areal median filtering approach, the spatial measure of neighborhood urbanness (SUi) is defined as
| 4 |
where mdi is the median of the population density in census tract i and its neighboring census tracts j.
Using census tracts as the basic enumeration units, aspatial and spatial measures of neighborhood deprivation (Mi and SMi, respectively) and neighborhood urbanness (Ui and SUi, respectively) were computed in R.63,64 As defined above, the function cij(.) implemented in SMi and SUi is a list of polygon neighbors that includes the census tract itself (i.e., cii = 1) and census tracts sharing a common boundary with or touching a boundary of census tract i (i.e., cij = 1). Then, the median is calculated to derive the areal median-filtered measures (see Fig. 1) after excluding census tracts that do not share or touch a boundary (i.e., cij = 0). The R codes for computing SMi and SUi are given below:
library(spdep)
# The function cij(.)
c_ij = nb2mat(include.self(poly2nb(Data, queen = TRUE)), style = “B”, zero.policy = TRUE)
# Replacing 0 s with NAs
c_ij[c_ij==0] = NA
# Spatial Measure of Neighborhood Deprivation
Median_Household_Income_c_ij = mapply(“*”, c_ij, Median_Household_Income, SIMPLIFY = TRUE)
Median_Household_Income_c_ij_matrix = matrix(Median_Household_Income_c_ij, nrow = nrow(c_ij), ncol = nrow(c_ij), byrow = TRUE)
SM = (−1)*apply(Median_Household_Income_c_ij_matrix, 1, FUN = “median”, na.rm = TRUE)
# Spatial Measure of Neighborhood Urbanness
Population_Density_c_ij = mapply(“*”, c_ij, Population_Density, SIMPLIFY = TRUE)
Population_Density_c_ij_matrix = matrix(Population_Density_c_ij, nrow = nrow(c_ij), ncol = nrow(c_ij), byrow = TRUE)
SU = apply(Population_Density_c_ij_matrix, 1, FUN = “median”, na.rm = TRUE)
Note that the inverse function f−1(.) in Mi and SMi is multiplying area-based measures by −1 (negative one).
Analysis
To demonstrate the differences between aspatial and spatial measures of neighborhood deprivation and neighborhood urbannness, scatterplots and Pearson product–moment correlation coefficients (r) were computed in R65 for St. Louis, MO (Fig. 2). As a supplemental illustration, histograms of these aspatial and spatial measures are shown in the lower panels of Fig. 2. For visual analysis, the geographical distributions of area-based measures derived from Mi, SMi, Ui, and SUi were mapped in GIS for St. Louis, MO (Fig. 3). For interpretation purposes, these four area-based measures were normalized to their range, such that all measures are bounded between 0 and 1. Here, the purpose of rescaling aspatial and spatial measures is to provide an interpretable comparison of the changes from low to high values in a consistent manner. These normalizations are justifiable as the purpose of comparing aspatial and spatial measures is to capture within-city variations, not between-city differences. A quantile classification scheme was used to display the levels of neighborhood deprivation and neighborhood urbanness.
FIG. 2.
Relationships between aspatial and spatial measures of neighborhood characteristics in St. Louis, MO (340 census tracts). a area-based measures of neighborhood deprivation and b area-based measures of neighborhood urbanness. The upper panels show scatterplots and Pearson product-moment correlation coefficients (r) between aspatial and spatial measures, and the lower panels show histograms of those measures.
FIG. 3.
Geographic distributions of aspatial and spatial measures of neighborhood characteristics in St. Louis, MO (340 census tracts). a aspatial measure of neighborhood deprivation, b spatial measure of neighborhood deprivation, c aspatial measure of neighborhood urbanness, and d spatial measure of neighborhood urbanness.
The normalization process retains rank order and the relative degree of separation between area-based measures within a given study area. However, the degree of separation between affluent and deprivation neighborhoods and the intensity of settlements vary from city to city. In other words, the highest and lowest values of median household income and population density in St. Louis, MO are unlikely to be the highest and lowest values, for example, in New York, NY, Los Angeles, CA, Chicago, IL, Houston, TX, and Philadelphia, PA. Therefore, the absolute area-based measures should be used when the purpose of comparing aspatial and spatial measures is to capture between-city differences.
To quantitatively demonstrate the difference between aspatial and spatial measures (i.e., Mi and Ui versus SMi and SUi) in the context of multilevel regression analysis, we constructed a hypothetical data of 3000 individuals living across 300 census tracts in St. Louis, MO (Table 1). As an example, we use obesity as the outcome of interest: non-obese versus obese. By incorporating neighborhood- and individual-level covariates, two multilevel logistic regression analyses were carried out in R66,67 for St. Louis, MO (Table 2).
TABLE 1.
Description of a hypothetical data in St. Louis, MO (3000 individuals living across 300 census tracts)
| Individual-level outcome | ||
|---|---|---|
| Non-obese | BMI <30 | 70.0 % |
| Obese | BMI ≥30 | 30.0 % |
| Individual-level sociodemographics | ||
| Age | Minimum | 25.0 |
| Mean | 39.2 | |
| Median | 39.0 | |
| Maximum | 65.0 | |
| Gender | Female | 50.0 % |
| Male | 50.0 % | |
| Education | ≥16 years | 25.1 % |
| 13–15 years | 25.1 % | |
| 12 years | 32.9 % | |
| <12 years | 11.9 % | |
| NA | 5.0 % | |
| Income | ≥$100,000 | 15.1 % |
| $75,000–$99,999 | 15.8 % | |
| $50,000–$74,999 | 18.4 % | |
| $25,000–$49,999 | 28.7 % | |
| <$25,000 | 15.0 % | |
| NA | 7.1 % | |
BMI body mass index, NA not available
TABLE 2.
Hypothetical multilevel logistic regression analysis of obesity in St. Louis, MO
| Model 1 | Model 2 | ||
|---|---|---|---|
| Fixed effects | OR (95 % CI) | OR (95 % CI) | |
| Neighborhood-level covariates | |||
| Neighborhood deprivation (M i) | 1.20 (1.13, 1.28) | ||
| Neighborhood urbanness (U i) | 0.54 (0.26, 1.11) | ||
| Neighborhood deprivation (SM i) | 1.24 (1.15, 1.33) | ||
| Neighborhood urbanness (SU i) | 0.44 (0.20, 0.95) | ||
| Individual-level covariates | |||
| Age | 1.05 (1.04, 1.06) | 1.05 (1.04, 1.06) | |
| Gender | Female | Ref. | Ref. |
| Male | 0.50 (0.42, 0.61) | 0.51 (0.42, 0.61) | |
| Education | ≥16 years | Ref. | Ref. |
| 13–15 years | 1.25 (0.94, 1.66) | 1.26 (0.95, 1.68) | |
| 12 years | 1.59 (1.22, 2.09) | 1.61 (1.23, 2.11) | |
| <12 years | 1.58 (1.11, 2.24) | 1.59 (1.12, 2.25) | |
| NA | 1.65 (0.85, 3.19) | 1.62 (0.84, 3.13) | |
| Income | ≥$100,000 | Ref. | Ref. |
| $75,000–$99,999 | 1.06 (0.75, 1.51) | 1.08 (0.76, 1.54) | |
| $50,000–$74,999 | 1.10 (0.78, 1.55) | 1.12 (0.80, 1.58) | |
| $25,000–$49,999 | 1.34 (0.96, 1.86) | 1.38 (0.99, 1.92) | |
| <$25,000 | 1.60 (1.10, 2.33) | 1.63 (1.12, 2.37) | |
| NA | 0.90 (0.50, 1.62) | 0.93 (0.52, 1.67) | |
| Random effects | Variance (SE) | Variance (SE) | |
| Census tract | 0.477 (0.690) | 0.478 (0.691) | |
| Goodness-of-fit measures | |||
| Akaike information criterion (AIC) | 3278.327 | 3281.703 | |
| Bayesian information criterion (BIC) | 3368.423 | 3371.798 | |
| Log-likelihood | −1624.164 | −1625.851 | |
| Deviance | 3248.327 | 3251.703 | |
OR odds ratio, CI confidence interval, NA not available, Ref. reference category, SE standard error
Results
Figure 2 shows the correlations between aspatial and spatial measures of neighborhood deprivation and neighborhood urbanness. Comparing the aspatial and spatial measures of neighborhood deprivation (upper scatterplot in Fig. 2a), Mi is highly and positively correlated with SMi (r = 0.90). Similarly, comparing the aspatial and spatial measures of neighborhood urbanness (upper scatterplot in Fig. 2b), Ui is highly and positively correlated with SUi (r = 0.83). While scatterplots display linear associations between these aspatial and spatial measures, clear discrepancies can be observed among the varying levels of neighborhood deprivation and neighborhood urbanness measures. These discrepancies are attributable to the skewness of those measures where Mi and SMi are negatively skewed, whereas Ui and SUi are positively skewed (lower histograms in Fig. 2).
For visual assessment of the difference between traditional and areal median filtering approaches, the spatial arrangements of aspatial and spatial measures of neighborhood deprivation and neighborhood urbanness are shown in Fig. 3. By comparing the geographical distributions of Mi and SMi (Fig. 3a versus Fig. 3b) as well as Ui and SUi (Fig. 3c versus Fig. 3d), it is clear that aspatial measures and its spatial counterparts do not exactly resemble the same spatial patterns; noticeably, SMi and SUi show much “smoother” spatial patterns than Mi and Ui, respectively. The “smoothness” of spatial measures and thus the dissimilar spatial patterns compared with its aspatial counterparts is attributable to the potential of spatial spillover effects27–30 in areal data. These, in turn, illustrate the importance of a spatial perspective that area-based measures of neighborhood characteristics are not merely shaped by a particular bounded census tract, but are also shaped by its neighboring census tracts.
The areal median filtering approach, which smooths the areal data spatially, is helpful not only for the visualization of spatial patterns, but also for the detection of spatial clusters.68 For example, highly deprived neighborhoods clustered in the eastern parts of the city in Fig. 3a are much more recognizable in Fig. 3b because those neighborhoods are adjacent to and/or surrounded by equally high levels of neighborhood deprivation. In a similar fashion, densely populated neighborhoods clustered in the eastern, northern, and southeastern parts of the city in Fig. 3c are much more profound in Fig. 3d since those neighborhoods are adjacent to and/or surrounded by similarly high levels of neighborhood urbanness. These “smoothed” maps can serve as the foundations for exploratory spatial data analysis (ESDA)69–71 to synthesize knowledge gained from existing and future studies toward informing public policies and decision-making. As a potential contribution, the visualization of high-risk neighborhood clusters through ESDA can assist health researchers and their collaborators in identifying where the resources should be allocated for designing effective community-based interventions or health promotions.
As a way to demonstrate the difference between traditional and areal median filtering approaches in the context of multilevel regression analysis, the associations of aspatial and spatial measures of neighborhood deprivation and neighborhood urbanness with obesity are shown in Table 2. Note that the parameter estimates are based on a hypothetical data of obesity described in Table 1, and thus they must not be considered as empirical evidence. Nevertheless, results from the two multilevel regression analyses (model 1 versus model 2) highlight the importance of using spatial measures of neighborhood deprivation (SMi) and neighborhood urbanness (SUi) over its aspatial counterparts (Mi and Ui, respectively). That is, the aspatial measure of neighborhood urbanness (Ui) is not associated with obesity in model 1 (OR 0.54, 95 % CI 0.26–1.11), but the spatial measure of neighborhood urbanness (SUi) is associated with obesity in model 2 (OR 0.44, 95 % CI 0.20–0.95). Otherwise, changes in the estimation of other fixed effects, random effects, and goodness-of-fit measures are negligible. These suggest that a multilevel regression analysis based on the aspatial measures may underestimate the potential risk of obesity among people living in suburban neighborhoods. Rooted in the core concepts of spatial thinking,27–34 a comparison of model 1 with model 2 in Table 2 underscores the notion that spatial spillover effects are an important source of externality in the study of neighborhood effects on health.
When incorporating the areal median filtering approach into multilevel regression analyses, however, some caution is required in future studies. For instance, the association of SUi with obesity in model 2 was conditional on SMi (i.e., SUi was not associated with obesity in a multilevel logistic regression analysis without SMi). Therefore, the covariate inclusion and exclusion criteria should be based on theoretical and scientific insights, rather than the statistical significance in a preliminary analysis. In like manner, spatial spillover effects may have a lesser or greater relevance in a multilevel regression analysis depending on the outcome of interest (e.g., health behaviors versus health outcomes). To better understand the role of spatial spillover effects in neighborhoods and health research, a series of empirical studies are needed for various types of health indicators in different cities (e.g., sparse and dense urban settings) and regions (e.g., Eastern and Western states) across the USA.
From an analytical standpoint, the generalizability and transferability of empirical findings to different populations, geographic settings, contexts, and/or points in time are essential components in health research. Yet, the study of neighborhood effects on health is heavily influenced by how it is studied and how results are interpreted; the results obtained from a multilevel regression analysis can be contingent on what was considered (or was not considered) in the model. Hence, model building and the interpretation of results should be handled with care.
Discussion
A realistic representation of neighborhood characteristics is needed for multilevel regression analyses; otherwise, only limited (if not biased) knowledge can be gained through existing and future studies.19–22 By integrating the concepts of composite population35 and median filter,58 we therefore described the areal median filtering approach to account for the potential of spatial spillover effects in quantifying a representative description of neighborhood characteristics surrounding the residents living in a given census tract. This spatializing process allows us to specify or formulate neighborhoods as meaningful geographic entities in multilevel regression analyses. Notwithstanding the usefulness of the areal median filtering approach, there are arguably comparable, but slightly different, perspectives toward the integration of spatial smoothers (i.e., median filter versus mean filter) and the choice of enumeration units (i.e., census tracts versus block groups) that may arise in future studies. Hence, we discuss our point of view on these matters.
Regarding the spatial smoother, mean filter (also referred to as moving mean or moving average)72,73 may be used to introduce the areal mean filtering approach. This alternative approach simply replaces the median filter with the mean filter, and thus can be used to derive, for example, SM*i and SU*i. Borrowing the concept from time-series analysis, Curry72 discussed possible uses of mean filter for geospatial data, and Haining73 provided a detailed theoretical treatment of using mean filter in modeling spatial interactions. By and large, both the concepts of mean and median filter, as well as other types of spatial smoothers, have been implemented in various modeling techniques.74–81 However, the use of median filter has long been advocated by Tukey58 as a measure of central tendency. More importantly, the nature of local variability in a given region69–71 needs to be considered when choosing between the two filters. On one hand, mean filter tends to lessen unusual values in the data, and thus may be more suitable for handling geospatial data where localized outliers are generally not expected. On the other hand, median filter tends to preserve unusual values in the data, and thus may be more appropriate for handling geospatial data where localized outliers are expected. Taking these into consideration and potential applications in future studies, the areal median filtering approach is preferred over its alternative areal mean filtering approach.
With regard to the geographic unit of analysis, block groups (or its equivalent enumeration units) may be used when using census tracts deemed inappropriate or impractical in certain settings. For example, the size of census tracts becomes much larger in rural areas. Unfortunately, recent US studies on residential activity spaces have been conducted in urban settings.49–53 While little is known about the urban-suburban-rural differences in residential activity spaces, it is likely that block groups (instead of census tracts) may be well-suited to represent residents’ activity space in their place of residence, and thus residents’ exposure to their residential environments in rural settings. In such a case, the areal median filtering approach can be implemented into block groups (but following the same specification given above). Another possible scenario for using block groups is that enumeration units equivalent to the US census tracts may not exist in some countries. For instance, census sections (which are analogous to block groups in the US) have been used to denote neighborhoods in Spanish studies.82–87 If block groups were used in urban settings outside the USA, however, the function cij(.) should be modified to include higher-order (e.g., the second-, third-, or fourth-order) units. The inclusion of higher-order enumeration units is to compensate for the smaller area of block groups, which may not reasonably represent the extent of residents’ daily mobility in their place of residence. The modified version of cij(.) can be denoted as c′ij(.) to model mq′i (similar to the specification given above).
While further efforts may focus on the use of other spatial smoothers and/or smaller enumeration units, the underlying principle remains the same. The key importance is that enumeration unit boundaries may not delimit residents’ daily mobility patterns in their place of residence, and thus residents’ exposure to their residential environments. As Getis88 emphasized, prediction is imperative for understanding the movement of people between places, not the degree of spatial autocorrelation present in a geospatial data or the statistical significance in a (multilevel) regression analysis. To this end, the areal median filtering approach provides a means to specify or formulate neighborhoods as meaningful geographic entities by removing the census tract boundaries as the absolute barriers and by pooling information from the neighboring census tracts to more realistically represent the characteristics of a neighborhood. These specifications or formulations address some of the conceptual and methodological concerns raised by different authors.19–22 Hence, the integration of mean filter (instead of median filter) and/or the use of block groups (instead of census tracts) can be viewed as an extension of, which may not be superior to, the areal median filtering approach.
As a technical note, we list a few words of caution for using the areal median filtering approach in future studies. First, an implementation of spatial function (binary weight35 and spatial kernel36,37 alike) will be subject to the boundary or edge effect. Such an effect introduces bias into the identification of spatial distribution and the parameter estimates of spatial processes.89 Since areal associations and geographic distributions extend beyond the demarcated boundary (e.g., county boundary), when the study area is “cropped” or “lifted” from its surrounding geography in the analysis, the measures or statistics computed within the study area would be biased. In particular, census tracts closer to the border of the study area will be affected more than those closer to the center of the study area. Several solutions have been proposed during the past decades, but none of them can fully solve the problem.90,91 Because the function cij(.) implemented in the areal median filtering approach involves only the first-order adjacency relation (Fig. 1), a buffer zone including census tracts immediately outside the study area will be sufficient for accounting the boundary or edge effect. If the function c′ij(.) was to be used in rural areas of the USA or in non-USA settings, then the buffer size should be adjusted to include higher-order adjacency relations outside the study area.
Second, an implementation of spatial function (binary weight35 and spatial kernel36,37 alike) smooths the spatial distribution of area-based measures. As shown in Fig. 3, it leads to a better visualization and detection of the systematic spatial patterns and spatial clusters within the study area. However, implementing a spatial function also magnifies the level of spatial autocorrelation (i.e., residuals that vary systematically over space), which in turn violate the assumption of independence. The violation of independence yield inflated R-square values, deflated standard errors, and overestimated t tests in the regression analysis.92,93 Because commonly used multilevel regression models only account for the within-group dependency, but not the between-group dependency,13–16 spatial autocorrelation between area-based measures of neighborhood characteristics (aspatial and spatial measures alike) must be handled in the multilevel regression analysis. One possible solution is to use (Bayesian) generalized geoadditive mixed models (GGAMMs),94 which can be considered as a multilevel version of (Bayesian) generalized geoadditive models. Given the complexity of such statistical models, however, the use of GGAMMs, particularly when using Bayesian GGAMMs, is only recommended for those who have a clear comprehension of handling the multilevel and spatial data structures.
Third, an application of areal median filtering approach for measuring other types of neighborhood characteristics may face challenges due to the quality of ACS data. After the 2000 decennial census, the US Census no longer gathers detailed information on household’s socioeconomic status and living conditions through the so-called long form. Replacing the long form is the ACS, which is an ongoing survey conducted by the US Census Bureau since 2005 and remains the major source of socioeconomic and housing data. However, due to the changes made in the ACS,95 estimates for smaller geographical units (e.g., census tracts, but more so for block groups) are not as reliable (with relatively large margin of errors) as the past decennial censuses. The inaccuracy of ACS data is primarily due to the reliance on small samples, particularly in sparsely population areas.96,97 Therefore, ACS data must be handled with care.
Conclusion
The study of neighborhood effects on health requires a spatial and multilevel thinking. In the context of neighborhoods and health research,18 the use of multilevel regression models has become the standard statistical approach during the past two decades.1–12 However, spatial thinking27–30 has been largely overlooked in existing studies.1–12 Only a limited number of studies have been conducted to account for the potential of spatial spillover effects in their multilevel regression analyses.98–102 These five studies used local spatial segregation measures based on the concept of composite population.103,104 The dearth of spatial thinking is linked to the fact that no spatial measures, similar to the local spatial segregation measures, exist today to quantify other types and/or dimensions of neighborhood characteristics. The areal median filtering approach provides a means to fill the gap by spatializing area-based measures of neighborhood characteristics for multilevel regression analyses. Hence, the concept and method described in this paper contribute to the integration of spatial thinking into the already evident multilevel thinking in neighborhoods and health research.
The areal median filtering approach is designed to quantify continuous variables (i.e., interval or ratio measures), and thus we focused on deriving spatial measures of neighborhood deprivation (SMi) and neighborhood urbanness (SUi). If a discrete variable (i.e., population count) is of interest, then the composite population approach35 should be considered instead. For example, residential segregation has long been considered to shape health in the USA105–108 and has become an increasing concern in Europe.109 Because simple percentages (e.g., percent non-Hispanic black) cannot capture the spatial nature of segregation,110,111 we recommend using spatial measures of residential segregation developed by Wong.103,104 See Oka and Wong110,111 for helpful theoretical and methodological discussions on the measurement of residential segregation and for a useful critique on the misuse of ineffective and insufficient segregation measures. Capitalizing on the shared principle of using the function cij(.), the composite population and areal median filtering approaches can be used to handle discrete and continuous variables, respectively.
Besides spatial measures of neighborhood deprivation (SMi) and neighborhood urbanness (SUi) considered in this paper, the areal median filtering approach can be applied to capture various types of neighborhood characteristics for multilevel regression analyses. However, it is important to recognize that we provided the concept and method for areal data, and not for point-referenced data. In some instances, retail store densities (e.g., convenience stores, grocery stores, supermarkets, and tobacco stores) have been used to characterize the neighborhood physical (or built) environments. Widely recognized as a “malpractice” in handling geospatial data, aggregating point-reference data into census tracts (as well as any other enumeration units) would lead to the creation of artificial spatial patterns, a source of bias known as the modifiable areal unit problem.112 In order to foster informative research, we do not recommend the use of areal median filtering approach to combine two different types of geospatial data (i.e., areal and point-referenced data).
The use of multilevel regression models is one way to examine how spatial, sociocultural, institutional, and national contexts (e.g., neighborhoods, peers, families, religions, schools, workplaces, cities or metropolis, regions, and countries) may shape the health of individuals within them.113–116 While spatial regression models117–121 may be used to account for the potential of spatial spillover effects in neighborhoods and health research, they are not designed to handle multilevel data structures and thus cannot adequately analyze multiple contexts. Since the areal median filtering approach addresses some of the limitations inherent in multilevel regression analyses,19–22 future studies should focus not only on individual- and neighborhood-level factors (i.e., a simple two-level model), but also on factors operating at different contexts (i.e., a cross-classified model). Toward gaining a more well-rounded insight into the “web of causation,”122 the continued uses of multilevel regression models,13–16 more preferably the uses of GGAMMs,94 provide a fruitful research paradigm for better understanding how micro-, meso-, and macro-level factors shape health.
Acknowledgments
This work was supported by the European Research Council (ERC) under the European Union’s Seventh Framework Programme (FP/2007-2013)/ERC Starting Grant “Heart Healthy Hoods” (Agreement No. 336893). The ERC had no further role in the study design, collection, analysis, and interpretation of data, writing of the article, or decision for publication. The opinions and assertions contained herein are the private ones of the authors and are not considered as official or reflecting the views of the ERC.
References
- 1.Robert SA. Socioeconomic position and health: the independent contribution of community context. Annu Rev Sociol. 1999;25:489–516. doi: 10.1146/annurev.soc.25.1.489. [DOI] [Google Scholar]
- 2.Pickett KE, Pearl M. Multilevel analyses of neighborhood socioeconomic context and health outcomes: a critical review. J Epidemiol Community Health. 2001;55(2):111–122. doi: 10.1136/jech.55.2.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Diez Roux AV. Residential environments and cardiovascular risk. J Urban Health. 2003;80(4):569–589. doi: 10.1093/jurban/jtg065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Booth KM, Pinkston MM, Poston WS. Obesity and the built environment. J Am Diet Assoc. 2005;105(5):S110–S117. doi: 10.1016/j.jada.2005.02.045. [DOI] [PubMed] [Google Scholar]
- 5.Papas MA, Alberg AJ, Ewing R, Helzlsouer KJ, Gary TL, Klassen AC. The built environment and obesity. Epidemiol Rev. 2007;29(1):129–143. doi: 10.1093/epirev/mxm009. [DOI] [PubMed] [Google Scholar]
- 6.Black JL, Macinko J. Neighborhoods and obesity. Nutr Rev. 2008;66(1):2–20. doi: 10.1111/j.1753-4887.2007.00001.x. [DOI] [PubMed] [Google Scholar]
- 7.Kim D. Blues from the neighborhood? Neighborhood characteristics and depression. Epidemiol Rev. 2008;30(1):101–117. doi: 10.1093/epirev/mxn009. [DOI] [PubMed] [Google Scholar]
- 8.Mair C, Diez Roux AV, Galea S. Are neighbourhood characteristics associated with depressive symptoms? A review of evidence. J Epidemiol Community Health. 2008;62(11):940–946. doi: 10.1136/jech.2007.066605. [DOI] [PubMed] [Google Scholar]
- 9.Chaix B. Geographic life environments and coronary heart disease: a literature review, theoretical contributions, methodological updates, and a research agenda. Annu Rev Public Health. 2009;30:81–105. doi: 10.1146/annurev.publhealth.031308.100158. [DOI] [PubMed] [Google Scholar]
- 10.Kramer MR, Hogue CR. Is segregation bad for your health? Epidemiol Rev. 2009;31(1):178–194. doi: 10.1093/epirev/mxp001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Diez Roux AV, Mair C. Neighborhoods and health. Ann N Y Acad Sci. 2010;1186:125–145. doi: 10.1111/j.1749-6632.2009.05333.x. [DOI] [PubMed] [Google Scholar]
- 12.Meijer M, Röhl J, Bloomfield K, Grittner U. Do neighborhoods affect individual mortality? A systematic review and meta-analysis of multilevel studies. Soc Sci Med. 2012;74(8):1204–1212. doi: 10.1016/j.socscimed.2011.11.034. [DOI] [PubMed] [Google Scholar]
- 13.Raudenbush SW, Bryk AS. Hierarchical linear models in social and behavioral research: applications and data analysis methods. 2nd ed. Newbury Park, CA: Sage Publications; 2002.
- 14.Gelman A, Hill J. Data analysis using regression and multilevel/hierarchical models. New York: Cambridge University Press; 2007. [Google Scholar]
- 15.Hox J. Multilevel analysis: techniques and applications. 2nd ed. New York, NY: Routledge; 2010.
- 16.Snijders TAB, Bosker RJ. Multilevel analysis: an introduction to basic and advanced multilevel modeling. 2nd ed. Thousand Oaks, CA: SAGE Publications; 2012.
- 17.Subramanian SV, Jones K, Kaddour A, Krieger N. Revisiting robinson: the perils of individualistic and ecologic fallacy. Int J Epidemiol. 2009;38(2):342–360. doi: 10.1093/ije/dyn359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Kawachi I, Berkman LF. Neighborhoods and Health. New York, NY: Oxford University Press; 2003.
- 19.Diez Roux AV. Investigating neighborhood and area effects on health. Am J Public Health. 2001;91(11):1783–1789. doi: 10.2105/AJPH.91.11.1783. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Diez Roux AV. Next steps in understanding the multilevel determinants of health. J Epidemiol Community Health. 2008;62(11):957–959. doi: 10.1136/jech.2007.064311. [DOI] [PubMed] [Google Scholar]
- 21.Spielman SE, E-h Y. The spatial dimensions of neighborhood effects. Soc Sci Med. 2009;68(6):1098–1105. doi: 10.1016/j.socscimed.2008.12.048. [DOI] [PubMed] [Google Scholar]
- 22.Perchoux C, Chaix B, Cummins S, Kestens Y. Conceptualization and measurement of environmental exposure in epidemiology: accounting for activity space related to daily mobility. Health Place. 2013;21:86–93. doi: 10.1016/j.healthplace.2013.01.005. [DOI] [PubMed] [Google Scholar]
- 23.United States Census Bureau. Geographic Areas - Definitions. American Community Survey Office. 2015. Available at: http://www.census.gov/acs/www/data_documentation/custom_tabulation_request_form/geo_def.php. Accessed December 18, 2015.
- 24.Krieger N. A century of census tracts: health & the body politic (1906–2006) J Urban Health. 2006;83(3):355–361. doi: 10.1007/s11524-006-9040-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Galster GC. What is neighbourhood?: an externality-space approach. Int J Urban Reg Res. 1986;10(2):243–263. doi: 10.1111/j.1468-2427.1986.tb00014.x. [DOI] [Google Scholar]
- 26.Galster GC. On the nature of neighbourhood. Urban Stud. 2001;38(12):2111–2124. doi: 10.1080/00420980120087072. [DOI] [Google Scholar]
- 27.Papageorgiou GJ. Spatial externalities i: theory. Ann Assoc Am Geogr. 1978;68(4):465–476. doi: 10.1111/j.1467-8306.1978.tb01210.x. [DOI] [Google Scholar]
- 28.Papageorgiou GJ. Spatial externalities ii: applications. Ann Assoc Am Geogr. 1978;68(4):477–492. doi: 10.1111/j.1467-8306.1978.tb01211.x. [DOI] [Google Scholar]
- 29.Goodchild MF, Anselin L, Appelbaum RP, Harthorn BH. Toward spatially integrated social science. Int Reg Sci Rev. 2000;23(2):139–159. doi: 10.1177/016001700761012701. [DOI] [Google Scholar]
- 30.Anselin L. Spatial externalities, spatial multipliers, and spatial econometrics. Int Reg Sci Rev. 2003;26(2):153–166. doi: 10.1177/0160017602250972. [DOI] [Google Scholar]
- 31.Tobler WR. A computer movie simulating urban growth in the Detroit region. Econ Geogr. 1970;46(S):234–240. doi: 10.2307/143141. [DOI] [Google Scholar]
- 32.Durlauf SN. Spillovers, stratification, and inequality. Eur Econ Rev. 1994;38(3–4):836–845. doi: 10.1016/0014-2921(94)90120-1. [DOI] [Google Scholar]
- 33.Morenoff JD, Sampson RJ. Violent crime and the spatial dynamics of neighborhood transition: Chicago, 1970–1990. Soc Forces. 1997;76(1):31–64. doi: 10.1093/sf/76.1.31. [DOI] [Google Scholar]
- 34.Sampson RJ, Morenoff JD, Earls F. Beyond social capital: spatial dynamics of collective efficacy for children. Am Sociol Rev. 1999;64(5):633–660. doi: 10.2307/2657367. [DOI] [Google Scholar]
- 35.Wong DWS. Measuring multiethnic spatial segregation. Urban Geogr. 1998;19(1):77–87. doi: 10.2747/0272-3638.19.1.77. [DOI] [Google Scholar]
- 36.Reardon SF, O’Sullivan D. Measures of spatial segregation. Sociol Methodol. 2004;34(1):121–162. doi: 10.1111/j.0081-1750.2004.00150.x. [DOI] [Google Scholar]
- 37.Feitosa FF, Câmara G, Monteiro AMV, Koschitzki T, Silva MPS. Global and local spatial indices of urban segregation. Int J Geogr Inf Sci. 2007;21(3):299–323. doi: 10.1080/13658810600911903. [DOI] [Google Scholar]
- 38.Wilson AG. A statistical theory of spatial distribution models. Transp Res. 1967;1(3):253–269. doi: 10.1016/0041-1647(67)90035-4. [DOI] [Google Scholar]
- 39.Hayes MC, Wilson AG. Spatial interaction. Socioecon Plann Sci. 1971;5(1):73–95. doi: 10.1016/0038-0121(71)90042-5. [DOI] [Google Scholar]
- 40.Curry L. A spatial analysis of gravity flows. Reg Stud. 1972;6(2):131–147. doi: 10.1080/09595237200185141. [DOI] [Google Scholar]
- 41.Wilson AG. Some new forms of spatial interaction model: a review. Transp Res. 1975;9(2–3):167–179. doi: 10.1016/0041-1647(75)90054-4. [DOI] [Google Scholar]
- 42.Griffith DA. Spatial structure and spatial interaction: 25 years later. Rev Reg Stud. 2007;37(1):28–38. [Google Scholar]
- 43.Besag J. Spatial interaction and the statistical analysis of lattice systems. J R Stat Soc B Methodol. 1974;36(2):192–236. [Google Scholar]
- 44.Besag J. Statistical analysis of non-lattice data. J R Stat Soc D Stat. 1975;24(3):179–195. [Google Scholar]
- 45.Griffith DA, Jones K. Explorations into the relationship between spatial structure and spatial interaction. Environ Plan A. 1980;12(2):187–201. doi: 10.1068/a120187. [DOI] [Google Scholar]
- 46.Fotheringham AS. Spatial structure and distance-decay parameters. Ann Assoc Am Geogr. 1981;71(3):425–436. [Google Scholar]
- 47.Fotheringham AS, Webber MJ. Spatial structure and the parameters of spatial interaction models. Geogr Anal. 1981;12(1):33–46. doi: 10.1111/j.1538-4632.1980.tb00016.x. [DOI] [Google Scholar]
- 48.Griffith DA. Geometry and spatial interaction. Ann Assoc Am Geogr. 1982;72(3):332–346. doi: 10.1111/j.1467-8306.1982.tb01829.x. [DOI] [Google Scholar]
- 49.Basta LA, Richmond TS, Wiebe DJ. Neighborhoods, daily activities, and measuring health risks experienced in urban environments. Soc Sci Med. 2010;71(11):1943–1950. doi: 10.1016/j.socscimed.2010.09.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Zenk SN, Schulz AJ, Matthews SA, et al. Activity space environment and dietary and physical activity behaviors: a pilot study. Health Place. 2011;17(5):1150–1161. doi: 10.1016/j.healthplace.2011.05.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Jones M, Pebley AR. Redefining neighborhoods using common destinations: social characteristics of activity spaces and home census tracts compared. Demography. 2014;51(3):727–752. doi: 10.1007/s13524-014-0283-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Coulton CJ, Korbin J, Chan T, Su M. Mapping residents’ perceptions of neighborhood boundaries: a methodological note. Am J Community Psychol. 2001;29(2):371–383. doi: 10.1023/A:1010303419034. [DOI] [PubMed] [Google Scholar]
- 53.Colabianchi N, Coulton CJ, Hibbert JD, McClure SM, Ievers-Landis CE, Davis EM. Adolescent self-defined neighborhoods and activity spaces: spatial overlap and relations to physical activity and obesity. Health Place. 2014;27:22–29. doi: 10.1016/j.healthplace.2014.01.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Kwan M-P. From place-based to people-based exposure measures. Soc Sci Med. 2009;69(9):1311–1313. doi: 10.1016/j.socscimed.2009.07.013. [DOI] [PubMed] [Google Scholar]
- 55.Matthews SA. The salience of neighborhood: some lessons from sociology. Am J Prev Med. 2008;34(3):257–259. doi: 10.1016/j.amepre.2007.12.001. [DOI] [PubMed] [Google Scholar]
- 56.Getis A, Ord JK. The analysis of spatial association by use of distance statistics. Geogr Anal. 1992;24(3):189–206. doi: 10.1111/j.1538-4632.1992.tb00261.x. [DOI] [Google Scholar]
- 57.Anselin L. Local indicators of spatial association–LISA. Geogr Anal. 1995;27(2):93–115. doi: 10.1111/j.1538-4632.1995.tb00338.x. [DOI] [Google Scholar]
- 58.Tukey JW. Exploratory data analysis. Reading, MA: Addison-Wesley; 1977.
- 59.Oka M. Measuring a neighborhood affluence-deprivation continuum in urban settings: descriptive findings from four US cities. Demogr Res. 2015;32(54):1469–1486. doi: 10.4054/DemRes.2015.32.54. [DOI] [Google Scholar]
- 60.Krieger N, Chen JT, Waterman PD, Soobader M-J, Subramanian SV, Carson R. Choosing area based socioeconomic measures to monitor social inequalities in low birth weight and childhood lead poisoning: the public health disparities geocoding project (US) J Epidemiol Community Health. 2003;57(3):186–199. doi: 10.1136/jech.57.3.186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Singh GK. Area deprivation and widening inequalities in US mortality, 1969–1998. Am J Public Health. 2003;93(7):1137–1143. doi: 10.2105/AJPH.93.7.1137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Messer LC, Laraia BA, Kaufman JS, et al. The development of a standardized neighborhood deprivation index. J Urban Health. 2006;83(6):1041–1062. doi: 10.1007/s11524-006-9094-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Bivand R, Hauke J, Kossowski T. Computing the Jacobian in Gaussian spatial autoregressive models: an illustrated comparison of available methods. Geogr Anal. 2013;45(2):150–179. doi: 10.1111/gean.12008. [DOI] [Google Scholar]
- 64.Bivand R, Piras G. Comparing implementations of estimation methods for spatial econometrics. J Stat Softw. 2015;63(18):1–36. doi: 10.18637/jss.v063.i18. [DOI] [Google Scholar]
- 65.R Core Team. R: a language and environment for statistical computing (computer program). Version 3.2.2. Vienna, Austria: R Foundation for Statistical Computing; 2015.
- 66.Bates D, Maechler M, Bolker B, Walker S. lme4: linear mixed-effects models using ‘eigen’ and S4 (computer program); 2015.
- 67.Bates D, Maechler M, Bolker BM, Walker S. Fitting linear mixed-effects models using lme4. J Stat Softw. 2015;67(1):1–48. doi: 10.18637/jss.v067.i01. [DOI] [Google Scholar]
- 68.Carr DB, Pickle LW. Visualizing data patterns with micromaps. Boca Raton, FL: Chapman and Hall/CRC; 2010.
- 69.Anselin L. How (not) to lie with spatial statistics. Am J Prev Med. 2006;30(2S):S3–S6. doi: 10.1016/j.amepre.2005.09.015. [DOI] [PubMed] [Google Scholar]
- 70.Unwin A, Unwin D. Exploratory spatial data analysis with local statistics. J Roy Stat Soc D Stat. 1998;47(3):415–421. doi: 10.1111/1467-9884.00143. [DOI] [Google Scholar]
- 71.Anselin L. Interactive techniques and exploratory spatial data analysis. In: Longley PA, Goodchild MF, Maguire DJ, Rhind DW, eds. Geographical information systems: principles, techniques, management and applications. New York, NY: Wiley; 1999: 253–266.
- 72.Curry L. Univariate spatial forecasting. Econ Geogr. 1970;46(Suppl):241–258. doi: 10.2307/143142. [DOI] [Google Scholar]
- 73.Haining RP. The moving average model for spatial interaction. Trans Inst Br Geogr. 1978;3(2):202–225. doi: 10.2307/622202. [DOI] [Google Scholar]
- 74.Cressie N. Kriging nonstationary data. J Am Stat Assoc. 1986;81(395):625–634. doi: 10.1080/01621459.1986.10478315. [DOI] [Google Scholar]
- 75.Buja A, Hastie T, Tibshirani R. Linear smoothers and additive models. Ann Stat. 1989;17(2):453–510. doi: 10.1214/aos/1176347115. [DOI] [Google Scholar]
- 76.Cressie N, Chan NH. Spatial modeling of regional variables. J Am Stat Assoc. 1989;84(406):393–401. doi: 10.1080/01621459.1989.10478783. [DOI] [Google Scholar]
- 77.Cressie N, Read TR. Spatial data analysis of regional counts. Biom J. 1989;31(6):699–719. doi: 10.1002/bimj.4710310607. [DOI] [Google Scholar]
- 78.Besag J, York J, Mollié A. Bayesian image restoration, with two applications in spatial statistics. Ann Inst Stat Math. 1991;43(1):1–59. doi: 10.1007/BF00116466. [DOI] [Google Scholar]
- 79.Kafadar K. Choosing among two-dimensional smoothers in practice. Comput Stat Data Anal. 1994;18(4):419–439. doi: 10.1016/0167-9473(94)90160-0. [DOI] [Google Scholar]
- 80.Kafadar K. Smoothing geographical data, particularly rates of disease. Stat Med. 1996;15(23):2539–2560. doi: 10.1002/(SICI)1097-0258(19961215)15:23<2539::AID-SIM379>3.0.CO;2-B. [DOI] [PubMed] [Google Scholar]
- 81.Mugglin AS, Carlin BP, Zhu L, Conlon E. Bayesian areal interpolation, estimation, and smoothing: an inferential approach for geographic information systems. Environ Plan A. 1999;31(8):1337–1352. doi: 10.1068/a311337. [DOI] [Google Scholar]
- 82.Borrell C, Marí-Dell’Olmo M, Serral G, Martínez-Beneito M, Gotsens M, other MEDEA members Inequalities in mortality in small areas of eleven Spanish cities (the multicenter MEDEA project) Health Place. 2010;16(4):703–711. doi: 10.1016/j.healthplace.2010.03.002. [DOI] [PubMed] [Google Scholar]
- 83.Cano-Serral G, Azlor E, Rodríguez-Sanz M, et al. Socioeconomic inequalities in mortality in Barcelona: a study based on census tracts (MEDEA project) Health Place. 2009;15(1):186–192. doi: 10.1016/j.healthplace.2008.04.004. [DOI] [PubMed] [Google Scholar]
- 84.Gotsens M, Marí-Dell’Olmo M, Martínez-Beneito MÁ, et al. Socio-economic inequalities in mortality due to injuries in small areas of ten cities in Spain (MEDEA project) Accid Anal Prev. 2011;43(5):1802–1810. doi: 10.1016/j.aap.2011.04.013. [DOI] [PubMed] [Google Scholar]
- 85.Gotsens M, Marí-Dell’Olmo M, Pérez K, Palència L, Borrell C. Trends in socio-economic inequalities in injury mortality among men in small areas of 26 Spanish cities, 1996–2007. Accid Anal Prev. 2013;51:120–128. doi: 10.1016/j.aap.2012.10.020. [DOI] [PubMed] [Google Scholar]
- 86.Marí-Dell’Olmo M, Gotsens M, Borrell C, et al. Trends in socioeconomic inequalities in ischemic heart disease mortality in small areas of nine Spanish cities from 1996 to 2007 using smoothed ANOVA. J Urban Health. 2014;91(1):46–91. doi: 10.1007/s11524-013-9799-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Nolasco A, Melchor I, Pina JA, et al. Preventable avoidable mortality: evolution of socioeconomic inequalities in urban areas in Spain, 1996–2003. Health Place. 2009;15(3):732–741. doi: 10.1016/j.healthplace.2008.12.003. [DOI] [PubMed] [Google Scholar]
- 88.Getis A. Spatial interaction and spatial autocorrelation: a cross-product approach. Environ Plan A. 1991;23(9):1269–1277. doi: 10.1068/a231269. [DOI] [Google Scholar]
- 89.Griffith DA. The boundary value problem in spatial statistical analysis. J Reg Sci. 1983;23(3):377–387. doi: 10.1111/j.1467-9787.1983.tb00996.x. [DOI] [PubMed] [Google Scholar]
- 90.Griffith DA. Towards a theory of spatial statistics. Geogr Anal. 1980;12(4):325–339. doi: 10.1111/j.1538-4632.1980.tb00040.x. [DOI] [Google Scholar]
- 91.Griffith DA. Toward a theory of spatial statistics: another step forward. Geogr Anal. 1987;19(1):69–82. doi: 10.1111/j.1538-4632.1987.tb00114.x. [DOI] [Google Scholar]
- 92.LeSage JP. Regression analysis of spatial data. J Reg Anal Pol. 1997;27(2):83–94. [Google Scholar]
- 93.Martin RL. On spatial dependence, bias and the use of first spatial differences in regression analysis. Area. 1974;6(3):185–194. [Google Scholar]
- 94.Fahrmeir L, Kneib T, Lang S. Penalized structured additive regression for space-time data: a Bayesian perspective. Stat Sin. 2004;14:731–761. [Google Scholar]
- 95.Herman E. The American Community Survey: an introduction to the basics. Gov Inf Q. 2008;25(3):504–519. doi: 10.1016/j.giq.2007.08.006. [DOI] [Google Scholar]
- 96.Bazuin JT, Fraser JC. How the ACS gets it wrong: the story of the American Community Survey and a small, inner city neighborhood. Appl Geogr. 2013;45:292–302. doi: 10.1016/j.apgeog.2013.08.013. [DOI] [Google Scholar]
- 97.Wong DWS, Sun M. Handling data quality information of survey data in GIS: a case of using the American Community Survey data. Spat Demogr. 2013;1(1):3–16. doi: 10.1007/BF03354884. [DOI] [Google Scholar]
- 98.Grady SC. Racial disparities in low birthweight and the contribution of residential segregation: a multilevel analysis. Soc Sci Med. 2006;63(12):3013–3029. doi: 10.1016/j.socscimed.2006.08.017. [DOI] [PubMed] [Google Scholar]
- 99.Grady SC, McLafferty SL. Segregation, nativity, and health: reproductive health inequalities for immigrant and native-born black women in New York City. Urban Geogr. 2007;28(4):377–397. doi: 10.2747/0272-3638.28.4.377. [DOI] [Google Scholar]
- 100.Grady SC, Ramírez IJ. Mediating medical risk factors in the residential segregation and low birthweight relationship by race in New York City. Health Place. 2008;14(4):661–667. doi: 10.1016/j.healthplace.2007.10.011. [DOI] [PubMed] [Google Scholar]
- 101.Greer S, Casper M, Kramer M, et al. Racial residential segregation and stroke mortality in Atlanta. Ethn Dis. 2011;21(4):437–443. [PubMed] [Google Scholar]
- 102.White K, Borrell LN, Wong DW, Galea S, Ogedegbe G, Glymour MM. Racial/ethnic residential segregation and self-reported hypertension among US- and foreign-born blacks in New York City. Am J Hypertens. 2011;24(8):904–910. doi: 10.1038/ajh.2011.69. [DOI] [PubMed] [Google Scholar]
- 103.Wong DWS. Modeling local segregation: a spatial interaction approach. Geogr Environ Model. 2002;6(1):81–97. doi: 10.1080/13615930220127305. [DOI] [Google Scholar]
- 104.Wong DWS. A local multi-dimensional approach to evaluate changes in segregation. Urban Geogr. 2008;29(5):455–472. doi: 10.2747/0272-3638.29.5.455. [DOI] [Google Scholar]
- 105.Williams DR. Race, socioeconomic status, and health: the added effects of racism and discrimination. Ann N Y Acad Sci. 1999;896:173–188. doi: 10.1111/j.1749-6632.1999.tb08114.x. [DOI] [PubMed] [Google Scholar]
- 106.Williams DR, Collins C. Racial residential segregation: a fundamental cause of racial disparities in health. Public Health Rep. 2001;116(5):404–416. doi: 10.1016/S0033-3549(04)50068-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Williams DR, Jackson PB. Social sources of racial disparities in health. Health Aff (Millwood) 2005;24(2):325–334. doi: 10.1377/hlthaff.24.2.325. [DOI] [PubMed] [Google Scholar]
- 108.Williams DR, Wyatt R. Racial bias in health care and health: challenges and opportunities. JAMA. 2015;314(6):555–556. doi: 10.1001/jama.2015.9260. [DOI] [PubMed] [Google Scholar]
- 109.Musterd S. Social and ethnic segregation in Europe: levels, causes, and effects. J Urban Aff. 2005;27(3):331–348. doi: 10.1111/j.0735-2166.2005.00239.x. [DOI] [Google Scholar]
- 110.Oka M, Wong DWS. Capturing the two dimensions of residential segregation at the neighborhood level for health research. Front Public Health. 2014;2:118. doi: 10.3389/fpubh.2014.00118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Oka M, Wong DWS. Spatializing segregation measures: an approach to better depict social relationships. Cityscape. 2015;17(1):97–113. [Google Scholar]
- 112.Openshaw S. The Modifiable areal unit problem, vol. 38. Norwich, England: Geo Books; 1984.
- 113.Duncan C, Jones K, Moon G. Context, composition and heterogeneity: using multilevel models in health research. Soc Sci Med. 1998;46(1):97–117. doi: 10.1016/S0277-9536(97)00148-2. [DOI] [PubMed] [Google Scholar]
- 114.Diez Roux AV. Bringing context back into epidemiology: variables and fallacies in multilevel analysis. Am J Public Health. 1998;88(2):216–222. doi: 10.2105/AJPH.88.2.216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Diez Roux AV. Multilevel analysis in public health research. Annu Rev Public Health. 2000;21(1):171–192. doi: 10.1146/annurev.publhealth.21.1.171. [DOI] [PubMed] [Google Scholar]
- 116.Bingenheimer JB, Raudenbush SW. Statistical and substantive inferences in public health: issues in the application of multilevel models. Annu Rev Public Health. 2004;25(1):53–77. doi: 10.1146/annurev.publhealth.25.050503.153925. [DOI] [PubMed] [Google Scholar]
- 117.Anselin L. Spatial econometrics: methods and models. Boston, MA: Kluwer Academic Publisher; 1988.
- 118.Cliff AD, Ord JK. Spatial processes: models and applications. London, England: Pion Limited; 1981.
- 119.Elliott P, Wakefield JC, Best NG, Briggs DJ. Spatial epidemiology: methods and applications. New York, NY: Oxford University Press; 2001.
- 120.Griffith DA. Spatial autocorrelation and spatial filtering: gaining understanding through theory and scientific visualization. New York, NY: Springer-Verlag Berlin Heidelberg; 2003.
- 121.Schabenberger O, Gotway CA. Statistical methods for spatial data analysis. Boca Raton, FL: Chapman & Hall/CRC; 2005.
- 122.Krieger N. Epidemiology and the web of causation: has anyone seen the spider? Soc Sci Med. 1994;39(7):887–903. doi: 10.1016/0277-9536(94)90202-X. [DOI] [PubMed] [Google Scholar]



