Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2022 Nov 29.
Published in final edited form as: Stat Med. 2019 Dec 10;39(5):544–561. doi: 10.1002/sim.8423

Robust estimation of the causal effect of time-varying neighborhood factors on health outcomes

Michael W Robbins 1, Beth Ann Griffin 2, Regina A Shih 3, Mary Ellen Slaughter 3
PMCID: PMC9706720  NIHMSID: NIHMS1846854  PMID: 31820833

Abstract

The fundamental difficulty of establishing causal relationships between an exposure and an outcome in observational data involves disentangling causality from confounding factors. This problem underlies much of neighborhoods research, which abounds with studies that consider associations between neighborhood characteristics and health outcomes in longitudinal data. Such analyses are confounded by selection issues; individuals with above average health outcomes (or associated characteristics) may self-select into advantaged neighborhoods. Techniques commonly used to assess causal inferences in observational longitudinal data, such as inverse probability of treatment weighting (IPTW), may be inappropriate in neighborhoods data due to unique characteristics of such data. We advance the IPTW toolkit by introducing a procedure based on a multivariate kernel density function which is more appropriate for neighborhoods data. The proposed weighting method is applied in conjunction with a marginal structural model. Our empirical analyses use longitudinal data from the Health and Retirement Study; our exposure of interest is an index of neighborhood socioeconomic status (NSES), and we examine its influence on cognitive function. Our findings illustrate the importance of the choice of method for IPTW—the comparison weighting methods provide poor balance across the set of covariates (which is not the case for our preferred procedure) and yield misleading results when applied in the outcomes models. The utility of the multivariate kernel is also validated via simulation. In addition, our findings emphasize the importance of IPTW—controlling for covariates within a regression without IPTW indicates that NSES affects cognition, whereas IPTW-weighted models fail to show a statistically significant effect.

Keywords: causality, doubly robust, inverse probability of treatment weighting, kernel density, longitudinal, neighborhood

1 |. INTRODUCTION

One of the most difficult problems in statistical inference involves determination of whether associations observed in non-experimental data between an exposure (eg, a neighborhood characteristic) and an outcome (eg, health status) have causal implications or if they are instead explained by confounding factors. Longitudinal data offer a plethora of advantages when approaching this problem; however, those advantages induce a wealth of methodological challenges. Consequentially, the field of causal inference has expanded markedly over the past 15 years, producing a rich mixture of state-of-the-art methods for handling longitudinal observational data including methods to handle continuous exposures.110 Available methods range from utilizing fully parametric approaches1114 to fully nonparametric,15 with a series on semiparametric methodologies as well.16,17 Many of these papers shed light on how to utilize inverse probability of treatment weighting (IPTW) with time-varying exposures. IPTW, which captures the likelihood of an individual experiencing a particular exposure history, is used to weight individuals so that all the observed characteristics that are included in the model for IPTW are no longer correlated with the exposure, thereby allowing for robust estimation of the marginal impact of the exposure on outcomes.

Methods for causal inference with observational data such as IPTW are particularly important within neighborhoods research, which constitutes a substantial arm of modern epidemiologic science. Although randomized experiments in neighborhoods studies have been performed (see, for example, the Moving to Opportunity study1820), the vast majority of studies of neighborhood impacts are observational due to difficulties randomizing individuals into neighborhoods. The goal in neighborhoods research is often to discern whether a neighborhood level-exposure, such as socioeconomic status2124 or immigrant enclaves,25 affects a health outcome, such as body mass index26,27 or depressive symptoms.2729 However, the literature is replete with debate about whether causal links can be established in neighborhoods research.3034 In particular, individuals might knowingly or unknowingly select into neighborhoods based on a number of confounding factors including family poverty,35 health conditions,3639 preferences for neighborhoods with greater walkability, healthy food access, etc,40 and safety considerations.4143 These factors create neighborhoods in which residents differ on a number of underlying characteristics which are associated with outcomes, including age, gender, race/ethnicity, and general health status.30,32,44,45 It is difficult to remove these imbalances when estimating associations between neighborhood factors and outcomes.

In our current study, we aim to assess if there is a causal link between neighborhood socioeconomic status (NSES) and cognition among a representative sample of older adults living in the United States. The public health and economic impact of cognitive impairment and dementia onset has been well-document in the US.46 Yet, evidence is limited and inconsistent regarding the effectiveness of interventions targeting individual risk factors to slow cognitive decline.47 As such, there is interest in gaining a better understanding of the contextual determinants of cognitive decline that may be a more fruitful for intervention for delaying the onset of cognitive impairment and dementia at a population level.48 Recent studies have examined the role of neighborhood and other geographic level factors, collectively referred to hereafter as context, on cognitive aging, with the vast majority of studies focusing on measures of community socioeconomic status and disadvantage (for a review, see the work of Wu et al49). It is of interest in this study to assess the potential role NSES might play on cognitive decline in older adults living in the United States (US) sample with a focus on how best to control for confounding effects. As noted above, it is pivotal that our analyses control for key confounders that involve the choice individuals in our sample make about which neighborhoods they want to live in.

In light of the challenges due to individual selection into neighborhoods, applications of IPTW in conjunction with marginal structural models (MSMs) in neighborhood research appear to be on the rise since such the methods are designed to handle confounding due to observed factors like neighborhood selection. Using MSMs with IPTW, causal links have been found between increased neighborhood poverty and increased alcohol consumption,50 increased mortality risk,51 and increased odds of being obese.52 Furthermore, there has also been work that found a causal link between neighborhood context and epigenetic aging.53 In addition, it has been shown that disadvantaged neighborhoods have a causal link to hindered verbal ability54 and lower graduation rates.55 The majority of these studies apply IPTW using MSMs on their outcomes following the methodology proposed by Robins and colleagues.15,56 In addition, doubly robust methodology,8,5759 which uses regression modeling and IPTW in tandem, has grown in popularity recently with some implementation in studies that investigate links between neighborhood factors and health outcomes.60,61

With the growing use of IPTW in neighborhoods research, it is critical to consider the special distributional characteristics of neighborhood-level data that may make the use of IPTW estimated via simple parametric models inappropriate. In particular, neighborhood-level exposures in longitudinal data tend to be static across most years of observation (ie, one’s neighborhood characteristics change meaningfully only if he or she relocates or moves). Traditional parametric approaches to estimate IPTW for longitudinal data are not built to handle such data characteristics. That is, the methods require that one evaluates residual densities for a conditional model that predicts the level of exposure at a given time point using prior values of the exposure. Commonly used Gaussian assumptions seem inappropriate since such models yield residuals with large masses near zero when applied to neighborhood-level exposures. Here, we propose a new method for estimating IPTW that is more appropriate for complex distributional structures seen in neighborhoods data. Specifically, we suggest a multivariate kernel density that circumvents the need for building conditional models of the exposure at one time period on the exposure at prior time periods.

In Section 2, we outline the outcomes models that will be used in our analyses as well as the candidate IPTW estimation methods that we will compare. Specifically, we consider an MSM that is a function of the cumulative value of the neighborhood exposure as suggested by Robins,1,56 as well as doubly robust versions of this model. In addition, we outline our proposed method of estimating IPTW in Section 2, as well as five other methods that form a basis for comparison. We aim to compare the performance of the different estimation methods for the IPTW in terms of ability to balance across the set of observed covariates as well as inferences yielded in analyses involving outcomes. In Section 3, we present the findings from our motivating study using the proposed outcome models and IPTW estimators in an effort to estimate the effect of NSES on cognitive function among older adults living in the United States. Both the data application and simulations indicate that weights calculated via the multivariate kernel density estimator outperform all other methods considered. Section 4 presents findings from a detailed simulation study whose structure was inspired by our motivating case study. Lastly, discussion is provided in Section 5, and remnant content is given in supplementary materials.

2 |. STATISTICAL METHODS

Here, we describe the statistical methodology that will be used to draw causal inferences regarding the effect of neighborhood on cognitive function among older adults living in the US. We begin with some notation. Assume that we have data for n individuals at T time points (T = 6 in our data). Let Z0,i denote a set of observed baseline covariates (eg, race, gender), also referred to as confounders, for individual i that are not time varying, and let Zt,i denote a set of time-varying covariates or confounders (eg, stroke, blood pressure, diabetes) for t = 1, …, T. Furthermore, let Xt,i denote the value of the exposure variable (here NSES), and let Yt,i denote the value of the outcome variable (here cognitive function) for individual i at time t for t = 1, …, T. Let X¯t,i=X1,i,,Xt,i and Z¯t,i=Z0,i,Z1,i,,Zt,i denote the exposure and covariate histories up to time t. Thus, in our analysis, X¯t,i represents the individual’s neighborhood history of NSES values up to time t.

For our approach, we utilize an IPTW potential outcomes framework.62,63 To elaborate, let Yix¯T,i=Y1,ix¯T,i,,YT,ix¯T,i denote the potential value of the outcome trajectory that would have been observed at time t if individual i’s neighborhood exposure history had been x¯T,i. The causal effect of experiencing an exposure history of x¯T,i in comparison to x¯T,i is defined as the difference in expected potential outcomes, which is given by

EYix¯T,iYix¯T,i=EYix¯T,iEYix¯T,i.

Thus, in our example, this is the expected difference in two potential outcomes for an individual had he/she had two different histories in neighborhood exposures. This quantity is not observed directly for all possible exposure histories (since the same individual does not experience multiple treatment histories) and cannot be estimated using conditional models that control for covariates. Moreover, the true causal effects of interest might vary depending on the given interests of a study. Nonetheless, models that do not control for confounding covariates that predict the neighborhood exposure level and the outcome of interest are subject to confounding bias when trying to estimate the needed expected values. Therefore, it is prudent to reframe the analysis within a pseudo-population wherein the exposure is no longer confounded by the observed covariates—therein, the association between the outcome and exposure can be assessed marginally using marginal structural models (MSMs) as described in Section 2.1. IPTW is used to generate this pseudo-population. Weights are determined by calculating each individual’s likelihood of observing the treatment history that the individual observed given his or her observed confounding covariate values—residual densities are used to assess these likelihoods. Then, weighting based upon these densities balances the exposure history groups with respect to the observed covariate confounders by giving more weight to individuals with treatment histories that are underrepresented in the pseudo-population and less weight to exposure or neighborhood histories that are overrepresented. In the end, the IPTW serve to create comparable exposure history groups across the observed set of confounders that are used in the IPT model.

There are four assumptions that are required for robust inference from an MSM plus IPTW approach: consistency, exchangeability, positivity, and no misspecification of the model used to estimate weights. Consistency requires that a subject’s potential outcome under the exposure received (here, their observed neighborhood history) is equal to the subject’s observed outcome. That is, Yt,i=Yt,iX¯T,i. Exchangeability requires that there are no unobserved confounders that have not been included in the covariates used to estimate weights. In other terms, this implies

Yt,iX¯T,iX¯t,iX¯t1,i,Z¯t,i,

where ⟂ indicates independence and thus that no important confounders were excluded from the IPT model when estimating the weights. Positivity requires that there is a nonzero probability of everyone in the sample receiving every level of the exposure. Consider, for example, that if a given level of the exposure were only observable among women, we could not consistently estimate the exposure effects among men. For the most part, the first two assumptions are generally determined as appropriate depending on the applied example at hand. The last assumption (also called the overlap assumption) is generally assessed using descriptive statistics to determine if there are any areas of the covariate distribution that might not be fully observed for all levels of the exposure (see the supplemental materials). Finally, the unobserved confounding assumption is typically assessed using sensitivity analyses (see, for example, other works6467).

Once the weights have been calculated, the causal parameter of interest from the pseudo-population may be extracted using a weighted regression model. When this regression does not control for covariates, it is an MSM in the vein of Robins.1,56 However, doubly robust57 versions of such a model can be employed by conditioning on covariates in addition to weighting the regression.

2.1 |. Calculation of weights for IPTW

Since IPTW with neighborhood data is complicated by the unique characteristics of such data (and therefore capturing the requisite densities is nontrivial), we consider a variety of methods for estimating the weights. We begin by outlining the traditional method for IPTW with longitudinal data and continuous exposures that was originally proposed by Robins1,56 (this is manner in which IPTW is commonly applied within neighborhood research). We then discuss procedures which provide advancements, culminating in our preferred approach for IPTW when using neighborhood-level exposures, which is based on a multivariate kernel density.

In accordance with Robins,1,56 IPTW is performed by first calculating a weight for individual i at time t for t = 1, …, T:

wt,i=fXt,iX¯t1,ifXt,iX¯t1,i,Z¯t,i, (1)

where f(·) denotes a density function. The numerator term fXt,iX¯t1,i acts as a stabilizer. For analysis, one uses the cumulative product of weights

wi=t=1Twt,i, (2)

which is the weight assigned to individual i for all time points.

An obstacle for implementation of IPTW is the fact that the true densities listed in the numerator and denominator of (1) are unknown. To approximate these densities, one could fit an appropriate density function to the residuals from the corresponding conditional models. However, the conditional densities are difficult to model in neighborhoods data. A neighborhood-level exposure (in our case, NSES) will be relatively stable over time for most study participants; however, when there is a change, it tends to be larger—this leads to residuals that have heavy tails. Therefore, we consider four methods of calculating weights for IPTW by modeling the residual densities needed in the calculation of wt,i.

Let rt,i1 denote the residuals from a linear regression of Xt,i on X1,i, …, Xt−1,i and an intercept term. Likewise, let rt,i2 denote the residuals from a linear regression of Xt,i on X1,i, …, Xt−1,i and Z¯t,i. If the density of rt,i(k), which we represent by ft(k)() for j = 1, 2, were known, one could set wt,i=ft(1)rt,i(1)/ft(2)rt,i(2). We approximate ft(k)() for t = 1, …, T and k = 1, 2 via the following methods:

  1. Univariate Normalft(k)() is represented by a normal density with mean zero and variance set to the sample variance of rt,i(k).

  2. Student’s tft(k)() is represented by a Student’s t distribution with 3 degrees of freedom. This mandates that rt,i(k) first be scaled to have a variance of 3 prior to calculating the weights. We select 3 degrees of freedom because it is the smallest integer value that yields a density with finite variance. Finite variance is needed because we have to scale the residuals to have the same variance as the density that is being fit to them.

  3. Mixture – We set ft(k)()=ρgt(k)()+(1ρ)ht(k)() for some mixing probability ρ ∈ (0, 1) where gt(k)() and ht(k)() are distinct normal densities. The parameters (ρ and the means and variances of the normal densities) are estimated by modeling rt,i(k); we use the mixtools package in R for estimation.68

  4. Univariate Kernel – We estimate ft(k)rt,i(k) by calculating

f^t(k)rt,i(k)=1nhj=1nKrt,i(k)rt,j(k)h, (3)

where K(·) is a kernel function and h is a bandwidth parameter. We use a standard normal density as the kernel function.

Distribution families not included above have been considered in the literature. For example, Do et al51 model an exposure that is bounded between 0 and 1 using a beta distribution which aims to more appropriately model the distribution of neighborhood-level exposures in a similar vain to our proposed work. However, the beta distribution does not work in this setting since we considering exposures that are unbounded.

Any inaccuracies in the estimation of the conditional densities will be exacerbated through the use of the product of weights that is seen in (2). Therefore, we propose a method for IPTW that circumvents the conditional models in (1). Note that wi can be rewritten as a ratio of joint densities:

wi=fX¯T,ifX¯T,iZ¯T,i, (4)

which follows from the fact that joint densities may be expressed as a product of a sequence of conditional densities. This relationship assumes that the exposure at a given time is conditionally independent of future values. Therefore, we propose to calculate weights for IPTW by directly targeting the joint densities seen in (4). This should, in theory, help us bypass estimation problems that are encountered when building weights using residuals from a model that predicts the current exposure level from the prior year’s exposure.

Let ri(1) denote the (multivariate) residuals from a linear regression of X¯T,i on only an intercept term, and let ri(2) denote the (multivariate) residuals from a linear regression of X¯T,i on Z¯T,i. We let f (k)(·) denote the multivariate density function of ri(k) for k = 1, 2– if f (k)(·) were known, we would set wi=f(1)ri(1)/f(2)ri(2). We outline two procedures for approximating the joint densities and compare their performance to the four methods list above.

  1. Multivariate Normalf (k)(·) is represented by a multivariate normal density with mean zero and variance set to the sample covariance matrix of rt,i(k).

  2. Multivariate Kernel – We estimate f (k)(·) via a multivariate kernel density function, which is denoted here as f^(k)().

Specifically, we set

f^(k)(r)=1ni=1nKHrri(k) (5)

for any length-T vector r, where H is a T × T matrix of bandwidth parameters, KH(r)=|H|1/2KH1/2r, and K(·) is a symmetric multivariate kernel function. It is common to let KH(·) be represented by a multivariate normal density with a covariance matrix equivalent to the identity matrix. Lastly, the quantity in (4) is approximated with

w^i=f^(1)ri(1)f^(2)ri(2).

In the empirical analyses provided in subsequent sections, the bandwidth parameter is calculated through Silverman’s rule of thumb.69 When the multivariate kernel density is used, we set H=0.1Σ^, where Σ^ is the sample correlation matrix of the appropriate multivariate residuals. Other more rigorous options were considered,7072 but these did not prove to be as effective at maintaining balance across the set of covariates. We use only Gaussian kernels.

To avoid distorting the underlying properties of the weights, we do not trim outlying weight values. In addition, note that weights calculated in accordance with (4) model an exposure on future values of the covariates. This functionality is not incorporated within (1). We argue that this facet of our methods enhances their robustness since weights calculated in the proposed manner will remove all associations between the exposure and the confounders. However, if needed, estimation of the weights in (4) can be performed in a manner that does not model current exposure on future covariate values.

2.1.1 |. Balance assessment

Imperative to all methods that invoke IPTW is the assumption that the weights obtain balance across the observed covariate set with respect to the exposure. Balance assessment with continuous treatment is more nuanced than with binary treatment. We follow guidance put forth by Zhu et al,17 wherein it is suggested to use sample correlations to assess balance. That is, if calculated correctly, the IPTW weights outlined here can be used to remove any correlation between the exposure/neighborhood history, X¯T,i, and the set of covariates, Z¯T,i.

2.1.2 |. Adjustments for survey weighting

Survey weights are often attached to microlevel data. For instance, the data may have been collected under a design that is not simple random sampling, and data are usually subject to nonresponse and/or attrition; survey weights inherently account for these complexities. In that vein, the Health and Retirement Survey (HRS) data set studied in Section 3 is subject to both nonresponse and design adjustments. The issue of performing propensity-score analysis (or IPTW analogously) using survey-weighted data has received attention in the literature. Ridgeway et al73 and Griffin and Robbins74 argue that when applying IPTW, survey weights should be included in the estimation of the weights for IPTW as well as in the estimation of the outcomes models. The latter is accomplished by using a final weight in outcomes analyses that is the product of the survey weight and the weight for IPTW.

Survey weights factor into the process of calculating IPTW weights as follows. Weighted least squares (with the survey weights) is used to approximate the residuals (ie, rt,i(1), rt,i(2), ri(1), and ri(2)). Any parameters (eg, variances) that are calculated from the estimated residuals are calculated using weighted estimation techniques. Furthermore, the kernel density estimators of (3) and (5) are weighted; for example, we calculate f^(k)(r)=i=1nviKHrri(k)/i=1nvi, where {vi} are the nonresponse weights.

2.2 |. Outcome models

As an alternative to structural nested models and related g-estimation,7578 Robins1,56 presents MSMs for the potential outcome Yt,iX¯t,i. To facilitate causal inferences with MSMs, he introduces IPTW in that context to remove the effects of the observed confounding covariates on the exposure effect estimates. Specifically, he considers MSMs of the form

EYt,iX¯t,i=gX¯t,i;β, (6)

for some function g(·) and parameters β. This model is marginal because it examines the effect of the exposure histories (like our neighborhood histories) on the marginal distributions of the potential outcomes. Likewise, the term structural is invoked since counterfactual random variates are modeled.

Robins1 and Hernán et al5 refine (6) by suggesting that the function g(·) depend primarily upon the cumulative exposure cumX¯t,i=j=1tXj,i. We refine this model further in manner that is more appropriate for the neighborhood data in our study. Specifically, we consider

Yt,i=β0+β0,i+β1t+β2t2+αcumX¯t,i+Ut,i, (7)

for i = 1, …, n and t = 1, …, T, where β0,i is an intercept random effect for individual i and where Ut,i is a mean zero error term. Here, 𝛼 is the causal parameter of interest and represents the average effect of a one-unit increase in NSES on cognition at the next time point. The sequence {Ut,i}t,i ≥ 1 is independent across t and i and is independent of the β0,i. We orient this model in the paradigm of potential outcomes through the use of IPTW to weight the regression. That is, IPTW removes the correlation between the observed confounding covariates Z¯T,i and any function of the neighborhood exposure X¯T,i.17 Ergo, IPTW is applicable for the general class of MSMs seen in (6), which includes (7). Recall that the average causal effect can be expressed is EYt,iX¯t,iEYt,iX¯t,i, which under the MSM in (7) is αcumX¯t,i; therefore, α has a causal interpretation.

IPTW has been applied previously in neighborhoods research,5055 and such applications tend to use a MSM that invokes cumulative sum (or moving average) of the exposure as in (7). Note that several authors, eg, Hernán et al5 and Cerdá et al,50 consider MSMs that model Yt,iX¯t,i as a function of cumX¯t1,i; this is to impose the assumption that the exposure does not have an instantaneous effect on the outcome. However, in the setting of our application, we feel it is reasonable to assume that current exposure levels can affect the outcome. (At the point when outcome measurements are taken, an individual has probably established him or herself in his or her respective neighborhood of a nonnegligible length of time.)

We also consider a version of (7) that controls for observed covariates within the regression. That is,

Yt,i=β0+β0,i+β1t+β2t2+α1cumX¯t,i+γ1Z0,i+γ2Zt,i+Ut,i. (8)

We apply the following outcome models in the ensuing sections:

  • Model 1: A weighted version of the MSM (7) where covariates are not included and IPTW weights are used.

  • Model 2: A weighted version of (8) where covariates are included and IPTW weights are used.

Note that Model 2 is considered doubly robust57 because of the use of weights together with regression adjustment. As Robins and Rotnitzky79 point out, so long as only one of the models, either that for the conditional mean of the outcome given covariates (the regression model) or that for the treatment indicator given covariates (the IPTW model) is correctly specified, the resulting estimator will be consistent. Robins and Rotnitzky79 provide the needed proof of doubly robustness by showing that combining such IPTW with regression adjustment has consistency and demonstrated under the assumption that a parametric model applies to either the IPTW or the regression function, but not necessarily to both. Here, we use parametric modeling in our outcome model.

In the supplemental materials, we consider a separate outcome model that is in the vein of the disaggregation model presented by Jokela34—this model is designed to disentangle between- and within-person effects of a neighborhood-level exposure. Therein, we provide results for IPTW-weighted versions of this model.

3 |. THE EFFECT OF NSES ON COGNITION

3.1 |. Study sample

To assess the influence of neighborhood on cognitive function, we use data from the Health and Retirement Study (HRS), which is a nationally representative, multicohort longitudinal panel of US adults aged 51 and older. The HRS provides one of the most comprehensive data sources on health in older Americans, with detailed data on health, residential status, and confounders such as economic and social well-being; further details are found in the work of Hauser and Willis.80 Although some HRS data are publicly available (https://hrs.isr.umich.edu/data-products), the specific data used for this study are confidential.

HRS data are collected biennially. To focus on one decade of address data geocoded using the same 2000-based Census definitions for census tracts, we isolate to the six waves of survey data collected from 2000 to 2010. We examined the largest HRS cohort of individuals born between 1931 and 1941; as such, we avoid modeling NSES and covariate effects differently by cohort.

HRS data are subject to attrition (eg, only 58% of all individuals in our cohort are measured at each survey wave). Since our methodology mandates that only individuals with data observed at all time points for pertinent variables are used, attrition is handled by deleting cases with missing follow-up and using nonresponse weighting to account for nonresponse bias. The final cohort used in analyses contains 4253 individuals. Authors that have studied attrition in the HRS conclude that although there is some evidence of differential patterns of attrition across certain characteristics, differences are not particularly strong and can be captured by nonresponse weights.81,82 Nonresponse weights corresponding to 2010 are factored into all analyses presented here, which ensures that our cohort is representative of the baseline sample (see the work of Heeringa and Connor83 for further details on weighting in the HRS).

3.2 |. Measures

Outcome.

To assess cognitive function, we use a modified version of the telephone interview for cognitive status (TICS) (see the work of Crooks et al84 for discussion of this instrument). We use the 27-point scale TICS that consisted of three tasks: an immediate and delayed word recall of a list of 10 nouns, a Serial Sevens subtraction test from 100, and a backwards count from 20 for 10 continuous numbers. Higher TICS means better cognitive status. For respondents who are unable to participate in the study, cognitive function is assessed through a series of questions to proxy respondents; however, only self-respondent data are included in the cognitive function score provided in the HRS and therefore utilized here.

Neighborhood socioeconomic status.

Neighborhood advantage is measured with an index of socioeconomic status that is a composite of tract-level median household income, percentage of female-headed households, percentage of adult workers who are unemployed, percentage of households below the poverty line, percentage of persons with a high school degree, and percentage of persons with a Bachelor’s degree.85 This index is referred to as NSES and serves as our primary exposure (ie, treatment) variable.

Individual-level covariates.

Our analysis controls for several individual-level variables that could confound neighborhood associations. As noted below, our IPTW procedure is designed to balance our cohort on several covariates that may confound the association between cognitive function and NSES. Therefore, we include a number of individual-level covariates in our IPTW model including time invariant covariates such as gender, race/ethnicity, education, and age at baseline. Furthermore, we include time-varying values of sociodemographics including household wealth and retirement status, health behaviors including physical activity and alcohol use, and the presence of health conditions such as stroke, heart problems, high blood pressure, diabetes, chronic conditions, and depressive symptoms.

3.3 |. Descriptive statistics of the data

Table 1 provides descriptive statistics of covariates when calculated across the cohort used in analysis. It is also prudent to consider relevant characteristics of our cognitive outcome (TICS) and its unadjusted relationship with our exposure variable of interest (NSES) given the concerns we have about how this relationship might be biased by observed confounders in the HRS data set. First, there is a steady decline in cognitive function over time due to aging. The average TICS score across our analytic cohort in 2000 was 16.7; in 2010, the average score declined to 14.8. Figure 1 shows scatter plots of TICS vs NSES for each year from 2000 to 2010. It highlights that within each year there is an unadjusted positive association between TICS and NSES (that is relatively stable over time), suggesting that people living in higher NSES neighborhoods tend to have higher mean cognitive function. The correlation between these two variables is 0.269 in 2000 and is 0.274 in 2010. Note that the decline in mean TICS is small in comparison to the variability inherent in the outcome.

TABLE 1.

Characteristics of the Health and Retirement Study cohort (n = 4253) at baseline (2000) and the final follow-up (2010). ns are not weighted; percentages and means (with standard errors) are weighted to account for sample design and attrition

Baseline (2000) Last follow-up (2010)
n/Mean %/SE n/Mean %/SE
SOCIODEMOGRAPHICSs
Age (baseline)
< 62 Years 1546 35.2%
62–64 Years 1234 28.2%
65–67 Years 1038 25.4%
68+ Years 435 11.2%
Years (mean/se) 63.15 0.06
Gender
Male 1698 42.3%
Female 2555 57.8%
Race
African-American 587 7.2%
Hispanic 347 5.0%
Other 3319 87.8%
Education
0–11 Years 916 19.0%
12 Years 1589 37.7%
13–16 Years 1265 30.9%
17+ Years 483 12.5%
Years of education (mean/se) 12.8 0.05
Household wealth
Quintile 1 (poorest) 609 13.7% 477 10.5%
Quintile 2 771 16.2% 710 15.4%
Quintile 3 880 19.9% 881 19.7%
Quintile 4 950 23.6% 1007 24.1%
Quintile 5 (wealthiest) 1043 26.7% 1178 30.3%
Retired
No 2874 66.0% 1214 26.8%
Yes 1379 34.0% 3039 73.2%
HEALTH BEHAVIORS
Vigorous physical activity
No 2099 50.2% 3279 77.2%
Yes 2154 49.8% 974 22.9%
Alcohol use (no. of days/week with drinks)
0 Days 2909 66.0% 2822 64.5%
1–4 Days 904 22.2% 930 22.3%
5+ Days 440 11.9% 501 13.3%
HEALTH CONDITIONS
Stroke 137 3.7% 411 10.2%
Heart problems 579 15.6% 1341 33.6%
High blood pressure 1837 42.6% 2952 68.4%
Diabetes 510 12.5% 1156 27.0%
Chronic conditions (mean/se) 1.52 0.02 2.65 0.03
Depressive symptoms (CES-D ≥ 3)a 766 18.8% 786 19.5%
a

Depressive symptoms were measured using a Center for Epidemiologic Studies-Depression Scale score of 3 or greater.

FIGURE 1.

FIGURE 1

Scatter plots of telephone interview for cognitive status (TICS) versus neighborhood socioeconomic status (NSES) in 2000 (far left) through 2010 (far right) with best fitting lines in red [Colour figure can be viewed at wileyonlinelibrary.com]

Our doubly robust analysis is designed to assess whether this relationship continues to be maintained after we adjusted for confounding using IPTW and regression.

3.4 |. Assessing the quality of IPTW

Each of the six methods described in Section 2.1 are used to calculate weights for IPTW with the analytic HRS cohort. First, note that (as shown in Table A.10 of the supplemental materials) our cohort contains imbalance across the observed covariates considered here with respect to the exposure (NSES). For example, we see imbalance across race and wealth. However, the imbalance is not so strong as to suggest violations of the positivity assumption; that is, there is some degree of overlap across the categorizations of NSES with the various covariate categories.

Histograms of residuals from the sequence of conditional models may be used to assess the adequacy of the various methods of fitting univariate densities. As such, Figure 2 shows histograms of the studentized residuals from the conditional model for the NSES in 2010 when regressed on only the prior values of NSES (see the numerator model in the left panel) and then on the prior values of NSES along with the covariates (see the denominator model in the right panel). The fitted density curves from each of the four methods of univariate estimation are overlaid on both histograms. The supplementary materials contain analogous histograms that correspond to the other years. The histograms indicate that, as expected, the residuals have a large density near zero along with long tails (and excess kurtosis as a result). However, the mixture and univariate kernel methods appear to do the best job of capturing these characteristics.

FIGURE 2.

FIGURE 2

Histograms of the studentized residuals from the conditional models of the numerator and denominator seen in (1) for 2010 with density curves estimated from various methods [Colour figure can be viewed at wileyonlinelibrary.com]

Additionally, some brief descriptives of the weights calculated via all of the various methods for IPTW are provided in Table 2. Each set of weights is scaled to sum to the sample size prior to calculation of the descriptors. In the Table, the design effect, as approximated using the Kish approximation86 (calculated as Kish DEFF =nwi2/wi2), is listed for each set of weights. Note that a design effect quantifies the inflation in variance that results from weighting. The mixture and univariate kernel weights produce exorbitant design effects. The multivariate kernel yields the lowest design effect among the IPTW approaches.

TABLE 2.

Summary statistics of the various types of inverse probability of treatment weighting (IPTW) weights. All weights are scaled to sum to the sample size. Kish DEFF is the Kish approximation of the design effect

IPTW
Nonresp. Weights Univariate Normal t-dist. Mixture Uni. Kernel Multi. Normal Multi. Kernel
Mean 1.00 1.00 1.00 1.00 1.00 1.00 1.00
SD 0.52 8.22 3.90 39.97 45.69 3.93 1.84
Min 0.12 0.00 0.01 0.00 0.00 0.00 0.00
Max 5.20 334.42 159.90 2542.39 2951.76 199.36 57.75
DEFF 1.27 68.62 16.21 1598.03 2087.71 16.41 4.37

To assess whether IPTW provides balance for each estimation method considered, we calculate weighted correlations of NSES with covariates, which is in line with the suggestions of Zhu et al.17 Specifically, we calculate the sample correlation between the covariates and NSES with and without IPTW using the stacked data set (which has nT rows). If the weights are calculated correctly, the sample correlations will be close to zero. These correlations are shown in graphically in Figure 3, wherein box plots of the absolute value of the correlations when calculated using the various forms of IPTW are given. The correlations in Figure 3 are tabulated in the supplementary materials.

FIGURE 3.

FIGURE 3

Boxplots of absolute correlations with neighborhood socioeconomic status and covariates before and after applying inverse probability of treatment weighting (IPTW), where various methods are used for IPTW

Pre-IPTW weights are calculated using only nonresponse weights; moderate correlations with NSES existed prior to IPTW. For instance, NSES appears to be most heavily correlated with education (pre-IPTW correlation with NSES = 0.328), African American race (−0.313), and highest income category (0.312). IPTW with a multivariate kernel density appears to most effectively reduce the bulk of these correlations; therein, all weighted correlations are less than 0.129. The mixture and univariate kernel weights yield increased correlations; this may be due, in part, to the large design effects given by these weights. Note that balance assessment results are very similar when correlations are calculated using NSES-based terms that appear in our outcome models, (ie, cumX¯t,i).

3.5 |. Assessing the effect of NSES on cognitive function

We estimated the outcome models described earlier to determine the causal effects of NSES on cognitive function (TICS). We consider the two models outlined in Section 2.2: Model 1 (uses IPTW without controlling for covariates), and Model 2 (uses IPTW and controls for covariates). We also examine a version of Model 2 that does not use IPTW (but does use nonresponse weights)—an analogous version of Model 1 is not included since such a model would do nothing to account for confounding. Note that nonresponse weights factor into the calculation of IPTW weights as described in Section 2.1.2. That is, models used to calculate the IPTW weights are weighted via the nonresponse weights, and the IPTW-weighted models use the product of the nonresponse weights and the resulting IPTW weights as final weights. Results for the coefficients on cumulative NSES term are shown graphically in Figure 4 for each of the six methods of calculating IPTW. Complete results are provided in the supplementary materials for each method of IPTW.

FIGURE 4.

FIGURE 4

Coefficients for cumulative neighborhood socioeconomic status (with 95% confidence bands) for the six methods of inverse probability of treatment weighting (IPTW) and two outcome models

In the no-IPTW version of Model 2, we see a statistically significant positive effect cumulative NSES (regression coefficient = 0.049 and 95% CI = [0.024, 0.073]). This may suggest that living in more advantaged neighborhoods is beneficial for cognitive function. However, when IPTW with a multivariate kernel density is used (in either Model 1 or 2), this effect is reduced in magnitude and is no longer statistically significant (regression coefficient = 0.031 and 95% CI = [−.011, 0.072] for Model 2). The lack of statistical significance may be partly due to an increase in standard error that is caused by IPTW (ie, the weights induce a design effect).

Our assessments of the quality of the weights (which include balance assessments and simulations shown in the following section) indicated that the multivariate kernel weights perform the best. Therefore, the outcomes models for the other weights should not be used for inference and are provided only for the purposes of comparison. That said, Figure 4 illustrates that IPTW with poor weights can provide profoundly misleading results. Specifically, both Models 1 and 2 indicate a statistically significant negative effect of NSES on cognitive function when univariate normal weights are used. Additionally, mixture distribution weights indicate a strong positive effect in Models 1 and 2 that is more than six times as large as the effect seen in the version of Model 2 that excludes IPTW. We see an estimated effect that is nearly as large when the univariate kernel weights are used. Note that some of these large effects are not statistically significant due to the design effect induced by the respective kernel weights. Furthermore, the multivariate normal weights show a negative effect of NSES that is statistically significant in Model 1. In results presented here, weights found using a t-distribution provide findings that are fairly in line with those yielded by multivariate kernel weights. However, when the disaggregation model is considered in the supplemental materials, t-distribution weights yield potentially spurious findings.

To provide intuition behind the magnitude of the estimated effect of cumulative NSES on cognitive function, we present effect sizes as follows. We calculate the difference in the expected TICS in 2010 for an individual in the 10th percentile of cumulative NSES (in 2010) with the corresponding expected outcome for someone in the 90th percentile of NSES. This difference is divided by the standard deviation across our cohort of TICS in 2010 to yield the approximated effect size. In Model 1 with the multivariate kernel weights, we see a small effect size of 0.095. However, for other types of weights, this value is more substantial (eg, −0.564 for the univariate normal weights and 0.923 for the mixture weights).

It is important to revisit the key assumptions of IPTW analyses with MSMs. Diagnostics shown in the supplementary materials are consistent with the assumption of positivity holding among our cohort. A key limitation to our analyses and any analysis using IPTW is the assumption of no unobserved confounding. While our models control for a large set of key factors related to cognition and NSES (eg, age and health comorbidities), it is possible that our IPTW weights are missing a key factor. The best way to address the potential impacts of an unobserved confounder is through the use of sensitivity analyses. Typically, these sensitivity analyses aim to understand how strong an unobserved confounder would have to be to wipe away a significant finding.66,8789 In our case, we have no evidence of a significant impact of NSES on cognition. Thus, we would aim to understand how strong the relationship of an unobserved confounder would have to be to move our estimated effect into being a statistically significant finding. Such efforts are nontrivial in the continuous exposure setting and with repeated measures data. Future research should aim to develop such methods that allow for easier testing of the potential role unobserved confounding might have on these analyses.

4 |. SIMULATIONS

In this section, we present a simulation study that is used to evaluate the effectiveness of the methods proposed herein. We attempt to simulate data in the vein of the neighborhood data studied in Section 3. Specifically, the simulated values of a data unit’s exposure are the same as seen in the prior time period unless that individual was determined to have “moved.”

For each iteration of this simulation study, we generate synthetic versions of the exposure (NSES) and the outcome (TICS) as functions of the real values of a time invariant covariate and a time varying covariate from the HRS data. As such, our sample size (n = 4, 253) and number of time periods (T = 6) are the same as in our HRS cohort. For the time invariant covariate, which is denoted by Z0,i for individual i, we use years of education. For the time-varying covariate (denoted by Zt,i for individual i), we use household wealth (since household wealth has five categories, Zt,i is four-dimensional after dropping a reference category).

The baseline value of synthetic NSES is generated using

X1,i=0.15+λ1Z0,i+λZ1,i+0.50V1,i. (9)

In (9), we set λ1 = 0.17 and λ = (−0.11, −0.83, 0.07, 0.11)′, and Vt,i for t = 1, …, T is a t-distributed random variable with 3 degrees of freedom that has been rescaled to have unit variance. Then, for t > 1, we generate a binary random variable that indicates if individual i moves neighborhoods between time t − 1 and t. That is, a move occurs if Ri,t = 1, where P(Ri,t = 1) = 𝜋i,t with

logπi,t/1πi,t=ϕ+0.18Z0,i+κZt,i+0.09j=1t1(1/2)tj1Xj,i. (10)

We use κ = (−0.08, −0.06, 0.05, 0.08)′ and ϕ = −2.6 in (10). Then, we generate the time varying exposure using

Xt,i=Xt,i=0.15+λ1Z0,i+λZt,i+0.3j=1t1(1/2)tj1Xj,i+0.50Vt,i,if Ri,t=1,Xt,i=Xt1,i+0.02Vt,iotherwise. (11)

In (11), we also set λ1 = 0.17 and λ = (−0.11, −0.83, 0.07, 0.11)′ . Note that the year-by-year variation in NSES for the neighborhood (indicated by the standard deviation of 0.02 in the above formula) is markedly less than the variation in NSES for movers (given by 0.50).

Lastly, the synthetic value of cognition (TICS) is generated with

Yt,i=14.79+1.47Z0,i+ξZt,i+3.80Ut,i,

for t = 1, …, T, where ξ=(0.36,0.26,0.24,0.37) and where Ut,i has the same distribution as Vt,i. Note that all Vt,i and Ut,i are generated independently of one another for all t and i. The values of all regression coefficients used here are chosen so as to make the simulated data resemble as closely as possible the real HRS data subject to the above model structure. Note also that, under this setting, approximately 30% of individuals move at least once over the six time periods considered.

The data generation process described above is repeated independently for K = 1000 replications. Within each replication, the balance assessment process applied in the prior sections is used. That is, for all six weighting methods outlined in Section 2.1, we calculate the weighted correlation between the exposure (Xt,i) and the time-invariant covariate (Z0,t) as well as the time-varying covariate (Zt,i) using the tall (or stacked) version of the simulated data. For comparison, we also provide the unweighted correlations. When kernel densities are used, bandwidth parameters are determined in the same manner as described in the prior sections. Boxplots of the estimated correlations across all K replications are provided in Figure 5.

FIGURE 5.

FIGURE 5

Box plots of the simulated unweighted and weighted correlations between the exposure and the time-invariant covariate, education (top panel), in addition to the correlations between the exposure and a time-varying covariate (bottom panel), the highest wealth category, across K = 1000 replications [Colour figure can be viewed at wileyonlinelibrary.com]

Within each of the K replications, we estimate Models 1 and 2 (ie, weighted versions of (7) and (8), respectively) as described in Section 2.2. Individual-level random effects and temporal trend are, however, excluded here. For each coefficient that corresponds to a term involving the exposure variable within each model, we calculate bias and root-mean squared error as

 bias =1Kk=1Kη^[k]andrMSE=1Kk=1Kη^[k]2,

respectively, where η^[k] is the value of a coefficient η as estimated within the kth iteration. Note that there should be no relationship between the outcome and the exposure that is not explained by covariates; ergo, for all relevant coefficients, η = 0. We calculate bias and rMSE for unweighted models and models calculated using the six weighting methods. Simulated bias and rMSE are shown in Table 3.

TABLE 3.

Bias and rMSE in estimators of a regression coefficient for various models and weighting methods across K = 1000 simulated data sets

Biasa rMSE
Model 1 Model 2 Model 1 Model 2
Unweighted 0.3424 −0.0444 0.3429 0.0459
Normal −0.0289 0.0070 0.1946 0.2023
t-dist. −0.1016 −0.0028 0.1452 0.0569
Mixture −0.2263 −0.0236 0.3754 0.3728
Uni. Kernel −0.2450 −0.0275 0.3873 0.3692
M.V. Normal −0.0284 0.0081 0.1893 0.2018
M.V. Kernel 0.0761 −0.0307 0.0814 0.0346
a

Note that all estimated biases for Models 1 and 2 are statistically significant at the 5% level

The results presented show that the weights calculated via a multivariate kernel outperform all other methods considered in the setting of this simulation. Specifically, Figure 5 illustrates that weights from a multivariate kernel reduce the correlation far more effectively than the other methods. Likewise, the multivariate kernel weights yield markedly smaller rMSE than all the other methods. In this simulation, controlling for covariates within the regression models (as in Model 2) appears to remove nearly all bias (which may not be the case in practice) with the multivariate kernel having rMSE that is a fraction of the rMSE of the other methods. Furthermore, note that the multivariate kernel tends to show slightly larger rMSE than the unweighted regression for Model 2. This is a consequence of variance inflation due to weighting and is an inherent trade-off in using IPTW to remove bias from observed confounders in estimation of causal effects.

Lastly, note that no method performs perfectly in this simulation study (all methods produce some degree of bias or excessive error as seen in Figure 5 and Table 3). This is, in part, because the data simulated here have very unique characteristics that no method is expected to capture perfectly. In fact, when comparing to the results in Section 3, we see that the simulated data may be harder for these methods to handle than the real data. For added rigor, the supplemental materials present an expanded version of this simulation study. In particular, simulation parameters are varied so as to alter the strength of the relationship between the exposure and covariates and to alter the frequency with which individuals move to new neighborhoods.

5 |. DISCUSSION

By proposing the use of a multivariate kernel density function in IPTW estimation, our work made contributions to statistical methods for causal inference utilizing IPTW and time-varying exposures. A multivariate kernel density allows one to circumvent the need to estimate a sequence of linear conditional models (one for each time point) as proposed by Robins.1 This greatly reduces the degree to which parameterization is needed in estimation of the weights. The kernel removes the need for the treatment (or exposure) at one time point to be a linear function of treatment at prior time points. This type of procedure is particularly necessary for modeling neighborhood-level exposures since an individual’s neighborhood characteristics are mostly invariant over time (therefore, models that condition one time period’s exposure on the prior period’s will tend to have nonstandard probabilistic distributions). When competing methods (including traditional procedures) for IPTW with longitudinal data were applied to the cohort studied in our data analysis, the resulting weights provided poor balance. The use of a multivariate kernel density yielded dramatic improvement.

With more standard data, it is most likely that there will be some cases where employing a multivariate kernel density will be useful and others where it will not be (eg, when Gaussian assumptions do not hold). We believe that it is important for researchers to have multiple tools in their back pocket when trying to estimate the causal effects of continuous exposures or treatments. The continuous exposure problem is a fundamentally harder causal problem to tackle than the binary or multigroup settings. In practice, researchers should try their preferred approach and see if they have obtained good balance using the selected approach. If good balance is not obtained, the researcher should then utilize additional estimators to see if a better balance might be obtained. Thus, we believe it is useful for researchers to understand that both the more standard approach of using a sequence of linear conditional models and the multivariate kernel density method have merits for more standard applications not involving neighborhood data.

Furthermore, IPTW with a multivariate kernel removed the statistical significance of an estimated positive effect of neighborhood on cognitive function. However, when weights are calculated with competing methods, one observes markedly different effects (in some cases, we saw a strong positive effect of neighborhood; in others, the effect was negative and statistically significant).

Note that as is standard in other literature that uses IPTW within neighborhood analyses, eg, in the works of Cerdá et al50 and Do et al,51 we do not consider neighborhood-level clustering within the estimation process for IPTW. This is reasonable in our applications since our data set is not large enough to contain substantial overlap of neighborhoods across cases but methodological adjustments akin to those suggested by Zubizarreta and Keele90 to incorporate such data characteristics may be worth exploring in future work.

To summarize, our work emphasizes the need for doubly robust estimation in neighborhood research. In our application, we saw evidence that the traditional regression model alone was not sufficient for capturing the complex associations that exist in longitudinal neighborhood-based data. The use of IPTW via a multivariate kernel density shows promise as a means for capturing these associations. Therefore, we recommend that the methods outlined here be considered whenever an analyst aims to draw causal inferences within neighborhood research on health with longitudinal, observational data sets.

DATA AVAILABILITY STATEMENT

Data are not publicly available, but pseudo-data are available upon contacting the authors.

Supplementary Material

Supplementary material

ACKNOWLEDGEMENTS

This work was made possible by Award R01AG043960 from the National Institute on Aging. Michael W. Robbins would also like to acknowledge funding from Award R21AG058123 from the National Institute on Aging, and Beth Ann Griffin would like to acknowledge funding from Award R01DA045049 from the National Institute on Drug Abuse.

Funding information

National Institute on Aging, Grant/Award Number: R01AG043960 and R21AG058123; National Institute on Drug Abuse, Grant/Award Number: R01DA045049

Footnotes

6 |

SUPPLEMENTARY MATERIALS

Web Appendix: This appendix contains results for a disaggregation outcome model, expanded simulations, miscellaneous tables, and figures that supplement those provided in the article, and R code that can be used to calculate weights for IPTW via a multivariate kernel.

SUPPORTING INFORMATION

Additional supporting information may be found online in the Supporting Information section at the end of the article.

REFERENCES

  • 1.Robins JM. Marginal structural models versus structural nested models as tools for causal inference. In: Statistical Models in Epidemiology, the Environment, and Clinical Trials New York, NY: Springer; 2000:95–133. [Google Scholar]
  • 2.Robins JM, Hernán MÁ, Brumback B. Marginal structural models and causal inference in epidemiology. Epidemiology 2000;11:550–560. [DOI] [PubMed] [Google Scholar]
  • 3.Hernán MÁ, Brumback B, Robins JM. Marginal structural models to estimate the causal effect of zidovudine on the survival of HIV-positive men. Epidemiology 2000;11(5):561–570. [DOI] [PubMed] [Google Scholar]
  • 4.Hernán MÁ, Brumback B, Robins JM. Marginal structural models to estimate the joint causal effect of nonrandomized treatments. J Am Stat Assoc 2001;96(454):440–448. [Google Scholar]
  • 5.Hernán MÁ, Brumback BA, Robins JM. Estimating the causal effect of zidovudine on CD4 count with a marginal structural model for repeated measures. Statist Med 2002;21(12):1689–1709. [DOI] [PubMed] [Google Scholar]
  • 6.Gill RD, Robins JM. Causal inference for complex longitudinal data: the continuous case. Ann Stat 2001:1785–1811.
  • 7.Arjas E, Parner J. Causal reasoning from longitudinal data. Scand J Stat 2004;31(2):171–187. [Google Scholar]
  • 8.Seaman S, Copas A. Doubly robust generalized estimating equations for longitudinal data. Statist Med 2009;28(6):937–955. [DOI] [PubMed] [Google Scholar]
  • 9.Rotnitzky A, Lei Q, Sued M, Robins JM. Improved double-robust estimation in missing data and causal inference models. Biometrika 2012;99(2):439–456. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Imai K, Ratkovic M. Robust estimation of inverse probability weights for marginal structural models. J Am Stat Assoc 2015;110(511):1013–1023. [Google Scholar]
  • 11.Imbens GW. The role of the propensity score in estimating dose-response functions. Biometrika 2000;87(3):706–710. [Google Scholar]
  • 12.Lechner M. Identification and estimation of causal effects of multiple treatments under the conditional independence assumption. In: Econometric Evaluation of Labour Market Policies Berlin, Germany: Springer; 2001:43–58. [Google Scholar]
  • 13.Hirano K, Imbens GW. The propensity score with continuous treatments. Appl Bayesian Model Causal Inference Incomplete-Data Perspect 2004;226164:73–84. [Google Scholar]
  • 14.Imai K, Van Dyk DA. Causal inference with general treatment regimes: generalizing the propensity score. J Am Stat Assoc 2004;99(467):854–866. [Google Scholar]
  • 15.Bia M, Flores CA, Flores-Lagunes A, Mattei A. A stata package for the application of semiparametric estimators of dose–response functions. Stata J 2014;14(3):580–604. [Google Scholar]
  • 16.Hirano K, Imbens GW, Ridder G. Efficient estimation of average treatment effects using the estimated propensity score. Econometrica 2003;71(4):1161–1189. [Google Scholar]
  • 17.Zhu Y, Coffman DL, Ghosh D. A boosting algorithm for estimating generalized propensity scores with continuous treatments. J Causal Inference 2015;3(1):25–40. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Katz LF, Kling JR, Liebman JB. Moving to opportunity in boston: early results of a randomized mobility experiment. Q J Econ 2001;116(2):607–654. [Google Scholar]
  • 19.Leventhal T, Brooks-Gunn J. Moving to opportunity: an experimental study of neighborhood effects on mental health. Am J Public Health 2003;93(9):1576–1582. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Sanbonmatsu L, Katz LF, Ludwig J, et al. Moving to Opportunity for Fair Housing Demonstration Program: Final Impacts Evaluation. Research Report Washington DC: US Dept of Housing and Urban Development; 2011. [Google Scholar]
  • 21.Shih RA, Ghosh-Dastidar B, Margolis KL, et al. Neighborhood socioeconomic status and cognitive function in women. Am J Public Health 2011;101(9):1721–8. 10.2105/ajph.2011.300169 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Sisco SM, Marsiske M. Neighborhood influences on late life cognition in the ACTIVE study. J Aging Res 2012;2012:435826. 10.1155/2012/435826 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Wight RG, Aneshensel CS, Miller-Martinez D, et al. Urban neighborhood context, educational attainment, and cognitive function among older adults. Am J Epidemiol 2006;163(12):1071–1078. 10.1093/aje/kwj176 [DOI] [PubMed] [Google Scholar]
  • 24.Zeki Al Hazzouri A, Haan MN, Osypuk T, Abdou C, Hinton L, Aiello AE. Neighborhood socioeconomic context and cognitive decline among older Mexican Americans: results from the Sacramento area Latino study on aging. Am J Epidemiol 2011;174(4):423–31. 10.1093/aje/kwr095 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Weden MM, Miles JNV, Friedman EM, et al. The Hispanic paradox: race/ethnicity and nativity, immigrant enclave residence, and cognitive impairment among older US adults. J Am Geriatr Soc 2016;65(5):1085–1091. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Dubowitz T, Ghosh-Dastidar M, Eibner C, et al. The women’s health initiative: the food environment, neighborhood socioeconomic status, BMI, and blood pressure. Obesity 2012;20(4):862–71. 10.1038/oby.2011.141 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Richardson AS, Meyer KA, Howard AG, et al. Multiple pathways from the neighborhood food environment to increased body mass index through dietary behaviors: a structural equation-based analysis in the CARDIA study. Health Place 2015;36:74–87. 10.1016/j.healthplace.2015.09.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Mair CF, Roux AVD, Galea S. Are neighborhood characteristics associated with depressive symptoms? A critical review. J Epidemiol Community Health 2008;62:940–946. [DOI] [PubMed] [Google Scholar]
  • 29.Paczkowski MM, Galea S. Sociodemographic characteristics of the neighborhood and depressive symptoms. Curr Opin Psychiatry 2010;23(4):337–341. [DOI] [PubMed] [Google Scholar]
  • 30.Oakes JM. The (mis)estimation of neighborhood effects: causal inference for a practicable social epidemiology. Soc Sci Med 2004;58(10):1929–1952. [DOI] [PubMed] [Google Scholar]
  • 31.Oakes JM. Rejoinder: causal inference and the relevance of social epidemiology. Soc Sci Med 2004;58(10):1969–1971. [DOI] [PubMed] [Google Scholar]
  • 32.Diez Roux AV. Estimating neighborhood health effects: the challenges of causal inference in a complex world. Soc Sci Med 2004;58(10):1953–1960. [DOI] [PubMed] [Google Scholar]
  • 33.Subramanian SV. The relevance of multilevel statistical methods for identifying causal neighborhood effects. Soc Sci Med 2004;58(10):1961–1967. [DOI] [PubMed] [Google Scholar]
  • 34.Jokela M Are neighborhood health associations causal? A 10-year prospective cohort study with repeated measurements. Am J Epidemiol 2014;180(8):776–784. [DOI] [PubMed] [Google Scholar]
  • 35.Desmond M Unaffordable america: poverty, housing, and eviction. Fast Focus Inst Res Poverty 2015;22:1–6. [Google Scholar]
  • 36.Arcaya MC, Graif C, Waters MC, Subramanian SV. Health selection into neighborhoods among families in the moving to opportunity program. Am J Epidemiol 2016;183(2):130–137. 10.1093/aje/kwv189 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Dunn EC, Winning A, Zaika N, Subramanian SV. Does poor health predict moving, move quality, and desire to move? A study examining neighborhood selection in US adolescents and adults. Health Place 2014;30:154–64. 10.1016/j.healthplace.2014.08.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Friedman EM, Weden MM, Shih RA, Kovalchik S, Singh R, Escarce J. Functioning, forgetting, or failing health: which factors are associated with a community-based move among older adults? J Gerontol B Psychol Sci Soc Sci 2016;71(6):1120–1130. 10.1093/geronb/gbv075 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Litwak E, Longino CF. Migration patterns among the elderly: a developmental perspective. The Gerontologist 1987;27(3):266–272. [DOI] [PubMed] [Google Scholar]
  • 40.Calise TV, Heeren T, DeJong W, Dumith SC, Kohl HW III. Do neighborhoods make people active, or do people make active neighborhoods? Evidence from a planned community in Austin, Texas. Prev Chronic Dis 2013;10:E102. 10.5888/pcd10.120119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Popkin SJ. Safety Is the Most Important Thing: How HOPE VI Helped Families. Technical Report Washington, DC: Urban Institute, Metropolitan Housing and Communities Center; 2007. [Google Scholar]
  • 42.Varady DP, Walker CC. Vouchering out distressed subsidized developments: does moving lead to improvements in housing and neighborhood conditions? Hous Policy Debate 2000;11(1):115–162. [Google Scholar]
  • 43.Goering J, Feins JD, Richardson TM. A cross-site analysis of initial moving to opportunity demonstration results. J Hous Res 2002;13(1):1. [Google Scholar]
  • 44.Diez Roux AV. Investigating neighborhood and area effects on health. Am J Public Health 2001;91(11):1783–1789. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Macintyre S, Ellaway A, Cummins S. Place effects on health: how can we conceptualise, operationalise and measure them? Soc Sci Med 2002;55(1):125–139. [DOI] [PubMed] [Google Scholar]
  • 46.Hurd MD, Martorell P, Delavande A, Mullen KJ, Langa KM. Monetary costs of dementia in the United States. N Engl J Med 2013;368(14):1326–1334. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Daviglus ML, Bell CC, Berrettini W, et al. National institutes of health state-of-the-Science conference statement: preventing Alzheimer disease and cognitive decline. Ann Intern Med 2010;153(3):176–181. [DOI] [PubMed] [Google Scholar]
  • 48.Ritchie K, Carrière I, Ritchie C, Berr C, Artero S, Ancelin M-L. Designing prevention programmes to reduce incidence of dementia: prospective cohort study of modifiable risk factors. BMJ 2010;341:c3885. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Wu YT, Prina AM, Brayne C. The association between community environment and cognitive function: a systematic review. Soc Psychiatry Psychiatr Epidemiol 2015;50(3):351–362. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Cerdá M, Diez-Roux AV, Tchetgen ET, Gordon-Larsen P, Kiefe C. The relationship between neighborhood poverty and alcohol use: estimation by marginal structural models. Epidemiology 2010;21(4):482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Do DP, Wang L, Elliott MR. Investigating the relationship between neighborhood poverty and mortality risk: a marginal structural modeling approach. Soc Sci Med 2013;91:58–66. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Kravitz-Wirtz N Temporal effects of child and adolescent exposure to neighborhood disadvantage on black/white disparities in young adult obesity. J Adolesc Health 2016;58(5):551–557. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Lei MK, Simons RL, Beach SR, Philibert RA. Neighborhood disadvantage and biological aging: using marginal structural models to assess the link between neighborhood census variables and epigenetic aging. J Gerontol B Psychol Sci Soc Sci 2017:e50–e59. [DOI] [PMC free article] [PubMed]
  • 54.Sampson RJ, Sharkey P, Raudenbush SW. Durable effects of concentrated disadvantage on verbal ability among African-American children. Proc Natl Acad Sci 2008;105(3):845–852. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Wodtke GT, Harding DJ, Elwert F. Neighborhood effects in temporal perspective: the impact of long-term exposure to concentrated disadvantage on high school graduation. Am Sociol Rev 2011;76(5):713–736. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Robins J Marginal structural models In: 1997 Proceedings or the American Statistical Association: Section on Bayesian Statistical Science; 1998; Alexandria, VA. [Google Scholar]
  • 57.Bang H, Robins JM. Doubly robust estimation in missing data and causal inference models. Biometrics 2005;61(4):962–973. [DOI] [PubMed] [Google Scholar]
  • 58.Kang JD, Schafer JL. Demystifying double robustness: a comparison of alternative strategies for estimating a population mean from incomplete data. Statistical Science 2007;22(4):523–539. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Shardell M, Hicks GE, Ferrucci L. Doubly robust estimation and causal inference in longitudinal studies with dropout and truncation by death. Biostatistics 2014;16(1):155–168. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Do DP, Finch BK. The link between neighborhood poverty and health: context or composition? Am J Epidemiol 2008;168(6):611–619. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Pollack CE, Griffin BA, Lynch J. Housing affordability and health among homeowners and renters. Am J Prev Med 2010;39(6):515–521. [DOI] [PubMed] [Google Scholar]
  • 62.Holland PW. Statistics and causal inference. J Am Stat Assoc 1986;81(396):945–960. [Google Scholar]
  • 63.Rubin DB. Estimating causal effects of treatments in randomized and nonrandomized studies. J Educ Psychol 1974;66(5):688. [Google Scholar]
  • 64.Griffin BA, Eibner C, Bird CE, et al. The relationship between urban sprawl and coronary heart disease in women. Health Place 2013;20:51–61. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Li F, Zaslavsky AM, Landrum MB. Propensity score weighting with multilevel data. Statist Med 2013;32(19):3373–3387. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Liu W, Kuramoto SJ, Stuart EA. An introduction to sensitivity analysis for unobserved confounding in nonexperimental prevention research. Prevention Science 2013;14(6):570–580. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Carnegie NB, Harada M, Hill JL. Assessing sensitivity to unmeasured confounding using a simulated potential confounder. J Res Educ Eff 2016;9(3):395–420. [Google Scholar]
  • 68.Benaglia T, Chauveau D, Hunter DR, Young D. mixtools: an R package for analyzing finite mixture models. J Stat Softw 2009;32(6):1–29. [Google Scholar]
  • 69.Silverman BW. Density Estimation for Statistics and Data Analysis Vol. 26; Boca Raton, FL: CRC Press; 1986. [Google Scholar]
  • 70.Wand M, Jones M. Multivariate plug-in bandwidth selection. Computational Statistics 1994;9(2):97–116. [Google Scholar]
  • 71.Duong T, Hazelton M. Plug-in bandwidth matrices for bivariate kernel density estimation. J Nonparametric Stat 2003;15(1):17–30. [Google Scholar]
  • 72.Chacón JE, Duong T. Multivariate plug-in bandwidth selection with unconstrained pilot bandwidth matrices. Test 2010;19(2):375–398. [Google Scholar]
  • 73.Ridgeway G, Kovalchik SA, Griffin BA, Kabeto MU. Propensity score analysis with survey weighted data. J Causal Inference 2015;3(2):237–249. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Griffin BA, Robbins M. Estimating generalized propensity scores with survey weighted data 2019. Pre-print. [DOI] [PMC free article] [PubMed]
  • 75.Robins JM. Correcting for non-compliance in randomized trials using structural nested mean models. Commun Stat Theory Methods 1994;23(8):2379–2412. [Google Scholar]
  • 76.Robins JM. Analytic methods for estimating HIV-treatment and cofactor effects. In: Methodological Issues in AIDS Behavioral Research Boston, MA: Springer; 2002:213–288. [Google Scholar]
  • 77.Robins JM. Structural nested failure time models. Encycl Biostat 2005;7. [Google Scholar]
  • 78.Vansteelandt S, Joffe M. Structural nested models and G-estimation: the partially realized promise. Statistical Science 2014;29(4):707–731. [Google Scholar]
  • 79.Robins JM, Rotnitzky A. Semiparametric efficiency in multivariate regression models with missing data. J Am Stat Assoc 1995;90(429):122–129. [Google Scholar]
  • 80.Hauser RM, Willis RJ. survey design and methodology in the Health and Retirement Study and the Wisconsin Longitudinal Study. Popul Dev Rev 2004;30:209–235. [Google Scholar]
  • 81.Kapteyn A, Michaud P-C, Smith JP, Soest AV. Effects of Attrition and Non-Response in the Health and Retirement Study Santa Monica, CA: RAND Corporation; 2006. [Google Scholar]
  • 82.Michaud PC, Kapteyn A, Smith JP, van Soest A. Temporary and permanent unit non-response in follow-up interviews of the Health and Retirement Study. Longitud Life Course Stud 2011;2(2):145–169. [Google Scholar]
  • 83.Heeringa SG, Connor JH. Technical Description of the Health and Retirement Survey Sample Design. Technical Report Ann Arbor, MI: University of Michigan; 1995. [Google Scholar]
  • 84.Crooks VC, Clark L, Petitti DB, Chui H, Chiu V. Validation of multi-stage telephone-based identification of cognitive impairment and dementia. BMC Neurology 2005;5(1):8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Miles JN, Weden MM, Lavery D, Escarce JJ, Cagney KA, Shih RA. Constructing a time-invariant measure of the socio-economic status of US census tracts. J Urban Health 2016;93(1):213–232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Kish L Survey Sampling Hoboken, NJ: John Wiley & Sons; 1965. [Google Scholar]
  • 87.Schneeweiss S Sensitivity analysis and external adjustment for unmeasured confounders in epidemiologic database studies of therapeutics. Pharmacoepidemiol Drug Saf 2006;15(5):291–303. [DOI] [PubMed] [Google Scholar]
  • 88.Blackwell M A selection bias approach to sensitivity analysis for causal effects. Political Analysis 2014;22(2):169–182. [Google Scholar]
  • 89.VanderWeele TJ, Ding P. Sensitivity analysis in observational research: introducing the e-value. Ann Intern Med 2017;167(4):268–274. [DOI] [PubMed] [Google Scholar]
  • 90.Zubizarreta JR, Keele L. Optimal multilevel matching in clustered observational studies: a case study of the effectiveness of private schools under a large-scale voucher system. J Am Stat Assoc 2017;112(518):547–560. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material

Data Availability Statement

Data are not publicly available, but pseudo-data are available upon contacting the authors.

RESOURCES