Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Jan 4.
Published in final edited form as: Am Stat. 2022 Jan 4;76(2):142–151. doi: 10.1080/00031305.2021.2003245

Statistical implications of endogeneity induced by residential segregation in small-area modelling of health inequities

Rachel C Nethery 1,*, Jarvis T Chen 2, Nancy Krieger 2, Pamela D Waterman 2, Emily Peterson 3, Lance A Waller 3, Brent A Coull 1
PMCID: PMC9070859  NIHMSID: NIHMS1762308  PMID: 35531350

Abstract

Health inequities are assessed by health departments to identify social groups disproportionately burdened by disease and by academic researchers to understand how social, economic, and environmental inequities manifest as health inequities. To characterize inequities, group-specific small-area health data are often modeled using log-linear generalized linear models (GLM) or generalized linear mixed models (GLMM) with a random intercept. These approaches estimate the same marginal rate ratio comparing disease rates across groups under standard assumptions. Here we explore how residential segregation combined with social group differences in disease risk can lead to contradictory findings from the GLM and GLMM. We show that this occurs because small-area disease rate data collected under these conditions induce endogeneity in the GLMM due to correlation between the model’s offset and random effect. This results in GLMM estimates that represent conditional rather than marginal associations. We refer to endogeneity arising from the offset, which to our knowledge has not been noted previously, as “offset endogeneity”. We illustrate this phenomenon in simulated data and real premature mortality data, and we propose alternative modeling approaches to address it. We also introduce to a statistical audience the social epidemiologic terminology for framing health inequities, which enables responsible interpretation of results.

Keywords: small-area analysis, generalized linear mixed models, disease mapping, offset endogeneity, interdisciplinary

1. Introduction

Modeling of small area disease/health event rates has long been critical to quantifying health inequities– that is, unjust, unnecessary, and in principle preventable differences in health status across social groups (Braveman and Gruskin 2003) – and to informing public health policies seeking to eliminate these inequities. Health departments employ such models to identify social groups disproportionately and unfairly burdened by disease and likely to benefit from public health resources (Association of State and Territorial Health Officials 2021; National Association of County Health Officials 2021). Academic researchers also model disease rates to investigate how social, economic, and environmental inequities across social groups manifest as health inequities and how risk patterns vary over space and time (Beckfield 2018). Small-area disease rates for social groups, e.g. racialized or economic groups, are often utilized for this purpose, and either generalized linear models (GLM) or generalized linear mixed models (GLMM) for count/rate outcomes are employed to estimate inequities (Burton et al. 2010; Chen 2013; Massachusetts Department of Public Health 2010).

Under the usual modeling assumptions (described in Section 4), the standard log-linear GLM and GLMM used to model inequities estimate the same marginal rate ratio comparing health outcomes across social groups (Demidenko 2007; Young et al. 2007; Zhang et al. 2012). In academic research, log-linear GLMMs, also known as disease mapping models when applied to health data from small enumeration areas, are often preferred due to their ability to estimate inequity measures with a marginal interpretation while accounting for spatial correlation in small-area data. These models also yield smoothed rate estimates for small areas where observed rates may be noisy due to small local sample sizes. However, GLMMs may not estimate marginal parameters if the data violate key modeling assumptions. For instance, endogeneity in the form of correlation between covariates and areal-level random effects in a GLMM can compromise the marginal interpretations of parameters in the log-linear GLMM (Bates et al. 2014; Neuhaus and McCulloch 2006).

Although the impacts of endogeneity in multilevel model frameworks have been well-studied, little to no work has focused on the implications for disease mapping models in health inequity research. We find that residential segregation combined with social group differences in on-average individual risk give rise to small-area data that induce endogeneity due to correlation between the offset and random effects in disease mapping models. While others have examined endogeneity due to correlation between covariates and random effects (Bates et al. 2014; Neuhaus and McCulloch 2006), to our knowledge this specific form of endogeneity, which we call “offset endogeneity”, has not been documented previously. This study examines and demonstrates how offset endogeneity leads to model misspecification and contradictory findings from the GLM and GLMM approaches.

Specifically, using the example of health inequities across racialized social groups, we demonstrate how racialized residential segregation across small areas combined with baseline on-average racialized differences in health outcomes, conditions which frequently co-occur, leads to offset endogeneity and contradictory inequity estimates from the GLM and GLMM. We evaluate the magnitude and implications of these discrepancies in real data and in simulations, and we propose an adjusted model specification that we term the “individual and neighborhood inequities” model (INE). This alternative formulation addresses the endogeneity problem by explicitly conditioning the models on both individual-level racialized social group membership and neighborhood racialized social group composition, and it can provide greater insights into the factors driving health inequities.

As a secondary objective of this paper, responding to recent calls for more thoughtful and fair uses of social data in health-related analyses (Chen et al. 2021; Krieger 2021), we employ social epidemiologic terminology for framing health inequities, which is critical to proper interpretation of studies employing these concepts. In Section 2, we explain and justify this terminology. In Section 3, we motivate this work by describing census tract (CT) premature mortality rate data for two racialized social groups in Massachusetts (MA), as defined by the US census: the Black population and the white non-Hispanic population (WNH) (US Census Bureau 2017, 2020). In Section 4, we formalize the GLM and GLMM approaches and examine how small-area disease/health event data stratified by racialized social group can induce offset endogeneity. In Sections 5 and 6, we evaluate the impact of this issue using simulated and real data, respectively. We conclude with a discussion in Section 7.

2. Language and terminology for health inequity studies

Because the aim of health inequity studies is to characterize unfair burdens of morbidity and mortality borne by social groups harmed by injustice (Braveman and Gruskin 2003; Krieger 2020), we utilize language that emphasizes the power dynamics and the agency (or denial thereof) of the groups that lead to inequities (Benjamin 2017; Krieger 2020, 2021). Recognizing that some of this terminology may be unfamiliar to a statistical audience, we explain and justify it here. We first note that, in statistics, much emphasis is placed on mathematical formalization of concepts, and, when working in interdisciplinary settings, formalizing the subject matter is equally important. This is perhaps uniquely critical when studying social constructs, as improper framing in such studies has historically been used to justify unjust beliefs and actions (Benjamin 2017; Krieger 2020, 2021). Because we examine a statistical problem brought about by complex social dynamics, formalizing and emphasizing groups’ relative power and agency is essential for accurate interpretation.

First, we use the term health inequity, as opposed to health disparity, to make clear that we are focused on preventable health differences across groups that arise because of social and structural inequities rather than biological differences (Beckfield 2018; Braveman and Gruskin 2003; Krieger 2020). We consider health inequities between two social groups co-defined by their power relations, which we refer to abstractly as the “privileged” and “marginalized” groups. This language highlights the fact that the marginalized group is denied agency in determining its status. Note that the term marginalized when applied to social groups has no connection to and should not be confused with the statistical concept of marginalization. In our motivating example, we utilize the term racialized social group, instead of the more commonly used but problematic term “race” (Benjamin 2017; Krieger 2021), to more precisely describe the theoretical social construct we aim to study. This terminology clarifies that “race” is socially (rather than biologically) defined and is not an innate individual “property” or trait. In practice, we interpret the data on what is termed “race” and “ethnicity” in the administrative health and census data employed in our analyses as indicators of racialized social group. For easy reference, this terminology is summarized in Table 1. Additional references on this topic are provided in the Supplementary Materials.

Table 1.

Health inequities terminology

Term Definition
Health inequity Unjust, unnecessary, and in principle preventable differences in health status across social groups (Beckfield 2018; Braveman and Gruskin 2003; Krieger 2020).
Privileged vs. marginalized groups Social group classifications for groups co-defined by unjust power relations, with these terms emphasizing the relative power and agency (or denial thereof) of the groups arising due to unfair social, economic, and political structures (Benjamin 2017; Krieger 2020, 2021).
Racialized social group Precise terminology that clarifies the category of “race” is socially (rather than biologically) defined and is not an innate individual “property” or trait (Benjamin 2017; Krieger 2020, 2021).

3. Motivating data

To motivate this work, we introduce the data we use to investigate inequities in CT-level premature mortality (death before age 65) in the Black and WNH populations in MA. We analyze data for the Black population, as opposed to the Black non-Hispanic population, because CT-level age-stratified population count data are not available for the latter (9.8% of Black individuals in MA also identified as Hispanic in the 2010 census). CTs are US Census Bureau administrative units designed to include roughly 4,000 residents (US Census Bureau 2019). They are commonly used to investigate spatial patterns in health (Krieger et al. 2005). Premature mortality is a widely used population health indicator for characterizing health inequities (Chen et al. 2006).

We obtained records for all premature deaths in MA during 2008–2012, dates chosen to be centered around the 2010 decennial census, from the MA Department of Health (55,836 total premature deaths). We geocoded the residential address at death to the corresponding CT, with only 0.4% of deaths unable to be geocoded at that level of precision. We extracted the racial/ethnic-, age- and sex-stratified population counts, as categorized with US census terminology, for each CT from the 2010 decennial census using the R package tidycensus (Walker 2020). We then multiplied each population count by five to obtain the at-risk person-years corresponding to our five years of mortality data. Our study includes all MA CTs with non-zero Black and WNH populations, totaling N = 1,465 CTs (99.1% of MA’s 1,478 CTs in 2010).

Let i = {1, …, N} index CT and j = {0, 1} index racialized social group within CT so that Yij is the observed count of premature deaths in racialized group j within CT i. We constructed standardized mortality ratios (SMRs) using the indirect standardization approach (Chen 2013). CT and racialized social group-specific expected premature death counts are calculated to serve as a denominator (the log of the denominator often serves as the offset in statistical modeling) to account for differential at-risk person-time and age and sex compositions. See the Supplementary Materials for more detail. We denote the expected count for racialized social group j in CT i by Pij.

The SMR is then given by SMRij = Yij/Pij. Figure 1A shows the distributions of the CT racialized social group-specific 5-year expected counts and SMRs. To illustrate the racialized segregation and the concentration of the Black population in a small number of CTs, Figure 2 maps the percent Black for all CTs in MA, and a magnified view of the city of Boston. The average percent Black across all MA CTs in 2010 was 7.5%, and approximately 39% of MA CTs had populations composed of <2% Black individuals. Boston was more diverse, with an average of 23.7% Black across the 177 Boston CTs.

Figure 1.

Figure 1.

For MA 2008–2012 5-year CT premature mortality data, (A) racialized group-stratified histograms of CT 5-year expected premature mortality counts (left) and SMRs (right); (B) observed vs expected 5-year premature mortality counts for Black vs. WNH populations. In the left panel, raw expected counts are used and in the right panel, the raw expected counts are adjusted by the CT baseline mortality rate.

Figure 2.

Figure 2.

CT percent Black for MA (left) and Boston (right), 2008–2012.

4. Methods

We focus on two traditional approaches to health inequity analysis using small-area data: GLMs, also called aggregated analyses, and GLMMs, also called disease mapping models, both of which we describe in detail below. For general exposition of the statistical concepts and methods, we abstract to two social groups: privileged and marginalized (in our motivating data, the WNH and Black populations). Analogous to the notation introduced above, let i index CT and j index social group.

A standard GLM specification for estimating health inequities across the study population’s social groups is

log(E(YijXij))=α0+α1Xij+log(Pij), (1)

where Xij denotes the indicator of social group (marginalized =1, privileged = 0). Yij and Pij are the CT- and social group-specific observed event count and expected event count, described in Section 3. This model is often estimated assuming a Poisson likelihood, although more flexible approaches such as the quasi-Poisson or the negative binomial distribution are also common. The rate ratio (RR) for comparing the CT-level rate of the health event in the marginalized vs. privileged population is exp(α1).

The GLM approach assumes that the independent unit of analysis is the CT- and social group-specific rate; however, the independence assumption is likely to be violated, as the rates of a health outcome in marginalized and privileged populations in the same neighborhoods are almost certain to be correlated (Kiang et al. 2019). In spite of its over-simplified specification, the GLM approach retains relevance because the RR estimator from the GLM can be shown to be equivalent to that obtained from an aggregated social group inequity analysis. The aggregated approach, often used by state health departments to monitor health inequities, aggregates incidence data from the entire state to generate a state-wide age-adjusted RR estimate comparing the marginalized and privileged groups (Massachusetts Department of Public Health 2010).

The GLMM extends the GLM approach by including a random intercept, often with a spatial correlation structure, along with the social group indicator fixed effect which is used to estimate the RR (Chen 2013; Kiang et al. 2019). In addition to estimating the overall RR, this model enables estimation of smoothed area-specific incidence/mortality rates, which can be used to further investigate spatial patterns in risk. A standard GLMM specification for estimating health inequities is

log(E(YijXij,bi))=β0+β1Xij+bi+log(Pij), (2)

where bi is a CT-specific random intercept. In the standard mixed model framework, the bi are assumed to be i.i.d. normal, i.e., bi~N(0,σb2). In addition to accounting for correlation in social group-specific health event rates from the same CT, the random effect can also be conceived as capturing some unmeasured CT-level factors that drive baseline rates. The spatial version of this model for areal data generally specifies a conditionally auto-regressive (CAR) structure on bi, which further allows for correlation in the event rates of spatially proximate CTs (Schmidt and Nobre 2018). Previous work has described conditions that can lead to discrepancies in fixed effect estimates from spatial and non-spatial mixed models (Hodges and Reich 2010). In this work, in order to isolate the impact of endogeneity resulting from residential segregation, we focus on the non-spatial GLMM, but the issues we illustrate here would extend to spatial analogues.

For many link functions, the fixed effects from GLMs and GLMMs are not directly comparable, as GLM parameters represent marginal associations while GLMM parameters represent conditional ones (given the CT-level random effects). Yet, under standard assumptions, a log-linear GLMM with normally distributed random intercepts is a special case in which the parameters in the two models coincide (Demidenko 2007; Young et al. 2007; Zeger, Liang, and Albert 1988; Zhang et al. 2012). This is apparent once one integrates over the random intercepts in the GLMM, yielding

E(YijXij)=Eb(E(YijXij,bi))=exp((β0+12σb2)+β1Xij+log(Pij)).

Thus, the marginal mean structure for this GLMM is equivalent to the GLM mean structure, with the exception of the constant multiplicative term exp(12σb2), a function of the variance of the random effects. While this term causes the intercepts from the two models to differ by 12σ2, the marginal RR for social group remains exp(β1). This feature of a log-linear GLMM with a random intercept, namely its ability to estimate marginal fixed effect parameters while accounting for complex correlation structures and enabling estimation of smoothed rates, makes it a popular choice for disease mapping.

4.1. Endogeneity

As shown above, exp(α1) and exp(β1) coincide when the true mean structure underlying the data is given by equation (1) or (2). However, we do not expect that these are the true, “causal” data generating mechanisms for complex health outcomes like premature mortality. In addition to describing patterns of unequal risk experienced by marginalized groups, researchers may also want to explore the mechanisms by which inequities arise, e.g., by including additional variables such as economic indicators or environmental exposures in statistical models, without necessarily making causal claims about “the effect” of social group. Although building a causal model may not be the goal of inequity analyses, certain types of model misspecification and violations of assumptions can impact model performance and interpretability of the results.

In biostatistics, one often overlooked assumption of the GLMM is that the random effects are uncorrelated with the fixed predictors on the right-hand side of the model (note that we use “fixed predictor/covariate/term” to refer to any covariates modeled with fixed rather than random effects). Most commonly discussed in the economics and education literature, correlation between random terms and fixed terms on the right-hand side of a regression model is referred to as the problem of endogeneity (Bates et al. 2014). GLMMs assuming i.i.d. normal random effects suffer from model misspecification in the context of endogeneity arising due to correlation between covariates and random effects, which has been noted in biostatistical contexts as well (Neuhaus and McCulloch 2006). Because endogeneity is a very general term, often referring to omitted variable bias in causal inference settings (where omitted variables are confounders of the “causal” effect of an exposure of interest), we clarify that our use of the term endogeneity throughout this paper refers to model misspecification arising from correlation between a fixed term and a random term in a non-causal setting.

Critically, when a fixed term is correlated with the random effect, fixed effect parameters in the properly-specified GLMM have a conditional, rather than marginal, interpretation, even in the special case of the log-linear model with a random intercept. Moreover, some literature on this topic suggests that, when a standard GLMM with i.i.d. random effects is erroneously specified and fit in the presence of this type of endogeneity, fixed effect parameter estimates are largely insensitive to the model misspecification (Neuhaus and McCulloch 2006), meaning that bias of the conditional effect estimator under endogeneity is typically small.

While the RR estimator from the GLMM estimates the conditional association parameter in the presence of endogeneity, the RR estimator from the GLM estimates the marginal association. Thus, endogeneity can lead to discrepancies between the GLM and GLMM inequity estimates. In the next section, we describe how endogeneity can arise in health inequity models due to correlation between the offset and the random intercept.

4.2. Example: Racialized residential segregation and baseline health differences

Here we demonstrate how the combination of racialized residential segregation and higher baseline rates of disease and mortality in neighborhoods with a high proportion of people of color– conditions that commonly co-occur in practice– induce a previously undocumented type of endogeneity, one relevant more broadly in the context of residential segregation and health inequities. In our setting, the history of racialized residential segregation at the neighborhood level results in within-CT homogeneity of residents’ racialized social group membership. As a result, CT expected event counts in Black and WNH groups, Pi1 and Pi0 respectively, will be negatively correlated in the presence of racialized residential segregation, with the magnitude of the correlation reflecting the degree of CT segregation. Moreover, if Black populations comprise a small portion of the study area’s total population and reside largely in a small number of neighborhoods, as in MA (Figure 2), Pi1 may be very small in most neighborhoods, resulting in unstable, zero-heavy SMRs for Black populations.

In the US, adverse racialized residential segregation is often accompanied by the residents’ greater exposure to community disinvestment and environmental hazards, as well as household- and individual-level exposures to racial discrimination and economic deprivation, together potentially increasing risk of poor health status among all persons living in this neighborhood (Duncan and Kawachi 2018; Roux and Mair 2010). The converse is that exclusionary racialized residential segregation, to protect privilege, adds to the health advantages of these areas’ residents (Duncan and Kawachi 2018; Roux and Mair 2010). Higher baseline rates of the health outcome in neighborhoods whose residents are predominantly people of color– in our example, Black– can be considered in the context of a GLMM-type data generating mechanism (equation 2) as higher values of the random effect, bi, occurring in CT with high Black population size/expected counts, Pi1. This leads to a setting in which bi is positively correlated with Pi1 and negatively correlated with Pi0—endogeneity arising from correlation between a random effect and the offset, which we refer to as “offset endogeneity”. Although the offset is in essence just a special type of covariate (with coefficient fixed to 1), to our knowledge it has never before been identified as a source of endogeneity. Note that, in this scenario, bi has no association with Xij, since each CT has exactly one count with Xij = 1 and one with Xij = 0 by design. Nonetheless, offset endogeneity introduced by the correlation between bi and Pi1 (and Pi0) creates a scenario in which the marginal and conditional associations between Yij and Xij differ, and the RR parameters from a GLM and GLMM diverge, even when using a log link.

Further insight into this phenomenon can be obtained by deriving the closed form estimators of the coefficients in the Poisson GLM and a simplified analogue of the Poisson GLMM. First, note that the GLM we specified is a saturated model, so that the estimator of α1 when using a Poisson distribution takes the form

α^1=log((iYi1)(iPi0)(iYi0)(iPi1))=log(Y1P0Y0P1). (3)

This confirms that the RR estimator from the GLM is equivalent to the RR estimator in an aggregated analysis, as stated in Section 1. Now, in order to develop closed-form expressions that highlight the impact of endogeneity, for the moment assume the random effects bi in the GLMM model are a priori known constants. In this case the bi can be absorbed into the offset term and the model is again a saturated model with closed form estimator of β1

β^1*=log((iYi1)(iPi0 exp(bi))(iYi0)(iPi1 exp(bi))) (4)

Because this model explicitly conditions on bi, β^1* has a conditional interpretation. From these closed form estimators, we can see that if Pi1 = πPi0 for some constant π ≥ 0, then β^1* and α^1 reduce to the same expression. When Pi1 is a constant proportion of Pi0 across areas, this association indicates the absence of racialized residential segregation, and in this setting, the marginal and conditional inequity RR estimators will coincide.

However, when there is both racialized residential segregation, Pi1πPi0, and baseline heterogeneity in rates of the outcome across areas, bi ≠ 0, equations (3) and (4) are not equal. Thus, the marginal and conditional estimators will differ. To gain further insight, we now equivalently write equation (4) as

β^1*=log((iYi1)(iPi0 φi)(iYi0)(iPi1 φi)) where φi=exp(bi)iexp(bi)

where the φi ∈ [0, 1] can be thought of as weights. This “weighted” expression makes clear that when the bi are large for large Pi1 (i.e., co-occurrence of racialized residential segregation and higher baseline health risk in predominately Black communities), the conditional estimator β^1* up-weights the Black expected counts in the denominator that are the largest, and down-weights the WNH counts in the numerator that are the largest. Comparing this to the estimator α^1 in equation (3), which takes the same form but with weights φi = 1/N, we see that when α^1>0, we also have α^1>β^1*. Thus, the marginal estimate will be larger than the conditional estimate in the presence of racialized residential segregation and higher baseline risks in predominately Black communities. This suggests that, in the presence of offset endogeneity, the health inequity estimate from the GLMM will be an underestimate of the marginal association.

Our MA premature mortality data illustrate this phenomenon (Figure 1B). In the left panel, the CT observed vs expected counts are plotted for Black and WNH populations, and a steeper positive relationship on average for the Black population is evident by the corresponding regression lines. In the right panel, we multiply the expected counts for each racialized group by the total-CT baseline mortality rate, which we call the “adjusted expected premature mortality”. After this adjustment, a steeper relationship on average for the WNH population emerges. The left panel roughly corresponds to the Black vs WNH relationship detected by the GLM and the right panel to the relationship detected by the GLMM. We delve deeper into this issue below with pseudo-simulated data and provide results for the MA premature mortality data.

4.3. Separating neighborhood composition from individual group membership in health inequities modelling: the “individual and neighborhood inequities” model

The phenomenon described above leads to a GLM RR estimate that reflects the marginal comparison of Black vs. WNH premature mortality across all CTs, while the GLMM RR estimate more closely approximates a conditional comparison of Black vs. WNH premature mortality within a given CT. Thus, the GLMM RR estimate is effectively comparing mortality rates for Black vs. WNH individuals living in the same neighborhood. Because the goal of small-area health inequity modeling is, in general, not to estimate the causal effect on the outcome of an individual belonging to a specified social group, but rather to understand how an accumulation of social factors results in social group inequities in morbidity or mortality, the GLMM may be “adjusting away” the association of interest in disease mapping studies of health inequities.

Because the GLMM may implicitly separate the neighborhood-level social group associations (random effect) from the individual-level social group associations (fixed effects), we suggest explicitly disentangling these factors in the models through the use of additional area-level fixed effects. In particular, we recommend the following modified GLMM specification:

log (E(YijXij,Wi,bi))=β0+β1Xij+β2Wi+bi+log(Pij) (5)

where Wi is the CT-level marginalized group composition (say, the percent Black in the CT). Here, exp(β2) is the multiplicative change in premature mortality risk corresponding to a one-unit increase in a CT’s percent marginalized residents. The term exp(β1) is the multiplicative association between premature mortality and individual-level marginalized group membership, conditional on the CT’s marginalized group composition. Analogously, can be included as a covariate in the GLM.

This alternative modeling approach, which we call the individual and neighborhood inequities (INE) model, has several implications. First, including Wi in the GLMM should reduce or eliminate the offset endogeneity issue. Thus, if Wi is included in both the GLM and GLMM, they should produce similar RR estimates. To explain heuristically why this addresses offset endogeneity using our real data example, think of the random intercept in equation (2) as capturing average residual structure across racialized groups with a CT, after adjusting for Xij and Pij. Even “adjustment” for the offset by including it in the fixed portion of the model does not account for disproportionately high premature mortality rates in CTs with larger Black populations, because there are separate racialized group-specific offsets for each CT (Pi1 and Pi0), and the model does not allow for different effects of these two offset terms (in fact the coefficient of the offset is fixed to one across the board). Thus, the increased premature mortality risk in CTs with large Black populations remains in the residual structure after conditioning on Xij and Pij, and the random intercept captures it, leading to a correlation between and bi and Pi1 inducing offset endogeneity. However, including CT proportion Black, Wi—which is common for both racialized groups within a CT—as a covariate effectively adjusts for the increased baseline risk in CTs with large proportion Black and removes that structure from the residuals, so that the random intercept does not capture it and is not highly correlated with Pi1 nor Pi0.

Including explicitly separates the association between neighborhood social group composition and health from the association between individual membership in the social group and health, conditioning each association on the other. In many contexts, separating these associations may provide additional insights into the social factors that lead to inequities. Moreover, the modified GLMM can still provide smoothed small-area estimates of incidence rates or rate ratios, which are often a target of inference in the GLMM approach. As discussed by Chen et al. (2006), when including area-level covariates in health inequity models (Wi), consideration should be given to whether the smoothed quantities of interest are best represented by the bi or by β2Wi + bi. If, instead, interest lies in the RR estimate that compares overall disease incidence across social groups, the standard GLM specification in equation (1) or a generalized estimating equations (GEE) approach, which may produce more accurate uncertainty estimates, will be most appropriate. Although they are not commonly used in this context, in our simulations and data analysis below, we fit GEE models as a comparator for the GLMs and GLMMs.

5. Simulations

This simulation study was conducted in R (R Core Team 2020), and the code is provided in the online Supplementary Materials and on Github. To illustrate how racialized residential segregation and differing baseline health event rates impact common models for health inequity analyses, we simulate CT-level racialized social group-specific data reflecting various combinations of these features. Following the GLMM notation above, CT event counts are generated via independent Poisson draws, Yij~Poisson(λij), with

log(λij)=β0+β1Xij+bi+log(Pij).

As detailed below, four different mean structures are created by altering the specification of the baseline rates, bi, and the expected counts Pij. All simulations have N = 1,465 CTs, corresponding to the number of MA CTs in our study.

Simulation 1: Racialized residential segregation, common baseline rate across CTs.

The CT-specific Black and WNH expected counts are negatively correlated. They are taken from the real MA CT data, to simulate a realistic degree of segregation and small Black populations in many CTs. A common baseline rate is achieved by setting bi = 0.

Simulation 2: No racialized residential segregation, differing baseline rates across CTs.

bi~N(0,σb2) i.i.d., is randomly generated to create differing CT baseline event rates. Expected counts do not differ by racialized social group within a CT, i.e., Pi1 = Pi0, to represent a setting with no racialized residential segregation. We generate Pi0~Unif(0.2,4) i.i.d.

Simulation 3: Racialized residential segregation, differing baseline rates across CTs. CT baseline rates independent of racialized group-specific expected counts.

bi~N(0,σb2) i.i.d., is randomly generated. CT-specific Black and WNH expected counts are negatively correlated. The expected counts are again taken from real MA CT data.

Simulation 4: Racialized residential segregation, differing baseline rates across CTs. CT baseline rates positively associated with Black expected counts and negatively associated with WNH expected counts.

Expected counts Pij are taken from real MA CT-level data. We generate the bi independently from bi~N(μi,σb42) where μi=P˜i0/5 and P˜i0 is the centered and scaled WNH expected count. This leads to a correlation of −0.36 between bi (the baseline rate) and the WNH expected count Pi0, and a correlation of 0.16 between bi and the Black expected count Pi1. This simulation corresponds to the structure anticipated in real health data from segregated areas in MA.

In Simulations 1–3, where the coefficients have marginal interpretations, we fix β0 = 0.02 and β1 = 0.32. In Simulation 4, where the coefficients have a conditional interpretation, we let β0 = 0.03 and β1 = −0.07. Where applicable, σb2=0.19 and σb42=0.25. We generate 200 datasets for each of the four simulation scenarios. To each dataset, we fit a Poisson and negative binomial GLM (equation 1), a Poisson and negative binomial GLMM (equation 2), and a GEE model with a Poisson-like quasi-likelihood and exchangeable working correlation structure within each CT. We report the bias of the Black vs. WNH RR estimate from each model (Figure 3), and the 95% confidence interval coverage probabilities for each RR estimate (Table S.1).

Figure 3.

Figure 3.

Simulation Results. Black vs. WNH RR estimates from the Poisson GLM and GLMM, the GEE, and the negative binomial GLM and GLMM fit to simulated data for each of the four simulation scenarios. True marginal RR used to generate the data is indicated by the black horizontal line and true conditional RR is indicated by the red horizontal line.

In Simulations 1–3, all models yield estimates of the RR with little empirical bias in settings with racialized residential segregation alone or differing CT baseline rates alone, or when both segregation and differing baseline rates are present but are uncorrelated. In Simulation 4, the scenario in which the baseline rates and racialized social group-specific expected counts are correlated, offset endogeneity yields discrepancies between the GLM and GLMM RR estimates. The GLMM estimates approximate the conditional association, while the Poisson GLM estimates the marginal association. The GEE estimates fall between the true marginal and conditional, although closer to the marginal.

In Simulation 4, we also fit INE models, including both an individual and neighborhood fixed effect for racialized social group as in equation (5), and assess the agreement between models (Figure 4). Note that because the INE model specification is not the true data generating model, we do not indicate true parameter values in the figure. The results show that the GLM, GLMM, and GEE estimates now largely agree.

Figure 4.

Figure 4.

Simulation Results. RR estimates from the INE model specification of the Poisson GLM and GLMM, the GEE, and the negative binomial GLM and GLMM fit to synthetic data from Simulation 4. The left panel shows estimates of the RRs for the individual Black indicator and the right panel shows estimates of the RRs for CT proportion Black.

6. Analysis of Massachusetts Premature Mortality Data

We now illustrate this phenomenon by investigating Black vs. WNH inequities in premature mortality in MA using the data described in Section 3. We also conduct secondary analyses restricted to Boston, where a large portion of MA’s Black population resides. The GLMs and GLMMs fit to these data are specified as in equations (1) and (2), respectively, using both Poisson and negative binomial likelihoods. The GEE models use the GLM mean structure with a Poisson-like quasi-likelihood and exchangeable working correlation structure within CTs. The INE models build on the standard models by adding CT proportion Black (centered and scaled) as a covariate. Each model is fit to the CT data for MA and, separately, to the Boston CTs only. All analyses were performed in R (R package citations in the Supplementary Materials).

The MA results mimic the findings from simulated data; i.e., the standard GLMs suggests that members of the Black compared to WNH population experience substantially higher rates of premature mortality, while the standard GLMMs find the opposite (Table 2). The GEE results fall between the GLM and GLMM results, though closer to those of the GLM. In contrast, the results of the INE GLMs, GLMMs, and GEE generally agree. Specifically, they suggest that CT proportion Black is associated with higher rates of premature mortality after conditioning on individual Black racialized social group membership, while individual Black racialized social group membership is associated with lower premature mortality risk conditional on CT proportion Black.

Table 2.

Fixed effect estimates (95% CI) from MA and Boston-only premature mortality models, years 2008–2012. Abbreviations: INE model= Individual and neighborhood inequities model; RR= rate ratio; CT= census tract; P-GLM= Poisson generalized linear model; P-GLMM= Poisson generalized linear mixed model; NB-GLM= negative binomial generalized linear model; NB-GLMM= negative binomial generalized linear mixed model; GEE= generalized estimating equations.

Massachusetts
P-GLM P-GLMM NB-GLM NB-GLMM GEE
Standard Model Intercept 1.02
(1.01,1.03)
1.03
(1.00,1.06)
1.15
(1.12,1.18)
1.03
(1.01,1.06)
1.02
(0.99,1.04)
Black RR 1.37
(1.33,1.42)
0.92
(0.88,0.95)
1.13
(1.07,1.19)
0.93
(0.89,0.96)
1.26
(1.19,1.33)
INE Model Intercept 1.06
(1.05,1.07)
1.04
(1.02,1.07)
1.15
(1.12,1.18)
1.04
(1.02,1.07)
1.06
(1.03,1.08)
Individual Black RR 0.89
(0.85,0.92)
0.85
(0.82,0.88)
0.98
(0.93,1.04)
0.86
(0.82,0.89)
0.82
(0.76,0.87)
CT Proportion Black RR 1.20
(1.19,1.22)
1.26
(1.23,1.29)
1.22
(1.2,1.25)
1.25
(1.22,1.29)
1.22
(1.19,1.25)
Boston
P-GLM P-GLMM NB-GLM NB-GLMM GEE
Standard Model Intercept 1.17
(1.12,1.21)
1.16
(1.07,1.26)
1.32
(1.21,1.45)
1.17
(1.08,1.26)
1.16
(1.06,1.28)
Black RR 1.52
(1.43,1.61)
1.22
(1.12,1.32)
1.36
(1.19,1.56)
1.24
(1.14,1.35)
1.51
(1.35,1.69)
INE Model Intercept 1.14
(1.09,1.18)
1.04
(0.95,1.13)
1.19
(1.09,1.3)
1.04
(0.96,1.13)
1.13
(1.03,1.24)
Individual Black RR 1.21
(1.11,1.31)
1.11
(1.02,1.21)
1.22
(1.06,1.39)
1.13
(1.03,1.24)
1.20
(1.05,1.39)
CT Proportion Black RR 1.07
(1.05,1.09)
1.12
(1.08,1.16)
1.11
(1.07,1.14)
1.12
(1.08,1.16)
1.07
(1.04,1.10)

Although Boston is more diverse than MA as a whole, Boston’s CTs are highly segregated in relation to the Black and WNH populations (Figure 2). In the analyses restricted to Boston (Table 2), inference from the standard GLMs and GLMMs agree that members of the Black population experience higher rates of premature mortality. However, the GLMM point estimates are substantially smaller than the GLM estimates, suggesting that endogeneity may still be impacting these analyses. Additionally, most INE models for Boston indicate that after conditioning on CT proportion Black, individuals categorized as Black have a higher risk of premature mortality, in contrast to the findings for MA. Hence, in Boston, both individual-level measures of membership in the Black population and neighborhood proportion Black contribute jointly to racialized inequities in premature mortality.

Beyond exemplifying offset endogeneity, our results, particularly from the INE models, offer novel substantive insights. First, they suggest that for MA in 2008–2012, the increased premature mortality risk for Black populations is explained mainly by area-level racialized social group composition, with individual-level Black race appearing to have a null or even protective association with mortality after adjusting for CT-level proportion Black. When analyzing Boston alone, these trends reverse, with individual-level membership in the Black racialized social group being equally or more important for explaining increased premature mortality risk as CT racialized social group composition. This geographic heterogeneity in associations further emphasizes the importance of accounting for space and context in social epidemiologic studies.

7. Discussion

In this paper, we report that the commonly co-occurring conditions of racialized residential segregation and increased baseline health risks in neighborhoods whose residents are predominately people of color can lead to divergent results in GLM and GLMM approaches often used to study racialized health inequities. This divergence occurs due to offset endogeneity, i.e., endogeneity induced by correlation between the offset (typically a measure of stratified population size) and the random intercept in the GLMM, and can occur even for models with a log link, for which the literature largely notes that marginal and conditional parameters coincide. In this case, regression coefficients in the log-linear GLMM have conditional interpretations, while the GLM coefficients represent marginal associations. We show that explicitly modeling separate, conditional associations between individual membership in racialized social groups and health outcomes, and between neighborhood racialized social group composition and health outcomes, may reduce offset endogeneity issues and may provide important insights into factors driving health inequities.

Critically, our findings suggest that the ability to estimate marginal association parameters, one of the most appealing features of log-linear GLMMs popular for disease mapping, may often be compromised in the presence of highly segregated study regions and heterogeneous area-level rates that are correlated with the area’s social group composition. In practice, health inequity estimates from standard GLMMs stratified by racialized social group may reflect inequities after conditioning on neighborhood, effectively comparing mortality/morbidity across groups living in the same neighborhood. Moreover, we have shown that the combination of racialized residential segregation and increased baseline health risks in predominately Black neighborhoods results in GLMM effect estimates that are typically lower than their marginal counterparts. Thus, past analyses that estimated racialized health inequities using GLMMs may have underestimated the magnitude of marginal health inequities.

The epidemiologic literature is replete with cautions about the potential for ecologic bias when interpreting the effect of an area-level measure in the absence of control for its individual counterpart. Our results emphasize that the reverse is also true: estimates of associations with individual-level membership in social groups should also consider area-level social context to avoid the “individualistic fallacy.” This observation, while previously noted in the social epidemiology literature, is relatively underappreciated (Diez-Roux 1998; Firebaugh 1978; Pearce 2000; Subramanian et al. 2009).

On the basis of this literature and our findings here, we recommend use of the INE model we have proposed in future small-area disease mapping for inequity studies, whether the social group under study are defined in relation to racialized groups, economic groups, or other social groups reflecting inequitable social relationships (Beckfield 2018; Krieger 2020). While the INE model is simple and straightforward, more complex GLMM specifications such as the shared component model, which allows for separate spatially correlated random effects across groups (Kiang et al. 2019), may also be suitable alternative approaches for dealing with offset endogeneity. We also note that studies of inequities in other fields—including sociology, economics, education, crime and criminal justice, policy, urban planning, and more—frequently utilize log-linear mixed model formulations similar to those presented here and include population denominators as offsets (see references to many such studies in the Supplementary Materials). This suggests that the offset endogeneity issue is likely to be relevant to the broader scientific community and that adoption of INE-type model specifications as standard practice may be advisable in inequity studies across fields.

Supplementary Material

Online Supplementary Materials

Acknowledgements

The authors gratefully acknowledge NIH grants 1K01ES032458-01, R01HD092580, R01AG060232-01A1, and P30ES000002.

References

  1. Association of State and Territorial Health Officials. 2021. “Health Equity and Social Determinants of Health.” Association of State and Territorial Health Officials. Retrieved April 19, 2021 (https://astho.org/Programs/Health-Equity/). [Google Scholar]
  2. Bates Michael David, Castellano Katherine E., Rabe-Hesketh Sophia, and Skrondal Anders. 2014. “Handling Correlations Between Covariates and Random Slopes in Multilevel Models.” Journal of Educational and Behavioral Statistics 39(6):524–49. doi: 10.3102/1076998614559420. [DOI] [Google Scholar]
  3. Beckfield Jason. 2018. Political Sociology and the People’s Health. Oxford University Press. [Google Scholar]
  4. Benjamin Ruha. 2017. “Cultura Obscura: Race, Power, and ‘Culture Talk’ in the Health Sciences.” American Journal of Law & Medicine 43(2–3):225–38. doi: 10.1177/0098858817723661. [DOI] [PubMed] [Google Scholar]
  5. Braveman P, and Gruskin S. 2003. “Defining Equity in Health.” Journal of Epidemiology & Community Health 57(4):254–58. doi: 10.1136/jech.57.4.254. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Burton DC, Flannery B, Bennett NM, Farley MM, Gershman K, Harrison LH, Lynfield R, Petit S, Reingold AL, Schaffner W, Thomas A, Plikaytis BD, Rose CE, Whitney CG, and Schuchat A. 2010. “Socioeconomic and Racial/Ethnic Disparities in the Incidence of Bacteremic Pneumonia Among US Adults.” American Journal of Public Health 100(10):1904–11. doi: 10.2105/AJPH.2009.181313. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Chen IY, Pierson E, Rose S, Joshi S, Ferryman K, and Ghassemi M. 2021. “Ethical Machine Learning in Healthcare.” Annual Review of Biomedical Data Science 4(1):null. doi: 10.1146/annurev-biodatasci-092820-114757. [DOI] [Google Scholar]
  8. Chen JT, Rehkopf DH, Waterman PD, Subramanian SV, Coull BA, Cohen B, Ostrem M, and Krieger N. 2006. “Mapping and Measuring Social Disparities in Premature Mortality: The Impact of Census Tract Poverty within and across Boston Neighborhoods, 1999–2001.” Journal of Urban Health 83(6):1063–84. doi: 10.1007/s11524-006-9089-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Chen Jarvis T. 2013. “Multilevel and Hierarchical Models for Disease Mapping.” Pp. 183–208 in Geographic Health Data: Fundamental Techniques for Analysis. CABI. [Google Scholar]
  10. Demidenko Eugene. 2007. “Poisson Regression for Clustered Data.” International Statistical Review 75(1):96–113. doi: 10.1111/j.1751-5823.2006.00003.x. [DOI] [Google Scholar]
  11. Diez-Roux AV 1998. “Bringing Context Back into Epidemiology: Variables and Fallacies in Multilevel Analysis.” American Journal of Public Health 88(2):216–22. doi: 10.2105/AJPH.88.2.216. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Duncan Dustin T., and Kawachi Ichiro. 2018. Neighborhoods and Health. Oxford University Press; Oxford, UK: [Google Scholar]
  13. Firebaugh Glenn. 1978. “A Rule for Inferring Individual-Level Relationships from Aggregate Data.” American Sociological Review 43(4):557–72. doi: 10.2307/2094779. [DOI] [Google Scholar]
  14. Hodges James S., and Reich Brian J.. 2010. “Adding Spatially-Correlated Errors Can Mess Up the Fixed Effect You Love.” The American Statistician 64(4):325–34. doi: 10.1198/tast.2010.10052. [DOI] [Google Scholar]
  15. Kiang MV, Krieger N, Buckee CO, Onnela JP, and Chen JT. 2019. “Decomposition of the US Black/White Inequality in Premature Mortality, 2010–2015: An Observational Study.” BMJ Open 9(11):e029373. doi: 10.1136/bmjopen-2019-029373. [DOI] [Google Scholar]
  16. Krieger Nancy. 2020. “Measures of Racism, Sexism, Heterosexism, and Gender Binarism for Health Equity Research: From Structural Injustice to Embodied Harm—An Ecosocial Analysis.” Annual Review of Public Health 41(1):37–62. doi: 10.1146/annurev-publhealth-040119-094017. [DOI] [Google Scholar]
  17. Krieger Nancy. 2021. “Structural Racism, Health Inequities, and the Two-Edged Sword of Data: Structural Problems Require Structural Solutions.” Frontiers in Public Health 9. doi: 10.3389/fpubh.2021.655447. [DOI] [Google Scholar]
  18. Krieger Nancy, Chen Jarvis T., Waterman Pamela D., Rehkopf David H., and Subramanian S. v.. 2005. “Painting a Truer Picture of US Socioeconomic and Racial/Ethnic Health Inequalities: The Public Health Disparities Geocoding Project.” American Journal of Public Health 95(2):312–23. doi: 10.2105/AJPH.2003.032482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Massachusetts Department of Public Health. 2010. Health of Massachusetts. Boston, MA. [Google Scholar]
  20. National Association of County Health Officials. 2021. “Health Equity and Social Justice.” National Association of County Health Officials. Retrieved April 19, 2021 (https://www.naccho.org/programs/public-health-infrastructure/health-equity). [Google Scholar]
  21. Neuhaus John M., and McCulloch Charles E.. 2006. “Separating Between- and within-Cluster Covariate Effects by Using Conditional and Partitioning Methods.” Journal of the Royal Statistical Society: Series B (Statistical Methodology) 68(5):859–72. doi: 10.1111/j.1467-9868.2006.00570.x. [DOI] [Google Scholar]
  22. Pearce Neil. 2000. “The Ecological Fallacy Strikes Back.” Journal of Epidemiology & Community Health 54(5):326–27. doi: 10.1136/jech.54.5.326. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. R Core Team. 2020. R: A Language and Environment for Statistical Computing [computer program]. Version 3.6.3. https://www.R-project.org/.
  24. Roux Ana V. Diez, and Mair Christina. 2010. “Neighborhoods and Health.” Annals of the New York Academy of Sciences 1186(1):125–45. doi: 10.1111/j.1749-6632.2009.05333.x. [DOI] [PubMed] [Google Scholar]
  25. Schmidt Alexandra M., and Nobre Widemberg S.. 2018. “Conditional Autoregressive (CAR) Model.” Pp. 1–11 in Wiley StatsRef: Statistics Reference Online. American Cancer Society. [Google Scholar]
  26. Subramanian SV, Jones Kelvyn, Kaddour Afamia, and Krieger Nancy. 2009. “Revisiting Robinson: The Perils of Individualistic and Ecologic Fallacy.” International Journal of Epidemiology 38(2):342–60. doi: 10.1093/ije/dyn359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. US Census Bureau. 2017. “Research to Improve Data on Race and Ethnicity.” The United States Census Bureau. Retrieved April 19, 2021 (https://www.census.gov/about/our-research/race-ethnicity.html). [Google Scholar]
  28. US Census Bureau. 2019. “Glossary: Census Tract.” The United States Census Bureau. Retrieved April 19, 2021 (https://www.census.gov/programs-surveys/geography/about/glossary.html). [Google Scholar]
  29. US Census Bureau. 2020. “About Race.” The United States Census Bureau. Retrieved April 19, 2021 (https://www.census.gov/topics/population/race/about.html). [Google Scholar]
  30. Walker Kyle. 2020. tidycensus: Load US Census Boundary and Attribute Data as “tidyverse” and ‘sf’-Ready Data Frames [computer program]. Version 0.9.9.2. https://CRAN.R-project.org/package=tidycensus.
  31. Young Mary L., Preisser John S., Qaqish Bahjat F., and Wolfson Mark. 2007. “Comparison of Subject-Specific and Population Averaged Models for Count Data from Cluster-Unit Intervention Trials.” Statistical Methods in Medical Research 16(2):167–84. doi: 10.1177/0962280206071931. [DOI] [PubMed] [Google Scholar]
  32. Zeger Scott L., Liang Kung-Yee, and Albert Paul S.. 1988. “Models for Longitudinal Data: A Generalized Estimating Equation Approach.” Biometrics 44(4):1049–60. doi: 10.2307/2531734. [DOI] [PubMed] [Google Scholar]
  33. Zhang H, Yu Q, Feng C, Gunzler D, Wu P, and Tu XM. 2012. “A New Look at the Difference between the GEE and the GLMM When Modeling Longitudinal Count Responses.” Journal of Applied Statistics 39(9):2067–79. doi: 10.1080/02664763.2012.700452. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Online Supplementary Materials

RESOURCES