Skip to main content
BMJ Open Diabetes Research & Care logoLink to BMJ Open Diabetes Research & Care
. 2023 Mar 1;11(2):e003120. doi: 10.1136/bmjdrc-2022-003120

Mediation of an association between neighborhood socioeconomic environment and type 2 diabetes through the leisure-time physical activity environment in an analysis of three independent samples

Katherine A Moon 1,, Cara M Nordberg 2, Stephanie L Orstad 3,4, Aowen Zhu 5, Jalal Uddin 5, Priscilla Lopez 3, Mark D Schwartz 3,6, Victoria Ryan 7, Annemarie G Hirsch 2, Brian S Schwartz 1,2, April P Carson 8, D Leann Long 9, Melissa Meeker 7, Janene Brown 7, Gina S Lovasi 7,10, Samranchana Adhikari 4, Rania Kanchi 4, Sanja Avramovic 11, Giuseppina Imperatore 12, Melissa N Poulsen 2
PMCID: PMC9980357  PMID: 36858436

Abstract

Introduction

Inequitable access to leisure-time physical activity (LTPA) resources may explain geographic disparities in type 2 diabetes (T2D). We evaluated whether the neighborhood socioeconomic environment (NSEE) affects T2D through the LTPA environment.

Research design and methods

We conducted analyses in three study samples: the national Veterans Administration Diabetes Risk (VADR) cohort comprising electronic health records (EHR) of 4.1 million T2D-free veterans, the national prospective cohort REasons for Geographic and Racial Differences in Stroke (REGARDS) (11 208 T2D free), and a case–control study of Geisinger EHR in Pennsylvania (15 888 T2D cases). New-onset T2D was defined using diagnoses, laboratory and medication data. We harmonized neighborhood-level variables, including exposure, confounders, and effect modifiers. We measured NSEE with a summary index of six census tract indicators. The LTPA environment was measured by physical activity (PA) facility (gyms and other commercial facilities) density within street network buffers and population-weighted distance to parks. We estimated natural direct and indirect effects for each mediator stratified by community type.

Results

The magnitudes of the indirect effects were generally small, and the direction of the indirect effects differed by community type and study sample. The most consistent findings were for mediation via PA facility density in rural communities, where we observed positive indirect effects (differences in T2D incidence rates (95% CI) comparing the highest versus lowest quartiles of NSEE, multiplied by 100) of 1.53 (0.25, 3.05) in REGARDS and 0.0066 (0.0038, 0.0099) in VADR. No mediation was evident in Geisinger.

Conclusions

PA facility density and distance to parks did not substantially mediate the relation between NSEE and T2D. Our heterogeneous results suggest that approaches to reduce T2D through changes to the LTPA environment require local tailoring.

Keywords: Diabetes Mellitus, Type 2; Primary Prevention; Physical Fitness


WHAT IS ALREADY KNOWN ON THIS TOPIC

  • Inequitable access to the leisure-time physical activity environment (LTPA) could explain disparities in type 2 diabetes (T2D).

WHAT THIS STUDY ADDS

  • We evaluated whether the LTPA environment mediates the association between neighborhood socioeconomic environment (NSEE) and T2D. We found little evidence that the density of PA facilities or the distance to parks mediated NSEE-T2D associations.

OW THIS STUDY MIGHT AFFECT RESEARCH, PRACTICE OR POLICY

  • Approaches to reduce T2D through changes to the LTPA environment likely require local tailoring.

Introduction

Eleven per cent of US adults (37.3 million) had diabetes in 2019.1 The economic burden of diabetes—including costs directly related to diabetes care and increased risk of developing neurological, peripheral vascular, cardiovascular, renal, endocrine/metabolic, ophthalmic, and other complications—accounted for more than one in eight healthcare dollars spent in 2017.2 The American Diabetes Association recently recommended social determinants of health, including socioeconomic status (SES) and the neighborhood physical and built environment, as priorities for research and intervention to prevent diabetes.3

Epidemiologic studies have found graded associations between greater area-level socioeconomic disadvantage, despite differences in measurement, and higher risk of type 2 diabetes (T2D) across a variety of study populations and designs.4–8 In an analysis of three study samples in the USA, greater socioeconomic disadvantage was generally associated with higher relative risk of T2D across study samples in most community types ranging from rural to urban in the Diabetes Location, Environmental Attributes, and Disparities (LEAD) Network.9 Stark geographic disparities in the incidence and prevalence of T2D observed across regions, counties, and neighborhoods may be attributable to community-level structural causes.10–14

Features of the social and physical environment can influence disease risk by promoting or hindering healthy behaviors such as physical activity (PA) and via chronic psychosocial or physical stressors.15 Higher levels of leisure-time PA (LTPA) are well known to reduce the risk of T2D.16 Proximity to or greater availability of LTPA resources, such as PA facilities and parks, have been associated with increased PA17–20 and reduced T2D risk,21–23 although most studies have examined associations in urban and suburban communities, with rural communities less studied. Studies have also found that lower SES neighborhoods have lower access to parks and PA facilities,24–26 suggesting that inequitable access to LTPA resources may be one mechanistic pathway through which neighborhood SES affects T2D risk. No prior studies have formally examined whether access to parks or PA facilities mediates the association between lower neighborhood SES and higher risk of T2D.

Evaluation of mediating pathways between neighborhood SES and T2D raises several methodologic challenges. Measures of neighborhood SES and PA environment are defined and operationalized heterogeneously across the literature27 and could operate differently across the urban to rural continuum. With associations between the PA environment and risk of T2D varying across study design, populations, and geographies, as evidenced by systematic reviews,17–20 23 harmonized analytic approaches are necessary to reduce heterogeneity between studies and generate epidemiologic inferences that can inform population-level interventions and policies. In this study by the Diabetes LEAD Network, we conducted complementary analyses across three unique study samples in diverse geographies in the USA to evaluate whether lower access to neighborhood resources for LTPA, including distance to PA facilities or distance to parks, could partially mediate the overall harmful associations previously observed between poor neighborhood SES and risk of T2D.

Methods

Overall approach

The Diabetes LEAD Network is a Centers for Disease Control and Prevention (CDC)-funded research collaboration of the following academic centers: Drexel University, Geisinger and Johns Hopkins University, New York University Grossman School of Medicine, and the University of Alabama at Birmingham.28 The goal of the Network is to identify modifiable community-level determinants of T2D and cardiometabolic conditions using electronic health records (EHRs) and survey data from across the USA. Led by the Drexel University Data Coordinating Center, the Diabetes LEAD Network partners conducted analyses to reduce, where possible, between-study heterogeneity and aid interpretation of results. First, we created harmonized measurements of key neighborhood-level variables, including SES, PA, land use, racial/ethnic composition, and community type. Second, we developed complementary analysis plans to estimate mediation effects within three independent study populations, described in detail further, with different populations, study designs, and geographic contexts. Baseline address data were geocoded and used to link individuals to census tract-level community features.

Veterans Administration Diabetes Risk (VADR) cohort study

The VADR cohort study enrolled veterans in the national Veterans Health Administration EHR for primary care with sufficient residential address information within 2 years after cohort entry (1 January 2008).29 Participants were either (1) T2D-free at cohort entry with at least two T2D-free primary care visits at least 30 days apart within any 5 year period since 1 January 2003 or (2) subsequently enrolled and T2D free at cohort entry through 31 December 2016. Prevalent and incident T2D diagnoses were defined using encounter diagnoses, medication orders, and laboratory test results prior to and during the study period (online supplemental table S1). Subjects were followed from the date of cohort entry until censoring, defined as date of incident T2D, death, no encounters with the EHR for 2 years, or the end of the study (31 December 2018).

Supplementary data

bmjdrc-2022-003120supp001.pdf (356.5KB, pdf)

REasons for Geographic and Racial Differences in Stroke (REGARDS) cohort study

The REGARDS study enrolled adults age 45 years and older at baseline (2003–2007) from the contiguous USA with oversampling in the Southeast.30 This analysis included 11 208 study participants without prevalent T2D at baseline and who completed a follow-up exam in 2013–2016. Incident T2D was defined as a fasting glucose ≥126 mg/dL or random glucose ≥200 mg/dL or use of T2D medication at the follow-up exam (online supplemental table S1). Follow-up time was measured as the time between baseline and follow-up home visits.

Geisinger EHR nested case–control study

Geisinger is an integrated health system that provides primary, specialty, urgent, and emergency healthcare services at community practice clinics and hospitals in central and northeastern Pennsylvania. The design of the nested case–control study of new onset T2D in the Geisinger EHR has been described previously.31 Between 2008 and 2016, individuals with T2D (n=15 888) were identified using T2D encounter diagnoses, medication orders, and laboratory test results (online supplemental table S1). We required at least two encounters on different days with a primary care provider prior to ensure that we could detect T2D if present. To exclude prevalent T2D, we required individuals to have at least one encounter with the health system without evidence of T2D at least 2 years prior to the T2D onset date. We randomly selected with replacement five control encounters for each case (n=79 435, with 65 084 unique persons) from individuals who never met any of the criteria used to define T2D cases and frequency matched to cases on age, sex, and year of encounter.

Harmonized community type

The distributions of the primary exposure of interest, area-level neighborhood socioeconomic disadvantage, and the mediator variables of interest, density of PA facilities and distance to parks, vary considerably across urban, suburban, and rural community contexts. We assigned census tracts to one of four community type categories (higher density urban, lower density urban, suburban/small town, and rural) using a previously described modification of the US Department of Agriculture Rural-Urban Commuting Area methodology to better differentiate urban cores from surrounding non-rural areas.32 We stratified all analyses by community type due to concerns about census tract-level residual confounding and non-positivity, which can lead to bias when exposure variables do not overlap within strata of confounding variables.

Harmonized neighborhood socioeconomic environment (NSEE)

As part of the Diabetes LEAD Network, we developed a harmonized definition of area-level socioeconomic disadvantage relevant across the US, hereafter referred to as neighborhood socioeconomic environment (NSEE). We defined NSEE as a z-score sum of six census tract variables (percentage of persons with less than a high-school education, persons unemployed, households earning less than $30 000/year, population with income below poverty level, households on public assistance, and occupied housing units with no cars), based on the work of Xiao and colleagues,33 from the 2000 decennial Census and 2010 5-year American Community Survey. Census data from the year 2000 were converted to 2010 tract boundaries using the interpolation tool from Brown Universities Longitudinal Tract Data Base.34 In VADR, participants who entered the cohort before 2010 were assigned NSEE based on the 2000 Census data; otherwise, they were assigned NSEE based on 2006–2010 ACS data. In REGARDS, participants were assigned NSEE based on the 2000 Census data. In Geisinger, cases and matched controls in years 2008–2012 were assigned NSEE based on 2000 Census, while cases and matched controls in years 2013–2016 were assigned NSEE based on 2006–2010 ACS data. Higher NSEE z-scores indicated greater disadvantage. To improve interpretability of comparisons within a community type and across studies, we rescaled NSEE to range from 0 to 100 and categorized NSEE separately within each community type.

Harmonized density of PA facilities within street network buffers

We defined PA facility density as the 5-year average (ending the year prior to study entry or selection) of the number of commercial facilities for fitness or recreation per square kilometer within a street network buffer centered on the population-weighted centroid of a census tract. We obtained annual data (1998–2014) on PA facilities from the Retail Environment and Cardiovascular Disease (RECVD) study,35 which classified health-related neighborhood amenities using the National Establishment Time Series (NETS) Database. Details on classification methods and the NETS data, which was based on information collected by Dun and Bradstreet (D&B, Short Hills, New Jersey, USA), has been described elsewhere.35 The RECVD team re-geocoded the data to improve locational accuracy and recategorized facilities using Standard Industrial Classification codes and word searches or name searches to enhance classification. Using ESRI ArcGIS9.3, different street network buffers were calculated depending on the LEAD community type: 1 mile (1.6 km) walking buffer in high density urban, 2 mile (3.2 km) driving buffer in lower density urban, and 6 mile (9.7 km) driving buffers in suburban/small town and rural. Buffer sizes were selected based on prior literature measuring the commercial PA facility environment, although few past studies could inform appropriate buffer sizes in rural areas.36–40

Harmonized population-weighted distance to seven closest parks

We defined neighborhood spatial access to parks for each census tract using a population-weighted measure of the closest seven parks, as described previously.25 41 Briefly, this measure accounts for distance (miles) and size of the seven closest parks while adjusting for the heterogeneous population distribution within a census tract. Data on national, state, and local parks in 2010, provided by the CDC Division of Population Health, were derived from two sources: The Homeland Security Infrastructure Program Gold database and ESRI ArcGIS 10.1 Data DVD.

Harmonized census tract-level covariates

We measured percent Hispanic and percent non-Hispanic Black and land use environment within each census tract as potential confounders. We obtained race/ethnicity data from the 2000 and 2010 decennial census. To measure the land use environment, we used factor scores from a multiple group confirmatory factor analysis, stratified by community type, which identified a single latent variable from seven components of the built environment: average block length, average block size, intersection density, street connectivity, establishment density, percent developed land, and household density. Data were derived from ESRI 2009 Vintage Street and computed via ArcGIS Pro 2.3. Details on the land use measurement model have been described previously.42 The z-score sums within each community type were scaled to range from 0 to 100, with a higher value indicating a more walkable neighborhood.

Individual-level covariates

Individual-level covariates, including age, sex, race and ethnicity, smoking status, and available proxies for individual or family SES, were defined within each study. Age was defined at baseline in VADR and REGARDS or at the time of case or control selection in Geisinger. We considered race/ethnicity as a proxy for race-based discrimination, a social construct that is correlated with social determinants of health. Categories of race/ethnicity were defined within each study (VADR: non-Hispanic White, non-Hispanic Black, Hispanic, Asian, and other/unknown; REGARDS: non-Hispanic White vs non-Hispanic Black; Geisinger: non-Hispanic Black, non-Hispanic White, or other race/ethnicity). SES was defined by baseline data on individual disability and income in VADR (disabled, low-income/non-disabled, or neither), as baseline household income in REGARDS (<$20 000, $20 000–$34 999, $35 000–$74 999, ≥$75 000, or refused to answer) and by receipt of medical assistance prior to case or control selection (0% vs >0% of time prior to case or control selection) in Geisinger.

Statistical analysis

Using a counterfactual framework, we estimated the average natural direct effect of NSEE on risk of T2D, not operating through the two hypothesized mediators, and the average natural indirect effect of NSEE on risk of T2D operating through the LTPA environment, adjusted a priori for confounding variables at the individual-level and census tract-level (online supplemental figure S1). When interpretable (ie, when direct and indirect effects were in the same direction), we calculated the proportion mediated (ie, the proportion of the total effect operating through the indirect pathway of interest).43 No sample size calculations were conducted.

In the VADR cohort study, we first examined exposure–outcome and mediator–outcome associations using piecewise exponential survival models44 with 2-year intervals and county random effects fit in R (V.4.04) using the ‘lme4’ R package.45 County random effects were used because of convergence issues when clustering by census tract. We used generalized linear mixed effects Poisson regression models with a log link function and an offset of logarithm of time-at-risk during each interval to estimate HRs assuming a constant hazard function within intervals over time. We estimated total, direct, and indirect effects46 as differences in 2-year T2D incidence rates, with quasi-Bayesian approximation 95% CIs, at each NSEE quartile compared with the first quartile, using the ‘mediation’ R package.47 Differences in incidence rates were multiplied by 100 for interpretability.

In the REGARDS cohort study, we examined exposure–outcome associations using Poisson mixed models with robust variance estimation to account for correlation of participants within census tracts, and exposure–mediator associations using linear mixed models with robust variance estimation in R (V.4.04) using the ‘lme4’ package.45 We estimated total, direct, and indirect effects46 as differences in T2D incidence rates, with bootstrapped 95% CIs, at each NSEE quartile compared with the first quartile, using the ‘mediation’ R package.47 Differences in incidence rates were multiplied by 100 for interpretability.

In the Geisinger EHR case–control study, we estimated total, direct, and indirect effects using logistic regression models as ORs, with bootstrapped 95% CIs,48 at each NSEE quartile compared with the first quartile, in R (VV.4.04) using the ‘medflex’” R package49 in order to fit models appropriate for our study design and to use tract-level bootstrap resampling to account for clustering within census tracts.

Final models were adjusted for Network-harmonized census tract-level race/ethnicity and land use environment (factor score) and study-specific age, sex, race/ethnicity, smoking status (REGARDS and Geisinger only), and SES. We treated age as a continuous variable with linear and quadratic terms. All VADR models included linear and quadratic age terms, while REGARDS and Geisinger models included quadratic age if the quadratic term was statistically significant (p<0.05). In VADR, we did not adjust for smoking status because of the large proportion of missingness (66%) at cohort entry.

Sensitivity analyses

We conducted sensitivity analyses to evaluate the robustness of the main effects. First, we refitted mediation models for PA facility density with larger street network buffers (2 mile driving buffers in higher density urban, 6 mile driving buffers in lower density urban, and 10 mile driving buffers in suburban/small town and rural communities). Second, we removed land use environment from models because this variable could be on the causal pathway. Third, we took advantage of the national VADR cohort to restrict VADR models to census tracts shared with either REGARDS or Geisinger. Although the effects across studies cannot be quantitatively compared due to differences in study design and analytic approaches, we qualitatively evaluated the consistency (eg, presence vs absence) of mediation results. We hypothesized that comparisons between primary mediation results from REGARDS or Geisinger and the geographically restricted VADR models, which compared different populations within the same census tracts, could suggest whether the observed discrepancies in primary findings across sites could partly be attributed to differences in geographic context or in population composition. Fourth, we conducted a post hoc analysis in the REGARDS rural community model in which we treated the PA facility density mediator as a binary variable, dichotomized at the median, to investigate whether the observed associations in REGARDS rural communities of mediation via PA facility density could be due to a misspecification of the PA facility density mediator as a continuous variable.

Results

Participant characteristics

Demographic characteristics and geographic coverage of participants in the three study samples are summarized in table 1 (individual level) and table 2 (tract level). Briefly, the VADR cohort included more than 4.1 million veterans (539 369 incident T2D cases) in 71 835 census tracts across most of the continental USA. The REGARDS cohort included 11 208 participants (1409 incident T2D cases) in 7502 census tracts, of which roughly half were in the southeast. The Geisinger nested case–control study sample included 15 888 new onset T2D cases and 79 435 controls in 785 census tracts in northeastern and central Pennsylvania. The VADR and REGARDS cohorts had a median follow-up of 5.0 and 9.5 years, respectively, while the median contact in the Geisinger EHR was 11.2 years.

Table 1.

Baseline individual-level characteristics of three independent study samples in the Diabetes LEAD Network

Study VADR REGARDS Geisinger
Geographic coverage 3108 counties across USA (71 835 census tracts) 1349 counties across USA (7502 census tracts) 37 counties in Pennsylvania
(785 census tracts)
Study design Retrospective EHR-based cohort Prospective population-based cohort Retrospective EHR-based nested case–control
Study period 2008–2018 2003–2016 2008–2016
Cases Control encounters
Follow-up or contact time, years, median (IQR)* 5.0 (2.4,9.0) 9.5 (8.7, 9.9) 11.2 (7.5, 14.1) 11.2 (7.5, 14.1)
Number of participants 4 100 650 11 208 15 888 79 435
Number of new onset diabetes cases 539 369 1409 NA NA
Age, years, mean (SD) 59.4 (17.2) 63.0 (8.5) 54.9 (15.1) 54.9 (15.3)
Age, n (%)
 10–17 NA NA 230 (1.5) 1184 (1.5)
 18–34 486 304 (11.9) NA 1361 (8.6) 6771 (8.5)
 35–49 637 553 (15.6) 649 (5.8) 4038 (25.4) 20 189 (25.4)
 50–64 1 332 938 (32.5) 5884 (52.5) 6202 (39.0) 31 007 (39.0)
 65–79 1 087 570 (26.5) 4321 (38.6) 3331 (21.0) 16 654 (21.0)
 80+ 556 227 (13.6) 354 (3.2) 726 (4.6) 3630 (4.6)
Sex, female, n (%) 321 013 (7.8) 6256 (55.8) 7798 (49.1) 38 988 (49.1)
Race/ethnicity, n (%) †
 Non-Hispanic White 2 783 756 (76.3) 7534 (67.2) 15 112 (95.1) 76 971 (96.9)
 Non-Hispanic Black 584 655 (16) 3674 (32.8) 293 (1.8) 905 (1.1)
 Hispanic 189 177 (5.2) NA 369 (2.3) 1094 (1.4)
 Asian 34 838 (1.0) NA 63 (0.4) 267 (0.3)
 Other/unknown 56 804 (1.6) NA 51 (0.32) 198 (0.3)
Smoking status, n (%)†
 Current smoker 610 506 (40.3) 1245 (11.1) 3272 (20.6) 14 831 (18.7)
 Non-smoker (former or never) 902 901 (59.7) 9926 (88.9) 12 223 (76.9) 63 241 (79.7)
HbA1c, mean (SD) ‡ † 5.7 (0.6) NA 7.3 (1.9) 5.6 (0.4)

*Follow-up time (VADR and REGARDS) or contact time (Geisinger): VADR: cohort subjects were followed from the date of cohort entry until censoring defined as date of incident T2D, death, no encounters with the VA system for 2 years, or the end of the study. REGARDS: follow-up time was measured as the time between the two in-home visits. Geisinger: time from first clinical encounter to T2D onset or clinical encounter during matched encounter year.

†Missing data: VADR: race/ethnicity, 451 420 (11% missing); smoking status, 2 587 243 (63% missing); HbA1C, 2 423 788 (59% missing); REGARDS: smoking status, 37 (0.3% missing); Geisinger: HbA1C: 8391 (53% missing) cases and 75 280 (95% missing) controls.

‡All baseline clinical measures: VADR: most recent measure prior or on cohort entry date. REGARDS: at enrollment. Geisinger: closest within 1 year prior to T2D diagnosis/matched encounter date.

HbA1c, hemoglobin A1c; IQR, Interquartile range (25th, 75th percentile); NA, not applicable; REGARDS, REasons for Geographic and Racial Differences in Stroke; T2D, type 2 diabetes; VADR, Veterans Administration Diabetes Risk.

Table 2.

Baseline census tract-level characteristics of three independent study samples in the Diabetes LEAD Network

Study
VADR REGARDS Geisinger
Cases Control encounters
Higher density urban
 Number of census tracts 17 035 1318 80 99
 Number of persons 478 668 1810 1039 4121
 NSEE Q1, n (%)* (lowest disadvantage) 50 404 (10.5) 415 (22.9) 243 (23.4) 1082 (26.3)
 NSEE Q2, n (%)* 99 851 (20.9) 509 (28.1) 310 (29.8) 1142 (27.7)
 NSEE Q3, n (%)* 127 723 (26.7) 486 (26.9) 288 (27.7) 1040 (25.2)
 NSEE Q4, n (%)* (highest disadvantage) 200 273 (41.9) 400 (22.1) 189 (19.1) 857 (20.8)
 Population-weighted distance to seven closest parks, miles, median (IQR) 0.5 (0.4, 0.8) 0.5 (0.4, 0.8) 0.78 (0.5, 1.2) 0.73 (0.59, 1.0)
 PA facility density within 1 mile walking buffer, count/km2, median (IQR) 1.3 (0.6, 2.4) 0.8 (0.4, 1.5) 2.1 (1.5, 2.7) 2.2 (1.6, 2.9)
 Land use environment score, median (IQR) −0.07 (−0.59, 0.50) 0.13 (−0.50, 0.64) 0.24 (−0.34, 0.66) 0.24 (−0.38, 0.70)
 % non-Hispanic Black, mean (SD) 25.1 (31.7) 64.4 (32.9) 6.2 (7.8) 5.1 (5.9)
 % Hispanic, mean (SD) 22.7 (24.3) 8.4 (11.4) 5.8 (7.3) 5.8 (8.2)
Lower density urban
 Number of census tracts 25 630 2973 101 148
 Number of persons 1 509 042 4524 1890 8665
 NSEE Q1, n (%)* (lowest disadvantage) 597 922 (39.6) 948 (21.0) 84 (4.4) 673 (7.8)
 NSEE Q2, n (%)* 467 687 (31) 1094 (24.2) 410 (21.7) 2144 (24.7)
 NSEE Q3, n (%)* 277 240 (18.4) 1205 (26.6) 547 (28.9) 2486 (28.7)
 NSEE Q4, n (%)* (highest disadvantage) 165 515 (11) 1277 (28.2) 849 (44.9) 3362 (38.8)
 Population-weighted distance to seven closest parks, miles, median (IQR) 0.8 (0.5, 1.4) 0.8 (0.5, 1.4) 1.3 (0.66, 2.1) 1.2 (.55. 2.1)
 PA facility density within 2 mile driving buffer, count/km2, median (IQR) 0.7 (0.4, 1.1) 0.5 (0.2, 0.8) 1.0 (0.72, 1.5) 1.0 (0.72, 1.5)
 Land use environment score, median (IQR) 0.09 (−0.58, 0.68) −0.00 (−0.63, 0.65) 0.31 (−0.47, 1.02) 0.15 (−0.57, 0.93)
 % non-Hispanic Black, mean (SD) 15.4 (22.8) 43.4 (35.9) 2.6 (3.3) 2.5 (2.9)
 % Hispanic, mean (SD) 14.8 (18.1) 5.0 (7.9) 3.0 (5.3) 2.7 (4.4)
Suburban/small town
 Number of census tracts 11 743 1491 159 209
 Number of persons 919 281 2224 5009 24 886
 NSEE Q1, n (%)* (lowest disadvantage) 331 549 (36.1) 482 (21.7) 779 (15.6) 4817 (19.4)
 NSEE Q2, n (%)* 303 951 (33.1) 493 (22.2) 906 (18.1) 5140 (20.7)
 NSEE Q3, n (%)* 189 142 (20.6) 560 (25.2) 1537 (30.7) 7509 (30.2)
 NSEE Q4, n (%)* (highest disadvantage) 94 385 (10.3) 689 (31.0) 1787 (35.7) 7420 (29.8)
 Population-weighted distance to seven closest parks, miles, median (IQR) 2.0 (1.1, 3.5) 2.1 (1.2, 4.1) 2.7 (1.7, 5.6) 2.4 (1.5, 4.6)
 PA facility density within 6 mile driving buffer, count/km2, median (IQR) 0.2 (0.1, 0.4) 0.2 (0.1, 0.3) 0.19 (0.14, 0.35) 0.19 (0.15, 0.39)
 Land use environment score, median (IQR) −0.06 (−0.70, 0.54) 0.02 (−0.64, 0.57) 0.71 (−0.54, 1.85) 0.26 (−0.68, 1.82)
 % non-Hispanic Black, mean (SD) 9.5 (15.9) 28.0 (30.3) 1.9 (4.4) 1.8 (4.3)
 % Hispanic, mean (SD) 9.8 (14.4) 3.3 (5.3) 1.5 (2.6) 1.5 (2.5)
Rural
 Number of census tracts 17 427 1719 235 296
 Number of persons 1 193 659 2650 7950 41 763
 NSEE Q1, n (%)* (lowest disadvantage) 79 421 (6.7) 500 (18.9) 2256 (28.4) 13 010 (31.2)
 NSEE Q2, n (%)* 332 667 (27.9) 531 (20.0) 1802 (22.7) 9.744 (23.3)
 NSEE Q3, n (%)* 507 151 (42.5) 638 (24.1) 2470 (31.1) 12 096 (29.0)
 NSEE Q4, n (%)* (highest disadvantage) 274 277 (23) 981 (37.0) 1422 (17.9) 6931 (16.6)
 Population-weighted distance to seven closest parks, miles, median (IQR) 5.9 (3.3, 9.8) 7.0 (3.9, 11.5) 6.3 (4, 8.5) 6 (3.9, 8.1)
 PA facility density within 6 mile driving buffer, count/km2, median (IQR) 0.04 (0.01, 0.09) 0.03 (0.01, 0.08) 0.08 (0.02, 0.14) 0.08 (0.02, 0.15)
 Land use environment score, median (IQR) −0.04 (−0.53, 0.59) 0.11 (−0.42, 0.79) 0.27 (−0.11, 0.67) 0.28 (−0.10, 0.67)
 % non-Hispanic Black, mean (SD) 6.6 (13.6) 31.5 (27.5) 2.2 (4.4) 2.2 (4.3)
 % Hispanic, mean (SD) 5.8 (10.5) 2.6 (5.1) 2.2 (3.6) 2.0 (3.3)

*Time-varying NSEE assignment: VADR: patients who entered the cohort before 2010 were assigned NSEE based on the 2000 Census data; otherwise, patients were assigned NSEE based on 2006–2010 ACS data. REGARDS: all participants were assigned NSEE based on the 2000 Census data. Geisinger: new cases/matched controls in years 2008–2012 were assigned NSEE based on 2000 Census; new cases/matched controls in years 2013–2016 were assigned NSEE based on 2006–2010 ACS data.

ACS, American Community Survey; IQR, Interquartile range (25th, 75th percentile); NA, not applicable; NSEE, neighborhood socioeconomic environment; PA, physical activity; Q, quartile; REGARDS, REasons for Geographic and Racial Differences in Stroke; VADR, Veterans Administration Diabetes Risk.

The Geisinger and REGARDS study samples had approximately equal proportions of males and females. The VADR cohort, reflecting overall veteran demographics, was 7.8% female. The average REGARDS participant was slightly older compared with VADR or Geisinger participants (63.0 vs 59.4 and 54.9 years, respectively). The REGARDS cohort had a greater proportion of non-Hispanic Black participants (32.8%) compared with the VADR and Geisinger study samples (16.0% and 1.8%, respectively). Within community type, median NSEE values did not vary substantially across study samples (online supplemental figure S2). The VADR cohort, the most geographically diverse study sample, had a greater range of NSEE values and longer upper tail (worse socioeconomic disadvantage) compared with the other study samples. PA facility density generally declined from urban to rural community types, while distance to parks generally increased (online supplemental figures S3 and S4, respectively). There was greater variability in the two LTPA environment measures in the VADR cohort compared with the other study samples.

Mediation via PA facility density

We found some limited evidence for mediation of the effect of NSEE on T2D via PA facility density in the VADR cohort, although of small magnitude, and in the REGARDS cohort, while there was no evidence of mediation in the Geisinger sample (table 3). The most consistent effects were observed in rural communities, where the indirect effects were positive and strengthened across increasing quartiles of NSEE, and where the proportion mediated was 3% in VADR and 51% in REGARDS (comparing the fourth vs first quartile of NSEE). In all other community types, we observed inconsistent exposure–outcome relations and directions of the indirect effects. In suburban/small town communities, we observed negative indirect effects in VADR (ie, inconsistent mediation opposing the direct effect) and positive indirect effects in REGARDS (ie, mediation in the same direction as the direct effect), although in the REGARDS cohort, the positive significant indirect effect was limited to the NSEE second vs first quartile only. In urban communities, we observed significant, although less consistent, indirect effects in the VADR cohort only. In VADR lower density urban communities, positive indirect effects in the second and third (vs first) quartiles of NSEE represented a proportion mediated of about 1%, whereas in higher density urban communities, indirect effects were negative.

Table 3.

Total, direct, and indirect effects of higher NSEE (higher socioeconomic disadvantage) on T2D via PA facility density within street network buffers

NSEE quartiles Mediation via PA facility density (count/km2)
Total effect (95% CI) Average direct effect (95% CI) Average indirect effect (95% CI)
Higher density urban (1-mile walking buffer)
VADR cohort (difference in incidence rates)*
 Q2 versus Q1 0.0017 (0.0009 to 0.0024) 0.0018 (0.0010 to 0.0026) 0.0111 (−0.0165 to 0.0055)
 Q3 versus Q1 0.0023 (0.0015 to 0.0030) 0.0025 (0.0017 to 0.0032) 0.0177 (−0.0247 to 0.0105)
 Q4 versus Q1 0.0024 (0.0015 to 0.0035) 0.0025 (0.0016 to 0.0036) 0.0117 (−0.0165 to 0.0055)
REGARDS cohort (difference in incidence rates)*
 Q2 versus Q1 4.15 (−1.06 to 9.24) 4.24 (−0.96 to 9.41) −0.10 (−0.67 to 0.40)
 Q3 versus Q1 5.76 (0.25 to 11.05) 5.78 (0.27 to 10.94) −0.02 (−0.35 to 0.25)
 Q4 versus Q1 3.17 (−2.36 to 8.55) 3.26 (−2.25 to 8.71) −0.09 (−0.65 to 0.41)
Geisinger nested case–control (OR)*
 Q2 versus Q1 1.19 (0.94 to 1.53) 1.14 (0.90 to 1.47) 1.04 (0.97 to 1.13)
 Q3 versus Q1 1.11 (0.88 to 1.41) 1.08 (0.86 to 1.37) 1.03 (0.95 to 1.11)
 Q4 versus Q1 0.78 (0.56 to 1.11) 0.80 (0.59 to 1.09) 0.97 (0.87 to 1.08)
Lower density urban (2-mile driving buffer)
VADR cohort (difference in incidence rates)*
 Q2 versus Q1 0.0010 (0.0007 to 0.0012) 0.0010 (0.0007 to 0.0012) 0.0017 (0.0013 to 0.0023)
 Q3 versus Q1 0.0019 (0.0002 to 0.0022) 0.0019 (0.0016 to 0.0022) 0.0010 (0.0008 to 0.0011)
 Q4 versus Q1 0.0012 (0.0008 to 0.0015) 0.0012 (0.0009 to 0.0015) 0.0012 (−0.0015 to 0.0008)
REGARDS cohort (difference in incidence rates)*
 Q2 versus Q1 0.92 (−2.61 to 4.07) 0.90 (−2.62 to 4.09) 0.01 (−0.06 to 0.12)
 Q3 versus Q1 4.40 (0.49 to 7.82) 4.45 (0.49 to 7.82) −0.05 (−0.27 to 0.15)
 Q4 versus Q1 3.94 (−0.47 to 7.98) 4.00 (−0.39 to 8.18) −0.07 (−0.06 to 0.12)
Geisinger nested case–control (OR)*
 Q2 versus Q1 1.55 (1.16 to 2.05) 1.55 (1.17 to 2.04) 1.00 (0.95 to 1.05)
 Q3 versus Q1 1.67 (1.26 to 2.23) 1.68 (1.26 to 2.23) 1.01 (0.97 to 1.06)
 Q4 versus Q1 1.70 (1.29 to 2.30) 1.68 (1.27 to 2.26) 1.01 (0.97 to 1.06)
Suburban/small town (6-mile driving buffer)
VADR cohort (difference in incidence rates)*
 Q2 versus Q1 0.0013 (0.0009 to 0.0017) 0.0013 (0.0009 to 0.0017) −0.0035 (0.0058 to 0.0006)
 Q3 versus Q1 0.0023 (0.0020 to 0.0028) 0.0024 (0.0021 to 0.0029) 0.0085 (−0.0147 to 0.0039)
 Q4 versus Q1 0.0028 (0.0023 to 0.0032) 0.0030 (0.0024 to 0.0033) 0.0109 (−0.0157 to 0.0050)
REGARDS cohort (difference in incidence rates)*
 Q2 versus Q1 −0.31 (−4.81 to 3.97) −1.21 (−5.77 to 3.05) 0.90 (0.04 to 1.99)
 Q3 versus Q1 2.33 (−2.53 to 6.68) 0.77 (−4.37 to 5.39) 1.55 (−0.001 to 3.27)
 Q4 versus Q1 4.97 (−0.64 to 10.89) 2.69 (−3.33 to 8.74) 2.28 (−0.09 to 4.73)
Geisinger nested case–control (OR)*
 Q2 versus Q1 1.06 (0.90 to 1.26) 1.05 (0.91 to 1.23) 1.01 (0.97 to 1.04)
 Q3 versus Q1 1.15 (0.98 to 1.35) 1.17 (0.99 to 1.38) 0.99 (0.95 to 1.02)
 Q4 versus Q1 1.22 (1.02 to 1.45) 1.23 (1.03 to 1.47) 0.99 (0.95 to 1.03)
Rural (6-mile driving buffer)
VADR cohort (difference in incidence rates)*
 Q2 versus Q1 0.0007 (0.0003 to 0.0011) 0.0006 (0.0002 to 0.0011) 0.0025 (0.0011 to 0.0034)
 Q3 versus Q1 0.0011 (0.0008 to 0.0015) 0.0011 (0.0002 to 0.0011) 0.0048 (0.0028 to 0.0075)
 Q4 versus Q1 0.0020 (0.0019 to 0.0023) 0.0020 (0.0018 to 0.0022) 0.0066 (0.0038 to 0.0099)
REGARDS cohort (difference in incidence rates)*
 Q2 versus Q1 5.63 (0.91 to 10.59) 4.69 (0.15 to 9.49) 0.94 (0.18 to 1.90)
 Q3 versus Q1 3.55 (−1.11 to 8.25) 2.31 (−2.55 to 7.14) 1.24 (0.16 to 2.39)
 Q4 versus Q1 3.00 (−2.24 to 8.21) 1.48 (−4.35 to 6.67) 1.53 (0.25 to 3.05)
Geisinger nested case–control (OR)*
 Q2 versus Q1 1.04 (0.93 to 1.16) 1.04 (0.92 to 1.16) 1.00 (0.99 to 1.02)
 Q3 versus Q1 1.15 (1.06 to 1.26) 1.15 (1.05 to 1.26) 1.00 (0.99 to 1.02)
 Q4 versus Q1 1.17 (1.06 to 1.32) 1.17 (1.05 to 1.32) 1.00 (0.99 to 1.02)

Bolded values indicate statistically significant (p<0.05).

*For the cohort studies (VADR and REGARDS), mediation effects are presented as differences in incidence rates and rescaled by multiplying by 100 for interpretability. For the case–control study (Geisinger EHR), mediation effects are presented as ORs.

EHR, electronic health record; NSEE, neighborhood socioeconomic environment; PA, physical activity; Q, quartile; REGARDS, REasons for Geographic and Racial Differences in Stroke; T2D, type 2 diabetes; VADR, Veterans Administration Diabetes Risk.

Mediation via distance to parks

We found limited evidence for mediation of the effect of NSEE on T2D via distance to parks in the VADR cohort only (table 4), and specifically, in rural communities. In these communities, the population-weighted distance to the seven closest parks mediated a small proportion (<1%) of the total effect of NSEE on T2D comparing the NSEE fourth vresus first quartiles. In the VADR cohort, there was minimal evidence for mediation in suburban/small town and lower density communities and no evidence for mediation in higher density urban communities.

Table 4.

Total, direct, and indirect effects of higher NSEE (higher socioeconomic disadvantage) on T2D via population-weighted distance to seven nearest parks

NSEE quartiles Mediation via population-weighted distance to seven nearest parks (miles)
Total effect (95% CI) Average direct effect (95% CI) Average indirect effect (95% CI)
Higher density urban
VADR cohort (difference in incidence rates)*
 Q2 versus Q1 0.0018 (0.0013 to 0.0022) 0.0018 (0.0013 to 0.0022) 0.0001 (−0.0005 to 0.0006)
 Q3 versus Q1 0.0025 (0.0021 to 0.0030) 0.0025 (0.0021 to 0.0030) −0.0002 (−0.0020 to 0.0021)
 Q4 versus Q1 0.0022 (0.0010 to 0.0031) 0.0022 (0.0010 to 0.0031) 0.0000 (−0.0015 to 0.0018)
REGARDS cohort (difference in incidence rates)*
 Q2 versus Q1 4.12 (−0.87 to 9.15) 4.00 (−1.11 to 8.95) 0.12 (−0.10 to 0.47)
 Q3 versus Q1 5.78 (0.38 to 11.38) 5.77 (0.25 to 11.49) 0.01 (−0.25 to 0.30)
 Q4 versus Q1 3.00 (−2.69 to 8.06) 3.08 (−2.65 to 8.23) −0.08 (−0.39 to 0.12)
Geisinger nested case–control (OR)*
 Q2 versus Q1 1.19 (0.93 to 1.54) 1.15 (0.91 to 1.47) 1.03 (0.98 to 1.09)
 Q3 versus Q1 1.11 (0.89 to 1.40) 1.11 (0.90 to 1.39) 1.00 (0.95 to 1.04)
 Q4 versus Q1 0.78 (0.56 to 1.09) 0.78 (0.57 to 1.08) 1.00 (0.93 to 1.07)
Lower density urban
VADR cohort (difference in incidence rates)*
 Q2 versus Q1 0.0011 (0.0008 to 0.0013) 0.0011 (0.0008 to 0.0013) 0.0007 (−0.0010 to to 0.0003)
 Q3 versus Q1 0.0019 (0.0017 to 0.0022) 0.0019 (0.0017 to 0.0022) 0.0008 (−0.0022 to to 0.0002)
 Q4 versus Q1 0.0011 (0.0009 to 0.0014) 0.0011 (0.0010 to 0.0014) 0.0013 (−0.0044 to to 0.0014)
REGARDS cohort (difference in incidence rates)*
 Q2 versus Q1 1.15 (−2.18 to 4.43) 1.19 (−2.13 to 4.50) −0.04 (−0.15 to 0.04)
 Q3 versus Q1 4.38 (0.59 to 7.93) 4.50 (0.80 to 8.01) −0.12 (−0.43 to 0.11)
 Q4 versus Q1 3.97 (−0.16 to 7.89) 4.09 (−0.15 to 8.11) −0.12 (−0.43 to 0.14)
Geisinger nested case control (OR)*
 Q2 versus Q1 1.55 (1.16 to 2.07) 1.53 (1.17 to 2.01) 1.01 (0.93 to 1.10)
 Q3 versus Q1 1.67 (1.26 to 2.23) 1.64 (1.25 to 2.14) 1.02 (0.94 to 1.12)
 Q4 versus Q1 1.70 (1.29 to 2.26) 1.66 (1.29 to 2.17) 1.02 (0.92 to 1.13)
Suburban/small town
VADR cohort (difference in incidence rates)*
 Q2 versus Q1 0.0013 (0.0010 to 0.0016) 0.0013 (0.0010 to 0.0016) 0.0002 (0.0000 to 0.0003)
 Q3 versus Q1 0.0022 (0.0019 to 0.0026) 0.0022 (0.0019 to 0.0016) 0.0003 (−0.0001 to 0.0008)
 Q4 versus Q1 0.0026 (0.0019 to 0.0034) 0.0026 (0.0019 to 0.0034) −0.0002 (−0.0005 to 0.0001)
REGARDS cohort (difference in incidence rates)*
 Q2 versus Q1 −0.77 (−5.07 to 3.65) −0.95 (−5.13 to 3.40) 0.18 (−0.18 to 0.59)
 Q3 versus Q1 1.81 (−2.81 to 6.28) 1.48 (−3.21 to 5.86) 0.34 (−0.32 to 0.99)
 Q4 versus Q1 4.04 (−1.36 to 9.64) 3.53 (−2.01 to 9.11) 0.51 (−0.51 to 1.56)
Geisinger nested case–control (OR)*
 Q2 versus Q1 1.06 (0.90 to 1.27) 1.04 (0.89 to 1.24) 1.02 (0.98 to 1.05)
 Q3 versus Q1 1.15 (0.97 to 1.36) 1.16 (0.99 to 1.36) 0.99 (0.97 to 1.01)
 Q4 versus Q1 1.22 (1.02 to 1.45) 1.20 (1.01 to 1.43) 1.01 (0.97 to 1.05)
Rural
VADR cohort (difference in incidence rates)*
 Q2 versus Q1 0.0005 (0.0002 to 0.0008) 0.0005 (0.0002 to 0.0008) 0.0007 (0.0002 to 0.0012)
 Q3 versus Q1 0.0011 (0.0008 to 0.0014) 0.0011 (0.0008 to 0.0014) 0.0014 (0.0005 to 0.0022)
 Q4 versus Q1 0.0017 (0.0014 to 0.0020) 0.0016 (0.0014 to 0.0020) 0.0015 (0.0000 to 0.0043)
REGARDS cohort (difference in incidence rates)*
 Q2 versus Q1 5.48 (0.60 to 10.09) 6.06 (1.25 to 10.86) −0.58 (−1.37 to 0.10)
 Q3 versus Q1 3.56 (−1.06 to 7.85) 4.38 (−0.32 to 8.91) −0.82 (−1.85 to 0.07)
 Q4 versus Q1 3.18 (−1.99 to 8.69) 4.04 (−1.18 to 9.74) −0.86 (−1.84 to 0.08)
Geisinger nested case–control (OR)*
 Q2 versus Q1 1.04 (0.94 to 1.15) 1.02 (0.92 to 1.14) 1.02 (0.99 to 1.05)
 Q3 versus Q1 1.15 (1.06 to 1.26) 1.12 (1.02 to 1.23) 1.03 (0.99 to 1.07)
 Q4 versus Q1 1.17 (1.06 to 1.31) 1.14 (1.03 to 1.28) 1.03 (0.99 to 1.07)

Bolded values indicate statistically significant (p<0.05).

*For the cohort studies (VADR and REGARDS), mediation effects are presented as differences in incidence rates and rescaled by multiplying by 100 for interpretability. For the case–control study (Geisinger EHR), mediation effects are presented as ORs.

EHR, electronic health record; NSEE, neighborhood socioeconomic environment; PA, physical activity; Q, quartile; REGARDS, REasons for Geographic and Racial Differences in Stroke; T2D, type 2 diabetes; VADR, Veterans Administration Diabetes Risk.

Sensitivity analyses

In analyses using larger buffer sizes to define PA facility density, results for REGARDS, but not VADR or Geisinger, differed from the primary analysis (online supplemental table S2). In REGARDS, using larger buffers resulted in slightly larger indirect effect sizes in lower density urban, suburban/small town, and rural communities. Removing the land use environment measure from mediation models did not substantially affect the magnitude or direction of the indirect effects (results not shown). When we restricted VADR analyses to REGARDS census tracts, we no longer observed any statistically significant indirect effects via PA facility density that were present in the REGARDS suburban/small town and rural communities (online supplemental table S3). For the corresponding analysis examining mediation via distance to parks, we did not observe substantial qualitative differences compared with the main REGARDS analysis (online supplemental table S3). In VADR models restricted to Geisinger census tracts, the indirect effects via both PA facilities and distance to parks (online supplemental table S4) were not qualitatively different than the Geisinger main analysis (ie, close to null). In the sensitivity analysis examining the PA facility density mediator as a binary variable, the indirect association in REGARDS rural communities was slightly attenuated in magnitude but remained statistically significant (online supplemental table S5).

Discussion

In a causal mediation analysis of three independent study samples, with harmonized exposure and neighborhood-level confounding variables, and geographic coverage across rural, suburban, and urban communities in the US, we found little evidence that the LTPA environment mediates the association between NSEE and new onset of T2D. We observed some statistically significant mediation via PA facility density and distance to parks; however, the magnitude of the indirect effects was small, especially in the VADR cohort, and in most models, the LTPA environment mediated only a small proportion of the effect of NSEE on T2D. Overall, the observed indirect effects through the LTPA environment are of insufficient magnitude to inform policy decisions aimed at reducing the impact of NSEE on T2D. Despite the lack of evidence for policy-relevant indirect effects via the LTPA environment, our heterogeneous mediation findings by community type affirm that future research should consider community type as a key modifying factor by which neighborhood context affects T2D.

Mediation via PA facility density

In non-rural community types, findings were inconsistent in direction and study sample and provided little evidence that PA facility density explains NSEE’s harmful effects on T2D. We found some evidence for mediation of NSEE on T2D via PA facility density in rural communities—small magnitude and small proportion mediated in VADR and a relatively large proportion mediated in REGARDS—but this finding warrants caution in its interpretation. Although no prior studies of mediation of NSEE-T2D associations via access to PA facilities exist, evidence for a relation between neighborhood SES and access to PA facilities24 26 50 51 and between access to PA facilities and T2D21 23 52 support the plausibility of this mediation pathway. Given that most studies of PA facility access and T2D risk have been limited to urban and suburban areas, our study’s finding of mediation in rural communities provides some evidence that the harmful effect of NSEE on T2D risk may be in part due to low access to PA facilities in rural communities, which have fewer PA facilities compared with more urbanized areas.26 However, given that we only observed a substantial proportion mediated in one study sample and the limitations in the supporting data, this finding should be interpreted with caution. In REGARDS rural communities, the distribution of PA facility density was relatively narrow and skewed (ie, many census tracts had no PA facilities), especially among tracts with high NSEE. In addition, confounding by individual-level factors is more likely when a study population has few persons per census tract; of the three study samples, the number of persons per census tract was lowest in REGARDS. To address concerns about the skewed distribution of PA facility density, we conducted a sensitivity analysis in which PA facility density was treated as a binary, rather than continuous, mediator. Indirect effects remained statistically significant, suggesting this association was not due to misspecification of the exposure–mediator or mediator–outcome exposure–response relation. While this bolsters evidence for the mediation finding, our sensitivity analysis restricting the VADR analysis to overlapping census tracts with the other study samples suggested that the mediation effects we observed in REGARDS may have been due to differences in population composition rather than contextual differences. The lack of mediation effects observed for Geisinger remains unclear but could potentially reflect overall regional differences in T2D risk12 or the lower T2D risk in rural areas within the Geisinger study region.31

In our study’s main analysis, we used street network buffer sizes for PA facility density in the two types of urban communities that were comparable with past studies of PA facility density and T2D, which have typically used 1-mile circular buffers.21 53 With limited prior studies on which to base our selection of street network buffers in rural areas, we selected 6-mile driving buffers in suburban/small town and rural communities. In a sensitivity analysis, we found that expanding the size of the buffers used to measure PA facility density resulted in slightly larger indirect effect sizes in REGARDS in most community types, suggesting that the relevant activity space for measuring LTPA resources may be larger in less dense, car dependent communities. Alternatively, using larger buffer sizes may have captured additional PA facilities in adjacent communities, increasing the effect sizes observed for the REGARDS cohort. There is no consensus in the literature on the most relevant activity spaces for LTPA across the rural to urban continuum. Empirical studies measuring how activity spaces relevant for LTPA differ across rural, suburban, and urban areas are an important area for future research. In addition, future mediation analyses should evaluate other aspects of the LTPA environment, such as the role of quality or affordability of PA facility access.

Mediation via distance to parks

We found only a small positive indirect effect (<1% mediation) via population-weighted distance to the seven closest parks that was limited to VADR rural communities. No prior studies have evaluated whether access to parks mediates the relation between NSEE and T2D; however, mediation via park access is plausible based on evidence of associations between NSEE and park access and between park access and T2D. As previously shown, higher poverty tracts in rural areas were farther from parks—as measured by the same definition in our study, the population-weighted distance from census tract centroid to the seven closest parks—whereas in urban and suburban areas, higher poverty tracts were closer to parks.25 Although few studies have specifically examined park access, similar exposures such as greenspace and public open spaces have been associated with lower risk of T2D.54 Our measure of park access may not influence individual LTPA as we hypothesized. Despite the strengths of our distance to parks measure, which accounted for the population distribution within a census tract, the size of the parks, and was unconstrained by tract boundaries, our largely null mediation results could indicate that this may not be an appropriate measure of park access for these populations or geographies. A systematic review of the relation between proximity and density of parks and LTPA found inconsistent associations across studies, which suggests that perceived measures of park access—as opposed to objective measures used in our study—may have more relevance to LTPA.20 Other features of the LTPA environment that could be more relevant to increasing LTPA, such as affordability, features, and quality, should be examined in future studies of mediation via the LTPA environment.21 26

Strengths

This study is the first to employ a formal causal mediation framework to examine whether features of the LTPA environment mediate the observed harmful association between NSEE and T2D. The main strengths of this analysis, made possible through collaboration of the Diabetes LEAD Network, was the development of complementary analysis plans across sites, including creating harmonized analytic variables (exposure and key neighborhood-level variables) and conducting statistical modeling as consistently as possible, despite the differences in study designs. This allowed us to address often-cited limitations in the comparison of epidemiologic study findings, namely, the extent to which methodological heterogeneity contributes to lack of replication across studies. Furthermore, our study samples represented both national and region-specific contexts and a broad spectrum of community types, which allowed exploration of heterogeneity across these dimensions of interest.

Limitations

Although our methodological approach eliminated important sources of variation between the three analyses, replication in different study samples and geographies does not allow for a full disentanglement of contextual and composition effects. Our study samples differed in their distributions of race/ethnicity, sex, age, SES, veteran status, and in the geographic regions covered. Residual confounding may have remained, such as from individual-level SES, which was measured differently in the three study samples. We were unable to completely harmonize the diagnostic criteria for T2D across samples, which could have contributed to the heterogeneity of results across studies. For some community types, small sample sizes for particular combinations of NSEE quartiles and PA mediators in the REGARDS and Geisinger study samples may have constrained our ability to detect the small mediation effects in the substantially larger VADR sample, but the small effect sizes detected in the VADR cohort may not be meaningful for informing policy or interventions to reduce the impact of NSEE on T2D. The LTPA environment variables were defined from population-weighted centroids, rather than individual addresses, which may not capture the relevant activity space of participants. For causal inference, these mediation analyses assume no unmeasured exposure–outcome confounding, exposure–mediator confounding, or mediator–outcome confounding. Stratifying statistical models by community type and evaluating NSEE as within-community quartiles reduced potential bias from census tract-level residual confounding and non-positivity, but this also restricted our ability to interpret mediation effects across community types. With cross-sectional address data, we could not address possible bias from residential self-selection, where health-related behaviors and preferences predict neighborhood choice and health outcomes.

Conclusions

In the first causal mediation analysis of the NSEE-T2D association through neighborhood LTPA resources, we found little evidence for partial mediation via the density of PA facilities or distance to parks in three unique study samples. Given the small magnitude of the indirect effects observed, alongside the heterogeneity by study sample and community type, the relevance of our findings for policy or interventions to reduce T2D is not straightforward. Our differential findings by community type highlight the importance of considering community type as a key factor in the contextual impacts on T2D, and by extension, suggest that approaches to change the LTPA environment to reduce T2D will require local tailoring to maximize population health benefits. Inequities in local destinations for walking, social engagement, and PA are widening across the socioeconomic gradient and by race and ethnicity in the USA.55 Our understanding of the best approaches to measure the LTPA environment in rural communities is particularly limited. Research evaluating alternative measures of the LTPA environment is needed, particularly methods that capture features beyond proximity and using multiple spatial scales.

Footnotes

Contributors: KM:

Accepts full responsibility for the finished work and/or the conduct of the study, had access to the data, and controlled the decision to publish;

KM and SO: writing (original draft, review and editing draft), statistical analysis and interpretation of statistical analysis; CN: writing (review and editing draft), statistical analysis and interpretation of statistical analysis; AZ, JU, PL and VR: writing (review and editing draft), statistical analysis and interpretation of results; MS: writing (review and editing draft); AGH, BSS, APC, and DLL: conceptualization, funding acquisition, writing (review and editing draft), statistical analysis and interpretation of results; MM: writing (review and editing draft), data curation; JB: writing (review and editing draft); GSL: conceptualization, funding acquisition, writing (review and editing draft); SA: writing (review and editing draft); RK: writing (review and editing draft); SA: writing (review and editing draft); GI: writing (review and editing draft); MP: conceptualization; funding acquisition, writing (review and editing draft), statistical analysis and interpretation of results.

Funding: This research was conducted by the Diabetes LEAD Network, funded by the Centers for Disease Control cooperative agreements U01DP006293 (Drexel University), U01DP006296 (Geisinger-Johns Hopkins University), U01DP006299 (New York University School of Medicine), and U01DP006302 (University of Alabama at Birmingham), along with collaboration with the US CDC Division of Diabetes Translation. This research used data created by The Retail Environment and Cardiovascular Disease (RECVD) study, which was supported by the National Institute on Aging (NIA) (1R01AG049970, 3R01AG049970-04S1), Commonwealth Universal Research Enhancement (C.U.R.E) program funded by the Pennsylvania Department of Health – 2015 Formula award – SAP #4100072543, the Urban Health Collaborative at Drexel University, and the Built Environment and Health Research Group at Columbia University. The REasons for Geographic and Racial Differences in Stroke (REGARDS) project was supported by cooperative agreement U01 NS041588 cofunded by the National Institute of Neurological Disorders and Stroke and the NIA, National Institutes of Health, Department of Health and Human Service.

Disclaimer: The findings and conclusions in this report are those of the authors and do not necessarily represent the official position of the Centers for Disease Control and Prevention.

Competing interests: DLL received investigator-initiated research support from Amgen, Inc for work unrelated to this manuscript. All other authors declare that they have no conflicts of interest.

Provenance and peer review: Not commissioned; externally peer reviewed.

Supplemental material: This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.

Data availability statement

Data are available on reasonable request. Data are available on reasonable request. Deidentified data are available on request with IRB approval and a data use agreement.

Ethics statements

Patient consent for publication

Not applicable.

Ethics approval

The study was approved by each institution’s institutional review boards. Geisinger Institutional Review Board (2017-0534), Drexel University Institutional Review Board (1707005552), New York University School of Medicine Institutional Review Board (i17-01428), Subcommittee for Human Studies, Research and Development Committee, Department of Veterans Affairs, New York Harbor Healthcare System (01667), University of Alabama at Birmingham (IRB-300000957). The Geisinger Institutional Review Board approved this study and waived informed consent. Data were collected for non-research purposes, and consent was waived because it would be a burden on the patient if they would have to come into a visit to review and sign a consent, as some patients are not seen regularly. It would be impractical for both the patient as well as study staff to contact thousands of patients to participate in the study. For NYU and the VADR cohort, the request for waiver or alteration of the informed consent requirement for all aspects of the study was approved. All REGARDS participants provided written informed consent and signed medical record release forms allowing REGARDS investigators to retrieve medical records for research purposes.

References

  • 1.Centers for Disease Control and Prevention . National diabetes statistics report. 2022. Available: https://www.cdc.gov/diabetes/data/statistics-report/index.html
  • 2.Yang W. Economic costs of diabetes in the U.S. in 2017. Diabetes Care 2018;41:917–28. 10.2337/dci18-0007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Hill-Briggs F, Adler NE, Berkowitz SA, et al. Social determinants of health and diabetes: a scientific review. Diabetes Care 2020;44:258–79. 10.2337/dci20-0053 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Bilal U, Auchincloss AH, Diez-Roux AV. Neighborhood environments and diabetes risk and control. Curr Diab Rep 2018;18:62. 10.1007/s11892-018-1032-2 [DOI] [PubMed] [Google Scholar]
  • 5.Lê-Scherban F, Ballester L, Castro JC, et al. Identifying neighborhood characteristics associated with diabetes and hypertension control in an urban African-American population using geo-linked electronic health records. Prev Med Rep 2019;15:100953. 10.1016/j.pmedr.2019.100953 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Piccolo RS, Duncan DT, Pearce N, et al. The role of neighborhood characteristics in racial/ethnic disparities in type 2 diabetes: results from the Boston area community health (Bach) survey. Soc Sci Med 2015;130:79–90. 10.1016/j.socscimed.2015.01.041 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Richardson AS, Meyer KA, Howard AG, et al. Neighborhood socioeconomic status and food environment: a 20-year longitudinal latent class analysis among cardia participants. Health Place 2014;30:145–53. 10.1016/j.healthplace.2014.08.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Krishnan S, Cozier YC, Rosenberg L, et al. Socioeconomic status and incidence of type 2 diabetes: results from the black women’s health study. Am J Epidemiol 2010;171:564–70. 10.1093/aje/kwp443 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Thorpe LE, Adhikari S, Lopez P, et al. Neighborhood socioeconomic environment and risk of type 2 diabetes: associations and mediation through food environment pathways in three independent study samples. Diabetes Care 2022;45:798–810. 10.2337/dc21-1693 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Barker LE, Kirtland KA, Gregg EW, et al. Geographic distribution of diagnosed diabetes in the U.S.: a diabetes belt. Am J Prev Med 2011;40:434–9. 10.1016/j.amepre.2010.12.019 [DOI] [PubMed] [Google Scholar]
  • 11.Shrestha SS, Thompson TJ, Kirtland KA, et al. Changes in disparity in county-level diagnosed diabetes prevalence and incidence in the United States, between 2004 and 2012. PLoS ONE 2016;11:e0159876. 10.1371/journal.pone.0159876 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Hipp JA, Chalise N. Spatial analysis and correlates of county-level diabetes prevalence, 2009-2010. Prev Chronic Dis 2015;12:E08. 10.5888/pcd12.140404 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Lord J, Roberson S, Odoi A. Investigation of geographic disparities of pre-diabetes and diabetes in Florida. BMC Public Health 2020;20:1226. 10.1186/s12889-020-09311-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Dendup T, Feng X, Clingan S, et al. Environmental risk factors for developing type 2 diabetes mellitus: a systematic review. Int J Environ Res Public Health 2018;15:78. 10.3390/ijerph15010078 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Diez Roux AV, Mair C. Neighborhoods and health. Ann N Y Acad Sci 2010;1186:125–45. 10.1111/j.1749-6632.2009.05333.x [DOI] [PubMed] [Google Scholar]
  • 16.Smith AD, Crippa A, Woodcock J, et al. Physical activity and incident type 2 diabetes mellitus: a systematic review and dose-response meta-analysis of prospective cohort studies. Diabetologia 2016;59:2527–45. 10.1007/s00125-016-4079-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Kaczynski AT, Henderson KA. Parks and recreation settings and active living: a review of associations with physical activity function and intensity. J Phys Act Health 2008;5:619–32. 10.1123/jpah.5.4.619 [DOI] [PubMed] [Google Scholar]
  • 18.Orstad SL, McDonough MH, Stapleton S, et al. A systematic review of agreement between perceived and objective neighborhood environment measures and associations with physical activity outcomes. Environment and Behavior 2017;49:904–32. 10.1177/0013916516670982 [DOI] [Google Scholar]
  • 19.Bauman AE, Reis RS, Sallis JF, et al. Correlates of physical activity: why are some people physically active and others not? Lancet 2012;380:258–71. 10.1016/S0140-6736(12)60735-1 [DOI] [PubMed] [Google Scholar]
  • 20.Bancroft C, Joshi S, Rundle A, et al. Association of proximity and density of Parks and objectively measured physical activity in the United States: a systematic review. Soc Sci Med 2015;138:22–30. 10.1016/j.socscimed.2015.05.034 [DOI] [PubMed] [Google Scholar]
  • 21.Christine PJ, Auchincloss AH, Bertoni AG, et al. Longitudinal associations between neighborhood physical and social environments and incident type 2 diabetes mellitus: the multi-ethnic study of atherosclerosis (MESA). JAMA Intern Med 2015;175:1311–20. 10.1001/jamainternmed.2015.2691 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Auchincloss AH, Diez Roux AV, Mujahid MS, et al. Neighborhood resources for physical activity and healthy foods and incidence of type 2 diabetes mellitus: the multi-ethnic study of atherosclerosis. Arch Intern Med 2009;169:1698–704. 10.1001/archinternmed.2009.302 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.den Braver NR, Lakerveld J, Rutters F, et al. Built environmental characteristics and diabetes: a systematic review and meta-analysis. BMC Med 2018;16:12. 10.1186/s12916-017-0997-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Jones SA, Moore LV, Moore K, et al. Disparities in physical activity resource availability in six US regions. Prev Med 2015;78:17–22. 10.1016/j.ypmed.2015.05.028 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Wen M, Zhang X, Harris CD, et al. Spatial disparities in the distribution of parks and green spaces in the USA. Ann Behav Med 2013;45 Suppl 1:S18–27. 10.1007/s12160-012-9426-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Powell LM, Slater S, Chaloupka FJ, et al. Availability of physical activity-related facilities and neighborhood demographic and socioeconomic characteristics: a national study. Am J Public Health 2006;96:1676–80. 10.2105/AJPH.2005.065573 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Pickett KE, Pearl M. Multilevel analyses of neighbourhood socioeconomic context and health outcomes: a critical review. J Epidemiol Community Health 2001;55:111–22. 10.1136/jech.55.2.111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Hirsch AG, Carson AP, Lee NL, et al. The diabetes location, environmental attributes, and disparities network: protocol for nested case control and cohort studies, rationale, and baseline characteristics. JMIR Res Protoc 2020;9:e21377. 10.2196/21377 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Avramovic S, Alemi F, Kanchi R, et al. US veterans administration diabetes risk (VADR) national cohort: cohort profile. BMJ Open 2020;10:e039489. 10.1136/bmjopen-2020-039489 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Howard VJ, Cushman M, Pulley L, et al. The reasons for geographic and racial differences in stroke study: objectives and design. Neuroepidemiology 2005;25:135–43. 10.1159/000086678 [DOI] [PubMed] [Google Scholar]
  • 31.Schwartz BS, Pollak J, Poulsen MN, et al. Association of community types and features in a case-control analysis of new onset type 2 diabetes across a diverse geography in Pennsylvania. BMJ Open 2021;11:e043528. 10.1136/bmjopen-2020-043528 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.McAlexander TP, Algur Y, Schwartz BS, et al. Categorizing community type for epidemiologic evaluation of community factors and chronic disease across the United States. Soc Sci Humanit Open 2022;5:100250. 10.1016/j.ssaho.2022.100250 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Xiao Q, Berrigan D, Powell-Wiley TM, et al. Ten-year change in neighborhood socioeconomic deprivation and rates of total, cardiovascular disease, and cancer mortality in older us adults. Am J Epidemiol 2018;187:2642–50. 10.1093/aje/kwy181 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Logan JR, Xu Z, Stults BJIUS. Interpolating U.S. decennial census tract data from as early as 1970 to 2010: a longtitudinal tract database. Prof Geogr 2014;66:412–20. 10.1080/00330124.2014.905156 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Hirsch JA, Moore KA, Cahill J, et al. Business data categorization and refinement for application in longitudinal neighborhood health research: a methodology. J Urban Health 2021;98:271–84. 10.1007/s11524-020-00482-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Garg PK, Platt JM, Hirsch JA, et al. Association of neighborhood physical activity opportunities with incident cardiovascular disease in the cardiovascular health study. Health Place 2021;70. 10.1016/j.healthplace.2021.102596 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Diez Roux AV, Evenson KR, McGinn AP, et al. Availability of recreational resources and physical activity in adults. Am J Public Health 2007;97:493–9. 10.2105/AJPH.2006.087734 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Boone-Heinonen J, Evenson KR, Song Y, et al. Built and socioeconomic environments: patterning and associations with physical activity in U.S. adolescents. Int J Behav Nutr Phys Act 2010;7:45. 10.1186/1479-5868-7-45 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Sallis JF, Hovell MF, Hofstetter CR, et al. Distance between homes and exercise facilities related to frequency of exercise among San Diego residents. Public Health Rep 1990;105:179–85. [PMC free article] [PubMed] [Google Scholar]
  • 40.Gordon-Larsen P, Nelson MC, Page P, et al. Inequality in the built environment underlies key health disparities in physical activity and obesity. Pediatrics 2006;117:417–24. 10.1542/peds.2005-0058 [DOI] [PubMed] [Google Scholar]
  • 41.Zhang X, Lu H, Holt JB. Modeling spatial accessibility to parks: a national study. Int J Health Geogr 2011;10:1–14. 10.1186/1476-072X-10-31 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Meeker MA, Schwartz BS, Bandeen-Roche K, et al. Assessing measurement invariance of a land use environment construct across levels of urbanicity. Geohealth 2022;6:e2022GH000667. 10.1029/2022GH000667 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Ananth CV. Proportion mediated in a causal mediation analysis: how useful is this measure? BJOG 2019;126:983. 10.1111/1471-0528.15691 [DOI] [PubMed] [Google Scholar]
  • 44.Friedman M. Piecewise exponential models for survival data with covariates. Ann Statist 2007;10. 10.1214/aos/1176345693 [DOI] [Google Scholar]
  • 45.Bates D, Mächler M, Bolker BM, et al. Fitting linear mixed-effects models using lme4. J Stat Softw 2015;67. 10.18637/jss.v067.i01 [DOI] [Google Scholar]
  • 46.Imai K, Keele L, Tingley D. A general approach to causal mediation analysis. Psychol Methods 2010;15:309–34. 10.1037/a0020761 [DOI] [PubMed] [Google Scholar]
  • 47.Tingley D, Yamamoto T, Hirose K, et al. Mediation: R package for causal mediation analysis. J Stat Softw 2014;59:1–38.26917999 [Google Scholar]
  • 48.Vanderweele TJ, Vansteelandt S. Odds ratios for mediation analysis for a dichotomous outcome. Am J Epidemiol 2010;172:1339–48. 10.1093/aje/kwq332 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Steen J, Loeys T, Moerkerke B, et al. Medflex: an R package for flexible mediation analysis using natural effect models. J Stat Softw 2017;76. 10.18637/jss.v076.i11 [DOI] [Google Scholar]
  • 50.Estabrooks PA, Lee RE, Gyurcsik NC. Resources for physical activity participation: does availability and accessibility differ by neighborhood socioeconomic status? Ann Behav Med 2003;25:100–4. 10.1207/S15324796ABM2502_05 [DOI] [PubMed] [Google Scholar]
  • 51.Moore LV, Diez Roux AV, Evenson KR, et al. Availability of recreational resources in minority and low socioeconomic status areas. Am J Prev Med 2008;34:16–22. 10.1016/j.amepre.2007.09.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Auchincloss AH, Diez Roux AV, Brown DG, et al. Neighborhood resources for physical activity and healthy foods and their association with insulin resistance. Epidemiology 2008;19:146–57. 10.1097/EDE.0b013e31815c480 [DOI] [PubMed] [Google Scholar]
  • 53.Gebreab SY, Hickson DA, Sims M, et al. Neighborhood social and physical environments and type 2 diabetes mellitus in African Americans: the Jackson heart study. Health Place 2017;43:128–37. 10.1016/j.healthplace.2016.12.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Twohig-Bennett C, Jones A. The health benefits of the great outdoors: a systematic review and meta-analysis of greenspace exposure and health outcomes. Environ Res 2018;166:628–37. 10.1016/j.envres.2018.06.030 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Hirsch JA, Grengs J, Schulz A, et al. How much are built environments changing, and where?: patterns of change by neighborhood sociodemographic characteristics across seven U.S. metropolitan areas. Soc Sci Med 2016;169:97–105. 10.1016/j.socscimed.2016.09.032 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary data

bmjdrc-2022-003120supp001.pdf (356.5KB, pdf)

Data Availability Statement

Data are available on reasonable request. Data are available on reasonable request. Deidentified data are available on request with IRB approval and a data use agreement.


Articles from BMJ Open Diabetes Research & Care are provided here courtesy of BMJ Publishing Group

RESOURCES