Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2014 Jul 16.
Published in final edited form as: Soc Sci Med. 2011 Dec 14;74(4):625–636. doi: 10.1016/j.socscimed.2011.10.021

Health Disparities in Mid-to-Late Life: The Role of Earlier Life Family and Neighborhood Socioeconomic Conditions

Rucker C Johnson 1, Robert F Schoeni 2, Jeannette A Rogowski 3
PMCID: PMC4100798  NIHMSID: NIHMS345436  PMID: 22212443

Abstract

The relationship between neighborhoods of residence in young adulthood and health in mid to late life in the United States are examined using the 1968-2005 waves of the Panel Study of Income Dynamics (PSID). The sample consists of persons who were aged 20-30 in 1968 and are followed for a period of 38 years (N=2,730). Four-level hierarchical random effects models of self-assessed general health status as a function of individual, family, and neighborhood factors are estimated. Using the original sampling design of the PSID, we analyze adult health trajectories of married couples and neighbors followed from young adulthood through elderly ages to assess the magnitudes of the possible causal effects of family and neighborhood characteristics in young adulthood on health in mid to late life. Estimates suggest disparities in neighborhood conditions in young adulthood account for one-quarter of the variation in mid-to-late-life health. Living in poor neighborhoods during young adulthood is strongly associated with negative health outcomes in later life. This result is robust even in the presence of a reasonably large amount of potential unobservable individual and family factors that may significantly affect both neighborhood of residence and subsequent health status. Racial differences in health status in mid to late life are also associated with family and neighborhood socioeconomic conditions earlier in life. Three quarters of the black-white gap in health status at ages over 55 can be accounted for by differences in childhood socioeconomic status and neighborhood and family factors in young adulthood.

Keywords: USA, self-assessed health, family, neighborhood effects, life course, racial & socioeconomic disparities, multilevel modeling

Introduction

Increasingly, research on the determinants of adult and old-age health is recognizing the need to incorporate earlier life circumstances. The dearth of available longitudinal, nationally-representative US data with extensive information on socioeconomic status and health status has presented formidable challenges to understanding how early, middle, and late life factors influence the life cycle trajectory of health in this country.

We investigate the extent to which later-life health inequality in the United States is related to the different types of contexts into which individuals are born, and within which they grow up, live, and work at earlier stages of the life cycle. Unraveling the sources of racial and socioeconomic differences in adult health outcomes has eluded studies that have attempted to explain them using only individual-level factors and contemporaneous socioeconomic factors. Accordingly, this study uses data from the Panel Study of Income Dynamics (PSID), spanning nearly four decades, to investigate whether and how neighborhoods affect the aging process over the life course.

There is strong theoretical rationale supporting the hypothesis that neighborhood factors influence health, and many empirical studies have found strong associations between health and specific neighborhood variables (see Kawachi and Berkman, 2003 and Ellen et al 2001, for reviews). The critical and unresolved question is whether these health status differences by neighborhood quality can be causally attributed to neighborhood environment, or whether they are simply due to pre-existing differences between individuals living in different neighborhoods. Because differences between individuals and families in poor neighborhoods and those in affluent neighborhoods will bias estimates of neighborhood effects, it is necessary to control for such differences. This problem has been well-documented and is variously referred to in different disciplines as selection bias, omitted variable bias, confounding, or unobserved heterogeneity (Oakes, 2004; Manski, 1993).

There is little evidence in the literature about how late-life health is affected by neighborhoods of residence earlier in life in the United States. Evidence from international studies, however, demonstrate the importance of earlier-life neighborhood conditions on health in late life in such countries as England and Canada (Curtis et al., 2004; Curtis et al., 2009).

This paper exploits the unique features of the PSID to address these challenges. The analysis is carried out in three stages. We first bound the proportion of inequality in late-life health that may be attributed to child socioeconomic status and disparities in neighborhood and family characteristics during early-to-mid adulthood, using an aggregated neighborhoods estimation strategy. Second, we attempt to identify the specific childhood and young adulthood family and neighborhood-level characteristics that influence later-life health. Sensitivity analyses are then used to probe the robustness of the results for causal inference. Specifically, we use a novel empirical approach, recently proposed by Altonji et al (2005) and Krauth (2006), to test the robustness of the estimated effects of young adult neighborhood poverty to selection bias due to omitted variables. We assess how large the unobservable factors would need to be, relative to the observable factors included, to invalidate the results. Finally, in addition to addressing how neighborhoods of residence over the life course affect elderly health, our empirical analysis makes a further contribution by producing evidence on the role of neighborhood environments in contributing to socioeconomic and racial health disparities later in life.

Conceptual Framework

This paper takes a life-course perspective on health and hypothesizes that inequalities in health status are a product of the interaction between the developmental opportunities and vulnerabilities at each stage of the life course, on the one hand, with widely varying attributes of socioeconomic neighborhood and family conditions, on the other. In the life-course approach, long-term health consequences of earlier life adverse socioeconomic conditions will occur through multiple pathways. While there is no single life course approach that is widely accepted, some of the pathways that have been postulated in the literature include such concepts as “latency”, “cumulative”, and “pathway” processes. For instance, latency refers to resultant effects of adverse conditions early in life that cause the body’s physiology and metabolism to be fundamentally altered, with some of the consequences of these changes manifesting in visible health problems much later in life (Barker, 1998). Alternately, the cumulative process explanation posits that persistent exposures to disadvantaged neighborhood and family conditions may have a cumulative toll in the form of “weathering” (Geronimus, 1996). This conceptual framework is consistent with a stress and adaptation perspective on how neighborhood conditions may influence health trajectories. Finally, pathway processes refer to linkages between health and socioeconomic attainment where childhood socioeconomic and family factors affect adult socioeconomic outcomes such as education, which in turn influences health later in life (Kuh and Wadsworth, 1993). An additional pathway process may arise because socioeconomic attainment in adulthood affects the quality of residential neighborhood and therefore health. Latent, cumulative, and pathway explanations for the linkages between health and socioeconomic status are not mutually exclusive and may produce synergistic influences.

We extend the SES-health literature and the literature on racial disparities in the United States by increased attention to both family- and neighborhood-level processes. It would be an atomistic fallacy to assume that individual-level processes operate the same at the neighborhood level. We hypothesize that neighborhood SES conditions may influence health and well-being over and above the independent influence of individual and family-level SES factors. Living in a neighborhood with concentrated poverty may have consequences above and beyond those of growing up in a poor family because of social isolation, crime, weakened social institutions, unrelenting stress, inferior health care accessibility, and other factors. Corroborating evidence has shown that community-wide job losses induce community-wide impacts beyond the individuals that lose jobs; i.e., estimates reflect negative effects on workers and families who maintain employment but are impacted by their friends’ and neighbors’ loss of employment and the resulting changes to their communities (Ananot et al., 2011). Aggregate changes in family stress and socioeconomic resources may also help explain the effects of community economic conditions on subsequent health. While separately identifying how latent, cumulative, and other pathway effects may combine with current circumstances to impact later-life health is beyond the scope of this paper, the underlying framework informs the modeling approach taken in our empirical work which takes a life course perspective.

Methodological Challenges in Estimating Neighborhood Effects

The primary methodological challenge in estimating the causal effects of neighborhoods on health status is that unobserved factors that affect health may also be correlated with neighborhood factors, leading to biased estimates of neighborhood effects. The problem of endogeneity of residential location (in the form of “selection bias”) is a paramount concern because families choose where to live in part based on the characteristics they value (although factors such as racial discrimination and exclusionary zoning may constrain these decisions). Thus, families who care more about investing in health-promoting activities may be less likely to choose to reside in a community with a poor health care system or high pollution. Many of the multidimensional aspects that influence residential location decisions cannot be easily measured, which makes it particularly difficult to disentangle the causal influences of individual-, family-, and neighborhood-level factors.

Most health outcomes are likely to be a product of cumulative exposures to environments spanning decades or exhibit long latent periods before problems manifest. A problematic characteristic of many neighborhood studies is that they examine the relationship between contemporaneous health and neighborhood conditions without regard to changing circumstances during the life cycle or the persistence of exposure to neighborhood conditions. If exposure to neighborhood conditions at an earlier stage of the life cycle influences current health but is not accounted for, then the estimated relationship between contemporaneous neighborhoods and health may be misleading. Moreover, the typical methods used to address endogeneity in observational studies (e.g., instrumental variables and fixed effect approaches) (Wooldridge, 2002) have significant limitations in this context. Because most methods for overcoming endogenous residential location are based on small, short-run changes in the neighborhood environment, these approaches might be limited to uncovering effects only for rapidly-responding intermediate outcomes such as health behaviors (e.g., smoking/drinking, exercise/diet). An additional issue is that neighborhood variables change slowly over time (based on our own analysis and Kunz et al., 2003), implying that much of the measured year-to-year variation may be measurement error.

The most powerful way to address selection is through a randomized trial. But an experimental design where neighborhoods are randomly assigned is rare. A significant exception is the evaluation of the Move to Opportunity (MTO) program, where an experimental design is used to estimate the effects of offering housing assistance that allows individuals to move out of low-income, poor neighborhoods. Several papers demonstrate that MTO had beneficial effects on child and adult health (Katz, Kling, Liebman, 2001; Leventhal and Brooks-Gunn, 2003).

Data

The PSID began interviewing a national probability sample of families in 1968 and re-interviewed them in each year through 1997, when interviewing became biennial. All individuals are followed until death or attrition, and data through the 2005 wave is utilized in the analysis. The PSID maintains extremely high wave-to-wave core re-interview response rates of 95-98%. Studies have concluded that the PSID remains representative of the U.S. population (Fitzgerald et al, 1998a,b; Lillard and Panis, 1998; Becketti et al, 1988; Andreski, McGonagle, and Schoeni, 2009).

In 1968 the PSID used a cluster sample design which today provides the unusual opportunity to examine health outcomes for adults who were neighbors in 1968. Although the initial sample was clustered in 1968, PSID families were located in 2,570 different census tracts in 2005 because families and individuals moved throughout the nation. The major advantage of the PSID for our purposes is that it offers the potential for constructing earlier-life neighborhood effects, coupled with rich detail on early-life and mid-life socioeconomic data, and data on dimensions of health status over time.

The selected sample consists of PSID original sample members born between 1928 and 1948, which consists of adults who were in their 20s and 30s in the first wave of interviewing in 1968. We then obtain all available information on these individuals for each wave, 1968 to 2005. Therefore, by 2005 the oldest person in the sample is 77 and the youngest is 57. Due to the complexity of the health status changes for women during the childbearing years, we exclude observations for women in the years they were pregnant.

To increase the sample size as well as the proportion of poor and black families in the sample, we include both the Survey Research Center (SRC) component and the Survey of Economic Opportunity (SEO) component, commonly known as the “poverty sample,” of the PSID. We appropriately apply multi-level sample weights at the neighborhood and family levels to produce nationally-representative estimates. Estimates that exclude the SEO sample are similar to those reported in the paper but have much less precision. The sample consists of 36,121 person-year observations from 2,730 individuals in 1,894 families, 1,457 neighborhoods, and 306 US counties. The mean age is 53, with age ranging from 37 to 77, and an average of 13 observations per person. A total of 865 families contained married couples, and a total of 307 neighborhoods contained at least two different unrelated families.

Health outcome

The health outcome examined is self-assessed general health status (GHS) based on the question asked of household heads and spouses (if present) at each wave between 1984 and 2005: “Would you say your health in general is excellent, very good, good, fair, or poor?” This question also was asked of all family members in 1986. GHS is highly predictive of morbidity measured in clinical surveys, and is a powerful predictor of mortality, even when controlling for physician-assessed health status and health-related behaviors (Benyamini et al, 2000).

Differentials in the likelihood of reaching old age are important in understanding the health differences observed in old age (Johnson, 2008). The first wave of PSID interviewing in 1968 included 4,354 individuals who were in their 20s and 30s; 321 of these individuals died by 2005; 2,871 had at least one valid report of health status in mid to late adulthood. These individuals are included in the analyses for the years they are observed alive. Johnson (2009) analyzes the effects of family and neighborhood conditions earlier in life on adult mortality in mid-to-late life. The results suggest that any selective attrition with respect to mortality is likely to lead to an understatement of the impact of adverse conditions in childhood and young adulthood on health status in older adulthood because those who suffered premature death disproportionately grew up in the more disadvantaged family and neighborhood environments.

In order to scale the GHS categories, we use the health utility-based scale that was developed in the construction of the Health and Activity Limitation index (HALex) and used by the National Center for Health Statistics to estimate health-related quality of life measures (Healthy People 2000). We estimate all of the regression models of health status using the interval regression method. Using a 100-point scale where 100 equals perfect health, the interval health values associated with GHS (based on HALex) that are used in this paper are: [95, 100] for excellent, [85, 95) for very good, [70,85) for good, [30,70) for fair, and [1,30) for poor health. While the HALex approach with interval regressions is superior to alternatives (see Electronic Appendix for further details INSERT LINK TO THIS ONLINE APPENDIX), we have also estimated identical models to those reported in the tables but using poor/fair health as the dependent variable in a multi-level logit model. The substantive conclusions are unchanged.

Childhood factors

The measures of childhood socioeconomic status come from retrospective self-reports of childhood conditions collected in the early years of the PSID. These measures include childhood poverty (poor and non-poor), mother’s and father’s education (high school dropout, high school graduate and college educated), father’s occupation (white collar and blue collar), region of birth, and the type of community the individual grew up in (farm, large urban area, small town or different places). Additional childhood factors include year of birth, birth order, gender, and race (black and white). The accuracy of retrospective reports of childhood conditions have been examined and found to be reliable (Haas, 2007; Smith, 2009).

Young adult family/individual factors

Socioeconomic and health insurance factors in young adulthood are represented by several variables: average annual family income-to-needs ratio during that period 1967-1971 (ratio<1.0, ratio 1.0-3.0, and ratio>3.0), educational attainment (high school dropout, high school graduate, some college, and college graduate or higher), no private health insurance coverage at any point 1968-1972, cigarette smoking at any point 1968-1972, and annual expenditures on alcohol consumption 1968-1972.

Young adult neighborhood factors

A key aspect of the data is that each individual is geocoded to the census block of residence. We define the neighborhood of residence during young adulthood as the census block where the respondent lived in 1968. In the original wave of the PSID, a represented census block contains four sampled families, on average. The PSID cluster design is discussed in greater detail in Solon et al (2000). We utilize information on neighborhood characteristics from respondent self-reports in the survey and merged on neighborhood-level variables from the 1970 Decennial Census. The Census-based variables include the neighborhood poverty rate (as defined by the US federal poverty line), and the black-white dissimilarity index as a measure of racial residential segregation in the metropolitan area. We classify neighborhoods with low poverty levels as those in which less than 10% of the households are poor; neighborhoods with high poverty levels are those in which more than 30% of the households are poor; and neighborhoods with medium levels of poverty are those in which 10-30% of households are poor.

The self-reports of housing/neighborhood conditions, which were only collected in 1975, include: whether live in public subsidized housing; poor neighborhood for children, whether there exist plumbing problems, housing structural problems, security problems, cockroach or rat problems, insulation problems, neighborhood cleanliness problems, overcrowding, noise, or traffic problems, burglary, robbery, assault, drug use, or problems related to having too few police. We use these 1975 measures to proxy 1968 neighborhood characteristics, with evidence that 1968 families tended to move to neighborhoods that had observable neighborhood characteristics that were similar to their previous residential location (based on our own examination and evidence from Kunz et al, 2003). We find in our sample that individuals’ observable neighborhood characteristics exhibit a great deal of persistence across years during young adulthood. This suggests that the neighborhood an individual inhabits at a particular time in young adulthood is a reasonable proxy for one’s long-run environment during that life stage, and that relying on such a proxy produces only modest measurement error biases. Based on this information, we conclude that the neighborhood measures that we use are accurate indicators of the quality of the 1968 neighborhood where respondents lived.

Most prior studies that use survey responses of neighborhood conditions characterize neighborhood features by relying on only a single individual’s report of what happens in his/her neighborhood. A more reliable approach to measuring neighborhood socioeconomic conditions is to aggregate the reports of multiple respondents living in the same neighborhood (Kawachi and Berkman, 2003). Thus, the approach taken in this analysis to produce more reliable and valid measures of neighborhood characteristics is to exploit the PSID survey responses about neighborhood conditions from multiple respondents in the same neighborhood and use their collective assessment to build neighborhood indicators, as opposed to a strategy of constructing neighborhood indicators based on a single individual’s report of neighborhood characteristics due to same source bias. The mean number of sample families per neighborhood was 4 (ranging from 1 to 32).

Accordingly, a neighborhood with high reported crime levels is defined as one in which the average response among neighboring PSID households is a report of major crime-related problems (e.g., security problems, burglary, robbery, assault, drug use, or problems related to having too few police). Similarly, a “neighborhood housing quality index” is constructed based on the average response of the presence of housing insulation, plumbing, and/or housing structural problems, among all PSID households who live in the same neighborhood. We also make use of a unique set of indices, measured in young adulthood, that capture aspirations/motivation, long-term planning and connectedness to informal sources of help that were collected in the early years of the PSID. The latter measures social supports and social capital. Neighborhood-level measures were obtained by computing the average index scores, respectively, based on responses among neighboring PSID households. This survey information is used along with the 1970 census tract based measures of the neighborhood poverty rate. Appendix Table A0 lists the sources and years of all data elements along with details of the PSID survey questions used to construct these measures. Appendix Table A1 contains descriptive statistics for all childhood and young adult family and neighborhood measures for our sample by race.

Ideally, we would also want to incorporate information on detailed measures of the neighborhoods in which individuals grew up, but this information is not available. However, in a subset of analyses, we make use of PSID residential mobility histories over the 1968-2005 period, merged with neighborhood-level variables from the 1970, 1980, 1990 and 2000 Decennial Census, to construct contemporaneous measures of the neighborhood poverty rate and the duration of exposure to concentrated poverty.

METHODS

Outline of empirical approach

The first goal of the analysis is focused on an overall assessment of the relative contributions of individual, family and neighborhood influences during childhood and young adulthood on health in mid-to-late life. The overall scope of both young adult family and neighborhood factors on later-life health (implied by the spousal and neighbor correlations in health, reported in Table 2) provide the impetus for further investigation of what aspects of young adult family and neighborhood features influence subsequent health trajectories. We then analyze the relative contribution of a rich array of measured childhood socioeconomic conditions, young adult family and neighborhood, and individual covariates to the total variation from each component, and test hypotheses about the effects of specific characteristics of families and neighborhoods (reported in Table 3). Upon discovering a deleterious relationship between living in concentrated neighborhood poverty during young adulthood and health status in older ages, we probe the robustness of this finding for causal inference. In particular, we employ an innovative empirical approach, recently proposed by Altonji et al. (2005), to gauge how sensitive estimates of the effects of neighborhood poverty are to selection on unobserved variables. The results reveal that even a large amount of selection on unobservable factors does not eliminate the significant effect of neighborhood poverty on health status later in life (reported in Table 4). Finally, we use the estimated models to assess the extent to which earlier life neighborhood and family SES conditions can explain the significant racial health disparities observed at older ages (reported in Table 3). We discuss the methods used for each of these stages of the analysis in turn.

Table 2.

Spousal and Young Adult Neighbor Correlations in Later-Life Health Status

All Adulthood yrs (35+) Age 35-55 Over 55

Spousal
Correlation
Young Adult
Neighbor
Correlation
Spousal
Correlation
Young Adult
Neighbor
Correlation
Spousal
Correlation
Young Adult
Neighbor
Correlation
Unconditional 0.4598
(0.0067)
0.3313
(0.0077)
0.4718
(0.0068)
0.3485
(0.0085)
0.3771
(0.0089)
0.3060
(0.0089)
Adjusted*
(net of residential sorting of HHs w/similar family
bckgrd)
-- 0.2677
Conditional, control for childhood SES 0.3605
(0.0075)
0.1988
(0.0083)
0.3739
(0.0076)
0.2116
(0.0098)
0.2593
(0.0098)
0.1592
(0.0095)
Conditional, control for childhood + young adult
family/neighborhood factors
0.2760
(0.0080)
0.1287
(0.0079)
0.3003
(0.0081)
0.1409
(0.0094)
0.1658
(0.0104)
0.0880
(0.0093)
*

To compute the adjusted neighbor correlations, we first estimated within-neighborhood estimates of the effects of family income, education, race, family structure, health insurance coverage, health behaviors, connectedness to informal sources of help and housing quality on health in adulthood. Then, we used the within-neighborhood estimates of the later-life health effects of this array of family characteristics to assess how much of the raw neighbor correlation is due to young adult neighbors having similar (observable) family characteristics as opposed to neighborhood effects per se . The estimation procedures are described in detail in the methods section of the paper. (The full model results used to compute the adjusted neighbor correlations are not shown to conserve space, but are available from the authors upon request).

Table 3.

General Health Status in Mid to Late Life

Age 35 - 55 Over 55

Controls for
Childhood SES +
Young Adult
Family
Controls for
Childhood SES +
Young Adult
Nhood + Fam
Controls for
Childhood SES +
Young Adult
Backgrd +
Contemporaneous
NHood
Controls for
Childhood SES +
Young Adult
Family
Controls for
Childhood SES +
Young Adult Nhood
+ Fam
Controls for
Childhood SES +
Young Adult Backgrd
+ Contemporaneous
NHood
Childhood factors
Black
  Non-Hispanic white (reference category)
−6.4238***
(0.3035)
−5.2458***
(0.3210)
−5.3266***
(0.3218)
−5.2408***
(0.3851)
−3.6323***
(0.4049)
−3.8721***
(0.4058)
Childhood poverty
  Non-poor (reference category)
−0.9583***
(0.1670)
−0.8027***
(0.1662)
−0.8010***
(0.1663)
−3.0631***
(0.2197)
−2.9577***
(0.2193)
−2.9434***
(0.2196)
Mother’s education:
 High school dropout
  High school dropout
−1.8096***
(0.1660)
−1.8141***
(0.1659)
−1.8138***
(0.1660)
−2.3982***
(0.2227)
−2.4046***
(0.2241)
−2.4144***
(0.2244)
 College educated 1.6477***
(0.2504)
1.9316***
(0.2487)
1.8945***
(0.2489)
0.9967***
(0.3407)
1.2704***
(0.3399)
1.2114***
(0.3404)
Father’s education:
 High school dropout
  High school graduate (reference category)
−2.0027***
(0.1785)
−1.9255***
(0.1773)
−1.8857***
(0.1774)
−1.6665***
(0.2414)
−1.7798***
(0.2405)
−1.7860***
(0.2409)
 College educated −0.5665**
(0.2626)
−0.3117
(0.2608)
−0.2771
(0.2611)
0.3004
(0.3593)
0.1198
(0.3574)
0.1359
(0.3578)
Father’s occupation:
 White collar
 1998Blue collar (reference category)
0.2164
(0.2441)
0.3893+
(0.2427)
0.3786+
(0.2429)
0.4631+
(0.3334)
0.7858**
(0.3324)
0.7271**
(0.3329)
Grew up on farm
  Grew up in large, urban MSA (ref. category)
−0.9223***
(0.2153)
−1.0149***
(0.2184)
−1.1212***
(0.2187)
0.8425***
(0.2743)
0.7363***
(0.2764)
0.6976**
(0.2768)
Grew up in small town −1.6966***
(0.1879)
−1.6357***
(0.1885)
−1.7062***
(0.1886)
−1.6318***
(0.2407)
−1.7885***
(0.2419)
−1.7948***
(0.2422)
Grew up in different places 1.3571*
(0.7486)
2.1853***
(0.7418)
2.1411***
(0.7421)
−0.8392
(0.9055)
0.1602
(0.8995)
0.1722
(0.9004)
Young Adulthood factors
Family income-to needs ratio
avg during 1967-1972), spline:
 Income-to-needs ratio*ratio is <1 12.6425***
(1.1252)
12.0062***
(1.1222)
12.0201***
(1.1229)
14.4338***
(1.5567)
15.1875***
(1.5577)
15.4341***
(1.5593)
 Income-to-needs ratio* ratio is 1 to 3 2.2339***
(0.1510)
1.6325***
(0.1515)
1.6691***
(0.1516)
2.6004***
(0.1937)
2.0381***
(0.1938)
2.0775***
(0.1940)
 Income-to-needs ratio* ratio is >3 0.1179*
(0.0673)
0.0799
(0.0666)
0.0822
(0.0667)
0.2545***
(0.0844)
0.2012**
(0.0837)
0.2175***
(0.0838)
Educational attainment:
 High school dropout
  High school graduate (reference category)
−6.4392***
(0.2142)
−6.1404***
(0.2136)
−6.1589***
(0.2138)
−8.0484***
(0.2871)
−7.2543***
(0.2881)
−7.2735***
(0.2885)
 Some college 0.4847**
(0.1964)
0.6371***
(0.1957)
0.6620***
(0.1958)
0.9893***
(0.2626)
1.1771***
(0.2628)
1.1769***
(0.2632)
 College graduate or higher 3.7414***
(0.2039)
4.0982***
(0.2026)
4.1290***
(0.2028)
5.0618***
(0.2717)
5.4568***
(0.2715)
5.4627***
(0.2718)
No Private HI coverage, 1968-1972 −2.3005***
(0.1696)
−1.7638***
(0.1691)
−1.7383***
(0.1692)
−3.2845***
(0.2183)
−2.5922***
(0.2192)
−2.6260***
(0.2194)
Smoked cigarettes at some point, 1968-1972 −2.4748***
(0.1747)
−2.2742***
(0.1737)
−2.2738***
(0.1738)
−2.7553***
(0.2195)
−2.6894***
(0.2180)
−2.7168***
(0.2182)
Annual alcohol expenditures (in $100’s),
 5-year average 1968-1972
0.0185+
(0.0122)
0.0052
(0.0122)
0.0050
(0.0122)
0.0341**
(0.0155)
0.0198
(0.0155)
0.0197
(0.0155)
Young Adult Neighborhood factors
Neighborhood poverty rate (1970), spline:
  Low poverty neighborhood (reference category)
 Medium poverty neighborhood −2.2588***
(0.2752)
−2.3401***
(0.2759)
−2.7542***
(0.3558)
−3.0075***
(0.3568)
 Neighborhood poverty rate - 20)* rate is 10 to 30% −1.1892***
(0.3765)
−1.2353***
(0.3772)
−1.4913***
(0.4985)
−1.7541***
(0.4996)
 High poverty neighborhood −0.5325
(0.4500)
−0.5253
(0.4507)
−6.3390***
(0.5921)
−6.4341***
(0.5929)
Neighborhood crime problem −0.6165**
(0.2918)
−0.6449**
(0.2920)
−1.2592***
(0.3713)
−1.2929***
(0.3717)
Residential segregation dissimilarity index, 1970 (MSA) −2.0344+
(1.4509)
−2.4281*
(1.4528)
−4.9635***
(1.8422)
−5.2839***
(1.8447)
Neighborhood housing quality index −1.1363***
(0.0538)
−1.1406***
(0.0538)
−0.7885***
(0.0724)
−0.7852***
(0.0725)
Neighborhood connectedness to informal sources of help 0.5998***
(0.0589)
0.6015***
(0.0589)
0.7270***
(0.0750)
0.7260***
(0.0751)
Average Aspirations index in neighborhood 0.4689***
(0.0530)
0.4689***
(0.0531)
0.5967***
(0.0677)
0.5995***
(0.0678)
Contemporaneous Neighborhood
Neighborhood poverty rate 0.0714**
(0.0322)
0.5512***
(0.0459)

Random Effects, Unmeasured (Std Dev)
Young Adult Neighborhood component 5.3427***
(0.1794)
5.1491***
(0.1766)
5.1865***
(0.1760)
5.2475***
(0.2416)
4.8412***
(0.2576)
4.8621***
(0.2575)

Family component 5.6989***
(0.1948)
5.4752***
(0.1954)
5.4415***
(0.1972)
4.8759***
(0.3473)
4.5505***
(0.3707)
4.5092***
(0.3749)

Individual component 11.5072***
(0.0711)
11.4734***
(0.0705)
11.4871***
(0.0706)
14.8695***
(0.1040)
14.9012***
(0.1043)
14.9303***
(0.1045)

Transitory error component 6.7333***
(0.0082)
6.7333***
(0.0082)
6.7304***
(0.0082)
7.7175***
(0.0122)
7.7177***
(0.01218)
7.7130***
(0.0122)

Log-likelihood −1333818.6 −1333338.8 −1333230.3 −871477.14 −871153.91 −871078.29

Number of counties 306 306 306 306 306 306
Number of neighborhoods 1,416 1,416 1,416 1,129 1,129 1,129
Number of families 1,827 1,827 1,827 1,460 1,460 1,460
Number of individuals 2,625 2,625 2,625 2,019 2,019 2,019
Number of person-year observations 22,766 22,766 22,766 13,355 13,355 13,355
***

p<0.01

**

p<0.05

*

p<0.10

Note: All models include a constant and controls for age, age squared, age cubed, year of birth, gender, birth order, region of birth, and indices intended to capture long-term planning horizon (coefficients suppressed to conserve space).

Table 4.

Estimated Effect of Living in High Poverty Neighborhood during Young Adulthood on Later-life Health for a Proportional Correlation Model with Varying Values of the Relative Correlation

Relative
Correlation
Estimated Effect of
High Poverty Neighborhood during
Young Adulthood
(reference cat: Low Poverty Neighborhood)
0 (exogeneity) −4.7344***
(0.1198)
0.2 −5.1847***
(0.2804)
0.4 −2.4631***
(0.2194)
0.8 −13.1834***
(0.2136)
1 −18.6636***
(0.7760)

First, we estimated models based on the aggregated neighborhoods approach (Solon et al, 2000) in order to create an upper bound for the size of the neighborhood effect. The basic idea is to estimate the correlation of health status later in life of adults who were neighbors earlier in life. We assess the extent to which health status is correlated among neighbors above and beyond the correlation that arises due to family effects (i.e., net of the resemblance because neighbors have similar family characteristics due to residential sorting). Because all 1968 family members within a given family are followed throughout their lives, the correlation among neighbors can be adjusted to account for family-specific factors. This approach avoids the difficulty of defining neighborhood quality at the outset, and instead asks: relative to the correlation among spouses, do adult neighbors have highly correlated health status? The comparison of spousal correlations with young adult neighbor correlations in mid-to-late life health allows an assessment of the relative magnitudes of the effects of the neighborhood environment in adulthood versus family characteristics in adulthood, placing an upper bound on the neighborhood influence. Small adult neighbor correlations would indicate that adult neighborhood factors can explain only a minor portion of the variation in later-life health outcomes. Large neighbor correlations would leave open the possibility that neighborhoods contribute significantly to inequality in health outcomes, and further analyses of the effects of particular neighborhood characteristics would be warranted.

We begin by estimating the four-level hierarchical random effects model (Kawachi and Berkman, 2003; Rabe-Hesketh and Skrondal, 2008; Raudenbush and Bryk, 2002) given by

Htsfn=(β0000+β1000Aget)+(η000n)+(ϕ00fn)+(δ0sfn)+εtsfn) (1)

We estimate these models separately at middle age (ages 35-55) and later-life (ages over 55), in order to gain greater insight into the extent to which family and neighborhood factors earlier in life influence the trajectory of health in mid-to-late life. These unconditional baseline models also include controls for year of birth, quadratic terms for age (suppressed in the above notation), and gender. The indices t, s, f, and n denote time, individuals, families, and neighborhoods, respectively, where there are

t=1,2,,Osfnobservations over time of individualsin familyfin neighborhoodn;s=1,2,,Sfnindividuals in familyfin neighborhoodn;f=1,2,,Fnfamilies in neighborhoodn;n=1,2,,Nneighborhoods.

The neighborhood-, family-, and individual-level random effects capture unobserved characteristics of the neighborhood, family, and individual. The neighborhood random intercept coefficient is represented by η000n ; the family random intercept coefficient is represented by ϕ00fn; the individual random intercept coefficient is represented by δ0sfn; andεtsfn represents the individual transitory component of self-reported health (which includes measurement error). Each of these random effects are assumed to be normally distributed with a mean of 0, and var(η000n)=σ0n2, var(ϕ00fn)=σ0fn2, var(δ0sfn)=σ0sfn2 and var(εtsfn)=σtsfn2. Aget is the individual’s actual age at time t centered around the mean age in the sample. All standard errors are Huber-corrected, clustered on county (Wooldridge, 2002; Rabe-Hesketh and Skrondal, 2008).

Since neighborhoods are nested within counties, we also estimated five-level hierarchical models, where the hierarchical levels represented counties, neighborhoods, families, and individuals over time. Those models were estimated as a robustness check to ensure that the young adult neighborhood random effects components were not primarily driven by effects operating at higher geographic levels of aggregation. However, those models did not significantly improve the fit and the between-county random effects component was not statistically significant, which supports the use of the four-level hierarchical model.

Of primary interest is the decomposition of the variance of the level of health in mid-to-late life into their within-family, between-family within-neighborhood, and between-neighborhood components. In this model, individuals from the same neighborhood but not in the same family (i.e., neighbors) are correlated because they share the neighborhood random effect η000n, and married couples are correlated because they share the neighborhood and family random effects η000n and ϕ00fn. We evaluate the health correlation between spouses at the same age, and evaluate the health correlation between neighbors at the same age. In this model, the spousal correlation and neighbor correlation in the level of health can be computed, respectively, as:

ρspousal,healthlevel(age)=(σ0n2)+(σ0fn2)(σ0n2)+(σ0fn2)+(σ0sfn2)ρneighbor,healhlevel(age)=(σ0n2)(σ0n2)+(σ0fn2)+(σ0sfn2)

The spousal correlation is between Hsfn and Hsfn, evaluated at the same age; the neighbor correlation is between Hsfn and Hsfn, evaluated at the same age. Our interest is in the permanent (rather than the transitory) component of health, so we do not include the temporal variation of health in the denominator.

We then use the estimated spousal and young adult neighbor correlations in health at mid-life and later-life to construct an age-profile of spousal and neighbor health correlations. This enables us to better assess the later-life health consequences of neighborhood and family influences experienced earlier in the life cycle.

“Adjusted Neighbor Correlations”

We then estimate “adjusted neighbor correlations”, which are net of the similarity arising from young adult neighbors having similar observed family characteristics. To extract the impact of similar family characteristics out of the neighbor correlation, we first estimate the following regression, where for ease of exposition we omit the random effects terms that are included in the estimated model:

Htsfn=α0agetsfn+α1gendersfn+α2Xfn+α3(Xn¯)+εtsfn, (2)

where X••fn is the vector of young adult family/individual characteristics. Xn¯ is a vector of the 1968 neighborhood-level means of the same above variables.

Inclusion of family-level and neighborhood-level variables measuring the same concepts enables the vector α2 of coefficients to capture within-neighborhood effects of family characteristics. Using the within-neighborhood estimates of the family effects of income, education, race, family structure, health insurance coverage, health behaviors, connectedness to informal sources of help and housing quality on health in adulthood, will ensure the coefficients ( α2 ) will not be biased by omitted neighborhood variables. This follows from the fact that the neighborhood-level unmeasured factors can only be correlated with the neighborhood-level mean of the covariates. In combination, the resulting estimates can be taken as a conservative estimate of the independent effects of family characteristics.

We then estimate the between-neighborhood variance in α^Xfn by estimating a hierarchical random effects model of α^Xfn on neighborhood-level, family-level, and individual-level random effects. We then subtract the estimate of the between-neighborhood variance in α^Xfn from the estimate of the overall between-neighborhood variance in Hsfn. Dividing the resulting quantity by Var^(Hsfn) yields a tighter upper bound on the proportion of Var(Hsfn) that can be attributed to neighborhood effects.

The estimates of “adjusted neighbor correlation” enable us to ascertain how much of the raw neighbor correlation is due to young adult neighbors having similar (observable) family characteristics. We then investigate to what extent observable childhood socioeconomic status and young adult neighborhood- and family-level characteristics explain the observed spousal and neighbor correlations at middle age and late-life.

Sensitivity Analysis

As a preview of our results, we find concentrated neighborhood poverty is associated with significantly elevated risks of problematic health, even after controlling for an extensive set of individual and family-level factors. Rather than using traditional regression models with statistical controls, we provide a new approach to the study of neighborhood effects that allows for the assessment of the sensitivity of results to selection bias. We use sensitivity analysis to assess the impact of selection bias from unobserved covariates on estimates of neighborhood effects—in this case, the later-life health impacts of living in a poor neighborhood. Intuitively, these methods can be thought of as estimating neighborhood effects by comparing individuals living in poor and non-poor neighborhoods, who are otherwise identical on observable characteristics, and then testing the robustness of the results to the presence of unobserved covariates. The goal of the sensitivity analysis here is to assess how an unobserved covariate that affects both neighborhood of residence and adult health would alter our conclusions about the neighborhood effect. Sensitivity analysis asks, How do our inferences change given various hypothetical unobserved covariates?

Our aim is to assess how the point estimate and confidence interval of the effect of neighborhood poverty change under the presence of selection bias of varying strengths. We use an approach proposed by Altonji et al (2005) and Krauth (2006). This sensitivity analysis allows one to determine the threshold of selection on unobservable factors, if any, at which neighborhood poverty during young adulthood no longer has a significant effect on adult health. The approach uses the statistical relationship between observed explanatory variables as a guide to generate plausible estimates about the relationship between observed and unobserved variables. The sensitivity parameter, θ, can be defined as

corr(Xk,u)=θcorr(Xk,XβXkβk)

where θ indexes the magnitude of the correlation between observables and unobservables relative to the analogous correlation among observables themselves. In other words, the correlation between the neighborhood poverty rate and the (effect-weighted) unobservables is proportional to the correlation between the neighborhood poverty rate and the effect-weighted observables. The standard exogeneity assumption is the special case of θ=0. This approach provides a way to construct bounds on the effect of neighborhood poverty during young adulthood on mid-to-late life health based on the bounds one places on the sensitivity parameter θ (i.e., the relative correlation).

Altonji et al (2005) argue that if the observable determinants of an outcome are truly just a random subset of the complete determinants, selection on observable characteristics must be equal to selection on unobservables. Because the PSID was conducted to study family factors that affect well-being, we would expect selection on observables to be greater than selection on unobservable factors. In other words, the extensive measures of family and neighborhoods captured in the PSID are likely to be the most important determinants of adult health. Thus, estimates obtained under the assumption of equal selection, i.e., θ=1, will be biased downwards.

Results

The existence of a relationship between neighborhood poverty status in young adulthood and health status in mid-to-late life are evident in simple bi-variate comparisons. In the PSID sample, 45% of adulthood years are spent in fair or poor health among those who lived in poor neighborhoods in young adulthood, while that proportion is 15% among those who lived in non-poor neighborhoods. Further, the gaps in health status between groups living in more and less socioeconomically advantaged areas are large and increase across the life course.

We explore these relationships further by examining the unadjusted spousal and young adult neighbor correlations of health in mid-to-late life. We then examine how much of the adult neighbor correlations can be explained by the fact that families in a neighborhood tend to be similar as opposed to emanating from neighborhood effects per se. We then present estimates of the magnitude of the effects and attempt to explain the life-cycle pattern of spousal and neighbor correlations and to explore potential mechanisms that underlie the relative roles of neighborhood and family factors on the health trajectory over the course of adulthood. Finally, we explore the role of factors over the life course in explaining racial disparities in health in mid to late life.

Unadjusted Spousal and Young Adult Neighbor Correlations in Later-life Health

Table 1 presents the estimates from the baseline four-level hierarchical random effects models, separately at ages 35-55 and at ages over 55—these models control only for age, gender, and year of birth. The random effects estimates are all significant at each of the young adulthood neighborhood, family and individual levels.

Table 1.

Mid to Late-life Health: Importance of Young Adult Neighborhood & Family Background

(Dependent variable: general health status)
Hierarchical Random Effects Interval Regression Model: 100pt-scale, 100=perfect health
All Adulthood yrs
35+
Ages 35-55 Over 55
(1) (2) (3)
Constant 81.0112***
(0.1221)
81.1520***
(0.1224)
77.9502***
(0.1631)
Age - 50 −0.3909***
(0.0026)
−0.4340***
(0.0043)
Age - 60 −0.4451***
(0.0062)
Female −1.8432***
(0.1295)
−2.1336***
(0.1293)
−1.7299***
(0.1801)

Random Effects, Unmeasured (Std Dev)
Young Adult Neighborhood component
9.4605***
(0.1262)
9.5284***
(0.1343)
10.5244***
(0.1718)

Family component 5.8912***
(0.1921)
5.6670***
(0.2079)
5.0742***
(0.3605)

Individual component 12.0808***
(0.0727)
11.7314***
(0.0732)
15.0163***
(0.1070)

Transitory error component 7.9767***
(0.0075)
6.7325***
(0.0082)
7.7180***
(0.0122)

Log-likelihood −2235086 −1338205 −875160.66

Number of counties 306 306 306
Number of neighborhoods 1,457 1,416 1,129
Number of families 1,894 1,827 1,460
Number of individuals 2,730 2,625 2,019
Number of person-year observations 36,121 22,766 13,355
***

p<0.01

**

p<0.05

*

p<0.10

Note: Robust standard errors in parentheses and all standard errors are Huber-corrected, clustered on county. All models control for year of birth and include age squared and age cubed terms (coefficients suppressed to conserve space).

The baseline models in Table 1 measure the overall magnitude of variation in mid-to-late life health related to young adult neighborhood, family, and individual-level factors. The spousal and neighbor correlation estimates are based on the decomposition of variance over time into the fraction that lies between neighborhoods, families, and individuals. The unadjusted spousal and neighbor correlations calculated from the baseline models are summarized in the first row of Table 2. The spousal correlation in general health status is 0.46, on average, across ages 35 and older, and declines somewhat during ages after 55.

Spousal correlations by themselves cannot disentangle how much of the resemblance among their health outcomes is due to the effects of shared household environment and family background and how much is due to the effects of neighborhoods. Augmenting the spousal correlation estimates with corresponding neighbor correlation estimates reveals young adult neighborhood and family factors may both be important determinants of general health status in mid-to-late life. While the young adult neighbor correlations are smaller than the spousal correlations, they are significant through middle-age and old age. In particular, the young adult neighbor correlation in health at ages 35 and older averages 0.33 (Table 2). These magnitudes are particularly noteworthy given the fact that the vast majority of these individuals no longer live in the same neighborhoods they once shared as young adults (in this paper young adulthood is defined as individuals in their 20s and 30s). In the final section we will return to the issue of the extent of residential mobility and persistence in neighborhood quality across the life course, and its implications for these results.

Adjusted Spousal and Young Adult Neighbor Correlations in Later-life Health

From the adjusted neighbor correlation estimates, we find that observable family sorting (controlling for a broad array of young adult family characteristics including income, education, race, family structure, health insurance coverage, health behaviors, connectedness to informal sources of help and housing quality) explain some but not all of the resemblance in adulthood health status among individuals who lived in the same neighborhood as young adults. (The full model results used to compute the adjusted neighbor correlations are not shown to conserve space, but are available from the authors upon request). Specifically, the adjusted neighbor correlation is roughly 20% lower than the unadjusted neighbor correlation (Table 2, row 2), suggesting that differences in neighborhood quality during young adulthood may account for up to one-quarter of health disparities in mid-to-late life.

Magnitude of effects and potential pathways

What do these correlation estimates mean in terms of the absolute size of the effects of family and neighborhood factors, and what aspects of family socioeconomic status and young adult neighborhood matter to explain the lasting importance of earlier life conditions on late-life health? Estimates of the neighborhood random components (σn) indicate that neighborhood quality during young adulthood has significant and enduring association with general health status over the course of adulthood. It is also possible that some of this underlying relationship may be due to differences in childhood neighborhood quality and the correlation between childhood neighborhood and young adult neighborhood factors. (We cannot directly investigate this issue for the older birth cohorts analyzed in this paper because we do not have information on detailed neighborhood-level factors experienced during childhood).

With these questions in mind, we next estimate a series of models, reported in Table 3, building toward a full model specification that includes a rich array of observable child socioeconomic status measures and young adult family-level and neighborhood-level characteristics to attempt to identify earlier life determinants of health in mid-to-late life. Family income and neighborhood poverty are dimensions of family and neighborhood background that are a key focus of the analysis. We include neighborhood measures of crime, connectedness to informal sources of help, aspirations/motivation, long-term planning, and racial residential segregation in young adulthood. These factors may themselves be the product of living in a very poor neighborhood and may represent pathways through which exposure to depressed neighborhood environments earlier in life affect health trajectories later in life. However, controlling for this myriad of ways in which those who reside in very poor neighborhoods may differ from individuals who live in affluent neighborhood environments allows one to generate a more conservative estimate of the effect of neighborhood poverty itself, as well as shed light on the factors that affect adult health status.

Table 3 reports the regression results for adult ages 35-55, and ages 55 and older, respectively. For each age group, the series of results reported include models that control for childhood SES and young adult family characteristics (first column) and then adds young adult neighborhood characteristics (second column), and a full model that adds contemporaneous socioeconomic and neighborhood characteristics (third column). Separately identifying the causal pathways through which earlier life socioeconomic factors influence late-life health is an arduous task, and one must use caution in drawing causal inferences from these coefficient estimates. The estimates summarize the relationships between the health trajectory in adulthood with various dimensions of neighborhood and family background. The robustness of the results for causal inference is examined in detail in the final section of the paper.

The results demonstrate that the most salient young adult neighborhood factor is concentrated neighborhood poverty. To put the magnitudes in perspective, it is useful to consider that the estimated later-life health differences between those who lived in a neighborhood with medium poverty levels versus low poverty levels during young adulthood are on par with the estimated impacts of smoking as a young adult. As shown in column 5 of Table 3, the results for health at ages over 55 show that a 10 percentage-point increase in the neighborhood poverty rate (in young adulthood) from 10 to 20 percent is related to a 1.49 reduction in GHS, and living in a neighborhood with a high poverty level corresponds with a 6.34 lower GHS score in later life, relative to living in a neighborhood with a low poverty level as a young adult. This latter effect is equivalent to reaching a level of health deterioration roughly 14 years sooner for an individual who lived in a neighborhood with a high poverty level, relative to the simulated health trajectory experienced by one who lived in a neighborhood with a low poverty level earlier in adulthood. For purposes of causal inference, the robustness of this result to alternative thresholds of selection on unobservable factors is analyzed in the following section.

Several other related dimensions of neighborhood disadvantage experienced in young adulthood had substantial, independent associations with the health trajectory over the course of adulthood, including high crime, racial residential segregation, neighborhood housing problems, neighborhood connectedness to informal sources of support (which may serve as a proxy for social cohesion), and neighborhood-level average aspirations for socioeconomic attainment. These factors appear to have stronger relationships with health over the life-course, with stronger links to health at ages over 55 relative to middle-age. The age-profile of these estimated effects suggest that the linkages may be the result of cumulative exposure to disadvantaged environments taking a toll on health later in life that may be reinforced with how these factors earlier in life influence the socioeconomic mobility process. For example, as shown in column 2 of Table 3, for health status at ages over 55, living in a neighborhood with high reported crime as a young adult reduces GHS by 1.3 points; a 25 percentage-point increase in childhood residential segregation (dissimilarity index increase from 50 to 75) is related to a 1.2 point reduction in GHS (similar effects for both blacks and whites (results not shown)). Taken together, the cumulative set of childhood SES measures, and young adult neighborhood and family factors account for 54 percent of the neighborhood-level variance at ages over 55 (implied quasi-R2 at the neighborhood level). After controlling for observable childhood and young adult SES, the similarity of both spouses’ and young adult neighbors’ later-life health outcomes are less marked. Namely, conditional on childhood SES, and young adult neighborhood and family factors, the spousal correlation is 0.28 and the young adult neighbor correlation in mid-to-late life health is reduced to 0.13, which is roughly one-half the size of the adjusted neighbor correlations (see the third row of Table 2). Thus, we see that our observable measures of childhood SES and young adult neighborhood factors account for a sizable share of the patterns of health inequality in older adulthood and explain a significant portion of the resemblance of young adult neighbors’ later-life health outcomes.

Most of the neighborhood effects literature that has examined health outcomes has investigated whether contemporaneous neighborhood factors are significantly associated with health at a point in time (see, for example, literature reviews by Ellen et al (2001) and Kawachi and Berkman, 2003). In columns 3 and 6 of Table 3, we use our models to re-examine this issue. The final model includes both the full set of childhood SES and young adult neighborhood and family factors along with the contemporaneous adult neighborhood poverty rate. (Differences in the functional form specified for contemporaneous neighborhood poverty yielded nearly identical results). Contemporaneous adult neighborhood poverty was only weakly related to health status in mid-to-late life. The coefficient estimates on the childhood SES and young adult neighborhood and family factors remain significant and are not significantly reduced with the inclusion of the contemporaneous adult neighborhood poverty rate.

Results from Sensitivity Analysis

The estimates of the significant effects of neighborhood poverty during young adulthood on later-life health reported in Table 3 are based on models in which exogeneity is assumed. We conduct a sensitivity analysis to test the robustness of the estimated effects of young adult neighborhood poverty to selection bias due to an omitted variable. Table 4 presents the range of estimated coefficients and standard errors on young adult neighborhood poverty as a function of the ratio of selection on unobservables to selection on observables, i.e., θ. Under the assumption of exogeneity, i.e. , θ=0, the estimated effect is −4.7344 and highly significant. If instead, θ=0.4, the estimated effect is smaller but still substantial at −2.4631 and highly significant. In sum, we find that the effect of young adult neighborhood poverty on health status later in life remains large and significant even with a large amount of selection on unobservable factors.

While there is no single perfect solution to address the endogeneity of residential location, there are additional tests we employed to determine whether selection bias is driving the results. In additional model results not shown, we find that the strength of the estimated effects of neighborhood poverty during young adulthood vary with the number of known neighbors and duration of exposure to concentrated neighborhood poverty. The fact that the effect of concentrated neighborhood poverty is weaker when the duration of exposure is brief and individuals know few of their neighbors suggests that selection bias is not driving these results, and that the observed patterns of health inequality are not solely attributable to pre-existing differences between individuals living in different neighborhoods. If effects simply represented unmeasured family factors, then the number of years in the neighborhood and the number of neighbors known by name should not be associated with the strength of these effects. But that is not the case here, and the evidence is not consistent with alternative omitted-variables counter-explanations of the results, nor with the hypothesis that families predisposed to poor outcomes select into poor neighborhoods. Instead, the pattern of results, taken together, point towards true ‘causal’ effects of concentrated neighborhood poverty, rather than results purely driven by selection effects.

Health Disparities

Residential segregation patterns that lead to racial differences in neighborhood quality have been cited as a factor that perpetuates racial health disparities (Williams and Collins, 1995). We also used the PSID data to examine racial disparities in health over the life course. Gaps in health between blacks and whites are large and exist at all stages in life. The general health status (GHS) index in adulthood, on average, based on the PSID data is 13.9 points lower for blacks and this gap increases in levels and in proportionate terms in adulthood. A useful way to interpret the estimate is in relationship to the size of the effect of age on health, with the race gap in ages over 55 equivalent to blacks (on average) reaching a level of health deterioration about 30 years prior to their white counterparts. That is, GHS is 13.9 points lower for black adults, which is equal to roughly 30 years evaluated at an effect of age of −0.45. Thus, substantial disparities in health exist between blacks and whites over the life course.

In this section, we explore the role of childhood and young adult family/neighborhood measures on the black-white difference in health status later in life (at ages 55 and older). We find that these factors have a substantial influence. The black-white difference in health status at ages over 55 in models that include childhood SES and young adult family measures is reduced to one-third of its original raw age-adjusted gap. Most prominent among these young adult family factors is family income, with substantially larger impacts of income in the lower tail of the distribution. For example, a one-unit increase in the family income-to-needs ratio (during young adulthood) from half of the poverty line to 1.5 times the poverty line translates into an 8.5 point increase in adult GHS at ages over 55 (0.5*14.4338+0.5*2.6004), which is equivalent to 18.9 years younger. One’s own educational attainment, health insurance coverage, cigarette smoking and alcohol use in young adulthood are each also shown to be strongly associated with later-life health.

Further evidence of the influence of early life SES on late life health disparities is found in the estimates in columns 2 and 5 of Table 3. Once we control for child SES and both young adult neighborhood and family factors, the black-white difference in health status at ages over 55 is reduced by 76 percent. Further, the coefficients on the childhood SES measures are reduced significantly with the inclusion of the set of young adult family and neighborhood factors, but the childhood factors remain large and significant.

Finally, as shown in these columns 3 and 6 of Table 3, the results reveal that while three-quarters of racial differences in health at ages over 55 can be accounted for by childhood SES and young adult family and neighborhood factors, contemporaneous neighborhood factors independently account for little of this gap. Taken together, these results underscore the importance of earlier-life family and neighborhood SES conditions as keys to understand sources of health disparities at older ages.

Limitations of the study

Although the forty-year panel of data from the PSID provides a rich source of data from which to study the effects of neighborhoods on health, the analyses here have some limitations. While it is important to theorize potential pathways through which neighborhood influences earlier in life may have long-run impacts on later-life health, it is also important to bear in mind that the empirical work does not aim to separately identify all the causal pathways through which, for example, living in concentrated poverty neighborhoods matter. Instead, we produce estimates of the total (direct and indirect) impacts of these neighborhood conditions, and show that these neighborhood effects persist even in the presence of a reasonably large amount of potential unobservable individual and family factors that may significantly affect both neighborhood of residence and subsequent health status. Understanding and uncovering all of the potential causal mechanisms through which neighborhood SES matters remains fertile ground for future research. While we have a more extensive set of measures than most previous studies have employed across the life course regarding earlier-life family and neighborhood influences, we are nonetheless restricted to available measures in the PSID and Census data. Therefore, our measures may proxy for key underlying constructs of neighborhood SES and social capital such as community resources, social cohesion, and stressful environments. We have limited measures of childhood neighborhood conditions for this older birth cohort. Research using new measures of family- and neighborhood-level stress and potential interventions to moderate it are an important and promising area for future study. Finally, although our research design permits us to produce upper bounds of the potential scope of neighborhood influences, and sensitivity analyses allowed us to assess the robustness of results to varying degrees of selection, it remains difficult to fully address endogeneity issues in this work.

Conclusion

Previous research demonstrates that health status in the US varies strongly across local, state, and regional settings (Murray, Michaud, McKenna, and Marks, 1998). This paper investigates multilevel causation of these differences over the course of adulthood and provides insights into how family and neighborhood environments earlier in life influence the health trajectories of individuals in ways that cannot be reduced to the characteristics of the individuals themselves.

In this paper, we used correlations based on a nationally representative longitudinal sample of married couples and neighbors followed from young adulthood through elderly ages to estimate bounds on the possible causal effects of young adult family and neighborhood characteristics on general health status in mid-to-late life. The results show a significant scope for both young adult family and neighborhood factors. The estimates suggest that disparities in neighborhood conditions experienced in young adulthood account for up to one-quarter of the variation in health status in mid-to-late life. While the neighbor correlations must be strictly interpreted as upper bounds, the estimates suggest that neighborhood factors experienced earlier in the life course influence both contemporaneous and future health outcomes.

Research on how neighborhood and family background influence later-life health is one with potential endogeneity issues that are not amenable to the usual microeconometric corrections through use of instrumental variables or fixed effect approaches, and for which the extant experimental evidence is likely too short a time horizon to detect effects on overall health status. Instead of attempting to remove or avoid selection bias caused by unobserved factors, the methods employed in this paper assess how the presence of varying levels of selection bias would alter conclusions about the effect of living in a neighborhood with a high poverty level during young adulthood on mid-to-late life health. The results reveal that even a large amount of selection on unobservable factors does not eliminate the significant effect of neighborhood poverty on health status later in life. This evidence indicates further research on the effects of particular neighborhood characteristics is strongly warranted to identify the causal mechanisms through which concentrated neighborhood poverty effects operate.

The findings in this paper further our understanding of the underlying processes that produce health disparities between different racial and socioeconomic groups. The study finds that racial differences in later-life health can be largely accounted for by childhood socioeconomic status and young adult family and neighborhood factors, while contemporaneous neighborhood factors account for relatively little of this gap. The paper investigates differential effects of socioeconomic conditions on health over the life cycle and provides evidence that dimensions of neighborhood disadvantage that occur earlier in life are more predictive of morbidity during mid-to-late life than are contemporaneous neighborhood measures. The cumulative and dynamic nature of the relationship between socioeconomic conditions and health over the life course applies to dimensions of both family and neighborhood characteristics. The findings of this paper challenge future research to further our understanding of the pathways through which neighborhoods influence health and how the effects of neighborhood conditions differ over the life cycle. This knowledge is critical to help policy makers develop interventions (e.g., early childhood interventions or targeted policies for the geographic deconcentration of the poor) that build a bridge between childhood and early adulthood for impoverished families, so fewer individuals arrive at the doorstep of retirement with accumulated exposures that are irreversible.

Supplementary Material

01
02
03

Acknowledgments

This research was funded by the National Institute on Aging (R01AG024058). Funding for RAND’s Center for Population Health and Health Disparities was provided through NIEHS P50 ES12383. We thank the PSID staff for access to the confidential restricted-use PSID geocode data.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

REFERENCES

  1. Altonji J, Elder T, Taber C. Selection on Observed and Unobserved Variables: Assessing the Effectiveness of Catholic Schools. Journal of Political Economy. 2005;113(1):151–184. [Google Scholar]
  2. Ananat E, Gassman-Pines A, C.M. Gibson-Davis C. The Effect of Local Employment Losses on Children’s Educational Attainment. In: Duncan Greg J., Murnane Richard., editors. Social Inequality and Educational Disadvantage. Russell Sage Foundation Press; New York: (in press) forthcoming. [Google Scholar]
  3. Andreski P, McGonagle K, Schoeni R. An Analysis of the Quality of the Health Data in the Panel Study of Income Dynamics. 2009. (PSID Technical Series Paper #09-02).
  4. Barker D. Mothers, Babies and Health in Later Life. Churchill Livingstone; Edinburgh: 1998. [Google Scholar]
  5. Benyamini Y, Idler E, Leventhal H, Leventhal E. Positive Health and Function as Influences on Self-Assessments of Health: Expanding Our View Beyond Illness and Disability. Journal of Gerontology: Psychological Sciences. 2000;55B:107–116. doi: 10.1093/geronb/55.2.p107. [DOI] [PubMed] [Google Scholar]
  6. Becketti S, Gould W, Lillard L, Welch F. The PSID after Fourteen Years: an Evaluation. Journal of Labor Economics. 1988;6(4):472–492. [Google Scholar]
  7. Curtis S, Southall H, Congdon P, Dodgeon B. Area effects on health variation over the life-course: Analysis of the Longitudinal Study sample in England using new data on area of residence in childhood. Social Science and Medicine. 2004;58:57–74. doi: 10.1016/s0277-9536(03)00149-7. [DOI] [PubMed] [Google Scholar]
  8. Curtis S, Setia M, Quesnel-Vallee A. Socio-Geographic Mobility and Health Status: a longitudinal analysis using the National Population Health Survey of Canada. Social Science and Medicine. 2009;69:12, 1845–1853. doi: 10.1016/j.socscimed.2009.08.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Ellen I, Mijanovich T, Dillman K. Neighborhood Effects on Health: Exploring the Links and Assessing the Evidence. Journal of Urban Affairs. 2001;23(3):391–408. [Google Scholar]
  10. Fitzgerald J, Gottschalk P, Moffitt R. An Analysis of Sample Attrition in Panel Data. The Journal of Human Resources. 1998a;33(2):251–99. [Google Scholar]
  11. ______ An Analysis of the Impact of Sample Attrition on the Second Generation of Respondents in the Michigan Panel Study of Income Dynamics. The Journal of Human Resources. 1998b;33(2):300–44. [Google Scholar]
  12. Geronimus A. Black/White Differences in the Relationship of Maternal Age to Birthweight: A Population-Based Test of the Weathering Hypothesis. Social Science and Medicine. 1996;42(4):589–97. doi: 10.1016/0277-9536(95)00159-x. [DOI] [PubMed] [Google Scholar]
  13. Haas S. The Long-term Effects of Poor Childhood Health: An Assessment & Application of Retrospective Reports. Demography. 2007;44(1):113–135. doi: 10.1353/dem.2007.0003. [DOI] [PubMed] [Google Scholar]
  14. Healthy People . United States Department of Health and Human Services; 2000. www.healthypeople.gov. [Google Scholar]
  15. Hill M. The panel study of income dynamics: A user’s guide. Sage Foundation; Newbury Park, Ca: 1992. [Google Scholar]
  16. Katz L, Kling J, Liebman J. Moving to Opportunity in Boston: Early Results of a Randomized Mobility Experiment. Quarterly Journal of Economics. 2001;116(2):607–654. [Google Scholar]
  17. Kawachi I, Berkman L. Neighborhoods and Health. Oxford University Press; New York: 2003. New York. [Google Scholar]
  18. Krauth B. Simulation-based Estimation of Peer Effects. Journal of Econometrics. 2006;133(1):243–271. [Google Scholar]
  19. Kuh D, Wadsworth M. Physical Health Status at 36 Years in the British National Birth Cohort. Social Science and Medicine. 1993;37(7):905–916. doi: 10.1016/0277-9536(93)90145-t. [DOI] [PubMed] [Google Scholar]
  20. Kunz J, Page M, Solon G. Are Point-in-Time Measures of Neighborhood Characteristics Useful Proxies for Children’s Long-run Neighborhood Environment? Economics Letters. 2003;79(2):231–237. [Google Scholar]
  21. Lillard L, Panis C. Panel Attrition from the Panel Study of Income Dynamics. The Journal of Human Resources. 1998;33(2):437–57. [Google Scholar]
  22. Lillard L, Panis C. aML Version 2. User’s Guide and Reference Manual. EconWare; Los Angeles: 2003. [Google Scholar]
  23. Leventhal T, Brooks-Gunn J. New York City Site Findings: The Early Impacts of Moving to Opportunity on Children and Youth. In: Goering J, Feins JD, editors. Choosing a Better Life: Evaluating the Moving to Opportunity Social Experiment. Urban Institute Press; Washington, DC: 2003. pp. 213–243. [Google Scholar]
  24. Oakes J. The (Mis)estimation of Neighborhood Effects: Causal Inference for a Practicable Social Epidemiology. Social Science & Medicine. 2004;58(10):1929–1952. doi: 10.1016/j.socscimed.2003.08.004. [DOI] [PubMed] [Google Scholar]
  25. Manski C. Identification of Endogenous Social Effects: The Reflection Problem. The Review of Economic Studies. 1993;60(3):531–542. [Google Scholar]
  26. Murray C, Michaud C, McKenna M, Marks J. U.S. patterns of mortality by county and race: 1965-1994. Harvard Center for Population and Development Studies; Cambridge, MA: 1998. (Monograph Series). [Google Scholar]
  27. Rabe-Hesketh S, Skrondal A. Multilevel and Longitudinal Modeling Using Stata. Second Edition Stata Press; College Station, TX: 2008. [Google Scholar]
  28. Raudenbush S, Bryk AS. Hierarchical Linear Models: Applications and Data Analysis Methods. Second Edition Sage Publications; Thousand Oaks: 2002. [Google Scholar]
  29. Smith J. Reconstructing Childhood Health Histories. Demography. 2009;46(2):387–403. doi: 10.1353/dem.0.0058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Solon G, Page M, Duncan G. Correlations Between Neighboring Children in their Subsequent Educational Attainment. The Review of Economics and Statistics. 2000;82(3):383–392. [Google Scholar]
  31. Williams D, Collins C. US Socioeconomic and Racial Differences in Health: Patterns and Explanations. Annual Review of Sociology. 1995;21:349–386. [Google Scholar]
  32. Wooldridge J. Econometric Analysis of Cross Section and Panel Data. MIT Press; Cambridge, MA: 2002. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

01
02
03

RESOURCES