Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2017 Mar 1.
Published in final edited form as: Curr Epidemiol Rep. 2016 Feb 16;3(1):63–71. doi: 10.1007/s40471-016-0069-5

The Consistency Assumption for Causal Inference in Social Epidemiology: When a Rose is Not a Rose

David H Rehkopf 1, M Maria Glymour 2, Theresa L Osypuk 3
PMCID: PMC4912021  NIHMSID: NIHMS760712  PMID: 27326386

Abstract

The assumption that exposures as measured in observational settings have clear and specific definitions underpins epidemiologic research and allows us to use observational data to predict outcomes in interventions. This leap between exposures as measured and exposures as intervened upon is typically supported by the consistency assumption. The consistency assumption has received extensive attention in risk factor epidemiology but relatively little emphasis in social epidemiology. However, violations of the consistency assumption may be especially important to consider when understanding how social and economic exposures influence health. Efforts to clarify the definitions of our exposures, thus bolstering the consistency assumption, will help guide interventions to improve population health and reduce health disparities. This article focuses on the consistency assumption as considered within social epidemiology. We explain how this assumption is articulated in the causal inference literature and give examples of how it might be violated for three common exposure in social epidemiology research: income, education and neighborhood characteristics. We conclude that there is good reason to worry about consistency assumption violations in much of social epidemiology research. Theoretically motivated explorations of mechanisms along with empirical comparisons of research findings under alternative operationalizations of exposure can help identify consistency violations. We recommend that future social epidemiology studies be more explicit to name and discuss the consistency assumption when describing the exposure of interest, including reconciling disparate results in the literature.

Keywords: social class, epidemiology, causal inference, potential outcomes, bias


Rose is a rose is a rose is a rose. – Gertrude Stein (1)

Introduction

A single term may encompass a multitude of meanings. Ambiguity in the meaning of a construct can impede scientific research and the replication and translation of findings. In epidemiology, this challenge is acute when translating evidence from observational to intervention settings. To help make the leap from observational evidence to intervention design, epidemiologists typically invoke three assumptions: exchangeability, positivity, and consistency. Social epidemiology, addressing how social and economic determinants influence health and disease throughout the life course, has long struggled with the exchangeability (no confounding) assumption. The positivity assumption – that in all covariate strata some individuals are treated while others are untreated - has also received important research attention from social epidemiologists. But social epidemiologists have paid relatively little attention to the consistency assumption, which entails that the exposure is defined with enough specificity that different variants of the exposure do not have different effects on the outcome. Both theoretical evidence and the limited available empirical evidence suggest that violations of the consistency assumption are plausible for social exposures. Attending to such violations may help us design more effective interventions.

Causal criteria of consistency

Moving from an observed association between two factors to understanding whether one factor actually caused the other is a common goal for epidemiology research. We adopt a counterfactual or potential outcomes approach to defining a cause as: if the cause did not occur, the chance of the outcome occurring would be different than if the cause did occur. We can more concisely describe this using potential outcomes notation where Pr(YX=1) represents the probability distribution of Y if the variable X were set to the value 1. This is called a “potential” outcome because X may or may not take the value 1. We define causal effects as contrasts of such potential outcomes. For example the risk difference measure of the effect of X on Y would be defined as the difference in the probability that Y was 1 if we set everyone in the population to X=1 versus if we set everyone in the population to X=0 (or any other specific value besides 1):

Pr(YX=1=1)Pr(YX=0=1) (1)

Causal inference relies on the process of replacing the potential outcomes in the causal parameter with observed outcomes that can be estimated from data. For example, we may estimate the risk difference by contrasting the probability that Y is 1 among those people in the population with X=1 versus the probability that Y is 1 among those people for whom X=0:

Pr(Y=1X=1)Pr(Y=1X=0) (2)

But under what circumstances can we substitute observed outcomes, such as Pr(Y=1|X=0) for potential outcomes such as Pr(YX=0=1)? It is impossible to observe both YX=0 and YX=1 for even one person, much less for everyone in the population, as demanded in equation 1. In the analysis of quantitative data, the core criteria for causal inference are exchangeability, positivity, and consistency. These criteria form a link between observable features of the data and potential outcomes that define causal effects. Most efforts for deducing causal estimates in epidemiology have been focused on whether or not there are violations to the criterion of exchangeability (i.e. no confounding), that those individuals receiving the treatment should be considered as exchangeable (with respect to potential outcomes) with those not receiving the treatment and vice versa. That is, they should be identical on average for characteristics that may influence the outcome except for the treatment itself. This is the advantage of randomized trials, since exchangeability is inherent to the study design by virtue of random assignment. Specifically, because of randomization, we believe we can modify equation 1 to estimate the distribution of the potential outcomes Pr(YX=1=1) for the whole sample based only on those people who were randomized to receive X=1, and similarly estimate the distribution of the potential outcomes Pr(YX=0=1) based only on those people who were randomized to receive X=0:

Pr(YX=1=1X=1)Pr(YX=0=1X=0) (3)

In observational studies, exchangeability is achieved via other design or analytic techniques, including most commonly through covariate adjustment in multiple regression, but also by restriction, matching, or weight-based approaches, all of which are intended to mimic randomization (2, 3). But note that in equation 3, we still have a potential outcome. We need another assumption to be able to directly estimate those values as in equation (4), and this assumption seems so obvious it is sometimes overlooked. We need to assume that Pr(YX=1=1|X=1) = Pr(Y=1|X=1): among those people with X=1, their actual value of Y matches the value Y would take if we set X=1. The consistency assumption requires that there are no two “flavors” or versions of treatment such that X=1 under both versions but the potential outcome for Y would be different under the alternative versions. This modest looking assumption -- which in recent causal inference literature has been labeled the consistency assumption – is often overlooked.

It is important to note that the consistency assumption we discuss here is only very indirectly related to the Bradford Hill Criterion of “consistency” (4), which invokes the view that if most studies find a similar result the association is more likely to be causal than if different studies report different results (5). We are also not focused here on a consistent statistical estimator, i.e., an estimator that moves closer to the truth in larger sample sizes (6).

Rather, we are focused on describing the consistency criterion that is fundamental to the potential outcomes approach to causal inference. The consistency assumption implies that an individual’s potential outcome under his or her observed exposure history is the outcome that will actually be observed for that person. Based on an earlier conception (7) restated by Pearl: “for all A and B, if A is true, then if B would have prevailed (counterfactually) had A been true, it must be true already.”(8) Perhaps part of what has led to less attention to this criterion is that on its own it can seem obtuse and a bit circular. As noted by Pearl, consistency can, depending on the exact causal framework used, be either presented as an assumption to support inference or as an axiom to define counterfactuals (8). This perhaps offers another reason why consistency may be the most ignored of the assumptions for causal inference - it is invisible because it is so fundamental to causal inference itself. What consistency critically implies is that the exposure specified in the analysis must have enough precision that any variation within the exposure specification would not result in a different outcome.

Consider the effect of socioeconomic status (SES) on health. While many measures of SES exist, composite measures that combine a person’s income, education and occupational prestige are common. For descriptive social epidemiology, composite measures are sometimes advantageous for capturing a more general underlying construct that could be, for instance, compared across place and time, in particular when the variation in the components of that composite index may differ in ways that are not of substantive interest. However, for causal studies, in order to avoid violation of the consistency assumption, it is important that regardless of which component of SES is changed, the same effect would occur. Without analyzing it directly, we can consider the plausible validity of this assumption by turning to literature that has estimated separate associations between each SES component of income, education, and occupation, with a specific health outcome. Based on a large literature in this area, we conclude that for the construct of socioeconomic status as operationalized using a composite, the assumption of consistency is likely violated (9, 10). Interventions on income, education, or occupation may each have different impacts on a particular health outcome. An SES index could be considered to be a “compound treatment,” a common cause of violation of the consistency assumption (11). While not invoking the consistency assumption, this has led others to the same conclusion, that for etiologic work it is best to avoid composite measures of SES (12).

Many interpretations of the consistency assumption require that we grapple with the manipulability criterion, a controversial topic in social epidemiology. This debate for social epidemiology was laid out in an influential commentary by Kaufman and Cooper “Seeking Causal Explanations in Social Epidemiology,” (13) where they explored the difficulties of establishing a clear counterfactual when examining social factors’ influences on health. They recommended that researchers “seek causal explanations only for definable interventions.” Consistency is often motivated by linking to specific interventions rather than simply to specificity of exposure (e.g., by focusing on whether obesity is changed by diet or by exercise, rather than by focusing on whether obesity is due to excess central or peripheral adipose tissue (14)). Currently, researchers in social epidemiology have not reached consensus on the manipulability criterion. However, even people who believe that non-manipulable factors can be causes agree that the consistency assumption may be violated if an intervention to change the exposure is not clearly and specifically described. For example, if intervening on SES by increasing education has different health consequences than intervening on SES by increasing income, SES is not a well-defined intervention and violates the consistency criterion. Telling other researchers that increasing SES improves health is thus insufficient to guide the development of specific interventions.

Ambiguity in the definition of interventions to change exposure is central to violations of consistency (15). Cole and Frangakis define consistency in terms of the potential outcome definition of Yj(x,k), where for individual j the exposure X is set to a specific value x by intervention k. Therefore the consistency assumption holds if the observed Yj(x)= Yj(x,k) for all values of k that set X to x. As noted by VanderWeele (16), the range of possible k (the means by which exposure occurs) will vary depending on the specific x (the specific exposure tested) (16). The task then within social epidemiology is for an investigator to consider the range of possible interventions (k) that could elicit the same value of exposure (x). Interventions need to be described with enough specificity that any additional variation not specified is irrelevant. However, if the intervention description is needlessly specific, it will preclude future replications.

A second related but distinct concern is effect measure modification. Effect measure modification occurs when the effect of an exposure of interest on an outcome is different in some subgroups of the population as compared to others - for example if the effect of social support on heart disease differs between men and women. In many cases it is ambiguous whether a variation should be considered a feature of the population or a feature of the intervention. For example, we might find that beginning school at ages 3-4 years has larger cognitive benefits than enrolling in school at ages 6-7 years. We could say there are two versions of the treatment “begin school at age 3” versus “begin school at age 6”, or we could say there is one treatment “begin school”, which could be applied to 3 year olds or to 6 year olds. Focusing on two interventions would frame this as a consistency problem; focusing on the age of children who receive a treatment frames this as an effect modification problem. The key distinction is whether the attribute violating the consistency assumption is more clearly thought of as a separate factor or a characteristic of the exposure itself. Very closely related is the challenge of transportability (11), or how causes identified in one population can or cannot be applied to other populations (17). For practical purposes, consistency violations and effect measure modification have similar implications: to design an intervention you must decide precisely who will receive the intervention and what the content of the intervention will be. To be useful to intervention design, observational studies must therefore specify both the population and the exposure.

A third issue related to but distinct from consistency considers causal pathways, or mediation. Exposures that influence an outcome via multiple mediating pathways with different magnitudes of effect do not necessarily violate the consistency assumption. Multiple mediating pathways, however, may imply the potential for violations of consistency because it is easy to imagine several closely related interventions that trigger some but not all of the mediating pathways. For example, education is thought to influence health via improvements in knowledge and cognitive skills, credentials that are valued on the labor market, status improvements, and changes to the individual’s social network. It is easy to imagine variations on educational experiences that have larger or smaller effects on just one of these mechanisms, e.g., cognitive skills or prestige. Indeed, articulating the theorized mechanisms linking exposure and outcome may help us define clear and specific exposures that fulfill the consistency criterion.

Potential consistency assumption violations in social epidemiology

Although some social epidemiology research is directed towards adherence to the consistency assumption, it is usually framed in terms of the specificity of the exposure and generalizability, without formal linkages to the consistency assumption as a criterion for causal inferences. Thus while not evaluated directly, in this section of the paper we discuss examples of how the consistency assumption may be violated for research on three commonly studied factors in social epidemiology: income, education and neighborhoods. We describe below examples of research in social epidemiology and related fields that through their specific counterfactual contrast examine a particular k for these three common exposures, and provide some evidence that consistency violations are a concern in social epidemiology. For all of the exposures we describe, the possible k variations (the means by which exposure is enacted) based on timing and duration also apply.

Income

Multiple reviews and chapters on the relationship between income and health have noted the varied associations across studies; these reviews generally focus on the importance of duration and timing of exposures, the outcomes examined and violations of exchangeability (18). All three of these issues are important for defining specific causal effects of income on health, but some of the variability in effect estimates for the relationships between income and health may also be due to lack of attention to the consistency criteria. In examining income effects on health, several types of income changes have been evaluated. These include sources such as regular earnings (19), paychecks (20), tax refunds (21), gaming profit disbursements (22), cash transfers (23), inheritances (24) and lottery winnings (25). For some of these differences in k, the distinction may be subtle. For example, regular earnings are typically disbursed as a paycheck, so earnings and paychecks could be considered effectively the same treatment with respect to the consistency assumption. Analyses, however, have differentiated within these exposures by using methods to examine earnings over periods of time of months or years, or focus on the period of time when the fund from a paycheck are actually received, resulting in a potentially different effect. It is not currently clear whether, for a particular health outcome, these various mechanisms of receiving income result in a different effect.

While some of the literature is adherent to the consistency criterion by carefully framing results in reference to the exact type of income transfer examined, it is generally a mistake to extrapolate from a particular exposure to the effects of income more generally. For example, in describing the implications of the conditional cash transfer from the Oportunidades program, the authors wrote: “Our results suggest that the cash transfer component of Oportunidades is associated with better outcomes in child health, growth, and development” (23). In contrast, when describing the goal of research examining the effect of lottery winnings on health, the authors wrote: “This paper has asked whether money makes individuals healthier” (26), a description of a causal estimate that has a wider range of k.

There are a few examples where multiple types of income exposures have been examined in the same study using similar methods, sometimes with identical outcomes and populations. This literature suggests that income from different sources has different health effects. One of the best examples of attention to the consistency criterion is work that stipulates a specific k of short-term duration and compared the effects of other multiple k sources of money (i.e., from social security payments, regular wage payments for the military, tax rebates and Alaska fund payments) on short-term mortality (27). The similarity of results across these k income sources suggests that for short-term effects of income, these sources effectively meet the consistency assumption. In the examination of short-term effects of tax credits on risk factors for mortality, however, impacts were primarily beneficial (21), suggesting that k could differ for the alternative exposure of consistent, large benefits from the Earned Income Tax Credit (EITC). Qualitative work can also play an important role in considering violations of the consistency assumption. Work on EITC recipients has shown that individuals view the lump sum tax refund payment in a different way than regular paychecks, and that what they spend money on varies depending on the type of income received (28).

Education

The overwhelming majority of health research operationalizes education as either years completed (often in two or three categories, such as less than 12 versus 12 or more) or degree completion (often in two or three categories, such as a high school diploma or bachelor’s degree). Although many researchers acknowledge the potential importance of school quality (29), aspects of quality such as variations in school term duration (which now averages around 180 days per year but has historically varied by nearly a factor of two, especially across segregated schools) are not captured in years completed or degrees obtained (30). Differences in the timing of education with respect to developmental stage are also disregarded in conventional measures of education based on years or degrees completed. The literature suggests that the multiple attributes of education that are not typically specified may be differentially associated with health outcomes, thus use of standard measures of level of education may violate the consistency assumption. In contrast to years of education, Manly has conceptualized literacy – the capacity to use printed and written information to function in society – as a marker of school quality (31). Teaching literacy skills is a key goal of education, and literacy is plausibly a powerful mediator for schooling effects. Manly finds that accounting for literacy substantially accounts for racial disparities in age-related dementia outcomes. She has also shown large regionally based disparities in literacy in older adults (32). Literacy could be improved via multiple different treatments that are within the domain of education, such as higher quality schooling, longer duration of schooling, or mechanisms unrelated to formal schooling. However, if literacy mechanisms are essential to the effects of schooling, it implies that educational interventions focusing on improved literacy will have different health consequences than interventions that, for example, focus only on the number of years of education. Nguyen reports that literacy accounts for roughly 19% of the effect of education on mortality, although this may be an underestimate due to the limited measure of literacy available (33). Another approach to intervening within the overall construct of education is on class size. The Tennessee STAR (Student/Teacher Achievement Ratio) trial randomized children in 79 schools to small class sizes, with or without certified teacher’s aides to assist children in the classroom (34). Evidence on the health effects of these variations would help us understand whether consistency violations are a concern in typical definitions of education. To date, however, evidence on the health effects of Tennessee STAR is limited and mixed, although assignment to high-quality classrooms appeared to benefit earnings (35).

Research using compulsory schooling laws (CSLs) as natural experiments for the health effects of education has indirectly grappled with the consistency challenge, because CSL changes typically either introduce an extra year of schooling at an earlier age (e.g., reducing the school entry age from 7 years to 6 years) or an extra year of schooling in adolescence (e.g., increasing the school leaving or work permit age from 14 to 15). Several instrumental variable studies based on CSLs find larger effect estimates than conventional analyses (36-39), and one interpretation is that delivering an additional year of schooling at these ages has larger consequences than the effect of an additional year of schooling at later developmental periods. To our knowledge, there is no direct empirical evaluation of this explanation for the discrepancy. However, several studies suggest that early interventions focused on improved cognitive environments (e.g. among children under 5) have enduring health benefits (38, 40, 41). These studies typically include both cognitive and non-cognitive (e.g., nutritional) enrichment for children, so it is difficult to disentangle strictly educational exposures from other benefits.

Figure 2 illustrates the challenge of identifying interventions that correspond with the relevant mechanisms via which education may influence health. Of the several possible interventions shown, few would be reflected in typical measurements of education (e.g., years of education completed) and each way of intervening might have a different consequence on the mediators and thus on health. Conversely, the consequences of attending extra years of schooling might depend quite strongly on how much classroom instruction occurred during each year of school, the quality of the teachers, the norms, talents, and socioeconomic resources of classmates, the prestige of the school, and so forth. In short, the link between what we measure in most observational studies of education, and what matters for health, is not necessarily close. As a result, we are unsure how to best intervene.

Figure 2.

Figure 2

Links between possible interventions on education and potential mechanisms affecting health. Not all interventions influence the same mechanisms, and it is unknown which mechanisms are most important for particular health outcomes.

Neighborhoods

Studies examining how neighborhood context influences health are quite heterogeneous in terms of the specific neighborhood construct, as well as the operationalizations of specific constructs. The most common exposure among neighborhood effects studies is the SES of a neighborhood, which is typically used to measure the construct of underlying neighborhood quality. A common way to operationalize neighborhood SES is via an index of different dimensions of SES, e.g. median area income, proportion of neighborhood households below poverty line, proportion of residents without a high school education and proportion of unemployed males of working age (42). This exposure encounters the same threat of consistency violation described above as for individual SES indices. For example, if decreasing the proportion of people in poverty may have different effects than increasing the average high school graduation rate.

Another potential violation of consistency concerns the method by which the construct of neighborhood quality is changed. Two approaches are typically conceptualized as approaches to modifying neighborhood environments: (1) place-focused interventions, often called neighborhood revitalization policies, that improve the conditions in disadvantaged neighborhoods, and (2) people-focused interventions that help households obtain housing in higher quality neighborhoods. This second approach improves neighborhood quality for individuals by virtue of a specific household that moves. Housing policy experts recommend that both place and people based approaches are necessary to reduce neighborhood inequality (43), but specific interventions within each of these approach types may have different impacts on the health of neighborhoods. One of the strongest studies for assessing the changes of neighborhood and housing context for health is the Moving to Opportunity (MTO) study, where volunteer low income households were randomized to receive a government housing voucher to subsidize rental costs in private units, compared to in-place control groups in public housing. The MTO study demonstrated that receiving a rental voucher resulted in substantial improvements of neighborhood context, and strong improvements in mental health for low-income predominantly minority female household heads and their daughters in the treatment group compared to controls. Treatment effects were stronger for those without health vulnerabilities at baseline (44, 45). The randomized design often (but not necessarily) means that consistency is met because exposure is specific and assigned by the program: study participants were assigned to a specific experimental group, with a well defined treatment protocol in each arm. However, more typical of observational studies in the field, changes in neighborhood poverty exposure may be achieved via any of several mechanisms, including household moves unassisted by a mobility policy, neighborhood revitalization, or via neighborhood gentrification. One could easily imagine that the health consequences of moving from a high-poverty to a low-poverty neighborhood might differ from the health consequences of improvements to the current neighborhood. If these modes differ in their effects on health, this suggests a potential violation of consistency. Specifying whether neighborhood change is achieved by moving, as opposed to by place-based change, would improve inferences and provide better guidance to develop interventions.

While neighborhood revitalization is one key mechanism to improve neighborhood quality, there are many possible ways to operationalize it. Moreover, in practice, revitalization programs often implement multiple simultaneous changes with concentrated place-based investment, which influence multiple dimensions of a neighborhood (e.g. transportation, housing, economic development). This bundled treatment makes it difficult to meet the consistency assumption. Again, narrowing the exposure could help, if such improvements were implemented (or able to be evaluated) in isolation. For example, neighborhood redesign to adopt smart growth principles of street connectivity to improve walkability, or increasing greenspace and vegetation. However, much of this literature fails to manipulate an exposure or to isolate a specific exposure (46).

Notably, neither random assignment, nor manipulability, guarantees consistency because of ambiguities in the treatment protocols and bundled treatments. Many social and economic policies that have been rigorously designed and evaluated (i.e., via experiments) deliver bundled treatments (47). Bundled treatments may be problematic for the consistency assumption if particular dimensions of the treatment affect health differently.

Is consistency always important?

Ralph Waldo Emerson wrote “a foolish consistency is the hobgoblin of little minds” (48), and we agree that in some settings the consistency assumption is less important. Particularly, early in the arc of a research question, it may be important to cast a wide net, examine unclearly defined constructs, and try to integrate evidence across studies with measures that do not clearly correspond to a specific intervention. Vandenbroucke et al., note that one challenge in recent causal inference literature is that one does not necessarily know in advance if an intervention is well-specified (i.e., consistent), because we do not know exactly how the intervention will work (49). Just as research designs are often more vulnerable to confounding or violations of exchangeability when forging new scientific territory, initial research designs may also appropriately back-burner consistency with the goal of evaluating whether any component of a large set of related but somewhat ill-defined constructs influences the outcome. As a research area matures and researchers move towards efforts to intervene, however, the consistency assumption must be directly addressed to guide intervention development.

Conclusion

We have three recommendations for how social epidemiologists should address the consistency assumption in their work. First, studies should be explicit about the assumptions of their version of treatment or exposure (k) and draw on both theoretical and empirical evidence to identify potentially relevant variations. Second, until results come to light that suggest that certain dimensions of the version of treatment (k) do not violate the consistency assumption, studies should endeavor to examine exposures with a reduced range of such versions. Third, as VanderWeele suggests (16), whenever possible, researchers should explicitly test multiple different definitions of an exposure. Although there is not a definitive test for the consistency assumption, there are ways to evaluate how plausible it is for a specific exposure definition by using data from both within and across studies.

The epidemiologic literature that speaks to the substantive magnitude of differences in effect estimates is also useful for thinking about how important it may be to pay closer attention to consistency. For example, quantitative comparison of studies has shown that there is a close match between trials and observational studies when there is a precisely defined exposure, such as a medical treatment (50). When results of randomized trials diverge from observational evidence, it may be because studies did not use precisely the same exposure, for example in studies of beta-carotene (51-53). Although the exchangeability assumption is usually assumed to account for differences between RCTs and observational evidence, consistency assumption violations may also play a role.

Violations of the consistency assumption in social epidemiology may provide another explanation for why observational study results are often heterogeneous, in addition to differences deriving from the study populations or confounding structures. The call for consequential social epidemiology entails more attention to the consistency assumption(54, 55). Fulfilling the consistency assumption allows for a closer connection between observational studies and inferences about actions based on those studies. In practice, this aligns with the so-called health in all policies approach (56, 57), a cross-sectoral view of how policies shaping social determinants of health outside of the health sector (e.g. in the education, housing and/or workforce sectors) can be used to improve health.

Figure 1.

Figure 1

56 million £ lottery winners Nigel Page and Justine Laycock. Does a dollar of lottery winnings affect their health in the same way as a dollar of income from work?

Footnotes

Compliance with Ethics Guidelines

Conflict of Interest

David H. Rehkopf, M. Maria Glymour, and Theresa L. Osypuk declare that they have no conflict of interest.

Human and Animal Rights and Informed Consent

This article does not contain any studies with human or animal subjects performed by any of the authors.

References

Papers of particular interest, published recently, have been highlighted as:

• Of importance •• Of major importance

  • 1.Stein G, Four Seas Company., Four Seas Press . Geography and plays. Vol. 419. The Four Seas Company; Boston: 1922. p. 1. p. (last p. blank) p. [Google Scholar]
  • 2.Robins JM, Hernan MA, Brumback B. Marginal structural models and causal inference in epidemiology. Epidemiology. 2000;11(5):550–60. doi: 10.1097/00001648-200009000-00011. [DOI] [PubMed] [Google Scholar]
  • 3.Stuart EA. Matching methods for causal inference: A review and a look forward. Statistical science: a review journal of the Institute of Mathematical Statistics. 2010;25(1):1. doi: 10.1214/09-STS313. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Hill AB. The Environment and Disease: Association or Causation? Proc R Soc Med. 1965;58:295–300. PubMed PMID: 14283879; PMCID: PMC1898525. [PMC free article] [PubMed] [Google Scholar]
  • 5.Rothman K, Greenland S. Causation and causal inference in epidemiology. American journal of public health. 2005;95(S1):S144–S50. doi: 10.2105/AJPH.2004.059204. [DOI] [PubMed] [Google Scholar]
  • 6.Kiefer J, Wolfowitz J. Consistency of the maximum likelihood estimator in the presence of infinitely many incidental parameters. The Annals of Mathematical Statistics. 1956:887–906. [Google Scholar]
  • * 7.Gibbard A, Harper WL. Counterfactuals and two kinds of expected utility: Springer. 1981 A fundamental development of counterfactual theory which describes the requirement of consistency for inference. [Google Scholar]
  • 8.Pearl J. On the consistency rule in causal inference: axiom, definition, assumption, or theorem? Epidemiology. 2010;21(6):872–5. doi: 10.1097/EDE.0b013e3181f5d3fd. [DOI] [PubMed] [Google Scholar]
  • 9.Braveman PA, Cubbin C, Egerter S, Chideya S, Marchi KS, Metzler M, Posner S. Socioeconomic status in health research: one size does not fit all. JAMA. 2005;294(22):2879–88. doi: 10.1001/jama.294.22.2879. doi: 10.1001/jama.294.22.2879. PubMed PMID: 16352796. [DOI] [PubMed] [Google Scholar]
  • 10.Adler NE, Rehkopf DH. US disparities in health: descriptions, causes, and mechanisms. Annu Rev Public Health. 2008;29:235–52. doi: 10.1146/annurev.publhealth.29.020907.090852. [DOI] [PubMed] [Google Scholar]
  • 11.Hernán MA, VanderWeele TJ. Compound treatments and transportability of causal inference. Epidemiology (Cambridge, Mass) 2011;22(3):368. doi: 10.1097/EDE.0b013e3182109296. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Duncan GJ, Magnuson KA. Off with Hollingshead: Socioeconomic resources, parenting, and child development. Socioeconomic status, parenting, and child development. 2003:83–106. [Google Scholar]
  • 13.Kaufman JS, Cooper RS. Seeking causal explanations in social epidemiology. American journal of epidemiology. 1999;150(2):113–20. doi: 10.1093/oxfordjournals.aje.a009969. [DOI] [PubMed] [Google Scholar]
  • **14.Hernán M, Taubman S. Does obesity shorten life? The importance of well-defined interventions to answer causal questions. International Journal of Obesity. 2008;32:S8–S14. doi: 10.1038/ijo.2008.82. This article gives an overview of the importance of the consistency assumption for causal inference in epidemiology illustrated using the example of studies of the effects of obesity on mortality. [DOI] [PubMed] [Google Scholar]
  • * 15.Cole SR, Frangakis CE. The consistency statement in causal inference: a definition or an assumption? Epidemiology. 2009;20(1):3–5. doi: 10.1097/EDE.0b013e31818ef366. A summary of the importance of the consitency assumption. [DOI] [PubMed] [Google Scholar]
  • * 16.VanderWeele TJ. Concerning the consistency assumption in causal inference. Epidemiology. 2009;20(6):880–3. doi: 10.1097/EDE.0b013e3181bd5638. This article clarifies the basis of the consistency assumption and provides details about the range of treatments. [DOI] [PubMed] [Google Scholar]
  • 17.Bareinboim E, Pearl J. A general algorithm for deciding transportability of experimental results. Journal of Causal Inference. 2013;1(1):107–34. [Google Scholar]
  • 18.Kawachi I, Adler NE, Dow WH. Money, schooling, and health: Mechanisms and causal evidence. Annals of the New York Academy of Sciences. 2010;1186(1):56–68. doi: 10.1111/j.1749-6632.2009.05340.x. [DOI] [PubMed] [Google Scholar]
  • 19.Benzeval M, Judge K. Income and health: the time dimension. Social science & medicine. 2001;52(9):1371–90. doi: 10.1016/s0277-9536(00)00244-6. [DOI] [PubMed] [Google Scholar]
  • 20.Catalano R, McConnell W. Psychiatric emergencies: the check effect revisited. Journal of Health and Social Behavior. 1999:79–86. [PubMed] [Google Scholar]
  • 21.Rehkopf DH, Strully KW, Dow WH. The short-term impacts of Earned Income Tax Credit disbursement on health. International journal of epidemiology. 2014:dyu172. doi: 10.1093/ije/dyu172. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Wolfe B, Jakubowski J, Haveman R, Courey M. The income and health effects of tribal casino gaming on American Indians. Demography. 2012;49(2):499–524. doi: 10.1007/s13524-012-0098-8. [DOI] [PubMed] [Google Scholar]
  • 23.Fernald LC, Gertler PJ, Neufeld LM. Role of cash in conditional cash transfer programmes for child health, growth, and development: an analysis of Mexico's Oportunidades. The Lancet. 2008;371(9615):828–37. doi: 10.1016/S0140-6736(08)60382-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Smith JP. Healthy bodies and thick wallets: the dual relation between health and economic status. The journal of economic perspectives: a journal of the American Economic Association. 1999;13(2):144. [PMC free article] [PubMed] [Google Scholar]
  • 25.Gardner J, Oswald AJ. Money and mental wellbeing: A longitudinal study of medium-sized lottery wins. Journal of health economics. 2007;26(1):49–60. doi: 10.1016/j.jhealeco.2006.08.004. [DOI] [PubMed] [Google Scholar]
  • 26.Apouey B, Clark AE. Winning big but feeling no better? The effect of lottery prizes on physical and mental health. Health economics. 2015;24(5):516–38. doi: 10.1002/hec.3035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Evans WN, Moore TJ. The short-term mortality consequences of income receipt. Journal of Public Economics. 2011;95(11):1410–24. [Google Scholar]
  • 28.Romich JL, Weisner T. How families view and use the EITC: Advance payment versus lump sum delivery. National tax journal. 2000:1245–65. [Google Scholar]
  • 29.Cohen AK, Syme SL. Education: a missed opportunity for public health intervention. American journal of public health. 2013;103(6):997–1001. doi: 10.2105/AJPH.2012.300993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Glymour MM, Manly JJ. Lifecourse social conditions and racial and ethnic patterns of cognitive aging. Neuropsychol Rev. 2008;18(3):223–54. doi: 10.1007/s11065-008-9064-z. Epub 2008/09/26. doi: 10.1007/s11065-008-9064-z. PubMed PMID: 18815889. [DOI] [PubMed] [Google Scholar]
  • 31.Manly JJ, Schupf N, Tang MX, Stern Y. Cognitive Decline and Literacy Among Ethnically Diverse Elders. Journal of Geriatric Psychiatry and Neurology. 2005;18(4):213. doi: 10.1177/0891988705281868. [DOI] [PubMed] [Google Scholar]
  • 32.Liu SY, Glymour MM, Zahodne LB, Weiss C, Manly JJ. Role of Place in Explaining Racial Heterogeneity in Cognitive Outcomes among Older Adults. Journal of the International Neuropsychological Society. 2015;21(09):677–87. doi: 10.1017/S1355617715000806. doi: doi:10.1017/S1355617715000806. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Nguyen T, Tchetgen Tchetgen E, Kawachi I, Gilman SE, Walter S, Glymour MM. Comparing alternative effect decompostion methods: the role of literacy in mediating educational effects on mortality. Epidemiology. 2015 doi: 10.1097/EDE.0000000000000517. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Nye B, Hedges LV, Konstantopoulos S. The long-term effects of small classes: A five-year follow-up of the Tennessee class size experiment. Educational Evaluation and Policy Analysis. 1999;21(2):127–42. [Google Scholar]
  • 35.Chetty R, Friedman JN, Hilger N, Saez E, Schanzenbach DW, Yagan D. How does your kindergarten classroom affect your earnings? Evidence from Project STAR. National Bureau of Economic Research. 2010 doi: 10.1093/qje/qjr041. [DOI] [PubMed] [Google Scholar]
  • 36.Lleras-Muney A. The relationship between education and adult mortality in the United States. The Review of Economic Studies. 2005;72(1):189–221. [Google Scholar]
  • 37.Silles MA. The causal effect of education on health: Evidence from the United Kingdom. Economics of Education Review. 2009;28(1):122–8. [Google Scholar]
  • 38.Glymour MM, Kawachi I, Jencks CS, Berkman LF. Does childhood schooling affect old age memory or mental status? Using state schooling laws as natural experiments. Journal of Epidemiology and Community Health. 2008;62(6):532–7. doi: 10.1136/jech.2006.059469. doi: 10.1136/jech.2006.059469. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Nguyen T, Tchetgen Tchetgen E, Kawachi I, Gilman SE, Walter S, Liu S, Manly J, Glymour M. Instrumental variable approaches to identifying the causal effect of educational attainment on dementia risk. Annals of Epidemiology. 2015 doi: 10.1016/j.annepidem.2015.10.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Campbell F, Conti G, Heckman JJ, Moon SH, Pinto R, Pungello E, Pan Y. Early childhood investments substantially boost adult health. Science. 2014;343(6178):1478–85. doi: 10.1126/science.1248429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Conti G, Heckman JJ, Pinto R. The Effects of Two Influential Early Childhood Interventions on Health and Healthy Behaviors. National Bureau of Economic Research. 2015 doi: 10.1111/ecoj.12420. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Sampson RJ, Raudenbush SW, Earls F. Neighborhoods and violent crime: A multilevel study of collective efficacy. Science. 1997;277(5328):918–24. doi: 10.1126/science.277.5328.918. [DOI] [PubMed] [Google Scholar]
  • 43.Katz B. Neighbourhoods of choice and connection: The evolution of American neighbourhood policy and what it means for the United Kingdom: Joseph Rowntree Foundation. 2004 [Google Scholar]
  • 44.Osypuk TL, Tchetgen Tchetgen EJ, Acevedo-Garcia D, Earls FJ, Lincoln A, Schmidt NM, Glymour MM. Differential mental health effects of neighborhood relocation among youth in vulnerable families: results from a randomized trial. Archives of general psychiatry. 2012;69(12):1284–94. doi: 10.1001/archgenpsychiatry.2012.449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Kling JR, Liebman JB, Katz LF. Experimental analysis of neighborhood effects. Econometrica. 2007;75(1):83–119. [Google Scholar]
  • 46.Lindberg RA, Shenassa ED, Acevedo-Garcia D, Popkin SJ, Villaveces A, Morley RL. Housing interventions at the neighborhood level and health: a review of the evidence. Journal of Public Health Management and Practice. 2010;16(5):S44–S52. doi: 10.1097/PHH.0b013e3181dfbb72. [DOI] [PubMed] [Google Scholar]
  • 47.Osypuk TL, Joshi P, Geronimo K, Acevedo-Garcia D. Do Social and Economic Policies Influence Health? A Review. Current epidemiology reports. 2014;1(3):149–64. doi: 10.1007/s40471-014-0013-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Emerson RW, John Davis Batchelder Collection (Library of Congress) Essays. Vol. 303. J. Munroe and company; Boston: 1841. p. p. [Google Scholar]
  • 49.Vandenbroucke JP, Broadbent A, Pearce N. Causality and Causal Inference in Epidemiology - the need for a pluralistic approach. Int J Epidemiol. doi: 10.1093/ije/dyv341. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Concato J, Shah N, Horwitz RI. Randomized, controlled trials, observational studies, and the hierarchy of research designs. New England Journal of Medicine. 2000;342(25):1887–92. doi: 10.1056/NEJM200006223422507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Druesne-Pecollo N, Latino-Martel P, Norat T, Barrandon E, Bertrais S, Galan P, Hercberg S. Beta-carotene supplementation and cancer risk: a systematic review and metaanalysis of randomized controlled trials. International journal of cancer. 2010;127(1):172–84. doi: 10.1002/ijc.25008. [DOI] [PubMed] [Google Scholar]
  • 52.Van Poppel G. Epidemiological evidence for beta-carotene in prevention of cancer and cardiovascular disease. European journal of clinical nutrition. 1996;50:S57–61. [PubMed] [Google Scholar]
  • 53.Peto R, Doll R, Buckley JD, Sporn M. Can dietary beta-carotene materially reduce human cancer rates? 1981 doi: 10.1038/290201a0. [DOI] [PubMed] [Google Scholar]
  • 54.Nandi A, Harper S. How consequential is social epidemiology? A review of recent evidence. Current epidemiology reports. 2014;2(1):61–70. [Google Scholar]
  • 55.Galea S. An argument for a consequentialist epidemiology. American journal of epidemiology. 2013:kwt172. doi: 10.1093/aje/kwt172. [DOI] [PubMed] [Google Scholar]
  • 56.Collins J, Koplan JP. Health impact assessment: a step toward health in all policies. JAMA. 2009;302(3):315–7. doi: 10.1001/jama.2009.1050. [DOI] [PubMed] [Google Scholar]
  • 57.Marmot MG, Allen J, Kawachi I. From science to policy. In: Berkman LF, Glymour M, editors. Social epidemiology. 2nd Oxford University Press; New York, NY: 2014. [Google Scholar]

RESOURCES