Abstract
Theoretical models of mortality selection have great utility in explaining otherwise puzzling phenomena. The most famous example may be the black-white mortality crossover: at old ages, blacks outlive whites, presumably because few frail blacks survive to old ages while some frail whites do. Yet theoretical models of unidimensional heterogeneity, or frailty, do not speak to the most common empirical situation for mortality researchers—where some important population heterogeneity is observed while some is not. I show that, when one dimension of heterogeneity is observed and another is unobserved, neither the observed nor the unobserved dimension need behave as classic, unidimensional frailty models predict. For example, in a multidimensional model, mortality selection can increase the proportion of survivors who are disadvantaged, or “frail,” and can lead black survivors to be more frail than whites, along some dimensions of disadvantage. Transferring theoretical results about unidimensional heterogeneity to settings with both observed and unobserved heterogeneity produces misleading inferences about mortality disparities. The unusually flexible behavior of individual dimensions of multidimensional heterogeneity creates previously unrecognized challenges for empirically testing selection models of disparities, such as models of mortality crossovers.
The classical mortality selection model is a triumph of formal demography. It starts from the premise that people vary systematically in mortality risk and derives the conclusion that cohorts are progressively reduced to a group of robust survivors. Models of mortality selection have been used to explain phenomena such as mortality crossovers, reversals in the sign of a disparity (e.g., Berkman et al. 1989; Dupre et al. 2006; Eberstein et al. 2008; Fenelon 2013; Guillot 2007; Hoffman 2008; Huang and Wu 2010; Kestenbaum 1992; Lynch et al. 2003; Manton et al. 1979; Nam et al. 1978; Nam 1995; Pearl 1922; Rogers 2002; Thornton 2004; Thornton and Nam 1968; Zeng and Vaupel 2003); mortality deceleration, the slowing of mortality’s rise with age (e.g., Beard 1959, 1971; Fukui et al. 1993; Horiuchi and Wilmoth 1997, 1998; Kannisto 1992; Lynch and Brown 2001; Lynch et al. 2003; Olshansky 1998; Thatcher et al. 1998; Vaupel et al. 1979; Vaupel and Yashin 1985); and mortality compression, the concentration of deaths into a small age range (e.g., Engelman et al. 2010; Kannisto 2000; Lynch and Brown 2001; Lynch et al. 2003).
But the classical mortality selection model does not speak to some of the most important questions in modern empirical mortality research, which concern the potential contribution of particular dimensions of heterogeneity when other important dimensions are unobserved. The classical model is unidimensional: the heterogeneity that mortality selection acts on is captured by a single unobserved scalar fact about an individual, i.e., in disciplinary jargon, whether the individual is “frail” or “robust.” All standard models of mortality selection are unidimensional in this sense, irrespective of other modeling choices.
The classic unidimensional model, in all its forms, developed at a time when old-age mortality data were limited, and creative theorizing made up for what could not yet be measured. Yet social science theories of stratification are multidimensional and intersectional, and substantive knowledge of health stratification suggests that there are many overlapping, yet distinct, risk factors for mortality (Bowleg 2012). Increasingly, covariate-rich datasets allow some of these distinct dimensions of population heterogeneity to be measured and offer new opportunities to analyze how particular heterogeneities contribute to changing mortality disparities, rather than treating “frailty” as a black box. But since measured heterogeneity is always partial, work that foregrounds selection still needs to engage with unmeasured heterogeneity alongside measured covariates.
The need for a theory of multidimensional mortality selection is underscored by recent empirical analyses, which use unidimensional theory to ask multidimensional questions about the black-white mortality crossover. The black-white mortality crossover is the phenomenon that black mortality exceeds white mortality at younger ages but falls below white mortality around age 85. The classical selection explanation for the crossover posits that blacks as a group are subject to greater selective pressure than whites, since their mortality is higher (e.g., Thornton and Nam 1968, Vaupel et al. 1979, Vaupel and Yashin 1985, Nam 1994, Lynch et al. 2003). Thus, old-age survivors include only the most robust members of the original black cohort, but a broader cross-section of the original white cohort. This longstanding theoretical explanation has increasingly been engaged by empirical studies (Berkman et al. 1989, Dupre et al. 2006, Sautter et al. 2012, Yao and Robert 2011) that try to identify which particular dimensions of heterogeneity might constitute this “frailty.” In current practice, such research draws theoretically on mortality selection models designed to compare full populations (e.g., blacks vs. whites) in the presence of unobserved heterogeneity, but asks questions that rely on complex, nested comparisons (e.g., blacks vs. whites with and without stratifying on a consequential health risk). This paper shows that the insights of the unidimensional selection model cannot be imported into the multidimensional setting and used to license the same kinds of predictions.
To address this gap between formal theory and empirical practice, I offer a model of the black-white crossover in the presence of multiple dimensions of heterogeneity and investigate its behavior. This work builds on prior research into the behavior of covariates in survival models.1 Yashin and Manton made early advances in incorporating unobserved covariates into more empirically realistic survival analyses (Yashin and Manton 1997) and in estimating unobserved heterogeneity from survival models with an observed covariate and an assumed baseline distribution of the unobserved covariate (Yashin et al. 1985). More broadly, an early line of research promised to meld the theoretical precision of mortality selection modeling with the empirical richness of new longitudinal data. This tradition explored multidimensional models that focused on single-population phenomena, such as mortality deceleration (Manton et al.1994, 1995; Manton and Woodbury 1983; Woodbury and Manton 1983), and more recently was picked up in theoretical work by Finkelstein and Esaulova (2008) and Finkelstein (2012). Finkelstein (2012) considers a two-dimensional frailty model of mortality deceleration and suggests an approach used in the analysis here: successively breaking populations into heterogeneous subpopulations defined by a single dimension of frailty, and then breaking those subpopulations into homogeneous groups. But while studying mortality deceleration involves analyzing a single population, understanding mortality crossovers requires comparing selection processes unfolding in multiple populations (e.g., blacks and whites). Other analyses (Bretagnolle and Huber-Carol 1988, Henderson and Oman 1999; see discussion in Wienke 2010: 127–130) that, like the current paper, model multiple observed covariates in the presence of unobserved heterogeneity, focus on quantifying the bias in estimated covariate effects. Each of these strands of prior research forms the lineage for the current paper, which asks a different question: what happens to a mortality disparity—such as the black-white disparity—when we incorporate a new covariate that we hypothesize to be part of the mortality selection process? Does the disparity change in a predictable way? In particular, how does the presence, absence, or timing of a mortality crossover change when the “frailty” that produces the crossover is partially adjusted for?
In short, I analyze whether the insights developed in unidimensional mortality selection theory can be extended to incorporate covariates representing partial measures of population heterogeneity. I show that, in general, they cannot. Multidimensional mortality selection and unidimensional mortality selection offer similar perspectives when all of the heterogeneity in a population is observed, or when none of it is observed. But unidimensional heterogeneity models offer no clear guidance about a multidimensional reality in which some dimensions of heterogeneity are observed while others remain unobserved. Yet this is the most common situation for social scientists studying mortality with datasets that include social and biological covariates representing some—but not all—of the heterogeneity within each population. The fact that individual dimensions of “frailty” need not behave like frailty as a whole implies that, when selection is occurring along multiple dimensions simultaneously, one cannot recover how it occurs along any one dimension (even qualitatively) without accounting for the other dimensions. This also implies that stratifying the crossover on observed heterogeneity offers quite limited information about the underlying selection processes. Nevertheless, I also show that it is possible to make some predictions about how the age at crossover responds to stratifying on key dimensions of heterogeneity, if certain assumptions can be made. This provides a direction for developing multidimensional selection theory.
I proceed by first presenting the core features of unidimensional mortality selection and then contrasting it with multidimensional mortality selection. For the multidimensional model, I outline two (alternative) predictions about how conditioning on an observed dimension of heterogeneity, in the presence of unobserved heterogeneity, should move the age at crossover. Neither prediction is supported: conditioning on partial measures of “frailty” has essentially unpredictable consequences without quite specific assumptions. I show that key facts about unidimensional heterogeneity do not hold for partially observed multidimensional heterogeneity, highlighting some previously unrecognized theoretical possibilities, such as frailty increases (mortality selection can lead populations to become more frail as they age) and frailty reversals (mortality selection can lead black survivors to be more frail than white survivors), that result from the intrinsic interactivity of multidimensional models.
Throughout, I adhere to the following terminological conventions. I consider two populations: blacks and whites. Populations may be stratified by one or two dimensions of heterogeneity, which may be unobserved or observed. The unobserved dimension of heterogeneity is always called frailty (in the unidimensional model) or residual frailty (in the multidimensional model), while the observed dimension of heterogeneity is called exposure. I call populations stratified by one dimension of heterogeneity subpopulations, e.g. the subpopulation of robust blacks or the subpopulation of exposed whites. I call populations stratified by two dimensions of heterogeneity groups, e.g. the group of exposed robust blacks, or the group of unexposed frail whites. All populations, subpopulations, and groups are analyzed as closed cohorts.
Mortality Selection with Unidimensional Heterogeneity
The classic model of mortality selection with unidimensional heterogeneity will serve as a baseline for the distinctive dynamics of mortality selection with multidimensional heterogeneity.
Unidimensional Mortality Selection Model
The classical mortality selection model (e.g., Vaupel et al. 1979, Vaupel and Yashin 1985) divides the black and white populations along a single dimension of heterogeneity, called frailty. I analyze frailty as a binary variable, which results in four internally homogenous subpopulations defined by race k = {b,w} and frailty j = {f,r}. Binary frailty allows the paper’s insights to be expressed in the simplest and the most direct way; the implications of this modeling choice are discussed below, after I introduce the multidimensional model. Frailty is unobserved. The subpopulations have proportional Gompertz hazards,
| (1) |
with shared slope β > 0 over age a ≥ 0 and intercepts αk,j. The subpopulation-specific intercepts are defined as
| (2) |
Thus, conditional on frailty, black subpopulations have higher mortality than white subpopulations in proportion b > 1 (the black mortality multiplier); and, conditional on race, frail subpopulations have higher mortality than robust subpopulations in proportion f > 1 (the frail mortality multiplier).2
Aggregate mortality for the black and white populations is a weighted average of the mortalities of the frail and robust subpopulations within each race,
| (3) |
where 0 ≤ πk(a) ≤ 1 is the proportion of race k that is frail and 1 − πk(a) is the proportion of race k that is robust at age a. The proportion frail, in turn, is given by
| (4) |
where a constant, is the ratio of frail to robust members of the population at baseline (assumed to be the same among blacks and whites3) and is the ratio of robust to frail survivors within each race at age a, where Since the frail die more quickly than the robust, the survivorship ratio increases, and the proportion frail, decreases monotonically with age. Since blacks always have higher mortality in each subpopulation, but not necessarily in the aggregate, the crossover is an example of Simpson’s paradox (e.g., Hernán et al. 2011, Hutchinton et al. 2000).
Four Facts about this Unidimensional Heterogeneity Model of Racial Disparities
The unidimensional frailty model of the black-white mortality crossover just presented makes two key assumptions, from which two results follow.
First, by assumption, conditional on frailty, black mortality exceeds white mortality at every age,
Second, by assumption, conditional on race, the frail have higher mortality than the robust at every age,
From the second assumption, it follows that, within each race, the proportion of survivors who are frail declines monotonically over age, (Vaupel and Yashin 1985).
From the first and second assumptions together, it follows that, if blacks and whites have the same proportion frail at baseline, blacks have a smaller proportion of frail survivors than do whites at every subsequent age, (Vaupel et al. 1979). Thus, the racial difference in the frailty of survivors results from the interaction of the between-race disadvantage of blacks and the within-race disadvantage of the frail.
These four generalizations provide a crucial point of comparison for the multidimensional model below.4
Two roles of unidimensional frailty
The interaction between the disadvantage of blacks and the disadvantage of the frail is depicted visually in Figure 1a, which illustrates the functional relationships given in Equations (1)–(4) and provides a point of comparison for the multidimensional model introduced below. Figure 1a shows that the black mortality multiplier affects the mortality of robust and frail blacks, while the frail mortality multiplier affects the mortality of frail blacks and whites. The mortality of both frail and robust blacks affects the proportion of black survivors who are frail, and each of those three terms, in turn, affects aggregate black mortality; likewise for whites. (Figure 1a omits the parameters that all subpopulations share: α, β, and π(0).)
Figure 1a.
Functional relationships between the race and frailty multipliers and aggregate mortality in the unidimensional mortality selection model.
Figure 1b zooms in on the part of Figure 1a depicting the effects of the frail mortality multiplier, f, on aggregate race-specific mortality, with +/− marks indicating the sign of each effect.5 It shows that f plays two competing roles in aggregate mortality. On one hand, increasing f raises aggregate mortality by raising the mortality of the frail, On the other hand, because f raises the mortality of the frail at each age, it also lowers aggregate mortality by reducing the proportion of the frail who survive to old age, The functional relationships shown qualitatively here are given quantitatively in Supplement 1.6 These two roles of frailty interact to create the potential for a crossover.
Figure 1b.
Two roles of unidimensional frailty in aggregate mortality.
The Black-White Mortality Crossover with Unidimensional Heterogeneity
A crossover occurs when aggregate white mortality exceeds aggregate black mortality, This crossover condition can be decomposed into three terms by rearranging the expanded forms of black and white mortality as given in Equation 3:
| (5) |
The key point is that the four facts about mortality with unidimensional heterogeneity fully determine the sign of all three terms.
The first term is the black-white difference in the mortality of the frail, weighted by the proportion of the black population that is frail, πb(a). The second term is the black-white difference in the mortality of the robust, weighted by the proportion of the black population that is robust, 1 − πb(a). These two terms are always positive because black mortality is always higher than white mortality, conditional on frailty.
The third term is the black-white difference in the proportion frail, weighted by the frail-robust difference in the mortality of whites, This term is always negative because its two factors have different signs: the black-white difference in the proportion frail is always negative, whereas the frail-robust mortality difference among whites is always positive.7 I call this third term the frailty factor. It represents the contribution of frailty-induced mortality selection to the racial difference in mortality. The frailty factor illuminates the dynamics of the multidimensional heterogeneity model below.
Equation 5 highlights the tradeoff at the heart of the crossover in the unidimensional selection model: higher black disaggregated mortality, but lower frailty among black survivors. This makes crossover dynamics with unidimensional heterogeneity qualitatively simple: the only question is whether and when the white-black compositional difference will outweigh the black mortality disadvantage at the subpopulation level.
The first column of Table 1 summarizes these key features of mortality selection with unidimensional heterogeneity, which will serve as a point of comparison with the multidimensional model.
Table 1.
Key comparisons between mortality selection with unidimensional and multidimensional heterogeneity
| Unidimensional Heterogeneity | Multidimensional Heterogeneity1 | |
|---|---|---|
| I. | Two roles of frailty in population-level mortality | Mortality Three roles of residual frailty in population-level2 |
| 1. Frailty increases the mortality of the frail subpopulation. | 1. Residual frailty increases the mortality of the exposed and non-exposed frail groups. | |
| 2. Frailty decreases the proportion of survivors who are frail in the full population. | 2. Residual frailty decreases the proportion of survivors who are residually frail in the exposed and non-exposed subpopulations. | |
| 3. Residual frailty can increase or decrease the proportion of survivors who are exposed in the full population. | ||
| II. | Four Key Facts about the Unidimensional Heterogeneity Model of Racial Disparities | The Four Key Facts Need Not Apply |
| 1. Conditional on frailty, black mortality exceeds white mortality. | 1. Conditional on residual frailty, black mortality can be higher or lower than white mortality (subpopulation crossovers are possible). | |
| 2. Conditional on race, frail mortality exceeds robust mortality. | 2. Conditional on race, residually frail mortality can be higher or lower than residually robust mortality (frailty crossovers are possible). | |
| 3. The share of survivors who are frail decreases with age. | 3. The share of survivors who are residually frail may increase or decrease with age (frailty increases are possible). | |
| 4. Black survivors are less likely than white survivors to be frail. | 4. Black survivors can be less likely or more likely than white survivors to be residually frail (frailty reversals are possible). | |
| III. | Decomposition of population-level mortality crossover (Equation 5) | Decomposition of population-level mortality crossover (Equation 11) |
| All terms have known sign. | All terms have unknown sign. | |
| Stratifying on frailty increases black-white mortality disparity. | Stratifying on residual frailty can increase or decrease black-white mortality disparity. | |
| Stratifying on frailty removes the crossover. | Stratifying on residual frailty can make age at crossover older or younger. |
Everything stated about unobserved residual frailty, with respect to the multidimensional model in the right column, also pertains to the observed exposure dimension of heterogeneity.
The roles of residual frailty depend on how mortality is decomposed. This table uses the decomposition of mortality given in the text and in Figure 5, in which exposure composition is represented at the population level and residual frailty composition is represented at the subpopulation level, in order to match empirical situations.
Mortality Selection with Multidimensional Heterogeneity
The unidimensional mortality selection model is the central reference point for work on mortality crossovers. But it is the wrong reference point for recent empirical work on the black-white mortality crossover, which is fundamentally multidimensional. These studies (Dupre et al. 2006, Sautter et al. 2012) ask what happens to the crossover when a particular dimension of heterogeneity is observed and other dimensions remain unobserved. They stratify on the observed dimension of heterogeneity and compare the ages at black-white crossover of the resulting subpopulations with the age at crossover of the aggregate populations. To formalize the theory implicit in this practice, I propose a model of mortality selection with partially-observed multidimensional heterogeneity, show that it behaves quite differently from the unidimensional model, and analyze the crossover age in the new model.
Multidimensional Mortality Selection Model
To demonstrate that multidimensional selection models with partially observed heterogeneity exhibit intrinsically different behaviors than the classical unidimensional model, I present a multidimensional model that differs from it in only one respect: each racial population is crosscut by, not one, but two dimensions of fixed heterogeneity. The observed dimension of heterogeneity describes whether or not people suffered a deleterious exposure (e.g., tobacco exposure in utero, since maternal smoking satisfies the model assumptions tolerably well: it raises mortality, is fixed at birth, and is relatively evenly distributed by race [Curtin and Mathews 2016]). The unobserved dimension of heterogeneity describes whether people are residually frail or residually robust.
The multidimensional model thus contains eight internally homogeneous groups defined by race k = {b, w}, observed exposure, i = {t, n}, and unobserved residual frailty, j = {f,r}. The groups have proportional Gompertz hazards,
| (6) |
with shared slope β > 0 over age a ≥ 0 and group-specific intercepts αk,i,j. The intercepts are defined as
| (7) |
where b > 1 is the black mortality multiplier, as before; f* > 1 is the residual frailty mortality multiplier; and t > 1 is the exposure mortality multiplier. (The exposed groups are designated with t as in tobacco exposure, or treatment.) I assume that, at baseline, both unobserved residual frailty and observed exposure are distributed independently of race, though not necessarily of each other.
The group-specific mortalities are analogous to the unidimensional model’s subpopulation-specific mortalities. In the multidimensional model, each set of subpopulations defined by one dimension of heterogeneity, aggregating over the other dimension (e.g., tobacco-exposed whites, aggregated over residual frailty), is a separate instantiation of the unidimensional model.
If both dimensions of heterogeneity were observed, then the black and white populations could be analyzed straightforwardly in terms of their component groups. If neither dimension of heterogeneity were observed, then the black and white populations could be analyzed as having just one dimension of heterogeneity with four (rather than two) categories, i.e., as a version of the classical unidimensional heterogeneity model. This multidimensional selection model speaks to a third situation—at the heart of recent empirical work on the crossover—where one dimension of heterogeneity is observed and the other is unobserved.
Mortality in the subpopulation defined by race k and observed exposure is the weighted average of the residually frail and residually robust groups in the subpopulation,
| (8) |
where πk,i(a) is the proportion of frail members of the subpopulation with exposure i, and 1 − πk,i(a) is the proportion robust.8 By assumption, is observed, but its component parts are not.
Aggregate mortality of race is a weighted average of the subpopulation-specific mortalities,
| (9) |
All terms of Equation 9 are observed, and Tk(a) is the proportion of each race that is exposed,
| (10) |
By assumption, Tk(a) is observed, but its component parts are not. Note that Tk(a) is defined at the population level, whereas πk,i(a) is defined at the subpopulation level.9 The interaction between these two dimensions of heterogeneity drives the distinctive behavior of heterogeneity at the aggregate population level (as in ) compared to heterogeneity at the subpopulation level (as in ) which the following sections will elucidate.10
A note on model interpretation
Two key parametric forms dominate the frailty literature: binary (e.g., Vaupel and Yashin 1985, Lynch et al. 2003, Wrigley-Field 2014) and gamma-distributed (e.g., Vaupel et al. 1979, Manton et al. 1981, Horiuchu and Wilmoth 1998, Wienke et al. 2003, Gampe et al. 2010) frailty. In general, researchers use gamma-distributed frailty when the point is to better match empirical reality, because much consequential variation between individuals is continuous, and often use binary frailty when the point is to produce conceptual insight about the nature of selection dynamics (for example, the stylized selection models in the classic “Heterogeneity’s Ruses” [Vaupel and Yashin 1985]). This paper is in the latter tradition. It uses a binary frailty model because its aims are pedagogical and the paper’s core intuitions are easier to grasp in a simplified context. (A supplemental appendix introduced later on discusses some of the results using an alternative, gamma specification.)
In the model used here, it can seem natural to read both the frailty multiplier and the black mortality multiplier as implying intrinsic, perhaps even genetic, individual disadvantages. It is important to note that the model used here does not imply this—and indeed that these are unrealistic interpretations of both racial disadvantages and of the stable inequalities captured by “frailty.”
Binary frailty can be thought of as a simplification of a context in which most consequential disadvantages are heavily clustered rather than independently distributed, so that the population crystallizes into sharply distinct advantaged and disadvantaged social groups along just a few dimensions.
Similarly, the black mortality multiplier is a mathematical construct that simplifies the vast complexity of racism in the United States. It makes one key assumption: being socially categorized as black is disadvantageous for everyone who is categorized that way. This does not mean that all blacks are disadvantaged in total compared to all whites (indeed, if that were true, no crossover would be possible). Blacks may be disadvantaged by their race but also advantaged in ways represented as exposure and residual frailty, where whites may be disadvantaged. But it does mean that all blacks are disadvantaged relative to the mortality we should expect for them if they were treated as whites are currently treated; in that sense, the model assumes that racial disadvantage is ubiquitous.11
Nor does the black mortality multiplier (a cohort-specific parameter) imply that racial disadvantage is ahistorical or unchanging. Both the magnitude and real social content of the individual-level disadvantage associated with being socially categorized as black would be different for a cohort whose members grew up under near-universal segregation enforced by legal and extra-legal violence, than for a cohort growing up under formal legal equality, mass incarceration, and racialized poverty. The black mortality multiplier does imply that racial disadvantage does not attenuate over the life course for individuals. This assumption is not an empirical claim, but it is important for this paper’s pedagogical purpose: it ensures that the model’s crossovers reflect selection rather than life course dynamics.12
Thus, a model with binary frailty and a black mortality multiplier is used here because it focuses attention on the multidimensional selection dynamics that produce the surprising patterns shown below, while simplifying away other complexities not essential to those patterns. For example, capturing black disadvantage explicitly in a mortality multiplier, rather than implicitly in the frailty distribution, preserves in a clear and accessible fashion the distinction between black/white inequality at the individual level (represented in that multiplier) and population-level disparity. This distinction is at the heart of most interpretations of the crossover.13 Additionally, much of the paper compares aggregate populations with subpopulations that are defined by sharing a particular level of frailty along one dimension. This comparison is a tractable way to present results in a simple form when the population can be broken down into only a few subpopulations, such as the exposed and the non-exposed.14 However, in a model where both dimensions of heterogeneity are continuous, there are infinitely many subpopulations (or, if heterogeneity values are rounded, at least very many subpopulations). An analysis that statistically adjusts for the level of (for example) exposure must choose which exposure levels to use to define the subpopulations of interest. This paper brackets these issues by using a model with just two observed subpopulations and analyzing both. By treating both dimensions of heterogeneity as binary, it also highlights the mathematical symmetry between observed and unobserved dimensions.
How Stratifying on Partially-Observed Heterogeneity Might Change the Age at Crossover: Two Predictions
Identifying particular dimensions of heterogeneity that contribute to the aggregate black-white crossover requires a testable prediction derived from a multidimensional selection model. A natural place to look for testable predictions is in the outcome that dominates research on mortality selection: the age at onset for some mortality selection artifact (e.g., Berkman et al. 1989; Dupre et al. 2006; Horiuchi and Wilmoth 1997, 1998; Lynch and Brown 2001; Lynch et al. 2003, Sautter et al. 2012, Yao and Robert 2011), in this case the crossover. Thus, to connect the multidimensional heterogeneity model to empirical research, consider the question: What happens to the extent of racial disparities in mortality—and what happens to the age at crossover—when black and white mortality are stratified on an observed dimension of heterogeneity (“uterine tobacco exposure”) while another dimension (“residual frailty”) goes unobserved?
The two predictions described below do not follow directly from the unidimensional model, which is silent on multidimensional applications. Rather, they represent alternative attempts to generalize that model’s logic to the questions asked in empirical practice. I will assess how these predictions fare in describing the crossover under partial stratification.
Prediction 1.
This prediction is used in empirical literature that references only formal models of unidimensional, not multidimensional, heterogeneity. This recent empirical work on the black-white mortality crossover (Dupre et al. 2006, Sautter at el. 2012) first presents a hypothesis that mortality selection in black and white populations operates simultaneously on an observed dimension of heterogeneity (such as poverty, education, or religiosity) and an unobserved dimension of heterogeneity, residual frailty.15 It offers predictions about the ages at crossover in the aggregate and in subpopulations when the black and white populations are stratified by the observed heterogeneity, and tests these predictions in empirical data—concluding that the observed dimension is (in the case of poverty and religiosity) or is not (in the case of low education) a dimension of the heterogeneity that produces the crossover in the aggregate.
Both Sautter et al. (2012) and Dupre et al. (2006) use the criterion that a trait is “[a source] of heterogeneity in individual frailty that contribute[s] to the Black-White mortality crossover” (Sautter et al. 2012:1566) if two regression coefficients on mortality are statistically significant: the trait interacted with age and with race.16 They further seem to take this criterion as coextensive with the criterion that the observed trait is part of “frailty” (i.e. multidimensional heterogeneity) if and only if conditioning on the trait changes the age at crossover (in some direction). The Dupre/Sautter criterion, then, proposes testing a model with partially observed, multidimensional heterogeneity by conditioning on the observed dimension and assessing whether the age at crossover changes. This prediction is not derived from any formal model of multidimensional heterogeneity.
Prediction 2.
In translating the unidimensional model into a multidimensional setting, one might also expect a more specific prediction to hold. If each dimension of multidimensional heterogeneity—such as uterine tobacco exposure and residual frailty, or low education and residual frailty—behaved like unidimensional frailty, then each dimension would have a predictable effect on black-white disparities. Specifically, the tobacco-exposed would necessarily have higher mortality than the non-exposed, more surviving whites would necessarily be exposed to tobacco than surviving blacks, and tobacco exposure would necessarily raise aggregate white mortality relative to black. Stratifying on observed tobacco exposure would therefore raise black mortality relative to white, delaying the crossover to an older age: the aggregate population would necessarily reach crossover before the subpopulations. This would constitute a testable prediction of the multidimensional heterogeneity model.
In what follows, I will show that neither the Dupre/Sautter prediction, nor this more specific prediction about crossover order, follows from the multidimensional heterogeneity model.
Unexpected Behaviors of Multidimensional Heterogeneity: The Four Key Facts About Unidimensional Heterogeneity Do Not Apply
In the unidimensional model, I identified four key facts and a resulting decomposition for the black-white mortality crossover in which all terms had known sign. None of these generalizations extend to the individual dimensions of multidimensional heterogeneity. That is, in a multidimensional heterogeneity scenario where some—but not all—dimensions of heterogeneity are observed, neither the observed nor the unobserved dimensions necessarily behave like unidimensional frailty.
The distinctive behaviors of the multidimensional model include phenomena that I label subpopulation race crossovers, frailty crossovers, frailty increases, and frailty reversals. The first two possibilities are straightforward extensions of the unidimensional model to the multidimensional context; the latter two are more surprising departures from unidimensional selection.
I. Subpopulation race crossovers—
In the unidimensional model, conditional on frailty, j, black mortality is always higher than white mortality, By contrast, in the multidimensional model, the subpopulations can have their own race crossovers. Conditional on unobserved residual frailty, black mortality can be either higher or lower than white mortality, at any given age. Analogously, conditional on observed exposure, black mortality can be either higher or lower than white mortality, For example, Figure 2 illustrates a cohort in which, in the exposed subpopulation, black mortality is higher than white mortality before age 70 and after age 76, but lower than white mortality in between. Figure 2 and all following numerical illustrations come from a large universe of simulations described and analyzed in Supplement 3; the specific parameter values for all illustrative figures are given in Table S3.2.
Figure 2.
Subpopulation crossover: Black-white mortality crossovers can occur in the exposed subpopulation and in the non-exposed subpopulation.
These black-white subpopulation crossovers can occur because each subpopulation defined by stratifying on the observed dimension of heterogeneity instantiates the unidimensional heterogeneity model given in Equations 1–3.
II. Frailty crossovers—
In the unidimensional model, within each race, frail mortality is always higher than robust mortality, In the multidimensional model, within each race, the residually frail subpopulation may have either higher or lower mortality than the residually robust subpopulation, and at any age. Similarly, the exposed subpopulation may have either higher or lower mortality than the non-exposed subpopulation, Frailty crossovers are exactly analogous to black-white subpopulation crossovers. Figure 3 shows a frailty crossover, for a cohort in which residually frail mortality falls below residually robust mortality for both blacks (ages 60–73) and whites (ages 71–84).
Figure 3.
Frailty crossover: Conditional on race, residually frail members can have higher or lower mortality than residually robust members.
III.and IV. Frailty increases and frailty reversals—
In the unidimensional model, survivors are progressively less likely to be frail as the population ages, (the third fact about the unidimensional model). Furthermore, given equal baseline frailty across races, black survivors are always less likely than white survivors to be frail after baseline, (the fourth fact).
By contrast, in a multidimensional model, mortality selection can increase, as well as decrease, population-level residual frailty, or population-level exposure, I call this possibility a frailty increase. Furthermore, mortality selection can make black survivors more or less likely than white survivors to be residually frail, or more or less likely to be exposed, I call this possibility that black survivors become more disadvantaged than white survivors a frailty reversal. Frailty increases and frailty reversals violate the most important insights into mortality selection derived from the unidimensional model.17
The formal conditions for frailty reversals are given in Supplement 2, but the intuition is straightforward. Just as unidimensional mortality selection creates a negative association between race and frailty among survivors, multidimensional mortality selection creates a negative association between tobacco exposure and residual frailty within each race. This negative association can become so strong that selecting against one of those dimensions of heterogeneity becomes selecting for the other. The dimension being selected for can thus increase over age (a frailty increase), or—because this selection is stronger among blacks—can become more common among blacks than among whites (a frailty reversal). When this occurs, the dimension selected for is always the one with a weaker effect on mortality, because selection for it is driven by complex associations created by selection against the stronger dimension. Thus, blacks will always be more selected than whites along the stronger dimension of heterogeneity, but not necessarily along the other dimension.
To illustrate frailty increases and a frailty reversal, Figure 4 shows the proportions of black and white survivors that are residually frail in a simulated cohort. Frailty increases occur for blacks from ages 83–94, and for whites from ages 90–101. These frailty increases result from frailty crossovers such that, in the black and white populations at these respective ages, the residually frail have lower mortality than the residually robust. Mortality selection at these ages therefore makes each population more residually frail.
Figure 4.
Frailty increases: The proportion of residually frail survivors increases among blacks and among whites. Frailty reversal: Whites can have a larger or smaller proportion of residually frail survivors over age.
Figure 4 also shows a frailty reversal that occurs from ages 86 to 97. Frailty reversals result from the interaction between the two dimensions of heterogeneity. In this cohort, exposure raises mortality a great deal at the individual level, while residual frailty raises mortality much less, Consequently, both races, and especially blacks, are heavily selected against exposure. Furthermore, all subpopulations, and especially the exposed, are selected against residual frailty. But since comparatively fewer exposed blacks than whites survive, selection against residual frailty occurs predominantly among whites. The interaction of selection against exposure and selection against residual frailty results in blacks being less selected against residual frailty than whites for an 11-year span. (Supplement 4 illustrates and analyzes a frailty reversal in a cohort simulated with an alternative, gamma-distributed frailty model, and serves as a bridge between this paper’s results and the gamma-Gompertz frailty literature.)
Frailty increases and frailty reversals underscore just how much the multidimensional selection model differs from the unidimensional one. When there is only a single dimension of fixed heterogeneity that raises mortality, we can be certain that it declines monotonically over age and that, if blacks and whites start out with the same proportion frail, they end up with fewer frail at each subsequent age. Neither of these core generalizations necessarily extends to each fixed dimension of heterogeneity that raises mortality when there is more than one. In the next section, I show that the interaction between dimensions of heterogeneity that drives these possibilities stems from a distinctive third role of frailty unique to the multidimensional model. As explained in Supplement 2, this third role of frailty not only forestalls the four key facts about unidimensional heterogeneity, but also breaks the dependencies between those facts.
Three Roles of Frailty in Multidimensional Mortality Selection
The four key facts about unidimensional heterogeneity do not extend to the multidimensional model because each dimension of heterogeneity in the latter plays three, rather than two, roles in determining population-level mortalities, Figure 5 and Table 1 represent heterogeneity’s three roles, focusing on unobserved residual frailty for convenience. (Analogous arguments apply to observed exposure’s three roles.) Figure 5 and Table 1 are based on the decomposition of population-level mortality given in Equation 9.
Figure 5.
Three roles of residual frailty on aggregate mortality in the two-dimensional mortality selection model.
In the unidimensional model, the frail mortality multiplier, f, plays two roles in population-level mortality for each race: it simultaneously increases population-level mortality by increasing the mortality of the frail subpopulation and reduced population-level mortality by reducing the proportion of frail survivors. In the multidimensional model, unobserved residual frailty, j (and, by analogy, observed exposure, i), plays the same two roles. An increase in the residual frailty multiplier f* increases population-level mortality by increasing the mortality of the residually frail within each subpopulation defined by observed exposure, and it decreases population-level mortality by decreasing the proportion frail within each subpopulation defined by observed exposure, As in the unidimensional model, these two roles suffice to produce a black-white crossover in aggregate mortalities.
The third role of heterogeneity, by contrast, is new and considerably more complex: residual frailty in the multidimensional model affects population-level mortality by changing the proportion of the population that is exposed, Tk(a). This means that the two dimensions of heterogeneity—unobserved residual frailty and observed exposure—interact. Even if the two dimensions of heterogeneity start out distributed independently of one another, they will become associated as the cohort ages: survivors who are disadvantaged along one dimension are unlikely to also be disadvantaged along the other, since such multiply disadvantaged individuals are least likely to survive.18
The effect of residual frailty on population-level mortality via observed exposure composition is essentially unpredictable, for two reasons.
First, increasing the disadvantage associated with residual frailty, f*, can either increase or decrease the proportion of survivors who are exposed, Tk(a). Insofar as the disadvantage associated with residual frailty increases the mortality of the exposed subpopulation, it will decrease the proportion of survivors who are exposed, Insofar as the disadvantage associated with residual frailty increases the mortality of the non-exposed subpopulation, it will increase the proportion of survivors who are exposed, .19 When the total effect of the two paths from f* into Tk(a) is positive in one population at some age, the result can be a “frailty crossover” between the exposed and non-exposed and a “frailty” (i.e., observed exposure) increase in that population at that age. When the total effect of the two paths into Tk(a) is larger among blacks than among whites for some span of ages, the result can be a “frailty” (i.e., observed exposure) reversal.20
Second, increasing the proportion of survivors who are exposed, Tk(a), can either increase or decrease population-level mortality, Increasing Tk(a) will increase population-level mortality when the exposed have higher mortality than the non-exposed. Increasing Tk(a) will decrease population-level mortality when the exposed have lower mortality than the non-exposed, that is, after a “frailty” (i.e., exposure) crossover. Thus, absent precise quantitative knowledge of the model parameters, residual frailty’s third role has an unpredictable effect on aggregate mortality, 21
In sum, in the multidimensional model, the various dimensions of heterogeneity within each population interact with each other, making it extremely difficult to relate any one observed dimension of heterogeneity to clean predictions about population-level mortality. If it is difficult to relate any given observed heterogeneity to aggregate mortality in any one population, then it is doubly difficult to relate an observed dimension of heterogeneity to mortality differentials between populations. Next I show what this implies for empirical research: stratifying on an observed dimension of heterogeneity (while another dimension remains unobserved) can either increase or decrease the black-white disparity in mortality, and the resulting subpopulations can reach a crossover either before or after the aggregate population.
Decomposition of the Aggregate Crossover with Multidimensional Heterogeneity: Conditioning on observed heterogeneity can move the age at crossover in either direction
Equation 11 decomposes the black-white crossover in aggregate mortality, along the observed exposure dimension:
| (11) |
Equation 11 is exactly analogous to the decomposition of the black-white mortality disparity with unidimensional heterogeneity given in 5—except that, in the unidimensional case, the signs of all three terms were known a priori because they were determined by the key facts about unidimensional heterogeneity. By contrast, here those facts need not apply, and each of the three terms in Equation 11 can be either positive or negative at any given age.
The first two terms of Equation 11 are, respectively, the black-white difference in the mortality of the exposed, weighted by the proportion of blacks who are exposed, Tb(a), and the black-white difference in the mortality of the non-exposed, weighted by the proportion of blacks who are non-exposed, 1 − Tb(a). These two terms can be either positive or negative because a black-white subpopulation crossover may or may not occur in each subpopulation defined by observed exposure.
The third term of Equation 11 is the frailty factor, representing the contribution of the racial compositional difference in observed exposure to the racial difference in aggregate mortality. It is the extent to which observed exposure is associated with higher mortality among whites, weighted by the black-white difference in observed exposure, This term can take either sign because each of its two factors can take either sign. The mortality difference will be positive, as in the unidimensional case, as long as whites have not had a frailty crossover along the observed exposure dimension, and negative, if they had a frailty crossover. And the compositional difference will be negative, as in the unidimensional case, as long as there has not been a frailty reversal along the observed exposure dimension, and positive, if there has been a frailty reversal.
When the frailty factor is negative, the black-white mortality disparity is less positive, or more negative, in the aggregate than in the subpopulations. Thus the aggregate can have a crossover when the subpopulations do not, or can have a more extreme crossover than they do. Conversely, when the frailty factor is positive, the black-white mortality disparity is more positive, or less negative, in the aggregate than in the subpopulations. Thus a crossover in aggregate mortality can be absent even when one or both of the subpopulations has a crossover.
Consequently, aggregate mortality can reach a crossover before both subpopulations, after both subpopulations, or in between the subpopulations. Stratifying black and white mortality on any single dimension of heterogeneity therefore moves the crossover in an essentially unpredictable direction. These results are summarized in the third panel of Table 1. Figure 6 shows illustrative examples of all three scenarios, with solid vertical lines marking the onset of the aggregate crossover and dashed vertical lines marking the onset of the subpopulation crossovers.
Figure 6, Panel A.
A simulated cohort in which black and white mortalities cross in the aggregate population before they cross in the exposed and non-exposed subpopulations.
Importantly, the crossover order—whether the aggregate populations reach a crossover at a younger or older age than the subpopulations—can change in response to very minor shifts in the model parameters. Cohorts that share most of their parameters can nevertheless vary in their crossover order, as shown in Supplement 3, which analyzes over 1.5 million simulated cohorts. Consequently, there is no obvious a priori prediction, absent strong assumptions about the latent model parameters, about whether stratifying on a single dimension of heterogeneity will increase or decrease the black-white mortality disparity at any given age, or increase or decrease the age at crossover.
Implications for Empirical Research
These results cast doubt on the two potential tests of the multidimensional heterogeneity model based on stratifying black and white populations on an observed dimension of heterogeneity and comparing changes in the age at crossover to putative predictions based on the model.
One potential test was based on the Dupre/Sautter prediction that stratifying on a single dimension of multidimensional heterogeneity should move the crossover in some (unspecified) direction. This criterion is neither a necessary nor a sufficient condition for identifying dimensions of heterogeneity that contribute to the aggregate crossover. On one hand, the age at crossover will almost always shift in some direction when any trait associated with both race and mortality is controlled for. This is true regardless of whether that trait behaves like the frailty of a mortality selection model—that is, regardless of whether it approximates the model assumptions of being fixed in individuals and raising mortality at all ages. On the other hand, it is possible for the crossover to occur at the same age in the aggregate population and in a subpopulation, even if the trait does constitute a dimension of frailty. Such a confluence of crossovers requires only—in the language of Equation 11—that the frailty factor have a very similar magnitude to that of the other subpopulation’s contribution around the aggregate crossover age.22 Figure 7 shows an example of such a cohort. In this simulated cohort, the non-exposed subpopulation reaches a crossover at 18 days younger than the aggregate population—simultaneously from the perspective of any real study of old-age mortality.23
Figure 7.
A simulated cohort in which aggregate mortality crosses essentially at the same time as the non-exposed (baseline) subpopulation. (These simultaneous crossovers begin just after the exposed subpopulation crossover ends.)
The second potential test of the model was based on the prediction that stratifying on the observed dimension of heterogeneity might necessarily increase the black-white mortality disparity and delay the crossover. This would be true if individual dimensions of heterogeneity behaved like unidimensional heterogeneity. The results here cast doubt on this criterion as well. The preceding section shows that, in the multidimensional context, aggregate and subpopulation crossovers can in fact occur in any order. Thus, empirically identifying particular dimensions of crossover-producing heterogeneity via such directional predictions similarly would not work.
The goal of identifying particular dimensions of heterogeneity that comprise a multidimensional analogue to “frailty” is an essential one for mortality research. But the results in this paper highlight the dangers of pursuing it without the benefit of an explicit model of multidimensional mortality selection. Moreover, they suggest that the goal may be surprisingly difficult to achieve. At least, it may require something other than the standard strategy of analyzing the age at which mortality selection artifacts begin—whether the crossover (Berkman et al. 1989, Dupre et al. 2006, Lynch et al. 2003, Sautter et al. 2012, Yao and Robert 2011) or mortality deceleration (e.g., Horiuchi and Wilmoth 1997, 1998; Lynch and Brown 2001; Lynch et al. 2003).
A question for further research is what other tests of the multidimensional mortality selection model might be possible. Supplement 3 shows that, while the crossover order varies with even small parameter changes over a large swath of simulated parameter space, some predictions nevertheless are possible, contingent on particular combinations of parameter values. The very presence of subpopulation crossovers implies that residual frailty is consequential and relatively common at baseline: whatever the measured exposure contributes to the aggregate crossover, the unmeasured heterogeneity is sufficient to generate a crossover.
In general, when the proportions of disadvantaged members (e.g., the residually frail and the exposed) are small at baseline, the crossover order is more constrained (largely because the aggregate dynamics will be dominated by the large group of more advantaged survivors); when the disadvantaged categories are larger at baseline, frequently, any order is possible even when the other parameters are fixed. Holding other parameters fixed, when baseline residual frailty is very high, it is relatively rare for the aggregate crossover to happen after both subpopulation crossovers when exposure is also very high at baseline, and it is relatively rare for the aggregate crossover to happen before both subpopulation crossovers when baseline exposure is low. (An aggregate crossover occurring between the two subpopulation crossovers is ubiquitous across the parameter space.)
The results here also imply that additional empirical tests of the multidimensional model may be possible in the special circumstance that the measured dimension of heterogeneity, t, can be assumed to represent a large portion of the total heterogeneity (although see the discussion in Supplement 4 that complicates this criterion for models in which the dimensions are not equally consequential for each race). Such a scenario is presumably atypical in the case of covariates like religious participation (which likely account for only a relatively small part of the stable heterogeneity in mortality risk within racial populations), but might be reasonable in the case of a covariate like a Charlson Comorbidity Index (Charlson et al. 1994), which summarizes a variety of chronic medical conditions that collectively strongly predict mortality. These results suggest that a good strategy for empirical researchers might be to focus on covariates structured to capture much of the variation in mortality risk, such as by amalgamating many other covariates into a total measure of observed risk, rather than focusing on single covariates whose effects on mortality are not overwhelmingly large.24 A covariate capturing much of the total heterogeneity licenses more predictions because it acts more like unidimensional heterogeneity. First, if t > f*, in this model, then “frailty” reversals and frailty crossovers along the measured (t) dimension are impossible.25 Measuring the proportion exposed over age in each race would therefore potentially allow this model to be falsified, given the assumption that t > f*.26 Unfortunately, given the more typical scenario that t < f* (i.e., unmeasured hererogeneity is more consequential for individual-level mortality than measured heterogeneity), the prediction that follows is about the unmeasured residual frailty dimension, and therefore not directly empirically testable. A second test may be possible even if t < f*, if t is still “large.” Any crossover requires that the frailest white mortality exceed the most robust black mortality. If t and f* are similar in magnitude, or more generally if t is large, then it is possible for while f* < b (even if f* > t). In this situation, the observed subpopulations defined by exposure would never reach a crossover even though the aggregate population might—an empirically testable conclusion.27
These empirical predictions—and the absence of similar predictions for measured covariates whose effect on mortality is small compared to the effect of the heterogeneity that remains unmeasured—suggest the value of explicit theorizing about mortality disparities in the presence of multiple dimensions of heterogeneity. They also suggest that, wherever possible, we attend to more localized parameter spaces, some of which will be more revealing than others. To determine whether particular observed dimensions of heterogeneity contribute to a crossover, we should focus on dimensions that are highly consequential for mortality at the individual level. More specific predictions are possible as more assumptions are made to limit the simulation space to cohorts that better resemble actual U.S. cohorts (shown in Supplement 3). To the extent that this is a meaningful exercise in models that remain highly stylized, it suggests more room for developing fruitful predictions in the future.
Conclusion
In this paper, I analyzed the black-white mortality crossover in the presence of multiple dimensions of heterogeneity within each race. The crossover represents a concrete example through which to understand the frequent circumstance that one dimension of heterogeneity is observed and another is theorized to exist, but is unobserved. This situation is not captured by standard unidimensional heterogeneity models of mortality selection, but it is common in empirical research on the crossover (e.g., Berkman et al. 1979, Dupre et al. 2006, Sautter et al. 2012, Yao and Robert 2011) and in empirical studies of mortality broadly. It is also likely to become more frequent as more datasets with rich covariates and sufficient coverage of old ages become available. As I have shown, the most basic facts about the unidimensional theoretical model do not necessarily extend to this situation. Neither the observed nor the unobserved dimensions of heterogeneity necessarily behave as the classic, unidimensional models would predict: individual dimensions of frailty do not behave the same way as frailty in total.
The standard, unidimensional mortality selection model of the crossover is well summarized by four key facts that together allow the crossover to occur and make its dynamics qualitatively simple. In the multidimensional model, none of the four key facts need hold: blacks may have either higher or lower mortality than whites; the frail may have either higher or lower mortality than the robust; frailty can increase or decrease over age; and black survivors may be either more likely or less likely than white survivors to be frail. Generalizations that apply to heterogeneity as a whole—including those that form the foundation of mortality selection’s account of the crossover—need not apply to each dimension of heterogeneity individually.
These possibilities arise because multidimensional mortality selection creates complex and essentially unpredictable—absent strong assumptions about parameter values—associations between the dimensions of heterogeneity. The four facts about unidimensional heterogeneity operate at the level of homogeneous subpopulations defined by race and frailty. But when heterogeneity is multidimensional, subpopulations defined by race and a single dimension of heterogeneity remain otherwise heterogeneous. These heterogeneous subpopulations change their composition over age, producing qualitatively complex behavior at the population level. Multidimensional mortality selection is complex because in it, the basic dynamic of unidimensional mortality selection occurs at many interacting levels simultaneously. The crossover with unidimensional heterogeneity is a straightforward example of Simpson’s paradox, occurring at a single level. But the crossover with multidimensional heterogeneity gives rise to Simpson’s paradoxa at several levels, so that the phenomena occurring at the surface level—population-level mortality—become less intuitive. Multidimensional populations are mixtures of unidimensional subpopulations, and the mixture need not behave like its ingredients.
These results have theoretical and practical consequences. Theoretically, they suggest that the intuitions derived from the longstanding tradition of unidimensional mortality selection theory do not apply to multidimensional mortality selection. These intuitions work well when frailty can be thought of as a cohesive whole, an amalgam of entirely unobserved traits. But when we want to get specific about “frailty” by identifying individual components of it, those intuitions can become deeply misleading. Even if dimensions of heterogeneity are independently distributed at birth, they become associated—with each other and with race—as a cohort ages due to their joint contribution to mortality. Any dimension of heterogeneity therefore carries information about all of the others. Conditioning on any single dimension is not merely conditioning on a noisy measure of overall heterogeneity—it is conditioning selectively on whatever dimension was observed—and, therefore, also on whatever dimension was not observed. The consequences of such selective stratification can only be described with a model that explicitly incorporates the joint distribution of each dimension of heterogeneity as it changes over age—whether the context is a mortality crossover or any other mortality trajectory.
Practically, the interaction between compositional changes in the observed and unobserved dimensions of heterogeneity produces unpredictable crossovers in the resulting subpopulations and aggregate populations. As a consequence, the crossover order is not an empirical confirmation or refutation of the general form of this model; the seemingly most natural way to test a multidimensional model may not work.
The implications of the results shown here extend well beyond the black-white mortality crossover. This paper joins several others in analyzing the consequences of selection along multiple dimensions simultaneously (Bretagnolle and Huber-Carol 1988, Manton et al. 1995, Henderson and Oman 1999, Finkelstein 2012). Collectively, this research motivates careful consideration of whether theoretical results about frailty as a whole extend to partially-observed heterogeneity. The results that are new in this paper concern the behavior of mortality disparities in a common research situation: when mortality is stratified by partial measures of heterogeneity.
The gap between theoretical work on mortality selection and recent empirical work on the crossover echoes a wider divergence between two traditions of demographic research in which the study of the crossover has traditionally been situated. Classical demography was an intellectually distinctive field that produced a series of models that excel at shifting perspectives between population aggregates and the individual-level status transitions that produce them. These models are able to reveal a great deal about population processes even without rich data; indeed, the striking creativity of formal demography in this era was presumably spurred by the need to wring as much information as possible from the limited data of the time. Classic mortality selection models, which interpret population-level patterns via theorized, unobserved frailty, are squarely in this tradition.
In contrast, much recent empirical work in demography can be characterized as part of a broader tradition of population studies drawing inspiration from myriad social sciences and engaging more substantively with social stratification. The sources of lifespan inequality are surely multiple and intersecting, and the advent of richer datasets allows some of these multiple heterogeneities to be measured. But recent work on the crossover, which sits uneasily between these traditions of formal and empirical demography, has tried to break open the black box of “frailty” without the benefit of any formal model of multidimensional selection.
Unidimensional frailty models are elegant and powerful tools for answering unidimensional questions, but multidimensional research questions need multidimensional theory. Explicitly multidimensional models of mortality crossovers can provide a first attempt to unite these two demographic traditions so that the substantive questions of recent empirical literature can be addressed with formal precision. Even as datasets grow ever richer, frailty remains an essential concept for mortality studies, as long as the heterogeneity that we do not measure remains as consequential as that which we do. The essential insight of all selection models—that observed associations, taken at face value, can mislead us about issues as fundamental as whether blacks or whites are truly disadvantaged in old age—remains powerful and necessary.
But knowing that selection against something must wholly or partially account for the crossover is only somewhat satisfying. Ultimately, we want to build and test theories about what constitutes frailty. The results here are a methodological step toward that goal, though they suggest that some plausible avenues of testing selection models are fraught with difficulty. As models are made more substantively realistic by incorporating multiple dimensions of heterogeneity, tests of them using the age at crossover will need to be based in far more specific, substantively grounded—and fallible—assumptions about unmeasured inequalities. Or, perhaps, it will turn out that we need other strategies altogether.
Supplementary Material
Figure 6, Panel B.
A simulated cohort in which black and white mortalities cross in the aggregate population after they cross in the exposed and non-exposed subpopulations.
Figure 6, Panel C.
A simulated cohort in which black and white mortalities cross in the aggregate population after they cross in the exposed subpopulation and before they cross in the non-exposed subpopulation.
Footnotes
The multidimensional mortality selection considered here differs from the multivariate mortality selection analyzed elsewhere, as in “shared frailty models” (e.g., Henderson and Oman 1999, Guo and Rodriguez 1992, Vaupel 1988, Wienke 2010: 131–160). The former deals with multiple independent variables and the latter, multiple (correlated) survival-time outcomes.
In the classical mortality selection literature about crossovers, disadvantage has been operationalized in different ways, including: greater mortality for blacks than whites at all levels of frailty, with black and white frailty equal at baseline (Vaupel et al. 1979), greater mortality for blacks than whites specifically among the frail (Vaupel et al. 1985), and greater mortality for blacks than whites among the frail and a larger initial proportion of frailty among blacks at baseline (Lynch et al. 2003). This paper’s model is consistent with the Vaupel et al. (1979) approach, which offers the neatest fit with the general preference for proportional hazard models (by assuming black disadvantage for all cohort members, not just the frail) and allows the paper to highlight how multidimensional selection produces flexible crossover results even without differences in black and white initial frailty distributions. An alternative model specification is considered in Supplement 4.
I assume the same baseline distributions of frailty in both populations in order to focus analysis cleanly on the dynamics of mortality selection, rather than other potential sources of racial difference in mortality. The main substantive points do not depend on this assumption. Nonetheless, if black and white frailty composition differed at birth, some aspects of the presentation of results would differ, as I remark below in footnote 8.
These two derivations from unidimensional crossover models (e.g., Vaupel et al. 1979) follow from the widely known fact that, in a proportional hazards context with one unmeasured (e.g., frailty) and one measured (e.g., race) covariate, the unmeasured covariate leads to an underestimation of the effect of the measured covariate (see, e.g., Aalen 1988, Henderson and Oman 1999, Hougaard et al. 1994). The second assumption is the defining assumption of fixed-frailty models (Finkelstein 2012), which have wide application beyond the crossover, while the first assumption is particular to crossover models. The great achievement of selection models of the crossover is to make this first assumption compatible with the existence of a crossover (Vaupel et al. 1979, Vaupel and Yashin 1985).
The arrows in Figure 1 represent multiplicative effects; thus, the overall sign of a path is the product of the signs on each arrow.
Increasing the frailty multiplier can increase as well as decrease mortality in each racial population. The total effect of f on aggregate mortality in a population depends on which path dominates. Thus, there can be spans of ages at which population-level mortality would be lower with a larger frailty multiplier because fewer frail survivors would remain.
A black-white crossover can occur regardless of the signs of the total effect of f on aggregate mortality in the black and white populations. When the total effect of frailty on mortality is less positive, or more negative, for the white population than for the black population at a given age, a crossover can occur. (Whether a crossover occurs additionally depends on whether the effect of frailty outweighs the black mortality disadvantage at the subpopulation level, as suggested by Figure 1a.)
The former fact depends on the assumption that blacks and whites are equally likely to be frail at birth. If blacks were more likely than whites to be frail at birth, then this term would become negative only if the greater selection against frailty among blacks eventually outweighed their initial excess frailty.
The formula for in the multidimensional model is analogous to the formula for in the unidimensional model given in Equation 4, replacing the subpopulation-level survivorships Sk,j(a) in Equation 4 with the corresponding group-level survivorships Sk,i,j(a) for the ith (exposed or non-exposed) subpopulation.
In the multidimensional model, I use uppercase Greek letters for composition defined at the population level (the [observed] proportion of each racial population that is exposed, aggregated over residual frailty, Tk(a), and the [unobserved] proportion of each racial population that is residually frail, Πk(a), aggregated over tobacco exposure) and lowercase Greek letters for composition defined at the subpopulation level (the [unobserved] proportion of each exposure subpopulation that is residually frail, πk,i(a), and the [unobserved] proportion of each residual frailty subpopulation that is tobacco-exposed, τk,j(a)).
One could instead decompose population-level mortality into the aggregate proportion of each racial population that is residually frail (Πk(a), unobserved) and the proportion of each of those subpopulations that is exposed (τk,j(a), unobserved). Regardless of how population-level mortality is decomposed, it reflects the distribution of each race along both dimensions of heterogeneity simultaneously.
This formulation is not necessarily a socially coherent counterfactual: in reality, if the social treatment of people designated as black were substantially altered, presumably so would the social treatment of people designated as white. (For example, white mortality might rise if whites were no longer protected by racism from meritocratic competition from blacks; or white mortality might fall if social welfare programs were not highly racialized and denigrated.) In causal terms, the assumption beneath this simplification is the Stable Unit Treatment Value Assumption (SUTVA) (Morgan and Winship 2007: 37–40).
I have argued elsewhere (Wrigley-Field and Elwert 2017) that the crossover literature should more seriously attend to how selection dynamics interact with racial disadvantages that may shrink and grow over the life course. Incorporating dynamic frailty alongside multiple dimensions of frailty is mathematically complex.
The alternative model in which black/white inequality is implicit in the heterogeneity distribution, rather than explicit in a mortality multiplier, also allows for a conceptually neat distinction between individual inequality and population disparity, but in that setting, this distinction can only be defined with a more complex counterfactual (Wrigley-Field and Elwert 2017), rather than with a single, simple parameter.
Indeed, the empirical studies this paper is in closest dialogue with, Dupre et al. (2006) and Sautter et al. (2012), use binary observed heterogeneity without committing to any particular specification of unobserved heterogeneity.
The dimensions of heterogeneity explored in the empirical literature are traits that—unlike “frailty”—are acquired and lost by individuals over time. This extension of the classic mortality selection models to time-varying dimensions of heterogeneity can introduce significant complications (see Manton et al. 1994, 1995; Rogers 1992; Woodbury and Manton 1983; Vaupel et al. 1988; and Wrigley-Field 2013) that are not considered either in those papers or in this one. Here, I focus solely on how fixed dimensions of heterogeneity interact in the selection process.
This criterion is explicit in Dupre et al. (2006:146): “To investigate whether religious involvement operates as a source of heterogeneity, two conditions must be satisfied and are hypothesized separately. First, in accordance with prior research that shows that religious involvement is more protective for blacks, the following hypothesis must be true: the effect of religious involvement will have a greater impact among blacks on the risk of dying […]. Thus, blacks who attend services weekly or more will have a larger reduction in mortality than whites. Second, to support the claim that religion contributes to why hazard rates invert, the effect of religion must vary with age.” In Sautter et al. (2012), this criterion is implicit, but undergirds the empirical analysis.
It is well known that frailty can increase in populations in which individuals can newly acquire frailty during their lives (see Vaupel et al. [1988] for one systematic exploration of population dynamics that can result from such dynamic frailty). It is specifically in the context of frailty fixed in individuals that the frailty increases and frailty reversals illustrated here are deeply surprising.
In the language of causal inference, the association occurs because mortality is a collider for its risk factors. Conditional on survival, those risk factors become associated. See Elwert and Winship (2015). In the classical mortality selection model of the crossover, mortality is a collider for race and frailty. In the multidimensional model, mortality is a collider for race, observed exposure, and residual frailty, producing three-way associations between them over age.
Whether the disadvantage associated with residual frailty, f*, has a larger effect on the mortality of the non-exposed or the exposed (and whether these effects have the same sign) depends on whether the increased mortality of the residually frail groups outweighs the increased selection of the residually frail groups in each exposure subpopulation. Both effects are greater among the exposed, making the total effects of their competing signs unpredictable a priori.
A frailty reversal in observed exposure, in which black survivors are more likely than white survivors to be exposed for some span of ages, can occur when the total effect of the paths into Tk, cumulative over all prior ages, is larger for blacks than for whites.
Whether the effects of covariates have a priori predictable or unpredictable sign is determined by the level of aggregation, not the dimension of heterogeneity. The mortality penalty associated with residual frailty, f*, always has a negative effect on the proportion of survivors who are residually frail at the subpopulation level, But since the dimensions of heterogeneity interact at the population level, as illustrated in Figure 5, f* can have either a negative or a positive effect on the proportion of survivors who are residually frail at the population level, Hence, the effect of f* on population mortality via its effect on the population-level proportion residually frail could be positive or negative,
Simulations show that the aggregate can cross simultaneously with either the exposed or the non-exposed subpopulation. Simultaneous crossovers are defined as crossovers occurring at the same survivorship of the robust non-exposed whites (thus, the same age), to three decimal places. The simulation procedure is described in Supplement 3.
One might suspect that the aggregate crossover is nearly simultaneous with the non-exposed crossover because, by the time the aggregate crossover occurs, virtually all survivors are non-exposed. But this is not the case. At age 82, when the aggregate and non-exposed crossovers occur, 27% of black survivors and 24% of white survivors are exposed. (Thus, there has been a “frailty” reversal in observed exposure.)
Any such covariates would need to be studied in a setting in which, or operationalized such that, they are fixed in individuals. For example, chronic illnesses acquired by middle adult or early-elderly ages might be used as a strong predictor of mortality at older ages.
Frailty reversals and increases might still occur along the residual frailty dimension, complicating the interpretation of the observed associations between tobacco exposure and mortality. See also the discussion in Supplement 4 that qualifies this criterion for alternative models in which one dimension of heterogeneity is more consequential for whites and the other for blacks.
Since frailty reversals and frailty crossovers will not always occur along the dimension of heterogeneity that less strongly increases mortality, only the presence, not the absence, of these phenomena constitute a test of this model.
Since subpopulation crossovers will not always occur even in subpopulations whose frailest whites have higher mortality than the most robust blacks, only the presence, not the absence, of subpopulation crossovers constitute a test of this model.
REFERENCES
- Aalen OO 1988. “Heterogeneity in survival analysis.” Statistics in Medicine 7(11). [DOI] [PubMed] [Google Scholar]
- Beard RE (1959). Notes on some mathematical mortality models In Wolstenholme GEW & O’Connor M. (Ed.) The Lifespan of Animals (pp. 302–311). Boston: Little, Brown. [Google Scholar]
- Beard RE (1971). Some aspects of theories of mortality, cause of death analysis, forecasting and stochastic processes In Brass W. (Ed.) Biological Aspects of Demography (pp. 57–68). London: Taylor & Francis. [Google Scholar]
- Berkman L, Singer B, & Manton K. (1989). Black/white differences in health status and mortality among the elderly. Demography, 26(4): 661–678. [PubMed] [Google Scholar]
- Bretagnolle J & Huber-Carol C. (1988) Effects of omitting covariates in Cox’s model for survival data. Scandinavian Journal of Statistics, 15:125–138. [Google Scholar]
- Bowleg L. (2012). The Problem With the Phrase Women and Minorities: Intersectionality— an Important Theoretical Framework for Public Health. American Journal of Public Health, 102(7): 1267–1273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Charlson M, Szatrowski TP, Peterson J, & Gold J. 1994. Validation of a Combined Comorbidity Index. Journal of Clinical Epidemiology, 47(11): 1245–51. [DOI] [PubMed] [Google Scholar]
- Curtin SC & Mathews TJ (2016) Smoking Prevalence and Cessation Before and During Pregnancy: Data From the Birth Certificate, 2014. National Vital Statistics Reports, 65(1). [PubMed] [Google Scholar]
- Dupre ME, Franzese AT & Parrado EA 2006. Religious attendance and mortality: Implications for the black-white mortality crossover. Demography, 43, 141–164. [DOI] [PubMed] [Google Scholar]
- Elwert F. & Winship C. 2015. Endogenous Selection Bias: The Problem of Conditioning on a Collider Variable. Annual Review of Sociology, 40: 31–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Engelman M, Canudas-Romo V, & Agree E. (2010). “The implications of increased survivorship for mortality variation in aging populations.” Population and Development Review, 36(3): 511–539. [DOI] [PubMed] [Google Scholar]
- Fenelon A. 2013. “An examination of black/white differences in the rate of age-related mortality increase.” Demographic Research, 29(17): 441–472. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Finkelstein M. 2012. “On ordered subpopulations and population mortality at advanced ages.” Theoretical Population Biology 81: 292–299. [DOI] [PubMed] [Google Scholar]
- Finkelstein M. & Esaulova V. 2006. “Asymptotic behavior of a general class of mixture failure rates.” The Advances in Applied Probability 38 (1), 244–262. [Google Scholar]
- Finkelstein M. & Esaulova V. 2008. “On asymptotic failure rates in bivariate frailty competing risks models.” Statistics and Probability Letters 78: 1174–1180. [Google Scholar]
- Fukui HH, Xiu L, & Curtsinger JW (1993). “Slowing of age-specific mortality rates in Drosophila melanogaster.” Experimental Gerontology, 28(6): 585–99. [DOI] [PubMed] [Google Scholar]
- Gampe J. (2010). “Human mortality beyond age 110” In Maier H, Gampe J, Jeune B, Robine J-M, & Vaupel J. (Ed.). Supercentenarians (Demographic Research Monographs 7) (pp. 219–230). Berlin: Springer. [Google Scholar]
- Guillot M. 2007. “Mortality in Kyrgyzstan since 1958: Real Patterns and Data Artifacts.” Espace Populations Societies 2007(1):113–26. [Google Scholar]
- Guo G. & Rodriguez G. 1992. Estimating a Multivariate Proportional Hazards Model for Clustered Data Using the EM Algorithm, with an Application to Child Survival in Guatemala. Journal of the American Statistical Association, 87(420): 969–976. [Google Scholar]
- Henderson R & Oman P. (1999). Effect of frailty on marginal regression estimates in survival analysis. Journal of the Royal Statistical Society, Series B, 61:367–379. [Google Scholar]
- Hernán MA, Clayton D, & Keiding N. 2011. The Simpson’s paradox unraveled. International Journal of Epidemiology, 40(3): 780–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoffmann R. 2008. Socioeconomic Differences in Old Age Mortality. Springer: New York, NY. [Google Scholar]
- Horiuchi S. & Wilmoth JR 1997. Age patterns of the life table aging rate for major causes of death in Japan, 1951–1990. Journals of Gerontology Series a-Biological Sciences and Medical Sciences, 52, B67–B77. [DOI] [PubMed] [Google Scholar]
- Horiuchi S. & Wilmoth JR 1998. Deceleration in the age pattern of mortality at older ages. Demography, 35, 391–412. [PubMed] [Google Scholar]
- Hougaard P, Mygelaard P, & Borch-Johnsen K. 1994. “Heterogeneity Models of Disease Susceptibility, with Application to Diabetic Nephropathy.” Biometrics 50(4): 1178–1188. [PubMed] [Google Scholar]
- Hutchinton JW, Kamakura WA, & Lynch JG Jr. 2000. Unobserved Heterogeneity as an Alternative Explanation for “Reversal” Effects in Behavioral Research. Journal of Consumer Research, 27(3): 324–344. [Google Scholar]
- Kannisto V. (1992). Frailty and survival. Genus, 47(3–4): 101–118. [PubMed] [Google Scholar]
- Kannisto V. (2000). Measuring the compression of mortality. Demographic Research, 3(6). [DOI] [PubMed] [Google Scholar]
- Kestenbaum B. 1992. “A Description of the Extreme Aged Population Based on Improved Medicare Enrollment Data.” Demography 29(4): 565–80. [PubMed] [Google Scholar]
- Luo Y, Hawkley LC, Waite LJ, & Cacciopo JT 2012. Loneliness, health, and mortality in old age: a national longitudinal study. Social Science and Medicine, 74(6): 907–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lynch SM & Brown JS (2001). Reconsidering mortality compression and deceleration: An alternative model of mortality rates. Demography, 38, 79–95. [DOI] [PubMed] [Google Scholar]
- Lynch SM, Brown JS & Harmsen KG 2003. Black-White differences in mortality compression and deceleration and the mortality crossover reconsidered. Research on Aging, 25, 456–483. [Google Scholar]
- Manton KG, Poss SS, & Wing S. 1979. The Black/White Mortality Crossover: Investigation from the Components of the Perspective of Aging. The Gerontologist, 19(3): 291–300. [DOI] [PubMed] [Google Scholar]
- Manton KG & Stallard E. 1981. Methods for evaluating the heterogeneity of aging processes in human populations using vital statistics data: Explaining the black-white mortality crossover by a model of mortality selection. Human Biology, 53, 47–67. [PubMed] [Google Scholar]
- Manton KG, Stallard E, Woodbury MA & Dowd JE 1994. Time-varying covariates in models of human mortality and aging: Multidimensional generalizations of the Gompertz. Journals of Gerontology, 49, B169–B190. [DOI] [PubMed] [Google Scholar]
- Manton KG, & Woodbury MA 1983. A Mathematical Model of the Physiological Dynamics of Aging and Correlated Mortality Selection: II. Application to the Duke Longitudinal Study. Journals of Gerontology, 38(4): 406–413. [DOI] [PubMed] [Google Scholar]
- Manton KG, Woodbury MA & Stallard E. 1995. Sex differences in human mortality and aging at late ages: The effect of mortality selection and state dynamics. Gerontologist, 35, 597–608. [DOI] [PubMed] [Google Scholar]
- Missov TI & Finkelstein MS (2011). Admissible mixing distributions for a general class of mixture survival models with known asymptotics. Theoretical Population Biology, 80(1): 64–70. [DOI] [PubMed] [Google Scholar]
- Mohtashemi M. & Levins R. 2002. Qualitative analysis of the all-cause Black-White mortality crossover. Bulletin of Mathematical Biology, 64, 147–173. [DOI] [PubMed] [Google Scholar]
- Olshansky SJ (1998). On the biodemography of aging: A review essay. Population and Development Review, 24, 381–393. [Google Scholar]
- Rogers RG (2002). Mortality differentials in a diverse society In Denton NA & Tolnay S. (Ed.), American Diversity: A demographic challenge for the twenty-first century (pp. 129–154). Albany: SUNY Press. [Google Scholar]
- Sautter JM, Thomas PA, Dupre ME, & George LK 2012. Socioeconomic status and the Black-White mortality crossover. American Journal of Public Health, 102(8): 1566–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Steinsaltz DR & Wachter KW 2006. “Understanding Mortality Rate Deceleration and Heterogeneity.” Mathematical Population Studies 13(1): 19–37. [Google Scholar]
- Thatcher AR, Kannisto V. & Vaupel JW (1998). The force of mortality at ages 80 to 120. Odense, Denmark: Odense University Press. [Google Scholar]
- Thornton R. 2004. The Navajo-U.S. Population Mortality Crossover Since the Mid-20th Century. Population Research and Policy Review, 23:3, 291–308. [Google Scholar]
- Vaupel JW 1988. Inherited frailty and longevity. Demography 25 (2), 277–287. [PubMed] [Google Scholar]
- Vaupel JW, Manton KG & Stallard E. 1979. Impact of heterogeneity in individual frailty on the dynamics of mortality. Demography, 16, 439–454. [PubMed] [Google Scholar]
- Vaupel JW, & Missov T. 2014. Unobserved population heterogeneity: A review of formal relationships. Demographic Research 31: 659–686. [Google Scholar]
- Vaupel JW & Yashin AI 1985. Heterogeneity’s ruses: Some surprising effects of selection on population dynamics. American Statistician, 39, 176–185. [PubMed] [Google Scholar]
- Vaupel JW, Yashin AI, & Manton K,G 1988. “Debilitation’s Aftermath: Stochastic Process Models of Mortality.” Mathematical Population Studies 1(1): 21–48. [DOI] [PubMed] [Google Scholar]
- Wienke A. 2010. Frailty Models in Survival Analysis. Chapman & Hall/CRC. [Google Scholar]
- Wrigley-Field E. 2014. Mortality deceleration and mortality selection: Three unexpected implications of a simple model. Demography, 51: 51–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wrigley-Field E. and Elwert F. 2016. “Mortality Crossovers from Dynamic Subpopulation Reordering” In Schoen R, ed., Dynamic Demographic Analysis, Springer: 177–199. [Google Scholar]
- Wrigley-Field E. and Elwert F. 2017. “The Black-White Mortality Crossover: Integrating Mortality Selection and Life Course Explanations.” American Sociological Association; Montreal. [Google Scholar]
- Woodbury MA & Manton KG 1983. A mathematical model of the physiological dynamics of aging and correlated mortality selection. 1: Theoretical development and critiques. Journals of Gerontology, 38(4): 398–405. [DOI] [PubMed] [Google Scholar]
- Yashin AI & Manton KG 1997. Effects of Unobserved and Partially Observed Covariate Processes on System Failure: A Review of Models and Estimation Strategies. Statistical Science, 12(1): 20–34. [Google Scholar]
- Yashin AI, Manton KG, & Vaupel JW 1985. Mortality and Aging in a Heterogeneous Population: A Stochastic Process Model with Observed and Unobserved Variables. Theoretical Population Biology 27: 154–175. [DOI] [PubMed] [Google Scholar]
- Yao L. & Robert SA 2011. Examining the Racial Crossover in Mortality between African American and White Older Adults: A Multilevel Survival Analysis of Race, Individual Socioeconomic Status, and Neighborhood Socioeconomic Context. Journal of Aging Research, 1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zeng Y, & Vaupel JW 2003. Oldest-Old Mortality in China. Demographic Research, 8(7): 215–244. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.










