Abstract
Unobserved heterogeneity in mortality risk is pervasive and consequential. Mortality deceleration—the slowing of mortality’s rise with age—has been considered an important window into heterogeneity that otherwise might be impossible to explore. This paper argues that deceleration patterns may reveal surprisingly little about the heterogeneity that putatively produces them. I show that even in a very simple model—one composed of just two subpopulations with Gompertz mortality—(1) aggregate mortality can decelerate even while a majority of the cohort is frail; (2) multiple decelerations are possible; and (3) mortality selection can produce acceleration as well as deceleration. Simulations show that these patterns are plausible in model cohorts that in the aggregate resemble cohorts in the Human Mortality Database. I argue that these results: challenge some conventional heuristics for understanding the relationship between selection and deceleration; undermine certain inferences from deceleration timing to patterns of social inequality; and imply that standard parametric models, assumed to plateau at most once, may sometimes badly misestimate deceleration timing—even by decades.
Keywords: Mortality deceleration, mortality selection, heterogeneity, methods, logistic models
Unobserved heterogeneity in mortality is pervasive and consequential. All cohorts are heterogeneous in mortality risk in ways that are often stably ordered over the life course: if one person is higher-risk than another at young ages, they will also be higher-risk at older ages, should they both live that long. This heterogeneity gives rise to mortality selection: the frailest members of a cohort disproportionately succumb to mortality, eventually leaving an intensely selected, relatively robust cohort in its place (e.g., Beard 1959, 1971; Kannisto 1992; Vaupel, Manton, and Stallard 1979; Vaupel and Yashin 1985). This selection will in general occur unequally across social groups with different patterns of heterogeneity and overall levels of mortality risk. Such unequal selection substantially complicates efforts to understand inequalities in mortality risk—including inequalities driven by observable factors—across and within groups.
The challenge posed by mortality selection for accurately modeling differences in mortality arises in part because—unlike in a linear regression context—mortality selection can bias hazard models even when the omitted sources of mortality risk are unrelated at baseline to the covariates included in the model (Rodriguez 1994). There are many examples of the distorting effects of unobserved heterogeneity caused by mortality selection. Perhaps the most well-known is the black-white mortality crossover (e.g., Berkman et al. 1989, Dupre et al. 1996, Kestenbaum 1992, Lynch et al. 2003, Manton et al. 1979, Masters 2012)—one of many documented mortality crossovers (e.g., Rogers 2002)—which confounds racial inequalities operating at old ages with the intensified selection for robustness experienced by African-American cohorts. Other examples abound. A long-running demographic controversy is whether the well-documented narrowing over age of the mortality disparities by educational status reflects a true convergence of mortality trajectories—perhaps implying a biological limit to lifespan that even healthy behaviors and other advantages cannot overcome—or simply the operation of mortality selection among the higher-mortality less educated, masking a protective effect of education that continues into old age (e.g., Robert and House 2000, Zajacova et al. 2009). Assessing demographic theories about the effects of early-life health on later-life outcomes can also be confounded by mortality selection: the cost to individuals of negative childhood conditions can be masked by the robust composition of those who survive such conditions (e.g., Costa 2012, Palloni 2006). Unobserved heterogeneity can be deeply distorting to efforts to understand group-level inequalities and individual determinants of mortality.
The significance of unobserved heterogeneity in disguising how mortality inequalities unfold over a life course stands in stark contrast to the relative dearth of methods for exploring such heterogeneity. Without comprehensive individual-level measures of the determinants of mortality, population scientists can consider only the aggregate mortality of groups, aided by either parametric assumptions about the unobserved distribution of risk within those groups (in a long research tradition beginning with Vaupel et al. [1979]) or qualitative assumptions about the distribution across groups. Simply put, when dealing with a phenomenon that is by definition unobserved, options are limited.
In this context, mortality deceleration has been considered an important window into heterogeneity that otherwise might be impossible to explore. Mortality deceleration—the slowing of mortality’s rise with age—can occur as cohorts are systematically selected by mortality to disproportionately contain those most robust to death (e.g., Beard 1959, 1971; Kannisto 1992; Thatcher et al. 1998; Vaupel et al. 1979; Vaupel and Yashin 1985). Thus, deceleration has often been considered an important indicator of substantial prior selection—and thus of heterogeneity in the original cohort. Similarly, differential deceleration patterns across groups (e.g., cohorts or race and sex groups) has been used as evidence of differences between groups’ patterns of internal heterogeneity (e.g., Horiuchi and Wilmoth 1997, 1998; Lynch and Brown 2001; Lynch et al. 2003). Indeed, in the words of Lynch et al. (2003: 462), “measuring deceleration, compression and crossover is the means by which to examine heterogeneity within and between populations.”
Demographers have, however, also recognized that deceleration patterns are no panacea for understanding heterogeneity since deceleration in mortality risk may arise biologically at the individual level, not only from mortality selection at the group level. In particular, many experimental biodemographic studies have not found heterogeneity alone to be a plausible cause of observed deceleration in insect and animal populations (e.g., Carey et al. 1992, Curtsinger et al. 1992, Drapeau et al. 2000 [but see Steinsaltz 2005], Rauser et al. 2005, Vaupel and Carey 1993), and a smaller number of studies have reached similar conclusions for human data (Mueller et al. 2011, Steinsaltz and Wachter 2006; and see reviews in Vaupel 1997, Wachter and Finch 1997). If deceleration does arise for reasons other than selection, then heterogeneity might or might not interact with individual-level decelerating mortality to influence population-level rates, but deceleration would not constitute straightforward evidence of selection (see Steinsaltz and Evans 2004 for an elaboration of the argument that deceleration patterns alone are not telling evidence for any particular model that might give rise to them).
In this paper, I show that even if deceleration stems entirely from mortality selection on unobserved heterogeneity, those deceleration patterns still may reveal little about the heterogeneity that produced them. The reason is that even an exceedingly simple binary frailty model can produce deceleration and acceleration patterns sufficiently complex so as to defy some conventional models and predictions. In particular, I show that mortality can decelerate even while most of the cohort remains frail; that mortality can decelerate twice, even with only two subpopulations; and that mortality can reaccelerate, not only because mortality selection has already run its course (a possibility discussed by Vaupel and Yashin [1985]), but also—counterintuitively—because of the continued action of selection itself. These possibilities stem from a simple mathematical result: the rate of selection is greatest, ceteris paribus, when the frail and robust are each half of the cohort. Thus, this paper explores the relationship between the level of frailty in a cohort as it ages, its rate of selection, and the deceleration and acceleration patterns produced by those two things.
These results have two additional implications. As I show, parametric mortality models, such as logistic models, may badly misestimate deceleration timing—even by decades. And standard cross-cohort comparisons of deceleration timing may misstate the direction of inequality in mortality risk.
I begin by stating more explicitly some conventional expectations about mortality deceleration that these results call into question. For concreteness, I then provide an example of the kind of counter-intuitive pattern that the analysis shows to be possible, before presenting the key analytical result that provides the intuition for the mortality patterns explored. However, the complexity of the mortality derivatives makes it difficult to assess analytically when such counter-intuitive patterns will arise, so the main body of the paper presents the results of simulation models with parameters drawn from the Human Mortality Database. Simulations show that the surprising patterns described here can be pervasive in models with realistic parameters, and that the resulting problems with parametric mortality models and certain cross-group comparisons may also be widespread. Before concluding, I argue that, since the results explored here are precluded by widely-used gamma-distributed frailty models, these results suggest a need for more direct comparison of heterogeneity models.
Mortality Deceleration and Mortality Selection
Mortality Deceleration
Mortality deceleration is the label given to a class of mortality patterns deviating from the exponential mortality of the Gompertz model, which posits that mortality accelerates at increasing speed as a cohort ages. In operationalizing deceleration, demographers have variously highlighted different degrees of deviation from exponential mortality. Some focus on what I call relative deceleration, which occurs when mortality continues to accelerate, but does so more slowly than at younger ages. This begins when the third derivative (jerk) of aggregate mortality becomes negative, or, equivalently, when the second derivative begins to decline (Rau et al. [2009] compares the alternatives and advocates this measure). Others (Bebbington et al. 2007, Lynch and Brown 2001; Lynch et al. 2003) employ what I call absolute deceleration, which occurs when mortality is no longer accelerating at all: the second derivative (acceleration) is negative, and the first derivative has begun to decline.1 In principle, cohorts may evince an even more extreme deviation from Gompertz mortality, mortality decline, which occurs when the first derivative (slope) is negative. Table 1 summarizes these measures. Here, I conceive of mortality deceleration as a process that begins with relative deceleration and may progress through absolute deceleration and even mortality decline, or may stop or reverse at any point. This paper is agnostic about which of absolute or relative deceleration, if either, is a preferable measure, and explores the properties of both. Accordingly, the crucial measures defining deceleration and reacceleration in what follows will be the signs of the derivatives of mortality.2
Table 1.
Mortality, μ(a) | |||
---|---|---|---|
Slope, μ′ (a) | Acceleration, μ″(a) | Jerk, μ″′(a) | |
Increasingly accelerating mortality | >0 | >0 | >0 |
Relative deceleration | >0 | >0 | <0 |
Absolute deceleration | >0 | <0 | |
Mortality decline | <0 |
Three Common Assumptions about Heterogeneity and Deceleration
Demographic work on mortality selection has been considerably advanced by efforts to articulate explicit intuitions about the conditions for mortality to decelerate. This paper shows that three such intuitions are wrong.
First, demographers frequently adopt the heuristic that mortality decelerates when the percent frail in the cohort reaches some low critical value. That heuristic seems to underlie some analyses of deceleration, particularly those that attempt to explicitly relate deceleration patterns to social inequality. For example, Lynch et al. (2003), which significantly advances the literature by providing one of the most explicit discussions of how characteristics of population heterogeneity affect deceleration timing, argues that:
A population with a large number of frail members relative to robust members will experience deceleration when mortality rates are higher (and potentially at a later age) than a population whose membership is equally distributed across frail and robust groups. In the former case, it simply takes longer for mortality to select out the frailer members [Lynch et al. 2003: 462; emphasis added]3
Heathcote, Puza, and Roberts (2009) makes a similar claim in a paper giving rare explicit consideration of the possibility that selection-induced deceleration may be followed by reacceleration in populations composed of several subgroups. Their paper shows that:
[T]here exist models of mixtures of Gompertz groups such that, depending on the extent of heterogeneity, there may be none, one or several age intervals of deceleration of the population hazard function interspersed with intervals of acceleration. Gompertz-like behaviour may then be resumed at extreme old age. An intuitive explanation is that deceleration occurs when the weakest group is dying out, followed by a brief assertion of Gompertz acceleration before the next weakest group dies out, and so forth. [Heathcote et al. 2009: 482; emphasis added]
Second, demographers commonly assume that mortality decelerates only once—at least if there are only two subpopulations. The assumption of a single deceleration is built into the standard parametric form used to model mortality deceleration, the logistic model (e.g., Bongaarts 2005, Kulminski et al. 2007, Rau et al. 2009, Thatcher 1999), as well as the alternative arctangent form used by Lynch and Brown (2001; Lynch et al. 2003). The expectation that multiple decelerations are precluded in a two-subpopulation model is made explicit in Heathcote et al. (2009), which proposes that, in a cohort with k heterogeneous closed subpopulations, mortality may decelerate in a maximum of k-1 intervals.4
Finally, it is common to conceptualize the derivatives of aggregate mortality as a competition between subpopulation acceleration, which leads the aggregate hazards to accelerate, and the declining frailty composition (driven by mortality selection) of the population, which leads the aggregate hazards to decelerate. Thus, reacceleration, when it is considered, is assumed to reflect the accelerating mortality of subpopulations, overwhelming the decelerating effect of selection. Mortality selection per se is assumed to produce only deceleration, never acceleration.5
In what follows, I show that each of these assumptions can fail.
An Example: High-Frailty Deceleration and Multiple Deceleration
To make concrete what follows, I begin by introducing as a running example a single simulated cohort, drawn from a class of simulated cohorts described in detail below. This cohort consists of two subcohorts, each with Gompertz mortality, with 75% of the cohort in the frail subpopulation at age 50.6
Fig. 1 displays this example cohort from ages 50–100 (by which age the frail are virtually extinct). Panel A of Fig. 1, which gives the aggregate mortality of the cohort over age, illustrates the counter-intuitive patterns at the heart of this paper. In this cohort, mortality increasingly accelerates until age 68, when the first interval of relative deceleration begins, with 66% of the cohort frail. At age 75, with 54% of the cohort frail, mortality decelerates absolutely; the second derivative remains negative until age 84 (16% frail), when a second period of increasingly accelerating mortality begins. This persists until age 91 (0.9% frail), when mortality again decelerates relatively until age 94 (0.2% frail), when the cohort enters increasingly accelerating mortality for the third and final time. Panel B shows the slope of the percent frail of the cohort. We will see that this slope—which, with its sign reversed, is the rate of frailty decline—drives much of the results to follow. In this cohort, frailty declines fastest at age 82, when the frail are 27% of the cohort. The dashed vertical line in both panels marks the point when the frail are half of the cohort, which will be shown to be a turning point in some of the selection dynamics explored in this paper. (The full set of derivatives of mortality and frailty for this cohort are shown in Online Resource 1.)
Analytical Intuition
High-Frailty Deceleration, Multiple Deceleration, and Selection-Driven Acceleration are Possible in Principle
The surprising mortality patterns shown here to be possible arise from the role played in the mortality derivatives by the rate of change in the cohort’s% frail. Specifically, it turns out that the level of frailty—the percent frail—plays two distinct roles in producing mortality acceleration and deceleration: a direct role (as already shown in Vaupel and Yashin [1985]), and an indirect role via the rate of selection. Here I show the key equations that provide the intuition; Online Resource 1 provides the full analytical results.
The model in this paper assumes that both subcohorts have Gompertz mortality, as shown in Eq. 1:
(1a) |
(1b) |
The two subcohorts share a log-slope over mortality, β. That log-slope and the baseline level of robust mortality, α, are assumed to be greater than zero. The frailty multiplier, f, which is the ratio of frail to robust mortality at any age, is assumed to be greater than 1.
Cohort mortality at any age, and its derivatives, are functions of three parameters: the mortality of the robust, μr(x) ; the mortality of the frail, μf(x) ; and the percent of the cohort that is frail (the frailty composition), π(x), as shown in Eq. 2:
(2) |
The crucial role in what follows is played by the slope of frailty composition, that is, by the rate at which the cohort is becoming less frail due to mortality selection. Equation 3, which provides the intuition for all results in this paper, shows that this slope is a function of the level of frailty composition and of the difference between frail and robust mortality:
(3) |
All terms in Eq. 3 are non-negative, and so the negative sign means that the slope is always negative: at all ages, mortality selection makes the cohort less frail. The absolute value of this expression can be considered the rate of selection. The binomial variance of the percent frail, π(x)(1 – (x)), is greatest when π(x)=.5. In other words, all else equal, the rate of frailty decline is greatest when half of the cohort is frail and half robust. Yet all else is not equal: the absolute difference between frail and robust mortality, πf(x) – πr(x), is greatest when the mortality of each subpopulation is greatest: at the oldest ages.
These two terms create the possibility that selection intensifies, and mortality decelerates, as π (x) hurtles downward toward .5; that selection attenuates, and mortality reaccelerates, as frailty declines further away from .5; and that selection reintensifies, and mortality decelerates again, at very old ages, when πf (x) πr (x) is very large (before mortality accelerates a final time with only the robust left alive). Yet the mortality derivatives are sufficiently complex that it is difficult to evaluate analytically whether to expect these patterns in cohorts with realistic parameters. To answer that question, I turn to simulations.
Simulation Results
High-Frailty Deceleration, Multiple Deceleration, and Selection-Driven Acceleration Occur Widely in Model Populations Compatible with the Human Mortality Database
Simulation Procedure
Four parameters define the mortality model: the intercept for robust mortality, α; the log-slope of mortality for both subpopulations, β; the frailty multiplier, or ratio of frail to robust mortality at any age, f; and the baseline percent frail, π0. All four parameters are unobserved in real data, since subpopulation membership is latent by assumption. Thus, in generating realistic simulations, the goal is to find parameter combinations that generate aggregate cohorts whose parameters match those of real human cohorts. To minimize the role played by assumptions about unobserved parameters and maximize the role played by real data in the selection of simulations, I generate many candidate latent subpopulation models and keep only those whose aggregate parameters are consistent with the life tables of known cohorts.
The mortality hazards are calculated analytically, in instantaneous time. To limit the complexity over four dimensions, one parameter—baseline frailty composition—is set in all simulations at .75, a high value chosen to make visible the selection dynamics when frailty is common as well as rare. The baseline age is 50, which leaves the model agnostic as to whether mortality rises during late adulthood with the same log-slope as it had earlier in life.7 The model therefore assumes that 75% of the population surviving to age 50 is frail.8 Thus, these simulations represent cohorts in which mortality advantage, rather than disadvantage, is the exceptional condition.9 The frailty multiplier is modeled at eight values, ranging in units of .5 from 1.5 to 5; the low end represents fairly modest disadvantage, while the top end is at the extreme of what we might consider plausible for human populations.10 The intercept for the robust and the log-slope for the two subpopulations are varied nearly continuously, in increments of .001—in the range [.001,.2] for α and [.001,.4] for β. In total, this produces 640,000 simulated cohorts before evaluating the resulting parameters for plausibility.11
To winnow these 640,000 simulated cohorts down to a realistic subset, I estimate a Gompertz model on each aggregated cohort,12 and keep only the ones that fall inside a parallelogram formed around the intercept and log-slope parameters estimated from the 2,352 historical European cohorts collected in the Human Mortality Database (HMD).13 The parallelogram hugs the shape of the HMD cohorts (shown visually in Online Resource 2), so that cohorts included in the final simulation universe generally not only have a Gompertz intercept and slope similar to real (but distinct) cohorts; they have a combination of intercept and slope that is similar to at least one real cohort.14 The result is a universe of 1,151 simulated cohorts analyzed in this paper.15
Simulation Results
High-Frailty and Multiple Deceleration
At what percent frail does deceleration and reacceleration occur in this universe? Figure 2 displays the frailty composition at absolute deceleration, relative deceleration as a whole, and relative deceleration restricted to cohorts that decelerate only once (each with their respective reaccelerations reflected in the bottom row). The results demonstrate the problems for the heuristics that mortality decelerates only once (one-third of the cohorts decelerate relatively twice) and only when the frail are nearly depleted. Panel A of Fig. 2 shows that absolute deceleration never corresponds, in these simulations, to the latter heuristic: absolute deceleration can occur when the frail are a majority or a minority, but never occurs here when they are less than 35% of the population. As shown in Panel B, reaccelerations following absolute deceleration, likewise, occur well before frailty depletion, when the frail are 15–34% of the cohort. Panels C and D, which show all relative decelerations, suggest that relative deceleration does sometimes occur as the heuristic would predict, with the cohort decelerating when the frail are nearly gone (in these cohorts, when frailty composition ranges between 0.5–1.2%) and reaccelerating shortly thereafter. But Panels E and F demonstrate that this pattern occurs only in the second of two decelerations. These panels are limited to the 67% of simulated cohorts with only one relative deceleration. They show that when there is a single relative deceleration, the deceleration occurs when most of the cohort is frail. In short, neither absolute nor relative deceleration corresponds to the conventional picture of a single deceleration when the frail are approaching extinction. (An analysis of where in the parameter space high-frailty and multiple decelerations occur is presented in Online Resource 2.)
Selection-Driven Acceleration
It seems intuitive to conceive of acceleration/deceleration as a tradeoff between accelerating subpopulations (producing acceleration) and declining frailty composition (producing deceleration). Yet declining frailty composition can also produce acceleration.
To underscore this point, I offer an artificial calculation as a device for isolating the role of declining frailty composition, illustrating what I call selection-driven acceleration. Starting from the example cohort given above (π0=.75, f=5, α=.002, and β=.103), imagine that we could hold subpopulation mortality and its derivatives fixed at their levels at age 81—the age when the aggregate second derivative reaches its minimum (with 31% of the cohort frail)—while the percent frail is left varying as in the actual cohort. This calculation isolates the effects of the declining percent frail from those of increasing subpopulation hazards, slopes and accelerations. Figure 3 shows the results: while mortality does not reaccelerate nearly as dramatically without the changes in the subpopulation mortality, nevertheless it does reaccelerate. In this exercise, the declining rate of selection, as frailty falls further below half, leads mortality to reaccelerate even in the absence of subpopulation-level changes. This demonstrates that mortality selection can contribute to acceleration as well as deceleration.
Further Implications
Bias in Estimated Age at Deceleration Using Common Parametric Models
Results so far suggest that, even in an exceedingly simple model, it is possible, indeed plausible, for cohorts to experience at least one period of deceleration followed by reaccelerating mortality. Yet the standard parametric forms used to model older ages—most often, logistic models (e.g., Bongaarts 2005, Kulminski et al. 2007, Rau et al. 2009, Thatcher 1999), and occasionally, very similar arctangent models (Lynch and Brown 2001; Lynch et al. 2003)—assume that mortality decelerates at most once and never reaccelerates. It turns out that this can lead such parametric forms to systematically misestimate deceleration timing.
Fig. 4 plots the deceleration timing derived from two-parameter logistic models estimated on the simulated cohorts against the actual deceleration timing of those cohorts, with the main diagonal provided as a reference line.16 If the logistic models work well, the estimated deceleration ages should fall along (or near) this main diagonal. For most decelerations, they fall very far from it.
Panel A of Fig. 4 shows the results for absolute deceleration. The logistic models badly overestimate the age of deceleration, with the overestimation ranging between 18 and 39 years (28 years on average). Panels B, C, and D show the results for relative deceleration, considering, respectively, single, first, and second relative decelerations. Thus, panel B gives actual vs. estimated deceleration age for those cohorts that decelerate only once. Panels C and D each give the same outcome measure—deceleration timing estimated from logistic models on cohorts that decelerate twice—spread over two different regions on the horizontal axis, reflecting the cohort’s two deceleration points. These panels show that the logistic models fit poorly the deceleration patterns for first and single relative decelerations, and fit well second decelerations. Single deceleration ages are overestimated by 5 to 27 years (16 on average), and first decelerations by 17 to 28 years (22 on average). In contrast, the logistic models underestimate the age at (relatively rare) second relative decelerations by between 1 and 6 years (underestimating by only 3 on average).
Most troublingly, perhaps, the logistic models falsely detect deceleration with alarming frequency. Absolute deceleration is detected in all cohorts that decelerate only relatively. Most strikingly, relative and absolute deceleration are predicted in all cohorts that do not decelerate at all. The estimated age of relative deceleration for these cohorts ranges from 83 to 100 (mean 90), and absolute deceleration from 99 to 117 (mean 107).
These results are especially problematic for two reasons. First, the falsely detected decelerations are found at very similar ages to those identified in previous empirical research on the age at deceleration. Second, the magnitude of the error found in the estimation of age at deceleration—up to several decades—dwarfs the size of differences in deceleration timing interpreted substantively in the empirical literature, which are sometimes only a few years.17
It is routine in the demographic literature for logistic or similar parametric models to be estimated on mortality data without checking the raw data for evidence of multiple deceleration, or deceleration followed by reacceleration. This might be because the raw data are too noisy to support such investigation non-parametrically, but perhaps it is also because such deceleration patterns are not considered a serious possibility, or a serious source of model bias if they do exist. The latter two ideas are cast into doubt by these results. These results collectively suggest that it would be advisable for demographers whose primary object of study is deceleration not to rely solely on models that assume a single peak in mortality. It may be best to use these conventional parametric approaches alongside some effort to consider whether multiple deceleration, or deceleration with reacceleration, may be present in the data.
Comparing Deceleration across Cohorts
A central motivation for accurately measuring deceleration is that comparing deceleration timing across cohorts or other closed social groups may permit demographers to infer something about the differences in the cohorts’ distributions of mortality risk (Horiuchi and Wilmoth 1997, 1998; Lynch and Brown 2001; Lynch et al. 2003). This endeavor, which links the measurement of deceleration to the study of inequality and change in mortality, necessitates a qualitatively simple relationship between unobserved patterns of heterogeneity and observed patterns of deceleration. Accordingly, such reasoning was advanced considerably by Lynch et al. (2003), which articulated explicit predictions about the circumstances in which one cohort should decelerate at an older age and higher mortality level than another, and used those predictions to infer changes in mortality heterogeneity within racial groups.18 One of these predictions was quoted above: “A population with a large number of frail members relative to robust members will experience deceleration when mortality rates are higher (and potentially at a later age) than a population whose membership is equally distributed across frail and robust groups” (Lynch et al. 2003: 462). In this section, I show more explicitly how this prediction will sometimes fail.
To test this prediction using all the parameter combinations in the simulation universe, I compare deceleration timing in pairs of cohorts with fixed frailty multiplier f, robust intercept α, and log-slope β, but with baseline frailty composition π0 varying in units of .05 from .05 to .75. Table 2 reports the proportion of pairs of cohorts for which the mortality or age at deceleration is greater in the cohort with greater baseline percent frail. For absolute decelerations and single relative decelerations, the prediction fares well (albeit imperfectly) for mortality, and poorly for age. For both mortality and age, the prediction is consistently validated for first relative decelerations (when these are separated from both second and single decelerations), but consistently disproved for second (low-frailty) relative decelerations. The latter is particularly important since, as shown above, it is these second relative decelerations, where the prediction fares worst, that are most closely matched by the estimated deceleration in logistic mortality models. In short, it is not necessarily the case that ordering decelerations across cohorts—by mortality or by age—can reveal which cohort had more frail members at birth, even assuming that the cohorts otherwise share the same mortality parameters.19 (These results are extended to consider the percent frail at deceleration, and an example set of cohorts is offered to illustrate the results in this section, in Online Resource 3.)
Table 2.
Deceleration Type | Mortality | Age |
---|---|---|
Absolute | .97 | .23 |
Relative (single) | .80 | .32 |
Relative (first) | 1 | 1 |
Relative (second) | 0 | 0 |
Relative (all) | .80 | .35 |
Parameterizing Heterogeneity
The results in this paper do not readily generalize to the widely used models with gamma-distributed individual frailty and Gompertz individual hazards, since those models aggregate to a logistic (hence single-deceleration) hazard (Beard 1959, 1971). Such gamma-Gompertz models fit many data well (Steinsaltz and Wachter 2006, Missov and Finkelstein 2011) and have been very widely used (e.g., Gampe 2010, Horiuchi and Wilmoth 1998) ever since Vaupel, Manton, and Stallard (1979) pointed out their convenient properties, but whether they are appropriate for all populations or whether competing models might fit equally well is a question that should receive further attention (Steinsaltz and Evans 2004).
In particular, it is not clear which model might be preferable in cases of extreme stratification along unobserved dimensions. Gamma-Gompertz models differ from the dichotomous model used here in two respects: the heterogeneity they estimate is continuously distributed, and it is unimodal. In cases where one unobserved dimension of heterogeneity is so extreme that it swamps other sources of variation between individuals, it is not obvious a priori whether a continuous but single-peak model is a better approximation than a dichotomous one.
To the best of my knowledge, all previous models either capture continuous individual variation at the cost of imposing a unimodal distribution conditional on observed covariates (as in gamma-Gompertz models), or capture the clumping of individual variation that may result from categorical inequalities, such as racial inequality, at the expense of continuous individual variation (in discrete models, such as the model used here). The results presented here suggest that it may be useful to more directly compare discrete and gamma-Gompertz models, particularly in populations that may have unmeasured extreme, categorical stratification, since the models produce such divergent deceleration patterns. Further innovations in modeling heterogeneity, such as mixtures of gamma distributions—which could combine the potential virtues of continuous variation and multiple peaks—should also be explored in future research, including research into the deceleration and acceleration patterns such models may produce.
Since gamma-Gompertz models preclude the deceleration and reacceleration patterns described here, in principle, these theoretical results suggest a test of those models, by looking for multiple decelerations or reaccelerations in empirical data (analogously to Horiuchi and Coale’s [1990] classic work using higher moments to distinguish models generating similar estimated means). In practice, such a test would require very high quality data at old ages, since deceleration would need to be assessed nonparametrically.
Conclusion
Unobserved heterogeneity is a barrier to accurately modeling individual-level risk and group-level inequalities in many domains of mortality research, and options for investigating it are relatively limited. Mortality deceleration has been considered by many a promising avenue for assessing such heterogeneity. This paper has shown that drawing conclusions about heterogeneity from mortality deceleration is problematic for reasons that have not previously been articulated, by demonstrating three unexpected facts about mortality deceleration, which together have three broader implications for demographers. It has shown that even within a single cohort composed of just two subpopulations with proportional Gompertz hazards:
Mortality can decelerate even while a majority of the cohort is frail (high-frailty deceleration).
Mortality can then reaccelerate while the frail remain a non-negligible part of the cohort. This occurs because the rate of selection is greatest when half the population is frail, so that—counter-intuitively—selection of the frail out of the cohort can cause acceleration, not only deceleration, as the frailty composition dips below half (selection-driven acceleration).
Mortality can then decelerate a second time as the frailty composition dips further below half (multiple deceleration), before finally reaccelerating as the robust become such a large part of the cohort that their acceleration dominates over the negligible selection that remains possible.
These facts have three important implications. First, the first two facts challenge a conventional heuristic that has anchored important intuitions in previous demographic work, namely: the heuristic that mortality decelerates only as the frailty composition is ‘nearly’ depleted, and reaccelerates only as the frailty composition is ‘nearly all’ depleted. This paper shows that, while this pattern does occur, it is not the only—or even the main—pattern of deceleration and reacceleration possible in this simple mortality setting. The contribution of mortality selection to deceleration and reacceleration is more complex than has been previously articulated in the literature.
Second, the second and third facts suggest that conventional parameterizations of old-age mortality may lead analysts astray. Parametric forms used to identify the timing of deceleration, such as logistic (Bongaarts 2005, Kulminski et al. 2007, Rau et al. 2009, Thatcher 1999) or arctangent (Lynch and Brown 2001, Lynch et al. 2003) forms, assume a single mortality plateau. Not only will such parametric forms miss reacceleration and multiple deceleration when they occur; when such patterns occur, these parametric forms may misestimate—by decades—the timing of any deceleration point, as they average observations whose derivatives are significantly more complex than the forms assume. This is particularly problematic for purposes that compare deceleration timing across cohorts—and such comparisons are the central way that deceleration timing bears on inequality, within and across cohorts.
Finally, the three facts together qualify the link between deceleration patterns and inequality in one additional way. It turns out that—in contrast to an earlier prediction (Lynch et al. 2003) used to link deceleration patterns to changing heterogeneity among blacks and whites in the United States—all else equal, a cohort with greater baseline frailty composition can sometimes decelerate at lower mortality and younger ages than one with fewer frail members at baseline.
What is perhaps most startling is not only that such counter-intuitive patterns are possible, but that they are possible even in an exceedingly simple mortality model. Reality is bound to be more complex, and more complicated models may or may not create even less predictable deceleration dynamics. These results call for caution in modeling and interpreting mortality deceleration. More broadly, the results in this paper should urge demographers to deepen our theoretical understanding of the surprisingly complex ways that patterns of acceleration and deceleration arise from changing cohort composition. These results highlight the dangers of relying on intuitions about deceleration. They suggest a greater need for formal modeling of deceleration dynamics, and in particular, explicitly comparative modeling that matches the kinds of inferences about heterogeneity and inequality for which deceleration patterns have been used as evidence.
Acknowledgments
This research was supported by a graduate research fellowship from the National Science Foundation, a dissertation fellowship from the Ford Foundation, and core grants to the Center for Demography and Ecology (R24 HD047873) and Center for Demography of Health and Aging (P30 AG017266) at the University of Wisconsin-Madison. The author thanks, for helpful feedback, Felix Elwert, James Montgomery, Jenny Conrad, Michal Engelman, Duncan Gillespie, Jeffrey Grigg, Paul Hanselman, Anna Haskins, Michelle Niemann, Jenna Nobles, and Sarah Zureick-Brown, along with excellent reviewers and the Editor, and, for technical assistance, Russell Dimond, David Siegel, and the Social Science Computing Cooperative at the University of Wisconsin-Madison.
Footnotes
Lynch and Brown (2001) use the term absolute deceleration as I use it, but use relative deceleration to refer to the Lifetable Aging Rate (LAR), discussed below. They do not discuss what I call relative deceleration, which entered the demographic literature more recently, with Rau et al. (2009).
The major alternative to the derivatives of mortality, in conceptualizing deceleration, is the slope of the natural log of mortality, dubbed the Lifetable Aging Rate (LAR) by Horiuchi (e.g., 1997; Horiuchi and Coale 1990; Horiuchi and Wilmoth 1997). Its chief disadvantage for this paper is that, because the LAR is relative to the overall level of mortality, mortality acceleration/deceleration as measured with the LAR is sensitive to the level of the age-invariant component of mortality (Vaupel and Zhang 2010). In contrast, the mortality derivatives are functions only of the derivatives of cohort frailty composition and of subpopulation mortality (as shown in Online Resource 1). Using the derivatives of mortality in its own scale therefore allows us to more cleanly focus on the contribution of mortality selection, that is, of the declining composition of frail members of the cohort.
Similarly, Lynch and Brown (2001) describes the general result that high-mortality populations decelerate at younger ages like this:
The heterogeneity hypothesis of Horiuchi and Wilmoth (1998) suggests that the age at which deceleration begins should increase over time. The rationale for this prediction is that, as a population becomes more homogeneously robust, the frailer members of the population live longer. Hence their mortality patterns are more similar to that of the most robust subpopulation. This implies a later age before mortality rates come to be governed by the more robust subpopulation, and hence an older age at which deceleration begins. [Lynch and Brown 2001: 81; emphasis added]
The results in the present paper do not directly speak to this proposal because Heathcote et al.’s model assigns heterogeneous slopes to the subpopulations, whereas the model presented here assumes proportional hazards. This paper shows that in the proportional hazards setting, generally considered a more restrictive assumption, even cohorts with only two, not three, closed subpopulations can experience two successive decelerations.
Demographic intuitions on this point may be influenced by a result presented in perhaps the most influential paper introducing mortality selection to a wide demographic audience, Vaupel and Yashin’s (1985) “Heterogeneity’s Ruses.” Vaupel and Yashin (1985: 177) write:
The sudden decline in the observed hazard rate is produced by the rapid extinction of the frailer subcohort. Until the point of decline, the frailer subcohort experiences death rates that are relatively low. Then, due to the exponential increase in the force of mortality, the death rates become sufficiently large that within a few years almost all of the frailer subcohort dies. The observed hazard rate declines to the level of the hazard rate for the more robust subcohort. Since this hazard rate is increasing, the observed hazard rate then starts to increase as well: the observed hazard rate now equals the hazard rate for the more robust subcohort because only members of the more robust subcohort are still alive.
Vaupel and Yashin are describing a cohort whose mortality increases, decreases, then increases, rather than the acceleration and deceleration of such a cohort. But the vivid imagery of decline precipitated by the rapid extinction of the frail, and then rising with the mortality of the robust, may have been naturally extended by analysts from hazard slopes to higher derivatives. As we will see, the analogue of Vaupel and Yashin’s slope pattern in the third derivative of the hazard is one form of deceleration and acceleration that can occur, but only one form—and not at all the most common form in the model cohorts to be considered here.
I give the full parameters of this example, and justify their reasonableness, in Footnote 15, after I describe the simulations.
The mortality derivatives are evaluated up to age 150, by which point the frail are extinct in all cohorts, to ensure that no periods of deceleration or reacceleration are censored. However, parametric (Gompertz and logistic) models, used for specific purposes described below, are estimated on ages 50–100 to ensure comparability between the models for real and simulated cohorts (since real data do not extend to age 150).
One might be concerned that it is impossible for a cohort to be 75% frail at age 50 with reasonably-valued Gompertz subpopulations because too many frail will have died by age 50. It turns out that this is not the case. Were the subpopulation intercept and slope parameters constant from birth, this would correspond to a proportion frail at birth in the range of .750 to 1 in the final universe of simulated cohorts, with a mean value of .887 (calculations omitted; available upon request). A proportion frail of 1 is incompatible with the assumption of two subpopulations. The 37 cohorts that generate that result, given the assumption of constant lifetime subpopulation mortality parameters, are the cohorts with the lowest β (slope) and highest α (intercept) values in the simulation universe. Excluding them does not appreciably change results.
Such populations are easily imagined; for example, Lynch et al.’s (2003) study of African-Americans born 1870–1972 hypothesizes that this population, due to its extreme deprivation, was nearly homogeneously frail. Another example might be mortality data that excludes certain dimensions of extreme social stratification. For example, mortality data from apartheid-era South Africa, if not stratified by race, could be conceived of as an aggregation of a large, high-mortality Black subpopulation and a smaller, advantaged White subpopulation, as well as an intermediate Coloured subpopulation. Since frail and robust are relative categories, relevant examples are ones in which the best dichotomization of mortality risk puts most of the population into the higher-mortality group, but not necessarily ones in which most people are “frail” in some absolute sense.
Some demographic theory on mortality compression suggests that such populations—where longevity relative to one’s cohort is the exception rather than the rule—are likely to be disadvantaged populations, insofar as modern health advances have more dramatically altered mortality by raising much of the population to a higher standard length of life than by allowing the most advantaged to live ever longer (e.g., Brown et al. 2012). Thus, it may be among relatively disadvantaged populations that multiple deceleration and high-frailty deceleration may occur. (Insofar as such populations often are the least well documented empirically, it may be especially difficult to amass the data required to circumvent the parametric assumptions shown in this paper to sometimes be deeply distorting.)
On the other hand, recent work examining cross-period and cross-cohort mortality variation at a variety of ages shows that, while mortality advances reduce variation from birth, such advances may increase variation at older ages, in part because with reduced early-life mortality, more frail cohort members live to old age (Engelman et al. 2010). Thus, even if advantaged populations have fewer frail members from birth than disadvantaged populations, they may have as many or more frail members at the elderly ages in which deceleration may occur.
In short, demographic theory does not preclude models with high frailty composition at early-old ages, such as the models explored in this paper, for either disadvantaged or advantaged populations.
For a rough-and-ready sense of what extreme mortality differentiation looks like, consider sex differences in Russian mortality. Russian cohorts born 1872–1980 have an age-specific ratio of the male to female annual mortality rate ranging between .77 to 4.87, with a mean (weighted by total exposure) of 2.85. The sex ratio is increasing over time; for cohorts born beginning in 1950, the mean weighted ratio is 3.01, and when limited to ages 50–100, as in the simulations, the ratio for those modern cohorts is 3.19 (author’s calculations from Human Mortality Database data).
MATLAB code is available from the author on request.
Estimating Gompertz models (and, later, logistic models) on the simulated cohorts requires estimating discrete survivorship at each age so that the parametric estimation can be weighted by survivorship, as in real data on individuals. These discrete survivorships are estimated from the mortality functions using standard life table methods that assume constant mortality within each age interval (Preston et al. 2001: 46–47). To make palatable this assumption, which violates the assumption of Gompertz subpopulation mortality, I use age increments of only four days.
Human Mortality Database. University of California, Berkeley (USA), and Max Planck Institute for Demographic Research (Germany). Available at http://www.mortality.org or http://www.humanmortality.de (data downloaded on August 18, 2011). I use all cohort (vs. period) data included in the HMD.
The constraint that simulated cohorts resemble a real combination of intercept and slope is relevant because of the well-known negative correlation across cohorts between those two parameters, first noted by Strehler and Mildvan (1960) and still of great demographic interest (e.g., Finkelstein 2012; Zheng, Yang, and Land 2011).
The method used here produces a set of simulated cohorts whose parameters are similar to those of real cohorts because the bulk of the real data fall into a parallelogram-like shape. This is particularly true for simulated cohorts close to relatively recent real cohorts, where the data are less sparse. All patterns discussed in this paper occur across the range of Gompertz intercept and slope values in the HMD universe (also shown visually in Online Resource 2), including among cohorts that fall amid dense clusters of real data.
Returning now to the example cohort presented in Fig. 1 and Table 2: that cohort, defined by the parameters f=5, α=.002, and β=.103, has Gompertz intercept .009 and slope .081. It was chosen arbitrarily from among those cohorts exhibiting multiple relative decelerations whose aggregate parameters fell in a dense cluster of HMD cohorts, born in Sweden and Denmark in the mid-19th Century and in England, Wales, and Scotland in the late 19th Century.
Deceleration timing for logistic models is defined in the same way as for the nonparametric true hazards—that is, when the second or third derivative become negative—using formulas for those derivatives taken from Rau et al. (2009).
The empirical literature finds deceleration at very similar ages to those identified—sometimes erroneously—here, and reports changes in deceleration timing that are often much smaller than the potential two-decade error identified here. For example, using logistic models, Rau et al. (2009) find relative deceleration among English and Welsh women at age 93, and absolute deceleration at age 103, and Bebbington et al. (2007) find absolute deceleration among Canadian men at age 92 and women at age 96.5. It stands to reason that similar problems might occur using arctangent models due to their similarity to logistic models; using arctangent models, Lynch and Brown (2001) find absolute deceleration among white women in the U.S. at ages ranging 95–96 and white men at ages 93–95, from 1968–1992; and Lynch et al. (2003) find absolute deceleration for U.S. whites at ages ranging 93–95 and for blacks at ages 92–96 (in unadjusted data) or 101–104 (in data adjusted for potential age misreporting), from 1970–1992.
Horiuchi and Wilmoth [1998] makes a similar contribution for deceleration patterns across causes of death.
Moreover, in practice, many comparative analyses will compare the mortality of cohorts that differ in their frailty distribution at whatever age is taken as baseline. This is because population scientists often wish to compare the mortality of more advantaged and less advantaged groups to one another, and cross-national analyses show that even populations with similar life expectancy may differ considerably in their degree of heterogeneity (Edwards and Tuljapurkar 2005). The results in this paper suggest that deceleration may occur at several different points in the process of shifting from a relatively frail to an almost entirely robust surviving cohort. When the cohorts also had very different heterogeneity distributions to begin with, inferences from deceleration patterns to the patterns of heterogeneity within each cohort may be particularly problematic.
References
- Beard RE. Notes on some mathematical mortality models. In: Wolstenholme GEW, O’Connor M, editors. The Lifespan of Animals. Boston: Little, Brown; 1959. pp. 302–311. [Google Scholar]
- Beard RE. Some aspects of theories of mortality, cause of death analysis, forecasting and stochastic processes. In: Brass W, editor. Biological Aspects of Demography. London: Taylor & Francis; 1971. pp. 57–68. [Google Scholar]
- Bebbington M, Lai C-D, Zitikis R. Modeling human mortality using mixtures of bathtub shaped failure distributions. Journal of Theoretical Biology. 2007;245(3):528–538. doi: 10.1016/j.jtbi.2006.11.011. [DOI] [PubMed] [Google Scholar]
- Berkman L, Singer B, Manton K. Black/white differences in health status and mortality among the elderly. Demography. 1989;26(4):661–678. [PubMed] [Google Scholar]
- Bongaarts J. Long-range trends in adult mortality: Models and projection methods. Demography. 2005;42(1):23–49. doi: 10.1353/dem.2005.0003. [DOI] [PubMed] [Google Scholar]
- Brown DC, Hayward M, Montez JK, Hidajat MM, Hummer RA, Chiu CT. The Significance of Education for Mortality Compression in the United States. Demography. 2012;49(3):819–40. doi: 10.1007/s13524-012-0104-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carey JR, Liedo P, Orozco D, Vaupel JW. Slowing of mortality rates at older ages in large Medfly cohorts. Science. 1992;258:457–461. doi: 10.1126/science.1411540. [DOI] [PubMed] [Google Scholar]
- Costa DL. Scarring and mortality selection among Civil War POWs: A long-term mortality, morbidity, and socioeconomic follow-up. Demography. 2012;49(4):1185–1206. doi: 10.1007/s13524-012-0125-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Curtsinger JW, Fukui HH, Townsend DR, Vaupel JW. Demography of genotypes: Failure of the limited life-span paradigm in Drosophilia-Melanogaster. Science. 1992;258:461–463. doi: 10.1126/science.1411541. [DOI] [PubMed] [Google Scholar]
- Drapeau MD, Gass EK, Simison MD, Mueller LD, Rose MR. Testing the heterogeneity theory of late-life mortality plateaus by using cohorts of Drosophila melanogaster. Experimental Gerontology. 2000;35:71–84. doi: 10.1016/s0531-5565(99)00082-0. [DOI] [PubMed] [Google Scholar]
- Dupre ME, Franzese AT, Parrado EA. Religious attendance and mortality: Implications for the black-white mortality crossover. Demography. 2006;43:141–164. doi: 10.1353/dem.2006.0004. [DOI] [PubMed] [Google Scholar]
- Edwards R, Tuljapurkar S. Inequality in lifespans and a new perspective on mortality convergence across industrialized countries. Population and Development Review. 2005;31(4):645–674. [Google Scholar]
- Engelman M, Canudas-Romo V, Agree E. The implications of increased survivorship for mortality variation in aging populations. Population and Development Review. 2010;36(3):511–539. doi: 10.1111/j.1728-4457.2010.00344.x. [DOI] [PubMed] [Google Scholar]
- Finkelstein M. Discussing the Strehler-Mildvan model of mortality. Demographic Research. 2012;26(9):191–206. [Google Scholar]
- Fukui HH, Xiu L, Curtsinger JW. Slowing of age-specific mortality rates in Drosophila melanogaster. Experimental Gerontology. 1993;28(6):585–99. doi: 10.1016/0531-5565(93)90048-i. [DOI] [PubMed] [Google Scholar]
- Gampe J. Human mortality beyond age 110. In: Maier H, Gampe J, Jeune B, Robine J-M, Vaupel J, editors. Supercentenarians (Demographic Research Monographs 7) Berlin: Springer; 2010. pp. 219–230. [Google Scholar]
- Gompertz B. On the nature of the function expressive of the law of human mortality, and on a new mode of determining the value of life contingencies. Philosophical transactions of the Royal Society of London. 1825;115:513–583. doi: 10.1098/rstb.2014.0379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heathcote CR, Puza BD, Roberts SP. The use of aggregate data to estimate Gompertz-type old-age mortality in heterogeneous populations. Australian & New Zealand Journal of Statistics. 2009;51:481–497. [Google Scholar]
- Horiuchi S. Postmenopausal acceleration of age-related mortality increase. Journal of Gerontology: Biological Sciences. 1997;52A:B78–B92. doi: 10.1093/gerona/52a.1.b78. [DOI] [PubMed] [Google Scholar]
- Horiuchi S, Coale AJ. Age patterns of mortality for older women: An analysis using the age-specific rate of mortality change with age. Mathematical Population Studies. 1990;2(4):245–267. doi: 10.1080/08898489009525312. [DOI] [PubMed] [Google Scholar]
- Horiuchi S, Wilmoth JR. Age patterns of the life table aging rate for major causes of death in Japan, 1951–1990. Journals of Gerontology Series a-Biological Sciences and Medical Sciences. 1997;52:B67–B77. doi: 10.1093/gerona/52a.1.b67. [DOI] [PubMed] [Google Scholar]
- Horiuchi S, Wilmoth JR. Deceleration in the age pattern of mortality at older ages. Demography. 1998;35:391–412. [PubMed] [Google Scholar]
- Kannisto V. Frailty and survival. Genus. 1992;47(3–4):101–118. [PubMed] [Google Scholar]
- Kannisto V. Measuring the compression of mortality. Demographic Research. 2000;3(6) doi: 10.4054/DemRes.2000.3.5. [DOI] [PubMed] [Google Scholar]
- Kestenbaum B. A description of the extreme aged population based on improved Medicare enrollment data. Demography. 1992;29:565–580. [PubMed] [Google Scholar]
- Kulminski AM, Ukraintseva SV, Akushevich IV, Arbeev KG, Yashin AI. Cumulative health deficiencies as a characteristic of long life. Journal of the American Geriatrics Society. 2007;55(6):935–940. doi: 10.1111/j.1532-5415.2007.01155.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lynch SM, Brown JS. Reconsidering mortality compression and deceleration: An alternative model of mortality rates. Demography. 2001;38:79–95. doi: 10.1353/dem.2001.0007. [DOI] [PubMed] [Google Scholar]
- Lynch SM, Brown JS, Harmsen KG. Black-white differences in mortality compression and deceleration and the mortality crossover reconsidered. Research on Aging. 2003;25:456–483. [Google Scholar]
- Masters RK. Uncrossing the U.S. black-white mortality crossover: The role of cohort forces in life course mortality risk. Demography. 2012;49(3):773–796. doi: 10.1007/s13524-012-0107-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Missov TI, Finkelstein MS. Admissible mixing distributions for a general class of mixture survival models with known asymptotics. Theoretical Population Biology. 2011;80(1):64–70. doi: 10.1016/j.tpb.2011.05.001. [DOI] [PubMed] [Google Scholar]
- Mueller LD, Rauser CL, Rose MR. Does aging stop? Oxford: Oxford University Press; 2011. [Google Scholar]
- Olshansky SJ. On the biodemography of aging: A review essay. Population and Development Review. 1998;24:381–393. [Google Scholar]
- Palloni A. Reproducing inequalities: Luck, wallets, and the enduring effects of childhood health. Demography. 2006;43:587–615. doi: 10.1353/dem.2006.0036. [DOI] [PubMed] [Google Scholar]
- Preston SH, Heuveline P, Guillot M. Demography: Measuring and modeling population processes. New York: Wiley-Blackwell; 2001. [Google Scholar]
- Rau R, Muszynska M, Baudisch A. At what age does mortality start to decelerate?. Paper presented at the Population Association of America; Detroit. 2009. [Google Scholar]
- Rauser CL, Abdel-Aal Y, Sheih JA, Suen CW, Mueller LD, Rose MR. Lifelong heterogeneity in fecundity is insufficient to explain late-life fecundity plateaus in Drosophila melanogaster. Experimental Gerontology. 2005;40(8–9):660–670. doi: 10.1016/j.exger.2005.06.006. [DOI] [PubMed] [Google Scholar]
- Robert SA, House JS. Socioeconomic inequalities in health: An enduring sociological problem. In: Bird CE, Conrad P, Fremont AM, editors. Handbook of Medical Sociology. 5. Upper Saddle River: Prentice Hall; 2000. pp. 79–97. [Google Scholar]
- Rodriguez G. Statistical issues in the analysis of reproductive histories using hazard models. Annals of the New York Academy of Sciences. 1994;709:266–279. doi: 10.1111/j.1749-6632.1994.tb30415.x. [DOI] [PubMed] [Google Scholar]
- Rogers RG. Mortality differentials in a diverse society. In: Denton NA, Tolnay S, editors. American Diversity: A demographic challenge for the twenty-first century. Albany: SUNY Press; 2002. pp. 129–154. [Google Scholar]
- Steinsaltz D. Re-evaluating a test of the heterogeneity explanation for mortality plateaus. Experimental Gerontology. 2005;40(1–2):101–113. doi: 10.1016/j.exger.2004.11.010. [DOI] [PubMed] [Google Scholar]
- Steinsaltz D, Evans SN. Markov mortality models: Implications of quasistationarity and initial distributions. Theoretical Population Biology. 2004;65(4):319–337. doi: 10.1016/j.tpb.2003.10.007. [DOI] [PubMed] [Google Scholar]
- Steinsaltz DR, Wachter KW. Understanding mortality rate deceleration and heterogeneity. Mathematical Population Studies. 2006;13(1):19–37. [Google Scholar]
- Strehler BL, Mildvan AS. General theory of mortality and aging. Science. 1960;132(3428):14–21. doi: 10.1126/science.132.3418.14. [DOI] [PubMed] [Google Scholar]
- Thatcher AR, Kannisto V, Vaupel JW. The force of mortality at ages 80 to 120. Odense, Denmark: Odense University Press; 1998. [Google Scholar]
- Vaupel JW. Trajectories of mortality at advanced ages. In: Wachter KW, Finch CE, editors. Between Zeus and the salmon: The biodemography of longevity. Washington, D.C: National Academy Press; 1997. pp. 17–37. [PubMed] [Google Scholar]
- Vaupel JW, Carey JR. Compositional interpretations of medfly mortality. Science. 1993;260:1666–1667. doi: 10.1126/science.8503016. [DOI] [PubMed] [Google Scholar]
- Vaupel JW, Gampe J. Supercentenarians and the theory of heterogeneity. Paper presented at the International Population Conference; Marrakech, Morocco. 2009. [Google Scholar]
- Vaupel JW, Manton KG, Stallard E. Impact of heterogeneity in individual frailty on the dynamics of mortality. Demography. 1979;16:439–454. [PubMed] [Google Scholar]
- Vaupel JW, Yashin AI. Heterogeneity’s ruses: Some surprising effects of selection on population dynamics. American Statistician. 1985;39:176–185. [PubMed] [Google Scholar]
- Vaupel JW, Zhang Z. Attrition in heterogeneous cohorts. Demographic Research. 2010;23(26):737–748. [Google Scholar]
- Wachter KW, Finch CE, editors. Between Zeus and the salmon: The biodemography of longevity. Washington, D.C: National Academy Press; 1997. [PubMed] [Google Scholar]
- Zajacova A, Goldman N, Rodriguez G. Unobserved heterogeneity can confound the effect of education on mortality. Mathematical Population Studies. 2009;16(2):153–173. doi: 10.1080/08898480902790528. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zheng H, Yang Y, Land KC. Heterogeneity in the Strehler-Mildvan general theory of mortality and aging. Demography. 2011;48:267–90. doi: 10.1007/s13524-011-0013-8. [DOI] [PubMed] [Google Scholar]