In a stationary population, the average lifespan of the living is a length-biased life expectancy

Elizabeth Wrigley-Field; Dennis Feehan

doi:10.1215/00703370-9639692

. Author manuscript; available in PMC: 2023 Feb 1.

Published in final edited form as: Demography. 2022 Feb 1;59(1):207–220. doi: 10.1215/00703370-9639692

In a stationary population, the average lifespan of the living is a length-biased life expectancy

Elizabeth Wrigley-Field ^*, Dennis Feehan ^†

PMCID: PMC8810607 NIHMSID: NIHMS1748464 PMID: 34918737

Abstract

What is the average lifespan in a stationary population viewed at a single moment in time? Even though periods and cohorts are identical in a stationary population, we show that the answer is not life expectancy, but a length-biased version of life expectancy. That is, the distribution of lifespans of the people alive at a single moment is a self-weighted distribution of cohort lifespans, such that longer lifespans have proportionally greater representation. One implication is that, if death rates are unchanging, the average lifespan of the current population always exceeds period life expectancy. This result connects stationary population lifespan measures to a well-developed body of statistical results; provides new intuition for established demographic results; generates new insights into the relationship between periods, cohorts, and prevalent cohorts; and offers a framework for thinking about mortality selection more broadly than the concept of demographic frailty.

How long, on average, will the people who are currently alive live? It turns out that, in a stationary population, this “average lifespan of the living” (ALL) can be interpreted as a length-biased life expectancy—meaning, a transformation of life expectancy in which lifespans are re-weighted by their own values. We will show that this has several important implications—such as that period life expectancy is always less than the average lifespan of the people alive in that period—and provides intuition for a number of previous results about stationary populations.

Length bias, also known as size bias or a self-weighted distribution, occurs when units are observed in proportion to their size (e.g., Correa and Wolfson 1999, Patil 2005, Lann and Falk 2005). Demographers are likely to be most familiar with length-biased sampling in the context of Preston’s (1976) analysis of the family sizes of children vs. women. A survey that asks gestational parents how many children they have will receive one report per parent, while a survey that asks children how many children are in their family will receive one report from each one-child family, but six reports from each six-child family, and so forth (Bytheway 1974, Preston 1976, Patil and Rao 1978, Ruggles 2012, Song and Mare 2014, Song 2020). Similarly, a length-biased—or self-weighted—lifespan distribution is a distribution of lifespans that is weighted by lifespan, such that the longer-lived are more likely to be observed, in proportion to their longer lifespans. We will show that, in a stationary population, the average lifespan of the living (ALL) is such a length-biased lifespan. In what follows, we formally prove this result; provide an intuitive explanation that also clarifies relationships between the lifespans of periods, cohorts, and prevalent cohorts; offer several empirical illustrations; and discuss how this result provides intuitions and implications for other demographic results and frameworks.

1. Proof

a. Background: Length Bias

Consider a non-negative real-valued random variable $Y \geq 0$ —lifespans or family sizes, for example—with probability density function $f (y)$ and mean $E [Y] > 0$ . The length-biased distribution arising from $f (y)$ has the density:

f_{l b} (y) = \frac{y f (y)}{E [Y]}

(1)

and the mean of this length-biased distribution is:

E_{l b} [Y] = \frac{E [Y^{2}]}{E [Y]}

(2)

This length-biased mean can be re-expressed, by using the definition of the variance $σ_{Y}^{2} = E [Y^{2}] - {E [Y]}^{2}$ , as a function of the squared coefficient of variation, ${c v}_{Y}^{2} = \frac{σ_{Y}^{2}}{{E [Y]}^{2}}$ :

E_{l b} [Y] = \frac{σ_{Y}^{2} + E {[Y]}^{2}}{E [Y]} = E [Y] + \frac{σ_{Y}^{2}}{E [Y]} = E [Y] (1 + c v_{Y}^{2})

(3)

These equations describing length-biased samples are well-known in the sampling statistics literatures (e.g., Cochran 1977: 249–255, deCarvalho 2016).

b. Proof: Length-Biased Life Expectancy

We will show that, in a stationary population—one with constant birth and death rates, zero net migration at any age, and zero population growth— the living population’s distribution of age at death, denoted X, is a length-biased sample of the cohort distribution of X. Denote the number of survivors at any age $x l (x)$ , such that the cohort survivorship is $l (x) / l (0)$ and the population at any moment in time has size $\int_{0}^{ω} l (x) d x$ where $ω$ is the oldest attainable age. Let the number of deaths at each age be denoted $d (x)$ . In a stationary population, at each instant the number of births equals the number of deaths, so $l (0) = \int_{0}^{ω} d (x) d x$ . Thus, the density of ages at death is given by $f (x) = d (x) / l (0)$ . The average lifespan (life expectancy at birth) of this cohort, $e (0)$ , is:

e (0) = \frac{\int_{0}^{ω} l (x) d x}{l (0)} = \frac{\int_{0}^{ω} x d (x) d x}{\int_{0}^{ω} d (x) d x}

(4)

(where $\int_{0}^{ω} l (x) d x = \int_{0}^{ω} x d (x) d x$ because both express the total lifespan of the cohort by alternatively summing the incremental lifetime lived at each age, or the full lifespan of those dying at each age).

In this population, the lifespan distribution of the living, $f_{l} (x)$ , is a length-biased transformation of the lifespan distribution of each cohort (or period) $f (x)$ :

f_{l} (x) = \frac{x d (x)}{\int_{0}^{ω} l (x) d x} = \frac{x f (x) / l (0)}{e (0) / l (0)} = \frac{x f (x)}{E_{f (x)} [X]} .

(5)

The first line of Eq. (5) reflects that, among the people currently alive, the number of people who will die at exact age x is the product of the per-cohort deaths at age x, $d (x)$ , and the x cohorts that have future deaths at age x; and the density that will die at exact age x is the ratio of those $x d (x)$ deaths to the size of the full living population, which is $\int_{0}^{ω} l (x) d x$ . Life expectancy, e(0), is the expected age at death in the cohort, denoted $E_{f (x)} [X]$ (with the $f (x)$ subscript to underscore that this expectation applies to the cohort, not the living population). Since Eq. (5) instantiates Eq. (1), Eq. (5) establishes that the distribution of lifespans in the living population is a length-biased sample of the lifespans in the stationary population’s cohort or period.

Similarly, Eq. (5), along with Eq. (2) and (3), implies that the average lifespan of the living is:

A L L = \frac{\int_{0}^{ω} x^{2} d (x) d x}{\int_{0}^{ω} x d (x) d x} = \frac{E_{f (x)} [X^{2}]}{E_{f (x)} [X]} = e (0) + \frac{σ_{x}^{2}}{e (0)}

(6)

An alternative derivation of these expressions for the ALL, not relying on Eq. (2) and (3) but derived directly from the life table functions, is given in Appendix 1.¹

Length bias is not the only perspective in which to view these results, but in the remainder of this article, we use the length-biased interpretation of the ALL as a length-biased sample to develop new insights and connect these results to other concepts in demography and other fields.

2. Intuition

The results in Eq. 5 and 6 connect lifespan demography to well-developed areas of statistics and help to clarify other results about stationary populations, discussed below. The intuition for this result is simple. By definition, the living population dies at an age older than its current age. This has the consequence that older ages of death are disproportionately likely among the living population, compared to its original birth cohorts. This selective sample of lifespans is illustrated in Figure 1, which shows a Lexis diagram in which the living population is defined by a vertical line and their future deaths are only those deaths included in the triangle on the right of this line. In epidemiology, the upper triangle in Figure 1—representing the deaths that contribute to the lifespan of the living—is called a prevalent cohort (e.g., Törner et al. 2011). The two horizontal bands reflect two example ages at death, illustrating that older ages at death will be experienced by more currently-living cohorts than younger ages at death will. This concordance between age at death and the number of cohorts that can still reach that age at death produces length bias.

Figure 1. — The average lifespan of the living (ALL) is the average of the lifespans that end in the upper right triangle. These deaths represent the future deaths of the living population. The deaths in the left triangle reflect the past deaths of living cohorts. The horizontal bands illustrate the unequal number of cohorts that contribute deaths at different ages at death (e.g., 40 cohorts for age 40 and 60 cohorts for age 60).

The results illustrated here offer new perspective on the relationship between cohorts, periods, and prevalent cohorts. Cohort life expectancy takes everyone who starts their lives at a single moment in time and follows that unselected sample across time. Period perspectives take everyone alive in a moment, regardless of when they began, but stay within that single moment in time. Prevalent cohorts, whose lifespan is the ALL, are distinct from both the cohort and the period perspectives. Like periods, they first select people on their longevity but then, like cohorts, follow them through their own lifespans. Thus, the length bias arises because the ALL is asking a cohort question about a period slice of the data.

An alternative formulation underscores what is unique about prevalent cohorts. Cohort life expectancy and period mean age at death are the average lifespans of a birth cohort or “death cohort” (e.g., Riffe, Schöley, and Villavicencio 2017), respectively. Thus, cohort and period lifespans each constrain birth or death time (respectively) to occur at a single instant, neither earlier nor later, while leaving the time of the other vital event (death or birth) unconstrained. In a stationary population, these alternative constraints generate the same distribution. Prevalent cohorts, in contrast, impose a one-sided constraint on each of birth time and death time: the ALL summarizes the lifespans of people who were born before time $t$ and die after $t$ . This produces the triangle shape in the Lexis diagram because, for young ages at death, there are very few combinations of birth and death cohort that meet this constraint, but for old ages at death, there are very many combinations of birth and death cohort that meet this constraint.

A practical application of this result is that, if age-specific death rates are unchanging over time, the lifespan of the population alive in a period will necessarily be greater than the period life expectancy.² More broadly, other contexts that define a ‘cohort’ through cross-sectional membership in some category will produce similarly length-biased estimates of the duration of category membership. For example, epidemiological research designs that begin by selecting people with some incurable medical condition at a particular time point, and then follow those people as a cohort to see how long they live—a prevalent cohort study design—generate length-biased lifespan estimates analogous to the bias in the ALL (Zelen and Feinleib 1969). The results here underscore that demographers can conceptualize living populations and their relationship to underlying birth cohorts with the same analytical tools that epidemiologists use to represent prevalent cohort samples and their relationship to disease incidence (e.g., Alho 1992, Addona et al. 2009, Carone et al. 2012, Keiding et al. 2019). Finally, a conceptual implication—recalling the familiar family size example—is that lifespans cluster time the way that families cluster children.

3. Empirical illustration: Life expectancy, variation in the age at death, and the ALL

The expression for the ALL given in the last line of Eq. 6 generates useful insights into the relationship between the ALL, life expectancy, and variance in the age at death. It implies that the ALL is always greater than life expectancy (except in the special case that everyone dies at the same age, in which case the ALL equals life expectancy). More generally, Eq. 6 shows that, holding life expectancy fixed, a population with more variable ages at death will have an older ALL that is more divergent from life expectancy. Conversely, holding the variation in the ages at death fixed, a population with a larger life expectancy will have a smaller ALL that is more similar to life expectancy.

The relationships between life expectancy, variation in the age at death, and the ALL are illustrated by calculating the ALL implied by model life tables.³ As a simple illustration, Table 1 summarizes four such life tables, which form two pairs. The first pair are two life tables with identical life expectancy (60 years) but divergent variance (represented in the table as the standard deviation, for ease of interpretation). The one with the smaller variance (the UN’s male Far East Asian life table) has an ALL that is 8 years above its life expectancy, while the one with the larger variance (the Coale-Demeny female South life table) has an ALL that is more than 13 years above its life expectancy. The second pair are two life tables with nearly-identical variance but radically different life expectancy. In this case, the life table with the larger life expectancy (the UN’s female Latin American life table at life expectancy 67) generates an ALL that is less than 10 years above life expectancy, while the low-lifespan table (Coale and Demeny’s male North life table at life expectancy 21) generates an ALL that is more than 30 years above life expectancy. In this last population, the combination of relatively large variance and very small life expectancy generate an extreme discrepancy between the lifespan of the population alive in cross-section and the lifespan of a cohort.

Table 1.

Average lifespan of the living and its divergence from life expectancy for four model life tables. The first two life tables share a life expectancy; the second two share a variance (reported here in terms of the standard deviation) in the age at death.

Life Table Family & Gender	Life Expectancy	Standard Deviation of the Age at Death	Average Lifespan of the Living (ALL)	ALL – life expectancy divergence
UN Far East Asian, Male	60	22.14	68.17	8.17
Coale & Demeny South, Female	60	28.81	73.84	13.84
UN Latin, Female	67	25.43	76.65	9.65
Coale & Demeny North, Male	21	25.43	51.74	30.74

Open in a new tab

Figure 2 illustrates how this divergence between life expectancy and the ALL occurs by showing the age at death distribution for the living population (hollow bars) and for birth cohorts (filled bars) implied by these four model life tables. As is well-known, deaths at young ages have an outsized effect on life expectancy. Figure 2 shows that these deaths are dramatically underrepresented in the living population’s lifespans. For example, someone who dies at 1 month old would need to appear in the living population at age less than 1 month, and the population contains few people in this narrow age band. More generally, the distribution of ages at death is systematically pulled upward among the living, compared to the cohorts from which they spring.

Figure 3 takes a more global view of these relationships across the full set of UN Extended Model Life Tables. (The life tables marked with Xs are the four example life tables just discussed.) Panels A and B show that, at low levels of life expectancy, the ALL diverges widely from life expectancy, while at high levels, it diverges very little. This relationship between the ALL and life expectancy is not only driven directly by the level of life expectancy, but also by life expectancy’s historical relationship with variance in the age at death. Panel C illustrates that historical relationship for these model life tables. This relationship between life expectancy and variation in the age at death can be roughly periodized into three phases. First, the earliest increases in life expectancy (away from the levels experienced by early human societies) may also have increased variation in the age at death, relative to a context in which most deaths were during childhood. Second, in the past few centuries, large increases in life expectancy were driven by declining mortality at young ages when such deaths were less common than in early human societies, and these changes tended to substantially reduce variation in the age at death. Third, in high-longevity populations, more recent increases in longevity are concentrated at older ages and may tend to reduce variation in the age at death only a small amount, and sometimes increase it (Engelman et. al. 2010, Tuljapurkar 2010). Since the ALL will diverge from life expectancy the most when the variance is high and life expectancy is low (Eq. 6), these relationships help to explain why, as Panels A and B of Fig. 3 showed, the ALL is maximally divergent from life expectancy when life expectancy is roughly in the age range 20–40, and converges rapidly toward life expectancy once life expectancy reaches roughly age 50. Panel D of Fig. 3 shows the resulting relationship between the variance in the age at death and the ALL. Even though, fixing life expectancy, a larger variance in the age at death means a larger divergence of the ALL from life expectancy, the historical relationship between life expectancy and variance complicates the overall relationship between variance and the ALL. As life expectancy increases across these model life tables, the ALL’s divergence from life expectancy travels along the arcs shown in Panel D, from the uppermost values (representing low life expectancy contexts) then out to the right, then down toward the bottom left.

A few examples help to calibrate expectations about how much longer the living population will live, compared to birth cohorts. As Figure 3 implies, even in stationary populations, this additional lifespan can differ dramatically across populations. In a comprehensive review of hunter-gatherer societies, Gurven and Kaplan (2007) suggest that life expectancy at birth in such societies ranges from roughly 21–37; the model life tables in that range of life expectancy estimate average lifespans of the living that are 18–35 years older than life expectancy. On the other hand, for low-mortality populations, the divergence is much smaller. Recent U.S. life expectancy values are 76 for men and 81 for women. Model life tables at those values generate ALLs that are, respectively, 2.9–5.7 years and 2.1–4.3 years older than life expectancy. For hypothetical populations with life expectancy of 95 or older, in these models, the ALL diverges from life expectancy by less than one year.

4. The ALL, the average age, and the average remaining life

The last line of Eq. 6, which represented the ALL as a function of life expectancy and the variance in the cohort age at death, also provides a core intuition for a well-known, but somewhat inscrutable, formula for the mean age of a stationary population, denoted A. The formula, given by Ryder (1975: 8) and Preston et al. (2001: 112), is:

A = \frac{e (0) + \frac{σ_{X}^{2}}{e (0)}}{2}

(7)

Thus, from Eq. (6) and Eq. (7), ALL=2A: in a stationary population, the average lifespan of the living is twice the average age.

One way to understand this formula is to see that it follows from two properties of stationary populations. One is the property shown here: that the average lifespan of a stationary population’s members alive in a snapshot is a length-biased life expectancy. The second distinctive feature of stationary populations is that the distribution of lifetimes lived so far—that is, age—equals the distribution of lifetimes still to come—that is, remaining life expectancy (Müller et al. 2004, 2007; Vaupel 2009; Villavicencio and Riffe 2016).⁴ This property can be considered a form of time symmetry in stationary populations, implying that in a stationary population observed at a specific moment in time, a randomly selected individual is equally likely to be observed at any point in their lives, and on average is observed halfway through their lifespan (Kim and Aron 1989, Goldstein 2009). Thus, the average age of the population and the average remaining lifespan are each half of the ALL. In combination, these two properties imply that the average age and average remaining lifespan in a stationary population each equal one half of the length-biased life expectancy, generating Eq. 7.⁵

This perspective similarly provides intuition for a proof given by Finkelstein (2008: 268) that a stationary population with a larger life expectancy than a second stationary population need not also have a larger average remaining lifespan: thus, it is possible for $e_{1} (0) > e_{2} (0)$ while $A_{1} < A_{2}$ (where $A$ , which we used to denote mean age, also equals mean remaining lifespan). Finkelstein’s clever proof, which involves making a succession of changes to a survival curve, does not (to our mind) provide a clear intuition as to the mechanism by which a cohort with a larger life expectancy can have a smaller remaining lifespan. Given that average remaining lifespan equals average age in a stationary population, Eq. 6 and Eq. 7 clarify that this possibility will come to pass if the variation in the age at death in the second cohort is large enough to offset its smaller life expectancy. In that circumstance (and only that circumstance), a randomly-selected member of this second cohort at a single moment in time will have a longer lifespan than a randomly-selected member of the reference cohort, even though a randomly-selected member at birth will have a shorter lifespan.

5. Connections to other demographic concepts

a. Mortality Selection Without Frailty

By highlighting the inherent bias in lifespan measures that begin with a population defined by a period, the length-biased lifespan in Eq. (5) also gives a somewhat different perspective on mortality selection than demographers’ and biostatisticians’ traditional perspective based on frailty modeling. Frailty is a construct representing systematic and sustained individual differences in longevity (Manton et al. 1979, Vaupel and Yashin 1985, Vaupel and Missov 2014). It implies that, if the frailer people who die young did instead live to old age, their old-age mortality would be higher than that of the robust people who actually do survive to old age. In other words, the concept of frailty implies that there are stable differences in individuals’ expected longevity, and, therefore, differences in actual longevity select the population by continually removing those with the lowest expected longevity (Feehan and Wrigley-Field 2021: 369–371.).

Yet recent theoretical and empirical investigations (Steiner and Tuljapurkar 2012; Caswell 2014; Hartemink, Missov, and Caswell 2017) suggest that the vast bulk of heterogeneity in lifespans may be due to stochastic differences in individual luck, rather than stable frailty. The results here highlight the importance of lifespan variability in generating mortality selection, whether or not there is any such frailty in the population. Indeed, no concept of frailty is needed to generate any of the results in this article. Imagine that everyone in the population has exactly the same age-specific hazards and lifespans differ purely by chance. Even in this situation, the ALL reflects a length-biasing selection on individual differences in actual longevity—even though those differences in actual longevity do not reflect differences in individuals’ expected longevity. Viewing the population in a cross-section intrinsically means viewing individuals in proportion to their lifespan. The cross-section selects cohort members, not on frailty (expected longevity), but on actual longevity.

b. Longevity of the living population vs. longevity of living cohorts: Comparison with the Cross-Sectional Average Lifespan (CAL)

The ALL is part of “a family of mortality indicators that make use of cohort information but refer to only one period” (Guillot and Payne 2019: 418). One member of that family deserves special mention because, like the ALL, it uses cohort mortality to describe the longevity of cohorts alive during a particular period. That measure is the cross-sectional average lifespan (CAL), proposed by Brouard (1986) and developed by Guillot (2003). A comparison between the ALL and CAL is informative about both measures.

The CAL at time $t$ is defined in terms of the cohort-specific survivorship $p_{c} (x, t - x) = l_{c} (x) / l_{c} (0)$ , the survivorship of cohort $c$ at age $x$ , as:

C A L (t) = \int_{0}^{ω} p_{c} (x, t - x) d x

Thus, “ $C A L (t)$ is the sum of proportions of survivors among the various cohorts present in the population at time $t$ ” (Guillot 2003: 42). We note that this implies that the CAL summarizes lifespans using only the deaths included in the left-hand triangle of our Fig. 1 (past deaths of living cohorts), in contrast to the ALL, which uses only the deaths included in the right-hand triangle (future deaths of living people).⁶

This key contrast between the CAL and the ALL—whether past or future deaths of living cohorts comprise the lifespans that the measure summarizes—also means that the two measures describe different population units: the ALL describes the (future) longevity of living people, while the CAL describes the (past) longevity of living cohorts. That relationship, in the CAL, of living people to their current cohort survivorships—reflecting the mortality that such people did not succumb to—may or may not proxy meaningful aspects of their own lives. In many contexts, being a rare survivor of a cohort with high past mortality, compared to a survivor of a cohort with lesser mortality, might capture something quite meaningful about the survivor (such as the intensity of the disease exposures each may have endured). On the other hand, in highly unequal and segregated contexts, high past mortality concentrated in a particular (disadvantaged) subpopulation might not be a meaningful descriptor of the experiences of survivors drawn largely from a different (advantaged) subpopulation within the same national population. In contrast to most other longevity measures, the ALL describes living populations directly, rather than the cohorts into which they were born.

6. Conclusion

We have shown that, in a stationary population, the lifespans of the living population form a length-biased sample of the cohort (and period) lifespan distribution. This insight connects demographic lifespan measures with a well-developed body of statistics (e.g., Cochran 1977: 249–255, deCarvalho 2016), including many epidemiological applications (e.g., Hill et al. 2003, Asgharian et al. 2006, Törner et al. 2011), and provides new intuition for existing demographic results. It implies that, when death rates are unchanging, the average lifespan of the living always exceeds period life expectancy, and exceeds it by more years when variation in the age at death is large and life expectancy is low.

These results are relevant whenever the population of interest is the population that currently experiences a state, rather than the population that ever experiences a state. Here, we considered the state of being alive, comparing the lifespan of the living to the lifespan of a cohort. The same results would apply when characterizing the expected length of current marriages (Alho 2016), disease durations among those who currently have a disease (Zelen and Feinleib 1969), or spells of incarceration (Patterson and Preston 2008) or unemployment (Beach and Kaliski 1983) among those currently incarcerated or unemployed. Each of these applications involves what we termed “asking a cohort question about a period slice of the data,” introducing length bias.

ACKNOWLEDGEMENTS

The authors gratefully acknowledge support from the Eunice Kennedy Shriver Institute for Child Health and Human Development via the Minnesota Population Center (P2C HD041023) and Berkeley Population Center (P2C HD073964); from the National Institute of Aging via the Life Course Center for the Demography and Economics of Aging (P30 AG066613); and from the Fesler-Lampert Chair in Aging Studies at the University of Minnesota; as well as helpful comments from Felix Elwert, Michelle Niemann, James Vaupel, and several anonymous reviewers.

Appendix 1. Alternative proof that the average lifespan of the living (ALL) is a length-biased life expectancy.

Here we present an alternative proof that the ALL is a length-biased life expectancy, to complement the proof given in Equation 6 and provide additional intuition for the result. The proof begins by representing the ALL as the total lifespan of everyone alive, divided by the number of people alive:

A L L = \frac{\int_{0}^{ω} \int_{y}^{ω} x d (x) d x d y}{\int_{0}^{ω} l (x) d x} = \frac{\int_{0}^{ω} x d (x) \int_{0}^{x} d y d x}{\int_{0}^{ω} l (x) d x} = \frac{\int_{0}^{ω} x^{2} d (x) d x}{\int_{0}^{ω} l (x) d x} = \frac{\int_{0}^{ω} x^{2} d (x) d x}{\int_{0}^{ω} \int_{x}^{ω} d (y) d y d x} = \frac{\int_{0}^{ω} x^{2} d (x) d x}{\int_{0}^{ω} d (y) \int_{0}^{y} d x d y} = \frac{\int_{0}^{ω} x^{2} d (x) d x}{\int_{0}^{ω} x d (x) d x} = \frac{l (0) \int_{0}^{ω} x^{2} f (x) d x}{l (0) \int_{0}^{ω} x f (x) d x} = \frac{E_{f (x)} [X^{2}]}{E_{f (x)} [X]} .

(S1)

The expression for the total lifespan of the living, in the numerator of the right-hand side of the first line of Eq. S1, may appear inscrutable. It represents the total lifespan $x$ of the $d (x)$ people who die at age $x$ within each cohort whose current age is $y \leq x$ . The total lifespan of the living therefore equals $\int_{0}^{ω} \int_{y}^{ω} x d (x) d x d y$ . From there, the proof employs “two standard strategems in formal demography” (Vaupel 2010: 1144), translating the number of people currently living ( $\int_{0}^{ω} l (x) d x$ ) to the number of future deaths ( $\int_{0}^{ω} \int_{x}^{ω} d (y) d y d x$ ), and switching the order of integration.

Footnotes

Other derivations are also possible. For example, our Equation (6) can be recovered from a decomposition of life expectancy provided by Cohen (2015), which divides life expectancy into three components defined by any arbitrary age x: the lifespan lived by those who die before age x, the first x years lived by those living beyond x, and the remaining life years lived beyond x, each weighted by the proportion of the original birth cohort that experiences them. The ALL in a stationary population can then be understood as the survivorship-weighted sum, over all ages, of Cohen’s second and third decomposition components, representing the past and future lifespans of those who survive to each successive age. Here, our goal is simply to provide a single lens through which some key demographic lifespan measures can be related to one another.

This implication assumes that everybody does not die at the same age, as we explain below.

UN Extended Model Life Tables were downloaded from https://www.un.org/en/development/desa/population/publications/mortality/model-life-tables.asp on October 22, 2020. Specifically, we used the complete (non-abridged) life tables in one-year age increments.

⁴

Specifically, Eq. (7) draws on the fact that the average age in a stationary population equals the average expected lifespan. This result was shown in a demographic context by Kim and Aron (1989) and Goldstein (2009), with elaborated historical context in Goldstein (2012). It was broadened to the more general claim about stationary populations referenced here—that the distribution of ages equals the distribution of past lifespans—in Vaupel (2009) and is generally considered to have been independently discovered in work by Brouard (1989) [which the authors are unable to read in its original French] and Carey (Müller et al. 2004, 2007) and subsequently (Villavicencio and Riffe 2016) dubbed the Carey-Brouard inequality. An alternative proof of the broader claim is presented in Rao and Carey (2015). The details of the Rao and Carey proof are controversial (Villavicencio and Riffe 2016, Rao 2021) but the claim is not. Finally, exploration of these time symmetries has been extended to other measures in stationary populations (Riffe 2015) and modified for stable populations (Vaupel and Villavicencio 2018). The intuition we provide here, based on length bias, is only one way of gaining insight into the mean age of a stationary population based on the symmetry of past and future lifespans.

⁵

Ryder (1975: 8–11) discusses implications of Eq. (7) for the relationship between life expectancy, the coefficient in the variation in the age at death, and the mean age of a stationary population. His discussion is an intellectual forerunner to our discussion in Section 3, above.

⁶

This restriction to past deaths of living cohorts generates one of the key practical advantages of the CAL as a period longevity indicator: while it is based on genuine cohort, rather than synthetic cohort, mortality (unlike period life expectancy), it requires no data on the future (unlike cohort life expectancy or the ALL).

REFERENCES

Addona V, Asgharian M, & Wolfson DB (2009). On the incidence-prevalence relation and length-biased sampling. Canadian Journal of Statistics 37(2): 206–218. [Google Scholar]
Akman O. (2004). On lifetime estimation in the presence of length-biased sampling plan. International Journal of Reliability, Quality and Safety Engineering 11(2): 187–194. [Google Scholar]
Alho JM (1992). Prevalence, incidence, and duration in general stable populations. Biometrics 48(2): 587–592. [PubMed] [Google Scholar]
Asgharian M, Wolfson DB, & Zhang X. (2006). Checking stationarity of the incidence rate using prevalent cohort survival data. Statistics in Medicine 25: 1751–1767. [DOI] [PubMed] [Google Scholar]
Brouard N. (1986). Structure et dynamique des populations. La pyramide des annees a vivre, aspects nationaux et exemples regionaux. Espaces, Populations, Societes 2(14–15): 157–168. [Google Scholar]
Brouard N. (1989). Mouvements et Modèles de Population. Yaoundé, Cameroun: Institut de Formation et de Recherche Démographiques. [Google Scholar]
Bytheway B. (1974). A statistical trap associated with family size. Journal of Biosocial Science 6: 67–72. [DOI] [PubMed] [Google Scholar]
Carone M, Asgharian M, & Wang M-C (2012). Nonparametric incidence estimation from prevalent cohort survival data. Biometrika 99(3): 599–613. [DOI] [PMC free article] [PubMed] [Google Scholar]
Caswell H. (2014). A matrix approach to the statistics of longevity in heterogeneous frailty models. Demographic Research 31: 553–592. doi: 10.4054/DemRes.2014.31.19 [DOI] [Google Scholar]
Cochran WG (1977). Sampling Techniques (Third edition). Wiley. [Google Scholar]
Cohen JE (2015). Markov’s Inequality and Chebyshev’s Inequality for tail probabilities: A sharper image. The American Statistician 69:1, 5–7, DOI: 10.1080/00031305.2014.975842. [DOI] [Google Scholar]
Correa JA & Wolfson DB (1999). Length-bias: some characterizations and applications. Journal of Statistical Computation and Simulation 64(3): 209–219. doi: 10.1080/00949659908811977 [DOI] [Google Scholar]
de Carvalho M. (2016). Mean, What do You Mean? The American Statistician 70(3):270–274. doi: 10.1080/00031305.2016.1148632 [DOI] [Google Scholar]
Engelman M, Canudas-Romo V, and Agree EM (2010). The implications of increased survivorship for mortality variation in aging populations. Population and Development Review 36(3): 511–539. [DOI] [PubMed] [Google Scholar]
Feehan D. & Wrigley-Field E. (2021). How do populations aggregate? Demographic Research 44(15): 363–378. doi: 10.4054/DemRes.2021.44.15 [DOI] [PMC free article] [PubMed] [Google Scholar]
Finkelstein M. (2008). Failure Rate Modelling for Reliability and Risk. Springer. [Google Scholar]
Goldstein JR (2009). Life lived equals life left in stationary populations. Demographic Research 20(2): 3–6. doi: 10.4054/DemRes.2009.20.2 [DOI] [Google Scholar]
Goldstein JR (2012). Historical Addendum to ‘Life lived equals life left in stationary populations’. Demographic Research 26: 167–172. [Google Scholar]
Giullot M. (2003). The Cross-Sectional Average Length of Life (CAL): A Cross-Sectional Mortality MeasureThat Reflects the Experience of Cohorts. Population Studies 57(1): 41–54. [Google Scholar]
Guillot M. & Payne CF (2019). Tracking progress in mean longevity: The Lagged Cohort Life Expectancy (LCLE) approach. Population Studies 73(3): 405–421. [DOI] [PubMed] [Google Scholar]
Hartemink N, Missov TI, and Caswell H. (2017). Stochasticity, heterogeneity, and variance in longevity in human populations. Theoretical Population Biology 114: 107–116. doi: 10.1016/j.tpb.2017.01.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
Hill G, Connelly J, Hébert R, Lindsay J, & Millar W. (2003). Neyman’s bias revisited. Journal of Clinical Epidemiology 56: 293–296. [DOI] [PubMed] [Google Scholar]
Keiding N, Albertsen KL, Rytgaard HC, Sørensen AL (2019). Prevalent cohort studies and unobserved heterogeneity. Lifetime Data Analysis 25: 712–738. [DOI] [PubMed] [Google Scholar]
Kim YJ and Aron JL (1989). On the equality of average age and average expectation of remaining life in a stationary population. SIAM Review 31(1): 110–113. [Google Scholar]
Lann A. and Falk R. (2005). A closer look at a relatively neglected mean. Teaching Statistics 27(3): 76–80. [Google Scholar]
Lann A. and Falk R. (2006). Tell me the method, I’ll give you the mean. The American Statistician 60(4): 322–327. [Google Scholar]
Madsen RW (1981). Making students aware of bias. Teaching Statistics 3(1): 2–5. [Google Scholar]
Müller H, Wang JL, Carey JR, Caswell-Chen E, Chen C, Papadoupoulos N, and Yao F. (2004). Demographic window to aging in the wild: Constructing life tables and estimating survival functions from marked individuals of unknown age. Aging Cell 3(3): 125–131. doi: 10.1111/j.1474-9728.2004.00096.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Müller H, Wang J-L, Yu W, Delaigle A, and Carey J. (2007). Survival in the wild via residual demography. Theoretical Population Biology 72(4): 513–522. doi: 10.1016/j.tpb.2007.07.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
Patil GP (2005). Weighted distributions. In Armitage P. and Colton T. Encyclopedia of Biostatistics, 2^nd ed. Wiley and Sons. Pp. 5725–5729. doi: 10.1002/0470011815.b2a15176 [DOI] [Google Scholar]
Patil GP and Rao CR (1978). Weighted Distributions and Size-Biased Sampling with Applications to Wildlife Populations and Human Families. Biometrics 34(2): 179–189. [Google Scholar]
Preston SH (1976). Family sizes of children and family sizes of women. Demography 13(1): 105–114. [PubMed] [Google Scholar]
Preston SH, Heuveline P, and Guillot M. (2001). Demography: Measuring and Modeling Population Processes. Blackwell. [Google Scholar]
Rao ASRS, and Carey JR (2015). Generalization of Carey’s equality and a theorem on stationary population. Journal of Mathematical Biology 71(3): 583–594. doi: 10.1007/s00285-014-0831-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
Rao ASRS (2021). Clarifications to “Generalization of Carey’s equality and a theorem on stationary population.” Unpublished manuscript available online at https://drive.google.com/file/d/1w61qu9yDbopbChYLE8Q9yemw-sztLFLO/view?usp=sharing. Downloaded on May 19, 2021.
Riffe T. (2015). The force of mortality by life lived is the force of increment by life left in stationary populations. Demographic Research 32: 827–834. [DOI] [PMC free article] [PubMed] [Google Scholar]
Riffe T, Schöley J, & Villavicencio F. (2017). A unified framework of demographic time. Genus 73:7, 1–24. doi: 10.1186/s41118-017-0024-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
Ruggles S. (2012). The future of historical family demography. Annual Review of Sociology 38: 18.1–18.19. [DOI] [PMC free article] [PubMed] [Google Scholar]
Song X. and Mare R. (2014). Prospective versus retrospective approaches to the study of intergenerational mobility. Sociological Methods and Research 1–30. doi: 10.1177/0049124114554460 [DOI] [PMC free article] [PubMed] [Google Scholar]
Song X. (2021). Song, Xi. 2021. Multigenerational Social Mobility: A Demographic Approach. Sociological Methodology, 51(1):1–43. doi: 10.1177/0081175020973054 [DOI] [PMC free article] [PubMed] [Google Scholar]
Steiner UK and Tuljapurkar S. (2012). Neutral theory for life histories and individual variability in fitness components. Proceedings of the National Academy of Sciences 109(12): 4684–4689. doi: 10.1073/pnas.1018096109 [DOI] [PMC free article] [PubMed] [Google Scholar]
Törner A, Dickman P, Duberg A-S, Kristinsson S, Landgren O, Björkholm M, and Svensson Å (2011). A method to visualize and adjust for selection bias in prevalent cohort studies. American Journal of Epidemiology 174(8): 969–976. [DOI] [PubMed] [Google Scholar]
Tuljapurkar S. (2010). The final inequality: Variance in the age at death. In Shoven JB, ed., Demography and the Economy, pp. 209–221. University of Chicago Press. [Google Scholar]
Vaupel JW (2009). Life lived and left: Carey’s equality. Demographic Research 20(3): 7–10. doi: 10.4054/DemRes.2009.20.3 [DOI] [Google Scholar]
Vaupel JW and Villavicencio F. (2018). Life lived and left: Estimating age-specific survival in stable populations with unknown ages. Demographic Research 37: 991–1008. [Google Scholar]
Villavicencio F. and Riffe T. (2016). Symmetries between life lived and left in finite stationary populations. Demographic Research 35: 381–398. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zelen M. and Feinleib M. (1989). On the theory of screening for chronic diseases. Biometrika 56: 601–614. [Google Scholar]

Reference to Appendix 1

Vaupel JW (2010). Total incremental change with age equals average lifetime change. Demographic Research 22(36): 1143–1148. DOI: 10.4054/DemRes.2010.22.36 [DOI] [Google Scholar]

[R1] Addona V, Asgharian M, & Wolfson DB (2009). On the incidence-prevalence relation and length-biased sampling. Canadian Journal of Statistics 37(2): 206–218. [Google Scholar]

[R2] Akman O. (2004). On lifetime estimation in the presence of length-biased sampling plan. International Journal of Reliability, Quality and Safety Engineering 11(2): 187–194. [Google Scholar]

[R3] Alho JM (1992). Prevalence, incidence, and duration in general stable populations. Biometrics 48(2): 587–592. [PubMed] [Google Scholar]

[R4] Asgharian M, Wolfson DB, & Zhang X. (2006). Checking stationarity of the incidence rate using prevalent cohort survival data. Statistics in Medicine 25: 1751–1767. [DOI] [PubMed] [Google Scholar]

[R5] Brouard N. (1986). Structure et dynamique des populations. La pyramide des annees a vivre, aspects nationaux et exemples regionaux. Espaces, Populations, Societes 2(14–15): 157–168. [Google Scholar]

[R6] Brouard N. (1989). Mouvements et Modèles de Population. Yaoundé, Cameroun: Institut de Formation et de Recherche Démographiques. [Google Scholar]

[R7] Bytheway B. (1974). A statistical trap associated with family size. Journal of Biosocial Science 6: 67–72. [DOI] [PubMed] [Google Scholar]

[R8] Carone M, Asgharian M, & Wang M-C (2012). Nonparametric incidence estimation from prevalent cohort survival data. Biometrika 99(3): 599–613. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] Caswell H. (2014). A matrix approach to the statistics of longevity in heterogeneous frailty models. Demographic Research 31: 553–592. doi: 10.4054/DemRes.2014.31.19 [DOI] [Google Scholar]

[R10] Cochran WG (1977). Sampling Techniques (Third edition). Wiley. [Google Scholar]

[R11] Cohen JE (2015). Markov’s Inequality and Chebyshev’s Inequality for tail probabilities: A sharper image. The American Statistician 69:1, 5–7, DOI: 10.1080/00031305.2014.975842. [DOI] [Google Scholar]

[R12] Correa JA & Wolfson DB (1999). Length-bias: some characterizations and applications. Journal of Statistical Computation and Simulation 64(3): 209–219. doi: 10.1080/00949659908811977 [DOI] [Google Scholar]

[R13] de Carvalho M. (2016). Mean, What do You Mean? The American Statistician 70(3):270–274. doi: 10.1080/00031305.2016.1148632 [DOI] [Google Scholar]

[R14] Engelman M, Canudas-Romo V, and Agree EM (2010). The implications of increased survivorship for mortality variation in aging populations. Population and Development Review 36(3): 511–539. [DOI] [PubMed] [Google Scholar]

[R15] Feehan D. & Wrigley-Field E. (2021). How do populations aggregate? Demographic Research 44(15): 363–378. doi: 10.4054/DemRes.2021.44.15 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] Finkelstein M. (2008). Failure Rate Modelling for Reliability and Risk. Springer. [Google Scholar]

[R17] Goldstein JR (2009). Life lived equals life left in stationary populations. Demographic Research 20(2): 3–6. doi: 10.4054/DemRes.2009.20.2 [DOI] [Google Scholar]

[R18] Goldstein JR (2012). Historical Addendum to ‘Life lived equals life left in stationary populations’. Demographic Research 26: 167–172. [Google Scholar]

[R19] Giullot M. (2003). The Cross-Sectional Average Length of Life (CAL): A Cross-Sectional Mortality MeasureThat Reflects the Experience of Cohorts. Population Studies 57(1): 41–54. [Google Scholar]

[R20] Guillot M. & Payne CF (2019). Tracking progress in mean longevity: The Lagged Cohort Life Expectancy (LCLE) approach. Population Studies 73(3): 405–421. [DOI] [PubMed] [Google Scholar]

[R21] Hartemink N, Missov TI, and Caswell H. (2017). Stochasticity, heterogeneity, and variance in longevity in human populations. Theoretical Population Biology 114: 107–116. doi: 10.1016/j.tpb.2017.01.001 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] Hill G, Connelly J, Hébert R, Lindsay J, & Millar W. (2003). Neyman’s bias revisited. Journal of Clinical Epidemiology 56: 293–296. [DOI] [PubMed] [Google Scholar]

[R23] Keiding N, Albertsen KL, Rytgaard HC, Sørensen AL (2019). Prevalent cohort studies and unobserved heterogeneity. Lifetime Data Analysis 25: 712–738. [DOI] [PubMed] [Google Scholar]

[R24] Kim YJ and Aron JL (1989). On the equality of average age and average expectation of remaining life in a stationary population. SIAM Review 31(1): 110–113. [Google Scholar]

[R25] Lann A. and Falk R. (2005). A closer look at a relatively neglected mean. Teaching Statistics 27(3): 76–80. [Google Scholar]

[R26] Lann A. and Falk R. (2006). Tell me the method, I’ll give you the mean. The American Statistician 60(4): 322–327. [Google Scholar]

[R27] Madsen RW (1981). Making students aware of bias. Teaching Statistics 3(1): 2–5. [Google Scholar]

[R28] Müller H, Wang JL, Carey JR, Caswell-Chen E, Chen C, Papadoupoulos N, and Yao F. (2004). Demographic window to aging in the wild: Constructing life tables and estimating survival functions from marked individuals of unknown age. Aging Cell 3(3): 125–131. doi: 10.1111/j.1474-9728.2004.00096.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] Müller H, Wang J-L, Yu W, Delaigle A, and Carey J. (2007). Survival in the wild via residual demography. Theoretical Population Biology 72(4): 513–522. doi: 10.1016/j.tpb.2007.07.003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] Patil GP (2005). Weighted distributions. In Armitage P. and Colton T. Encyclopedia of Biostatistics, 2^nd ed. Wiley and Sons. Pp. 5725–5729. doi: 10.1002/0470011815.b2a15176 [DOI] [Google Scholar]

[R31] Patil GP and Rao CR (1978). Weighted Distributions and Size-Biased Sampling with Applications to Wildlife Populations and Human Families. Biometrics 34(2): 179–189. [Google Scholar]

[R32] Preston SH (1976). Family sizes of children and family sizes of women. Demography 13(1): 105–114. [PubMed] [Google Scholar]

[R33] Preston SH, Heuveline P, and Guillot M. (2001). Demography: Measuring and Modeling Population Processes. Blackwell. [Google Scholar]

[R34] Rao ASRS, and Carey JR (2015). Generalization of Carey’s equality and a theorem on stationary population. Journal of Mathematical Biology 71(3): 583–594. doi: 10.1007/s00285-014-0831-6 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R35] Rao ASRS (2021). Clarifications to “Generalization of Carey’s equality and a theorem on stationary population.” Unpublished manuscript available online at https://drive.google.com/file/d/1w61qu9yDbopbChYLE8Q9yemw-sztLFLO/view?usp=sharing. Downloaded on May 19, 2021.

[R36] Riffe T. (2015). The force of mortality by life lived is the force of increment by life left in stationary populations. Demographic Research 32: 827–834. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R37] Riffe T, Schöley J, & Villavicencio F. (2017). A unified framework of demographic time. Genus 73:7, 1–24. doi: 10.1186/s41118-017-0024-4 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R38] Ruggles S. (2012). The future of historical family demography. Annual Review of Sociology 38: 18.1–18.19. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R39] Song X. and Mare R. (2014). Prospective versus retrospective approaches to the study of intergenerational mobility. Sociological Methods and Research 1–30. doi: 10.1177/0049124114554460 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R40] Song X. (2021). Song, Xi. 2021. Multigenerational Social Mobility: A Demographic Approach. Sociological Methodology, 51(1):1–43. doi: 10.1177/0081175020973054 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R41] Steiner UK and Tuljapurkar S. (2012). Neutral theory for life histories and individual variability in fitness components. Proceedings of the National Academy of Sciences 109(12): 4684–4689. doi: 10.1073/pnas.1018096109 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R42] Törner A, Dickman P, Duberg A-S, Kristinsson S, Landgren O, Björkholm M, and Svensson Å (2011). A method to visualize and adjust for selection bias in prevalent cohort studies. American Journal of Epidemiology 174(8): 969–976. [DOI] [PubMed] [Google Scholar]

[R43] Tuljapurkar S. (2010). The final inequality: Variance in the age at death. In Shoven JB, ed., Demography and the Economy, pp. 209–221. University of Chicago Press. [Google Scholar]

[R44] Vaupel JW (2009). Life lived and left: Carey’s equality. Demographic Research 20(3): 7–10. doi: 10.4054/DemRes.2009.20.3 [DOI] [Google Scholar]

[R45] Vaupel JW and Villavicencio F. (2018). Life lived and left: Estimating age-specific survival in stable populations with unknown ages. Demographic Research 37: 991–1008. [Google Scholar]

[R46] Villavicencio F. and Riffe T. (2016). Symmetries between life lived and left in finite stationary populations. Demographic Research 35: 381–398. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R47] Zelen M. and Feinleib M. (1989). On the theory of screening for chronic diseases. Biometrika 56: 601–614. [Google Scholar]

PERMALINK

In a stationary population, the average lifespan of the living is a length-biased life expectancy

Elizabeth Wrigley-Field

Dennis Feehan

Abstract