Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Aug 12.
Published in final edited form as: J Am Stat Assoc. 2007 Fall;102(480):1199–1211. doi: 10.1198/016214507000000040

On the Estimation of Disability-Free Life Expectancy: Sullivan’ Method and Its Extension

Kosuke Imai 1,2, Samir Soneji 1,2
PMCID: PMC4533834  NIHMSID: NIHMS507506  PMID: 26279593

Abstract

A rapidly aging population, such as the United States today, is characterized by the increased prevalence of chronic impairment. Robust estimation of disability-free life expectancy (DFLE), or healthy life expectancy, is essential for examining whether additional years of life are spent in good health and whether life expectancy is increasing faster than the decline of disability rates. Over 30 years since its publication, Sullivan’s method remains the most widely used method to estimate DFLE. Therefore, it is surprising to note that Sullivan did not provide any formal justification of his method. Debates in the literature have centered around the properties of Sullivan’s method and have yielded conflicting results regarding the assumptions required for Sullivan’s method. In this article we establish a statistical foundation of Sullivan’s method. We prove that, under stationarity assumptions, Sullivan’s estimator is unbiased and consistent. This resolves the debate in the literature, which has generally concluded that additional assumptions are necessary. We also show that the standard variance estimator is consistent and approximately unbiased. Finally, we demonstrate that Sullivan’s method can be extended to estimate DFLE without stationarity assumptions. Such an extension is possible whenever a cohort life table and either consecutive cross-sectional disability surveys or a longitudinal survey are available. Our empirical analysis of the 1907 and 1912 U.S. birth cohorts suggests that while mortality rates remain approximately stationary, disability rates decline during this time period.

Keywords: Aggregate data, Demography, Healthy life expectancy, Life tables, Morbidity, Mortality, Stationarity

1. INTRODUCTION

A rapidly aging population, such as the United States today, is characterized by the increased prevalence of chronic impairment, which is especially pronounced among the elderly. Robust estimation of disability-free life expectancy (DFLE), or healthy life expectancy, is essential for examining whether additional years of life are spent in good health and whether life expectancy is increasing faster than the decline of disability rates. DFLE represents the expected number of years of remaining disability-free life a member of the life table cohort would experience if cohort age-specific rates of mortality and disability prevailed throughout his/her lifetime.

In his seminal article, Sullivan (1971) developed a method for combining mortality and morbidity rates into a single summary measure of a population’s health status. Over 30 years after its publication, Sullivan’s method remains the most widely used method among applied researchers and government officials for estimating DFLE. The key idea of the method is to combine the period life table, which is the main method to calculate life expectancy, with the age-specific disability prevalence estimated from cross-sectional survey data. In particular, Sullivan’s method simply partitions the total number of person-year lived, which is obtained from the period life table, into the disability and disability-free life expectancy based on the proportion disabled, which is in turn measured from the cross-sectional disability survey.

Sullivan’s method is of prime methodological importance in the ongoing exploration of morbidity (e.g., Crimmins, Saito, and Ingeneri 1989, 1997). It has been used extensively to estimate DFLE in various populations (e.g., Iburg, Bronnum-Hansen, and Bjerregaard 2001) as well as differences in DFLE by socioeconomic status (e.g., Sihvonen, Kunst, Lahelma, Valkonen, and Mackenbach 1998; Molla, Madans, and Wagener 2004), educational levels (e.g., Minicuci 2005), occupational groups (e.g., Bronnum-Hansen 2000; Cambois, Robine, and Hayward 2001), and between time periods (e.g., Crimmins et al. 1989; Bronnum-Hansen, Andersen, Kjoller, and Rasmussen 2004; Graham, Blakely, Davis, Sporle, and Pearce 2004). The method has also been used to estimate the burden of disease from chronic conditions such as diabetes (e.g., Manuel and Schultz 2004) and the contribution of specific diseases to educational disparities in DFLE (e.g., Nusselder et al. 2005). Nusselder and Looman (2004) used Sullivan’s method to analyze the contribution of various causes of death and disability to differences healthy life expectancy among populations and over time. Murray and Lopez (1996) compared the disability-adjusted life expectancy, a quantity closely related to DFLE, across world regions using Sullivan’s method as part of their Global Burden of Disease Study. In addition to academic researchers, a number of governments and international health organizations employ Sullivan’s method. The U.S. National Center for Health Statistics uses Sullivan’s method as part of the Healthy People 2010 Study (Molla, Madans, Wagener, and Crimmins 2003). The Australian Institute of Health and Welfare used Sullivan’s method in its 1996 Burden of Disease Study (Mathers, Vos, Stevenson, and Begg 2001). The World Health Organization also used the method to estimate disability-adjusted life expectancy for 191 member states (Musgrove et al. 2000).

Robust estimation of DFLE is also vital to the theoretical understanding of morbidity. The existing research on DFLE of various populations and time periods has often reached contradictory conclusions regarding the competing nature of mortality and morbidity. For example, Gruenberg (1977) and Kramer (1980) argued that the decline in mortality rates only reflects a decline in the mortality rate of chronic diseases rather than a decline in their incidence. Greater life expectancy will result in more severe chronic diseases. Fries (1980), on the other hand, argued for the compression of morbidity. If the onset of the chronic condition can be postponed and adult life expectancy is relatively constant, morbidity will be compressed into a shorter period of time. Moreover, Manton (1982) offered an alternative theory of dynamic equilibrium in which the decline in mortality leads to an increase in the prevalence of milder chronic diseases. Therefore, the accurate estimation of DFLE is essential for the empirical evaluation of these competing theories.

Given the importance and popularity of Sullivan’s method, it is surprising to note that Sullivan (1971) did not provide any formal justification of his method. Debates in the literature have centered around the properties of Sullivan’s method and have yielded conflicting results regarding the assumptions required for Sullivan’s method to provide a valid estimate of DFLE (e.g., Rogers, Rogers, and Belanger 1990; Mathers 1991; Barendregt, Bonneux, and Van der Maas 1994, 1995; Van de Water, Boshuizen, Perenboom, Mathers, and Robine 1995). In this article, we establish a statistical foundation of Sullivan’s method. After defining DFLE and reviewing the period life table method in Section 2, we prove in Section 3 that under stationarity assumptions, Sullivan’s estimator is consistent and unbiased and that the standard variance estimator is consistent and approximately unbiased. Our theoretical results imply that Sullivan’s method requires no additional assumption other than stationarity assumptions. This resolves the debate in the literature, which has generally concluded that additional assumptions are necessary (e.g., Newman 1988; Palloni, Guillen, Monteverde, Ayuso, and White 2005). Our theoretical results appear to be consistent with the existing simulation and empirical studies that investigate the performance of Sullivan’s method under various conditions (e.g., Barendregt et al. 1994; Mathers and Robine 1997; Lievre, Brouard, and Heathcote 2003).

In Section 4 we demonstrate that Sullivan’s method can be extended to estimate DFLE without stationarity assumptions. Such an extension is possible whenever a cohort life table and either consecutive cross-sectional disability surveys or a longitudinal survey are available. The multistate life table method is a popular alternative in the literature (e.g., Rogers, Rogers, and Branch 1989a; Robine et al. 1995), but it requires longitudinal data as well as a number of theoretical and practical assumptions. In contrast, our extension of Sullivan’s method allows for unbiased and consistent estimation of DFLE without adhering to the assumptions and data requirement of the multistate life table method. The computer code that implements the methods proposed in this article is available at the first author’s website. In Section 5 we apply our proposed methodology to estimate DFLE for the 1907 and 1912 U.S. birth cohorts using mortality data from U.S. Vital Statistics and disability data from the U.S. Medicare Current Beneficiary Survey. Our empirical analysis suggests that while mortality rates remained approximately stationary, disability rates may have declined during this time period. Finally, in Section 6 we present conclusions.

2. DISABILITY–FREE LIFE EXPECTANCY AND PERIOD LIFE TABLE

Sullivan’s method utilizes the mortality data from a period life table and the disability prevalence data from a crosssectional survey. In this section, we introduce the standard notation of the period life table used in the field of demography (e.g., Chiang 1984; Preston, Heuveline, and Guillot 2001) and define the estimand (i.e., DFLE).

2.1 Theoretical Definitions of Life Expectancy and DFLE

Theoretically, mortality for a cohort (either real or hypothetical) can be considered as a continuous-time process, which is determined solely by the hazard function, μ(x,y), representing the instantaneous rate of death at age x ∈ [0,∞) for a cohort born at time y. Given the hazard function, one can derive life expectancy at age x for this cohort, denoted by e(x, y), which represents the expected remaining life of an individual at age x who is born at time y. Let l(0, y) represent the total number alive at age 0 for this cohort. Then, the number of survivors at age x is given by

l(x,y)=l(0,y)exp[0xμ(t,y)dt]. (1)

If l(0, y) is set to 1, as we do for the remainder of this article, l(x, y) corresponds to the survival function of this cohort. Given the survival function l(x, y), life expectancy at age x can be written as

e(x,y)=1l(x,y)xl(t,y)dt. (2)

Similarly, one can define DFLE, denoted by eDF(x, y), which represents the expected remaining disability-free (DF) life of an individual age x born at time y. Let π(x, y) be the proportion disabled at exact age x for the cohort born at time y. In other words, π(x, y) represents the conditional probability that an individual of this cohort is disabled at age x given that he/she survived up to age x. Because the proportion of survivors who are disability-free at age x is equal to [1 − π(x, y)]l(x, y), DFLE is given by

eDF(x,y)=1l(x,y)x[1π(t,y)]l(t,y)dt. (3)

2.2 Calculation of Life Expectancy From a Period Life Table

Although the theoretical definition of life expectancy is given within the continuous-time framework, the data are typically recorded in a discrete form. A period life table is a common source of discrete data and is often analyzed in order to approximate the continuous-time mortality process. Sullivan’s method also requires the use of a period life table. A main purpose of a period life table is to calculate the life expectancy of a hypothetical cohort that experiences the currently observed cross-sectional mortality rates.

Let 𝚨 be a set of the starting ages for the age intervals of a period life table. We use ω to denote the starting age of the oldest age interval. Let nx represent the width (years) of an age interval starting at age x ∈ 𝚨. Typically, the width of age intervals is the same for all but the oldest age interval [ω, ∞), that is nx = n for all x ∈ 𝚨 \{ω} and nω = ∞. When n = 1, a period life table is said to be unabridged, whereas it is called abridged if n > 1. In this article we allow for a more general setting where each age interval may have a different width.

A period life table is created by first observing the midinterval population, denoted by nxPx, and the total number of deaths, denoted by nxDx, for each interval [x, x + nx). Then, the observed mortality rate for each interval, denoted by nxMx, is calculated as nxDx/nxPx. Keeping with the standard demographic notation, we use prescripts to indicate the width of the interval under consideration. A period life table relies on the following stationarity assumptions of the population (e.g., Chiang 1984; Preston et al. 2001):

  1. The age-specific hazard rate is constant over time, that is, μ(x, y) = μ(x) for all y.

  2. The birth rate is constant over time.

  3. The net migration rates are 0 at all ages.

The assumptions imply that the survival function is also constant over time, that is, l(x, y) = l(x), and that the crude death rate, that is, x𝒜nxDxx𝒜nxPx, equals the crude birth rate, that is, Bx𝒜nxPx where B is the total number of births to members of the population in the period. Therefore, the total size of the hypothetical cohort is assumed to remain constant over time. Another important consequence of stationarity assumptions is that the age distribution of the hypothetical cohort in any given interval, [x, x + nx), is constant over time and is proportional to the survival function. Formally, for all s ∈ [x, x + nx), the age distribution is defined by the following density function:

l(s)xx+nxl(t)dt. (4)

For example, a common departure from stationarity occurs in many developing countries today, where annual births have been growing relative to deaths. As we see in Section 5, a violation of the stationarity assumptions is also possible in developed countries where the death rates are declining due to the advance of medical technologies.

Because nxPx and nxDx are directly obtained from the Census data and vital statistics, they are typically large. Thus, in the literature, the sampling variability about the mortality rate of the hypothetical cohort, denoted by nxmx, is considered to be small and, hence, typically ignored. That is, nxMx is assumed to equal nx mx, which is given by

nxmx=xx+nxl(t)μ(t)dtxx+nxl(t)dt (5)

for all x𝚨. Nevertheless, we later show how our results apply to the case where nx mx is consistently estimated by nx Mx (see Sec. 3.4).

Furthermore, it can be shown that the conditional probability of death within an interval [x, x + nx) given that an individual of the hypothetical cohort survived up to age x, which is denoted by nx qx, is equal to nx nx mx/[1 + (nxnxax)nx mx], where nxax represents the average person-years lived in a given interval [x, x + n) among those who are alive at age x but die within the interval. The values of nxax are obtained from complete life tables and used in subsequent calculations as a known quantity (e.g., Molla, Wagener, and Madans 2001; Preston et al. 2001).

Within this framework, the total number of person-years lived in an interval, [x, x + nx), is given by

nxLx=nxlx+nx+lxnxqxnxax, (6)

where the members of the lx+nx proportion who survive the entire interval each contribute nx years, and the members of the lx nxqx proportion who die in the interval contribute nxax years, on average. Finally, life expectancy at age x is equal to the total number of person-years for subsequent age intervals:

ex=1lxi𝒜xniLi, (7)

where 𝚨x = {i𝚨 : xi}. Under stationarity assumptions for the unbounded last age interval [ω, ∞), life expectancy at age ω is equal to the inverse of the death rate, that is, eω=mω1. The equality follows from the fact that all those alive at age ω must die in the interval, that is, = 1.

We now show that under the stationarity assumptions discussed previously, ex, which is the life expectancy calculated from a period life table in (7), equals e(x), which is the theoretical definition of life expectancy given in (2). Although in common demographic notation, l(x) is used in continuous notation and lx in discrete, both refer to the proportion alive at exact age x and, hence, are numerically identical. Given the hazard function, μ(x), the conditional probability of death for an age interval, [x, x + nx), is equal to the number of deaths in an age interval divided by the proportion alive at the beginning of the age interval:

nxqx=xx+nxl(t)μ(t)dtl(x). (8)

Next, the average number of years lived in an interval among those who die in the interval is equal to the total number of person-years lived among those who will die divided by the proportion who will die in the interval:

nxax=xx+nxl(t)μ(t)(tx)dtxx+nxl(t)μ(t)dt. (9)

Substituting (8) and (9) into (6) and integrating it by parts yield

nxLx=xx+nxl(t)dt. (10)

Therefore, it follows that ex equals e(x).

Table 1 shows the 1999 U.S. unabridged period life table for selected ages (Arias 2002). In keeping with common demographic notation, the left prescript, n = 1, is not written. The radix, l0, is set at 1 so that lx represents the survival probability. At age 20 years, 98.6% of the hypothetical life table cohort remains. From age 20 to ω = 85, the remaining 98.6% of the cohort will live i=2085Li=54.9 person-years. Hence, a 20-year-old member of the hypothetical cohort will live, on average, 55.9 years given he/she experiences the prevailing period age-specific conditional probabilities of death. For the last age group, a85 = e85 because everyone who is alive at age ω = 85 dies within the last interval.

Table 1.

The 1999 U.S. period life table and life expectancy for selected ages

Age lx qx ax Lx ex
20 .986 .001 .506 .986 55.851
25 .982 .001 .500 .981 51.101
30 .977 .001 .495 .976 46.338
35 .971 .001 .500 .970 41.597
40 .963 .002 .500 .962 36.916
45 .952 .003 .500 .950 32.323
50 .935 .004 .500 .933 27.853
55 .911 .007 .499 .908 23.516
60 .875 .011 .501 .870 19.391
65 .820 .016 .500 .813 15.499
70 .743 .025 .500 .733 11.839
75 .638 .038 .500 .626 8.356
80 .505 .059 .500 .490 4.873
85+ .345 1.000 .951 .328 .951

NOTE: The unabridged period life table (nx = 1 for all x𝚨 \ {85}) is created from the conditional probability of death, qx, and the average person-years lived in the age interval by those dying in the interval, ax. lx is the proportion of survivors at age x, whereas Lx represents the total number of person-years lived within the age interval [x, x + 1) for those who were alive at age x. The last age interval is [85, ∞). The final column gives the life expectancy ex at each age.

3. STATISTICAL FOUNDATION OF SULLIVAN’ METHOD

Unlike life expectancy, DFLE cannot be estimated from a period life table alone without obtaining additional information about disability prevalence. Sullivan (1971) proposed a measure of DFLE by combining mortality information from a period life table and disability information from a cross-sectional disability survey. However, he did not offer any formal justification of his method. In this section, we provide a statistical foundation of Sullivan’s method by deriving the assumptions under which Sullivan’s method yields a valid estimate of DFLE. We also discuss how our theoretical findings relate to the previous investigations of Sullivan’s method in the literature.

3.1 Sullivan’s Method

Sullivan’s method estimates DFLE by partitioning the person-years lived in a given age interval into the proportion with and without disability. Formally, Sullivan’s estimator is defined by

e^xDF=1lxi𝒜x(1niπ^i)niLi, (11)

where niπ^i is the sample fraction of the disabled survey respondents within the age interval [i, i + ni). If simple random sampling is used for the disability survey, niπ^i is given by

niπ^i=1niNij=1niNiYij(tij), (12)

where niNi represents the total number of survey respondents in the age interval, [i, i + ni) and Yij(tij) is the disability indicator variable for the jth respondent of that interval whose age is tij ∈ [i, i + ni) at the time of survey. Depending on one’s sampling scheme, niπ^i may be computed as a weighted average with appropriate sampling weights. In this article, for notational simplicity, we assume simple random sampling, but all the results can be easily generalized to other sampling schemes. Note that in the original article Sullivan (1971) proposed and applied an invalid estimator of disability prevalence, which is different from the estimator of (12) used by subsequent researchers (see Sec. 3.3).

In the literature, the standard way to obtain the variance of Sullivan’s estimator is to assume that the total number of the disabled within each age interval, x, x + nx, follows an independent binomial process with a constant probability, which is estimated by nxπ^x (e.g., Mathers 1991; Montpellier 1997; Molla et al. 2001). Given this distributional assumption, the variance of Sullivan’s estimator can be estimated by

σ^xDF=1lx2i𝒜xniπ^i(1niπ^i)niLi2niNi, (13)

where the sampling weights can be incorporated by calculating niπ^i as a weighted average.

We illustrate Sullivan’s method with the 1999 U.S. period life table of Table 1. We estimate the disability prevalence from the 1999 National Health Interview Survey (NHIS) and the 1999 National Nursing Home Survey (NNHS), both of which are conducted by U.S. Department of Health and Human Services. The NHIS is a multipurpose health survey conducted by the National Center for Health Statistics and is the principal source of information on the health of the civilian, noninstitutionalized population of the United States, which included a sample of 97,059 in 1999. The NNHS is a survey of the residents of nursing homes and related care facilities in the United States also conducted by the NCHS with 8,215 observations in 1999. The use of the two surveys gives a complete picture of disability prevalence among the noninstitutionalized and institutionalized populations.

The second column of Table 2 shows the estimated disability prevalence in each interval, π^x, where the sampling weights are incorporated so that respondents from the two surveys are appropriately weighted. Following the literature (e.g., Crimmins et al. 1997; Crimmins and Saito 2001; Molla et al. 2004), a respondent was considered disabled if he/she responded affirmatively to the following question: “Because of a physical, mental, or emotional problem, do you need the help of other persons with personal care needs, such as…?” where “…” represents various activities of daily living (ADL). ADL includes bathing and showering, dressing, eating, getting in/out of bed or chair, using the toilet, and getting around in home. The third column of Table 2 presents the point estimates of DFLE based on Sullivan’s method, while the next four columns show their 95% confidence intervals. The “nxmx known” 95% confidence intervals (fourth and fifth columns) are based on the standard variance in (13). The “nxmx estimated” 95% confidence intervals (sixth and seventh columns) account for additional variability in the estimation of nxmx based on the large-sample variance of êDF in (A.6) and is discussed further in Section 3.4.

Table 2.

Estimated DFLE based on Sullivan’s method for selected ages of the 1999 U.S. population

Age Estimated
disability
prevalence
Estimated
DFLE
nxmx known
95% CI
nxmx estimated
95% CI
Lower Upper Lower Upper
20 .01 54.47 54.40 54.53 54.37 54.56
25 .00 49.73 49.67 49.79 49.64 49.82
30 .00 44.98 44.91 45.04 44.89 45.07
35 .00 40.25 40.19 40.32 40.16 40.34
40 .01 35.59 35.53 35.66 35.51 35.68
45 .01 31.02 30.96 31.09 30.94 31.10
50 .01 26.58 26.52 26.64 26.50 26.66
55 .02 22.26 22.19 22.32 22.18 22.34
60 .01 18.17 18.11 18.24 18.10 18.25
65 .03 14.31 14.24 14.37 14.24 14.38
70 .05 10.70 10.64 10.77 10.64 10.77
75 .08 7.30 7.24 7.37 7.24 7.37
80 .15 4.05 3.99 4.11 3.99 4.11
85 .20 .77 .75 .78 .74 .79

NOTE: The first column shows the estimated disability prevalence using the sample (weighted) averages from the 1999 National Health Interview Survey and the 1999 National Nursing Home Survey. The definition of disability is based on activities of daily living. The remaining five columns present the point estimates of DFLE and their 95% confidence intervals using Sullivan’s method, without and with accounting for the sampling variability in the estimation of nxmx.

3.2 Assumptions and Statistical Properties of Sullivan’s Method

Sullivan’s method inherits three stationarity assumptions discussed in Section 2.2 because it relies on a period life table. The following proposition shows that the stationarity of age-specific disability prevalence is the only additional assumption required for Sullivan’s estimator to be unbiased and consistent for DFLE. The stationarity assumption about the disability prevalence is needed for the exact same reason as the other stationarity assumptions. It guarantees that cross-sectional data from different cohorts can be used to infer the age-specific disability prevalence of a hypothetical cohort.

Proposition 1

Suppose that three stationary assumptions of period life tables hold. In addition, suppose that the age-specific disability prevalence is constant over time, that is, π(x, y = π(x) for all y. Then, Sullivan’s method estimates DFLE without bias, that is, E(e^xDF)=eDF(x), and is also consistent, that is e^xDFpexDF for exDF for all x ∈ 𝚨.

A proof is given in the Appendix. The result does not depend on the interval widths and, hence, applies to both abridged and unabridged period life tables. It also does not require researchers to know the exact age of survey respondents, so long as one knows the age interval to which they belong.

Next, we show that under the same stationarity assumptions, the standard variance estimator of (13) is consistent and approximately unbiased. Furthermore, these properties do not depend on the assumption of the constant disability prevalence in each interval, which is unlikely to hold when the age interval is wide, as in many applications of Sullivan’s method to abridged period life tables. Indeed, no additional assumption is required for the functional form of π(x).

Proposition 2

Under the four stationarity assumptions of Proposition 1, the variance of Sullivan’s estimator is given by

σxDF=1lx2i𝒜xnipi(1nipi)niLi2niNi,

where niPi=ii+niπ(t)l(t)dtii+nil(t)dt. The standard variance estimator of (13) is consistent and approximately unbiased for σxDF for all x𝚨.

A proof is given in the Appendix. As is the case of Proposition 1, the statistical validity of Sullivan’s method does not depend on the way the age is partitioned into intervals. The proof of Proposition 2 can also be used to derive the following alternative variance estimator that is both (exactly) unbiased and consistent:

1lx2i𝒜xniLi2j=1niNi(Yij(tij)niπ^i)2niNi(niNi1). (14)

3.3 Discussion

Proposition 1 shows that four stationarity assumptions are sufficient to establish the unbiasedness and consistency of Sullivan’s estimator, whereas Proposition 2 shows that, under these assumptions, the standard variance estimator is consistent and approximately unbiased. Because three out of four stationarity assumptions are needed for any analysis based on period life tables, the only additional assumption required is the stationarity of disability prevalence. This assumption may be reasonable once the stationarity of mortality rates is invoked if morbidity and mortality are closely related in a given population. However, it is also possible that mortality rates remain approximately stationary while disability rates depart from stationarity (see Sec. 5).

In his original article, Sullivan (1971) briefly touched on the requirement of mortality and disability stationarity, which corresponds to the assumptions in Propositions 1 and 2,

This expectation of disability [DFLE] can be interpreted as the number of years of disability a member of the life table cohort would experience if current agespecific rates of mortality and disability prevailed throughout the cohort’s lifetime (p. 351).

Sullivan did not, however, give a formal proof of this stationarity requirement. Our results establish both finite-sample and large-sample statistical properties of Sullivan’s method and, consequently, provide a formal justification of his claim.

Furthermore, Sullivan incorrectly suggested the use of the following estimator of disability prevalence, which was also used in his own application:

nxπ^x=1nxNxj=1nxNxWij(tij)365, (15)

where Wij(tij) is the self-reported number of days the jth respondent of the ith age interval spent in disability during the previous year. The proof of Proposition 1 in the Appendix implies that the equality, xx+nxW(t)365l(t)dt=xx+nxπ(t)l(t)dt, must hold for each x𝚨 in order for nxπ^x given in (15) to estimate DFLE without bias, where W(t) represents the population average days spent in disability during the past year at age t. The equality is unlikely to hold because π(t) measures the disability prevalence at age t, while W(t) corresponds to the disability prevalence over the one-year period. Fortunately, with the exception of few studies (e.g., Newman 1988), most applications of Sullivan’s method used the valid disability prevalence estimator of (12) rather than that of (15).

Since the publication of Sullivan (1971), the literature has generated controversies over Sullivan’s method. For example, Rogers et al. (1990) argued that Sullivan’s method underestimates DFLE because of bias in the estimation of disability prevalence. Yet, Mathers (1991) argued the observed bias actually results from a nonstationary population that has not reached equilibrium associated with the given transition rates. Another controversy has taken place over the question of whether Sullivan’s method allows for transitions from disabled status to healthy status (e.g., Barendregt et al. 1994, 1995; Van de Water et al. 1995; Laditka and Hayward 2003). However, the arguments presented in these studies are informal and are often based on simulation studies rather than theoretical results.

The lack of formal results has prompted some theoretical investigations about the required assumptions of Sullivan’s method. Nevertheless, these studies yielded conflicting results on what, if any, additional assumptions are required for Sullivan’s method. Some have argued that even with a stationary population, Sullivan’s method requires additional strong assumptions about the probability of the transitions between healthy and disabled status. For example, Newman (1988) argued that if the probability of transition from disabled state to healthy state (i.e., the “recovery” probability) is large, Sullivan’s method provides a consistent estimator of DFLE. Conversely, Palloni et al. (2005) maintained that Sullivan’s method assumes this recovery probability to be negligible and further argued that Sullivan’s method requires another assumption about the homogeneity in the mortality risks of the disabled and healthy populations.

In contrast, Propositions 1 and 2 imply that Sullivan’s method does not make any assumption about the hazard function and disability prevalence rate other than that they must be stationary. They also show that Sullivan’s method does not make any assumption regarding the homogeneity of mortality risk between the healthy and disabled populations. Note that the stationarity of transition probabilities implies that of prevalence rates, but the latter do not necessarily imply the former. For example, although perhaps rare, it is possible that more disabled people are dying at a certain age over (calendar) time but more people are transitioning into the disabled state over time such that the proportion of the disabled among those who are alive remains constant.

In addition to formal investigations, numerous simulation and empirical studies have been conducted to examine the performance of Sullivan’s method under various conditions. These studies in general confirm our theoretical results. For example, Mathers and Robine (1997) found that, under stationarity assumptions, Sullivan’s method provides a consistent estimator of DFLE. A number of studies also suggest that Sullivan’s method performs poorly when the assumption of stationarity is grossly violated, especially with respect to disability prevalence (e.g., Barendregt et al. 1994; Mathers and Robine 1997; Lievre et al. 2003).

3.4 Incorporating Additional Uncertainty About nxmx

As briefly mentioned in Section 2.2, the standard use of Sullivan’s method assumes stationarity and ignores the sampling variability about nxmx by simply setting nxmx = nxMx. Here, we discuss how Propositions 1 and 2 extend to the situation where nxmx is unknown but consistently estimated by nxMx. First, although e^xDF is no longer an unbiased estimate of eDF(x), the former is still consistent for the latter because l(x) can be consistently estimated by lx and, hence, xx+nxl(t)dt can also be consistently estimated by nxLx.

Second, because the standard variance estimator of (13) ignores this additional source of uncertainty, it generally leads to underestimation of the true variance. In the Appendix we derive the large-sample variance of e^xDF, which incorporates the uncertainty about nxmx, and show that this variance can be consistently estimated. The sixth and seventh columns of Table 2 present the 95% confidence intervals based on the large-sample variance of (A.6). These confidence intervals are slightly wider than the confidence intervals based on the standard variance estimator of (13). For example, at age 50, allowing for additional variability about the estimation of nxmx, the 95% confidence widens by .04 years from (26.52, 26.64) to (26.50, 26.66).

4. ESTIMATION OF DFLE WITHOUT STATIONARITY ASSUMPTIONS

The assumption of stationary mortality and disability required for Sullivan’s method may be tenuous, especially in developed countries over the 20th century where mortality rates for the oldest ages have declined. Therefore, it is often of interest to estimate DFLE without stationarity assumptions. A popular approach in the literature has been the multistate life table method, which models the transition probabilities among the healthy state, disabled state, and death (e.g., Rogers et al. 1989b, 1990). This approach, however, requires a large-scale longitudinal disability survey, which is rarely available. Moreover, as discussed later, the multistate life table method makes assumptions about the transition probabilities that are often strong and untestable.

In this section we show that if DFLE is the quantity of interest, Sullivan’s method can be used to estimate DFLE without stationarity and other assumptions by using a cohort life table. The unbiased estimation of DFLE is also possible with consecutive cross-sectional disability surveys, which are often easier to obtain than longitudinal data.

4.1 Cohort Life Table

A cohort life table describes the mortality experience of a real cohort of individuals from birth of the first to death of the last member of the group (e.g., Chiang 1984). An important advantage of cohort life tables over period life tables is that the three assumptions of stationarity discussed in Section 2.2 are not invoked. A main purpose of cohort life tables is to calculate the life expectancy of a real birth cohort using cohort-specific birth and mortality rates for each age. Cohort life tables are created by first observing the midinterval population of the cohort born in year y, denoted by nxPx,y, and the total number of deaths in this cohort, denoted by nxDx,y, for each interval [x, x + nx). Other quantities such as lx,y and nxLx,y are defined analogously as done for period life tables. Calculation of cohort life expectancy also follows the methods identical to the ones described in Section 2.2.

There are two common ways to close a cohort life table. First, an ideal way is to observe the birth cohort until the last member dies. If a birth cohort cannot be observed until the last member dies, as is often the case, then the last age interval, [ω, ∞) is unbounded, and an assumption must be made about the hazard function within the last age interval. For example, Horiuchi and Coale (1982) derived an expression for eω by assuming that the proportion of the last age group relative to the overall population remains constant. Another commonly invoked assumption is the stationarity of mortality in the last age group, which will yield negligible error if the proportion of the birth cohort alive at age ω is sufficiently small (e.g., Wilmoth, Andreev, Jdanov, and Glei 2005).

4.2 Estimation of Cohort DFLE Using Sullivan’s Method

Next, we show that cohort DFLE can be estimated using Sullivan’s method. Specifically, Sullivan’s estimator of DFLE for the cohort born in year y is defined by

e^x,yDF=1lx,yi𝒜x(1niπ^i,y)niLi,y, (16)

where niπ^i,y is the sample fraction of the disabled survey respondents within the age interval [i, i + ni) for the cohort born in year y. Thus, nxπ^x,y can be computed for each x either from consecutive cross-sectional surveys, which follow the cohort born in year y, or from a longitudinal survey, which follows the same individuals of that cohort over time. Proposition 1 applies directly to Sullivan’s estimator of (16) except that stationarity assumptions are no longer necessary. The variance of this estimator can be calculated in the exact same way as before, and Proposition 2 also holds without stationarity assumptions. The use of Census data and vital statistics implies that sampling variability about nxmx can be ignored because a cohort life table directly summarizes the cohort of interest rather than a random sample from a hypothetical cohort. Thus, Sullivan’s method, if applied to a cohort life table and either consecutive cross-national disability surveys or longitudinal data, requires no assumption.

When consecutive cross-sectional surveys are available, it is possible to model π(x, y) as a function of y by assuming that the disability prevalence does not experience a sudden change of large magnitude across different cohorts. One may then estimate π(x, y), for example, using the generalized additive models (GAMs) (Hastie and Tibshirani 1990) or a random-walk model similar to the approach used by Lee and Carter (1992). Borrowing the information across cohorts in this way may increase the efficiency of estimation (see also Sec. 5).

So far, we have assumed that disability surveys cover all the age intervals. However, it is possible that the starting age of the oldest age group surveyed for disability prevalence, denoted by ω* ∈ 𝚨, is less than the starting age of the last age interval for the cohort life table, that is, ω* < ω. In this case, we can bound DFLE by considering the maximum and minimum values of the contribution of disability-free person-years within the last age intervals that are not covered by disability surveys. Because disability prevalence is bounded between 0 and 1, the bounds for disability-free person-years for these intervals are given by

0ω+nω[1π(t,y)]l(t,y)dtLω+nω,y. (17)

The preceding bounds do not impose any assumption, but may not be informative in practice.

In order to further narrow the bounds, we entertain a monotonicity assumption regarding the nature of disability for older ages. In particular, we may assume that the disability prevalence of a given birth cohort in the last interval of interest, [ω* + nω*, ∞), is greater than or equal to the average disability prevalence of the previous age interval, [ω*, ω* + nω*). Formally, we assume

π(t,y)E[π(s,y)]=ωω+nωπ(s,y)l(s,y)dsωω+nωl(s,y)ds, (18)

for all t ∈ [ω* + nω*, ∞). Then, the new bounds for disability free person-years in the last interval are given by

0ω+nω[1π(t,y)]l(t,y)dt{1E[π(s,y)]}Lω+nω,y, (19)

which are more informative than those in (17). Indeed, the new upper bound is almost always more informative. The proof of Proposition 1 in Appendix 1 shows that nωπ^ω,y is an unbiased and consistent estimator of E[π(s, y)], and, hence, the new upper bound can be estimated without bias and consistently.

Similarly, if disability surveys do not cover earlier age intervals, one can obtain the bounds of DFLE using the monotonicity assumption. Suppose that disability surveys start at age x and we wish to bound the total number of disability-free years for the preceding age interval, [xnx−, x), with some nx− > 0, where nx− indicates the length of the interval ending at age x. The bounds are given by

{1E[π(s,y)]}nxLxnxnxx[1π(t,y)]l(t,y)dtnxLxnx,y, (20)

where s ∈ [x, x + nx) and the expectation is taken with respect to s and over the region, [x, x + nx). We can use nxπ^x,y to estimate the lower bound without bias and consistently.

4.3 Confidence Intervals for the Bounds

When the bounds in (17) do not involve nxπ^x,y, they can be estimated without sampling variability. This implies that the upper (lower) confidence bound for the bounds of the DFLE equals the usual upper (lower) confidence band separately obtained for the upper (lower) bound of the DFLE based on its estimated variance. The resulting confidence interval covers the true bounds with exact (finite sample) probability.

Under the monotonicity assumption, the estimated upper bound in (19) and the lower bound in (20) have sampling variability. Thus, if we estimate the confidence interval for the bounds of DFLE in the same way as before, the coverage probability of the resulting confidence intervals can be greater than its nominal level, yielding wider confidence intervals than necessary. Formally, let BL and BU be true lower and upper bounds of DFLE. Then, applying the Bonferroni inequality, we see that Pr([BL,BU][B^αL,B^αU])Pr(BLB^αL)+Pr(BUB^αU)1=1α, where B^αU and B^αL represent the estimated lower and upper (1 − α) confidence intervals and are found such that Pr(BLB^αL)=Pr(BUB^αU)=1α2.

It is possible to obtain the balanced confidence interval for the bounds with asymptotically exact coverage probability (e.g., Cheng and Small 2006). Beran (1988) provided such a method based on the bootstrap method. To apply the method, we choose c~αL=F^L1[F^1(1α)] and c~αU=F^U1[F^1(1α)], where F^L and F^U are the empirical distribution functions of B~LB^L and B^UB~U, and F^ is the empirical distribution functions of max{F^L(B~LB^L),F^U(B^UB~U)}. The resulting confidence interval, [B^Lc~αL,B^U+c~αU], asymptotically covers the true bounds by the fixed probability 1 − α. Moreover, these confidence intervals are balanced in a sense that they treat upper and lower bounds fairly; that is, Pr(B^Lc~αLBL)=Pr(B^U+c~αUBU) hold asymptotically. In contrast, the bootstrap confidence intervals proposed by Horowitz and Manski (2000) have asymptotically exact coverage probability but are not balanced.

4.4 Comparison With the Multistate Life Table Method

The multistate life table method is another popular approach to estimating the DFLE in the literature. Here, we compare this alternative method with the proposed extension of Sullivan’s method described previously. The multistate life table method models transitions among different states over age by assuming the continuous-time first-order Markov process (e.g., Land and Rogers 1982; Schoen 1988). Newman (1988) and Rogers et al. (1989b) were among the first to apply the multistate life table method to estimate DFLE. These authors modeled the transitions of individuals of a specific cohort among nonabsorbing states (e.g., disabled and disability-free) and an absorbing state (e.g., death). The assumption of the first-order Markov process implies that all individuals of the life table cohort who are found in a given state at the same age will have the same transition probabilities regardless of their previous paths. Some researchers have raised a concern that this assumption may be tenuous because past history of disability is likely to affect the probability of future disability (e.g., Nour and Suchindran 1983; Laditka and Wolf 1998).

The multistate life table method is similar to the period life table method discussed in Section 2.2, but is based on transition probabilities, τ(ij)(x, x + t), which represent the probability that a person in state i at age x is in state j at exact age x + t for t > 0. The estimation of these transition probabilities requires the availability of longitudinal data. Ordinarily, researchers estimate transition probabilities using either sample fractions (e.g., Rogers et al. 1990; Crimmins, Hayward, and Saito 1994) or parametric models (e.g., Mathers and Robine 1997). Given τ(ij)(x, x + t), one can recursively define the proportion of survivors at age x who are in state i, which we denote by lx(i), as lx+t(i)=lx(i)+jiτ(ji)(x,x+t)lx(j)jiτ(ij)(x,x+t)lx(i) for t < 0.

Next, let nxLx(i) represent the number of person-years spent in state i in a given age interval, [x, x + nx), that is, formally, nxLx(i)=xx+nxl(i)(t)dt. To estimate this quantity, researchers must make assumptions about the average number of personyears spent in each state for the interval given that a person starts in state j at age x and ends up in state k at age x + nx for all j and k (e.g., Land and Rogers 1982; Schoen 1988). There are four common methods to estimate this quantity within a given age interval. They are based on the assumption that within each age interval, the survival functions are linear (e.g., Schoen 1975; Mathers 1991; Crimmins et al. 1994), quadratic (e.g., Schoen 1979), exponential (e.g., Krishnamoorthy 1979), or cubic (e.g., Schoen and Nelson 1974; Schoen and Urton 1979). Finally, the expected number of remaining years spent in state i can be computed by ex(i)=j𝒜xnjLj(i)lx, where lx represents the survival function evaluated at age x as before.

In addition to these theoretical assumptions, typical applications of the multistate life table method invoke several other assumptions. First, the stationarity of age-specific mortality rates is often assumed because of limited observation of a birth cohort (e.g., Rogers et al. 1989b, 1990; Crimmins et al. 1994; Albarran, Ayuso, Guillen, and Monteverde 2005). While Mathers and Robine (1997) did not assume the stationarity of mortality rates, the lack of historical records forced the same researchers to make the stationarity assumption about disability rates.

Second, because of limited data availability, the vast majority of studies estimate transition probabilities from just a handful of panels from longitudinal disability surveys (e.g., Newman 1988; Rogers et al. 1989a, b, 1990; Crimmins et al. 1994; Albarran et al. 2005). Consequently, these studies assume the stationarity of transition probabilities beyond the period covered by longitudinal surveys. Even in the United States, few nationally representative and long-term longitudinal studies of disability exist, due to the inherent difficulty in following the same cohort of individuals during a long period of time. A notable exception is the National Long Term Care Survey, but its panel waves are five years apart. In contrast, nationally representative cross-sectional studies are conducted every year in the United States, including the National Health Interview Survey, the American Community Survey, the Medicare Current Beneficiary Survey, and the National Health and Nutrition Examination Survey. As shown in Section 5, Sullivan’s method can exploit the availability of these large-scale consecutive crosssectional disability surveys and estimate DFLE without stationarity and other assumptions.

Finally, Davis, Heathcote, and O’Neil (2001) described the estimation of cohort DFLE using the notation of multistate life tables. However, as the authors correctly pointed out, it is not possible to estimate transition probabilities from consecutive cross-sectional surveys. Instead, Davis et al. (2001) proposed a method that is similar to the one described in Section 4.2 by estimating the marginal probabilities of various states of health and death. Unlike the method proposed in this article, however, the approach of Davis et al. (2001) requires numerical integration and the calculation of standard errors is more complex.

In sum, the multistate life table method provides valuable information about transitions among different states and, hence, allows researchers to conduct a richer analysis of mortality and morbidity than Sullivan’s method. However, the method also requires a number of assumptions about transition probabilities and the functional form of hazard function, none of which is necessary for Sullivan’s method. Numerous studies investigate the performance of the multistate life table method when its required assumptions are violated (Hoem and Jensen 1982; Nour and Suchindran 1984; Liu, Liang, Jow-Ching, and Whitelaw 1997; Manton and Land 2000; Schoen 2001; Yi, Danan, and Land 2004). If DFLE is the quantity of interest, Sullivan’s method yields a valid estimate of DFLE with minimal assumptions and data requirement. If quantities other than DFLE are of interest, on the other hand, the multistate life table method may be useful.

5. AN EMPIRICAL ANALYSIS OF THE 1907 AND 1912 U.S. BIRTH COHORTS

In this section we apply our extension of Sullivan’s method to the 1907 and 1912 U.S. birth cohorts. We also compare cohort DFLE estimated from the 1907 birth cohort with period DFLE estimated from the 1991 to 2002 U.S. populations. Our analysis is based on the mortality rates of the 1907 and 1912 birth cohorts and the cross-sectional mortality rates from 1991 to 2002, all of which are obtained from the Human Mortality Database and are based on annual U.S. vital statistics. The data were downloaded on April 1, 2006, from http://www.mortality.org, the website maintained by the University of California, Berkeley, and the Max Planck Institute for Demographic Research.

We estimate the disability prevalence using consecutive cross-sectional surveys. In particular, we use the 1991 Medicare Current Beneficiary Survey (MCBS) [available through the Inter-university Consortium for Political and Social Research (ICPSR)], 1992 and 1993 MCBS Access to Care (available through the ICPSR), and 1994 to 2002 MCBS Cost and Use (available through the U.S. Department of Health and Human Services). The MCBS is a continuous, multipurpose survey of a representative national sample of the Medicare population, which includes both the noninstutitionalized and the institutionalized populations, and is conducted by the Centers for Medicare and Medicaid Services. Medicare is the largest health insurance program in the United States, which covered over 95% of the U.S. population age 65 and older between the years 1991 and 2002 (DeNavas-Walt, Proctor, and Hill Lee 2005). Thus, the use of this survey gives us a complete picture of disability status for the overall U.S. population for each year. We use the MCBS rather than the NHIS and NNHS discussed in Section 3.1 because while the NHIS is administered annually from 1991 to 2002, the NNHS was only administered in 1995, 1997, and 1999 during the period of interest.

As in Section 3.1, a respondent was considered disabled if he or she reported at least one activity of daily living limitation. In all the analyses presented in this section, the survey weights are incorporated so that respondents are appropriately weighted according to their population size. We estimate disability prevalence for the 1907 (1912) birth cohort from age 81 (76) to 90 (90) using the first MCBS survey in 1991 to the 2002 MCBS. We estimate DFLE for ages 81 (76) to 83 (78) using the monotonicity assumption and estimate the bounds shown in (20). For the sake of a comparison between the 1907 and 1912 birth cohorts, we close both cohort life tables by assuming a stationary population at age ω = 90 and beyond. We use only the first 7 years of mortality and disability (1991–1997) for the 1907 birth cohort, while all 12 years of the data are used for the 1912 birth cohort.

The upper left (right) panel of Figure 1 shows the estimated bounds of DFLE for the 1907 (1912) birth cohort along with life expectancy. Based on the mortality experience of the 1907 birth cohort, for example, individuals who are alive at age 85 lived, on average, 7.20 years, and they spent between 1.30 and 3.06 years without disability. The lower left (right) panel of the figure shows another quantity of interest, the estimated bounds of the proportion of remaining life spent disability-free, e^xDFex for the 1907 (1912) birth cohort. For example, at age 85, members of the 1907 birth cohort are expected to spend between 18.0% and 42.5% of their remaining life without disability. The estimated proportion decreases gradually with age. For both DFLE and the proportion, the 95% balanced confidence intervals are estimated using the bootstrap procedure alluded to in Section 4.3 with 10,000 replications.

Figure 1.

Figure 1

Estimated bounds of disability-free life expectancy (DFLE) and proportion of life spent disability-free for the 1907 and 1912 U.S. birth cohorts. The upper left (right) panel shows the estimated bounds of DFLE along with life expectancy for the 1907 (1912) birth cohort from age 81 (76) to 90 (90). The lower panels show the estimated bounds of the proportion of remaining life spent disability-free. The 95% confidence intervals are shown as dashed lines. For the 1907 (1912) birth cohort, DFLE and proportion of remaining life spent disability-free are calculated using the monotonicity assumption for ages from 81 (76) to 83 (78).

The comparison of the two birth cohorts shows that while life expectancy increased slightly over the two birth cohorts, the estimated proportion of remaining life spent disability-free does not show clear differences between the two cohorts due to the wide confidence intervals. For example, life expectancy at age 85 increases by .28 years between the two birth cohorts. Yet, the 95% balanced confidence intervals of DFLE overlap significantly, that is, (1.30, 3.06) for the 1907 birth cohort and (1.76, 3.82) for the 1912 birth cohort. Consequently, the 95% balanced confidence interval of the proportion of remaining life spent disability-free for the 1907 birth cohort (.18, .42) overlaps with that of the 1912 birth cohort (.24, .51). The wide confidence intervals are in part due to the fact that the mortality and disability data are available only up to 90 years of age for the 1912 birth cohort (i.e., year 2002). If the additional years of data become available in the future, the bounds may become substantially narrower and exhibit statistically significant differences between the two cohorts.

As described in Section 4.2, we also use a model-based adjustment and estimate π(x,y) as a smooth function of y. In particular, we modeled π(x,y) using the GAM with binomial family and logistic link. The estimated DFLE based on GAM is between .05 (.10) and .30 (.29) years smaller for the 1907 (1912) birth cohort than those based on the observed disability prevalence. The 95% balanced confidence intervals of DFLE are also slightly narrower for both birth cohorts using GAM. For example, at age 93 for the 1907 birth cohorts, the confidence interval for DFLE was (.18, 1.62) using sample weighted averages and (.19, 1.51) using GAM.

For the purpose of comparison, we also estimate life expectancy, DFLE, and the proportion of remaining life spent disability-free for the 1991 hypothetical period cohort using the standard Sullivan method and compare the results with those of the 1907 birth cohort. In this case, we use the full mortality and disability information available, 1991 to 2002, corresponding to ages 84 to 95. We begin our analysis at age 84, the age of the 1907 birth cohort in 1991, and examine the differences between the period and cohort estimates for subsequent ages. The upper panel of Figure 2 plots the difference between the 1991 period and 1907 birth cohort life expectancy, that is, 1991 period estimate minus 1907 cohort estimate. Age-specific life expectancy is nearly identical for the birth cohort than for the hypothetical period cohort from age 84 to 95, which indicates virtually stationary age-specific mortality rates.

Figure 2.

Figure 2

Comparison between the 1991 period and 1907 U.S. birth cohorts. The upper, middle, and lower panels compare the difference in life expectancy, DFLE, and the proportion or remaining life spent disability-free (i.e., 1991 period minus 1907 birth cohort). The estimated bounds of the differences are shown. The 95% balanced confidence intervals are calculated from the bootstrap method and shown as dashed lines.

The middle panel plots the estimated bounds for the same differences for DFLE. The 1907 birth cohort DFLE is significantly higher than the 1991 period cohort DFLE from age 84 (the age of the 1907 birth cohort in 1991) to 88 as seen from the fact that the 95% confidence intervals for the bounds of the differences do not contain 0 in this range. Given the near stationarity of mortality rates, this nonstationarity of DFLE is possible only with the nonstationarity of age-specific disability rates. The lower panel plot shows that for the proportion of remaining life spent disability-free is also significantly higher for the birth cohort than the hypothetical period cohort from age 84 to 88. The evidence shows that while mortality rates remain approximately stationary, disability rates may have declined during this time period.

Indeed, as shown in Figure 3, mortality rates remained nearly stationary while disability rates did not between the 1991 period and 1907 birth cohorts. The upper panel plots 1991 period and 1907 birth cohort mortality rates for ages 81 to 95. Mortality rates for this age range are approximately stationary as shown by the equality of period and cohort rates. As shown in the lower panel of Figure 3, however, age-specific disability rates are nonstationary and decrease over time. The disability rate for each age of the hypothetical cohort in the 1991 period is uniformly greater than the corresponding disability rate experienced by the birth cohort of 1907.

Figure 3.

Figure 3

Comparison of the 1991 period cohort with the 1907 birth cohort. The left panel compares the 1991 period cohort mortality rate (vertical axis) with the 1907 birth cohort mortality rate (horizontal axis) from age 81 to 95. The right panel compares the 1991 period disability prevalence (vertical axis) with the 1907 birth cohort disability prevalence (horizontal axis) from age 84 to 95. The 45° solid line represents equality.

6. CONCLUDING REMARKS

Robust estimation of DFLE is vital to testing the competing theories of morbidity and mortality. Over the last 30 years, Sullivan’s method has been the most widely used method for estimating DFLE. Academic researchers and government officials use this method because of the relative ease of obtaining mortality data from a period life table and disability data from a cross-sectional survey. Yet, the statistical properties of Sullivan’s method have not been understood. In this article we establish a statistical foundation of Sullivan’s method by proving that, under stationarity assumptions, Sullivan’s estimator is unbiased and consistent and the standard variance estimator is also consistent and approximately unbiased. Our results resolve methodological debates in the literature, which generally concluded that Sullivan’s method requires additional assumptions.

Finally, we show how to extend Sullivan’s method to cohort life tables so that the valid estimation of DFLE is possible without stationarity assumptions. When compared with the multistate life table method, which is popular among applied researchers, this extension of Sullivan’s method avoids the strong assumptions about transition probabilities and can be applied even with consecutive cross-sectional disability surveys, which are often easier to obtain than a longitudinal survey. While the multistate life table method allows for a richer analysis of morbidity and mortality, if DFLE is the quantity of interest, Sullivan’s method is easier to implement and can estimate DFLE with fewer assumptions and less stringent data requirement. Our empirical analysis of the 1907 and 1912 U.S. birth cohorts suggests that while mortality rates remain approximately stationary, disability rates may have declined during this time period.

Acknowledgments

We thank Andrew Gelman, Kenneth Land, Scott Lynch, Daniel Rabinowitz, Germán Rodríguez, Burton Singer, Elizabeth Stuart, James Trussell, and seminar participants at Columbia University (Departments of Political Science and Statistics), University of Pennsylvania (Department of Statistics), and Princeton University (Office of Population Research) for helpful comments. Comments from two anonymous reviewers and the associate editor also significantly improved this article. Bobray Bordelon of Princeton University Library and Tami Swenson of Research Data Assistance Center at the University of Minnesota helped us acquire the data analyzed in this article. Finally, we acknowledge financial support from the National Science Foundation (SES-0550873), Princeton University Library, and Princeton University Committee on Research in the Humanities and Social Sciences.

APPENDIX: PROOFS

A.1. PROOF OF PROPOSITION 1

Under the assumptions, DFLE may be expressed as

eDF(x)=1l(x)i𝒜x[niLiii+niπ(t)l(t)dt], (A.1)

where the equality follows from the fact that, under the stationarity assumptions, (10) holds for all x𝚨. Moreover, under the stationarity assumptions, the age distribution is given by (4) and is proportional to the probability that an individual of exact age x being sampled. Therefore, for any age interval, [x, x + nx), if t ∈ [x, x + nx), then we have

E(nxπ^x)=E[π(t)]=xx+nxπ(t)l(t)dtxx+nxl(t)dt, (A.2)

where the first equality follows from (12) and E[Y(t)|t] = π(t). Equally (A.2) implies nxLxE(nxπ^x)=xx+nxπ(t)l(t)dt for any age interval, [x, x + nx). Together with (A.1), we have the desired result:

E(e^xDF)=1lxi𝒜x[1E(niπ^i)]niLi=eDF(x). (A.3)

To prove the consistency, observe that the variance of Y(t) is bounded from above by 1 because Y(t) is binary. Therefore, as nxNx goes to ∞, the variance of nxπ^x goes to 0, which together with its unbiasedness implies that nxπ^x converges in probability to E[π(t) for any age [x, x + nx). Thus, Sullivan’s estimator is consistent for exDF.

A.2. PROOF OF PROPOSITION 2

We first derive the variance of Sullivan’s estimator. Given any age interval, [x, x + nx), we can write the variance of Y(t) as follows:

var[Y(t)]=E[Y(t)]{E[Y(t)]}2=nxpx(1nxpx), (A.4)

where the first equality follows from the fact that Y(t) is binary and the second equality follows from E[Y(t)] = E[π(t)], the expression of which is given in (A.2). Together with the definition of Sullivan’s estimator in (11), this implies that the variance equals σxDF.

To prove the consistency of σ^xDF, note that the proof of Proposition 1 shows that, under the stationarity assumptions, for any age interval, [x, x + nx), nxpx can be estimated by nxπ^x consistently. Thus, it follows that σ^xDF in (13) is a consistent estimator of σxDF. To prove the approximate unbiasedness of σ^xDF, we note that

E[niπ^i(niπ^i)]=E[1niNij=1niNi(Yij(tij)niπ^i)2] (A.5)

for any age interval, [i, i + ni), where the equality follows from the fact that Yij(tij) is binary. Then, the right side of this equation approximates nxPx(1−nxPx) = var[Y(t)] if and only if niNiniNi − 1.

A.3. LARGE–SAMPLE VARIANCE OF e^xDF WHEN nxmx IS ESTIMATED

Because nxLx = nxaxlx + (nxnxax)lx+nx, we rewrite e^xDF as

e^xDF=(1nxπ^x)nxax+(1π^ω)aωp^xω+i𝒜x+nx\{ω}(1niπ^i)cip^xi,

where ci = (ni−ni−ai−) + niai is a constant and i− indexes the age interval ending at age i and njxi = li/lx. Recall that xi = ∏j∈𝚨x \{i} njj, where njj = 1 − njj. Because nxMx is a consistent estimate of nxMx is a consistent estimate of nxmx, xi also converges to pxi in probability. Then, following Chiang (1984, chap. 4), we expand the variance of exDF around E(nxπ^x) and pxi, which is true value of xi, using Taylor’s theorem to obtain the large-sample variance:

Var(e^xDF)=nxax2Var(nxπ^x)+Var((1π^ω)aωp^xω+i𝒜x+nx\{ω}(1niπ^i)cip^xi)nxax2Var(nxπ^x)+i𝒜x\{ω}pxi2[{1E(niπ^i+ni)}(niniai)+ei+niDF]2Var(nip^i)+(aωpxω)2var(π^ω)+i𝒜x+nx\{ω}(cipxi)2Var(niπ^i), (A.6)

where

Var(nip^i)=[E(niq^i)]2[1E(niq^i)]niDi

and

Var(niπ^i)=E(niπ^i)[1E(niπ^i)]niNi2.

Note that the independence of nxπ^x across age groups and between nxπ^x and x′ for any x as well as the independence of x across age groups are assumed. (These independence assumptions are reasonable if the cross-sectional disability survey is conducted separately from the construction of the period life table, as is often the case.) Finally, the large-sample variance in (A.6) can be consistently estimated by replacing E(nxπ^x) and pxi with their consistent estimates, that is, nxπ^x and xi, respectively.

Footnotes

A replication archive for this article is publicly available as Imai and Soneji (2007). The archive containx the computer code that implements the proposed method as well as the data set analyzed in Section 5. An earlier version of this article was presented at the 2006 Population Association of America Meeting, 2006 Joint Statistical Meetings, and 2006 Réseau Espérance de vie en Santé Annual Meeting.

REFERENCES

  1. Albarran I, Ayuso M, Guillen M, Monteverde M. A Multiple State Model for Disability Using the Decomposition of Death Probabilities and Cross-Sectional Data. Communications in Statistics, Part A—Theory and Methods. 2005;34:2063–2075. [Google Scholar]
  2. Arias E. United States Life Table, 2002. National Vital Statistics Reports. 2002;53:6. [PubMed] [Google Scholar]
  3. Barendregt J, Bonneux L, Van der Maas P. Health Expectancy: An Indicator for Change? Journal of Epidemiology and Community Health. 1994;48:482–487. doi: 10.1136/jech.48.5.482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Barendregt J, Bonneux L, Van der Maas P. Health Expectancy: An Indicator for Change? Journal of Epidemiology and Community Health. 1995;49:330–332. doi: 10.1136/jech.48.5.482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Beran R. Balanced Simultaneous Confidence Sets. Journal of the American Statistical Association. 1988;83:679–686. [Google Scholar]
  6. Bronnum-Hansen H. Socioeconomic Differences in Health Expectancy in Denmark. Scandinavian Journal of Public Health. 2000;28:194–199. [PubMed] [Google Scholar]
  7. Bronnum-Hansen H, Andersen O, Kjoller M, Rasmussen NK. Social Gradient in Life Expectancy and Health Expectancy in Denmark. Sozial-Und Praventivmedizin. 2004;49:36–41. doi: 10.1007/s00038-003-3003-9. [DOI] [PubMed] [Google Scholar]
  8. Cambois E, Robine J, Hayward M. Social Inequalities in Disability-Free Life Expectancy in the French Male Population, 1980–1991. Demography. 2001;38:513–524. doi: 10.1353/dem.2001.0033. [DOI] [PubMed] [Google Scholar]
  9. Cheng J, Small D. Bounds on Causal Effects in Three-Arm Trials With Noncompliance. (Ser. B).Journal of the Royal Statistical Society. 2006;68:815–836. [Google Scholar]
  10. Chiang CL. The Life Table and Its Applications. Robert E. Krieger; Malabar, FL: 1984. [Google Scholar]
  11. Crimmins E, Hayward M, Saito Y. Changing Mortality and Morbidity Rates and the Health Status and Life Expectancy of the Older Population. Demography. 1994;31:159–175. [PubMed] [Google Scholar]
  12. Crimmins E, Saito Y. Trends in Healthy Life Expectancy in the United States, 1970–1990: Gender, Racial, and Educational Differences. Social Science and Medicine. 2001;52:1629–1641. doi: 10.1016/s0277-9536(00)00273-2. [DOI] [PubMed] [Google Scholar]
  13. Crimmins E, Saito Y, Ingeneri D. Changes in Life Expectancy and Disability-Free Life Expectancy in the United States. Population and Development Review. 1989;15:235–267. [Google Scholar]
  14. Crimmins E, Saito Y, Ingeneri D. Trends in Disability-Free Life Expectancy in the United States, 1970–1990. Population and Development Review. 1997;23:555–572. [Google Scholar]
  15. Davis BA, Heathcote CR, O’Neill TJ. Estimating Cohort Health Expectancies From Cross-Sectional Surveys of Disability. Statistics in Medicine. 2001;20:1097–1111. doi: 10.1002/sim.724. [DOI] [PubMed] [Google Scholar]
  16. DeNavas-Walt C, Proctor B, Hill Lee C. Technical Report Current Population Reports P60-229, U.S. Census Bureau. U.S. Government Printing Office; Washington, DC: 2005. Income, Poverty, and Health Insurance Coverage in the United States: 2004. [Google Scholar]
  17. Fries J. Aging, Natural Death, and the Compression of Morbidity. New England Journal of Medicine. 1980;303:130–135. doi: 10.1056/NEJM198007173030304. [DOI] [PubMed] [Google Scholar]
  18. Graham P, Blakely T, Davis P, Sporle A, Pearce N. Compression, Expansion, or Dynamic Equilibrium? The Evolution of Health Expectancy in New Zealand. Journal of Epidemiology and Community Health. 2004;58:659–666. doi: 10.1136/jech.2003.014910. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Gruenberg E. The Failures of Success. Milbank Memorial Foundation Quarterly/Health and Society. 1977;55:3–24. [PubMed] [Google Scholar]
  20. Hastie TJ, Tibshirani R. Generalized Additive Models. Chapman&Hall; London: 1990. [Google Scholar]
  21. Hoem JM, Jensen UF. Multistate Life Table Methodology: A Probabilist Critique. In: Land K, Rogers A, editors. Multidimensional Mathematical Demography. Academic Press; New York: 1982. pp. 155–264. [Google Scholar]
  22. Horiuchi S, Coale A. A Simple Equation for Estimating the Expectation of Life at Old Ages. Population Studies. 1982;36:317–326. doi: 10.1080/00324728.1982.10409034. [DOI] [PubMed] [Google Scholar]
  23. Horowitz J, Manski C. Nonparametric Analysis of Randomized Experiments With Missing Covariate and Outcome Data. Journal of the American Statistical Association. 2000;95:77–84. [Google Scholar]
  24. Iburg D, Bronnum-Hansen B, Bjerregaard T. Health Expectancy in Greenland. Scandinavian Journal of Public Health. 2001;29:5–12. doi: 10.1177/14034948010290010501. [DOI] [PubMed] [Google Scholar]
  25. Imai K, Soneji S. hdl:1902.1/GWRFQUYNBZ. Henry A. Murray Research Archive [distributor (DDI)]; 2007. Replication Dataset for ’On the Estimation of Disability-Free Life Expectancy: Sullivan’s Method and Its Extension,’.http://id.thedata.org/hdl%3A1902.1%2FGWRFQUYNBZ [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Kramer M. The Rising Pandemic of Mental Disorders and Associated Chronic Diseases and Disabilities. Acta Psychiatrica Scandinavica. 1980;62:282–297. [Google Scholar]
  27. Krishnamoorthy S. Classical Approach to Increment–Decrement Life Tables: An Application to the Study of Marital Status of United States Females, 1970. Mathematical Biosciences. 1979;44:139–154. [Google Scholar]
  28. Laditka S, Wolf D. New Methods for Analyzing Active Life Expectancy. Journal of Aging and Health. 1998;10:214–241. [Google Scholar]
  29. Laditka SB, Hayward MD. The Evolution of Demographic Methods to Calculate Health Expectancies. In: Robine JM, Jagger C, Mathers CD, Crimmins EM, Suzman RM, editors. Determining Health Expectancies. Wiley; New York: 2003. pp. 221–234. [Google Scholar]
  30. Land K, Rogers A. Multidimensional Mathematical Demography. Academic Press; New York: 1982. [Google Scholar]
  31. Lee RD, Carter LR. Modeling and Forecasting U.S. Mortality. Journal of the American Statistical Association. 1992;87:659–671. [Google Scholar]
  32. Lievre A, Brouard N, Heathcote C. The Estimation of Health Expectancies From Cross-Longitudinal Surveys. Mathematical Population Studies. 2003;10:211–248. [Google Scholar]
  33. Liu X, Liang J, Jow-Ching E, Whitelaw N. Modeling Multidimensional Transitions in Health Care. Sociological Methods and Research. 1997;25:284–317. [Google Scholar]
  34. Manton K. Changing Concepts of Morbidity and Mortality in the Elderly Population. Milbank Memorial Fund Quarterly/Health and Society. 1982;60:183–244. [PubMed] [Google Scholar]
  35. Manton K, Land K. Active Life Expectancy Estimates for the U.S. Elderly Population: A Multidimensional Continuous-Mixture Model of Functional Change Applied to Completed Cohorts, 1982–1996. Demography. 2000;37:253–265. [PubMed] [Google Scholar]
  36. Manuel D, Schultz S. Health-Related Quality of Life and Health-Adjusted Life Expectancy of People With Diabetes in Ontario, Canada, 1996–1997. Diabetes Care. 2004;27:407–414. doi: 10.2337/diacare.27.2.407. [DOI] [PubMed] [Google Scholar]
  37. Mathers C. Health Expectancies in Australia 1981 and 1988. Australian Government Publishing Service; Canberra: 1991. [Google Scholar]
  38. Mathers C, Robine J. How Good Is Sullivan’s Method for Monitoring Changes in Population Health Expectancies. Journal of Epidemiology and Community Health. 1997;51:80–86. doi: 10.1136/jech.51.1.80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Mathers C, Vos T, Stevenson C, Begg S. The Burden of Disease and Injury in Australia. Bulletin of the World Health Organization. 2001;79:1076–1084. [PMC free article] [PubMed] [Google Scholar]
  40. Minicuci N. Influence of Level of Education on Disability Free Life Expectancy by Sex: The ILSA Study. Experimental Gerontology. 2005;40:997–1003. doi: 10.1016/j.exger.2005.08.011. [DOI] [PubMed] [Google Scholar]
  41. Molla M, Madans J, Wagener D. Differentials in Adult Mortality and Activity Limitation by Years of Education in the United States at the End of the 1990s. Population and Development Review. 2004;30:625–646. [Google Scholar]
  42. Molla M, Madans J, Wagener D, Crimmins E. National Center for Health Statistics; Hyattsville, MD: 2003. Summary Measures of Population Health Report of Findings on Methodologic and Data Issues. technical report. [Google Scholar]
  43. Molla M, Wagener D, Madans J. Healthy People Statistical Notes. National Center for Health Statistics; Hyattsville, MD: 2001. Summary Measures of Population Health: Methods for Calculating Healthy Life Expectancy. Technical Report 21. [DOI] [PubMed] [Google Scholar]
  44. Montpellier J. Reseau Esperance de Vie en Sante (REVES) 1997. Health Expectancy Calculation by the Sullivan Method: A Practical Guide. technical report. [Google Scholar]
  45. Murray C, Lopez A. The Global Burden of Disease: A Comprehensive Assessment of Mortality and Disability From Diseases, Injuries, and Risk Factors in 1990 and Projected to 2020. Harvard University Press; Cambridge, MA: 1996. [Google Scholar]
  46. Musgrove P, Creese A, Preker A, Baeza C, Anell A, Prentice T. Health Systems: Improving Performance. World Health Organization; Geneva: 2000. World Health Report 2000. [Google Scholar]
  47. Newman S. A Markov Process Interpretation of Sullivan’s Index of Morbidity and Mortality. Statistics in Medicine. 1988;7:787–794. doi: 10.1002/sim.4780070708. [DOI] [PubMed] [Google Scholar]
  48. Nour E-S, Suchindran C. A General Formulation of the Life Table. Mathematical Biosciences. 1983;63:241–252. [Google Scholar]
  49. Nour E-S, Suchindran C, Willekens FJ, et al. The Construction of Multi-State Life Tables. Population Studies. 1984;38:325–328. Comment on. [Google Scholar]
  50. Nusselder W, Looman C. Decomposition of Differences in Health Expectancy by Cause. Demography. 2004;41:315–334. doi: 10.1353/dem.2004.0017. [DOI] [PubMed] [Google Scholar]
  51. Nusselder W, Looman C, Mackenbach J, Huisman M, Van Oyer H, Deboosere P, Kunst A. The Contribution of Specific Diseases to Educational Disparities in Disability-Free Life Expectancy. American Journal of Public Health. 2005;95:2035–2041. doi: 10.2105/AJPH.2004.054700. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Palloni A, Guillen M, Monteverde M, Ayuso M, White R. University of Wisconsin–Madison, Center for Demography and Health of Aging; 2005. Evaluation of Errors in Cross-Sectional Estimates of Life Expectancy in Disability. technical report. [Google Scholar]
  53. Preston S, Heuveline P, Guillot M. Demography: Measuring and Modeling Population Processes. Blackwell; Malden, MA: 2001. [Google Scholar]
  54. Robine JM, Romieu I, Cambois E, Van de Water H, Boshuizen H, Jagger C. Network on Health Expectancy and the Disability Process (REVES); 1995. Global Assessment in Positive Health. Technical Report 196. [Google Scholar]
  55. Rogers A, Rogers R, Belanger A. Longer Life but Worse Health? Measurement and Dynamics. The Gerontologist. 1990;30:640–649. doi: 10.1093/geront/30.5.640. [DOI] [PubMed] [Google Scholar]
  56. Rogers A, Rogers R, Branch L. A Multistate Analysis of Active Life Expectancy. Public Health Reports. 1989a;104:222–226. [PMC free article] [PubMed] [Google Scholar]
  57. Rogers R, Rogers A, Belanger A. Active Life Among the Elderly in the United States: Multistate Life-Table Estimates and Population Projections. The Milbank Quarterly. 1989b;67:370–411. [PubMed] [Google Scholar]
  58. Schoen R. Constructing Increment–Decrement Life Tables. Demography. 1975;41:3–28. [PubMed] [Google Scholar]
  59. Schoen R. Calculating Increment–Decrement Life Tables by Estimating Mean Durations at Transfers From Observed Rates. Mathematical Biosciences. 1979;47:255–269. [Google Scholar]
  60. Schoen R. Modeling Multigroup Populations. Plenum; New York: 1988. [Google Scholar]
  61. Schoen R. Toward a General Model for Populations With Changing Rates. Demographic Research. 2001; 4:163–184. [Google Scholar]
  62. Schoen R, Nelson V. Marriage, Divorce, and Mortality: A Life Table Analysis. Demography. 1974;11:267–290. doi: 10.2307/2060563. [DOI] [PubMed] [Google Scholar]
  63. Schoen R, Urton W. National Central Bureau of Statistics; 1979. Marital Status Life Tables for Sweden. Technical Report Urval Number 10. [Google Scholar]
  64. Sihvonen A-P, Kunst A, Lahelma E, Valkonen T, Mackenbach J. Socioeconomic Inequalities in Health Expectancy in Finland and Norway in the Late 1980s. Social Science in Medicine. 1998;47:303–315. doi: 10.1016/s0277-9536(98)00020-3. [DOI] [PubMed] [Google Scholar]
  65. Sullivan D. A Single Index of Mortality and Morbidity. HSMHA Health Reports. 1971;86:347–354. [PMC free article] [PubMed] [Google Scholar]
  66. Van de Water H, Boshuizen H, Perenboom R, Mathers C, Robine J-M. Health Expectancy: An Indicator for Change? Journal of Epidemiology and Community Health. 1995;49:330–332. doi: 10.1136/jech.49.3.330. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Wilmoth J, Andreev K, Jdanov D, Glei D. University of California at Berkeley and Max Planck Institute for Demographic Research; 2005. Methods Protocol for the Human Mortality Database. technical report. [Google Scholar]
  68. Yi Z, Danan G, Land K. A New Method for Correcting Underestimation of Disabled Life Expectancy and an Application to the Chinese Oldest-Old. Demography. 2004;41:335–361. doi: 10.1353/dem.2004.0018. [DOI] [PubMed] [Google Scholar]

RESOURCES