Introduction
Disease burden is often encapsulated in mortality rates. However, given the connection between age and death, mortality rates typically are most influenced by deaths among older adults. Alternative epidemiologic measures of mortality burden have been proposed that explicitly weight deaths at younger ages more heavily [1–3]. One such measure is years of potential life lost [4] (YPLL), which has been used in diverse contexts to quantify premature mortality burden [5,6]. YPLL for an individual fatality i is defined to be an upper reference age —typically close to a broadly applicable life expectancy—minus age at death ai if the difference is positive and zero otherwise.
(1) |
Mortality data are often summarized as death counts within age intervals because of privacy issues, such as national death counts from the coronavirus disease 2019 (COVID-19) pandemic [7], precluding exact calculation of YPLL values. In such settings, the standard approach is to operationally replace each individual’s age at death with its interval midpoint for the purpose of calculating aggregate YPLL, a method that implicitly assumes that ages at death within intervals are uniformly distributed [8]. However, no attempt is typically made in applied epidemiologic studies using this “midpoint method” to quantify the uncertainty in YPLL-based estimates because of the administrative interval censoring of ages at death [9–12]. To address this limitation, we propose a straightforward and easily implementable Monte Carlo (MC) simulation procedure that yields point estimates of YPLL-based quantities with associated uncertainty quantification consistent with the assumption of uniformly distributed ages at death within intervals.
Proposed Monte Carlo simulation procedure
Suppose we wish to estimate a scalar quantity θ that is a function F of individual YPLL values. Let d1, d2, …, dK denote the death counts in K mutually exclusive, collectively exhaustive, and chronologically ordered age intervals (L1 = 0, U1), (L2, U2)…,(LK, UK = ∞), respectively, from the study population. The proposed procedure simulates plausible ages at death for each individual at each MC iteration, and an estimate of θ is obtained from their corresponding YPLL values. The collection of MC estimates of θ is then summarized as an overall point and interval estimate of θ, reflecting the uncertainty in the true ages at death. Observe that choosing such that obviates the need to simulate ages at death from (LK, ∞). The proposed procedure can be described as follows:
- Let {(L1, U1), (L2, U2),…, (Lr, Ur)} denote the maximal set of intervals such that L1 < L2 < ⋯ < Lr < , and r < K. For each death i in interval (Lj, Uj), j = 1,2,…,r, independently simulate an age at death aij from a continuous uniform distribution corresponding to the interval:
(2) - Compute a point estimate of θ from the simulated ages:
(3) - Repeat steps 1 and 2 to obtain point estimates of θ, denoted . The overall point estimate of θ is given by the following equation:
(4)
Furthermore, a (1 –α) × 100% interval estimate of θ is given by the and quantiles of :
(5) |
where denotes the q ∈ (0, 1) quantile of .
The proposed procedure and the midpoint method yield estimates of total YPLL equal in expectation, but challenges can arise when using the midpoint method to estimate age-standardized YPLL rates. The midpoint method requires the age intervals in the mortality data to be aligned with those in the study and standard population—which may require combining age intervals—to calculate age-specific YPLL rates in the study population that are then applied to the standard population. If analyses are based on a coarser representation of the mortality data, study population, or standard population in order to align age intervals, it is expected that resulting estimates will be less accurate. If the age intervals cannot be aligned by combining age intervals, it is not straightforward how to proceed with the midpoint method. A key advantage of the proposed procedure is its flexibility; because simulated ages are continuous, their corresponding YPLL values can be aggregated with respect to an arbitrary collection of age intervals for which age-specific YPLL rates can be calculated and subsequently applied to the standard population.
Illustrative data example
We demonstrate the proposed procedure on COVID-19 death counts by age and sex from the Directorate General of Health in Portugal [13], calculating a point and 95% interval estimate of the age-adjusted male-to-female YPLL rate ratio (RR). The cumulative death counts by age and sex as of July 30, 2020, are presented in the July 31, 2020, daily Directorate General of Health epidemiologic report (pg. 4), included as Supplementary File 1. Although the number of male (n = 867) and female (n = 868) deaths are almost equal, a greater concentration of male deaths occurs at earlier ages, which will be reflected in YPLL calculations.
We use 2020 estimates of the sex-stratified age distribution of Portugal from PopulationPyramid.net [14] (summarized in 5-year intervals up to age 100 years) to calculate sex- and age-specific YPLL rates, and we use their combined overall age distribution as the standard population (see Supplementary File 2 for the corresponding data). Direct age adjustment [15] is used to calculate the male and female age-adjusted YPLL rates, and their quotient is the age-adjusted male-to-female YPLL RR. Consistent with the Organisation for Economic Co-operation and Development Health Statistics standard [16], we use an upper reference age of , and we perform MC iterations.
Figure 1 displays a boxplot of the 1000 MC estimates of the age-adjusted male-to-female YPLL RR along with the midpoint method estimate for comparison. The overall point estimate of the age-adjusted male-to-female YPLL RR is 1.77 with an associated 95% interval estimate of (1.65, 1.90), meaning that after accounting for differences in the male and female population age distributions, males experienced COVID-19-attributable YPLL before age 70 at a rate estimated to be 77% higher than females with a 95% interval estimate of the true relative increase of 65%—90%. The midpoint method, which requires the Portugal population age intervals to be combined into 10-year intervals, yields an estimate of 1.78 with no uncertainty quantification. R code implementing this reproducible illustrative data example is provided as Supplementary File 3.
Conclusions
In summary, we developed a straightforward and easily implementable MC simulation procedure to quantify the uncertainty of YPLL-based estimates from mortality data summarized as death counts within age intervals. We demonstrated the procedure on COVID-19 mortality data from Portugal, calculating a point and 95% interval estimate for the age-adjusted male-to-female YPLL RR with respect to cumulative deaths as of July 30, 2020. We encourage future applied epidemiologic research quantifying the mortality burden of public health problems in terms of YPLL to quantify the uncertainty of their estimates.
Supplementary Material
Footnotes
Supplementary data
Supplementary data associated with this article can be found in the online version at https://doi.org/10.1016/j.annepidem.2020.11.002.
References
- [1].Wise RP, Livengood JR, Berkelman RL, Goodman RA. Methodological alternatives for measuring premature mortality. Am J Prev Med 1988;4(5):268–73. [PubMed] [Google Scholar]
- [2].Bonneux L How to measure the burden of mortality? J Epidemiol Community Health 2002;56(2):128–31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [3].Devleesschauwer B, Havelaar AH, De Noordhout CM, Haagsma JA, Praet N, Dorny P, et al. Calculating disability-adjusted life years to quantify burden of disease. Int J Public Health 2014;59(3):565–9. [DOI] [PubMed] [Google Scholar]
- [4].Gardner JW, Sanborn JS. Years of potential life lost (YPLL) — What does it measure? Epidemiology 1990:322–9. [DOI] [PubMed] [Google Scholar]
- [5].Wong MD, Chung AK, Boscardin WJ, Li M, Hsieh HJ, Ettner SL, et al. The contribution of specific causes of death to sex differences in mortality. Public Health Rep 2006;121(6):746–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [6].Darke S, Marel C, Mills KL, Ross J, Slade T, Tessson M. Years of potential life lost amongst heroin users in the Australian Treatment Outcome Study cohort, 2001–2015. Drug Alcohol Depend 2016;162:206–10. [DOI] [PubMed] [Google Scholar]
- [7].Covid-19 Mexico. https://datos.covid-19.conacyt.mx/. [Accessed 31 July 2020].
- [8].Romeder J, McWhinnie J. Potential years of life lost between ages 1 and 70: an indicator of premature mortality for health planning. Int J Epidemiol 1977;6(2):143–51. [DOI] [PubMed] [Google Scholar]
- [9].Arca M, Di Orio F, Forastiere F, Tasco C, Perucci C. Years of potential life lost (YPLL) before age 65 in Italy. Am J Public Health 1988;78(9):1202–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [10].Curran JW, Morgan WM, Hardy AM, Jaffe HW, Darrow WW, Dowdle WR. The epidemiology of AIDS: current status and future prospects. Science 1985;229(4720):1352–7. [DOI] [PubMed] [Google Scholar]
- [11].Albert VA, Koh HK, Geller AC, Miller DR, Prout MN, Lew RA. Years of potential life lost: another indicator of the impact of cutaneous malignant melanoma on society. J Am Acad Dermatol 1990;23(2):308–10. [DOI] [PubMed] [Google Scholar]
- [12].Mitra AK, Payton M, Kabir N, Whitehead A, Ragland KN, Brown A. Potential years of life lost due to COVID-19 in the United States, Italy, and Germany: an old formula with newer ideas. Int J Environ Res Public Health 2020;17(12): 4392. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [13].de Situação Relatório. https://covid19.min-saude.pt/relatorio-de-situacao/. [Accessed 31 July 2020].
- [14].PopulationPyramid.net. Population pyramids of the world from 1950 to 2100. https://www.populationpyramid.net/. [Accessed 31 July 2020].
- [15].Celentano DD, Szklo M. Gordis Epidemiology. Elsevier; 2019. [Google Scholar]
- [16].Potential years of life lost. https://data.oecd.org/healthstat/potential-years-of-life-lost.htm. [Accessed 31 July 2020].
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.