Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2024 Sep 4;14:20592. doi: 10.1038/s41598-024-71444-w

Modelling the age distribution of longevity leaders

Csaba Kiss 1, László Németh 2,3,, Bálint Vető 1,4
PMCID: PMC11375213  PMID: 39232045

Abstract

Human longevity leaders with remarkably long lifespan play a crucial role in the advancement of longevity research. In this paper, we propose a stochastic model to describe the evolution of the age of the oldest person in the world by a Markov process, in which we assume that the births of the individuals follow a Poisson process with increasing intensity, lifespans of individuals are independent and can be characterized by a gamma–Gompertz distribution with time-dependent parameters. We utilize a dataset of the world’s oldest person title holders since 1955, and we compute the maximum likelihood estimate for the parameters iteratively by numerical integration. Based on our preliminary estimates, the model provides a good fit to the data and shows that the age of the oldest person alive increases over time in the future. The estimated parameters enable us to describe the distribution of the age of the record holder process at a future time point.

Keywords: Human longevity, World’s oldest person, Stochastic model, Gamma–Gompertz lifespan distribution

Subject terms: Applied mathematics, Statistics

Introduction

Exceptionally long human lifespans are one of the cornerstones of demography and mortality research. Studying the group of record holders may reveal not only the underlying mortality mechanism of a population but also potentially shed some light on the future developments of human longevity. As life expectancy increases1 with deaths shifting to older ages, the distribution of deaths at the oldest-old ages2,3 gains more interest of demographers, actuaries and decision makers of numerous disciplines.

The pattern of adult human mortality has already been described4 but there is still a debate about the exact distribution of deaths at adult ages. More details on the necessity of a correctly specified model for the underlying mortality process and its impact on further research are discussed in5,6. Numerous publications review the ongoing discussion on the existence of a mortality plateau712, and the levelling-off of adult human death rates at the oldest ages is supported by the findings in1315 while others cast some doubt on this observation1618. If there is a mortality plateau then the distribution of deaths at the oldest-old ages must be gamma-Gompertz and human lifespan can increase further without any maximum19. Further studies discuss the existence of a limit to human lifespan with more focus on the extreme value distribution aspect of the deaths at the oldest-old ages for various populations and different lifetime distributions2024. These models can be helpful in determining the plausibility of longevity leaders as well.

We contribute to this discussion by proposing a stochastic model to describe the evolution of the age of the world’s oldest person. Based on our estimates the model provides a good fit to the titleholder data since 1955, collected by the Gerontology Research Group25. With the model results, it is possible to predict the age of the oldest person in the world in the future. When should we expect to see the next Jeanne Calment, the supercentenarian with the longest human lifespan ever documented? Will her record ever be surpassed? Our results provide a prediction for the age distribution of the record holder in the coming decades to answer these questions.

Results

Our model describes the evolution of the age of the oldest living person under the following assumptions. We assume that the births of individuals follow a Poisson process with time-dependent intensity26. The lifespans of individuals in the population are independent and their distribution may depend on the date of birth. Then the age of the record holder in the population evolves in time as a Markov process with explicit transition probabilities. As the first main result of this paper, we explicitly compute the distribution of the age of the record holder for any given birth rate parameter and lifespan distribution. The detailed mathematical description and the properties of the general model are described in section "Mathematical model for the age of the oldest person".

We apply our general result to the case which approximates the human birth rate over the world and the human lifespan. We specify the intensity function of the Poisson process of births to have an exponential growth in time. The underlying force of mortality is chosen so that it follows an extension of the Gompertz mortality model4 and the lifespan distribution of individuals is given by the gamma–Gompertz (ΓG) distribution with time-dependent parameters. This distribution adequately captures the slowing down of senescence mortality at the oldest old ages. Given the growth parameters of the birth rate, we fit the model parameters to the statistics of the oldest person titleholder data using maximum likelihood method. The optimal parameters of the model fit well to the data. It shows in particular that the age of the oldest person alive increases over time, and it will most likely increase further in the future. We compute the expected value and a confidence interval for the age of the world’s oldest person using the fitted model parameters for each year between 1955 and 2019 shown by the green curves on Fig. 1. The detailed discussion of the model specification, likelihood calculations as well as the parameter fitting are given in section "Model specification and parameter fitting". Section "Methods" contains calculations related to the gamma–Gompertz–Makeham generalization of the gamma–Gompertz distribution.

Figure 1.

Figure 1

Blue curve: age of the oldest person in the world since 1955 with two outliers indicated (Jeanne Calment: JC and Sarah Knauss: SK). Green curves: the model-estimated mean age of the oldest person (solid line) and the standard deviation (distance from the dashed lines) with the estimation based on the full dataset. Red curves: mean age and standard deviation of the oldest person estimated using the data between 1955 and 1988.

Our results enable us to predict the age distribution of the world’s oldest person at future time points. We compute the probability density of the age of the world’s oldest person in different years not only in the past but also in the future. These densities are shown in Fig. 2. When comparing the age distribution of the oldest person in the world in different years to the age of Jeanne Calment at her death, we find that on Jan 1st 2060 we can expect that the age of the world’s oldest person will exceed her age with probability around 0.5. This also means that with high probability her age record will already be broken by that time.

Figure 2.

Figure 2

The probability densities of the age of the oldest person in the world in the following years: 1960 (red), 1980 (green), 2000 (blue), 2020 (pink), 2040 (orange), 2060 (magenta), 2080 (yellow) and 2100 (black).

In Fig. 1 two extreme outliers with unexpectedly long lifetimes can be observed. Jeanne Calment died at the age of 122.45 years in 1997, and Sarah Knauss died at the age 119.27 in 2000. In our model, the probability of observing an age greater than or equal to their actual age at the time of their death is 0.000286 for Calment and 0.0116 for Knauss. See details in section "Computation of the oldest person’s age and of the reign length".

The fact that Calment and Knauss are outliers among the oldest old in the world became even more evident when we performed a backtesting of our model. We estimated the parameters based on the data on the world’s oldest person between 1955 and 1988 where the ending date is the time when Calment became the world’s oldest person. The model-estimated mean and confidence interval of the world’s oldest person using the full data and the partial dataset (before Calment) are shown in Fig. 1 by green and red, respectively. The estimate using the data until 1988 is less reliable after 2000 which is shown by the fact that the observed data is out of the confidence interval in the majority of the time after 2000. When we compare the two confidence intervals, we can conclude that, based on the data before 1988, Calment and Knauss already had extremely high ages at their deaths. Adding the remaining data set, the estimated mean age of the world’s oldest person becomes lower. Hence we can conclude that now we can consider the ages of the two outliers between 1988 and 2000 at their death to be more extreme than based on the information available in 1988.

The other important observable is the reign length of a record holder. The numerical value of the expected reign length with our estimated model parameters is 1.195 in 1955 and it is 1.188 in 2019. The empirical value of the reign length is 1.008 which is not much less than the model-based estimate.

Our approach to studying the age of the oldest old is completely novel because it takes into account jointly the age and the time of birth of individuals. Although the age and the reign length of the world’s oldest person depend in a complex non-linear way on the total lifespan and the time of birth of supercentenarians, we compute explicitly the probability distribution of the age of the oldest person. Hence, the performance of our predictions cannot be directly compared to previous results in the literature because in the usual approach, the oldest person in each cohort is considered separately, and it is not relevant whether this person was ever the oldest in the population, see e.g. the extreme value method in20. Our model contributes to the mathematical understanding of the evolution of the oldest individual, which is the extra benefit compared to a prediction using the trend in the data, e.g. in linear regression. In this way, we not only observe but also prove mathematically that the dynamics of the birth process and that of the lifespan distribution which we consider in this paper necessarily imply the increase of the expected age of the world’s oldest person.

Mathematical model for the age of the oldest person

In this section, we provide the mathematical definition of a general model for the age of the world’s oldest person, where the births of individuals follow a Poisson process and their lifespans are independent. Under the assumptions of the general model, the age of the record holder in the population evolves in time as a Markov process with explicit transition probabilities. In sections "Model description and two-dimensional representation"–"Exact distribution of the oldest person’s age process", we describe the exact distribution of the age of the record holder in this generality for any given birth rate parameter and lifespan distribution using the two-dimensional representation of the age process of the oldest person. In the time-homogeneous case with constant birth rate and identical lifespan distributions the reign length distribution of a record holder is computed in sections "Homogeneous model"–"Reign length distribution". We explain the role of the entry age parameter in section "Entry age parameter".

Model description and two-dimensional representation

The model is formally defined as follows. Let λ(t) be the birth rate parameter which depends on time and let Ft and ft be a family of cumulative distribution functions and density functions corresponding to non-negative random variables which are also time-dependent. We assume that individuals are born according to a Poisson point process at rate λ(t) and that the lifespan of an individual born at time t is given by Ft so that lifespans are independent for different individuals.

Let Yt denote the age of the oldest person in the population at time t. The process (Yt:tR) is Markovian. The Markov property holds because at any time t the history of the process (Ys:st) provides information about the lifetime of individuals born before the current record holder while any transition of (Ys:st) depends only on the lifetime of the current record holder and of those born after them.

The evolution of the Markov process Yt is the following. It has a deterministic linear growth with slope 1 due to the ageing of the current record holder. This happens until the death of the record holder. Additionally, given that Yt-=limstYs=y for some t with y>0, the process has a downward jump at time t at rate ft(y)/(1-Ft(y)) which is the hazard rate of the distribution Ft at y. This corresponds to the possibility that the record holder dies at time t which happens at rate ft(y)/(1-Ft(y)). The conditional distribution of the jump is given by

P(Yt<x|Yt-=y,Yt<y)=exp-xyλ(t-u)(1-Ft-u(u))du 1

for all x>0. The jump distribution in (1) has an absolutely continuous part supported on [0, y] with density

jy,t(x)=exp-xyλ(t-u)(1-Ft-u(u))duλ(t-x)1-Ft-x(x) 2

and a point mass at 0 with probability

ay,t=exp-0yλ(t-u)(1-Ft-u(u))du. 3

As we shall see in the relevant parameter regime the probability ay,t of the point mass at 0 is negligible. The transition formula in (1) can be proven using the description below.

We introduce a two-dimensional representation of the process Yt as follows. Let Λ={(ti,xi):iI} be a marked Poisson process in R×R+ where {ti:iI} forms a Poisson point process on R with intensity λ(t) and xi0 is sampled independently for each iI according to the distribution Fti. The point (ti,xi) represents an individual born at time ti with lifespan xi for all iI, that is, the individual i is alive in the time interval [ti,ti+xi) and their age at time t is t-ti if t[ti,ti+xi). Hence the marked Poisson process Λ contains all relevant information about the age statistics of the population at any time. In particular the age of the oldest person Yt can be expressed in terms of Λ as

Yt=max(t-ti)1{t[ti,ti+xi)}:iI. 4

where the indicator 1{t[ti,ti+xi)} is 1 exactly if the ith person is alive at time t.

The transition distribution formula (1) can be seen using the two-dimensional representation as follows. Given that the current record holder dies at time t at age y the event {Yt<x} means that nobody with age between x and y can be alive at time t. This event can be equivalently characterized in terms of the Poisson process of birth at rate λ(·) thinned by the probability that the person is still alive at time t. Indeed the event {Yt<x} can be expressed as a Poisson process of intensity at time t-u given by λ(t-u)(1-Ft-u(u)) for u[x,y] not having any point in the time interval [t-y,t-x]. This probability appears exactly on the right-hand side of (1). In other words, for any u[x,y] people are born at time t-u at rate λ(t-u). On the other hand, the probability for a person born at time t-u to be alive at time t (that is, at age u) is 1-Ft-u(u). See Fig. 3 for illustration.

Figure 3.

Figure 3

Two-dimensional representation of the event {Yt<x} conditionally given {Yt-=y,Yt<y}: the marked Poisson process does not have any point in the grey region, see (1).

Exact distribution of the oldest person’s age process

We assume that all birth events are already sampled on (-,t] together with the corresponding lifespans. Then the distribution of Yt can be computed explicitly for all tR using the two-dimensional representation. For all tR the density

ht(x)=exp-xλ(t-u)(1-Ft-u(u))duλ(t-x)(1-Ft-x(x)) 5

for all x>0 and the point mass at 0

mt=exp-0λ(t-u)(1-Ft-u(u))du 6

characterize the distribution of Yt which can be seen as follows. We mention that the point mass at 0 is negligible in the application.

Similarly to the proof of the transition formula in (1) the event {Yt<x} for any x>0 is the same as the event that nobody with age at least x is alive at time t. We express this event in terms of the Poisson process of birth at rate λ(·) thinned by the probability that the person is still alive at time t. The event {Yt<x} means that a Poisson process of intensity at time t-u given by λ(t-u)(1-Ft-u(u)) for ux does not have any point in (-,t-x] yielding

P(Yt<x)=exp-xλ(t-u)(1-Ft-u(u))du. 7

In other words for any ux individuals are born at time t-u at rate λ(t-u). A person born at time t-u is alive at time t at age u with probability 1-Ft-u(u). (5)–(6) follow by differentiation in (7) and by taking the x0 limit. See Fig. 4 for illustration.

Figure 4.

Figure 4

Two-dimensional representation of the event {Yt<x}: the marked Poisson process does not have any point in the grey region, see (7).

Homogeneous model

The exact computation of the reign length distribution (see section "Reign length distribution") can only be performed in a special case of our general model described in section "Model description and two-dimensional representation". We introduce this special case as the homogeneous model where individuals are born at the times of a Poisson process of constant rate λ=λ(t)>0 for all t. The lifespan of individuals are independent and identically distributed with a fixed density f=ft and cumulative distribution function F=Ft for all t which does not depend on time.

In the homogeneous model the jump distribution given in (2)–(3) simplifies to

jy(x)=jy,t(x)=exp-λxy(1-F(u))duλ(1-F(x)),ay=ay,t=exp-λ0y(1-F(u))du. 8

The distribution of Yt does not depend on time in this case hence it is a stationary distribution as well. The formulas for the density of Yt and point mass at 0 reduce in the homogeneous case to

h(x)=exp-λx(1-F(u))duλ(1-F(x)),m=exp-λ0(1-F(u))du 9

where the integral 0(1-F(u))du is equal to the expected lifespan.

The equilibrium condition for the homogeneous density h can be written as

h(x)+h(x)f(x)1-F(x)-xh(y)f(y)1-F(y)jy(x)dy=0. 10

After differentiation and using the fact that

ddxjy(x)=jy(x)λ(1-F(x))-f(x)1-F(x) 11

one can derive from (10) the second order differential equation

h(x)+2f(x)1-F(x)-λ(1-F(x))h(x)+2f(x)2(1-F(x))2+f(x)1-F(x)h(x)=0. 12

The point mass m at 0 satisfies

mλ=0h(y)f(y)1-F(y)jy(0)dy. 13

The peaks process

In the homogeneous model the sequence of peaks in Yt forms a discrete time Markov chain. By peak we mean a local maximum of Yt with value being equal to the lifespan of the last record holder. Each time the oldest person dies the process Yt has a peak with a downward jump following it. Let Zn denote the age of record holders at which they die which are the values of the peaks of the process Yt. The sequence Zn forms a discrete time Markov chain. The Markov property follows by the fact that ages at death of previous record holders only give information on people born before the current record holder but transitions depend on the lifespan of the current record holder and that of people born after them.

The stationary density of Zn is given by

z(x)=f(x)exp-λx(1-F(u))du0f(y)exp-λy(1-F(u))dudy. 14

The formula can be seen as follows. To have a record holder who dies at age x there has to be a person who has lifespan x which gives the factor f(x) in the numerator on the right-hand side of (14). The exponential factor is by the two-dimensional representation equal to the probability that no people born before the record holder who just died can be alive at the time the record holder dies. The denominator on the right-hand side of (14) makes z(x) a probability density function.

The density of Zn can also be characterized by the following description. It satisfies the integral equation

z(x)=0z(w)0min(x,y)jw(y)f(x)1-F(y)dydw+0z(w)awf(x)dw 15

which comes from the possible transitions of the peak process as follows. If the previous record holder had a total lifetime w[0,) then at the death the process Yt jumps down to some value y at rate jw(y) or to 0 with probability aw. The density of the age at which a person dies who becomes a record holder at age y is f(x)/(1-F(y)). From the integral equation in (15) one can derive the second order differential equation for the function g(x)=z(s)/f(x) given by

g(x)-λ(1-F(x))g(x)+λf(x)g(x)=0 16

which is satisfied by g(x)=cexp-λx(1-F(u))du in accordance with (14).

Reign length distribution

In the homogeneous model, let Wn denote the reign length of the nth record holder, that is, the time length for which this person is the oldest person of the population. The density of the random reign length is given by

r(w)=0h(y)f(y+w)dy+m0wf(z)λe-λ(w-z)f(z)dz0h(y)(1-F(y))dy+m. 17

The density formula in (17) can be derived based on the stationary density of the peaks process given by (14) as follows.

It holds for the density of the reign length that

r(w)=0z(x)0xjx(y)f(y+w)1-F(y)dy+ax0wf(z)λe-λ(w-z)dzdx 18

based on the decomposition with respect to the previous value of the peaks process Zn. The integral 0wf(z)λe-λ(w-z)dz is the density of the convolution of the density f with an independent exponential distribution of parameter λ. On the right-hand side of (18), one can use the definitions of the density z given by (14) and the homogeneous jump distribution jx and ax given by (8). Then in the numerator after the exchange of the order of integrations in the first term and by using the formula for the stationary distribution given by (9) one gets the numerator of (17). In the denominator one can use the equality

0f(y)exp-λy(1-F(u))dudy=0h(y)(1-F(y))dy+m. 19

which follows by integration by parts.

Note also that the density is not equal to the remaining reign length of Yt under the stationary distribution because it would involve the integral 0h(y)f(y+w)/(1-F(y))dy in place of the first term in the numerator on the right-hand side of (17).

Entry age parameter

Next we introduce another parameter which we call the entry age and we denote it by E. As opposed to our original model we consider individuals as being born at age E at a modified birth rate and with a modified lifespan distribution. As a result we obtain a model to the age of the world’s oldest person all values of the entry age parameter E0 and we can fit the model parameters with different values of the entry age.

For any value E0 of the entry age we denote by λE(t) the rate at which people reach the age E at time t, that is,

λE(t)=λ(t-E)(1-Ft-E(E)) 20

because the new birth process of rate λE(t) is obtained by an inhomogeneous thinning of the original Poisson process of the birth events. The lifespan distribution of those born at time t with age E becomes the remaining lifetime distribution at age E. The modified cumulative distribution function and density are given by

FtE(x)=Ft-E(x+E)-Ft-E(E)1-Ft-E(E),ftE(x)=ft-E(x+E)1-Ft-E(E). 21

Model specification and parameter fitting

In section "Model specification: birth rate parameter and lifespan distribution" we specify the general model introduced and discussed in section "Mathematical model for the age of the oldest person", that is, we assume that the birth rate parameter increases exponentially in time and that the lifespan distribution is given by the gamma–Gompertz–Makeham distribution with time-dependent parameters. We provide the details of the computation of the likelihood as a function of the model parameters in section "Likelihood calculations". We show the way to maximize the likelihood and how the optimal parameters can be found using the Nelder–Mead method in section "Likelihood maximization". With these values of the parameters, the age of the world’s oldest person and the reign length of the record holder can be computed as described in section "Computation of the oldest person’s age and of the reign length".

Model specification: birth rate parameter and lifespan distribution

For the rest of the paper we specifiy our general model described in section "Mathematical model for the age of the oldest person" to the following choice of the birth rate parameter and of the lifespan distribution. We choose the value of the entry age to be E=0,30,60 and we fit the model parameters for all three values of E separately.

First we specify the intensity function of the Poisson process of births with an exponential growth in time. For any of the three values of the entry age E we assume that the birth rate at age E is given by

λE(t)=CEeκEt 22

where the numerical values of the parameters CE and κE are obtained by linear regression of the logarithmic data of newborns, people at age 30 and 60 published by the United Nations since 1950. We extrapolate the linear regression backwards in time and we use the numerical values shown in Table 1.

Table 1.

Birth rate parameters in (22) for different values of the entry age E.

E=0 E=30 E=60
CE 6270 4.680·10-9 3.249·10-11
κE 0.004987 0.01876 0.02085

We assume that the underlying force of mortality is chosen so that the lifespan distribution of individuals follows the gamma–Gompertz (ΓG) distribution with cumulative distribution function and density

Fa,b,γΓG(x)=1-1+aγb(ebx-1)-1/γ,fa,b,γΓG(x)=aebx1+aγb(ebx-1)-1-1γ 23

for x0 where a,b,γ are positive parameters. We mention that the gamma–Gompertz–Makeham (ΓGM) distribution differs from the ΓG distribution by the presence of a non-negative extrinsic mortality parameter c which appears as an additive term in the force of mortality. See (29) for the definition of the ΓGM distribution. In our model, we exclude the extrinsic mortality for the following two reasons. Since the extrinsic mortality becomes irrelevant at high ages and we aim to model the front-end of the death distribution at the oldest-old ages, we do not expect to obtain a reliable estimate on the extrinsic mortality using the data about the world’s oldest person. On the other hand, as explained later in section "Likelihood maximization", the likelihood maximization provides unrealistic lifespan distributions even for the ΓG model if one tries to optimize in all the parameters at the same time. In order for the algorithm to result in a distribution close to the actual human lifespan distribution, the number of model parameters had to be decreased.

For our model we suppose that in the ΓG lifespan distribution, parameters b=bE, the rate of aging and γ=γE, the magnitude of heterogeneity are constants over time and that they only depend on the value of the entry age parameter E. The parameter a, the initial level of mortality at the entry age for individuals born at time t, depends on time given by the exponentially decreasing function

aE(t)=KEe-αE(t-2000) 24

where the exponent αE and the constant KE only depends on the entry age E. The reason for subtracting 2000 in (24) is only technical, the numerical values of the parameters do not become tiny with this definition.

In the model with entry age E, we assume that the birth rate λE(t) is given by (22) and we fit the gamma–Gompetz distribution with parameters bE,γE and aE(t) given by (24) for the modified distribution function FtE(x) and density ftE(x) in (21). This means that we search for the best fitting values of the parameters αE,KE,bE,γE which results in an approximation of the remaining lifetime distribution at the age E.

Likelihood calculations

The aim of the maximum likelihood method is to give an estimate to the parameters αE, KE, bE and γE for E=0,30,60 by finding those values for which the likelihood of the full sample is the largest. The sample is obtained from the the historical data on the world’s oldest person available in25. We transform this information into a list of triples of the form (ti,yi,zi) for i=1,,n where ti is the ith time in the sample when the oldest person dies at age yi and the new record holder has age zi at time ti. Then the data has to satisfy the consistency relation ti-zi=ti+1-yi+1 since the two sides express the date of birth of the same person.

In the model with entry age E, the likelihood of the ith data point (ti,yi,zi) given the previous data point is equal to

fti-yi+EE(yi-E)1-Fti-yi+EE(zi-1-E)jyi,tiE(zi)=fti-yi+EE(yi-E)1-Fti-yi+EE(zi-1-E)exp-ziiλE(ti-u+E)(1-Fti-u+EE(u-E))duλE(ti-zi+E)(1-Fti-zi+EE(zi-E)) 25

for all i=2,3,,n except for i=1 in which case the 1-Fy1-t1+EE(z0-E) factor in the denominator is missing. In (25) above we use the transition probabilities of the model with entry age E given by

jy,tE(x)=exp-xyλE(t-u+E)(1-Ft-u+EE(u-E))duλE(t-x+E)(1-Ft-x+EE(x-E)) 26

as a generalization of (2). The explanation of the left-hand side of (25) is that the person died at time ti at age yi had age E at time ti-yi+E. The previous data point ensures that this person has already reached age zi-1 hence we condition their lifetime distribution on this fact. The transition probabilities in (26) are obtained similarly to (2) with the difference that a person at age u with u[x,y] at time t had age E at time t-u+E.

Note that when computing the likelihood of the full data by multiplying the right-hand side of (25) for different values of i the consistency relation of the data implies that the factor 1-Fti-zi+E(zi-E) of the ith term cancels with the factor 1-Fti+1-yi+1+E(zi-E) coming from the (i+1)st term. Hence the log-likelihood of the full sample is given by

l(α,K,b,γ)=i=1nlogfti-yi+EE(yi-E)-ziiλE(ti-u+E)(1-Fti-u+EE(u-E))du+logλE(ti-zi+E)+log(1-Ftn-zn-EE(zn+E))=i=1nlogfKe-α(ti-yi+E-2000),b,γΓG(yi-E)-ziiCeκ(ti-u+E)1-FKe-α(ti-u+E-2000),b,γΓG(u-E)du+log1-FKe-α(tn-zn+E-2000),b,γΓG(zn-E)+nlogC+i=1nκ(ti-zi+E). 27

where we suppress the dependence of the parameters α,K,b,γ on the entry age. Note that the last two terms do not depend on the parameters α,K,b,γ hence we can omit these terms in the maximization of the log-likelihood.

Likelihood maximization

We implemented the calculation of the log-likelihood function l(α,K,b,γ) given by (27) in Python. We used numerical integration to obtain the integrals on the right-hand side of (27). We mention that the general integral formula in (30) could not be used because the parameter a of the gamma–Gompertz distribution in the integrand depends on the integration variable on the right-hand side of (27).

In order to maximize the value of the log-likelihood function l(α,K,b,γ) we applied the Nelder–Mead method27 which is already implemented in Python. We mention that initially we used the gamma–Gompertz–Makeham distribution as lifespan distribution, see (29) for the definition, which contains the extra parameter c to be fitted but it turned out that the number of model parameters has to be reduced. The behaviour of the optimization algorithm in the five parameters α,K,b,c,γ using the gamma–Gompretz–Makeham model was very similar the case of four parameters α,K,c,γ in the gamma–Gompertz model. Running the optimization in the full set of parameters (α,K,b,c,γ in the gamma–Gompertz–Makeham model or α,K,b,γ in the gamma–Gompertz model), it turned out that after a few rounds the parameter K started to decrease dramatically and reached values below 10-10. The resulting lifespan distribution seemed very unrealistic with almost no mortality before the age of 100. This happened for all values of the entry age E=0,30,60.

We explain this phenomenon by the fact that historical data about the oldest person in the world only gives information about the behaviour of the lifespan distribution between the ages 107 and 123. The simple optimization in the four parameters α,K,b,γ simultaneously yields an excellent fit for the tail decay of the lifespan distribution with the historical data but the result may be very far from the actual human lifespan. This would limit the practical relevance of our results.

The mathematical reason for the fact that the four-parameter optimization does not result in a satisfactory approximation to the human lifespan distribution is the following. In these cases, the optimization procedure diverges to those regimes of the parameter space R+4 where the corresponding gamma–Gompertz distribution is degenerate. One can prevent reaching these unrealistic combinations of parameters by reducing the amount of freedom in the optimization. Hence we specify some of the parameters a priori and we perform the optimization in the remaining ones so that it provides a good fit to the data on the age of the oldest old as well as a realistic lifespan distribution.

We believe that the most robust of the four parameters of the ΓG model is b which is the exponent in the time dependence of the mortality rate. By setting the rate of aging b=0.09 the algorithm gives the optimal triple α,K,γ with the best likelihood which is very stable under changing the initial values of these parameters. The running time is also very short.

The Nelder–Mead algorithm, being a numerical maximization method, heavily relies on the tolerance parameter, which determines the minimal improvement required for the algorithm to continue running. If this parameter is set too high, the algorithm might stop before reaching the optimum. Conversely, if set too low, the algorithm might take excessively long to converge. To address this, we drew inspiration from dynamic learning rate algorithms used in neural network training and developed the following meta-algorithm.

First, we run the Nelder–Mead optimization. Based on the improvement from the starting point, we dynamically adjust the tolerance factor, similar to how learning rates are modified during neural network training. We then run the optimization again, recalibrating the tolerance factor based on the observed improvement, and repeat the process. This iterative adjustment allows us to get closer to the optimum, a hypothesis supported by our practical experience with this meta-algorithm. Following this meta-algorithm, only a few calls of the Nelder–Mead method is enough to reach the optimum. The Python codes for the likelihood calculations as well as the Nelder–Mead optimization implemented to this problem are available in28.

The numerical values of the resulting parameters for the three choices of the entry age are shown in Table 2. The survival probability functions with the parameters given in Table 2 for individuals born in 2000 corresponding to the entry age E=0,30,60 are shown on Fig. 5 as a function of the age. We also computed the optimal values of the parameters α,K,γ for other values of the rate of aging b as a sensitivity analysis. The resulting parameter values for the choices b=0.11, b=0.13 and b=0.15 are shown in Table 3.

Table 2.

The optimal parameters obtained for b=0.09 and for various values of the entry age E=0,30,60.

E=0 E=30 E=60
αE 0.01277 0.01124 0.01110
KE 0.00002951 0.0005950 0.01208
b 0.09 0.09 0.09
γE 0.08596 0.08061 0.08026
l(αE,KE,0.09,0,γE) -119.64 -117.68 -117.27

Figure 5.

Figure 5

Survival probabilities as a function of the age for different values of the entry age: E=0 in red, E=30 in black, E=60 in blue.

Table 3.

The optimal parameters for b=0.11, b=0.13 and b=0.15.

E=0 E=30 E=60
αE 0.01561 0.01376 0.01352
KE 3.728·10-6 0.0001495 0.005901
b 0.11 0.11 0.11
γE 0.1160 0.1117 0.1110
l -119.48 -117.51 -117.09
E=0 E=30 E=60
αE 0.01845 0.01628 0.01596
KE 4.530·10-7 3.613·10-5 0.002784
b 0.13 0.13 0.13
γE 0.1441 0.14065 0.1399
l -119.37 -117.40 -116.98
E=0 E=30 E=60
αE 0.02129 0.01880 0.01842
KE 5.362·10-8 8.500·10-6 0.001281
b 0.15 0.15 0.15
γE 0.1708 0.1681 0.1674
l -119.31 -117.34 -116.91

We mention as an alternative approach that scaling the parameters could enhance the optimization process, but this requires prior knowledge of the range within which the parameters vary. This range could be determined through our iterative application of the Nelder–Mead algorithm.

Computation of the oldest person’s age and of the reign length

We observe that the model with the parameters in Table 2 fits well to the titleholder data. We focus on two statistics of the process in order to support this observation about the comparison: the age of the world’s oldest person and the reign length of the record holder. In the case of both statistics exact formulas are only available for the homogeneous model introduced in section "Homogeneous model" where the birth rate is constant as well as the lifespan distribution does not depend on time. Hence we apply an approximation where the error is negligible compared to the difference from the statistics computed using the data.

In the general model the distribution of the age of the world’s oldest person at time t is given by the density ht(x) in (5) and by the point mass mt at 0 in (6). For the numerical computations, we ignore the point mass mt which is below the round-off error in the numerical results. The difficulty in computing the mean age of the oldest person at time t is that parameter a of the gamma–Gompertz–Makeham distribution function Ft-u in the exponent of (5) also depends on the integration variable u.

In our approximation we fix the value of the parameter a of the ΓG distribution in ht(x) in (5) to a value which is equal to a0(t-d) in (24) with some delay d. The delay d is chosen so that the mean age of the oldest person computed using a0(t-d) as parameter a for all times in the ΓG distribution function in (5) is equal to the same value d. For a given t, this value of d can be obtained as the fixed point of the contraction map

d0xe-xλ(t-u)(1-Ft-d(u))duλ(t-x)(1-Ft-d(x))dx 28

which provides a reasonable approximation for the mean age of the world’s oldest person. For the comparison with the data and for the prediction, we use the model with entry age E=0. Hence in (28), the function λ is given in (22) with E=0 and Ft-d is the ΓG distribution function with parameters given by the E=0 values in Table 2 and with a=a0(t-d) in (24). This approximation is reasonable because the distribution of the age of the oldest person is highly concentrated.

The fixed point of the map in (28) as the expected age of the world’s oldest person can be found in a few steps of iterations. We show the result on Fig. 1. We applied 10 iterations using the fitted model parameters for each year between 1955 and 2019. By computing the standard deviation of the age of the oldest person as well we obtain the mean and a confidence interval for the age.

The predictions for the age distribution of the world’s oldest person in the future shown on Fig. 2. We obtained them by using the exact age distribution formula in section "Exact distribution of the oldest person’s age process" along with the numerical values of the parameters C,κ,α,K,γ given in Tables 1 and 2 for entry age E=0.

In our model, the distribution function of the age of the world’s oldest person is given in (7) where the ΓG distribution function can be substituted with the estimated parameter values at any time. In this way, the probability of observing an age greater than or equal to Calment’s or Knauss’ actual age at the time of their death can be computed exactly. The numerical values are 0.000286 for Calment and it is 0.0116 for Knauss.

The backtesting mentioned in the Results section is performed as follows. We estimated the best parameter values with entry age 0 based on the reduced data on the world’s oldest person between 1955 and 1988 where the ending date is the time when Calment became the world’s oldest person. The resulting parameters α=0.01516,K=0.00002064,γ=0.08413 are numerically not very far from the optimal parameters in Table 2 but the difference is more visible on Fig. 1. The figure shows the model-based mean age and confidence interval for the age of the world’s oldest person computed using the full data as well as the data until 1988.

For the reign length of record holders, we again used the expected age at a given time obtained as the fixed point of the iteration in (28). The numerical value of the expected reign length obtained from the iteration is 1.195 in 1955 and it is 1.188 in 2019. The empirical value of the reign length is 1.008 computed from the data by dividing the total length of the time interval between 1955 and 2019 by the number of record holders.

Methods

In this section, we provide supplementary information related to the main result of this paper. We perform explicit computations with the gamma–Gompertz–Makeham model and we express the integral of the survival function in terms of a hypergeometric function.

The cumulative distribution function and the density of the gamma–Gompertz–Makeham (ΓGM) distribution are given by

Fa,b,c,γΓGM(x)=1-e-cx1+aγb(ebx-1)1/γ,fa,b,c,γΓGM(x)=e-cx1+aγb(ebx-1)1+1γc(b-aγ)+a(b+cγ)ebxb 29

for x0 where a,b,c,γ are positive parameters. The positivity of parameters implies the finiteness of all moments and, in particular, the convergence of the integral of the survival function x(1-Fa,b,c,γΓGM(u))du. In the homogeneous model, the integral of the survival function appears in the density of the distribution of Yt in (9) and in the stationary density of the peaks process in (14). We show below that in the gamma–Gompertz–Makeham model the integral of the survival function can be computed explicitly and it is given by

x1-Fa,b,c,γΓGM(u)du=baγ1/γe-(c+bγ)xbγ+c2F11γ,1γ+cb;1+1γ+cb;aγ-baγe-bx 30

where 2F1(a,b;c;z) is the hypergeometric function. See 15.1.1 in29 for the definition and properties.

We prove (30) based on the following integral representation 15.3.1 in29 of the hypergeometric function

2F1(a,b,c,z)=Γ(c)Γ(b)Γ(c-b)01tb-1(1-t)c-b-1(1-tz)-adt 31

which holds whenever Re(c)>Re(b)>0. First we prove an identity for complex parameters α,β,δ which satisfy Re(α+β)>0 and we compute

xe-βu(1+δeu)αdu=1δαxe-(α+β)u(1+e-u/δ)αdu=e-(α+β)xδα01yα+β-11+e-xy/δ-αdy=e-(α+β)x(α+β)δα·2F1α,α+β;1+α+β;-e-xδ 32

where we applied a change of variables y=ex-u in the second equality above and we applied the hypergeometric identity (31) in the last equality with a=α, b=α+β, c=1+α+β, z=-e-x/δ together with the observation that with these values of the parameters the prefactor of the integral on the right-hand side of (31) simplifies to α+β. Note that the condition Re(c)>Re(b)>0 for (31) to hold is satisfied by our assumption Re(α+β)>0 which also makes the integrals in (32) convergent.

Next we show (30) using (32) as follows. We write

x1-Fa,b,c,γΓGM(u)du=11-aγb1/γxe-cu1+aγb-aγebu1/γdu=11-aγb1/γbbxe-cv/b1+aγb-aγev1/γdv=11-aγb1/γbe-1γ+cbbx1γ+cbaγb-aγ1/γ2F11γ,1γ+cb;1+1γ+cb;aγ-baγe-bx 33

where we applied the change of variables v=bu in the second equality above and we used (32) with α=1/γ, β=c/b, δ=aγ/(b-aγ) and with x replaced by bx. The right-hand side of (33) simplifies to that of (30).

Acknowledgements

We thank Katalin Kovács for some useful advice at an early stage of the project which led to this collaboration. The work of Cs. Kiss and B. Vető was supported by the NKFI (National Research, Development and Innovation Office) grant FK142124. B. Vető is also grateful for the support of the NKFI grant KKP144059 “Fractal geometry and applications” and for the Bolyai Research Scholarship of the Hungarian Academy of Sciences. L. Németh was supported by MaRDI, funded by the Deutsche Forschungsgemeinschaft (DFG), project number 460135501, NFDI 29/1 “MaRDI – Mathematische Forschungsdateninitiative.

Author contributions

B.V. initiated the research. Cs.K. and B.V. derived the model, prepared the scripts and figures, and carried out the estimations. Cs.K., L.N. and B.V. analyzed the results and wrote the manuscript.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Data availibility

The titleholder data are freely available at https://grg.org/Adams/C.HTM

Competing interest

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Oeppen, J. & Vaupel, J. W. Broken limits to life expectancy. Science296, 1029–1031 (2002). 10.1126/science.1069675 [DOI] [PubMed] [Google Scholar]
  • 2.Canudas-Romo, V. Three measures of longevity: Time trends and record values. Demography47, 299–312 (2010). 10.1353/dem.0.0098 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Vaupel, J. W., Villavicencio, F. & Bergeron-Boucher, M.-P. Demographic perspectives on the rise of longevity. Proc. Natl. Acad. Sci.118, e2019536118 (2021). 10.1073/pnas.2019536118 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Gompertz, B. On the nature of the function expressive of the law of human mortality, and on a new mode of determining the value of life contingencies. Philos. Trans. R. Soc. Lond.115, 513–583 (1825). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Németh, L. & Missov, T. I. Adequate life-expectancy reconstruction for adult human mortality data. Plos One13, e0198485 (2018). 10.1371/journal.pone.0198485 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Missov, T. I., Németh, L. & Dańko, M. J. How much can we trust life tables? sensitivity of mortality measures to right-censoring treatment. Palgrave Commun.2, 1–10 (2016). 10.1057/palcomms.2015.49 [DOI] [Google Scholar]
  • 7.Maier, H., Jeune, B. & Vaupel, J. W. Exceptional Lifespans (Springer, 2021).
  • 8.Dang, L. H. K. et al. The question of the human mortality plateau. Demogr. Res.48, 321–338 (2023). 10.4054/DemRes.2023.48.11 [DOI] [Google Scholar]
  • 9.Vijg, J. & Le Bourg, E. Aging and the Inevitable Limit to Human Life Span. Gerontology63, 432–434 (2017). 10.1159/000477210 [DOI] [PubMed] [Google Scholar]
  • 10.Alvarez, J.-A., Villavicencio, F., Strozza, C. & Camarda, C. G. Regularities in human mortality after age 105. PloS One16, e0253940 (2021). 10.1371/journal.pone.0253940 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Belzile, L. R., Davison, A. C., Gampe, J., Rootzén, H. & Zholud, D. Is there a cap on longevity? A statistical review. Ann. Rev. Stat. Appl.9, 21–45 (2022). 10.1146/annurev-statistics-040120-025426 [DOI] [Google Scholar]
  • 12.Rootzén, H. & Zholud, D. Human life is unlimited-but short. Extremes20, 713–728 (2017). 10.1007/s10687-017-0305-5 [DOI] [Google Scholar]
  • 13.Barbi, E., Lagona, F., Marsili, M., Vaupel, J. W. & Wachter, K. W. The plateau of human mortality: Demography of longevity pioneers. Science360, 1459–1461 (2018). 10.1126/science.aat3119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Modig, K., Andersson, T., Vaupel, J., Rau, R. & Ahlbom, A. How long do centenarians survive? Life expectancy and maximum lifespan. J. Internal Med.282, 156–163 (2017). 10.1111/joim.12627 [DOI] [PubMed] [Google Scholar]
  • 15.Wilmoth, J. R. & Robine, J.-M. The world trend in maximum life span. Popul. Dev. Rev.29, 239–257 (2003). [Google Scholar]
  • 16.Gavrilov, L. & Gavrilova, N. Mortality measurement at advanced ages: A study of the social security administration death master file. North Am. Actuarial J.15, 442–447 (2011). 10.1080/10920277.2011.10597629 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Newman, S. J. Errors as a primary cause of late-life mortality deceleration and plateaus. PLoS Biol.16, e2006776 (2018). 10.1371/journal.pbio.2006776 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Camarda, C. G. The curse of the plateau. measuring confidence in human mortality estimates at extreme ages. Theor. Popul. Biol.144, 24–36 (2022). [DOI] [PubMed]
  • 19.Missov, T. I. & Vaupel, J. W. Mortality implications of mortality plateaus. Siam Rev.57, 61–70 (2015).
  • 20.Gbari, S., Poulain, M., Dal, L. & Denuit, M. Extreme value analysis of mortality at the oldest ages: A case study based on individual ages at death. North Am. Actuar. J.21, 397–416 (2017). 10.1080/10920277.2017.1301260 [DOI] [Google Scholar]
  • 21.Hanayama, N. & Sibuya, M. Estimating the upper limit of lifetime probability distribution, based on data of Japanese centenarians. J. Gerontol. Ser. A: Biomed. Sci. Med. Sci.71, 1014–1021 (2016). 10.1093/gerona/glv113 [DOI] [PubMed] [Google Scholar]
  • 22.Einmahl, J. J., Einmahl, J. H. & de Haan, L. Limits to human life span through extreme value theory. J. Am. Stat. Assoc.114, 1075–1080 (2019). 10.1080/01621459.2018.1537912 [DOI] [Google Scholar]
  • 23.Li, J. & Liu, J. A modified extreme value perspective on best-performance life expectancy. J. Popul. Res.37, 345–375 (2020). 10.1007/s12546-020-09248-8 [DOI] [Google Scholar]
  • 24.Milholland, B. Jeanne Calment, actuarial paradoxography and the limit to human lifespan. Rejuvenat. Res.23, 17–18 (2020). 10.1089/rej.2019.2272 [DOI] [PubMed] [Google Scholar]
  • 25.Group, G. R. World’s oldest person titleholders since 1955. https://grg.org/Adams/C.HTM (2018). [Online; Accessed June 14, 2023].
  • 26.Brillinger, D. R. A biometrics invited paper with discussion: the natural variability of vital rates and associated statistics. Biometrics42, 693–734 (1986). 10.2307/2530689 [DOI] [PubMed] [Google Scholar]
  • 27.Nelder, J. A. & Mead, R. A Simplex Method for Function Minimization. Comput. J.7, 308–313 (1965). 10.1093/comjnl/7.4.308 [DOI] [Google Scholar]
  • 28.Kiss, C. Modelling the age of the oldest person in the world. https://github.com/csabi0312/modelling-the-age-of-the-oldest-person-in-the-world (2024).
  • 29.Abramowitz, M. & Stegun, I. A. Pocketbook of Mathematical Functions (Verlag Harri Deutsch, Thun-Frankfurt, 1984). [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The titleholder data are freely available at https://grg.org/Adams/C.HTM


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES