Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2008 Aug 1.
Published in final edited form as: Math Biosci. 2006 Dec 5;208(2):538–551. doi: 10.1016/j.mbs.2006.11.006

STOCHASTIC MODEL FOR ANALYSIS OF LONGITUDINAL DATA ON AGING AND MORTALITY

Anatoli I Yashin 1, Konstantin G Arbeev 1, Igor Akushevich 1, Aliaksandr Kulminski 1, Lucy Akushevich 1, Svetlana V Ukraintseva 1
PMCID: PMC2084381  NIHMSID: NIHMS14623  PMID: 17300818

Abstract

Aging-related changes in a human organism follow dynamic regularities, which contribute to the observed age patterns of incidence and mortality curves. An organism’s “optimal” (normal) physiological state changes with age, affecting the values of risks of disease and death. The resistance to stresses, as well as adaptive capacity, declines with age. An exposure to improper environment results in persisting deviation of individuals’ physiological (and biological) indices from their normal state (due to allostatic adaptation), which, in turn, increases chances of disease and death. Despite numerous studies investigating these effects, there is no conceptual framework, which would allow for putting all these findings together, and analyze longitudinal data taking all these dynamic connections into account. In this paper we suggest such a framework, using a new version of stochastic process model of aging and mortality. Using this model, we elaborated a statistical method for analyses of longitudinal data on aging, health and longevity and tested it using different simulated data sets. The results show that the model may characterize complicated interplay among different components of aging-related changes in humans and that the model parameters are identifiable from the data.

Keywords: stochastic process model of aging and mortality, longitudinal data analysis, allostatic load, homeostenosis, stresses resistance, physiological norms

1. Introduction

Current studies of mortality rates deal not only with analyses of available life span data. A substantial part of this research is going on within an analytical framework aiming to derive observed features of mortality curves using respective theoretical concepts. These studies include evolutionary biology theory of senescence [1], mutation-selection balance [2], reliability theory [3,4], and economic theory [5], among others. Such abstract analyses became possible because of remarkable regularities revealed in the shape of the mortality curve: the decline in the childhood, the exponential increase in the adult ages and the tendency to deceleration and leveling off at the oldest old ages. The studies intend to explain these regularities as natural phenomena resulting from the postulates of the respective theories.

Researchers studying aging/health/longevity still argue about regularities of aging-related deterioration in health and well-being status in humans. The lack of consensus in this area delays development of comprehensive models and theory. The revealed findings and regularities of the aging process remain largely disconnected and the high potential of data collected in many longitudinal studies remains underused. The typical situation is that only portions of available data are currently used and most of them are analyzed separately using formal statistical methods.

Such methods, however, largely ignore current knowledge and theory about aging in the process of data analyses. Meanwhile, it is clear that the progress in developing such modeling concepts would make combining distinct subsets of large-scale longitudinal data possible. The joint analysis of such data would allow for systemic addressing the issues on comprehensive interplay between different aging-related changes in a human organism. Ultimately this approach would help making a complete picture out of available mosaic details, findings, and regularities of aging-related changes in humans.

Overcoming these methodological problems is the matter of time. Development of several interesting concepts capable of capturing fundamental features of aging-related changes is now underway. They are related to the notion of allostatic load [6], the decline in adaptive capacity (homeostenosis) [7,8], the decline in resistance to stresses [9], the aging-related physiological norms, and heterogeneity in longitudinal data.

The formal approaches to the joint analysis of longitudinal and time-to-event data have been also developed during the last decades (see recent reviews in [1012]). Joint longitudinal-survival models are the models that describe the joint behavior of the process generating the observed measurements and the survival (generally, time-to-event) process. The first, “longitudinal,” process is observed at respective times of measurement and the second, “survival,” process generates (possibly censored) event times that depend on the observations of the longitudinal process. The frequently used approach for a univariate case assumes that the longitudinal data follow a linear mixed-effects model [13] and that the hazard depends both on the random effects and other time-independent covariates through a Cox proportional hazard relationship [1416]. Xu and Zeger [17] extended the model using the generalized linear model for the longitudinal process to allow for continuous or discrete covariates. Wang and Taylor [18] included a stochastic (an integrated Ornstein-Uhlenbeck) process into the model of longitudinal data to allow for random fluctuations of individual measurements around the population average. In Henderson et al. [19], a latent bivariate Gaussian process is introduced as a time-dependent variable in a proportional hazard model. Multivariate generalizations of such methods and the estimation procedures have been suggested recently [2025].

The models mentioned so far use the Cox proportional hazards to characterize the relationship between the longitudinal and survival data. There are, however, many cases where the proportionality assumption fails. For such situations other models need to be used. Tseng et al. [26] used an accelerated failure time survival model as an alternative to the Cox model with longitudinal covariates following a linear mixed-effects model with measurement errors. Song and Huang [27] used a joint longitudinal-survival model with an additive hazard, where time-dependent covariates measured with errors are added to the baseline hazard.

An important class of models for analyses of longitudinal data is based on a biologically-motivated assumption of a quadratic hazard which is justified by J- or U-shapes of hazards considered as functions of risk factors observed in epidemiological studies [28]. These models were developed and intensively used in the studies of longitudinal data [2933]. The advantageous feature of this approach is that it allows for incorporation of the new insights and ideas appearing in the course of research on aging.

In this paper we propose a new model of health, mortality, and aging, which will further develop this biologically-motivated approach by including all four major concepts of aging known to date, i.e., the notions of the age-dependent physiological norms, allostatic load, adaptive capacity and resistance to stress, and investigate the potential for the model application to the analysis of longitudinal data.

The reminder of the paper is organized as follows. Section 2 presents the model and outlines the estimation procedure. Section 3 describes the results of a simulation study checking the estimation procedure and the model performance. The last section summarizes the results and discusses perspectives of further research in this area.

2. Model

2.1. General description

The arguments discussed above allow us to formulate requirements for the model capable of connecting different aspects of aging and explore respective links in the analyses of longitudinal data. It is clear that it should be a dynamic model capable of describing random differences between individual trajectories of physiological or other indices. The values of such indices have to affect health or mortality risks. The model should be capable of describing the J-, or U-shape of the risk considered as a function of risk factors. The age trajectory in physiological space, for which the minimum value of the risk function is reached, will characterize the age-dependent physiological norm. Persistent deviations from the norm will characterize effects of allostatic adaptation and the magnitudes of such deviations for each physiological index will be associated with components of allostatic load. The narrowing of the U-shape of the risk function with age will characterize the decline in stress resistance. The dynamic model must include a feedback mechanism with coefficients of homeostatic regulation. The age-related changes in these coefficients will characterize the decline in adaptive capacity. To meet these requirements, we suggest the stochastic process model for continuously changing risk factors in the form of stochastic differential equation:

dYt=a(t)(Yt-f1(t))dt+b(t)dWt,Y0. (1)

Here Yt (t is age) is a k-dimensional stochastic process describing continuously changing vector of risk factors (e.g., physiological characteristics), which approximates a human organisms’ functional state, Wt is a vector Wiener process with independent components, which describes exogenous challenges affecting these covariates. The process Wt is independent of the initial vector Y0 with normally distributed components. Note that such a model preserves the Gaussian property: in the case of initial Gaussian distribution for Y0, the distribution of Yt among survivors is also Gaussian ([30,31,34]). Thus, our assumption on a Gaussian initial distribution defines the structure of the entire process Yt. The strength of disturbances of Wt is characterized by a matrix of diffusion coefficients b(t).

The vector-function f1(t) (having the same dimension as a vector Yt) describes a functional state that organisms subject to allostasis [35] are forced to follow by the process of adaptive regulation at age t. This trajectory reflects aging-related changes in the organism’s functioning due to the average effect of a complicated interplay among the ontogenetic program, senescence, and environmental stresses exceeding limits of the homeostatic regulation in human organisms.

The matrix a(t) characterizes the rate of adaptive regulation. Specifically, the elements of matrix a(t) regulate age trajectories of the components of the physiological state approximated by the vector Yt, i.e., the elements of matrix a(t) characterize the rate of the adaptive response for any deviation of a physiological index from the state f1(t) which an organism tends to follow. The important feature of aging – the decline in adaptive capacity – has never been measured directly in longitudinal studies of aging, health and longevity. The use of matrix a(t) in our model allows for evaluating this effect. For example, in a simplified one-dimensional case, in the absence of random disturbances in equation (1) (b(t) = 0, for all t), and constant negative a(t) = a for all t, the parameter a is the coefficient of negative feedback in the equation for Yt, which keeps the trajectory of Yt close to f1(t). When f1(t) = f1 is constant for all t, the value of Yt asymptotically approaches f1. The higher is the absolute value of a, the closer is Yt to f1, and the faster Yt tends to f1 in case of disturbances. That is why the value a(t) characterizes adaptive capacity. When the absolute value of coefficient a(t) declines with age, more time is needed for the trajectory of Yt to approach f1(t) at old ages compared to younger ages.

The process Yt can be stopped randomly at time T. The conditional distribution of T given trajectories of Yu, 0 ≤ut is completely characterized by the conditional hazard μ(t, Yt), which is assumed to be:

μ(t,Yt)=μ0(t)+(Yt-f(t))Q(t)(Yt-f(t)). (2)

Here μ0(t) is the background hazard characterizing the mortality rate which would remain if a vector of covariates Yt follows the optimal trajectory coinciding with f(t). An asterisk in (2) and in formulas below denotes the transposition of respective vectors or matrices. Matrix Q(t) is a non-negative-definite symmetric matrix of respective dimension (k × k). Note that we assume column vectors throughout the text, thus the transposition of vectors in (2) is needed to produce the respective scalar quadratic term in the hazard rate. The one-dimensional version of (2) is

μ(t,Yt)=μ0(t)+μ1(t)(Yt-f(t))2, (3)

where μ1(t) is some non-negative function of age t.

The vector-function f(t) in (2) (or, correspondingly, the scalar function f(t) in (3)) is introduced to explicitly characterize age-related changes in the “optimal” physiological state corresponding to the minimum of hazard at a given age. It has a meaning of the age-dependent norm for a given functional state. It may differ from f1(t) since the process of allostatic adaptation does not necessarily results in the optimal physiological state. Thus, the difference between f1(t) and f(t) provides the measure of the allostatic load.

2.2. Estimation procedure

The model can be estimated using the maximum likelihood method. The survival function associated with the life span distribution is P(T>t)=exp(-0tμ¯(u)du), where the marginal (unconditional) hazard μ¯(u) has the form (see [32,36,37]):

μ¯(u)=μ0(u)+(m(u)-f(u))Q(u)(m(u)-f(u))+Tr(Q(u)γ(u)). (4)

Here Tr denotes the trace of a matrix and m(u) and γ(u) satisfy the following system of ordinary nonlinear differential equations:

dm(t)/dt=a(t)(m(t)-f1(t))-2γ(t)Q(t)(m(t)-f(t)),m(0),dγ(t)/dt=a(t)γ(t)+γ(t)a(t)+b(t)b(t)-2γ(t)Q(t)γ(t),γ(0). (5)

Here m(0) and γ(0) are the mean and the variance/covariance matrix of the normal distribution of initial vector Y0. Note that in such a model the distribution of Yt among survivors is also Gaussian at any age t ([30,31,34]). The mean and the variance/covariance matrix of this distribution at age t are given by m(t) and γ(t), respectively.

Let the sequence yt0ii,yt1ii,...,ytniii,τi represent the results of measurements of the process Yt and the life span (which may be censored) related to the ith individual. The likelihood function for N individuals is (see [32,36,37]):

i=1Nj=0ni(τi)(2π)-/2kγi(tji-)-/21exp{-12(ytjii-mi(tji-))γi(tji-)-1(ytjii-mi(tji-))}×μ¯i(τi)δiexp{-0τiμ¯i(u)du}. (6)

Here δi is a censoring indicator, mi (t) and γi (t) satisfy equations (5) at the intervals [t0i,t1i);[t1i,t2i);...;[tni-1i,tnii);[tnii,τi) with the initial conditions yt0ii,yt1ii,...,ytniii, respectively. Here mi(tji-)=limttjimi(t), and γi(tji-)=limttjiγi(t), and tnii is the age of the latest measurement of a functional state before death/censoring at τi. Maximization of this likelihood function generates parameter estimates that characterize the dynamics of stochastic process Yt describing trajectories of physiological aging. Note that the observed values yt0ii,yt1ii,...,ytniii are used as initial conditions for differential equations (5) at the beginning of subsequent intervals between the observation times. Therefore, the individual trajectories of mi (t) and γi (t) differ for different individuals. Consequently, the estimates of the chances of death for individuals having different observed values of the respective covariates also differ.

Note that, as mentioned above, the conditional distribution of Yt among survivors is Gaussian, whereas most of physiological indices that can be represented by Yt are positive. Thus, appropriate restrictions on parameters of respective functions defining the dynamics of Yt are necessary in the estimation procedure. In addition to this, restrictions are necessary for functions in the mortality risk (2). These restrictions should reasonably impose constraints on parameters of: a) an initial distribution Y0 (to ensure a negligible probability of negative values); b) functions f1(t) and f(t) (to ensure non-negative values for each age); c) matrix a(t) (to ensure that the feedback coefficient in (1) does not become too small and the trajectories of Yt tend to f1(t)); d) the background hazard μ0(t) (to ensure non-negative values for each age and a non-decreasing age pattern); and e) matrix Q(t) (to ensure that the matrix is non-negative definite for each age).

3. Simulation study

We performed a simulation study to check performance of the model in one-dimensional case. In computer simulations, we used a discrete-time version of the model (1), (3). We assumed that the background mortality in (3) is the Gompertz hazard, μ0(t)=aμ0ebμ0(t-tmin), where tmin = 30. The trajectories f1(t) and f(t) are approximated by linear functions, f1(t) = af1 + bf1 (ttmin), f(t) = af + bf (ttmin). The functions a(t) and b(t) are assumed constant, a (t) = ay, b (t) = σ1. The initial distribution of Yt0 is normal with the mean f1 (t0) and the variance σ02. We considered the case of non-symmetric dependency of mortality on deviations of values of the process Yt from the trajectory f(t), assuming that μ1(t)={μ11(t),if Ytf(t)μ12(t),if Yt>f(t), where μ1j (t) = aμ1j + bμ1j (ttmin) j = 1, 2. Parameters to be estimated in this model are: aμ0, bμ0, aμ11, bμ11, aμ12, bμ12, aY, σ0, σ1, af1, bf1, af, and bf. The age at entry into the study was simulated as a discrete random variable uniformly distributed over the interval from 30 to 60 years. The interval between observations of Yt equals 2 years. The number of observations (surveys) is 25. This structure resembles the Framingham Heart Study (FHS) data [38]. The simulated values of parameters were taken similar to those obtained in our preliminary analyses of data on diastolic blood pressure for females in the FHS. We simulated 100 data sets with 2500 individuals in each data set (which is approximately equal to the number of females in the FHS data) and estimated the discrete model for different data sets using the MATLAB’s optimization toolbox [39,40]. Mean values, standard deviations and 95% ranges of the estimated parameters are presented in Table 1.

Table 1.

Means, standard deviations (st. dev.) and 95% lower (LCL) and upper (UCL) limits of parameter estimates in 100 simulated data sets

aμ0 · 104 bμ0 aμ11· 104 bμ11 · 104 aμ12 · 104 bμ12 · 104 aY σ0 σ1 af1 bf1 af bf
Mean 1.052 0.109 0.968 0.012 0.495 0.0012 −0.140 12.980 6.005 85.014 −0.200 77.125 −0.065
St. dev. 0.224 0.004 0.307 0.010 0.094 0.0034 0.002 0.173 0.020 0.246 0.008 1.302 0.044
LCL 0.678 0.103 0.379 −0.006 0.325 −0.0054 −0.143 12.604 5.958 84.577 −0.219 74.603 −0.157
UCL 1.501 0.117 1.569 0.031 0.669 0.0066 −0.136 13.281 6.043 85.558 −0.186 79.489 0.008
True values 1 0.11 1 0.01 0.5 0.001 0.14 13 6 85 0.2 77 0.06

The estimated age trajectories of ln μ0(t), μ11(t), μ12(t), f1(t), f(t) and the initial distribution of Yt0 (p(Yt0)) for 100 simulated data sets are shown in Fig. 1.

Fig. 1.

Fig. 1

Estimated trajectories (grey lines) of lnμ0(t) (left top panel), μ11(t) (right top panel), μ12(t) (left middle panel), f1(t) (right middle panel), f (t) (left bottom panel) and p (Yt0) (right bottom panel) for 100 simulated data sets. True trajectories are shown as black lines.

The table and the figure show that the parameters related to the dynamics of Yt(aY, σ0, σ1, af1, bf1) and the background mortality (aμ0, bμ0) are estimated better than those related to the optimal trajectory f(t) (af ,bf). For example, the standard deviations of af and bf are about five times larger than those of af1 and bf1. As the result, the trajectory of f1(t) is estimated better than that of f(t). Nevertheless, the means of all parameters are close to the true values. Fig. 2 illustrates the estimated (mean for 100 data sets) and true mortality (μ(t, Yt)) and relative risk (RR(t, Yt)) over age (t) and values of Yt.

Fig. 2.

Fig. 2

Estimated (mean for 100 data sets) and true mortality (μ(t, Yt)) and relative risk (RR(t, Yt)). Thick lines denote the “optimal” age trajectory of a physiological index (f(t)).

The figure shows that both estimated and true parameters specify two key features of dependence of mortality and relative risk on age and values of Yt: the narrowing of U-shape of mortality and the decrease in the relative risk with age.

4. Discussion

Many researchers involved in population studies of aging and longevity continue considering mortality curves, as an important source of information about the rate of individual aging. In particular, the slope of the logarithm of the mortality curve is often associated with the aging rate at the adult and old ages. Such interpretations may be misleading: the changes in the slope and in many other features of mortality curve may occur because of many other reasons, which have nothing to do with the aging process [41]. Data on many such factors dynamically affecting chances of survival are collected in longitudinal data. Moreover, these data contain information about various aspects of senescence process and can be used for evaluating respective effects. That is why in situations characterized by the wide spectrum of longitudinal data and findings on aging, making speculations about the meaning of different properties of the mortality curve and ignoring the presence of other relevant information about aging-related changes affecting survival chances would be methodologically incorrect.

The urgent need for modifying a traditional experimental paradigm that associates the features of individual aging with properties of the age pattern of the mortality curve without developing an appropriate biological background has been also emphasized by Manton and Yashin [33]. The urgency stems from the lack of balance between data and theory currently existing in the area of research on aging: the abundance of the data and findings about aging and the development of aging-related disorders in humans and laboratory animals on the one hand and the weakness of methodological and theoretical concepts guiding the collection and analysis of data on the other hand. It became clear that working with data collected in human longitudinal studies of aging and longevity requires new models capable of not only describing mortality linked with longitudinally measured physiological or health indices. These models must have the ability to describe connections and evaluate joint effects of senescence, ontogenetic program, and environmental stresses in aging-related changes in health/well-being/survival characteristics measured in longitudinal studies.

An important attempt to connect the Gompertz model of mortality rate with the model describing longitudinal data has been performed using the quadratic hazard model of human mortality and aging (see [33] and references therein). The conditional mortality rate in this model is represented in the form

μ(t,Yt)=YtQYteθt, (7)

where Yt=(1,Yt) is an extended vector of covariates Yt, t is age, Q is an extended (constant) matrix, and θ is the Gompertz’s parameter [33].

In applications of this model to longitudinal data, the estimated value of the parameter θ has always been smaller than the respective parameter in the Gompertz’s model describing the total mortality rate evaluated for the same data. The reduction of the Gompertz’s growth parameter estimated in the presence of observed covariates has been interpreted as an effect of measurements: the new (reduced) value of parameter θ characterized the component of aging-related increase in the mortality rate remaining unexplained in this scheme of observations (i.e., which occurred due to unobserved processes). The difference between the old and the new estimates of θ characterized explanatory power of the observed covariates. To make this point clearer, we provide an equivalent formulation of the original one-dimensional quadratic hazard model:

μ(t,Yt)=(μ0+μ1(Yt-c)2)eθt=μ0eθt+μ1eθt(Yt-c)2, (8)

where μ0, μ1, and c are constants. The first term has been interpreted as a part of the Gompertz’s mortality remained after effects of observed covariates have been taken into account (the second term). Equation (8) clearly shows implicit assumptions used in the formulation of the original quadratic hazard model – a competing risks model of the hazard rate with two mortality components. Specifically, the first component is an exponentially increasing function of age. The second risk is a product of an exponentially increasing function of age and a quadratic function of observed covariates. The important limitation of this model is that the exponential multipliers in both components of risk function are the same. The second limitation is that the minimum of the second (quadratic hazard) term is reached at the constant level of observed covariates.

The model proposed in this paper is free of these two basic limitations. This can be seen in its one-dimensional formulation where the quadratic hazard is given by (3). First, our model assumes that the minimum value of the risk function can change over age. This is a realistic assumption since in epidemiologic and medical practice specialists often operate with the notion of age-dependent “norm”, i.e., values of the physiological indices that are “optimal” for a given age. The “optimal” age trajectory of physiological (or, more broadly, functional) state, f(t) (i.e., the trajectory for which the risk of death takes its minimum value), is explicitly included in the model description (compare equations (3) and (8)). Importance of this extension is that it allows one to statistically test hypotheses on an optimal physiological trajectory and, thus, rigorously justify respective age-dependent physiological “norms” of physiological state corresponding to the minimum of the hazard. The model also allows for evaluating the “price” for deviations from this norm in terms of mortality increase at different ages.

Second, our model distinguishes between the contribution of an additive term μ0 (t) and the multiplier of the quadratic hazard μ1 (t). Assuming differences between these functions allows for a completely new interpretation of these coefficients, which clarify the contribution of measured indices to mortality and evaluate the effects of senescence on behavior of risk functions. The function μ0(t) can be considered as a component of competing risks associated with death from factors other than those involved in the quadratic term (i.e., unmeasured factors). The risk μ0(t) is supposed to be smaller than the total mortality calculated when observed covariates are ignored. Therefore, μ0(t) characterizes mortality remaining after all observed covariates follow the “optimal” trajectory and its interpretation remains similar to that used in the original model. Thus, this model allows us to evaluate a potential decline in the mortality rate (or an increase in the life expectancy) which would happen when the risk associated with all observed covariates is eradicated.

The term μ1(t) captures changes in the mortality risk associated with deviations of the observed index Yt from its normal age trajectory. It shows how the shape of the risk function changes with age. Since the quadratic hazard captures the U- (or J-) shape of the mortality risk considered as a function of risk factors, the increasing pattern of μ1(t) indicates how the branches of respective U-shaped risk function are getting steeper with age. This means that the range of tolerant deviations of the resultant risk factor from its “optimal” value is getting narrower with age, which, in turn, is an indicator of decline in stress resistance with age.

Evaluating such behavior of risk functions in human data is extremely important to capture the connections between senescence, longevity, and stress-resistance. Although many aspects of such connections have been investigated in experimental animal studies [9], they were not adequately addressed for humans. Thus, the rate of increase in μ1(t) (not the slope of the logarithm of the mortality curve) may characterize the rate of senescence. Thus, the important methodological finding is that the model is transformed to the form where effects of senescence on survival, longevity and disease development may be evaluated from longitudinal data.

The modified model includes the earlier version (8) as a particular case. The similarity between μ0(t) and μ1(t) can be easily tested using the likelihood ratio test. Changes in μ1(t) with age indicate age-related narrowing of the U-shape of mortality risks (considered as function of Yt), which is interpreted as a decline in stress resistance with age. One should expect that a similar decline in resistance to stresses represented by deviations of other (unobserved) risk factors from their norms also takes place, and contributes to an increase in the baseline hazard μ0(t) Thus, both functions μ1(t) and μ0(t) reflect effects of senescence process but capture its different aspects.

Incorporating functions f1(t), f(t) and matrix a(t) into the extended model allows one to test various hypotheses about dynamics of aging-related changes in a human organism. For example, one can test whether the age trajectory of physiological state, which an organism is forced to follow by the process of allostasis, f1(t), coincides with the “optimal” trajectory corresponding to the minimal mortality at respective ages, f(t). The rejection of respective null-hypothesis would indicate that the processes of compensatory adaptation and remodeling do not tend to minimize the mortality risk. A testable hypothesis is also the one that the observed mean age-trajectories of the covariates in a population coincide with the age-dependent norm f(t). Estimating the model with fixed and non-fixed a(t), one can test the hypothesis about aging-related changes in the “adaptive capacity” of a human organism, i.e., how the rate of the adaptive response for any deviation of physiological indices from the “prescribed” state f1(t) changes with age. Specifications of different forms of the matrix Q(t) allow for an analysis of relationships between covariates and evaluation of their joint effects on the mortality risk.

Contrary to the traditional association of the aging rate with the slope of the logarithm of the mortality curve, it seems more appropriate to relate the rate of decline in stress resistance (associated with the function μ1(t) with this process. This function, however, characterizes the aspects of senescence associated with a selected risk factor. More generally, the increasing role of senescence in the mortality risk could be captured by the widening pattern of the U-shape of the relative risk, which would indicate a faster increase in μ0(t) compared to μ1(t). Our analysis shows that both characteristics can be evaluated from the longitudinal data. The performed analysis shows that the proposed model can be effectively used for evaluating features of the aging-related changes from data collected in longitudinal studies.

It is important to note that functions f(t) and f1(t) introduced in the model are rough approximations of the reality. The physiological “norm” f(t), or the “optimal” age trajectory of physiological indices corresponds to the minimal value of the mortality risk at respective ages. The model assumes that all individuals have the same “optimal” age trajectory of physiological indices that depend solely on age. In some situations, this assumption looks oversimplified. For example, it is quite likely that the notion of “norm” may differ for individuals carrying different alleles or genotypes, or having different histories of life events (e.g., diseases, environmental exposures), etc. The lack of appropriate individual data is the only reason for considering f(t) as a “common” function for all individuals.

Similar considerations also take place for the function f1(t) defined as the age trajectory of physiological state which an organism is forced to follow by the process of allostasis. This function is also likely to be more personalized and depend on various genetic and non-genetic factors. However, empirical evaluation of such trajectories is limited by the absence of respective data.

These observations indicate the need for developing a more general methodology, which could incorporate individualized notions of “norms” and the adaptive response. For example, the trajectories of f(t) and f1(t) can be modeled as stochastic processes. Elaboration of respective models, specification of required data, and development of appropriate estimation procedures are beyond the scope of this paper.

The properties of the process Yt depend on the assumption about the initial value Y0. The model preserves the Gaussian property: in the case of initial Gaussian distribution for Y0, the distribution of Yt among survivors is also Gaussian. Note that such a distribution allows for negative values of respective variables with a positive probability, which is not appropriate for most physiological indices. In modeling practice, however, this property of Gaussian models does not limit the analyses. One can always consider a Gaussian approximation as a first description of the observed dynamic phenomenon. The advantage of such a model, however, is that it describes the probability distribution in terms of the first two moments, and, in this sense, it may serve as a two-moment approximation for many other distributions resulted from a non-Gaussian initial value or nonlinear effects in the physiological response.

Acknowledgments

The work on this paper was supported by the NIH/NIA grants 1R01 AG028259-01, 1RO1-AG-027019-01 and 5PO1-AG-008761-16. The authors thank two anonymous reviewers for valuable comments.

References

  • 1.Charlesworth B. Patterns of age-specific means and genetic variances of mortality rates predicted by the mutation-accumulation theory of ageing. J Theor Biol. 2001;210:47–65. doi: 10.1006/jtbi.2001.2296. [DOI] [PubMed] [Google Scholar]
  • 2.Steinsaltz D, Evans SN, Wachter K. A generalized model of mutation-selection balance with applications to aging. Advances in Applied Mathematics. 2005;35(1):16–33. [Google Scholar]
  • 3.Gavrilov LA, Gavrilova NS. Reliability theory of aging and longevity. In: Masoro EJ, Austad SN, editors. Handbook of the biology of aging. sixth. Academic Press; San Diego, CA, USA: 2006. pp. 3–42. [Google Scholar]
  • 4.Gavrilov LA, Gavrilova NS. The reliability theory of aging and longevity. J Theor Biol. 2001;213(4):527–545. doi: 10.1006/jtbi.2001.2430. [DOI] [PubMed] [Google Scholar]
  • 5.Lee RD. Rethinking the evolutionary theory of aging: transfers, not births, shape senescence in social species. Proc Natl Acad Sci USA. 2003;100(16):9637–9642. doi: 10.1073/pnas.1530303100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Seeman TE, McEwen BS, Rowe JW, Singer BH. Allostatic load as a marker of cumulative biological risk: MacArthur studies of successful aging. Proc Natl Acad Sci USA. 2001;98:4770–4775. doi: 10.1073/pnas.081072698. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Troncale JA. The aging process. Physiologic changes and pharmacologic implications. Postgrad Med. 1996;99(5):111–114. 120–122. [PubMed] [Google Scholar]
  • 8.Lund J, Tedesco P, Duke K, Wang J, Kim SK, Johnson TE. Transcriptional profile of aging in C. elegans. Curr Biol. 2002;12(18):1566–1573. doi: 10.1016/s0960-9822(02)01146-6. [DOI] [PubMed] [Google Scholar]
  • 9.Semenchenko G, Khazaeli A, Curtsinger JW, Yashin A. Stress resistance declines with age: analysis of data from a survival experiment with Drosophila melanogaster. Biogerontology. 2004;5(1):17–30. doi: 10.1023/b:bgen.0000017681.46326.9e. [DOI] [PubMed] [Google Scholar]
  • 10.Tsiatis AA, Davidian MA. Joint modeling of longitudinal and time-to-event data: an overview. Statistica Sinica. 2004;14:809–834. [Google Scholar]
  • 11.Yu M, Law NJ, Taylor JMG, Sandler HM. Joint longitudinal-survival-cure models and their application to prostate cancer. Statistica Sinica. 2004;14:835–862. [Google Scholar]
  • 12.Troxel AB. Techniques for incorporating longitudinal measurements into analyses of survival data from clinical trials. Stat Methods Med Res. 2002;11:237–245. doi: 10.1191/0962280202sm285ra. [DOI] [PubMed] [Google Scholar]
  • 13.Laird N, Ware J. Random-effects models for longitudinal data. Biometrics. 1982;38:963–974. [PubMed] [Google Scholar]
  • 14.Tsiatis A, De Gruttola V, Wulfsohn M. Modeling the relationship of survival to longitudinal data measured with error: Applications to survival and CD4 counts in patients with AIDS. J Am Stat Assoc. 1995;90:27–37. [Google Scholar]
  • 15.Faucett CL, Thomas DC. Simultaneously modeling censored survival data and repeatedly measured covariates: a Gibbs sampling approach. Stat Med. 1996;15:1663–1686. doi: 10.1002/(SICI)1097-0258(19960815)15:15<1663::AID-SIM294>3.0.CO;2-1. [DOI] [PubMed] [Google Scholar]
  • 16.Wulfsohn MS, Tsiatis AA. A joint model for survival and longitudinal data measured with error. Biometrics. 1997;53:330–339. [PubMed] [Google Scholar]
  • 17.Xu J, Zeger SL. Joint analysis of longitudinal data comprising repeated measures and time to events. Applied Statistics. 2001;50:375–388. [Google Scholar]
  • 18.Wang Y, Taylor J. Jointly modeling longitudinal and event time data with application to acquired immunodeficiency syndrome. J Am Stat Assoc. 2001;96:895–905. [Google Scholar]
  • 19.Henderson R, Diggle P, Dobson A. Joint modeling of measurements and event time data. Biostatistics. 2000;1:465–480. doi: 10.1093/biostatistics/1.4.465. [DOI] [PubMed] [Google Scholar]
  • 20.Song X, Davidian M, Tsiatis A. An estimator for the proportional hazards model with multiple longitudinal covariates measured with error. Biostatistics. 2002;3:511–528. doi: 10.1093/biostatistics/3.4.511. [DOI] [PubMed] [Google Scholar]
  • 21.Brown ER, Ibrahim JG. A Bayesian semiparametric joint hierarchical model for longitudinal and survival data. Biometrics. 2003;59:221–228. doi: 10.1111/1541-0420.00028. [DOI] [PubMed] [Google Scholar]
  • 22.Ha ID, Park T, Lee Y. Joint modeling of repeated measures and survival time data. Biometrical Journal. 2003;45:647–658. [Google Scholar]
  • 23.Ibrahim JG, Chen MH, Sinha D. Bayesian methods for joint modeling of longitudinal and survival data with applications to cancer vaccine studies. Statistica Sinica. 2004;14:863–883. [Google Scholar]
  • 24.Ratcliffe SJ, Guo W, Ten Have TR. Joint modeling of longitudinal and survival data via a common frailty. Biometrics. 2004;60:892–899. doi: 10.1111/j.0006-341X.2004.00244.x. [DOI] [PubMed] [Google Scholar]
  • 25.Brown ER, Ibrahim JG, DeGruttola V. A flexible B-spline model for multiple longitudinal biomarkers and survival. Biometrics. 2005;61:64–73. doi: 10.1111/j.0006-341X.2005.030929.x. [DOI] [PubMed] [Google Scholar]
  • 26.Tseng YK, Hsieh F, Wang JL. Joint modeling of accelerated failure time and longitudinal data. Biometrika. 2005;92:587–603. [Google Scholar]
  • 27.Song X, Huang Y. A corrected pseudo-score approach for additive hazards model with longitudinal covariates measured with error. Lifetime Data Anal. 2006;12:97–110. doi: 10.1007/s10985-005-7222-7. [DOI] [PubMed] [Google Scholar]
  • 28.Witteman JCM, Grobbee DE, Valkenburg HA, van Hemert AM, Stijnen T, Burger H, Hofman A. J-shaped relation between change in diastolic blood pressure and progression of aortic atherosclerosis. Lancet. 1994;343:504–507. doi: 10.1016/s0140-6736(94)91459-1. [DOI] [PubMed] [Google Scholar]
  • 29.Woodbury MA, Manton KG. A random walk model of human mortality and aging. Theor Popul Biol. 1977;11:37–48. doi: 10.1016/0040-5809(77)90005-3. [DOI] [PubMed] [Google Scholar]
  • 30.Yashin AI. Dynamics in survival analysis: conditional Gaussian property vs. Cameron-Martin formula. In: Krylov NV, Lipster RSh, Novikov AA, editors. Statistics and control of stochastic processes. Springer; New York: 1985. pp. 446–475. [Google Scholar]
  • 31.Yashin AI, Manton KG, Vaupel JW. Mortality and aging in heterogeneous populations: a stochastic process model with observed and unobserved variables. Theor Popul Biol. 1985;27:159–175. doi: 10.1016/0040-5809(85)90008-5. [DOI] [PubMed] [Google Scholar]
  • 32.Yashin AI, Manton KG. Effects of unobserved and partially observed covariate processes on system failure: a review of models and estimation strategies. Statistical Science. 1997;12(1):20–34. [Google Scholar]
  • 33.Manton KG, Yashin AI. Mechanisms of aging and mortality: searches for new paradigms. Monographs on Population Aging, 7. Odense University Press; Odense, Denmark: 2000. [Google Scholar]
  • 34.Yashin AI. Conditional Gaussian estimation of dynamic systems under jumping observations. Automation and Remote Control. 1980;5:618–626. (Translated from Russian) [Google Scholar]
  • 35.McEwen BS, Wingfield JC. The concept of allostasis in biology and biomedicine. Horm Behav. 2003;43:2–15. doi: 10.1016/s0018-506x(02)00024-7. [DOI] [PubMed] [Google Scholar]
  • 36.Yashin AI, Manton KG, Stallard E. Evaluating the effects of observed and unobserved diffusion processes in survival analysis of longitudinal data. Mathematical Modelling. 1986;7:1353–1363. [Google Scholar]
  • 37.Yashin AI, Manton KG, Stallard E. Dependent competing risks: a stochastic process model. J Math Biol. 1986;24:119–140. doi: 10.1007/BF00275995. [DOI] [PubMed] [Google Scholar]
  • 38.Dawber T, Meadors G, Moore F. Epidemiological approaches to heart disease: the Framingham Study. Am J Public Health. 1951;41:279–286. doi: 10.2105/ajph.41.3.279. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Math Works, Inc. MATLAB: the language of technical computing. Programming. Version 7. Natick, MA: The Math Works, Inc; 2004. [Google Scholar]
  • 40.Math Works, Inc. Optimization toolbox for use with MATLAB. User’s guide. Version 3. Natick, MA: The Math Works, Inc; 2004. [Google Scholar]
  • 41.Yashin A, Ukraintseva S, Boiko S, Arbeev K. Individual aging and mortality rate: how are they related? Soc Biol. 2002;49:206–217. doi: 10.1080/19485565.2002.9989059. [DOI] [PubMed] [Google Scholar]

RESOURCES