Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Aug 31.
Published in final edited form as: Stat Methods Med Res. 2020 May 18;29(11):3179–3191. doi: 10.1177/0962280220921553

Dynamic prediction of competing risk events using landmark sub-distribution hazard model with multiple longitudinal biomarkers

Cai Wu 1,2, Liang Li 1, Ruosha Li 2
PMCID: PMC10469606  NIHMSID: NIHMS1927085  PMID: 32419611

Abstract

The cause-specific cumulative incidence function (CIF) quantifies the subject-specific disease risk with competing risk outcome. With longitudinally collected biomarker data, it is of interest to dynamically update the predicted CIF by incorporating the most recent biomarker as well as the cumulating longitudinal history. Motivated by a longitudinal cohort study of chronic kidney disease, we propose a framework for dynamic prediction of end stage renal disease using multivariate longitudinal biomarkers, accounting for the competing risk of death. The proposed framework extends the local estimation based landmark survival modeling to competing risks data, and implies that a distinct sub-distribution hazard regression model is defined at each biomarker measurement time. The model parameters, prediction horizon, longitudinal history and at-risk population are allowed to vary over the landmark time. When the measurement times of biomarkers are irregularly spaced, the predictor variable may not be observed at the time of prediction. Local polynomial is used to estimate the model parameters without explicitly imputing the predictor or modeling its longitudinal trajectory. The proposed model leads to simple interpretation of the regression coefficients and closed-form calculation of the predicted CIF. The estimation and prediction can be implemented through standard statistical software with tractable computation. We conducted simulations to evaluate the performance of the estimation procedure and predictive accuracy. The methodology is illustrated with data from the African American Study of Kidney Disease and Hypertension.

Keywords: Competing risks, dynamic prediction, Fine-Gray model, landmark analysis, longitudinal biomarkers, prediction model

1. Introduction

Patients with chronic kidney disease (CKD) are at increased risk of end stage renal disease (ESRD). Accurate prediction of the timing is of great importance in clinical research and practice to facilitate preparation for renal replacement therapy and individualize clinical decisions.1 The typical ESRD risk equations are “static” prediction models in the sense that they are developed from survival regression models that relate the predictors at an earlier time point, such as baseline, to the time of ESRD.2,3 Longitudinal data of those biomarkers between baseline and the terminal event are often available and potentially informative to the disease progression, but they are not used in prediction model development.

In statistical literature, the prediction of the risk of clinical events using longitudinal data is often referred to as dynamic prediction in the sense that the prediction can be updated with accumulating longitiudinal data. Important fundamental work has been published in the last decade.49 There are a number of challenges when this methodology is applied to the prediction of ESRD among CKD population. First, CKD patients have increased chance of mortality before reaching ESRD. Proper adjustment for competing risk is often needed in CKD studies.10. Second, previous literature has identified a large number of risk factors, including multiple biomarkers that are known to be causally associated with ESRD.1 Put in the longitudinal context, it requires that the dynamic prediction model should accommodate multiple biomarkers with tractable computation. Some biomarkers, such as the estimated GFR, have diverse nonlinear progression trajectories.11,12 This feature could add to the complexity of the statistical analysis if modeling subject-specific longitudinal trajectories is needed. Third, it may take many years before a CKD patient reaches ESRD or death. The strength of association between biomarkers and the disease outcome may vary over time, leading to time-varying effects. Fourth, it is common that patients do not always follow a pre-specified clinical visits schedule. Even if the visit times are non-informative in the sense that they are not related to the health condition of the patients, the irregularly spaced and unsynchronized biomarker measurement times pose a challenge to the development of dynamic prediction model, as elucidated below.

On the topic of dynamic prediction with competing risks data, Bayesian Markov chain Monte Carlo methods have been developed to model the joint distribution of longitudinal data and competing risks with shared random effects.13,14 However, when a large number of random effects are needed to accommodate multiple and possibly nonlinear longitudinal trajectories per subject, fitting the joint model is computationally infeasible.15 An alternative, computationally simpler dynamic prediction approach called landmark modeling4 has been applied to competing risk problems.1620 These methods adapted to the dynamic prediction setup some existing competing risk regression models, such as cause-specific hazard model17, pseudo-observations18, multi-state model16, and the Fine-Gray subdistribution hazard model19,20. The Fine-Gray model21 imposes a parsimonious relationship between predictors and the cumulative incidence function (CIF) of ESRD without a separate model for death, which is difficult to establish due to the heterogeneity in the causes of death. Based on this consideration, we extended our previous work22 to a landmark sub-distribution hazard model with the following flexibilities: the baseline sub-distribution hazard function is a bivariate function of both the landmark time and the survival time without pre-specified functional structure; all the model parameters are estimated nonparametrically in a coherent local polynomial estimation framework. In a related recent work, Liu et al19,20 extended the idea of “Super Cox model”4 to a landmark sub-distribution hazard model for competing risk data. Motivated by the specific needs of CKD research, our proposed methodology in this paper is also different from the aforementioned literature in how the landmark times are selected and how the irregularly spaced longitudinal data are handled. The typical landmark approach involves pre-specifying a number of landmark time points distributed over the follow-up period, creating a landmark dataset at each landmark time point that consists of at-risk subjects and their predictor variables and time-to-event, and fitting the model to the stacked landmark datasets.4,17,19,20. The predictor variable is not always measured at the landmark times due to irregularly spaced and unsynchronized measurement times. Imputing the unknown value by the last or closest measurement is difficult to apply because that measurement could be years apart and because the progression of CKD includes both chronic periods, when biomarkers change slowly, and acute episodes, when biomarkers change more quickly. Our proposed method does not require pre-specification of the landmark times, which is important given that there is currently no guideline in the literature on how to set the number and locations of the landmark times. It can also accommodate the irregularly spaced observational times without explicit imputation.

This paper is organized as follows. In section 2, we introduce the dynamic prediction model for competing risk data. In section 3, we provide model estimation procedures for the quantities of interest. In section 4 we propose estimators of predictive accuracy measures within the dynamic competing risk context. In section 5, we conduct numerical studies to evaluate the performance of the proposed methodology. The application to the AASK data is presented in section 6. Discussion and future work are presented in section 7.

2. The landmark dynamic prediction model for competing risks data

2.1. The notation and data structure

Let Ti and Ci be the time to event and censoring for subject i, and ϵi{1,,K} be the K causes of the event. We observe the follow-up time T˜i=minTi,Ci, the censoring indicator Δi=1TiCi, and the observed event type Δiϵi{0,,K}. Without loss of generality, we assume K=2 throughout this paper. In the context of the data application, event 1 denotes ESRD, the clinical event of interest, and event 2 denotes death, the competing event. Let the ni×q matrix Yi=Yi1,Yi2,,Yiq denote the repeated measured covariates for subject i, i.e. q covariates with each measured ni times. The matrix notation Yi covers both time-dependent (longitudinal) and time-independent (baseline) covariates. Each column of this matrix includes the ni repeated measurements of that variable at time points ti=ti1,ti2,tiniT, where all tij<T˜i. We use the scalar variable Yimj to denote the j-th repeated measure j=1,2,,ni of the m-th covariate (m=1,2,,q) from the i-th subject (i=1,2,,n). Hence Yim=Yim1,Yim2,,YimniT. For ease of presentation, we will also express Yimj=Yimtij, i.e., subject i’s m-th covariate measured at time tij is the value of the trajectory function Yim(.) at that time.We assume that the distribution of ti is non-informative in the sense that it is independent of Yi and Ti. We assume independent censoring in the sense that C is conditionally independent of T and ϵ given the baseline covariates. We observe independent and identically distributed training data 𝒟n=T˜i,Δiϵi,Yi,ti,i=1,,n, from which the dynamic prediction model is to be developed. Our interest is to estimate the probability of event 1 (e.g., ESRD) in the next τ1 years (called prediction horizon) for a generic individual, indexed by subscript o, given the covariates and survival up to time s:PTos,s+τ1,ϵo=1T˜o>s,Yo(s)¯, where Yo(s)¯=toj,Yo1j,Yo2j,,Yoqj1j no and tojs denotes the covariate history prior to s for subject o.

In the following, we describe the construction of landmark dataset. We use notation T(s)=T-s to denote the residual lifetime when a generic subject is at risk at time s, and Y(s)¯ to denote covariate history up to s, as defined above. For subject i in the development dataset, we define Tij=Ti-tij and Cij=Ci-tij as the subject-specific residual times to event and censoring, starting from tij. For prediction up to the horizon τ1, we can artificially censor the residual times at τ1, i.e., we observe T˜ij=minTij,Cij,τ1 and the event indicator δ˜ij=1TijminCij,τ1×ϵi, where 1(⋅) is the indicator function. The artificial censoring helps to reduce the chance of misspecifying certain model assumptions4. Throughout this paper, we focus on modeling the relationship between T(s) and Y(s)¯ at any landmark time s. In the model development dataset, T(s) and Y(s)¯ are observed at the longitudinal measurement times tiji=1,2,,n,j=1,2,,ni, leading to the observed outcome and predictor pairs T˜ij,δ˜ij,Y˜itij. Here the notation Y˜itij denotes the vector of predictors at time tij for the corresponding residual lifetime; this vector includes features of the longitudinal history Y(s)¯ that are selected as predictors. These features can be viewed as functions of Y(s)¯. Therefore, we also call tij as the landmark time as they are the starting time of the residual lifetime outcome. When making a prediction, the new prediction is often made when new measurements become available, i.e., at a new tij of that subject. From this perspective, the prediction time is also called a landmark time.

2.2. Sub-distribution hazard model with baseline covariates

We first briefly review the sub-distribution hazard (SDH) model for competing risks21 with baseline covariates X. For prediction, the quantity of interest is the cumulative incidence function (CIF) at time t* given X:π1t*;X=PTt*,ϵ=1X. Under Fine and Gray’s formulation, this CIF is formulated as

P(Tt*,ϵ=1X)=1exp(0t*λ1(tX)dt) (1)

with λ1(tX)=λ10(t)expαTX, where λ10(t) can be any non-negative function of time t>0 and α is a real vector. Fine and Gray further developed an interpretation for the λ1(tX) function. They showed that it can be interpreted as a sub-distribution hazard, in the sense that λ1(t;X)=limΔt01ΔtP(tTt+Δt,ϵ=1{Tt}{Ttϵ1},X)=-dlog1-π1(t;X)dt. Such a definition can be viewed as the hazard function for an improper random variable 1(ϵ=1)×T+1(ϵ1)×. This interpretation also helps the development of an estimation procedure that is analogous to that of the Cox model.

As far as prediction is of concern, characterizing the bilateral relationship between time to event T (ϵ=1) and the covariate vector X is all that is needed. The sub-distribution hazard value at a time t (t>0) is not of direct relevance to this prediction. The sub-distribution hazard function λ1(tX) serves as the internal machinery that helps the estimation of model (1). This is a key observation that motivates the working model in the next subsection.

2.3. Landmark proportional sub-distribution hazard model

By extending model (1) to the context of dynamic prediction, we propose the following landmark proportional SDH model at landmark time s:

P(T(s)t*,ϵ=1Y(s)¯,T>s)=1exp(0t*λ1(tY˜(s),s)dt). (2)

As the notation on the left hand side of (2) suggests, at any given landmark time s, this model is specified for those subjects still at risk (T>s) at that time. If we treat the given s as a new baseline, then this model is equivalent to a model (1) specified for the residual life time T(s) among the at-risk subjects at time s, given the history Y(s)¯. Since there are in theory infinitely many landmark time s, model (2) is formulated under the working assumption that these models hold simultaneously. For a specific landmark dataset T˜ij,δ˜ij,Y˜itij,tij;i=1,,n,j=1,,ni, this model implies that the bilateral relationship between each residual time to event T˜ij,δ˜ij and the corresponding “baseline” predictor variables Y˜itij satisfies model (2) at the corresponding landmark time s=tij. To fit this model, we define a working SDH function as:

λ1(t*Y˜i(tij),tij)=λ10(t*,tij)exp(βT(tij)Y˜i(tij)),t*(0,τ1], (3)

where λ10t*,tij is a bivariate smooth baseline SDH function, defined on the scale of the residual lifetime t*0,τ1 and landmark time tij. The time-varying coefficients β(.) are assumed to besmooth functions to allow the association to vary with the landmark time. For the bilateral relationship between T˜ij,δ˜ij and Y˜itij, the corresponding value of the coefficient function is βtij.

Our landmark dataset construction resembles that in the partly conditional model7, which resets the follow-up time scale at each measurement time. From this perspective, the basic idea of the proposed methodology is more closely related to that model than the “super Cox model” type approaches5,19,20. However, besides the accommodation of competing risk outcome, another difference between our approach and Zheng and Heagerty’s7 is that the time-varying coefficients are functions of the landmark time tij instead of the derived follow-up time t*. Therefore it differs from the usual time-varying coefficient model in survival analysis that is commonly used to deal with non-proportional hazards23. With the artificial censoring at τ1, the covariate effect is more likely to be constant over t*0,τ1 (but still vary with tij) and the proportional sub-distribution assumption is more likely to hold.24.

Model (3) is called a “working” sub-distribution hazard function because it is used to facilitate the model fitting using the estimating equations developed by Fine and Gray21. While it implies that a subject’s residual sub-distribution hazard at landmark time s is λ1t*=0Y(s)¯,s, it does not imply that this subject’s sub-distribution hazard at time s+t*t*>0 given the history Y(s)¯, is still given by (3). The hazard at time s+t* depends on Ys+t*¯. In general, the hazard at time s+t* conditional on Y(s)¯ depends on both the hazard at time s+t* conditional on Ys+t*¯ and the conditional distribution of the paths of longitudinal predictors Y˜uus,s+t* given Y(s)¯. This is elucidated by the concept of consistency25. Like other landmark (or partly conditional) models, the proposed model has not been proven as a consistent prediction model. However, this working model can still be a useful prediction tool as long as (2) provides a good approximation to the bilateral relationship between T˜ij,δ˜ij and Y˜itij.

3. Model estimation and dynamic prediction of the CIF

For estimation, we extend the kernel approach in Li et al22 to the competing risk context and formalize the idea of borrowing information from lagging covariates26,27. Assume that β. has a continuous second derivative in a neighborhood of s, by local linear approximation, βtijβ(s)+β(s)tij-s for subject-specific time points tij around s. The landmark dataset m are clustered multivariate time-to-event data with competing events, where the ni records from the same subject are correlated. For clustered competing risk data28, we define the counting process for event 1 as Nijt*=1tijTitij+t*,δ˜ij=1 and the at-risk process Rijt*=1-Nijt*-=1Ti>tij+t*+1tijTitij+t*,δ˜ij1. Based on a local “working independence” partial likelihood function28, for any given landmark point s, we can estimate the parameters β(s) using a kernel-weighted estimation equation, by borrowing biomarker measurements from the neighboring time points, tij(s-h,s+h):

i=1nj=1niKh(tijs)0t*wij(t){Z˜ij(1,tijs)Z¯(β(s),t)}dNij(t). (4)

In the above expression, K() is a kernel function with bounded support on [-1,1];Kh(x)=h-1K(x/h) and h is the bandwidth; Z˜ij1,tij-s=Y˜itij1,tij-s with denoting the Kronecker product. In addition, we have the notations Z¯(β(s),t)=Sˆ(1)(β(s),t)Sˆ(0)(β(s),t), and

S^(r)(β(s),t)=n1i=1nm=1niKh(tims)wim(t)Rim(t)×Z˜im(1,tims)r×exp(bT(s)Z˜im(1,tims)),

where b(s)=b0(s)T,b1(s)TT is used to estimate β(s) and β(s) respectively. Z˜0=1 and Z˜1=Z˜. The wij(t) in (4) denotes the inverse probability censoring weight for competing events21:

wij(t)=1(CijTijt)G(ts)G(Tijts),

where denotes the minimum of the two values and G(s) denotes the distribution of the residual censoring time at landmark s. This distribution is estimated by a kernel-weighted Kaplan-Meier estimator:

G^(ts)=ζΩ,ζt{1imKh(tims)1(C˜im=ζ,δ˜im=0)imKh(tims)1(C˜imζ)},

where Ω is the set of distinct observed censoring times C˜im. The coefficient β(s) is estimated at each landmark s by βˆ(s)=bˆ0(s). The variance of βˆ(s) can be estimated by bootstrap, which involves randomly sampling n subjects from the original dataset with replacement, estimating the point estimator from each randomly sampled bootstrap dataset, and calculating the sample variance of the point estimators from all bootstrap datasets29. Once we obtain the estimates of β(s), the baseline cumulative SDH function at time s can be estimated by plugging in βˆ(s):

Λ˜10(t*,s)=1ni=1nj=1niKh(tijs)0t*1S^(0)(β^(s),t)w^ij(t)dNij(t).

The conditional CIF for any future subject o can be estimated as

π^1(t*s,Yo(s)¯)=P^(s<Tos+t*,ϵo=1T^o>s,Yo(s)¯)=1exp(Λ˜10(t*,s)×exp(β^T(s)Y^o(s))).

4. Quantifying the dynamic predictive accuracy

In this section, we study two predictive accuracy measures, the time-dependent receiver operating characteristic (ROC) curve, in particular the area under the ROC curve (AUC); and the Brier score (BS). In the dynamic prediction framework, the time-dependent predictive accuracy measures are functions of two time scales, the landmark time s and the prediction horizon τ1. The following procedure for estimating sensitivity, specificity, and BS were modified from the non-parametric kernel-weighted approach of Wu and Li30 for competing risk data.

4.1. The dynamic time-dependent ROC curve and AUC

At any landmark time s, we want to evaluate how well the risk score, i.e., the estimated CIF, discriminates between subjects with the event of interest in the window s,s+τ1 versus those without. For any at-risk subject at time s who experiences the main event within the time interval s,s+τ1, that occurrence is defined as a case: D+s,τ1=i:s<Tis+τ1,ϵi=1. When a subject is event-free at s+τ1, that occurrence is defined as a control: D-s,τ1=i:Ti>s+τ1. An alternative definition for a control is to use the complementary set D+s,τ1=i:s<Tis+τ1,ϵi1Ti>s+τ1, including subjects who experience a competing event within the time interval s,s+τ1 or remain event-free at s+τ1. To illustrate the ideas, we present the estimators for the former in this subsection. A similar extension can be made for the latter. For simplicity, we use the notation Uτ1s to denote the individual predicted CIF (i.e., the risk score). Given a threshold value c(0,1), the time-dependent sensitivity and specificity functions are defined as Sec,s,τ1=PUτ1s>cD+s,τ1 and Spc,s,τ1=PUτ1scD-s,τ1. The estimators of sensitivity and specificity are

Se^(c,s,τ1)=isW^1idyn1(Ui(τ1s)>c)isW^1idynSp^(c,s,τ1)=is(1k=1KW^kidyn)1(Ui(τ1s)c)is(1k=1KW^kidyn),

where W1idyn=PTi(s)0,τ1,ϵi=1T˜i(s),δi,Ui=1δ˜ij=0F1τ1Ui,s-F1T˜i(s)Ui,sST˜i(s)Ui,s+1δ˜ij=1,Ti(s)=Ti-s,T˜i(s)=T˜i-s and Ui is short for Uiτ1s. Rs is risk set within the neighborhood of s which includes the most recent record at tij for each subject ii:T˜is,tij-stij-s,j=1,2,,ni,tij(s-h,s+h).F1xUi,s=PTi(s)x,ϵi=1Ui,s and SxUi,s=PTi(s)xUi,s.

For estimating the conditional probability weight W1idyn, we treat the at-risk data set at landmark s as the new baseline data set. The time-dependent ROC curve is a plot of sensitivity Sec,s,τ1 over 1-specificity 1-Spc,s,τ1, i.e., for x[0,1],ROC^x,s,τ1=Se^Sp^-11-x,s,τ1,s,τ1. The AUC is estimated as AUC^s,τ1=01ROC^x,s,τ1dx.

4.2. The dynamic time-dependent Brier score

The time-dependent BS under the dynamic competing risk framework is defined as BSτ1,s=E1s<Ts+τ1,ϵ=1-Uτ1sT>s2, where 1() is the indicator function. Applying the weight W1idyn, the BS can be estimated as

BS^(τ1,s)=1nsi=1ns(W^1idyn×(1Ui(τ1s))2+(1W^1idyn)×(0Ui(τ1s))2),

where ns is the number of subjects at risk at landmark time s.

The AUC and BS assess different aspects of the predictive model. AUC evaluates the discrimination between a case and a control, and BS quantifies the deviance of the predicted probability from the observed data. A model with perfect discrimination will have AUC=1, while AUC close to 0.5 indicates poor discrimination that resembles a random guess. BS is a prediction error metric, with smaller values indicating better prediction.

5. Simulation

The simulation in this section mainly evaluates the prediction accuracy of the proposed model. A separate simulation, which evaluates the estimation of model parameters and bandwidth selection under the assumptions of the working model, is presented in Section B of the online supplementary materials. Similar to other studies evaluating the prediction accuracy of landmark models31, we simulated longitudinal and competing risks data from a joint frailty model with shared random effects32. Details of the data generation process are described in Section A of the online supplementary materials. The data generating model included a baseline covariate and three longitudinal biomarkers. We considered two scenarios: (S1) the longitudinal biomarkers are non-informative for survival in the sense that their effects on both time-to-event outcome are zero, and (S2) the longitudinal biomarkers are informative in the sense that they have non-zero regression coefficients on both time-to-event outcomes. The incremental contribution of the longitudinal biomarkers to the prediction accuracy is expected to be zero under S1 and non-zero under S2.

Table 1 presents the predictive accuracy of the proposed model under both S1 and S2. The full model (M1) includes both the longitudinal biomarkers and the baseline covariate; the null model (M0) only includes the baseline covariate. Since the data were simulated from a joint frailty model32, the proposed landmark SDH model worked under misspecification. However, regardless of whether the data generating model matches the fitting model, the predictive performance can always be evaluated. We considered both discrimination and calibration measures in Table 1. For discrimination, we reported the true positive (TP) fraction and false positive (FP) fraction at a given threshold value, and the AUC as a global discrimination summary. For calibration, we used the Brier score. The predictive accuracy measures were evaluated at three landmark times s=3,5,7 with prediction horizon τ1=1,3. For each simulation, the proposed model was fit to a simulated training data set and the predictive accuracy measures were calculated from another simulated validation dataset from the same distribution. When all the longitudinal biomarkers are non-informative, the predictive accuracy measures of the full model and the null model are very similar. When the three longitudinal biomarkers are informative, including the longitudinal biomarkers in the prediction model substantially improves both discrimination and calibration.

Table 1.

The means (EST) and empirical standard deviations (ESD) of estimated predictive accuracy metrics comparing the full model with longitudinal biomarkers M1 and the null model with only the baseline covariate but without longitudinal biomarkers M0 in the simulation. Prediction horizon τ1=3. S1: non-informative longitudinal biomarkers. S2: informative longitudinal biomarkers. AUC: area under the ROC curve comparing the group experiencing the event of interest with those who experienced competing events or event-free. TP(c): true positive fraction at threshold c.FP(c): false positive fraction at threshold c. BS: Brier score. Sample size n=500.

AUC TP (0.25) FP (0.25) BS
M1 M0 M1 M0 M1 M0 M1 M0
S1 s=1 EST 0.703 0.707 0.514 0.512 0.272 0.290 0.161 0.165
ESD 0.031 0.030 0.088 0.209 0.064 0.173 0.011 0.012
s=3 EST 0.691 0.698 0.726 0.675 0.529 0.499 0.199 0.203
ESD 0.034 0.032 0.092 0.223 0.099 0.237 0.011 0.015
s=5 EST 0.660 0.676 0.751 0.705 0.614 0.594 0.211 0.223
ESD 0.050 0.049 0.125 0.285 0.133 0.309 0.014 0.027
S2 s=1 EST 0.894 0.611 0.595 0.176 0.091 0.126 0.082 0.126
ESD 0.031 0.061 0.117 0.217 0.027 0.175 0.034 0.077
s=3 EST 0.882 0.578 0.738 0.477 0.240 0.452 0.142 0.205
ESD 0.030 0.059 0.132 0.345 0.122 0.342 0.064 0.070
s=5 EST 0.884 0.531 0.831 0.553 0.350 0.552 0.162 0.242
ESD 0.030 0.058 0.082 0.400 0.113 0.392 0.035 0.052

Under S1, the estimated regression parameters of the proposed sub-distributional hazard model are close to zero for the longitudinal biomarkers (first row of supplemental Figure 1). In contrast, the estimated regression parameters under S2 demonstrate that the effects of the three longitudinal biomarkers are notably different from zero and in the right direction (second row of supplemental Figure 1).

We conducted additional simulation to compare the predicted and true conditional risks at individual level. The true conditional risk of an individual (indexed by subscript o, at landmark time s, conditional on biomarker history Y(s)¯, and with prediction horizon τ1) is defined as PTos,s+τ1,ϵo=1T˜o>s,Yo(s)¯ Since the true conditional risk varies by subject, landmark time and history, and does not have a tractable analytical expression, we calculated it empirically at 9 representative landmark time by history combinations (supplementary Table 1) as follows. We simulated data using the procedure in Section A of online supplementary materials but with one informative longitudinal biomarker Y2 (the effect of Y1,Y3, and X on the time-to-event outcome were all set to zero). The true CIF for the event of interest in the next τ1=3 years were obtained empirically as the proportion of subjects with that event within s,s+τ1] give survival up to time s, among those with nearly identical Y2. For illustration, we chose three target Y2 values: m=0,2 and 4 in supplementary Table 1. Subjects with marker values within ±0.05 of the target value were counted in the denominator of the proportion calculation. To ensure that there were enough subjects in the denominator, we simulated a very large dataset (n=1,000,000). We restricted to the case of a single informative biomarker Y2 because it is less feasible to match subjects on multiple biomarkers. The conditional CIFs for hypothetical subjects with marker value 0, 2, 4 at landmark times 1,3,5 were estimated from 500 Monte Carlo repetitions. The average estimated CIF (EST), empirical standard deviation (ESD), percent bias (% Bias) and mean squared error (MSE) are presented in supplementary Table 1. Being a working model, the proposed semi-parametric landmark SDH model worked under mis-specification in this simulation. However, the result suggests that the estimated CIF has little bias and the MSE is low. It may indicate that the proposed model was flexible enough to provide approximate the data well at multiple landmark times, despite that the simulated data do not exactly satisfy the working assumptions. Unlike the simulation results in Table 1, the results in supplementary Table 1 pertain to the quality of predictions at individual level.

The proposed landmark SDH model is a working model and it is not yet clear whether there exists a joint distribution of longitudinal and competing risk data such that the model holds at all landmark times. This is a well known difficulty with landmark dynamic prediction models in general4,22 and it is not specific to our landmark model, though limited progress has been made in problems without competing risks7,33,34. Due to this difficulty, researchers often evaluate the numerical performance of the landmark models using data simulated from the joint model with shared random effects31,35. This is also the simulation strategy that we chose. When the data generation model and the analysis model do not match, the estimated model parameters are difficult to evaluate and interpret, but prediction accuracy can still be assessed. We have designed scenarios S1 and S2 to demonstrate that incorporating informative longitudinal biomarkers improves predictive accuracy. This qualitative conclusion is unlikely to be invalidated by the magnitude of model misspecification. We designed three longitudinal biomarkers with complicated trajectories to demonstrate that the proposed model works in situations where joint modeling approaches may be difficult to apply.

6. Application to the AASK data

The AASK study (African American Study of Kidney Disease and Hypertension) is an NIH funded longitudinal cohort study.3638 It started as a multicenter clinical trial where participants were randomly assigned in a 3 × 2 factorial design to three antihypertensive drugs (ramipril, amlodipine, metoprolol) and two levels of blood pressure control (mean arterial pressure 92 or 102 – 107Hg). 36 The trial included 1,094 African Americans of age 18 to 70 years who were diagnosed with hypertensive renal disease and had baseline eGFRs between 20-65mL/min/1.73m2.37 After the trial phase ended, the study continued to follow the trial participants in a cohort phase to monitor their long-term progression.38 Altogether the follow-up period was up to 12 years, with follow-up visits every 6 months. By the end of the study, 318 (29%) individuals developed ESRD, the event of research interest, and 176 (16%) died before ESRD. The median time to ESRD was 4.3 years and the median time to death was 5.2 years. We chose clinically relevant prediction horizons of τ1=1 or 3 years and illustrated the dynamic prediction at years 3, 5, and 7. The key longitudinal biomarker is eGFR (estimated Glomerular Filteration Rate). Our previous publication demonstrated that this biomarker have diverse and possibly nonlinear individual progression patterns11. In addition, some CKD patients experienced acute kidney injury (AKI) during the follow-up, which may cause substantial short term variation in the eGFR (e.g., see Figure 2 of Li et al22). The number of repeated measurements for eGFR ranged from 3 to 30, with over 50% of individuals providing 17 or more measurements. In addition to the current value of eGFR at a clinical visit, we derived the rate of change in eGFR (linear eGFR slope) during the history window of τ2=3 years, because the eGFR slope is often used by clinicians to characterize the speed of progression in CKD39. The estimation of eGFR slope followed the approach in our recent paper22. Additional biomarkers included longitudinal measurements of serum albumin (Alb), urine protein to creatinine ratio (UP/Cr), serum phosphorus (Phos) and urine potassium (Upot). These biomarkers were considered because they have known biological association with disease progression and have been used in other CKD risk equations1. Also included in the prediction model were age at the time of prediction and an indicator of any hospitalization during the previous year.

Figure 2.

Figure 2.

Individual risk predictions for three selected subjects: subject 1 was censored (dotted vertical green line), subject 2 had ESRD (dotted vertical red line) and subject 3 died (dotted vertical black line). Three biomarkers are plotted over time: “G” is eGFR ml/min/1.73m2, “R” is log-urine protein-to-creatinine ratio (g/g), and “A” is albumin (g/dL). The connected red dots are predicted probabilities of ESRD within a horizon of τ1=3 years. The gray vertical bars represent episodes of hospitalization, with the two vertical borders being admission and discharge dates. The connected black dots are the predicted probability of death within τ1=3 years. The y-axis to the left is the scale of eGFR, and the y-axis to the right is the scale of predicted probabilities (0 to 1). The other two biomarkers, log-UP/CR and albumin, are re-scaled to be displayed in the same plot with eGFR but their respective scales are not shown. The dynamic predicted probabilities of ESRD are calculated using the dynamic SDH model with four predictors: eGFR, eGFR slope in the past three years, log-UP/CR and phosphorus. The dynamic predicted probabilities of death are calculated using the dynamic SDH model with four predictors: current age, serum albumin, any hospitalization within the past year, and log urine potassium.

For the competing events of ESRD and death, we fit landmark SDH models separately using the same set of candidate predictors (supplementary Figure 2). The eGFR, its rate of change, and log UP/Cr were significantly associated with time to ESRD but not with time to death. In contrast, age, Alb and hospitalization are risk factors related to death. This indicates that the progression to ESRD and death may be related to different pathological processes, which justifies the proposal of modeling the competing events separately rather than as a composite outcome. After removal of the non-significant covariates, the final model for ESRD included eGFR, eGFR.slope, log UP/Cr and Phos, and the final model for death included age, Alb, log-Upot and hospitalization (online Figure 3). We conducted bandwidth selection using 5-fold cross-validation. Predictive accuracy metrics were evaluated in the cross-validation dataset, and they were robust to different bandwidths (up to 3 digits after the decimal point). Therefore, we used the bandwidth of h=1.5 in the final model, which provided a relatively smooth curve for the log-SDH ratio curve. The surface plots of the CIF for ESRD and death are illustrated in Figure 1.

Figure 3.

Figure 3.

Individual dynamic predicted CIF for the three selected subjects in Figure 2. Each row in the panel represents one subject, the three columns are the predictions made at landmark years s=3,5,7. The prediction is made at the vertical blue dashed lines. The predicted CIFs up to τ1=3 years are plotted for the event of ESRD (red curve) and death (black curve). Symbols in the figure are similar to Figure 2.

Figure 1.

Figure 1.

Estimated surface of the cumulative incidence function over the landmark time and prediction horizon. This shows an examplary population with age = 55, eGFR=45ml/min/1.73m2, eGFR. slope=0, UP/Cr=0.3g/g, albumin=4g/dL, and hospitalization within the past year.

Figure 2 presents the longitudinal profiles and individual dynamic predictions from three AASK subjects: subject 1 was event-free by the end of the study, subject 2 experienced ESRD after 7.5 years, and subject 3 died after 9.7 years. We demonstrated the biomarker values with real-time predicted 3-year probabilities of ESRD and death. The risk prediction was dynamically updated at each new clinical visit. Subject 1 demonstrated stable disease. Subject 2 demonstrated persistent decline in eGFR and notable increase in proteinuria (log-UP/Cr), which led to drastic increase in the risk of ESRD after year 5. In contrast, the risk of death for subject 2 increased moderately, which may be explained by the Alb level and hospitalization around year 7. For subject 3, the relatively stable eGFR and log-Up/Cr also stablized the subject’s susceptibility to ESRD, but the frequent hospitalization and decreasing Alb level were associated with increased risk of death, possibly due to other co-morbidity. We did not estimate the model after year 8 because the number of observed clinical events was relatively small near the end of the follow-up.

Figure 3 presents the profiles of the same three patients but with their dynamic CIFs (up to τ1=3 years) at landmark times s=3,5,7 years. For subject 1, the predicted CIFs for both ESRD and death were flat. In contrast, the predicted CIF of ESRD for subject 2 started to increase after year 5 and the increase became very prominent by year 7. This was likely caused by a combination of deteriorated renal function (eGFR) and proteinuria (log-UP/Cr). This patient reached ESRD shortly after year 7. The predicted CIF of ESRD for subject 3 stayed flat, but the CIF of death increased at year 7, after frequent hospitalization. This subject eventually died at year 9.6 without ESRD.

In Table 2, we summarized the predictive accuracy of the landmark SDH models for prediction horizons τ1=1,3 at three landmark years s=3,5,7. The model for ESRD achieved good discrimination with AUCs between 0.93–0.96. When we used the cutoff value of 0.05, the sensitivity (TP) and specificity (1-FP) could be controlled within 0.80–0.90 range under all scenarios. The prediction accuracy metrics were similar with different prediction horizons. In contrast, the model for death discriminated no better than a random guess, resulting in AUCs around 0.5; the prediction errors were also at least twice as large as those from predicting ESRD. More importantly, the AUCs from the proposed model improved in comparison with previous studies where AUCs were around 0.8 and always less than 0.922,31. One possible explanation is that these previous studies treat “time to ESRD or death” as a composite outcome. This introduces noise and diminishes the predictive accuracy because all-cause death is difficult to predict with the selected biomarkers, which are prognostically specific to renal disease. The ROC curves for predicting ESRD were plotted in supplementary Figure 4.

Table 2.

Measures of predictive accuracy from the landmark SDH model for ESRD and death in the analysis of AASK data. The estimates were obtained at three landmark years, s=3,5,7, with prediction horizon τ1=1,3 years. AUC: area under the ROC curve. TP(c): true positive rate at threshold c;FP(c): false positive rate at threshold c; thresholds c are selected to be 0.05 for ESRD and 0.01 for death. BS: Brier score.

AUC TP(c) FP(c) BS
ESRD Death ESRD Death ESRD Death ESRD Death
τ1=1 s=3 0.957 0.545 0.925 0.568 0.075 0.539 0.024 0.191
s=5 0.925 0.547 0.885 0.585 0.100 0.596 0.026 0.043
s=7 0.965 0.584 0.832 0.692 0.099 0.675 0.021 0.035
τ1=3 s=3 0.944 0.558 0.876 0.468 0.119 0.378 0.054 0.119
s=5 0.943 0.520 0.852 0.498 0.140 0.549 0.052 0.093
s=7 0.957 0.492 0.863 0.343 0.131 0.405 0.048 0.110

7. Discussion

For CKD patients, estimating the time to ESRD is crucial for the timely treatment management. Dynamic prediction is an attractive tool for this purpose, because it is adaptive to the changing health condition and prognostic history of the patient. It enables real-time monitoring of patient risk. In this paper, we develop novel methodology for dynamic prediction of ESRD among the CKD patients, and we overcome a number of analytical hurdles, including competing events of death, irregularly spaced clinical visit times, multiple biomarkers with complicated longitudinal trajectories, time-varying at-risk population, and time-varying covariate-outcome association. Our proposed methodology is flexible, because the model parameters are estimated semi-parametrically. Hence, it can effectively mitigate the risk of model misspecification. This feature is very important for dynamic prediction models, because, as explained in Section 5, the landmark dynamic prediction model is a working model and needs to provide adequate approximation to the data at all landmark times. Another advantage of the proposed methodology is that it is computationally simple, and it can be implemented through standard statistical software for competing risks analysis, regardless of how many longitudinal biomarkers are included as predictors. In this paper, the estimation process was accomplished with the available R function coxph() after translating the competing events into a counting process40. We believe that the simplicity in computation makes the proposed methodology attractive for various practical situations, including applications with large datasets, a large number of biomarkers with complicated longitudinal trajectories, and other longitudinal prognostic information that cannot be easily modeled at an individual-level (e.g., hospitalization episodes and medication history).

Our kernel-based estimation approach relies on the assumption that the clinical visit times are noninformative. Future work is needed to study dynamic prediction when the frequency of clinical visits is related to the health condition of the patients. The predictors in our proposed model framework include pre-specified features extracted from the data history. Automatic extraction of predictive features from the longitudinal history is another topic that we will pursue in future research.

In this paper, we consider death as a competing risk to ESRD. Although death can also be observed after ESRD, a semi-competing risk analysis is not justified in this context and hence is not pursued. Our study population is CKD. Once a CKD patient reaches ESRD, that patient either becomes a kidney graft recipient or a dialysis patient, and the health condition, care, etiology and prognosis are fundamentally different. GFR, an important biomarker and predictor used in this paper for CKD, is no longer scientifically defined after ESRD.

Supplementary Material

supplementary PDF

Acknowledgements

The authors declare no potential conflicts of interest with respect to the research, authorship and publication of this article. This research was supported by grants from the U.S. National Institutes of Health (P30CA016672, U01DK103225, R01DK118079).

Footnotes

Supplementary Materials

Supplementary materials, including the R code to implement the proposed method, are available for this article on the journal’s supplementary materials website.

References

  • 1.Tangri N, Stevens LA, Griffith J et al. A predictive model for progression of chronic kidney disease to kidney failure. Journal of the American Medical Association 2011; 305(15): 1553–1559. [DOI] [PubMed] [Google Scholar]
  • 2.Echouffo-Tcheugui JB and Kengne AP. Risk models to predict chronic kidney disease and its progression: a systematic review. PLoS medicine 2012; 9(11): e1001344. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Greene T and Li L. From static to dynamic risk prediction: time is everything. American Journal of Kidney Diseases 2017; 69(4): 492–494. [DOI] [PubMed] [Google Scholar]
  • 4.van Houwelingen HC and Putter H. Dynamic Prediction in Clinical Survival Analysis. Chapman & Hall/CRC Press, 2011. [Google Scholar]
  • 5.Van Houwelingen HC. Dynamic prediction by landmarking in event history analysis. Scandinavian Journal of Statistics 2007; 34(1): 70–85. [Google Scholar]
  • 6.van Houwelingen HC and Putter H. Dynamic predicting by landmarking as an alternative for multi-state modeling: an application to acute lymphoid leukemia data. Lifetime Data Analysis 2008; 14(4): 447–463. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Zheng Y and Heagerty PJ. Partly conditional survival models for longitudinal data. Biometrics 2005; 61(2): 379–391. [DOI] [PubMed] [Google Scholar]
  • 8.Rizopoulos D Dynamic predictions and prospective accuracy in joint models for longitudinal and time-to-event data. Biometrics 2011; 67(3): 819–829. [DOI] [PubMed] [Google Scholar]
  • 9.Proust-Lima C and Taylor JM. Development and validation of a dynamic prognostic tool for prostate cancer recurrence using repeated measures of posttreatment psa: a joint modeling approach. Biostatistics 2009; 10(3): 535–549. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Noordzij M, Leffondré K, van Stralen KJ et al. When do we need competing risks methods for survival analysis in nephrology? Nephrology Dialysis Transplantation 2013; 28(11): 2670–2677. [DOI] [PubMed] [Google Scholar]
  • 11.Li L, Astor BC, Lewis J et al. Longitudinal progression trajectory of gfr among patients with ckd. American Journal of Kidney Diseases 2012; 59(4): 504–512. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Li L, Chang A, Rostand SG et al. A within-patient analysis for time-varying risk factors of ckd progression. Journal of the American Society of Nephrology 2013; : ASN-2013050464. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Rué M, Andrinopoulou ER, Alvares D et al. Bayesian joint modeling of bivariate longitudinal and competing risks data: an application to study patient-ventilator asynchronies in critical care patients. Biometrical Journal 2017; 59(6): 1184–1203. [DOI] [PubMed] [Google Scholar]
  • 14.Andrinopoulou ER, Rizopoulos D, Takkenberg JJ et al. Combined dynamic predictions using joint models of two longitudinal outcomes and competing risk data. Statistical methods in medical research 2017; 26(4): 1787–1801. [DOI] [PubMed] [Google Scholar]
  • 15.Hickey GL, Philipson P, Jorgensen A et al. Joint modelling of time-to-event and multivariate longitudinal outcomes: recent developments and issues. BMC Medical Research Methodology 2016; 16(1): 117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Cortese G and Andersen P. Competing risks and time-dependent covariates. Biometrical Journal 2010; 52(1): 138–158. [DOI] [PubMed] [Google Scholar]
  • 17.Nicolaie M, Houwelingen J, Witte T et al. Dynamic prediction by landmarking in competing risks. Statistics in Medicine 2012; 32(12): 2031–2047. [DOI] [PubMed] [Google Scholar]
  • 18.Nicolaie M, van Houwelingen J, de Witte T et al. Dynamic pseudo-observations: a robust approach to dynamic prediction in competing risks. Biometrics 2013; 69(4): 1043–1052. [DOI] [PubMed] [Google Scholar]
  • 19.Liu Q Dynamic prediction models for data with competing risks. PhD Thesis, University of Pittsburgh, 2014. [Google Scholar]
  • 20.Liu Q, Tang G, Costantino JP et al. Landmark proportional subdistribution hazards models for dynamic prediction of cumulative incidence functions, 2019. arXiv:1904.09002. [Google Scholar]
  • 21.Fine JP and Gray RJ. A proportional hazards model for the subdistribution of a competing risk. Journal of the American Statistical Association 1999; 94(446): 496–509. [Google Scholar]
  • 22.Li L, Luo S, Hu B et al. Dynamic prediction of renal failure using longitudinal biomarkers in a cohort study of chronic kidney disease. Statistics in Biosciences 2017; 9(2): 357–378. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Cai Z and Sun Y. Local linear estimation for time-dependent coefficients in cox’s regression models. Scandinavian Journal of Statistics 2003; 30(1): 93–111. [Google Scholar]
  • 24.Liu Q, Tang G, Costantino JP et al. Robust prediction of the cumulative incidence function under non-proportional subdistribution hazards. Canadian Journal of Statistics 2016; 44(2): 127–141. [Google Scholar]
  • 25.Jewell N and Neilsen J. A framework for consistent prediction rules based on markers. Biometrika 1993; 80(1): 153–164. [Google Scholar]
  • 26.Andersen PK and Liestøl K. Attenuation caused by infrequently updated covariates in survival analysis. Biostatistics 2003; 4(4): 633–649. [DOI] [PubMed] [Google Scholar]
  • 27.Cao H, Churpek MM, Zeng D et al. Analysis of the proportional hazards model with sparse longitudinal covariates. Journal of the American Statistical Association 2015; 110(511): 1187–1196. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Zhou B, Fine J, Latouche A et al. Competing risks regression for clustered data. Biostatistics 2012; 13(3): 371–383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Efron B and Tibshirani R. An Introduction to the Bootstrap. Chapman & Hall/CRC Press, 1993. [Google Scholar]
  • 30.Wu C and Li L. Quantifying and estimating the predictive accuracy for censored time-to-event data with competing risks. Statistics in Medicine 2018; 37(21): 3106–3124. [DOI] [PubMed] [Google Scholar]
  • 31.Maziarz M, Heagerty P, Cai T et al. On longitudinal prediction with time-to-event outcome: comparison of modeling options. Biometrics 2017; 73(1): 83–93. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Elashoff RM, Li G and Li N. A joint model for longitudinal measurements and survival data in the presence of multiple failure types. Biometrics 2008; 64(3): 762–771. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Zhu Y, Li L and Huang X. On the landmark survival model for dynamic prediction of event occurrence using longitudinal data. In Y Z and DG C (eds.) New Frontiers of Biostatistics and Bioinformatics, chapter 19. Springer-Verlag, 2018. pp. 387–401. [Google Scholar]
  • 34.Zhu Y, Li L and Huang X. Landmark linear transformation model for dynamic prediction with application to a longitudinal cohort study of chronic disease. Journal of the Royal Statistical Society, Series C 2019; 68(3): 771–791. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Huang X, Yan F, Ning J et al. A two-stage approach for dynamic prediction of time-to-event distributions. Statistics in Medicine 2016; 35(13): 2167–2182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Gassman JJ, Greene T, Wright JT et al. Design and statistical aspects of the african american study of kidney disease and hypertension (aask). Journal of the American Society of Nephrology 2003; 14(suppl 2): S154–S165. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Wright JT Jr, Bakris G, Greene T et al. Effect of blood pressure lowering and antihypertensive drug class on progression of hypertensive kidney disease: results from the aask trial. Journal of the American Medical Association 2002; 288(19): 2421–2431. [DOI] [PubMed] [Google Scholar]
  • 38.Appel L, Middleton J, Miller E et al. The rationale and design of the aask cohort study. Journal of the American Society of Nephrology 2003; 14(7 Suppl 2): S166–72. [DOI] [PubMed] [Google Scholar]
  • 39.Schluchter M, Greene T and Beck G. Analysis of change in the presence of informative censoring: application to a longitudinal clinical trial of progressive renal disease. Statistics in Medicine 2001; 20(7): 989–1007. [DOI] [PubMed] [Google Scholar]
  • 40.Geskus RB. Cause-specific cumulative incidence estimation and the fine and gray model under both left truncation and right censoring. Biometrics 2011; 67(1): 39–49. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supplementary PDF

RESOURCES