Abstract
The cause-specific cumulative incidence function (CIF) quantifies the subject-specific disease risk with competing risk outcome. With longitudinally collected biomarker data, it is of interest to dynamically update the predicted CIF by incorporating the most recent biomarker as well as the cumulating longitudinal history. Motivated by a longitudinal cohort study of chronic kidney disease, we propose a framework for dynamic prediction of end stage renal disease using multivariate longitudinal biomarkers, accounting for the competing risk of death. The proposed framework extends the local estimation based landmark survival modeling to competing risks data, and implies that a distinct sub-distribution hazard regression model is defined at each biomarker measurement time. The model parameters, prediction horizon, longitudinal history and at-risk population are allowed to vary over the landmark time. When the measurement times of biomarkers are irregularly spaced, the predictor variable may not be observed at the time of prediction. Local polynomial is used to estimate the model parameters without explicitly imputing the predictor or modeling its longitudinal trajectory. The proposed model leads to simple interpretation of the regression coefficients and closed-form calculation of the predicted CIF. The estimation and prediction can be implemented through standard statistical software with tractable computation. We conducted simulations to evaluate the performance of the estimation procedure and predictive accuracy. The methodology is illustrated with data from the African American Study of Kidney Disease and Hypertension.
Keywords: Competing risks, dynamic prediction, Fine-Gray model, landmark analysis, longitudinal biomarkers, prediction model
1. Introduction
Patients with chronic kidney disease (CKD) are at increased risk of end stage renal disease (ESRD). Accurate prediction of the timing is of great importance in clinical research and practice to facilitate preparation for renal replacement therapy and individualize clinical decisions.1 The typical ESRD risk equations are “static” prediction models in the sense that they are developed from survival regression models that relate the predictors at an earlier time point, such as baseline, to the time of ESRD.2,3 Longitudinal data of those biomarkers between baseline and the terminal event are often available and potentially informative to the disease progression, but they are not used in prediction model development.
In statistical literature, the prediction of the risk of clinical events using longitudinal data is often referred to as dynamic prediction in the sense that the prediction can be updated with accumulating longitiudinal data. Important fundamental work has been published in the last decade.4–9 There are a number of challenges when this methodology is applied to the prediction of ESRD among CKD population. First, CKD patients have increased chance of mortality before reaching ESRD. Proper adjustment for competing risk is often needed in CKD studies.10. Second, previous literature has identified a large number of risk factors, including multiple biomarkers that are known to be causally associated with ESRD.1 Put in the longitudinal context, it requires that the dynamic prediction model should accommodate multiple biomarkers with tractable computation. Some biomarkers, such as the estimated GFR, have diverse nonlinear progression trajectories.11,12 This feature could add to the complexity of the statistical analysis if modeling subject-specific longitudinal trajectories is needed. Third, it may take many years before a CKD patient reaches ESRD or death. The strength of association between biomarkers and the disease outcome may vary over time, leading to time-varying effects. Fourth, it is common that patients do not always follow a pre-specified clinical visits schedule. Even if the visit times are non-informative in the sense that they are not related to the health condition of the patients, the irregularly spaced and unsynchronized biomarker measurement times pose a challenge to the development of dynamic prediction model, as elucidated below.
On the topic of dynamic prediction with competing risks data, Bayesian Markov chain Monte Carlo methods have been developed to model the joint distribution of longitudinal data and competing risks with shared random effects.13,14 However, when a large number of random effects are needed to accommodate multiple and possibly nonlinear longitudinal trajectories per subject, fitting the joint model is computationally infeasible.15 An alternative, computationally simpler dynamic prediction approach called landmark modeling4 has been applied to competing risk problems.16–20 These methods adapted to the dynamic prediction setup some existing competing risk regression models, such as cause-specific hazard model17, pseudo-observations18, multi-state model16, and the Fine-Gray subdistribution hazard model19,20. The Fine-Gray model21 imposes a parsimonious relationship between predictors and the cumulative incidence function (CIF) of ESRD without a separate model for death, which is difficult to establish due to the heterogeneity in the causes of death. Based on this consideration, we extended our previous work22 to a landmark sub-distribution hazard model with the following flexibilities: the baseline sub-distribution hazard function is a bivariate function of both the landmark time and the survival time without pre-specified functional structure; all the model parameters are estimated nonparametrically in a coherent local polynomial estimation framework. In a related recent work, Liu et al19,20 extended the idea of “Super Cox model”4 to a landmark sub-distribution hazard model for competing risk data. Motivated by the specific needs of CKD research, our proposed methodology in this paper is also different from the aforementioned literature in how the landmark times are selected and how the irregularly spaced longitudinal data are handled. The typical landmark approach involves pre-specifying a number of landmark time points distributed over the follow-up period, creating a landmark dataset at each landmark time point that consists of at-risk subjects and their predictor variables and time-to-event, and fitting the model to the stacked landmark datasets.4,17,19,20. The predictor variable is not always measured at the landmark times due to irregularly spaced and unsynchronized measurement times. Imputing the unknown value by the last or closest measurement is difficult to apply because that measurement could be years apart and because the progression of CKD includes both chronic periods, when biomarkers change slowly, and acute episodes, when biomarkers change more quickly. Our proposed method does not require pre-specification of the landmark times, which is important given that there is currently no guideline in the literature on how to set the number and locations of the landmark times. It can also accommodate the irregularly spaced observational times without explicit imputation.
This paper is organized as follows. In section 2, we introduce the dynamic prediction model for competing risk data. In section 3, we provide model estimation procedures for the quantities of interest. In section 4 we propose estimators of predictive accuracy measures within the dynamic competing risk context. In section 5, we conduct numerical studies to evaluate the performance of the proposed methodology. The application to the AASK data is presented in section 6. Discussion and future work are presented in section 7.
2. The landmark dynamic prediction model for competing risks data
2.1. The notation and data structure
Let and be the time to event and censoring for subject , and be the causes of the event. We observe the follow-up time , the censoring indicator , and the observed event type . Without loss of generality, we assume throughout this paper. In the context of the data application, event 1 denotes ESRD, the clinical event of interest, and event 2 denotes death, the competing event. Let the matrix denote the repeated measured covariates for subject , i.e. covariates with each measured times. The matrix notation covers both time-dependent (longitudinal) and time-independent (baseline) covariates. Each column of this matrix includes the repeated measurements of that variable at time points , where all . We use the scalar variable to denote the -th repeated measure of the -th covariate from the -th subject . Hence . For ease of presentation, we will also express , i.e., subject ’s -th covariate measured at time is the value of the trajectory function at that time.We assume that the distribution of is non-informative in the sense that it is independent of and . We assume independent censoring in the sense that is conditionally independent of and given the baseline covariates. We observe independent and identically distributed training data , from which the dynamic prediction model is to be developed. Our interest is to estimate the probability of event 1 (e.g., ESRD) in the next years (called prediction horizon) for a generic individual, indexed by subscript , given the covariates and survival up to time , where and denotes the covariate history prior to for subject .
In the following, we describe the construction of landmark dataset. We use notation to denote the residual lifetime when a generic subject is at risk at time , and to denote covariate history up to , as defined above. For subject in the development dataset, we define and as the subject-specific residual times to event and censoring, starting from . For prediction up to the horizon , we can artificially censor the residual times at , i.e., we observe and the event indicator , where 1(⋅) is the indicator function. The artificial censoring helps to reduce the chance of misspecifying certain model assumptions4. Throughout this paper, we focus on modeling the relationship between and at any landmark time . In the model development dataset, and are observed at the longitudinal measurement times , leading to the observed outcome and predictor pairs . Here the notation denotes the vector of predictors at time for the corresponding residual lifetime; this vector includes features of the longitudinal history that are selected as predictors. These features can be viewed as functions of . Therefore, we also call as the landmark time as they are the starting time of the residual lifetime outcome. When making a prediction, the new prediction is often made when new measurements become available, i.e., at a new of that subject. From this perspective, the prediction time is also called a landmark time.
2.2. Sub-distribution hazard model with baseline covariates
We first briefly review the sub-distribution hazard (SDH) model for competing risks21 with baseline covariates . For prediction, the quantity of interest is the cumulative incidence function (CIF) at time given . Under Fine and Gray’s formulation, this CIF is formulated as
(1) |
with , where can be any non-negative function of time and is a real vector. Fine and Gray further developed an interpretation for the function. They showed that it can be interpreted as a sub-distribution hazard, in the sense that . Such a definition can be viewed as the hazard function for an improper random variable . This interpretation also helps the development of an estimation procedure that is analogous to that of the Cox model.
As far as prediction is of concern, characterizing the bilateral relationship between time to event and the covariate vector is all that is needed. The sub-distribution hazard value at a time is not of direct relevance to this prediction. The sub-distribution hazard function serves as the internal machinery that helps the estimation of model (1). This is a key observation that motivates the working model in the next subsection.
2.3. Landmark proportional sub-distribution hazard model
By extending model (1) to the context of dynamic prediction, we propose the following landmark proportional SDH model at landmark time :
(2) |
As the notation on the left hand side of (2) suggests, at any given landmark time , this model is specified for those subjects still at risk at that time. If we treat the given as a new baseline, then this model is equivalent to a model (1) specified for the residual life time among the at-risk subjects at time , given the history . Since there are in theory infinitely many landmark time , model (2) is formulated under the working assumption that these models hold simultaneously. For a specific landmark dataset , this model implies that the bilateral relationship between each residual time to event and the corresponding “baseline” predictor variables satisfies model (2) at the corresponding landmark time . To fit this model, we define a working SDH function as:
(3) |
where is a bivariate smooth baseline SDH function, defined on the scale of the residual lifetime and landmark time . The time-varying coefficients are assumed to besmooth functions to allow the association to vary with the landmark time. For the bilateral relationship between and , the corresponding value of the coefficient function is .
Our landmark dataset construction resembles that in the partly conditional model7, which resets the follow-up time scale at each measurement time. From this perspective, the basic idea of the proposed methodology is more closely related to that model than the “super Cox model” type approaches5,19,20. However, besides the accommodation of competing risk outcome, another difference between our approach and Zheng and Heagerty’s7 is that the time-varying coefficients are functions of the landmark time instead of the derived follow-up time . Therefore it differs from the usual time-varying coefficient model in survival analysis that is commonly used to deal with non-proportional hazards23. With the artificial censoring at , the covariate effect is more likely to be constant over (but still vary with ) and the proportional sub-distribution assumption is more likely to hold.24.
Model (3) is called a “working” sub-distribution hazard function because it is used to facilitate the model fitting using the estimating equations developed by Fine and Gray21. While it implies that a subject’s residual sub-distribution hazard at landmark time is , it does not imply that this subject’s sub-distribution hazard at time given the history , is still given by (3). The hazard at time depends on . In general, the hazard at time conditional on depends on both the hazard at time conditional on and the conditional distribution of the paths of longitudinal predictors given . This is elucidated by the concept of consistency25. Like other landmark (or partly conditional) models, the proposed model has not been proven as a consistent prediction model. However, this working model can still be a useful prediction tool as long as (2) provides a good approximation to the bilateral relationship between and .
3. Model estimation and dynamic prediction of the CIF
For estimation, we extend the kernel approach in Li et al22 to the competing risk context and formalize the idea of borrowing information from lagging covariates26,27. Assume that has a continuous second derivative in a neighborhood of , by local linear approximation, for subject-specific time points around . The landmark dataset are clustered multivariate time-to-event data with competing events, where the records from the same subject are correlated. For clustered competing risk data28, we define the counting process for event 1 as and the at-risk process . Based on a local “working independence” partial likelihood function28, for any given landmark point , we can estimate the parameters using a kernel-weighted estimation equation, by borrowing biomarker measurements from the neighboring time points, :
(4) |
In the above expression, is a kernel function with bounded support on and is the bandwidth; with denoting the Kronecker product. In addition, we have the notations , and
where is used to estimate and respectively. and . The in (4) denotes the inverse probability censoring weight for competing events21:
where denotes the minimum of the two values and denotes the distribution of the residual censoring time at landmark . This distribution is estimated by a kernel-weighted Kaplan-Meier estimator:
where is the set of distinct observed censoring times . The coefficient is estimated at each landmark by . The variance of can be estimated by bootstrap, which involves randomly sampling subjects from the original dataset with replacement, estimating the point estimator from each randomly sampled bootstrap dataset, and calculating the sample variance of the point estimators from all bootstrap datasets29. Once we obtain the estimates of , the baseline cumulative SDH function at time can be estimated by plugging in :
The conditional CIF for any future subject can be estimated as
4. Quantifying the dynamic predictive accuracy
In this section, we study two predictive accuracy measures, the time-dependent receiver operating characteristic (ROC) curve, in particular the area under the ROC curve (AUC); and the Brier score (BS). In the dynamic prediction framework, the time-dependent predictive accuracy measures are functions of two time scales, the landmark time and the prediction horizon . The following procedure for estimating sensitivity, specificity, and BS were modified from the non-parametric kernel-weighted approach of Wu and Li30 for competing risk data.
4.1. The dynamic time-dependent ROC curve and AUC
At any landmark time , we want to evaluate how well the risk score, i.e., the estimated CIF, discriminates between subjects with the event of interest in the window versus those without. For any at-risk subject at time who experiences the main event within the time interval , that occurrence is defined as a case: . When a subject is event-free at , that occurrence is defined as a control: . An alternative definition for a control is to use the complementary set , including subjects who experience a competing event within the time interval or remain event-free at . To illustrate the ideas, we present the estimators for the former in this subsection. A similar extension can be made for the latter. For simplicity, we use the notation to denote the individual predicted CIF (i.e., the risk score). Given a threshold value , the time-dependent sensitivity and specificity functions are defined as and . The estimators of sensitivity and specificity are
where and is short for . is risk set within the neighborhood of which includes the most recent record at for each subject and .
For estimating the conditional probability weight , we treat the at-risk data set at landmark as the new baseline data set. The time-dependent ROC curve is a plot of sensitivity over 1-specificity , i.e., for . The AUC is estimated as .
4.2. The dynamic time-dependent Brier score
The time-dependent BS under the dynamic competing risk framework is defined as , where is the indicator function. Applying the weight , the BS can be estimated as
where is the number of subjects at risk at landmark time .
The AUC and BS assess different aspects of the predictive model. AUC evaluates the discrimination between a case and a control, and BS quantifies the deviance of the predicted probability from the observed data. A model with perfect discrimination will have , while AUC close to 0.5 indicates poor discrimination that resembles a random guess. BS is a prediction error metric, with smaller values indicating better prediction.
5. Simulation
The simulation in this section mainly evaluates the prediction accuracy of the proposed model. A separate simulation, which evaluates the estimation of model parameters and bandwidth selection under the assumptions of the working model, is presented in Section B of the online supplementary materials. Similar to other studies evaluating the prediction accuracy of landmark models31, we simulated longitudinal and competing risks data from a joint frailty model with shared random effects32. Details of the data generation process are described in Section A of the online supplementary materials. The data generating model included a baseline covariate and three longitudinal biomarkers. We considered two scenarios: (S1) the longitudinal biomarkers are non-informative for survival in the sense that their effects on both time-to-event outcome are zero, and (S2) the longitudinal biomarkers are informative in the sense that they have non-zero regression coefficients on both time-to-event outcomes. The incremental contribution of the longitudinal biomarkers to the prediction accuracy is expected to be zero under S1 and non-zero under S2.
Table 1 presents the predictive accuracy of the proposed model under both S1 and S2. The full model (M1) includes both the longitudinal biomarkers and the baseline covariate; the null model (M0) only includes the baseline covariate. Since the data were simulated from a joint frailty model32, the proposed landmark SDH model worked under misspecification. However, regardless of whether the data generating model matches the fitting model, the predictive performance can always be evaluated. We considered both discrimination and calibration measures in Table 1. For discrimination, we reported the true positive (TP) fraction and false positive (FP) fraction at a given threshold value, and the AUC as a global discrimination summary. For calibration, we used the Brier score. The predictive accuracy measures were evaluated at three landmark times with prediction horizon . For each simulation, the proposed model was fit to a simulated training data set and the predictive accuracy measures were calculated from another simulated validation dataset from the same distribution. When all the longitudinal biomarkers are non-informative, the predictive accuracy measures of the full model and the null model are very similar. When the three longitudinal biomarkers are informative, including the longitudinal biomarkers in the prediction model substantially improves both discrimination and calibration.
Table 1.
AUC | TP (0.25) | FP (0.25) | BS | |||||||
---|---|---|---|---|---|---|---|---|---|---|
S1 | EST | 0.703 | 0.707 | 0.514 | 0.512 | 0.272 | 0.290 | 0.161 | 0.165 | |
ESD | 0.031 | 0.030 | 0.088 | 0.209 | 0.064 | 0.173 | 0.011 | 0.012 | ||
EST | 0.691 | 0.698 | 0.726 | 0.675 | 0.529 | 0.499 | 0.199 | 0.203 | ||
ESD | 0.034 | 0.032 | 0.092 | 0.223 | 0.099 | 0.237 | 0.011 | 0.015 | ||
EST | 0.660 | 0.676 | 0.751 | 0.705 | 0.614 | 0.594 | 0.211 | 0.223 | ||
ESD | 0.050 | 0.049 | 0.125 | 0.285 | 0.133 | 0.309 | 0.014 | 0.027 | ||
S2 | EST | 0.894 | 0.611 | 0.595 | 0.176 | 0.091 | 0.126 | 0.082 | 0.126 | |
ESD | 0.031 | 0.061 | 0.117 | 0.217 | 0.027 | 0.175 | 0.034 | 0.077 | ||
EST | 0.882 | 0.578 | 0.738 | 0.477 | 0.240 | 0.452 | 0.142 | 0.205 | ||
ESD | 0.030 | 0.059 | 0.132 | 0.345 | 0.122 | 0.342 | 0.064 | 0.070 | ||
EST | 0.884 | 0.531 | 0.831 | 0.553 | 0.350 | 0.552 | 0.162 | 0.242 | ||
ESD | 0.030 | 0.058 | 0.082 | 0.400 | 0.113 | 0.392 | 0.035 | 0.052 |
Under S1, the estimated regression parameters of the proposed sub-distributional hazard model are close to zero for the longitudinal biomarkers (first row of supplemental Figure 1). In contrast, the estimated regression parameters under S2 demonstrate that the effects of the three longitudinal biomarkers are notably different from zero and in the right direction (second row of supplemental Figure 1).
We conducted additional simulation to compare the predicted and true conditional risks at individual level. The true conditional risk of an individual (indexed by subscript , at landmark time , conditional on biomarker history , and with prediction horizon ) is defined as Since the true conditional risk varies by subject, landmark time and history, and does not have a tractable analytical expression, we calculated it empirically at 9 representative landmark time by history combinations (supplementary Table 1) as follows. We simulated data using the procedure in Section A of online supplementary materials but with one informative longitudinal biomarker (the effect of , and on the time-to-event outcome were all set to zero). The true CIF for the event of interest in the next years were obtained empirically as the proportion of subjects with that event within ] give survival up to time , among those with nearly identical . For illustration, we chose three target values: and 4 in supplementary Table 1. Subjects with marker values within ±0.05 of the target value were counted in the denominator of the proportion calculation. To ensure that there were enough subjects in the denominator, we simulated a very large dataset . We restricted to the case of a single informative biomarker because it is less feasible to match subjects on multiple biomarkers. The conditional CIFs for hypothetical subjects with marker value 0, 2, 4 at landmark times 1,3,5 were estimated from 500 Monte Carlo repetitions. The average estimated CIF (EST), empirical standard deviation (ESD), percent bias (% Bias) and mean squared error (MSE) are presented in supplementary Table 1. Being a working model, the proposed semi-parametric landmark SDH model worked under mis-specification in this simulation. However, the result suggests that the estimated CIF has little bias and the MSE is low. It may indicate that the proposed model was flexible enough to provide approximate the data well at multiple landmark times, despite that the simulated data do not exactly satisfy the working assumptions. Unlike the simulation results in Table 1, the results in supplementary Table 1 pertain to the quality of predictions at individual level.
The proposed landmark SDH model is a working model and it is not yet clear whether there exists a joint distribution of longitudinal and competing risk data such that the model holds at all landmark times. This is a well known difficulty with landmark dynamic prediction models in general4,22 and it is not specific to our landmark model, though limited progress has been made in problems without competing risks7,33,34. Due to this difficulty, researchers often evaluate the numerical performance of the landmark models using data simulated from the joint model with shared random effects31,35. This is also the simulation strategy that we chose. When the data generation model and the analysis model do not match, the estimated model parameters are difficult to evaluate and interpret, but prediction accuracy can still be assessed. We have designed scenarios S1 and S2 to demonstrate that incorporating informative longitudinal biomarkers improves predictive accuracy. This qualitative conclusion is unlikely to be invalidated by the magnitude of model misspecification. We designed three longitudinal biomarkers with complicated trajectories to demonstrate that the proposed model works in situations where joint modeling approaches may be difficult to apply.
6. Application to the AASK data
The AASK study (African American Study of Kidney Disease and Hypertension) is an NIH funded longitudinal cohort study.36–38 It started as a multicenter clinical trial where participants were randomly assigned in a 3 × 2 factorial design to three antihypertensive drugs (ramipril, amlodipine, metoprolol) and two levels of blood pressure control (mean arterial pressure or 102 – 107Hg). 36 The trial included 1,094 African Americans of age 18 to 70 years who were diagnosed with hypertensive renal disease and had baseline eGFRs between 37 After the trial phase ended, the study continued to follow the trial participants in a cohort phase to monitor their long-term progression.38 Altogether the follow-up period was up to 12 years, with follow-up visits every 6 months. By the end of the study, 318 (29%) individuals developed ESRD, the event of research interest, and 176 (16%) died before ESRD. The median time to ESRD was 4.3 years and the median time to death was 5.2 years. We chose clinically relevant prediction horizons of or 3 years and illustrated the dynamic prediction at years 3, 5, and 7. The key longitudinal biomarker is eGFR (estimated Glomerular Filteration Rate). Our previous publication demonstrated that this biomarker have diverse and possibly nonlinear individual progression patterns11. In addition, some CKD patients experienced acute kidney injury (AKI) during the follow-up, which may cause substantial short term variation in the eGFR (e.g., see Figure 2 of Li et al22). The number of repeated measurements for eGFR ranged from 3 to 30, with over 50% of individuals providing 17 or more measurements. In addition to the current value of eGFR at a clinical visit, we derived the rate of change in eGFR (linear eGFR slope) during the history window of years, because the eGFR slope is often used by clinicians to characterize the speed of progression in CKD39. The estimation of eGFR slope followed the approach in our recent paper22. Additional biomarkers included longitudinal measurements of serum albumin (Alb), urine protein to creatinine ratio (UP/Cr), serum phosphorus (Phos) and urine potassium (Upot). These biomarkers were considered because they have known biological association with disease progression and have been used in other CKD risk equations1. Also included in the prediction model were age at the time of prediction and an indicator of any hospitalization during the previous year.
For the competing events of ESRD and death, we fit landmark SDH models separately using the same set of candidate predictors (supplementary Figure 2). The eGFR, its rate of change, and log UP/Cr were significantly associated with time to ESRD but not with time to death. In contrast, age, Alb and hospitalization are risk factors related to death. This indicates that the progression to ESRD and death may be related to different pathological processes, which justifies the proposal of modeling the competing events separately rather than as a composite outcome. After removal of the non-significant covariates, the final model for ESRD included eGFR, eGFR.slope, log UP/Cr and Phos, and the final model for death included age, Alb, log-Upot and hospitalization (online Figure 3). We conducted bandwidth selection using 5-fold cross-validation. Predictive accuracy metrics were evaluated in the cross-validation dataset, and they were robust to different bandwidths (up to 3 digits after the decimal point). Therefore, we used the bandwidth of in the final model, which provided a relatively smooth curve for the log-SDH ratio curve. The surface plots of the CIF for ESRD and death are illustrated in Figure 1.
Figure 2 presents the longitudinal profiles and individual dynamic predictions from three AASK subjects: subject 1 was event-free by the end of the study, subject 2 experienced ESRD after 7.5 years, and subject 3 died after 9.7 years. We demonstrated the biomarker values with real-time predicted 3-year probabilities of ESRD and death. The risk prediction was dynamically updated at each new clinical visit. Subject 1 demonstrated stable disease. Subject 2 demonstrated persistent decline in eGFR and notable increase in proteinuria (log-UP/Cr), which led to drastic increase in the risk of ESRD after year 5. In contrast, the risk of death for subject 2 increased moderately, which may be explained by the Alb level and hospitalization around year 7. For subject 3, the relatively stable eGFR and log-Up/Cr also stablized the subject’s susceptibility to ESRD, but the frequent hospitalization and decreasing Alb level were associated with increased risk of death, possibly due to other co-morbidity. We did not estimate the model after year 8 because the number of observed clinical events was relatively small near the end of the follow-up.
Figure 3 presents the profiles of the same three patients but with their dynamic CIFs (up to years) at landmark times years. For subject 1, the predicted CIFs for both ESRD and death were flat. In contrast, the predicted CIF of ESRD for subject 2 started to increase after year 5 and the increase became very prominent by year 7. This was likely caused by a combination of deteriorated renal function (eGFR) and proteinuria (log-UP/Cr). This patient reached ESRD shortly after year 7. The predicted CIF of ESRD for subject 3 stayed flat, but the CIF of death increased at year 7, after frequent hospitalization. This subject eventually died at year 9.6 without ESRD.
In Table 2, we summarized the predictive accuracy of the landmark SDH models for prediction horizons at three landmark years . The model for ESRD achieved good discrimination with AUCs between 0.93–0.96. When we used the cutoff value of 0.05, the sensitivity (TP) and specificity (1-FP) could be controlled within 0.80–0.90 range under all scenarios. The prediction accuracy metrics were similar with different prediction horizons. In contrast, the model for death discriminated no better than a random guess, resulting in AUCs around 0.5; the prediction errors were also at least twice as large as those from predicting ESRD. More importantly, the AUCs from the proposed model improved in comparison with previous studies where AUCs were around 0.8 and always less than 0.922,31. One possible explanation is that these previous studies treat “time to ESRD or death” as a composite outcome. This introduces noise and diminishes the predictive accuracy because all-cause death is difficult to predict with the selected biomarkers, which are prognostically specific to renal disease. The ROC curves for predicting ESRD were plotted in supplementary Figure 4.
Table 2.
AUC | TP(c) | FP(c) | BS | ||||||
---|---|---|---|---|---|---|---|---|---|
ESRD | Death | ESRD | Death | ESRD | Death | ESRD | Death | ||
0.957 | 0.545 | 0.925 | 0.568 | 0.075 | 0.539 | 0.024 | 0.191 | ||
0.925 | 0.547 | 0.885 | 0.585 | 0.100 | 0.596 | 0.026 | 0.043 | ||
0.965 | 0.584 | 0.832 | 0.692 | 0.099 | 0.675 | 0.021 | 0.035 | ||
0.944 | 0.558 | 0.876 | 0.468 | 0.119 | 0.378 | 0.054 | 0.119 | ||
0.943 | 0.520 | 0.852 | 0.498 | 0.140 | 0.549 | 0.052 | 0.093 | ||
0.957 | 0.492 | 0.863 | 0.343 | 0.131 | 0.405 | 0.048 | 0.110 |
7. Discussion
For CKD patients, estimating the time to ESRD is crucial for the timely treatment management. Dynamic prediction is an attractive tool for this purpose, because it is adaptive to the changing health condition and prognostic history of the patient. It enables real-time monitoring of patient risk. In this paper, we develop novel methodology for dynamic prediction of ESRD among the CKD patients, and we overcome a number of analytical hurdles, including competing events of death, irregularly spaced clinical visit times, multiple biomarkers with complicated longitudinal trajectories, time-varying at-risk population, and time-varying covariate-outcome association. Our proposed methodology is flexible, because the model parameters are estimated semi-parametrically. Hence, it can effectively mitigate the risk of model misspecification. This feature is very important for dynamic prediction models, because, as explained in Section 5, the landmark dynamic prediction model is a working model and needs to provide adequate approximation to the data at all landmark times. Another advantage of the proposed methodology is that it is computationally simple, and it can be implemented through standard statistical software for competing risks analysis, regardless of how many longitudinal biomarkers are included as predictors. In this paper, the estimation process was accomplished with the available R function coxph() after translating the competing events into a counting process40. We believe that the simplicity in computation makes the proposed methodology attractive for various practical situations, including applications with large datasets, a large number of biomarkers with complicated longitudinal trajectories, and other longitudinal prognostic information that cannot be easily modeled at an individual-level (e.g., hospitalization episodes and medication history).
Our kernel-based estimation approach relies on the assumption that the clinical visit times are noninformative. Future work is needed to study dynamic prediction when the frequency of clinical visits is related to the health condition of the patients. The predictors in our proposed model framework include pre-specified features extracted from the data history. Automatic extraction of predictive features from the longitudinal history is another topic that we will pursue in future research.
In this paper, we consider death as a competing risk to ESRD. Although death can also be observed after ESRD, a semi-competing risk analysis is not justified in this context and hence is not pursued. Our study population is CKD. Once a CKD patient reaches ESRD, that patient either becomes a kidney graft recipient or a dialysis patient, and the health condition, care, etiology and prognosis are fundamentally different. GFR, an important biomarker and predictor used in this paper for CKD, is no longer scientifically defined after ESRD.
Supplementary Material
Acknowledgements
The authors declare no potential conflicts of interest with respect to the research, authorship and publication of this article. This research was supported by grants from the U.S. National Institutes of Health (P30CA016672, U01DK103225, R01DK118079).
Footnotes
Supplementary Materials
Supplementary materials, including the R code to implement the proposed method, are available for this article on the journal’s supplementary materials website.
References
- 1.Tangri N, Stevens LA, Griffith J et al. A predictive model for progression of chronic kidney disease to kidney failure. Journal of the American Medical Association 2011; 305(15): 1553–1559. [DOI] [PubMed] [Google Scholar]
- 2.Echouffo-Tcheugui JB and Kengne AP. Risk models to predict chronic kidney disease and its progression: a systematic review. PLoS medicine 2012; 9(11): e1001344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Greene T and Li L. From static to dynamic risk prediction: time is everything. American Journal of Kidney Diseases 2017; 69(4): 492–494. [DOI] [PubMed] [Google Scholar]
- 4.van Houwelingen HC and Putter H. Dynamic Prediction in Clinical Survival Analysis. Chapman & Hall/CRC Press, 2011. [Google Scholar]
- 5.Van Houwelingen HC. Dynamic prediction by landmarking in event history analysis. Scandinavian Journal of Statistics 2007; 34(1): 70–85. [Google Scholar]
- 6.van Houwelingen HC and Putter H. Dynamic predicting by landmarking as an alternative for multi-state modeling: an application to acute lymphoid leukemia data. Lifetime Data Analysis 2008; 14(4): 447–463. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Zheng Y and Heagerty PJ. Partly conditional survival models for longitudinal data. Biometrics 2005; 61(2): 379–391. [DOI] [PubMed] [Google Scholar]
- 8.Rizopoulos D Dynamic predictions and prospective accuracy in joint models for longitudinal and time-to-event data. Biometrics 2011; 67(3): 819–829. [DOI] [PubMed] [Google Scholar]
- 9.Proust-Lima C and Taylor JM. Development and validation of a dynamic prognostic tool for prostate cancer recurrence using repeated measures of posttreatment psa: a joint modeling approach. Biostatistics 2009; 10(3): 535–549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Noordzij M, Leffondré K, van Stralen KJ et al. When do we need competing risks methods for survival analysis in nephrology? Nephrology Dialysis Transplantation 2013; 28(11): 2670–2677. [DOI] [PubMed] [Google Scholar]
- 11.Li L, Astor BC, Lewis J et al. Longitudinal progression trajectory of gfr among patients with ckd. American Journal of Kidney Diseases 2012; 59(4): 504–512. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Li L, Chang A, Rostand SG et al. A within-patient analysis for time-varying risk factors of ckd progression. Journal of the American Society of Nephrology 2013; : ASN-2013050464. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Rué M, Andrinopoulou ER, Alvares D et al. Bayesian joint modeling of bivariate longitudinal and competing risks data: an application to study patient-ventilator asynchronies in critical care patients. Biometrical Journal 2017; 59(6): 1184–1203. [DOI] [PubMed] [Google Scholar]
- 14.Andrinopoulou ER, Rizopoulos D, Takkenberg JJ et al. Combined dynamic predictions using joint models of two longitudinal outcomes and competing risk data. Statistical methods in medical research 2017; 26(4): 1787–1801. [DOI] [PubMed] [Google Scholar]
- 15.Hickey GL, Philipson P, Jorgensen A et al. Joint modelling of time-to-event and multivariate longitudinal outcomes: recent developments and issues. BMC Medical Research Methodology 2016; 16(1): 117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Cortese G and Andersen P. Competing risks and time-dependent covariates. Biometrical Journal 2010; 52(1): 138–158. [DOI] [PubMed] [Google Scholar]
- 17.Nicolaie M, Houwelingen J, Witte T et al. Dynamic prediction by landmarking in competing risks. Statistics in Medicine 2012; 32(12): 2031–2047. [DOI] [PubMed] [Google Scholar]
- 18.Nicolaie M, van Houwelingen J, de Witte T et al. Dynamic pseudo-observations: a robust approach to dynamic prediction in competing risks. Biometrics 2013; 69(4): 1043–1052. [DOI] [PubMed] [Google Scholar]
- 19.Liu Q Dynamic prediction models for data with competing risks. PhD Thesis, University of Pittsburgh, 2014. [Google Scholar]
- 20.Liu Q, Tang G, Costantino JP et al. Landmark proportional subdistribution hazards models for dynamic prediction of cumulative incidence functions, 2019. arXiv:1904.09002. [Google Scholar]
- 21.Fine JP and Gray RJ. A proportional hazards model for the subdistribution of a competing risk. Journal of the American Statistical Association 1999; 94(446): 496–509. [Google Scholar]
- 22.Li L, Luo S, Hu B et al. Dynamic prediction of renal failure using longitudinal biomarkers in a cohort study of chronic kidney disease. Statistics in Biosciences 2017; 9(2): 357–378. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Cai Z and Sun Y. Local linear estimation for time-dependent coefficients in cox’s regression models. Scandinavian Journal of Statistics 2003; 30(1): 93–111. [Google Scholar]
- 24.Liu Q, Tang G, Costantino JP et al. Robust prediction of the cumulative incidence function under non-proportional subdistribution hazards. Canadian Journal of Statistics 2016; 44(2): 127–141. [Google Scholar]
- 25.Jewell N and Neilsen J. A framework for consistent prediction rules based on markers. Biometrika 1993; 80(1): 153–164. [Google Scholar]
- 26.Andersen PK and Liestøl K. Attenuation caused by infrequently updated covariates in survival analysis. Biostatistics 2003; 4(4): 633–649. [DOI] [PubMed] [Google Scholar]
- 27.Cao H, Churpek MM, Zeng D et al. Analysis of the proportional hazards model with sparse longitudinal covariates. Journal of the American Statistical Association 2015; 110(511): 1187–1196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Zhou B, Fine J, Latouche A et al. Competing risks regression for clustered data. Biostatistics 2012; 13(3): 371–383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Efron B and Tibshirani R. An Introduction to the Bootstrap. Chapman & Hall/CRC Press, 1993. [Google Scholar]
- 30.Wu C and Li L. Quantifying and estimating the predictive accuracy for censored time-to-event data with competing risks. Statistics in Medicine 2018; 37(21): 3106–3124. [DOI] [PubMed] [Google Scholar]
- 31.Maziarz M, Heagerty P, Cai T et al. On longitudinal prediction with time-to-event outcome: comparison of modeling options. Biometrics 2017; 73(1): 83–93. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Elashoff RM, Li G and Li N. A joint model for longitudinal measurements and survival data in the presence of multiple failure types. Biometrics 2008; 64(3): 762–771. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Zhu Y, Li L and Huang X. On the landmark survival model for dynamic prediction of event occurrence using longitudinal data. In Y Z and DG C (eds.) New Frontiers of Biostatistics and Bioinformatics, chapter 19. Springer-Verlag, 2018. pp. 387–401. [Google Scholar]
- 34.Zhu Y, Li L and Huang X. Landmark linear transformation model for dynamic prediction with application to a longitudinal cohort study of chronic disease. Journal of the Royal Statistical Society, Series C 2019; 68(3): 771–791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Huang X, Yan F, Ning J et al. A two-stage approach for dynamic prediction of time-to-event distributions. Statistics in Medicine 2016; 35(13): 2167–2182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Gassman JJ, Greene T, Wright JT et al. Design and statistical aspects of the african american study of kidney disease and hypertension (aask). Journal of the American Society of Nephrology 2003; 14(suppl 2): S154–S165. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Wright JT Jr, Bakris G, Greene T et al. Effect of blood pressure lowering and antihypertensive drug class on progression of hypertensive kidney disease: results from the aask trial. Journal of the American Medical Association 2002; 288(19): 2421–2431. [DOI] [PubMed] [Google Scholar]
- 38.Appel L, Middleton J, Miller E et al. The rationale and design of the aask cohort study. Journal of the American Society of Nephrology 2003; 14(7 Suppl 2): S166–72. [DOI] [PubMed] [Google Scholar]
- 39.Schluchter M, Greene T and Beck G. Analysis of change in the presence of informative censoring: application to a longitudinal clinical trial of progressive renal disease. Statistics in Medicine 2001; 20(7): 989–1007. [DOI] [PubMed] [Google Scholar]
- 40.Geskus RB. Cause-specific cumulative incidence estimation and the fine and gray model under both left truncation and right censoring. Biometrics 2011; 67(1): 39–49. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.