Summary
In recent years, a wide range of markers have become available as potential tools to predict risk or progression of disease. In addition to such biological and genetic markers, short term outcome information may be useful in predicting long term disease outcomes. When such information is available, it would be desirable to combine this along with predictive markers to improve the prediction of long term survival. Most existing methods for incorporating censored short term event information in predicting long term survival focus on modeling the disease process and are derived under restrictive parametric models in a multi-state survival setting. When such model assumptions fail to hold, the resulting prediction of long term outcomes may be invalid or inaccurate. When there is only a single discrete baseline covariate, a fully non-parametric estimation procedure to incorporate short term event time information has been previously proposed. However, such an approach is not feasible for settings with one or more continuous covariates due to the curse of dimensionality. In this paper, we propose to incorporate short term event time information along with multiple covariates collected up to a landmark point via a flexible varying-coefficient model. To evaluate and compare the prediction performance of the resulting landmark prediction rule, we use robust non-parametric procedures which do not require the correct specification of the proposed varying coefficient model. Simulation studies suggest that the proposed procedures perform well in finite samples. We illustrate them here using a dataset of post-dialysis patients with end-stage renal disease.
Keywords: Landmark Prediction, Risk Prediction, Survival Time, Varying Coefficient Model
1 Introduction
In the quest to provide accurate prognosis of disease for patients it has become evident that a wide range of biological and genetic markers hold great potential in improving prediction. The prediction potential of these markers has led to a more personalized approach to individual patient care (Bauer et al., 2007; Wilson et al., 1998; Folsom et al., 2006). Incorporating short term outcomes in addition to these markers may further improve prediction accuracy. For example, the extent to which change in oncological status after surgery can predict long term survival for breast cancer patients was examined in Hatteville et al. (2002). Additionally, treatment response in breast cancer patients may provide useful information for prediction of future events (Osborne, 1998). Hirschtick et al. (1995) demonstrated an association between occurrence of bacterial pneumonia and death among HIV-positive patients. Recently, the association between acute graft versus host disease and survival of acute leukemia patients following bone marrow or stem cell transplantation was investigated in Cortese & Andersen (2010) and Lee et al. (2002). Among end-stage renal disease patients, the relationship between an early hospitalization and risk of death has been examined by Collins et al. (2009) and USRDS (2010).
In the aforementioned examples, the long term outcome of interest is often time to a terminal event e.g. death and the short term outcome is often time to a non-terminal event. This setting is referred to as a semi-competing risk setting since the occurrence of the terminal event would censor the non-terminal event but not vice versa. Throughout, we let T and T denote event times for the short term and long term events, respectively. Most existing methods for analyzing semi-competing risk data focus on estimation of the cause-specific hazard functions and/or the joint distribution of events under semi-parametric models (Fine et al., 2001; Siannis et al., 2007; Jiang et al., 2005). The joint distribution estimators can potentially be used to make predictions about T given information on T. Methods developed specifically to incorporate T information and baseline predictors for the prediction of T have concentrated primarily on semi-parametric models in a multi-state framework (Putter et al., 2006; Kay, 1986; Klein et al., 1994; Datta et al., 2000; Hatteville et al., 2002; Cortese & Andersen, 2010). For example, Klein et al. (1994) use a standard Cox regression analysis to incorporate information on whether or not the intermediate event has occurred by a certain time. Hatteville et al. (2002) extend this method to include non-proportional hazards and propose estimation procedures. Recently, Van Houwelingen & Putter (2008) proposed a landmark prediction procedure to incorporate the status of the short term event by a landmark point t0.
These methods provide useful tools to leverage information about T to improve the prediction of T. However, these simple semi-parametric models may not capture the relationship among the two outcomes and the covariates in practice. As an example, T may be highly associated with unknown subtypes of disease and the effect of observed covariates on T may vary with the disease subtype. Commonly used proportional hazards models may be inadequate for data from such a mixture of populations. When there is only a single discrete baseline covariate, a fully non-parametric procedure was proposed in Parast et al. (2011) to estimate the conditional distribution of the residual life from a landmark time t0, T − t0, given both T and the covariate among subjects who survive past t0. However, such an approach is not feasible in settings with one or more continuous covariates due to the curse of dimensionality (Robins & Ritov, 1997). To overcome such limitations, we extend the landmark Cox model of Van Houwelingen & Putter (2008) and propose a flexible varying-coefficient working model to incorporate T information along with a vector of covariates that may consist of information collected up to t0. The varying-coefficient model can potentially approximate the underlying disease process better compared to commonly used semi-parametric models. Landmarking along with an inverse probability weighting (IPW) approach (Rotnitzky & Robins, 2005; Robins & Rotnitzky, 1992) enables us to overcome both non-informative dropout and informative censoring issues for T due to competing risks. In contrast to Parast et al. (2011), our proposed approach can also incorporate longitudinal predictor information collected up to t0, such as repeated biomarker measurements and the frequency of complications or adverse events up to t0.
An important follow up step in developing methods for risk prediction is to objectively assess its predictive performance. Due to the complexity of the disease process as mentioned above, it is important to estimate the prediction accuracy measures non-parametrically without requiring that the fitted statistical models hold. To evaluate the performance of the proposed landmark prediction rule, we make inference about the accuracy measures using robust non-parametric procedures. We provide inference procedures for comparing prediction rules and resampling based procedures for interval estimation. Results from our simulation studies suggest that our proposed procedures work well in finite samples and the flexibility of our varying coefficient model could lead to a substantial gain in accuracy compared to a landmark Cox model based approach. For illustration, we apply our procedures to a dataset of post-dialysis patients with end-stage renal disease and develop a landmark risk model for kidney transplant free survival incorporating time to first hospitalization as well as several available predictors.
2 Estimation
2·1 Notations and Settings
We consider the semi-competing risk setting where T is a terminal event and can only be censored by the administrative censoring, C, not T, while T may be censored by both T and C. For example, T may represent time to a nonfatal stroke or hospitalization and T may represent time to death. Considering the potential difference in the underlying disease process, patients who experience the short term outcome may have substantially different clinical outcomes from the general patient population. Therefore, it may be important to incorporate information on the short term outcome in the prediction of a long term outcome. In this paper, we focus on predicting the risk of T ≤ t0 + τ among subjects whose T is greater than a pre-defined landmark point t0 for any given τ. We denote this subset of subjects as ΩT>t0 = {T > t0}. For example, if t0 is a 1 year check-up appointment from the time of surgery and T is death, we are interested in predicting future mortality among patients who have survived at least 1 year after surgery. Our goal is two-fold: 1) to accurately assess the probability of survival to time t0 + τ among ΩT>t0 using information on T and a vector of covariates Z and 2) to quantify how much prediction accuracy was improved by incorporating T information compared to using only covariates Z. When short term outcome information is used, it is essential that the prediction only use information on T up to the landmark point t0. Note that though we condition on patients surviving up to t0, these patients may still experience both T and T after t0.
Our potential data for analysis is composed of n independent and identically distributed random vectors {Di = (Xi, δi, Xi, δi, )′, i = 1, …, n}, where X = min(T, C), X = min(T, C), δ = I(T ≤ C), δ = I(T ≤ C), and indicates all available collected covariate information up to t0. We let Z represent the vector version of . It is important to note that Xi is not always observable due to competing risks, however, given our interest in the subset, ΩT>t0, the information concerning Xi required for our proposed estimation procedure is always observable. For details see Section 2·3. We assume that C is independent of (T, T, Z) with a survival function G(·) and P(X > t0 + τ, X > t0)P(T ≤ t0) > 0. Without specifying the true model for the conditional survival, such an independent censoring assumption is often required to enable valid estimation of the prediction performance measures (defined in Section 2·4) due to the curse of dimensionality (Robins & Ritov, 1997). When censoring does depend on Z, one may relax such assumptions by imposing a semi-parametric model on C given Z, see Section 6.
2·2 Landmark Risk Prediction for the Subpopulation ΩT>t0
2·2.1 Landmark Prediction with Z only
For comparison, we are first interested in developing landmark prediction rules based on Z only, which would enable us to assess the incremental value of T in prediction. To estimate , we consider a working time-specific generalized linear model (GLM)
(2·1) |
where g is a known, strictly increasing link function and φt0 is a vector of unknown regression coefficients. Model (2·1) allows the effect of Z to vary over t0 which is attractive in settings where markers predictive of short term risk may not be predictive of long term risk. In the presence of censoring, φt0 may be estimated by modifying the IPW estimator proposed in Uno et al. (2007) to incorporate landmarking. Specifically, we estimate φt0 as φ̂t0, the solution to:
(2·2) |
where
and Ĝ(·) is the Kaplan-Meier estimator of G(·). Note that Ŵi = 0 for all subjects with Xi > t0. By using this weight, we essentially use the subgroup with X > t0 as an unbiased random sample for the subpopulation ΩT>t0 since P(T ≤ t0 + τ | X > t0) = P(T ≤ t0 + τ | T > t0) under independent censoring.
2·3 Landmark Prediction with Z and T
We now consider developing landmark prediction rules to estimate the probability that T ≤ t0 + τ given Z and the information on T up to t0 for subjects in ΩT>t0,
where
(2·3) |
(2·4) |
as in Parast et al. (2011). The quantity Pt0,τ,ts,z pertains to subjects with T ≤ t0 while Pt0,τ,z pertains to those with T > t0. The conditional risk Pt0,τ,Z can also be approximated via a time-specific GLM similar to (2·1), , and the unknown model parameter βt0 can be estimated as β̂t0, the solution to:
(2·5) |
Here, we now use the subgroup with {X > t0, T > t0} as an unbiased random sample for the subpopulation {T > t0, T > t0} since P(T ≤ t0 + τ, | X > t0, X > t0) = P(T ≤ t0 + τ, | T > t0, T > t0) under independent censoring. Now, to approximate Pt0,τ,tsz, we consider a varying coefficient model (Hastie & Tibshirani, 1993; Fan & Zhang, 1999; Fan & Zhang, 2008):
(2·6) |
where g is a known, strictly increasing link function and βts is vector of unknown functions of ts. This model allows the effect of Z on the risk to vary over the short term outcome. To estimate βts in the presence of censoring, we propose to obtain β̂ts as the solution to the following IPW kernel smoothed estimating equation:
(2·7) |
where Kh(x) = h−1K(x/h), K(·) is a known smooth symmetric kernel density function with a bounded support, ψ: [0, ∞) → (−∞, ∞) is a given transformation function and the bandwidth h > 0 is assumed to be O(n−v), for 1/5 < v < 1/2. To potentially improve estimation one can implement an appropriate transformation ψ(·) such as log(·)(Wand et al., 1991; Park et al., 1997). Combining β̂T and β̂t0, we can now estimate as
Note that the observed data have sufficient information on T and T for the estimation of (2·3) and (2·4). The conditional risk (2·3) only involves the subgroup with T ≤ t0 < T and since Xi > t0 implies Ci > t0, when Xi > t0 and Xi ≤ t0, then Ti = Xi and I(Ti ≤ t0) is observable. The probability in (2·4) involves the subgroup with min(T, T) > t0 and can be obtained based on the subset of patients with min(X, X) > t0 which is always observable.
It is known that the choice of the smoothing parameter h when estimating βts is critical as in any non-parametric functional estimation problem. To obtain an appropriate h we use a standard K-fold cross-validation procedure to obtain hopt, the minimizer of an appropriately weighted mean squared error estimate. It is common practice to undersmooth, due to the difficulty in estimating the bias, and obtain an estimator whose variance dominates bias. For example, to obtain the final bandwidth for the estimation of βts, one can choose h = hoptn−c0, for some c0 ∈ (0, 3/10). Estimation is then performed over a region of ts defined as [ψ−1(ρl + h), ψ−1(ρr − h)] where (ρl, ρr) is a subset of ψ(ts). In all numerical examples, (ρl, ρr) is taken to be the (0.02, 0.98) percentiles of the observed X values.
2·4 Information gained by incorporating short-term outcome
To investigate the incremental value in predicting I(T ≤ t0 + τ) gained from incorporating the short-term outcome T information, we compare the predicted individual risk for subjects using (T, Z) with that using Z alone. A wide range of accuracy measures have been proposed in the literature to quantify and compare the predictive performance of risk models. Interesting discussions on this topic can be found in Heagerty & Zheng (2005), Cook (2007), Gerds et al. (2008), Pencina et al. (2008), Kerr & Pepe (2011). For illustration purposes, we focus on two commonly used measures, the Brier score (Brier, 1950; Gerds & Schumacher, 2006; Lawless & Yuan, 2010) and the area under the Receiver Operating Characteristic (ROC) (AUC) curve (Swets & Pickett, 1982; Heagerty & Zheng, 2005). The Brier score and the AUC of a risk function p(T, Z) in the context of landmark prediction can be defined as BSt0,τ= E{[I(Ti ≤ t0 + τ) − p(Ti, Zi)]2|Ti > t0} and AUCt0,τ = P{p(Ti, Zi) > p(Tj, Zj) | t0 < Ti ≤ t0 + τ, Tj > t0 + τ}. Similar to Gerds & Schumacher (2006), Uno et al. (2007) and Parast et al. (2011), for each prediction model, these measures can be estimated empirically using an IPW approach. Specifically, if the jth model estimates the conditional risk for the ith subject as , then
(2·8) |
(2·9) |
consistently estimate BSt0,τ and AUCt0,τ for the corresponding risk model, respectively. Based on P̂(2)(Ti, Zi) and P̂(1)(Zi), the incremental value of T can be quantified by and . In addition, one may quantify the incremental value of T by the additional proportion of variation explained by model (2) relative to model (1), defined as . When the sample size is not large, it is important to correct for over-fitting bias via procedures such as cross-validation.
3 Inference
To make inference about βts, it can be shown using the weak convergence of to a zero-mean Gaussian process (Kalbfleisch & Prentice, 2002), that the uncertainty of Ĝ(·) in Ŵi is of order , which is negligible compared to the standard non-parametric rate of {(nh)−1 log(n)}1/2. Therefore, the proposed estimator is asymptotically equivalent to the estimator obtained by replacing Ŵi with Wi. Using similar arguments as given by Bickel & Rosenblatt (1973), Dabrowska (1989) and Cai et al. (2010), one can show that β̂ts → βts in probability as n → ∞ regardless of whether (2·6) is correctly specified, where βts is the unique solution to the estimating equation, . In addition, one can show that for any given ts ≤ t0 and h = Op(n−v) with v ∈ (1/5, 1/2), for some fixed function ξ. Thus, it follows from similar arguments as given in Bickel & Rosenblatt (1973) that and B̂(ts) converges in distribution to a zero-mean normal random variable for any given ts.
To construct a confidence interval (CI) for βts in practice, we employ a perturbation-resampling method (Park & Wei, 2003; Cai et al., 2005; Tian et al., 2007) to approximate the distributions of the proposed estimators. Let { } be n × B independent copies of a positive random variable V from a known distribution with mean and variance equal to one. Then one may obtain perturbed estimates of βts, , either as the solution to
or explicitly as , where (β) = −∂Ŝts(β)/∂β,
and Ĝ(b) is the Kaplan-Meier estimator of G(·) with weights . Note that, although the variation in Ĝ is negligible asymptotically for assessing the variation of β̂ts, we find that accounting for this variation provides a better approximation in finite samples and thus perturb Ĝ with Ĝ(b) to obtain . The variance of β̂ts, , can be estimated as the empirical variance of { }. Such a perturbation procedure also enables us to construct 100(1 − α)% simultaneous confidence bands for {βts, 0 < ts0< ts ≤ t0} based on {β̂ts ± ζασ̂βts}, where ζα is the upper 100(1 − α)th percentile of { } and ts0 is selected such that P(T < ts0) > 0.
For {β̂t0, φ̂t0}, it can be shown using similar arguments as given by Uno et al. (2007) and Parast & Cai (2010) that they converge in probability to {βt0, φt0}, the respective solutions to and . Further more, { } converges jointly in distribution to a multivariate normal. Similar perturbation procedures can be used to construct CIs for βt0 and φt0. For the accuracy measure estimates, it is not difficult to justify their root-n consistency and normality using the uniform consistency of the risk estimates and functional central limit theorem Pollard (1990) along with modifications of the arguments given in Zheng et al. (2008). We note that although β̂ts is estimated at a slower non-parametric rate, the estimated overall accuracy parameters remain the standard parametric rate of root-n since they are essentially integrated over the distribution of T. To overcome the bias and variance trade-off, we choose to under smooth with h = O(n−v) and v ∈ (1/4, 1/2). The CIs for the Brier score and AUC can be constructed by perturbing (2·8) and (2·9) along with all the estimated model parameters and Ĝ. For all estimators with standard convergence rate of root-n, the variation due to Ĝ is no longer negligible and hence it is important to use the perturbed censoring weight to account for the sampling variations.
4 Simulations
We conducted simulation studies to examine the finite-sample properties of the proposed estimation procedures. For simplicity, we consider a single continuous covariate Z generated from a Normal(0,4) distribution and the survival times T and T from the model:
(4·1) |
and η, μ1, μ2, , and were chosen in order to produce (i) a low-event rate setting and (ii) a moderate-event rate setting. In each setting, we considered a sample size of n=2,000 and 5,000. Throughout, the results are summarized based on 1000 simulated data sets for each setting. For illustration, we chose t0 to be year 2 and τ to be 8 years. Therefore, among patients who have not experienced the long-term event by year 2, T > 2, we wish to estimate their probability of experiencing the long term event before year 10. Throughout all of the numerical examples, we let g be the logit link function. For the estimation of P̂t0,τ,ts,z using (2·7), we let K(·) be the gaussian kernel and ψ(x) = log(x). The smoothing parameter h was obtained using the 3-fold cross-validation scheme and c0 = 1/10 and hence h = O(n−0.3). The standard error (SE) estimates were obtained based on B = 500 perturbations each.
In setting (i), the low event rate setting, the survival times were generated from (4·1) with η = 0.4, μ1 = 1.8, μ2 = 3.3, , and . Under these conditions, P(T ≤ t0 | T > t0) = 0.20 and P(T > t0) = 0.98 and P(T ≤ t0 + τ |T > t0) = 0.13. We consider the case when C was generated from a Weibull distribution with shape = 1.6 and scale = 20. With this censoring pattern, 30% of observations are censored for T, 80% are censored for T and P(T ≤ t0, δ = 1 | X > t0) = 0.18 and P(X > t0) = 0.96 and P(T ≤ t0 + τ, δ = 1|X > t0) = 0.11. Estimates of βts in Pt0,τ,ts,z for t0 = 2 and τ = 8 and n = 2, 000 and 5, 000 are shown for multiple values of ts in Table 1(i). Values of ts in the table were chosen as the 20th, 40th, 60th, and 80th percentiles of T < t0. The SEs estimated from the perturbation resampling method are close to the empirical SEs with higher SEs in the case of a lower sample size as expected. In addition, the SEs for both intercept and slope estimates tend to be higher for smaller values of ts possibly due to smoothing when there are fewer events in this time period as is typically seen in non-parametric estimation. All estimates have slight but negligible bias and the empirical coverage levels are close to their nominal level.
Table 1.
(i) low event rate setting | ||||||||
---|---|---|---|---|---|---|---|---|
| ||||||||
n=2000 | ||||||||
| ||||||||
Intercept | Slope | |||||||
| ||||||||
ts | t0.20 | t0.40 | t0.60 | t0.80 | t0.20 | t0.40 | t0.60 | t0.80 |
| ||||||||
Bias | 5.6 | 3.2 | 1.8 | 0.7 | −2.0 | −0.1 | 1.7 | 3.6 |
ASE | 39.8 | 33.3 | 29.0 | 26.8 | 14.5 | 13.2 | 12.6 | 12.6 |
ESE | 41.6 | 35.2 | 29.8 | 26.7 | 15.5 | 14.7 | 13.3 | 12.6 |
Coverage | 93.6 | 93.5 | 94.3 | 94.5 | 93.8 | 92.7 | 92.5 | 93.4 |
| ||||||||
n=5000 | ||||||||
| ||||||||
Intercept | Slope | |||||||
| ||||||||
ts | t0.20 | t0.40 | t0.60 | t0.80 | t0.20 | t0.40 | t0.60 | t0.80 |
| ||||||||
Bias | 1.9 | 0.8 | 0.8 | −0.2 | −0.7 | 0.8 | 1.8 | 3.5 |
ASE | 33.7 | 27.4 | 23.0 | 20.2 | 11.8 | 10.5 | 9.8 | 9.5 |
ESE | 35.4 | 28.3 | 22.6 | 20.2 | 12.6 | 10.9 | 9.8 | 10.0 |
Coverage | 93.9 | 94.0 | 95.6 | 94.5 | 92.9 | 93.1 | 94.4 | 91.1 |
| ||||||||
(ii) moderate event rate setting | ||||||||
| ||||||||
n=2000 | ||||||||
| ||||||||
Intercept | Slope | |||||||
| ||||||||
ts | t0.20 | t0.40 | t0.60 | t0.80 | t0.20 | t0.40 | t0.60 | t0.80 |
Bias | −26.3 | −2.8 | 3.5 | 5.3 | 10.2 | 2.1 | −0.3 | −3.4 |
ASE | 84.7 | 50.4 | 31.6 | 21.2 | 29.0 | 19.2 | 18.1 | 19.4 |
ESE | 92.2 | 53.4 | 31.9 | 21.7 | 31.9 | 20.3 | 18.4 | 21.1 |
Coverage | 93.3 | 95.0 | 95.1 | 94.7 | 92.7 | 94.6 | 94.7 | 93.8 |
| ||||||||
n=5000 | ||||||||
| ||||||||
Intercept | Slope | |||||||
| ||||||||
ts | t0.20 | t0.40 | t0.60 | t0.80 | t0.20 | t0.40 | t0.60 | t0.80 |
| ||||||||
Bias | −3.5 | −3.1 | 4.2 | 5.7 | −1.0 | 2.3 | −0.2 | −3.6 |
ASE | 51.6 | 31.1 | 19.6 | 13.3 | 17.7 | 12.0 | 11.3 | 12.2 |
ESE | 52.0 | 33.5 | 20.4 | 13.4 | 18.2 | 12.5 | 11.9 | 12.5 |
Coverage | 95.8 | 94.0 | 93.3 | 93.4 | 94.7 | 93.5 | 94.3 | 95.0 |
In setting (ii), the moderate event rate setting, the survival times were generated from (4·1) with η = 0.7, μ1 = 1.2, μ2 = 2.3, , σ12 = 0.5, and . Under these conditions, P(T ≤ t0 | T > t0) = 0.35 and P(T > t0) = 0.95 and P(T ≤ t0 + τ |T > t0) = 0.23. We again consider the case when C was generated from a Weibull distribution with shape = 1.6 and scale = 20. With this censoring pattern, 25% of observations are censored for T, 70% are censored for T and P(T ≤ t0, δ = 1 | X > t0) = 0.35 and P(X > t0) = 0.92 and P(T ≤ t0 + τ, δ = 1|X > t0) = 0.21. We again consider estimation of βts in Pt0,τ,ts,z and results are shown in Table 1(ii). The SEs estimated from the perturbation resampling method are again close to the empirical SEs with lower SEs in this event rate setting as compared to the lower event rate setting. Similar to the results in setting (i), SEs tend to by higher for smaller values of ts. All estimates have slight bias, however the empirical coverage levels are close to their nominal level. In both the low and moderate event rate setting, the procedures appear to perform well though care should be taken when there are very few events in a specific time interval as this could lead to some unstable estimates.
In both settings we examined the value of using the additional short-term outcome information in using the procedures discussed in Section 2·4. Here we use a 3-fold cross-validation procedure to correct for over-fitting bias. Brier score and AUC estimates are shown in Tables 2 and 3. In the low event rate setting, the proportion of explained variation in predicting long-term t0 + τ survival increased by 0.077 when n=2,000 and by 0.086 when n=5,000 after incorporating short-term outcome information compared to using Z information only. In the moderate event rate setting, the proportion of explained variation in predicting long-term t0 + τ survival increased by 0.306 when n=2,000 and by 0.317 when n=5,000 after incorporating short-term outcome information compared to using Z information only. In the low event rate setting, including the short-term outcome information results in a 0.011 increase in AUC when n=2,000 and a 0.013 increase in AUC when n=5,000 compared to using Z information only. In the moderate event rate setting, including the short-term outcome information results in a 0.060 increase in AUC when n=2,000 and a 0.062 increase in AUC when n=5,000 compared to using Z information only. The SEs estimated from the perturbation resampling method are close to the empirical SEs with smaller SEs in the higher sample size setting as expected. Empirical coverage levels are close to their nominal level.
Table 2.
(i) low event rate setting | |||||||
---|---|---|---|---|---|---|---|
| |||||||
n=2000 | |||||||
| |||||||
Apparent | CV (3-fold) | ||||||
| |||||||
Average | ESE | ASE | Average | ESE | Cvg | ||
| |||||||
|
7.6 | 0.4 | 0.5 | 7.9 | 0.4 | 95.6 | |
|
8.5 | 0.4 | 0.5 | 8.6 | 0.5 | 95.2 | |
IPEV2/1 | 10.7 | 2.4 | 2.8 | 7.7 | 2.8 | 90.3 | |
| |||||||
n=5000 | |||||||
| |||||||
Apparent | CV (3-fold) | ||||||
| |||||||
Average | ESE | ASE | Average | ESE | Cvg | ||
| |||||||
|
7.7 | 0.3 | 0.3 | 7.8 | 0.3 | 95.5 | |
|
8.5 | 0.3 | 0.3 | 8.6 | 0.3 | 93.7 | |
IPEV2/1 | 10.4 | 1.5 | 1.7 | 8.6 | 1.6 | 95.3 | |
| |||||||
(ii) moderate event rate setting | |||||||
| |||||||
n=2000 | |||||||
| |||||||
Apparent | CV (3-fold) | ||||||
| |||||||
Average | ESE | ASE | Average | ESE | Cvg | ||
| |||||||
|
9.6 | 0.4 | 0.5 | 10.2 | 0.5 | 89.8 | |
|
14.6 | 0.5 | 0.5 | 14.7 | 0.5 | 94.9 | |
IPEV2/1 | 33.9 | 2.7 | 2.9 | 30.6 | 2.8 | 92.2 | |
| |||||||
n=5000 | |||||||
| |||||||
Apparent | CV (3-fold) | ||||||
| |||||||
Average | ESE | ASE | Average | ESE | Cvg | ||
| |||||||
|
9.8 | 0.3 | 0.3 | 10.0 | 0.3 | 93.0 | |
|
14.6 | 0.3 | 0.3 | 14.6 | 0.3 | 94.9 | |
IPEV2/1 | 32.9 | 1.7 | 1.8 | 31.7 | 1.8 | 94.6 |
Table 3.
(i) low event rate setting | |||||||
---|---|---|---|---|---|---|---|
| |||||||
n=2000 | |||||||
| |||||||
Apparent | CV (3-fold) | ||||||
| |||||||
Average | ESE | ASE | Average | ESE | Cvg | ||
| |||||||
|
90.0 | 0.9 | 1.0 | 89.3 | 1.0 | 91.5 | |
|
88.2 | 1.0 | 1.0 | 88.2 | 1.0 | 93.8 | |
| |||||||
n=5000 | |||||||
| |||||||
Apparent | CV (3-fold) | ||||||
| |||||||
Average | ESE | ASE | Average | ESE | Cvg | ||
| |||||||
|
90.0 | 0.6 | 0.6 | 89.6 | 0.6 | 93.8 | |
|
88.3 | 0.6 | 0.6 | 88.3 | 0.6 | 94.7 | |
| |||||||
(ii) moderate event rate setting | |||||||
| |||||||
n=2000 | |||||||
| |||||||
Apparent | CV (3-fold) | ||||||
| |||||||
Average | ESE | ASE | Average | ESE | Cvg | ||
| |||||||
|
92.0 | 0.7 | 0.8 | 91.2 | 0.8 | 92.6 | |
|
85.2 | 1.0 | 1.0 | 85.2 | 1.0 | 95.2 | |
| |||||||
n=5000 | |||||||
| |||||||
Apparent | CV (3-fold) | ||||||
| |||||||
Average | ESE | ASE | Average | ESE | Cvg | ||
| |||||||
|
91.8 | 0.4 | 0.5 | 91.5 | 0.5 | 94.0 | |
|
85.3 | 0.6 | 0.6 | 85.3 | 0.6 | 95.9 |
Additionally, we also consider the prediction accuracy of risk estimates based on a landmark Cox model as previously proposed in the literature (Van Houwelingen & Putter, 2008; Cortese & Andersen, 2010). Such a model would involve fitting a conditional proportional hazards working model for the residual life among ΩT>t0 using baseline covariates along with a variable indicating whether the short term event had occurred by the landmark time. We denote the resulting risk estimate as . To evaluate the prediction rule resulting from this landmark Cox model approach, the corresponding Brier Score and AUC may be estimated by replacing in (2·8) and (2·9) with to obtain and , respectively. Simulations results comparing the accuracy estimates from the proposed model to those from the landmark Cox model are shown in Table 4. In the low event rate setting, the proportion of explained variation in predicting long-term t0 + τ survival increased by 0.063 and the AUC is 0.017 higher when n=5,000 when using the proposed approach to incorporate short-term outcome information compared to the landmark Cox model approach. In the moderate event rate setting, we observe a substantial increase in accuracy. When n = 5000, the proposed approach yields an increase in AUC of 0.057 and a 26% improvement in the proportion of variation explained.
Table 4.
(a) Brier score | ||||||
---|---|---|---|---|---|---|
| ||||||
|
|
IPEV2/cox | ||||
| ||||||
(i) low event rate | ||||||
n=2000 | 79.2 (0.42) | 83.9 (0.45) | 5.3 (7.6) | |||
n=5000 | 78.2 (0.27) | 83.6 (0.29) | 63.4 (4.8) | |||
(ii) moderate event rate | ||||||
n=2000 | 10.2 (0.44) | 13.5 (0.50) | 24.6 (4.5) | |||
n=5000 | 10.0 (0.28) | 13.5 (0.32) | 25.9 (2.7) | |||
| ||||||
(b) AUC | ||||||
| ||||||
|
|
|
||||
| ||||||
(i) low event rate | ||||||
n=2000 | 89.3 (0.93) | 87.9 (1.01) | 1.4 (1.54) | |||
n=5000 | 89.6 (0.58) | 87.9 (0.65) | 1.7 (0.93) | |||
(ii) moderate event rate | ||||||
n=2000 | 91.2 (0.69) | 85.8 (0.95) | 5.4 (1.22) | |||
n=5000 | 91.5 (0.44) | 85.8 (0.64) | 5.7 (0.79) |
5 Example
To illustrate our proposed procedures, we use a dataset of patients with end-stage renal disease (ESRD) who had initiated maintenance peritoneal or hemodialysis from the United States Renal Data System (USRDS) (USRDS, 2010). The USRDS consists of a total of 463,240 patients from 18 regional areas who initiated dialysis between 2000 and 2004. Our analysis focused on the subset of patients from one network consisting of six-states in the New England area without a previous kidney transplant. This subset consisted of 9,785 patients with individual information on time to death or kidney transplant, time to first hospitalization, and several baseline covariates measured at initiation of dialysis. Total time at risk was measured from the earliest outpatient date after initiation of dialysis until censoring at transplantation, death, loss to follow-up, or December 31, 2005. Time until first hospitalization was measured from the earliest outpatient date after initiation of dialysis to the first hospitalization date, when available.
We define T as time to first hospitalization and T as time to death or kidney transplant. Early hospitalizations are frequent in patients with ESRD and are associated with higher morbidity and risk of death (USRDS, 2007; Collins et al., 2009). Previous work has examined the incremental value of incorporating hospitalization time in predicting residual life in this dataset in the presence of a single discrete marker, erythropoietin use (Parast et al., 2011). While this marker is useful, it would be of great clinical interest to be able to incorporate information on the other clinical variables available. For our analyses, we include the following baseline covariates measured at initiation of dialysis based on results from a multivariate analysis: age, albumin, erythropoietin use, serum creatinine, and prior history of heart failure (HF), arrhythmia, chronic obstructive pulmonary disease (COPD), and cancer. There is an abundance of clinical literature examining associations of these demographic and clinical predictors with all-cause or cause-specific hospitalization and mortality (Sands et al., 2006; Barrett et al., 1997; Murphy & Parfrey, 2000). In addition, some patients are hospitalized multiple times prior to t0, which might indicate poor prognosis and hence a higher risk of death. Using our procedure, we are able to incorporate such information into prediction by adding the frequency of hospitalizations before t0 as a covariate.
There were a total of 5,480 first hospitalizations, 5,448 deaths, 111 transplantations and the 18 month hospitalization rate was about 0.57; the 18 month death or transplant rate was about 0.40. For illustration, we chose t0 to be 6 months and τ to be 1 year. Among the 7,679 patients who survived past 6 months, 37% were hospitalized within 6 months of initiating dialysis and 63% did not have a hospitalization before 6 months. We aim to make a prediction concerning 1-year survival among those who survived 6 months using the proposed procedures incorporating information on time of first hospitalization (T) and frequency of hospitalizations between baseline and t0 as well as multiple baseline covariates. For example, the probability of interest, Pt0,τ,ts,z, for a patient with T = ts = 24 days is displayed Figure 1.
Estimates of βts are shown in Table 5 with corresponding SE estimates from the perturbation-resampling method. Figure 2 displays the estimates over a range of ts for two sets of patients with T = ts ≤ t0 who differ only by history of HF. Both patients were hospitalized once before t0 and have the following same baseline covariate values: age = 76, albumin = 3.2, using erythropoietin, serum creatinine = 6.5, no prior history of arrhythmia, COPD, or cancer. However, one patient has prior history of HF (solid black thin line) while the other has no prior history of HF (solid black thick line). Using baseline covariate information only, , the predicted risk of having an event between 6 months and 1.5 years is equal to 0.21 for the patient with prior history of HF and 0.18 for the patient without prior history of HF regardless of hospitalization time. Note that is the predicted risk for all patients who survived past 6 months whereas P̂t0,τ,ts,z is for patients who survived past 6 months and were hospitalized before 6 months. Given that a hospitalization did not occur before 6 months, and 0.14 for the patient with and without prior history of HF, respectively. These results are consistent with the clinical literature which shows a clear association between morbidity upon initiation of dialysis, particularly cardiovascular conditions, and higher risk of hospitalization and death in end-stage renal disease patients (USRDS, 2010; Foley et al., 1997; Saravanan & Davidson, 2010; Cheung et al., 2004). Sudden cardiac death is the most common form of death in dialysis patients and it is has been argued that early referral of patients with co-morbid conditions would lead to planned transition from stage 4 chronic kidney disease to end-stage renal disease, perhaps reducing morbidity and mortality in the first year of dialysis treatment (Collins et al., 2009; Herzog et al., 2008). In addition, we note a peak at 60 days since initiation of dialysis which could potentially be explained by the tendency for patients to change dialysis modality within the first 90 days of dialysis treatment (Dalrymple et al., 2010).
Table 5.
ts | t0.20 | t0.40 | t0.60 | t0.80 | ||||
---|---|---|---|---|---|---|---|---|
| ||||||||
Estimate | SE | Estimate | SE | Estimate | SE | Estimate | SE | |
| ||||||||
β0 | −2.17 | 0.99 | −2.22 | 0.81 | −2.86 | 0.78 | −3.14 | 0.87 |
βalbumin | −0.27 | 0.13 | −0.29 | 0.10 | −0.22 | 0.10 | −0.15 | 0.10 |
βepo use | −0.21 | 0.15 | −0.07 | 0.12 | −0.07 | 0.12 | −0.12 | 0.13 |
βserum creatinine | −0.04 | 0.03 | −0.03 | 0.02 | −0.02 | 0.02 | −0.01 | 0.02 |
βPHx HF | 0.13 | 0.15 | 0.11 | 0.12 | 0.11 | 0.12 | 0.06 | 0.12 |
βPHx arrhythmia | 0.18 | 0.20 | 0.34 | 0.17 | 0.46 | 0.17 | 0.55 | 0.18 |
βPHx COPD | 0.46 | 0.22 | 0.32 | 0.17 | 0.27 | 0.16 | 0.30 | 0.17 |
βPHx cancer | 0.15 | 0.23 | 0.04 | 0.19 | 0.10 | 0.18 | 0.21 | 0.19 |
βage | 0.03 | 0.01 | 0.03 | 0.01 | 0.03 | 0.01 | 0.03 | 0.01 |
βhosp | 0.20 | 0.06 | 0.24 | 0.05 | 0.27 | 0.06 | 0.27 | 0.06 |
Brier score estimates with corresponding values of the increase in explained variation and AUC estimates are shown in Table 6. Incorporating intermediate hospitalization information leads to an improvement in accuracy with estimated as 0.005 (95% C.I. [0.0020, 0.0072]), IPEV2/1 estimated as 0.026 (95% CI: [0.012, 0.041]), and increase in AUC by 0.051 (95% CI: [0.034, 0.068]). This indicates a significant gain in predictive ability after incorporating short-term outcome information. We note that in many clinical applications, the increment in AUC due to the inclusion of new novel biomarkers is often as little as ~ 0.01 (e.g. Wang et al., 2006).
Table 6.
Brier Score | ||||
---|---|---|---|---|
| ||||
Apparent | CV (3-fold) | SE | ||
| ||||
|
16.5 | 17.0 | 0.3 | |
|
17.3 | 17.4 | 0.3 | |
IPEV2/1 | 4.4 | 2.6 | 3 | |
| ||||
95% C.I. for | ||||
| ||||
AUC | ||||
| ||||
Apparent | CV (3-fold) | SE | ||
| ||||
|
67.0 | 65.2 | 0.9 | |
|
60.6 | 60.1 | 0.9 | |
| ||||
95% C.I. for |
As previously mentioned, Parast et al. (2011) proposed non-parametric procedures to estimate predicted probability and evaluate the incremental value of including short term event information when there is only a single discrete covariate available. Using these procedures, there was an increase in predictive ability after incorporating hospitalization time in addition to the single discrete marker used for illustration, erythropoietin use. Since a significant advantage of the procedure proposed in this paper is the ability to accommodate one or more continuous covariates including time-varying covariate information collected up to t0, it would be of interest to quantify how much the proposed procedure gained over the Parast et al. (2011) procedure in terms of prediction accuracy. In Parast et al. (2011), a prediction rule which incorporates both intermediate hospitalization information and erythropoietin use resulted in a Brier score estimate of 0.1741 with an SE of 0.0025 and an AUC of 0.6111 with an SE of 0.0080. Compared to these results, our procedure which incorporates the frequency of hospitalizations before t0 and all the aforementioned baseline covariates leads to an improvement in prediction accuracy as demonstrated by a reduction in the Brier score of 0.0046 (95% CI: [−0.0129,0.0038 ]) and an increase in AUC of 0.0410 (95% CI: [0.0169,0.0652]).
6 Discussion
In this paper, we propose a flexible varying-coefficient model to incorporate short term event time information up to a landmark point along with baseline covariates for the prediction of long term survival. In addition, we use robust non-parametric procedures to make inference about the resulting accuracy measures and the difference in the accuracy measures between various prediction rules without requiring the correct specification of the proposed model. One may expect that including the short term event information could significantly improve the prediction accuracy when the short term event is highly correlated with the long term event of interest. Though we assumed independent censoring, the proposed procedure could accommodate marker dependent censoring by replacing Ĝ in Ŵi with Ĝz obtained as the Kaplan Meier estimator of P(C > t | Z = z) for the subgroup with Zi = z. It is important to note that under model mis-specification βts may not necessarily reflect the effect of Z on T among the subgroup in Ωt0,ts = {T > t0, T = ts}. However, it would still reflect the relative importance of Z in terms of prediction for subjects with different values of ts. Especially when the model is approximately true, one may follow the same arguments as given in Eguchi and Copas (2002) that βts could be interpreted as the best linear combination of Z in predicting I(T ≤ t0 + τ) among the subgroup Ωt0,ts.
Though we present cross-validated estimates of the proposed accuracy measures, in an effort to reduce bias from potential over-fitting, it is often appealing to validate the performance of a risk prediction model in a population independent of the study population. We additionally acquired patient information from another region of the USRDS, specifically New York state and assessed the accuracy of our proposed risk score in this dataset. All accuracy measures were estimated assuming that βts, βt0, and φt0 were fixed and known i.e. equal to β̂ts, β̂t0, and φ̂t0, estimated previously in the first region of New England states. The accuracy measures from this separate validation set are similar in magnitude to those of the cross-validated accuracy measures in the original dataset and the proposed procedure performs well in both regions. For example, after incorporating hospitalization information, the AUC increased by 0.034 (95% CI: 0.026, 0.043) indicating a gain in predictive ability.
When a single discrete marker is available for prediction, one may consider a non-parametric approach as in Parast et al. (2011). When there are few markers available, one may extend this method using multi-dimensional smoothing. However, a large sample size would be required for even two-dimensional smoothing in the presence of censoring (Dabrowska, 1987). This lack of power would be even more pronounced here due to the reduced effective sample size as the analysis is restricted to patients who survive to t0. While more parametric methods can certainly be considered, risk estimates from some model-based procedures may have poor accuracy if the model assumptions fail to hold. The proposed varying coefficient landmark prediction modeling framework also enables use to incorporate longitudinally measured covariates easily. This differs from the standard Cox model with time-varying covariates in that we do not use covariate information at time t to infer about hazard rate at t, but allow for a prediction of future events occurring long after t. We also note that our proposed method holds for any choice of t0 and τ. In practice, one may consider the proposed estimator across a range of t0 and τ as a basis for selecting an appropriate time point such that the prediction model is most accurate.
Throughout this paper we have assumed that the covariates and short term event time are measured without error. In practice, this may not be a valid assumption and if it is violated, this procedure may lead to biased estimates for the regression parameters. One may consider extensions to this procedure using measurement error models proposed in the literature (Carroll et al., 2006) to handle such situations. However, the added complexity and issues resulting from model mis-specification should be carefully considered. On the other hand, our proposed model can also be viewed as a working model aimed to approximate the underlying data generating process and to obtain the best prediction rule with the given marker values potentially measured with error. Future research on accounting for measurement error to improve prediction performance is warranted.
All analyses were performed in R. Code for implementing the proposed procedures is available upon request.
Acknowledgments
The data reported here have been supplied by the United States Renal Data System (USRDS). The interpretation and reporting of these data are the responsibility of the author(s) and in no way should be seen as an official policy or interpretation of the U.S. government. The authors are grateful to the Editor, Associate Editor, and the reviewers for their insightful comments on the article. Support for this research was provided by National Institutes of Health grants T32GM074897 (Parast), R01AI052817 (Parast), R01-GM079330 (Cai) and U54LM008748 (Cai).
Contributor Information
Layla Parast, Email: lparast@post.harvard.edu, Department of Biostatistics, Harvard School of Public Health, 677 Huntington Avenue, Boston, MA 02115, Phone: 512-731-9919.
Su-Chun Cheng, Email: scheng@jimmy.harvard.edu, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115.
Tianxi Cai, Email: tcai@hsph.harvard.edu, Department of Biostatistics, Harvard School of Public Health, 677 Huntington Avenue, Boston, MA 02115.
References
- Barrett BJ, Parfrey PS, Morgan J, Barr P, Fine A, Goldstein MB, Handa SP, Jindal KK, Kjellstrand CM, Levin A, Mandin H, Muirhead N, Richardson RMA. Prediction of early death in end-stage renal disease patients starting dialysis. American Journal of Kidney Diseases. 1997;29:214–222. doi: 10.1016/s0272-6386(97)90032-9. [DOI] [PubMed] [Google Scholar]
- Bauer KR, CTRM, Brown M, Cress RD, Parise CA, Caggiano V. Descriptive analysis of estrogen receptor (ER)-negative, progesterone receptor (PR)-negative, and HER2-negative invasive breast cancer, the so-called triple-negative phenotype: A population-based study from the California cancer registry. Cancer. 2007;109:1721–1728. doi: 10.1002/cncr.22618. [DOI] [PubMed] [Google Scholar]
- Bickel PJ, Rosenblatt M. On some global measures of the deviations of density function estimates. The Annals of Statistics. 1973;1:1071–1095. [Google Scholar]
- Brier GW. Verification of forecasts expressed in terms of probability. Monthly Weather Review. 1950;78:1–3. [Google Scholar]
- Cai T, Tian L, Uno H, Solomon SD, Wei L. Calibrating parametric subject-specific risk estimation. Biometrika. 2010;97:389–404. doi: 10.1093/biomet/asq012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cai T, Tian L, Wei LJ. Semiparametric Box-Cox power transformation models for censored survival observations. Biometrika. 2005;92:619–632. [Google Scholar]
- Carroll RJ, Ruppert D, Stefanski LA, Crainiceanu C. Measurement Error in Nonlinear Models: A Modern Perspective. London: Chapman and Hall; 2006. [Google Scholar]
- Cheung AK, Sarnak MJ, Yan G, Berkoben M, Heyka R, Kaufman A, Lewis J, Rocco M, Toto R, Windus D, et al. Cardiac diseases in maintenance hemodialysis patients: results of the HEMO Study. Kidney International. 2004;65:2380–2389. doi: 10.1111/j.1523-1755.2004.00657.x. [DOI] [PubMed] [Google Scholar]
- Collins AJ, Foley RN, Gilbertson DT, Chen SC. The state of chronic kidney disease, ESRD, and morbidity and mortality in the first year of dialysis. Clinical Journal of the American Society of Nephrology. 2009;4:S5. doi: 10.2215/CJN.05980809. [DOI] [PubMed] [Google Scholar]
- Cook NR. Use and misuse of the receiver operating characteristic curve in risk prediction. Circulation. 2007;115:928–935. doi: 10.1161/CIRCULATIONAHA.106.672402. [DOI] [PubMed] [Google Scholar]
- Cortese G, Andersen PK. Competing risks and time-dependent covariates. Biometrical Journal. 2010;52:138–158. doi: 10.1002/bimj.200900076. [DOI] [PubMed] [Google Scholar]
- Dabrowska DM. Non-parametric regression with censored survival time data. Scandinavian Journal of Statistics. 1987;14:181–197. [Google Scholar]
- Dabrowska DM. Uniform consistency of the kernel conditional Kaplan-Meier estimate. The Annals of Statistics. 1989;17:1157–1167. [Google Scholar]
- Dalrymple LS, Johansen KL, Chertow GM, Cheng SC, Grimes B, Gold EB, Kaysen GA. Infection-related hospitalizations in older patients with esrd. American Journal of Kidney Diseases. 2010;56:522–530. doi: 10.1053/j.ajkd.2010.04.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Datta S, Satten GA, Datta S. Nonparametric Estimation for the Three-Stage Irreversible Illness–Death Model. Biometrics. 2000;56:841–847. doi: 10.1111/j.0006-341x.2000.00841.x. [DOI] [PubMed] [Google Scholar]
- Fan J, Zhang W. Statistical estimation in varying coefficient models. Annals of Statistics. 1999;27:1491–1518. [Google Scholar]
- Fan J, Zhang W. Statistical methods with varying coefficient models. Statistics and its Interface. 2008;1:179. doi: 10.4310/sii.2008.v1.n1.a15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fine J, Jiang H, Chappell R. On semi-competing risks data. Biometrika. 2001;88:907–919. [Google Scholar]
- Foley R, Culleton B, Parfrey P, Harriett J, Kent G, Murray D, Barre PE. Cardiac disease in diabetic end-stage renal disease. Diabetologia. 1997;40:1307–1312. doi: 10.1007/s001250050825. [DOI] [PubMed] [Google Scholar]
- Folsom AR, Chambless LE, Ballantyne CM, Coresh J, Heiss G, Wu KK, Boerwinkle E, Mosley TH, Jr, Sorlie P, Diao G, et al. An assessment of incremental coronary risk prediction using C-reactive protein and other novel risk markers: the atherosclerosis risk in communities study. Archives of Internal Medicine. 2006;166:1368. doi: 10.1001/archinte.166.13.1368. [DOI] [PubMed] [Google Scholar]
- Gerds TA, Cai T, Schumacher M. The performance of risk prediction models. Biometrical Journal. 2008;50:457–479. doi: 10.1002/bimj.200810443. [DOI] [PubMed] [Google Scholar]
- Gerds TA, Schumacher M. Consistent estimation of the expected Brier score in general survival models with right-censored event times. Biometrical Journal. 2006;48:1029–1040. doi: 10.1002/bimj.200610301. [DOI] [PubMed] [Google Scholar]
- Hastie T, Tibshirani R. Varying-coefficient models. Journal of the Royal Statistical Society Series B (Methodological) 1993;55:757–796. [Google Scholar]
- Hatteville L, Mahe C, Hill C. Prediction of the long-term survival in breast cancer patients according to the present oncological status. Statistics in Medicine. 2002;21:2345–2354. doi: 10.1002/sim.1046. [DOI] [PubMed] [Google Scholar]
- Heagerty PJ, Zheng Y. Survival model predictive accuracy and roc curves. Biometrics. 2005;61:92–105. doi: 10.1111/j.0006-341X.2005.030814.x. [DOI] [PubMed] [Google Scholar]
- Herzog CA, Mangrum JM, Passman R. Non-coronary heart disease in dialysis patients: Sudden cardiac death and dialysis patients. Seminars in Dialysis. 2008;21:300–307. doi: 10.1111/j.1525-139X.2008.00455.x. [DOI] [PubMed] [Google Scholar]
- Hirschtick RE, Glassroth J, Jordan MC, Wilcosky TC, Wallace JM, Kvale PA, Markowitz N, Rosen MJ, Mangura BT, Hopewell PC, et al. Bacterial pneumonia in persons infected with the human immunodeficiency virus. New England Journal of Medicine. 1995;333:845. doi: 10.1056/NEJM199509283331305. [DOI] [PubMed] [Google Scholar]
- Jiang H, Fine JP, Chappell R. Semiparametric analysis of survival data with left truncation and dependent right censoring. Biometrics. 2005;61:567–575. doi: 10.1111/j.1541-0420.2005.00335.x. [DOI] [PubMed] [Google Scholar]
- Kalbfleisch JD, Prentice RL. The Statistical Analysis of Failure Time Data. New York: John Wiley & Sons; 2002. [Google Scholar]
- Kay R. A Markov model for analysing cancer markers and disease states in survival studies. Biometrics. 1986;42:855–865. [PubMed] [Google Scholar]
- Kerr KF, Pepe MS. Joint modeling, covariate adjustment, and interaction: contrasting notions in risk prediction models and risk prediction performance. Epidemiology. 2011;22:805. doi: 10.1097/EDE.0b013e31823035fb. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Klein JP, Keiding N, Copelan EA. Plotting summary predictions in multistate survival models: Probabilities of relapse and death in remission for bone marrow transplantation patients. Statistics in Medicine. 1994;12:2315–2332. doi: 10.1002/sim.4780122408. [DOI] [PubMed] [Google Scholar]
- Lawless JF, Yuan Y. Estimation of prediction error for survival models. Statistics in Medicine. 2010;29:262–274. doi: 10.1002/sim.3758. [DOI] [PubMed] [Google Scholar]
- Lee SJ, Klein JP, Barrett AJ, Ringden O, Antin JH, Cahn JY, Carabasi MH, Gale RP, Giralt S, Hale GA, et al. Severity of chronic graft-versus-host disease: association with treatment-related mortality and relapse. Blood. 2002;100:406–414. doi: 10.1182/blood.v100.2.406. [DOI] [PubMed] [Google Scholar]
- Murphy ST, Parfrey PS. The impact of anemia correction on cardiovascular disease in end-stage renal disease. Seminars in Nephrology. 2000;20:350. [PubMed] [Google Scholar]
- Osborne CK. Tamoxifen in the treatment of breast cancer. New England Journal of Medicine. 1998;339:1609. doi: 10.1056/NEJM199811263392207. [DOI] [PubMed] [Google Scholar]
- Parast L, Cai T. Landmark prediction of survival. Harvard University Biostatistics Working Paper Series. 2010:123. http://www.bepress.com/harvardbiostat/paper123/
- Parast L, Cheng SC, Cai T. Incorporating short-term outcome information to predict long-term survival with discrete markers. Biometrical Journal. 2011;53:294–307. doi: 10.1002/bimj.201000150. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Park B, Kim W, Ruppert D, Jones M, Signorini D, Kohn R. Simple transformation techniques for improved non-parametric regression. Scandinavian Journal of Statistics. 1997;24:145–163. [Google Scholar]
- Park Y, Wei LJ. Estimating subject-specific survival functions under the accelerated failure time model. Biometrika. 2003;90:717–23. [Google Scholar]
- Pencina MJ, D’Agostino RB, Sr, D’Agostino RB, Jr, Vasan RS. Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond. Statistics in Medicine. 2008;27:157–172. doi: 10.1002/sim.2929. [DOI] [PubMed] [Google Scholar]
- Pollard D. Empirical Processes: Theory and Applications. Hayward, CA: Institute of Mathematical Statistics; 1990. [Google Scholar]
- Putter H, van der Hage J, de Bock GH, Elgalta R, van de Velde CJH. Estimation and Prediction in a Multi-State Model for Breast Cancer. Biometrical Journal. 2006;48:366–380. doi: 10.1002/bimj.200510218. [DOI] [PubMed] [Google Scholar]
- Robins JM, Ritov Y. Toward a curse of dimensionality appropriate (CODA) asymptotic theory for semi-parametric models. Statistics in Medicine. 1997;16:285–319. doi: 10.1002/(sici)1097-0258(19970215)16:3<285::aid-sim535>3.0.co;2-#. [DOI] [PubMed] [Google Scholar]
- Robins JM, Rotnitzky A. Recovery of information and adjustment for dependent censoring using surrogate markers. Aids Epidemiology, Methodological Issues. 1992:297–331. [Google Scholar]
- Rotnitzky A, Robins J. Inverse probability weighted estimation in survival analysis. Encyclopedia of Biostatistics. 2005:2619–2625. [Google Scholar]
- Sands JJ, Etheredge GD, Shankar A, Graff J, Loeper J, McKendry M, Farrell R. Predicting hospitalization and mortality in end-stage renal disease (ESRD) patients using an Index of Coexisting Disease (ICED)-based risk stratification model. Disease Management. 2006;9:224–235. doi: 10.1089/dis.2006.9.224. [DOI] [PubMed] [Google Scholar]
- Saravanan P, Davidson NC. Risk Assessment for Sudden Cardiac Death in Dialysis Patients. Circulation: Arrhythmia and Electrophysiology. 2010;3:553. doi: 10.1161/CIRCEP.110.937888. [DOI] [PubMed] [Google Scholar]
- Siannis F, Farewell V, Head J. A multi-state model for joint modeling of terminal and non-terminal events with application to Whitehall II. Statistics in Medicine. 2007;26:426–442. doi: 10.1002/sim.2342. [DOI] [PubMed] [Google Scholar]
- Swets JA, Pickett RM. Evaluation of Diagnostic Systems: Methods from Signal Detection Theory. Academy press; New York: 1982. [Google Scholar]
- Tian L, Cai T, Goetghebeur E, Wei L. Model evaluation based on the sampling distribution of estimated absolute prediction error. Biometrika. 2007;94:297–311. [Google Scholar]
- Uno H, Cai T, Tian L, Wei LJ. Evaluating prediction rules for t-year survivors with censored regression models. Journal of the American Statistical Association. 2007;102:527–37. [Google Scholar]
- USRDS. US Renal Data System 2007 Annual Data Report: Atlas of Chronic Kidney Disease and End-Stage Renal Disease in the United States. National Institutes of Health, National Institute of Diabetes and Digestive and Kidney Diseases; 2007. [Google Scholar]
- USRDS. US Renal Data System 2010 Annual Data Report: Atlas of Chronic Kidney Disease and End-Stage Renal Disease in the United States. National Institutes of Health, National Institute of Diabetes and Digestive and Kidney Diseases; 2010. [Google Scholar]
- Van Houwelingen HC, Putter H. Dynamic predicting by landmarking as an alternative for multi-state modeling: an application to acute lymphoid leukemia data. Lifetime Data Analysis. 2008;14:447–463. doi: 10.1007/s10985-008-9099-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wand M, Marron J, Ruppert D. Transformations in density estimation. Journal of the American Statistical Association. 1991;86:343–353. [Google Scholar]
- Wang TJ, Gona P, Larson MG, Tofler GH, Levy D, Newton-Cheh C, Jacques PF, Rifai N, Selhub J, Robins SJ, et al. Multiple biomarkers for the prediction of first major cardiovascular events and death. New England Journal of Medicine. 2006;355:2631–2639. doi: 10.1056/NEJMoa055373. [DOI] [PubMed] [Google Scholar]
- Wilson PWF, D’Agostino RB, Levy D, Belanger AM, Silbershatz H, Kannel WB. Prediction of coronary heart disease using risk factor categories. Circulation. 1998;97:1837–1847. doi: 10.1161/01.cir.97.18.1837. [DOI] [PubMed] [Google Scholar]
- Zheng Y, Cai T, Pepe MS, Levy WC. Time-dependent predictive values of prognostic biomarkers with failure time outcome. Journal of the American Statistical Association. 2008;103:362–368. doi: 10.1198/016214507000001481. [DOI] [PMC free article] [PubMed] [Google Scholar]