Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Aug 21.
Published in final edited form as: J Am Stat Assoc. 2012 Aug 21;107(500):1492–1501. doi: 10.1080/01621459.2012.721281

Landmark Prediction of Long Term Survival Incorporating Short Term Event Time Information

Layla Parast 1, Su-Chun Cheng 2, Tianxi Cai 3
PMCID: PMC3535339  NIHMSID: NIHMS415544  PMID: 23293405

Summary

In recent years, a wide range of markers have become available as potential tools to predict risk or progression of disease. In addition to such biological and genetic markers, short term outcome information may be useful in predicting long term disease outcomes. When such information is available, it would be desirable to combine this along with predictive markers to improve the prediction of long term survival. Most existing methods for incorporating censored short term event information in predicting long term survival focus on modeling the disease process and are derived under restrictive parametric models in a multi-state survival setting. When such model assumptions fail to hold, the resulting prediction of long term outcomes may be invalid or inaccurate. When there is only a single discrete baseline covariate, a fully non-parametric estimation procedure to incorporate short term event time information has been previously proposed. However, such an approach is not feasible for settings with one or more continuous covariates due to the curse of dimensionality. In this paper, we propose to incorporate short term event time information along with multiple covariates collected up to a landmark point via a flexible varying-coefficient model. To evaluate and compare the prediction performance of the resulting landmark prediction rule, we use robust non-parametric procedures which do not require the correct specification of the proposed varying coefficient model. Simulation studies suggest that the proposed procedures perform well in finite samples. We illustrate them here using a dataset of post-dialysis patients with end-stage renal disease.

Keywords: Landmark Prediction, Risk Prediction, Survival Time, Varying Coefficient Model

1 Introduction

In the quest to provide accurate prognosis of disease for patients it has become evident that a wide range of biological and genetic markers hold great potential in improving prediction. The prediction potential of these markers has led to a more personalized approach to individual patient care (Bauer et al., 2007; Wilson et al., 1998; Folsom et al., 2006). Incorporating short term outcomes in addition to these markers may further improve prediction accuracy. For example, the extent to which change in oncological status after surgery can predict long term survival for breast cancer patients was examined in Hatteville et al. (2002). Additionally, treatment response in breast cancer patients may provide useful information for prediction of future events (Osborne, 1998). Hirschtick et al. (1995) demonstrated an association between occurrence of bacterial pneumonia and death among HIV-positive patients. Recently, the association between acute graft versus host disease and survival of acute leukemia patients following bone marrow or stem cell transplantation was investigated in Cortese & Andersen (2010) and Lee et al. (2002). Among end-stage renal disease patients, the relationship between an early hospitalization and risk of death has been examined by Collins et al. (2009) and USRDS (2010).

In the aforementioned examples, the long term outcome of interest is often time to a terminal event e.g. death and the short term outcome is often time to a non-terminal event. This setting is referred to as a semi-competing risk setting since the occurrence of the terminal event would censor the non-terminal event but not vice versa. Throughout, we let TInline graphic and TInline graphic denote event times for the short term and long term events, respectively. Most existing methods for analyzing semi-competing risk data focus on estimation of the cause-specific hazard functions and/or the joint distribution of events under semi-parametric models (Fine et al., 2001; Siannis et al., 2007; Jiang et al., 2005). The joint distribution estimators can potentially be used to make predictions about TInline graphic given information on TInline graphic. Methods developed specifically to incorporate TInline graphic information and baseline predictors for the prediction of TInline graphic have concentrated primarily on semi-parametric models in a multi-state framework (Putter et al., 2006; Kay, 1986; Klein et al., 1994; Datta et al., 2000; Hatteville et al., 2002; Cortese & Andersen, 2010). For example, Klein et al. (1994) use a standard Cox regression analysis to incorporate information on whether or not the intermediate event has occurred by a certain time. Hatteville et al. (2002) extend this method to include non-proportional hazards and propose estimation procedures. Recently, Van Houwelingen & Putter (2008) proposed a landmark prediction procedure to incorporate the status of the short term event by a landmark point t0.

These methods provide useful tools to leverage information about TInline graphic to improve the prediction of TInline graphic. However, these simple semi-parametric models may not capture the relationship among the two outcomes and the covariates in practice. As an example, TInline graphic may be highly associated with unknown subtypes of disease and the effect of observed covariates on TInline graphic may vary with the disease subtype. Commonly used proportional hazards models may be inadequate for data from such a mixture of populations. When there is only a single discrete baseline covariate, a fully non-parametric procedure was proposed in Parast et al. (2011) to estimate the conditional distribution of the residual life from a landmark time t0, TInline graphict0, given both TInline graphic and the covariate among subjects who survive past t0. However, such an approach is not feasible in settings with one or more continuous covariates due to the curse of dimensionality (Robins & Ritov, 1997). To overcome such limitations, we extend the landmark Cox model of Van Houwelingen & Putter (2008) and propose a flexible varying-coefficient working model to incorporate TInline graphic information along with a vector of covariates that may consist of information collected up to t0. The varying-coefficient model can potentially approximate the underlying disease process better compared to commonly used semi-parametric models. Landmarking along with an inverse probability weighting (IPW) approach (Rotnitzky & Robins, 2005; Robins & Rotnitzky, 1992) enables us to overcome both non-informative dropout and informative censoring issues for TInline graphic due to competing risks. In contrast to Parast et al. (2011), our proposed approach can also incorporate longitudinal predictor information collected up to t0, such as repeated biomarker measurements and the frequency of complications or adverse events up to t0.

An important follow up step in developing methods for risk prediction is to objectively assess its predictive performance. Due to the complexity of the disease process as mentioned above, it is important to estimate the prediction accuracy measures non-parametrically without requiring that the fitted statistical models hold. To evaluate the performance of the proposed landmark prediction rule, we make inference about the accuracy measures using robust non-parametric procedures. We provide inference procedures for comparing prediction rules and resampling based procedures for interval estimation. Results from our simulation studies suggest that our proposed procedures work well in finite samples and the flexibility of our varying coefficient model could lead to a substantial gain in accuracy compared to a landmark Cox model based approach. For illustration, we apply our procedures to a dataset of post-dialysis patients with end-stage renal disease and develop a landmark risk model for kidney transplant free survival incorporating time to first hospitalization as well as several available predictors.

2 Estimation

2·1 Notations and Settings

We consider the semi-competing risk setting where TInline graphic is a terminal event and can only be censored by the administrative censoring, C, not TInline graphic, while TInline graphic may be censored by both TInline graphic and C. For example, TInline graphic may represent time to a nonfatal stroke or hospitalization and TInline graphic may represent time to death. Considering the potential difference in the underlying disease process, patients who experience the short term outcome may have substantially different clinical outcomes from the general patient population. Therefore, it may be important to incorporate information on the short term outcome in the prediction of a long term outcome. In this paper, we focus on predicting the risk of TInline graphict0 + τ among subjects whose TInline graphic is greater than a pre-defined landmark point t0 for any given τ. We denote this subset of subjects as ΩTInline graphic>t0 = {TInline graphic > t0}. For example, if t0 is a 1 year check-up appointment from the time of surgery and TInline graphic is death, we are interested in predicting future mortality among patients who have survived at least 1 year after surgery. Our goal is two-fold: 1) to accurately assess the probability of survival to time t0 + τ among ΩTInline graphic>t0 using information on TInline graphic and a vector of covariates Z and 2) to quantify how much prediction accuracy was improved by incorporating TInline graphic information compared to using only covariates Z. When short term outcome information is used, it is essential that the prediction only use information on TInline graphic up to the landmark point t0. Note that though we condition on patients surviving up to t0, these patients may still experience both TInline graphic and TInline graphic after t0.

Our potential data for analysis is composed of n independent and identically distributed random vectors {Di = (XInline graphici, δInline graphici, XInline graphici, δInline graphici, Inline graphic)′, i = 1, …, n}, where XInline graphic = min(TInline graphic, C), XInline graphic = min(TInline graphic, C), δInline graphic = I(TInline graphicC), δInline graphic = I(TInline graphicC), and Inline graphic indicates all available collected covariate information up to t0. We let Z represent the vector version of Inline graphic. It is important to note that XInline graphici is not always observable due to competing risks, however, given our interest in the subset, ΩTInline graphic>t0, the information concerning XInline graphici required for our proposed estimation procedure is always observable. For details see Section 2·3. We assume that C is independent of (TInline graphic, TInline graphic, Z) with a survival function G(·) and P(XInline graphic > t0 + τ, XInline graphic > t0)P(TInline graphict0) > 0. Without specifying the true model for the conditional survival, such an independent censoring assumption is often required to enable valid estimation of the prediction performance measures (defined in Section 2·4) due to the curse of dimensionality (Robins & Ritov, 1997). When censoring does depend on Z, one may relax such assumptions by imposing a semi-parametric model on C given Z, see Section 6.

2·2 Landmark Risk Prediction for the Subpopulation ΩTInline graphic>t0

2·2.1 Landmark Prediction with Z only

For comparison, we are first interested in developing landmark prediction rules based on Z only, which would enable us to assess the incremental value of TInline graphic in prediction. To estimate Pt0,τ(1)(z)=P(TLt0+τTL>t0,Z=z), we consider a working time-specific generalized linear model (GLM)

Pt0,τ(1)(Z)=g(φt0Z), (2·1)

where g is a known, strictly increasing link function and φt0 is a vector of unknown regression coefficients. Model (2·1) allows the effect of Z to vary over t0 which is attractive in settings where markers predictive of short term risk may not be predictive of long term risk. In the presence of censoring, φt0 may be estimated by modifying the IPW estimator proposed in Uno et al. (2007) to incorporate landmarking. Specifically, we estimate φt0 as φ̂t0, the solution to:

iW^LiZi[I(XLit0+τ)-g(φt0Zi)]=0, (2·2)

where

W^Li=I(t0<XLit0+τ)δLiG^(XLi)+I(XLi>t0+τ)G^(t0+τ),

and Ĝ(·) is the Kaplan-Meier estimator of G(·). Note that ŴInline graphici = 0 for all subjects with XInline graphici > t0. By using this weight, we essentially use the subgroup with XInline graphic > t0 as an unbiased random sample for the subpopulation ΩTInline graphic>t0 since P(TInline graphict0 + τ | XInline graphic > t0) = P(TInline graphict0 + τ | TInline graphic > t0) under independent censoring.

2·3 Landmark Prediction with Z and TInline graphic

We now consider developing landmark prediction rules to estimate the probability that TInline graphict0 + τ given Z and the information on TInline graphic up to t0 for subjects in ΩTInline graphic>t0,

Pt0,τ(2)(TS,Z)=P(TLt0+τTL>t0,min(TS,t0),I(TSt0),Z)=Pt0,τ,TS,ZI(TSt0)+Pt0,τ,ZI(TS>t0)

where

Pt0,τ,ts,z=P(TLt0+τTL>t0,TS=Ts,Z=z)ifTS=tst0 (2·3)
Pt0,τ,z=P(TLt0+τTL>t0,TS>t0,Z=z)ifTS>t0 (2·4)

as in Parast et al. (2011). The quantity Pt0,τ,ts,z pertains to subjects with TInline graphict0 while Pt0,τ,z pertains to those with TInline graphic > t0. The conditional risk Pt0,τ,Z can also be approximated via a time-specific GLM similar to (2·1), Pt0,τ,Z=g(βt0Z), and the unknown model parameter βt0 can be estimated as β̂t0, the solution to:

S^(β)iI(TSi>t0)W^LiZi[I(XLit0+τ)-g(βt0Zi)]=0 (2·5)

Here, we now use the subgroup with {XInline graphic > t0, TInline graphic > t0} as an unbiased random sample for the subpopulation {TInline graphic > t0, TInline graphic > t0} since P(TInline graphict0 + τ, | XInline graphic > t0, XInline graphic > t0) = P(TInline graphict0 + τ, | TInline graphic > t0, TInline graphic > t0) under independent censoring. Now, to approximate Pt0,τ,tsz, we consider a varying coefficient model (Hastie & Tibshirani, 1993; Fan & Zhang, 1999; Fan & Zhang, 2008):

Pt0,τ,ts,Z=g(βtsZ)tst0 (2·6)

where g is a known, strictly increasing link function and βts is vector of unknown functions of ts. This model allows the effect of Z on the risk to vary over the short term outcome. To estimate βts in the presence of censoring, we propose to obtain β̂ts as the solution to the following IPW kernel smoothed estimating equation:

S^ts(β)iI(TSit0)W^LiZiKh{ψ(TSi)-ψ(ts)}[I(XLit0+τ)-g(βtsZi)]=0 (2·7)

where Kh(x) = h−1K(x/h), K(·) is a known smooth symmetric kernel density function with a bounded support, ψ: [0, ∞) → (−∞, ∞) is a given transformation function and the bandwidth h > 0 is assumed to be O(nv), for 1/5 < v < 1/2. To potentially improve estimation one can implement an appropriate transformation ψ(·) such as log(·)(Wand et al., 1991; Park et al., 1997). Combining β̂TInline graphic and β̂t0, we can now estimate Pt0,τ(2)(TS,Z) as

P^t0,τ(2)(TS,Z)=g(β^TSZ)I(TSt0)+g(β^t0Z)I(TS>t0).

Note that the observed data have sufficient information on TInline graphic and TInline graphic for the estimation of (2·3) and (2·4). The conditional risk (2·3) only involves the subgroup with TInline graphict0 < TInline graphic and since XInline graphici > t0 implies Ci > t0, when XInline graphici > t0 and XInline graphicit0, then TInline graphici = XInline graphici and I(TInline graphicit0) is observable. The probability in (2·4) involves the subgroup with min(TInline graphic, TInline graphic) > t0 and can be obtained based on the subset of patients with min(XInline graphic, XInline graphic) > t0 which is always observable.

It is known that the choice of the smoothing parameter h when estimating βts is critical as in any non-parametric functional estimation problem. To obtain an appropriate h we use a standard K-fold cross-validation procedure to obtain hopt, the minimizer of an appropriately weighted mean squared error estimate. It is common practice to undersmooth, due to the difficulty in estimating the bias, and obtain an estimator whose variance dominates bias. For example, to obtain the final bandwidth for the estimation of βts, one can choose h = hoptnc0, for some c0 ∈ (0, 3/10). Estimation is then performed over a region of ts defined as [ψ−1(ρl + h), ψ−1(ρrh)] where (ρl, ρr) is a subset of ψ(ts). In all numerical examples, (ρl, ρr) is taken to be the (0.02, 0.98) percentiles of the observed XInline graphic values.

2·4 Information gained by incorporating short-term outcome

To investigate the incremental value in predicting I(TInline graphict0 + τ) gained from incorporating the short-term outcome TInline graphic information, we compare the predicted individual risk for subjects using (TInline graphic, Z) with that using Z alone. A wide range of accuracy measures have been proposed in the literature to quantify and compare the predictive performance of risk models. Interesting discussions on this topic can be found in Heagerty & Zheng (2005), Cook (2007), Gerds et al. (2008), Pencina et al. (2008), Kerr & Pepe (2011). For illustration purposes, we focus on two commonly used measures, the Brier score (Brier, 1950; Gerds & Schumacher, 2006; Lawless & Yuan, 2010) and the area under the Receiver Operating Characteristic (ROC) (AUC) curve (Swets & Pickett, 1982; Heagerty & Zheng, 2005). The Brier score and the AUC of a risk function p(TInline graphic, Z) in the context of landmark prediction can be defined as BSt0,τ= E{[I(TInline graphicit0 + τ) − p(TInline graphici, Zi)]2|TInline graphici > t0} and AUCt0,τ = P{p(TInline graphici, Zi) > p(TInline graphicj, Zj) | t0 < TInline graphicit0 + τ, TInline graphicj > t0 + τ}. Similar to Gerds & Schumacher (2006), Uno et al. (2007) and Parast et al. (2011), for each prediction model, these measures can be estimated empirically using an IPW approach. Specifically, if the jth model estimates the conditional risk for the ith subject as P^i(j), then

BS^t0,τ(j)=iW^Li{I(t0<XLit0+τ)-P^i(j)}2iW^LiI(XLi>t0) (2·8)
AUC^t0,τ(j)=i:XLit0+τj:XLj>t0+τI(P^i(j)>P^j(j))W^LiW^Lji:XLit0+τj:XLj>t0+τW^LiW^Lj. (2·9)

consistently estimate BSt0,τ and AUCt0,τ for the corresponding risk model, respectively. Based on (2)(TInline graphici, Zi) and (1)(Zi), the incremental value of TInline graphic can be quantified by BS^t0,τ(1)-BS^t0,τ(2) and AUC^t0,τ(2)-AUC^t0,τ(1). In addition, one may quantify the incremental value of TInline graphic by the additional proportion of variation explained by model (2) relative to model (1), defined as IPEV2/1=1-BS^t0,τ(2)/BS^t0,τ(1). When the sample size is not large, it is important to correct for over-fitting bias via procedures such as cross-validation.

3 Inference

To make inference about βts, it can be shown using the weak convergence of n12{G^(t)-G(t)} to a zero-mean Gaussian process (Kalbfleisch & Prentice, 2002), that the uncertainty of Ĝ(·) in ŴInline graphici is of order n-12, which is negligible compared to the standard non-parametric rate of {(nh)−1 log(n)}1/2. Therefore, the proposed estimator is asymptotically equivalent to the estimator obtained by replacing ŴInline graphici with WInline graphici. Using similar arguments as given by Bickel & Rosenblatt (1973), Dabrowska (1989) and Cai et al. (2010), one can show that β̂tsβts in probability as n → ∞ regardless of whether (2·6) is correctly specified, where βts is the unique solution to the estimating equation, E[Zi{I(TLit0+τ)-g(βtsZi)}TLi>t0,TSi=ts]=0. In addition, one can show that for any given tst0 and h = Op(nv) with v ∈ (1/5, 1/2), B^(ts)=(nh)12(β^ts-βts)=(nh)-12i=1nKh{ψ(TSi)-ψ(ts)}ξ(Di)+op{(nh)-12} for some fixed function ξ. Thus, it follows from similar arguments as given in Bickel & Rosenblatt (1973) that suptst0B^(ts)=Op{(nh)-12log(n)12} and (ts) converges in distribution to a zero-mean normal random variable for any given ts.

To construct a confidence interval (CI) for βts in practice, we employ a perturbation-resampling method (Park & Wei, 2003; Cai et al., 2005; Tian et al., 2007) to approximate the distributions of the proposed estimators. Let { V(b)=(V1(b),,Vn(b)),b=1,B} be n × B independent copies of a positive random variable V from a known distribution with mean and variance equal to one. Then one may obtain perturbed estimates of βts, β^ts(b), either as the solution to

S^ts(b)(β)iI(TSit0)W^Li(b)Kh{ψ(TSi)-ψ(ts)}Zi[I(XLit0+τ)-g(βZi)]Vi(b)=0

or explicitly as β^ts(b)=β^ts+A^ts(β^ts)S^ts(b)(β^ts), where Inline graphic(β) = −Ŝts(β)/β,

W^Li(b)=I(t0<XLit0+τ)δLiG^(b)(XLi)+I(XLi>t0+τ)G^(b)(t0+τ)

and Ĝ(b) is the Kaplan-Meier estimator of G(·) with weights Vi(b). Note that, although the variation in Ĝ is negligible asymptotically for assessing the variation of β̂ts, we find that accounting for this variation provides a better approximation in finite samples and thus perturb Ĝ with Ĝ(b) to obtain β^ts(b). The variance of β̂ts, σ^βts2, can be estimated as the empirical variance of { β^ts(b),b=1,,B}. Such a perturbation procedure also enables us to construct 100(1 − α)% simultaneous confidence bands for {βts, 0 < ts0< tst0} based on {β̂ts ± ζασ̂βts}, where ζα is the upper 100(1 − α)th percentile of { supts0<ts<t0β^ts(b)-β^ts/σ^βts,b=1,,B} and ts0 is selected such that P(TInline graphic < ts0) > 0.

For {β̂t0, φ̂t0}, it can be shown using similar arguments as given by Uno et al. (2007) and Parast & Cai (2010) that they converge in probability to {βt0, φt0}, the respective solutions to E[Zi{I(TLit0+τ)-g(βt0Z)}TLi>t0,TSi>t0]=0 and E[Zi{I(TLit0+τ)-g(φt0Zi)}TLi>t0]=0. Further more, { n(β^t0-βt0),n(φ^t0-φt0)} converges jointly in distribution to a multivariate normal. Similar perturbation procedures can be used to construct CIs for βt0 and φt0. For the accuracy measure estimates, it is not difficult to justify their root-n consistency and normality using the uniform consistency of the risk estimates and functional central limit theorem Pollard (1990) along with modifications of the arguments given in Zheng et al. (2008). We note that although β̂ts is estimated at a slower non-parametric rate, the estimated overall accuracy parameters remain the standard parametric rate of root-n since they are essentially integrated over the distribution of TInline graphic. To overcome the bias and variance trade-off, we choose to under smooth with h = O(nv) and v ∈ (1/4, 1/2). The CIs for the Brier score and AUC can be constructed by perturbing (2·8) and (2·9) along with all the estimated model parameters and Ĝ. For all estimators with standard convergence rate of root-n, the variation due to Ĝ is no longer negligible and hence it is important to use the perturbed censoring weight W^Li(b) to account for the sampling variations.

4 Simulations

We conducted simulation studies to examine the finite-sample properties of the proposed estimation procedures. For simplicity, we consider a single continuous covariate Z generated from a Normal(0,4) distribution and the survival times TInline graphic and TInline graphic from the model:

TS=exp(-ηZi+ε1)TL=exp(-η[Zilog(TS)-log(TS)+Zi]+ε2)whereε1~N(μ1,σ12)ε2~N(μ2,σ22) (4·1)

and η, μ1, μ2, σ12, and σ22 were chosen in order to produce (i) a low-event rate setting and (ii) a moderate-event rate setting. In each setting, we considered a sample size of n=2,000 and 5,000. Throughout, the results are summarized based on 1000 simulated data sets for each setting. For illustration, we chose t0 to be year 2 and τ to be 8 years. Therefore, among patients who have not experienced the long-term event by year 2, TInline graphic > 2, we wish to estimate their probability of experiencing the long term event before year 10. Throughout all of the numerical examples, we let g be the logit link function. For the estimation of t0,τ,ts,z using (2·7), we let K(·) be the gaussian kernel and ψ(x) = log(x). The smoothing parameter h was obtained using the 3-fold cross-validation scheme and c0 = 1/10 and hence h = O(n−0.3). The standard error (SE) estimates were obtained based on B = 500 perturbations each.

In setting (i), the low event rate setting, the survival times were generated from (4·1) with η = 0.4, μ1 = 1.8, μ2 = 3.3, σ12=1.0 , and σ22=1.2. Under these conditions, P(TInline graphict0 | TInline graphic > t0) = 0.20 and P(TInline graphic > t0) = 0.98 and P(TInline graphict0 + τ |TInline graphic > t0) = 0.13. We consider the case when C was generated from a Weibull distribution with shape = 1.6 and scale = 20. With this censoring pattern, 30% of observations are censored for TInline graphic, 80% are censored for TInline graphic and P(TInline graphict0, δInline graphic = 1 | XInline graphic > t0) = 0.18 and P(XInline graphic > t0) = 0.96 and P(TInline graphict0 + τ, δInline graphic = 1|XInline graphic > t0) = 0.11. Estimates of βts in Pt0,τ,ts,z for t0 = 2 and τ = 8 and n = 2, 000 and 5, 000 are shown for multiple values of ts in Table 1(i). Values of ts in the table were chosen as the 20th, 40th, 60th, and 80th percentiles of TInline graphic < t0. The SEs estimated from the perturbation resampling method are close to the empirical SEs with higher SEs in the case of a lower sample size as expected. In addition, the SEs for both intercept and slope estimates tend to be higher for smaller values of ts possibly due to smoothing when there are fewer events in this time period as is typically seen in non-parametric estimation. All estimates have slight but negligible bias and the empirical coverage levels are close to their nominal level.

Table 1.

Bias estimates for β̂ts with corresponding averages of the SE estimates from the perturbation-resampling method (ASE), empirical SEs (ESE), and empirical 95% coverage levels (Coverage) from setting (i), the low event rate setting and (ii), the moderate event rate setting. Values of ts were chosen as the 20th, 40th, 60th, and 80th percentiles of TInline graphict0. All entries are multiplied by 100.

(i) low event rate setting

n=2000

Intercept Slope

ts t0.20 t0.40 t0.60 t0.80 t0.20 t0.40 t0.60 t0.80

Bias 5.6 3.2 1.8 0.7 −2.0 −0.1 1.7 3.6
ASE 39.8 33.3 29.0 26.8 14.5 13.2 12.6 12.6
ESE 41.6 35.2 29.8 26.7 15.5 14.7 13.3 12.6
Coverage 93.6 93.5 94.3 94.5 93.8 92.7 92.5 93.4

n=5000

Intercept Slope

ts t0.20 t0.40 t0.60 t0.80 t0.20 t0.40 t0.60 t0.80

Bias 1.9 0.8 0.8 −0.2 −0.7 0.8 1.8 3.5
ASE 33.7 27.4 23.0 20.2 11.8 10.5 9.8 9.5
ESE 35.4 28.3 22.6 20.2 12.6 10.9 9.8 10.0
Coverage 93.9 94.0 95.6 94.5 92.9 93.1 94.4 91.1

(ii) moderate event rate setting

n=2000

Intercept Slope

ts t0.20 t0.40 t0.60 t0.80 t0.20 t0.40 t0.60 t0.80
Bias −26.3 −2.8 3.5 5.3 10.2 2.1 −0.3 −3.4
ASE 84.7 50.4 31.6 21.2 29.0 19.2 18.1 19.4
ESE 92.2 53.4 31.9 21.7 31.9 20.3 18.4 21.1
Coverage 93.3 95.0 95.1 94.7 92.7 94.6 94.7 93.8

n=5000

Intercept Slope

ts t0.20 t0.40 t0.60 t0.80 t0.20 t0.40 t0.60 t0.80

Bias −3.5 −3.1 4.2 5.7 −1.0 2.3 −0.2 −3.6
ASE 51.6 31.1 19.6 13.3 17.7 12.0 11.3 12.2
ESE 52.0 33.5 20.4 13.4 18.2 12.5 11.9 12.5
Coverage 95.8 94.0 93.3 93.4 94.7 93.5 94.3 95.0

In setting (ii), the moderate event rate setting, the survival times were generated from (4·1) with η = 0.7, μ1 = 1.2, μ2 = 2.3, σ12=1.0, σ12 = 0.5, and σ22=1.3. Under these conditions, P(TInline graphict0 | TInline graphic > t0) = 0.35 and P(TInline graphic > t0) = 0.95 and P(TInline graphict0 + τ |TInline graphic > t0) = 0.23. We again consider the case when C was generated from a Weibull distribution with shape = 1.6 and scale = 20. With this censoring pattern, 25% of observations are censored for TInline graphic, 70% are censored for TInline graphic and P(TInline graphict0, δInline graphic = 1 | XInline graphic > t0) = 0.35 and P(XInline graphic > t0) = 0.92 and P(TInline graphict0 + τ, δInline graphic = 1|XInline graphic > t0) = 0.21. We again consider estimation of βts in Pt0,τ,ts,z and results are shown in Table 1(ii). The SEs estimated from the perturbation resampling method are again close to the empirical SEs with lower SEs in this event rate setting as compared to the lower event rate setting. Similar to the results in setting (i), SEs tend to by higher for smaller values of ts. All estimates have slight bias, however the empirical coverage levels are close to their nominal level. In both the low and moderate event rate setting, the procedures appear to perform well though care should be taken when there are very few events in a specific time interval as this could lead to some unstable estimates.

In both settings we examined the value of using the additional short-term outcome information in P^t0,τ(2)(TS,Z) using the procedures discussed in Section 2·4. Here we use a 3-fold cross-validation procedure to correct for over-fitting bias. Brier score and AUC estimates are shown in Tables 2 and 3. In the low event rate setting, the proportion of explained variation in predicting long-term t0 + τ survival increased by 0.077 when n=2,000 and by 0.086 when n=5,000 after incorporating short-term outcome information compared to using Z information only. In the moderate event rate setting, the proportion of explained variation in predicting long-term t0 + τ survival increased by 0.306 when n=2,000 and by 0.317 when n=5,000 after incorporating short-term outcome information compared to using Z information only. In the low event rate setting, including the short-term outcome information results in a 0.011 increase in AUC when n=2,000 and a 0.013 increase in AUC when n=5,000 compared to using Z information only. In the moderate event rate setting, including the short-term outcome information results in a 0.060 increase in AUC when n=2,000 and a 0.062 increase in AUC when n=5,000 compared to using Z information only. The SEs estimated from the perturbation resampling method are close to the empirical SEs with smaller SEs in the higher sample size setting as expected. Empirical coverage levels are close to their nominal level.

Table 2.

Average apparent and cross-validated (CV) estimates for the Brier score using TInline graphic, Z information ( BSt0,τ(2)) and Z information only ( BSt0,τ(1)), and the increase in proportion of explained variation(IPEV) with corresponding empirical SEs (ESE), average of the SE estimates from the perturbation-resampling method (ASE) for the apparent estimates and empirical coverage levels (Cvg) from setting (i), the low event rate setting and (ii), the moderate event rate setting. All entries are multiplied by 100.

(i) low event rate setting

n=2000

Apparent CV (3-fold)

Average ESE ASE Average ESE Cvg

BSt0,τ(2)
7.6 0.4 0.5 7.9 0.4 95.6
BSt0,τ(1)
8.5 0.4 0.5 8.6 0.5 95.2
IPEV2/1 10.7 2.4 2.8 7.7 2.8 90.3

n=5000

Apparent CV (3-fold)

Average ESE ASE Average ESE Cvg

BSt0,τ(2)
7.7 0.3 0.3 7.8 0.3 95.5
BSt0,τ(1)
8.5 0.3 0.3 8.6 0.3 93.7
IPEV2/1 10.4 1.5 1.7 8.6 1.6 95.3

(ii) moderate event rate setting

n=2000

Apparent CV (3-fold)

Average ESE ASE Average ESE Cvg

BSt0,τ(2)
9.6 0.4 0.5 10.2 0.5 89.8
BSt0,τ(1)
14.6 0.5 0.5 14.7 0.5 94.9
IPEV2/1 33.9 2.7 2.9 30.6 2.8 92.2

n=5000

Apparent CV (3-fold)

Average ESE ASE Average ESE Cvg

BSt0,τ(2)
9.8 0.3 0.3 10.0 0.3 93.0
BSt0,τ(1)
14.6 0.3 0.3 14.6 0.3 94.9
IPEV2/1 32.9 1.7 1.8 31.7 1.8 94.6

Table 3.

Average apparent and cross-validated (CV) estimates for the AUC using TInline graphic, Z information ( AUCt0,τ(2)) and Z information only ( AUCt0,τ(1)) with corresponding empirical SEs (ESE), average of the SE estimates from the perturbation-resampling method (ASE) for the apparent estimates and empirical coverage levels (Cvg) from setting (i), the low event rate setting and (ii), the moderate event rate setting. All entries are multiplied by 100.

(i) low event rate setting

n=2000

Apparent CV (3-fold)

Average ESE ASE Average ESE Cvg

AUCt0,τ(2)
90.0 0.9 1.0 89.3 1.0 91.5
AUCt0,τ(1)
88.2 1.0 1.0 88.2 1.0 93.8

n=5000

Apparent CV (3-fold)

Average ESE ASE Average ESE Cvg

AUCt0,τ(2)
90.0 0.6 0.6 89.6 0.6 93.8
AUCt0,τ(1)
88.3 0.6 0.6 88.3 0.6 94.7

(ii) moderate event rate setting

n=2000

Apparent CV (3-fold)

Average ESE ASE Average ESE Cvg

AUCt0,τ(2)
92.0 0.7 0.8 91.2 0.8 92.6
AUCt0,τ(1)
85.2 1.0 1.0 85.2 1.0 95.2

n=5000

Apparent CV (3-fold)

Average ESE ASE Average ESE Cvg

AUCt0,τ(2)
91.8 0.4 0.5 91.5 0.5 94.0
AUCt0,τ(1)
85.3 0.6 0.6 85.3 0.6 95.9

Additionally, we also consider the prediction accuracy of risk estimates based on a landmark Cox model as previously proposed in the literature (Van Houwelingen & Putter, 2008; Cortese & Andersen, 2010). Such a model would involve fitting a conditional proportional hazards working model for the residual life among ΩTInline graphic>t0 using baseline covariates along with a variable indicating whether the short term event had occurred by the landmark time. We denote the resulting risk estimate as P^t0,τ(cox)(TS,Zi). To evaluate the prediction rule resulting from this landmark Cox model approach, the corresponding Brier Score and AUC may be estimated by replacing P^t0,τ(2)(TSi,Zi) in (2·8) and (2·9) with P^t0,τ(cox)(TS,Zi) to obtain BS^t0,τ(2)(cox) and AUC^t0,τ(2)(cox), respectively. Simulations results comparing the accuracy estimates from the proposed model to those from the landmark Cox model are shown in Table 4. In the low event rate setting, the proportion of explained variation in predicting long-term t0 + τ survival increased by 0.063 and the AUC is 0.017 higher when n=5,000 when using the proposed approach to incorporate short-term outcome information compared to the landmark Cox model approach. In the moderate event rate setting, we observe a substantial increase in accuracy. When n = 5000, the proposed approach yields an increase in AUC of 0.057 and a 26% improvement in the proportion of variation explained.

Table 4.

Average cross-validated estimates using TInline graphic and Z information using the proposed procedure ( BSt0,τ(2),AUCt0,τ(2)) compared to using the landmark Cox model ( BSt0,τ(2)(cox),AUCt0,τ(2)(cox)) (with corresponding SE estimates). All entries are multiplied by 100.

(a) Brier score

BSt0,τ(2)
BSt0,τ(2)(cox)
IPEV2/cox

(i) low event rate
 n=2000 79.2 (0.42) 83.9 (0.45) 5.3 (7.6)
 n=5000 78.2 (0.27) 83.6 (0.29) 63.4 (4.8)
(ii) moderate event rate
 n=2000 10.2 (0.44) 13.5 (0.50) 24.6 (4.5)
 n=5000 10.0 (0.28) 13.5 (0.32) 25.9 (2.7)

(b) AUC

AUCt0,τ(2)
AUCt0,τ(2)(cox)
AUCt0,τ(2)-AUCt0,τ(2)(cox)

(i) low event rate
 n=2000 89.3 (0.93) 87.9 (1.01) 1.4 (1.54)
 n=5000 89.6 (0.58) 87.9 (0.65) 1.7 (0.93)
(ii) moderate event rate
 n=2000 91.2 (0.69) 85.8 (0.95) 5.4 (1.22)
 n=5000 91.5 (0.44) 85.8 (0.64) 5.7 (0.79)

5 Example

To illustrate our proposed procedures, we use a dataset of patients with end-stage renal disease (ESRD) who had initiated maintenance peritoneal or hemodialysis from the United States Renal Data System (USRDS) (USRDS, 2010). The USRDS consists of a total of 463,240 patients from 18 regional areas who initiated dialysis between 2000 and 2004. Our analysis focused on the subset of patients from one network consisting of six-states in the New England area without a previous kidney transplant. This subset consisted of 9,785 patients with individual information on time to death or kidney transplant, time to first hospitalization, and several baseline covariates measured at initiation of dialysis. Total time at risk was measured from the earliest outpatient date after initiation of dialysis until censoring at transplantation, death, loss to follow-up, or December 31, 2005. Time until first hospitalization was measured from the earliest outpatient date after initiation of dialysis to the first hospitalization date, when available.

We define TInline graphic as time to first hospitalization and TInline graphic as time to death or kidney transplant. Early hospitalizations are frequent in patients with ESRD and are associated with higher morbidity and risk of death (USRDS, 2007; Collins et al., 2009). Previous work has examined the incremental value of incorporating hospitalization time in predicting residual life in this dataset in the presence of a single discrete marker, erythropoietin use (Parast et al., 2011). While this marker is useful, it would be of great clinical interest to be able to incorporate information on the other clinical variables available. For our analyses, we include the following baseline covariates measured at initiation of dialysis based on results from a multivariate analysis: age, albumin, erythropoietin use, serum creatinine, and prior history of heart failure (HF), arrhythmia, chronic obstructive pulmonary disease (COPD), and cancer. There is an abundance of clinical literature examining associations of these demographic and clinical predictors with all-cause or cause-specific hospitalization and mortality (Sands et al., 2006; Barrett et al., 1997; Murphy & Parfrey, 2000). In addition, some patients are hospitalized multiple times prior to t0, which might indicate poor prognosis and hence a higher risk of death. Using our procedure, we are able to incorporate such information into prediction by adding the frequency of hospitalizations before t0 as a covariate.

There were a total of 5,480 first hospitalizations, 5,448 deaths, 111 transplantations and the 18 month hospitalization rate was about 0.57; the 18 month death or transplant rate was about 0.40. For illustration, we chose t0 to be 6 months and τ to be 1 year. Among the 7,679 patients who survived past 6 months, 37% were hospitalized within 6 months of initiating dialysis and 63% did not have a hospitalization before 6 months. We aim to make a prediction concerning 1-year survival among those who survived 6 months using the proposed procedures incorporating information on time of first hospitalization (TInline graphic) and frequency of hospitalizations between baseline and t0 as well as multiple baseline covariates. For example, the probability of interest, Pt0,τ,ts,z, for a patient with TInline graphic = ts = 24 days is displayed Figure 1.

Figure 1.

Figure 1

Given patient i is hospitalized 24 days after initiation of dialysis, TInline graphici = ts = 24, but survives past 6 months, Pt0,τ,ts,Zi is the probability of death within the shaded region when t0 = 6 months and τ = 1 year.

Estimates of βts are shown in Table 5 with corresponding SE estimates from the perturbation-resampling method. Figure 2 displays the estimates P^t0,τ(2)(ts,z)=P^t0,τ,ts,z=P^(TLt0+τTL>t0,TS=ts,Z=z)=g(β^tsz) over a range of ts for two sets of patients with TInline graphic = tst0 who differ only by history of HF. Both patients were hospitalized once before t0 and have the following same baseline covariate values: age = 76, albumin = 3.2, using erythropoietin, serum creatinine = 6.5, no prior history of arrhythmia, COPD, or cancer. However, one patient has prior history of HF (solid black thin line) while the other has no prior history of HF (solid black thick line). Using baseline covariate information only, P^t0,τ(1)(z), the predicted risk of having an event between 6 months and 1.5 years is equal to 0.21 for the patient with prior history of HF and 0.18 for the patient without prior history of HF regardless of hospitalization time. Note that P^t0,τ(1)(z) is the predicted risk for all patients who survived past 6 months whereas t0,τ,ts,z is for patients who survived past 6 months and were hospitalized before 6 months. Given that a hospitalization did not occur before 6 months, P^t0,τ(2)(ts,z)=P^t0,τ,z=0.16 and 0.14 for the patient with and without prior history of HF, respectively. These results are consistent with the clinical literature which shows a clear association between morbidity upon initiation of dialysis, particularly cardiovascular conditions, and higher risk of hospitalization and death in end-stage renal disease patients (USRDS, 2010; Foley et al., 1997; Saravanan & Davidson, 2010; Cheung et al., 2004). Sudden cardiac death is the most common form of death in dialysis patients and it is has been argued that early referral of patients with co-morbid conditions would lead to planned transition from stage 4 chronic kidney disease to end-stage renal disease, perhaps reducing morbidity and mortality in the first year of dialysis treatment (Collins et al., 2009; Herzog et al., 2008). In addition, we note a peak at 60 days since initiation of dialysis which could potentially be explained by the tendency for patients to change dialysis modality within the first 90 days of dialysis treatment (Dalrymple et al., 2010).

Table 5.

Estimates of βts = (β0, βalbumin, βepo use, βserum creatinine, βPHx HF, βPHx arrhythmia, βPHx COPD, βPHx cancer, βage, βhosp) with corresponding SE estimates from the perturbation-resampling method (SE) when t0 = 6 months and τ = 1 year in a dataset of end-stage renal disease patients. Values of ts were chosen as the 20th, 40th, 60th, and 80th percentiles of TInline graphict0. Note that PHx = “prior history of”, HF = heat failure, COPD = chronic obstructive pulmonary disease, epo = erythropoietin, hosp = number of hospitalizations before 6 months.

ts t0.20 t0.40 t0.60 t0.80

Estimate SE Estimate SE Estimate SE Estimate SE

β0 −2.17 0.99 −2.22 0.81 −2.86 0.78 −3.14 0.87
βalbumin −0.27 0.13 −0.29 0.10 −0.22 0.10 −0.15 0.10
βepo use −0.21 0.15 −0.07 0.12 −0.07 0.12 −0.12 0.13
βserum creatinine −0.04 0.03 −0.03 0.02 −0.02 0.02 −0.01 0.02
βPHx HF 0.13 0.15 0.11 0.12 0.11 0.12 0.06 0.12
βPHx arrhythmia 0.18 0.20 0.34 0.17 0.46 0.17 0.55 0.18
βPHx COPD 0.46 0.22 0.32 0.17 0.27 0.16 0.30 0.17
βPHx cancer 0.15 0.23 0.04 0.19 0.10 0.18 0.21 0.19
βage 0.03 0.01 0.03 0.01 0.03 0.01 0.03 0.01
βhosp 0.20 0.06 0.24 0.05 0.27 0.06 0.27 0.06

Figure 2.

Figure 2

For t0 = 6 months (183 days) and τ = 1 year, estimates of t0,τ,ts,z = (TInline graphict0 + τ | TInline graphic > t0, TInline graphic = ts, Z = z) over time for two patients with TInline graphic = tst0 who differ only by history of HF are shown. Both patients were hospitalized one time and have the following baseline covariate values: age = 76, albumin = 3.2, using erythropoietin, serum creatinine = 6.5, no prior history of arrhythmia, chronic obstructive pulmonary disease, or cancer, but one patient has prior history of HF (solid black thin line) while the other has no prior history of HF (solid black thick line). Note that using baseline covariate information only, (TInline graphict0 + τ | TInline graphic > t0, Z = z) is equal to 0.21 and 0.18 for the patient with prior history of HF and for the patient without prior history of heat failure, respectively.

Brier score estimates with corresponding values of the increase in explained variation and AUC estimates are shown in Table 6. Incorporating intermediate hospitalization information leads to an improvement in accuracy with BSt0,τ(1)-BSt0,τ(2) estimated as 0.005 (95% C.I. [0.0020, 0.0072]), IPEV2/1 estimated as 0.026 (95% CI: [0.012, 0.041]), and increase in AUC by 0.051 (95% CI: [0.034, 0.068]). This indicates a significant gain in predictive ability after incorporating short-term outcome information. We note that in many clinical applications, the increment in AUC due to the inclusion of new novel biomarkers is often as little as ~ 0.01 (e.g. Wang et al., 2006).

Table 6.

Apparent and cross-validated (CV) estimates and corresponding SEs from the perturbation-resampling method (SE) for Brier score and AUC using hospitalization time and baseline marker information ( BSt0,τ(2),AUCt0,τ(2)), marker information only ( BSt0,τ(1),AUCt0,τ(1)) and the increase in proportion of explained variation(IPEV) using the ESRD patient dataset with t0 = 6 months and τ = 1 year. All entries are multiplied by 100.

Brier Score

Apparent CV (3-fold) SE

BSt0,τ(2)
16.5 17.0 0.3
BSt0,τ(1)
17.3 17.4 0.3
IPEV2/1 4.4 2.6 3

95% C.I. for BSt0,τ(1)-BSt0,τ(2)=(0.20,0.72)

AUC

Apparent CV (3-fold) SE

AUCt0,τ(2)
67.0 65.2 0.9
AUCt0,τ(1)
60.6 60.1 0.9

95% C.I. for AUCt0,τ(2)-AUCt0,τ(1)=(3.4,6.8)

As previously mentioned, Parast et al. (2011) proposed non-parametric procedures to estimate predicted probability and evaluate the incremental value of including short term event information when there is only a single discrete covariate available. Using these procedures, there was an increase in predictive ability after incorporating hospitalization time in addition to the single discrete marker used for illustration, erythropoietin use. Since a significant advantage of the procedure proposed in this paper is the ability to accommodate one or more continuous covariates including time-varying covariate information collected up to t0, it would be of interest to quantify how much the proposed procedure gained over the Parast et al. (2011) procedure in terms of prediction accuracy. In Parast et al. (2011), a prediction rule which incorporates both intermediate hospitalization information and erythropoietin use resulted in a Brier score estimate of 0.1741 with an SE of 0.0025 and an AUC of 0.6111 with an SE of 0.0080. Compared to these results, our procedure which incorporates the frequency of hospitalizations before t0 and all the aforementioned baseline covariates leads to an improvement in prediction accuracy as demonstrated by a reduction in the Brier score of 0.0046 (95% CI: [−0.0129,0.0038 ]) and an increase in AUC of 0.0410 (95% CI: [0.0169,0.0652]).

6 Discussion

In this paper, we propose a flexible varying-coefficient model to incorporate short term event time information up to a landmark point along with baseline covariates for the prediction of long term survival. In addition, we use robust non-parametric procedures to make inference about the resulting accuracy measures and the difference in the accuracy measures between various prediction rules without requiring the correct specification of the proposed model. One may expect that including the short term event information could significantly improve the prediction accuracy when the short term event is highly correlated with the long term event of interest. Though we assumed independent censoring, the proposed procedure could accommodate marker dependent censoring by replacing Ĝ in ŴInline graphici with Ĝz obtained as the Kaplan Meier estimator of P(C > t | Z = z) for the subgroup with Zi = z. It is important to note that under model mis-specification βts may not necessarily reflect the effect of Z on TInline graphic among the subgroup in Ωt0,ts = {TInline graphic > t0, TInline graphic = ts}. However, it would still reflect the relative importance of Z in terms of prediction for subjects with different values of ts. Especially when the model is approximately true, one may follow the same arguments as given in Eguchi and Copas (2002) that βts could be interpreted as the best linear combination of Z in predicting I(TInline graphict0 + τ) among the subgroup Ωt0,ts.

Though we present cross-validated estimates of the proposed accuracy measures, in an effort to reduce bias from potential over-fitting, it is often appealing to validate the performance of a risk prediction model in a population independent of the study population. We additionally acquired patient information from another region of the USRDS, specifically New York state and assessed the accuracy of our proposed risk score in this dataset. All accuracy measures were estimated assuming that βts, βt0, and φt0 were fixed and known i.e. equal to β̂ts, β̂t0, and φ̂t0, estimated previously in the first region of New England states. The accuracy measures from this separate validation set are similar in magnitude to those of the cross-validated accuracy measures in the original dataset and the proposed procedure performs well in both regions. For example, after incorporating hospitalization information, the AUC increased by 0.034 (95% CI: 0.026, 0.043) indicating a gain in predictive ability.

When a single discrete marker is available for prediction, one may consider a non-parametric approach as in Parast et al. (2011). When there are few markers available, one may extend this method using multi-dimensional smoothing. However, a large sample size would be required for even two-dimensional smoothing in the presence of censoring (Dabrowska, 1987). This lack of power would be even more pronounced here due to the reduced effective sample size as the analysis is restricted to patients who survive to t0. While more parametric methods can certainly be considered, risk estimates from some model-based procedures may have poor accuracy if the model assumptions fail to hold. The proposed varying coefficient landmark prediction modeling framework also enables use to incorporate longitudinally measured covariates easily. This differs from the standard Cox model with time-varying covariates in that we do not use covariate information at time t to infer about hazard rate at t, but allow for a prediction of future events occurring long after t. We also note that our proposed method holds for any choice of t0 and τ. In practice, one may consider the proposed estimator across a range of t0 and τ as a basis for selecting an appropriate time point such that the prediction model is most accurate.

Throughout this paper we have assumed that the covariates and short term event time are measured without error. In practice, this may not be a valid assumption and if it is violated, this procedure may lead to biased estimates for the regression parameters. One may consider extensions to this procedure using measurement error models proposed in the literature (Carroll et al., 2006) to handle such situations. However, the added complexity and issues resulting from model mis-specification should be carefully considered. On the other hand, our proposed model can also be viewed as a working model aimed to approximate the underlying data generating process and to obtain the best prediction rule with the given marker values potentially measured with error. Future research on accounting for measurement error to improve prediction performance is warranted.

All analyses were performed in R. Code for implementing the proposed procedures is available upon request.

Acknowledgments

The data reported here have been supplied by the United States Renal Data System (USRDS). The interpretation and reporting of these data are the responsibility of the author(s) and in no way should be seen as an official policy or interpretation of the U.S. government. The authors are grateful to the Editor, Associate Editor, and the reviewers for their insightful comments on the article. Support for this research was provided by National Institutes of Health grants T32GM074897 (Parast), R01AI052817 (Parast), R01-GM079330 (Cai) and U54LM008748 (Cai).

Contributor Information

Layla Parast, Email: lparast@post.harvard.edu, Department of Biostatistics, Harvard School of Public Health, 677 Huntington Avenue, Boston, MA 02115, Phone: 512-731-9919.

Su-Chun Cheng, Email: scheng@jimmy.harvard.edu, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115.

Tianxi Cai, Email: tcai@hsph.harvard.edu, Department of Biostatistics, Harvard School of Public Health, 677 Huntington Avenue, Boston, MA 02115.

References

  1. Barrett BJ, Parfrey PS, Morgan J, Barr P, Fine A, Goldstein MB, Handa SP, Jindal KK, Kjellstrand CM, Levin A, Mandin H, Muirhead N, Richardson RMA. Prediction of early death in end-stage renal disease patients starting dialysis. American Journal of Kidney Diseases. 1997;29:214–222. doi: 10.1016/s0272-6386(97)90032-9. [DOI] [PubMed] [Google Scholar]
  2. Bauer KR, CTRM, Brown M, Cress RD, Parise CA, Caggiano V. Descriptive analysis of estrogen receptor (ER)-negative, progesterone receptor (PR)-negative, and HER2-negative invasive breast cancer, the so-called triple-negative phenotype: A population-based study from the California cancer registry. Cancer. 2007;109:1721–1728. doi: 10.1002/cncr.22618. [DOI] [PubMed] [Google Scholar]
  3. Bickel PJ, Rosenblatt M. On some global measures of the deviations of density function estimates. The Annals of Statistics. 1973;1:1071–1095. [Google Scholar]
  4. Brier GW. Verification of forecasts expressed in terms of probability. Monthly Weather Review. 1950;78:1–3. [Google Scholar]
  5. Cai T, Tian L, Uno H, Solomon SD, Wei L. Calibrating parametric subject-specific risk estimation. Biometrika. 2010;97:389–404. doi: 10.1093/biomet/asq012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Cai T, Tian L, Wei LJ. Semiparametric Box-Cox power transformation models for censored survival observations. Biometrika. 2005;92:619–632. [Google Scholar]
  7. Carroll RJ, Ruppert D, Stefanski LA, Crainiceanu C. Measurement Error in Nonlinear Models: A Modern Perspective. London: Chapman and Hall; 2006. [Google Scholar]
  8. Cheung AK, Sarnak MJ, Yan G, Berkoben M, Heyka R, Kaufman A, Lewis J, Rocco M, Toto R, Windus D, et al. Cardiac diseases in maintenance hemodialysis patients: results of the HEMO Study. Kidney International. 2004;65:2380–2389. doi: 10.1111/j.1523-1755.2004.00657.x. [DOI] [PubMed] [Google Scholar]
  9. Collins AJ, Foley RN, Gilbertson DT, Chen SC. The state of chronic kidney disease, ESRD, and morbidity and mortality in the first year of dialysis. Clinical Journal of the American Society of Nephrology. 2009;4:S5. doi: 10.2215/CJN.05980809. [DOI] [PubMed] [Google Scholar]
  10. Cook NR. Use and misuse of the receiver operating characteristic curve in risk prediction. Circulation. 2007;115:928–935. doi: 10.1161/CIRCULATIONAHA.106.672402. [DOI] [PubMed] [Google Scholar]
  11. Cortese G, Andersen PK. Competing risks and time-dependent covariates. Biometrical Journal. 2010;52:138–158. doi: 10.1002/bimj.200900076. [DOI] [PubMed] [Google Scholar]
  12. Dabrowska DM. Non-parametric regression with censored survival time data. Scandinavian Journal of Statistics. 1987;14:181–197. [Google Scholar]
  13. Dabrowska DM. Uniform consistency of the kernel conditional Kaplan-Meier estimate. The Annals of Statistics. 1989;17:1157–1167. [Google Scholar]
  14. Dalrymple LS, Johansen KL, Chertow GM, Cheng SC, Grimes B, Gold EB, Kaysen GA. Infection-related hospitalizations in older patients with esrd. American Journal of Kidney Diseases. 2010;56:522–530. doi: 10.1053/j.ajkd.2010.04.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Datta S, Satten GA, Datta S. Nonparametric Estimation for the Three-Stage Irreversible Illness–Death Model. Biometrics. 2000;56:841–847. doi: 10.1111/j.0006-341x.2000.00841.x. [DOI] [PubMed] [Google Scholar]
  16. Fan J, Zhang W. Statistical estimation in varying coefficient models. Annals of Statistics. 1999;27:1491–1518. [Google Scholar]
  17. Fan J, Zhang W. Statistical methods with varying coefficient models. Statistics and its Interface. 2008;1:179. doi: 10.4310/sii.2008.v1.n1.a15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Fine J, Jiang H, Chappell R. On semi-competing risks data. Biometrika. 2001;88:907–919. [Google Scholar]
  19. Foley R, Culleton B, Parfrey P, Harriett J, Kent G, Murray D, Barre PE. Cardiac disease in diabetic end-stage renal disease. Diabetologia. 1997;40:1307–1312. doi: 10.1007/s001250050825. [DOI] [PubMed] [Google Scholar]
  20. Folsom AR, Chambless LE, Ballantyne CM, Coresh J, Heiss G, Wu KK, Boerwinkle E, Mosley TH, Jr, Sorlie P, Diao G, et al. An assessment of incremental coronary risk prediction using C-reactive protein and other novel risk markers: the atherosclerosis risk in communities study. Archives of Internal Medicine. 2006;166:1368. doi: 10.1001/archinte.166.13.1368. [DOI] [PubMed] [Google Scholar]
  21. Gerds TA, Cai T, Schumacher M. The performance of risk prediction models. Biometrical Journal. 2008;50:457–479. doi: 10.1002/bimj.200810443. [DOI] [PubMed] [Google Scholar]
  22. Gerds TA, Schumacher M. Consistent estimation of the expected Brier score in general survival models with right-censored event times. Biometrical Journal. 2006;48:1029–1040. doi: 10.1002/bimj.200610301. [DOI] [PubMed] [Google Scholar]
  23. Hastie T, Tibshirani R. Varying-coefficient models. Journal of the Royal Statistical Society Series B (Methodological) 1993;55:757–796. [Google Scholar]
  24. Hatteville L, Mahe C, Hill C. Prediction of the long-term survival in breast cancer patients according to the present oncological status. Statistics in Medicine. 2002;21:2345–2354. doi: 10.1002/sim.1046. [DOI] [PubMed] [Google Scholar]
  25. Heagerty PJ, Zheng Y. Survival model predictive accuracy and roc curves. Biometrics. 2005;61:92–105. doi: 10.1111/j.0006-341X.2005.030814.x. [DOI] [PubMed] [Google Scholar]
  26. Herzog CA, Mangrum JM, Passman R. Non-coronary heart disease in dialysis patients: Sudden cardiac death and dialysis patients. Seminars in Dialysis. 2008;21:300–307. doi: 10.1111/j.1525-139X.2008.00455.x. [DOI] [PubMed] [Google Scholar]
  27. Hirschtick RE, Glassroth J, Jordan MC, Wilcosky TC, Wallace JM, Kvale PA, Markowitz N, Rosen MJ, Mangura BT, Hopewell PC, et al. Bacterial pneumonia in persons infected with the human immunodeficiency virus. New England Journal of Medicine. 1995;333:845. doi: 10.1056/NEJM199509283331305. [DOI] [PubMed] [Google Scholar]
  28. Jiang H, Fine JP, Chappell R. Semiparametric analysis of survival data with left truncation and dependent right censoring. Biometrics. 2005;61:567–575. doi: 10.1111/j.1541-0420.2005.00335.x. [DOI] [PubMed] [Google Scholar]
  29. Kalbfleisch JD, Prentice RL. The Statistical Analysis of Failure Time Data. New York: John Wiley & Sons; 2002. [Google Scholar]
  30. Kay R. A Markov model for analysing cancer markers and disease states in survival studies. Biometrics. 1986;42:855–865. [PubMed] [Google Scholar]
  31. Kerr KF, Pepe MS. Joint modeling, covariate adjustment, and interaction: contrasting notions in risk prediction models and risk prediction performance. Epidemiology. 2011;22:805. doi: 10.1097/EDE.0b013e31823035fb. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Klein JP, Keiding N, Copelan EA. Plotting summary predictions in multistate survival models: Probabilities of relapse and death in remission for bone marrow transplantation patients. Statistics in Medicine. 1994;12:2315–2332. doi: 10.1002/sim.4780122408. [DOI] [PubMed] [Google Scholar]
  33. Lawless JF, Yuan Y. Estimation of prediction error for survival models. Statistics in Medicine. 2010;29:262–274. doi: 10.1002/sim.3758. [DOI] [PubMed] [Google Scholar]
  34. Lee SJ, Klein JP, Barrett AJ, Ringden O, Antin JH, Cahn JY, Carabasi MH, Gale RP, Giralt S, Hale GA, et al. Severity of chronic graft-versus-host disease: association with treatment-related mortality and relapse. Blood. 2002;100:406–414. doi: 10.1182/blood.v100.2.406. [DOI] [PubMed] [Google Scholar]
  35. Murphy ST, Parfrey PS. The impact of anemia correction on cardiovascular disease in end-stage renal disease. Seminars in Nephrology. 2000;20:350. [PubMed] [Google Scholar]
  36. Osborne CK. Tamoxifen in the treatment of breast cancer. New England Journal of Medicine. 1998;339:1609. doi: 10.1056/NEJM199811263392207. [DOI] [PubMed] [Google Scholar]
  37. Parast L, Cai T. Landmark prediction of survival. Harvard University Biostatistics Working Paper Series. 2010:123. http://www.bepress.com/harvardbiostat/paper123/
  38. Parast L, Cheng SC, Cai T. Incorporating short-term outcome information to predict long-term survival with discrete markers. Biometrical Journal. 2011;53:294–307. doi: 10.1002/bimj.201000150. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Park B, Kim W, Ruppert D, Jones M, Signorini D, Kohn R. Simple transformation techniques for improved non-parametric regression. Scandinavian Journal of Statistics. 1997;24:145–163. [Google Scholar]
  40. Park Y, Wei LJ. Estimating subject-specific survival functions under the accelerated failure time model. Biometrika. 2003;90:717–23. [Google Scholar]
  41. Pencina MJ, D’Agostino RB, Sr, D’Agostino RB, Jr, Vasan RS. Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond. Statistics in Medicine. 2008;27:157–172. doi: 10.1002/sim.2929. [DOI] [PubMed] [Google Scholar]
  42. Pollard D. Empirical Processes: Theory and Applications. Hayward, CA: Institute of Mathematical Statistics; 1990. [Google Scholar]
  43. Putter H, van der Hage J, de Bock GH, Elgalta R, van de Velde CJH. Estimation and Prediction in a Multi-State Model for Breast Cancer. Biometrical Journal. 2006;48:366–380. doi: 10.1002/bimj.200510218. [DOI] [PubMed] [Google Scholar]
  44. Robins JM, Ritov Y. Toward a curse of dimensionality appropriate (CODA) asymptotic theory for semi-parametric models. Statistics in Medicine. 1997;16:285–319. doi: 10.1002/(sici)1097-0258(19970215)16:3<285::aid-sim535>3.0.co;2-#. [DOI] [PubMed] [Google Scholar]
  45. Robins JM, Rotnitzky A. Recovery of information and adjustment for dependent censoring using surrogate markers. Aids Epidemiology, Methodological Issues. 1992:297–331. [Google Scholar]
  46. Rotnitzky A, Robins J. Inverse probability weighted estimation in survival analysis. Encyclopedia of Biostatistics. 2005:2619–2625. [Google Scholar]
  47. Sands JJ, Etheredge GD, Shankar A, Graff J, Loeper J, McKendry M, Farrell R. Predicting hospitalization and mortality in end-stage renal disease (ESRD) patients using an Index of Coexisting Disease (ICED)-based risk stratification model. Disease Management. 2006;9:224–235. doi: 10.1089/dis.2006.9.224. [DOI] [PubMed] [Google Scholar]
  48. Saravanan P, Davidson NC. Risk Assessment for Sudden Cardiac Death in Dialysis Patients. Circulation: Arrhythmia and Electrophysiology. 2010;3:553. doi: 10.1161/CIRCEP.110.937888. [DOI] [PubMed] [Google Scholar]
  49. Siannis F, Farewell V, Head J. A multi-state model for joint modeling of terminal and non-terminal events with application to Whitehall II. Statistics in Medicine. 2007;26:426–442. doi: 10.1002/sim.2342. [DOI] [PubMed] [Google Scholar]
  50. Swets JA, Pickett RM. Evaluation of Diagnostic Systems: Methods from Signal Detection Theory. Academy press; New York: 1982. [Google Scholar]
  51. Tian L, Cai T, Goetghebeur E, Wei L. Model evaluation based on the sampling distribution of estimated absolute prediction error. Biometrika. 2007;94:297–311. [Google Scholar]
  52. Uno H, Cai T, Tian L, Wei LJ. Evaluating prediction rules for t-year survivors with censored regression models. Journal of the American Statistical Association. 2007;102:527–37. [Google Scholar]
  53. USRDS. US Renal Data System 2007 Annual Data Report: Atlas of Chronic Kidney Disease and End-Stage Renal Disease in the United States. National Institutes of Health, National Institute of Diabetes and Digestive and Kidney Diseases; 2007. [Google Scholar]
  54. USRDS. US Renal Data System 2010 Annual Data Report: Atlas of Chronic Kidney Disease and End-Stage Renal Disease in the United States. National Institutes of Health, National Institute of Diabetes and Digestive and Kidney Diseases; 2010. [Google Scholar]
  55. Van Houwelingen HC, Putter H. Dynamic predicting by landmarking as an alternative for multi-state modeling: an application to acute lymphoid leukemia data. Lifetime Data Analysis. 2008;14:447–463. doi: 10.1007/s10985-008-9099-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Wand M, Marron J, Ruppert D. Transformations in density estimation. Journal of the American Statistical Association. 1991;86:343–353. [Google Scholar]
  57. Wang TJ, Gona P, Larson MG, Tofler GH, Levy D, Newton-Cheh C, Jacques PF, Rifai N, Selhub J, Robins SJ, et al. Multiple biomarkers for the prediction of first major cardiovascular events and death. New England Journal of Medicine. 2006;355:2631–2639. doi: 10.1056/NEJMoa055373. [DOI] [PubMed] [Google Scholar]
  58. Wilson PWF, D’Agostino RB, Levy D, Belanger AM, Silbershatz H, Kannel WB. Prediction of coronary heart disease using risk factor categories. Circulation. 1998;97:1837–1847. doi: 10.1161/01.cir.97.18.1837. [DOI] [PubMed] [Google Scholar]
  59. Zheng Y, Cai T, Pepe MS, Levy WC. Time-dependent predictive values of prognostic biomarkers with failure time outcome. Journal of the American Statistical Association. 2008;103:362–368. doi: 10.1198/016214507000001481. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES