Introduction
The authors are to be congratulated for a very interesting paper. They are also be thanked for recommending reading chapter 8 of our book on dynamic prediction (van Houwelingen and Putter 2012) The data in that chapter are extensively analyzed in the unpublished PhD thesis of Mark de Bruijne. One chapter is published as de Bruijne et al. (2001a). Unfortunately the chapter using the revival process never got published. A preprint (de Bruijne and van Houwelingen 2001b) is available.
I am pleased by the introduction of the concept of “stale measurement”. It is related to the concept of “ageing covariate” in section 5.3 of van Houwelingen and Putter (2012). In de Bruijne et al. (2001a) the concept “TEL(t) = time elapsed since last observation” is introduced as a tool to adjust for the staleness of observations. It is a nice feature of the revival approach that TEL(t) is inherently taken into account.
My main interest is the predictive use of the revival process. My comments arise from this preoccupation with prognosis. My plea for robustness in van Houwelingen (2014) arises from the need to validate prognostic models in new data or by cross-validation. Robustness is needed to ensure that the models are validation-proof. In the paper, the robustness of the revival model is explored in the supplementary material, but no attempt is made to check the robustness of the prediction model. Robustness of the implied prediction model is also an important issue in Rizopoulos et al. (2014) and Rizopoulos et al. (2017).
In this commentary I will focus on four issues: visualization of the data, more insight in the information carried by the revival process, validation of the implied prediction model and an alternative for the assumption. I will use the CSL1 data and the standard model of section 6—exponential marginal survival with and revival model based on the uncensored observations—to clarify my comments.
Visualization
The two graphs in Fig. 1 are helping to get more insight in the data structure. The left panel shows the Kaplan–Meier estimates for the censoring function, the survival function with its exponential fit and the fraction still at risk. The high rate of early censoring is a bit unexpected, but its discussion is beyond the scope of this commentary. The interesting point for me is that appears to be the observation limit in this data. Only 43 patients carry information about what happens after and most of them (36) are censored. Anything said about what happens after is very speculative.
Fig. 1.
Survival etc
The right panel is an attempt to visualize how long patients are still followed up for survival after the last measurement. For each patient the difference between observed survival/censoring time and the time of the last observation can be found by the horizontal distance between the isolated dots and the dots on the line. One might wonder what happened to the patients with a wide gap between the last measurement and the survival time, but that issue is also beyond the scope of this commentary.
More insight in the information carried by the revival model
Figure 2 shows the expected value of the revival process for , presented in follow-up time t. The solid graphs show the curves for “Null Treatment” which can be seen as the expected value corrected for the additive treatment effect. The steep decrease near seems promising for the use of the revival process in dynamic prediction of survival. However, there is substantial variation in the data. The total variance computed from the three variance components in the model is 625, giving a standard deviation . The tolerance regions are shown by the dotted lines. The large variation suggest that it would not be easy to infer the future T from the data available at time t.
Fig. 2.
Insight in revival model
If we ignore the uncertainty in the regression parameters we have a model f(observations|T). Given the observation history, inference on T can be made in a very classical way by computing the log-likelihood of the data. In the main paper such a log-likelihood is shown in the left panel of Figure 5 for one specific case. To get more insight a kind of landmark analysis was carried out, in which all 278 patients still at risk at are considered. The number of preceding observations varied from 2 to 9 with mode=4 .For each individual the log-likelihood ll(T) of the standard model is computed for survival time . For each individual the location of the maximum is obtained together with a quasi . If this and is not on the boundary of the interval, the confidence region for T contains the whole interval [0,9]. A summary of the results is given in Table 1.
Table 1.
Results of a classical maximum likelihood analysis at
| Total | ||||
|---|---|---|---|---|
| 0–1 | 61 | 43 | 94 | 198 |
| 1–2 | 11 | 4 | 39 | 54 |
| 2–3 | 4 | 3 | 8 | 15 |
| 3–4 | 0 | 0 | 4 | 4 |
| 4+ | 2 | 2 | 3 | 7 |
| Total | 78 | 52 | 148 | 278 |
Figure 2 helps to understand what is going on. If the last observation is quite low, the best fitting curve would be obtained by . However, the patient is still alive at , which moves the to 2. If some observations are quite high, that would be an indication for survival beyond and will end up at the right boundary. The situation is more subtle, because of the random patient effect, but it is clear that it will not be easy to predict T at .
Prediction model
To investigate the behavior of the implied prediction model, the landmarking approach is used. That means that for a fixed time-point in the follow-up we consider all patients still at risk, obtain the predictive distribution obtained from standard model using the observations available at and compare that with the actual survival data. For the sake of robustness a horizon is fixed and it is investigated how well the (conditional) survival up to can be predicted. Table 2 shows the results for and , n = the number at risk.
Table 2.
Calibration of the standard model
| n | Dead | c | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| se | Mean | sd | se | se | Z | ||||||
| 1 | 332 | 70 | 0.79 | 0.09 | 0.32 | 0.07 | 0.90 | 0.60 | 1.03 | 0.53 | 1.96 |
| 2 | 278 | 61 | 0.83 | 0.11 | 0.33 | 0.07 | 0.37 | 0.71 | 2.13 | 0.65 | 3.30 |
| 3 | 229 | 46 | 0.75 | 0.11 | 0.33 | 0.06 | 2.53 | 0.89 | 4.27 | 0.85 | 5.03 |
| 4 | 188 | 36 | 0.69 | 0.11 | 0.32 | 0.06 | 1.52 | 0.96 | 3.38 | 0.91 | 3.73 |
| 5 | 166 | 37 | 0.86 | 0.14 | 0.33 | 0.08 | 0.46 | 0.68 | 2.22 | 0.64 | 3.44 |
| 6 | 136 | 30 | 0.94 | 0.17 | 0.33 | 0.06 | 0.96 | 1.13 | 2.58 | 1.05 | 2.45 |
| 7 | 96 | 23 | 1.04 | 0.22 | 0.33 | 0.06 | 0.64 | 1.24 | 2.19 | 1.14 | 1.92 |
First a simple calibration of the marginal survival is obtained through the model applied on the patients at risk at and administratively censored at . The table shows the estimate and it standard error. The apparent need for this correction can already be seen from Fig. 1.
Next we consider the cumulative hazard from up to as obtained from the standard model for each patient. This can be seen as a summary of the prognosis.The modeled conditional survival is
The standard deviation of gives insight into the variation in prediction between the patients in the landmark data set. The performance of the model can be checked through the exponential calibration model
The calibration of the conditional survival is perfect if and . My cautious conclusion is that the standard model is not well calibrated, but the predicted cumulative hazard might be a useful tool in landmark type models because of its significance as shown in the last column. The Weibull model might be better calibrated. but I did not check that.
Alternative for the assumption
Personally, I am very hesitant about any modeling beyond the observation limit =9. In situations where patients might be “cured”, models with do make sense. The advantage of the semi-parametric Cox model is that does not make any statement about what happens after the last observation. My suggestion for an alternative approach is to define an observation limit , to censor all patients at this limit, to make a revival model for using the uncensored data and for using all patients that survive up to and to estimate the marginal survival by Kaplan–Meier. This approach does not need any imputation. Moreover, calibration can now be based on the Cox model as well.
The coefficients in the revival model are shown in Table 3.
Table 3.
Regression coefficients for the standard model and the alternative
| Constant | Control | Prednisone | T | s | ||
|---|---|---|---|---|---|---|
| All uncensored | 63.47 | 2.49 | 13.56 | 1.74 | 2.11 | 4.66 |
| Uncensored | 63.35 | 2.28 | 13.40 | 1.56 | 1.80 | 4.58 |
| All | 75.15 | 6.55 | 18.97 | 1.47 | 1.84 | 0.176 |
Table 4 shows the findings of the landmark analysis for the alternative approach. The first observation is that the marginal Kaplan–Meier does not need any calibration because it is model free. The deviations from are due to the discrete nature of the Kaplan–Meier. The second observation is that the “prediction tool” shows more variation within and between landmark sets than in Table 2. Next, we see that the calibration through the exponential model is much better: the estimates are much closer to one than in the standard model. Finally, we see that the calibration Cox model gives virtually the same as the exponential with standard errors that are marginally smaller.
Table 4.
Calibration of the alternative model
| c | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| se | Mean | sd | se | se | se | Z | |||||
| 1 | 1.01 | 0.12 | 0.26 | 0.10 | 1.16 | 0.45 | 0.65 | 0.32 | 0.66 | 0.32 | 2.09 |
| 2 | 1.00 | 0.13 | 0.27 | 0.11 | 0.59 | 0.48 | 1.07 | 0.36 | 1.06 | 0.35 | 3.07 |
| 3 | 1.00 | 0.15 | 0.25 | 0.09 | 0.80 | 0.59 | 2.13 | 0.46 | 2.09 | 0.38 | 5.57 |
| 4 | 1.02 | 0.17 | 0.22 | 0.09 | 0.71 | 0.65 | 2.00 | 0.47 | 1.97 | 0.38 | 5.23 |
| 5 | 0.99 | 0.16 | 0.30 | 0.15 | 0.66 | 0.45 | 1.06 | 0.37 | 1.07 | 0.34 | 3.16 |
| 6 | 0.97 | 0.18 | 0.31 | 0.16 | 1.28 | 0.49 | 0.48 | 0.38 | 0.48 | 0.37 | 1.31 |
| 7 | 0.94 | 0.20 | 0.42 | 0.24 | 0.95 | 0.35 | 0.91 | 0.37 | 0.94 | 0.33 | 2.84 |
Conclusion
The revival approach can be a very useful tool for taking account of the observation time in prediction models. The comparison of different approaches to obtain prediction models using the calibration in the full data set as shown above might be optimistically biased. To avoid this optimism bias some form of cross-validation is needed, but that is beyond the scope of this commentary.
References
- de Bruijne MHJ, le Cessie S, van Houwelingen HC. On the use of Cox regression in the presence of an irregularly observed time-dependent covariate. Stat Med. 2001;20:3817–3829. doi: 10.1002/sim.1083. [DOI] [PubMed] [Google Scholar]
- de Bruijne MHJ, van Houwelingen HC (2001b) A mixture model approach to survival prediction with time-dependent covariates. Unpublished paper. For a copy, contact jcvanhouwelingen@lumc.nl
- Rizopoulos D, Hatfield LA, Carlin BP, Takkenberg JJM. Combining dynamic predictions from joint models for longitudinal and time-to-event data using Bayesian model averaging. J Am Stat Assoc. 2014;109:1385–1397. doi: 10.1080/01621459.2014.931236. [DOI] [Google Scholar]
- Rizopoulos D, Molenberghs G, Lesaffre EMEH. Dynamic predictions with time-dependent covariates in survival analysis using joint modeling and landmarking. Biom J. 2017;59:1261–1276. doi: 10.1002/bimj.201600238. [DOI] [PubMed] [Google Scholar]
- van Houwelingen HC. From model building to validation and back: a plea for robustness. Stat Med. 2014;33:5223–5238. doi: 10.1002/sim.6282. [DOI] [PubMed] [Google Scholar]
- van Houwelingen HC, Putter H. Dynamic prediction in clinical survival analysis. Monographs on statistics and applied probability 123. Boca Raton: CRC Press; 2012. [Google Scholar]


