Abstract
We provide a nonparametric estimate of τ-restricted mean survival using follow-up information beyond τwhen appropriate to improve precision. The variance accounts for correlation between follow-up windows. Both asymptotic calculations and simulation studies recommend follow-up intervals spaced approximately τ/2 apart.
Keywords: Correlated times-to-events, Follow-up intervals, Residual life, Restricted mean survival
1 Introduction
Yearly progression predictions are commonly reported for clinical longitudinal data. For example, Raghu et al. (2011) report that mild to moderate idiopathic pulmonary fibrosis (IPF) patients tend to lose 0.2 liters in forced vital capacity lung function per year. This is a valuable summary statistic that has not been sufficiently explored for censored time to event data. It would be clinically useful to be able to report that IPF patients followed for 10 years were observed to live 91% of each year, on average, given they were alive at the start of the year. This concise estimate describes how the disease will affect patients in the short term and indicates stability of the disease where appropriate. Health economists have noted that patient’s value life-years closer to the present more than those in the future, Gyrd-Hansen and Sogaard (1998).
The restricted mean residual life function (RMRL) is the expected days of life per year for those surviving at the beginning of the year and may be used to view trends in these summary statistics over time. Ghorai et al. (1982) proposed an estimator based on integrated conditional Kaplan-Meier estimates (Ghorai and Rejto, 1987) and Na and Kim (1999) proposed a smooth-spline estimator for this quantity, among others. Yu (2003) developed confidence bands for restricted mean residual life functions estimated via Nelson-Aalen estimates, Cox model hazard estimates and inverse weighted hazard estimates, calling them expected life prosper functions (ELPF). Stability of these functions suggests an opportunity for producing an overall summary statistic that is more precise.
In Section 2 we review RMRL estimation and confidence band construction. Section 3 describes the nonparametric τ-restricted mean survival estimator that combines information across different τ-length intervals of follow-up. The proposed variance described in Section 3 is based on a linearization of random components of the estimator, similar to the approach recommended by Woodruff (1971) and more recently Williams (1995). In Section 4 we consider how to choose the number of follow-up intervals useful for obtaining efficiency gains. A simulation study that assesses the performance of the proposed estimate and its variance against currently available competitors is presented in Section 5. Examples of the proposed analysis approach pertaining to IPF patients and diabetic retinopathy patients are given in Section 6. A discussion follows in Section 7.
2 τ-restricted mean residual life
2.1 Notation
For each of n patients we define observed event time, Xi = min(Ti,Ci), with failure indicator δi = I(Ti ≤ Ci), based on true failure time, Ti, and censoring time, Ci, i = 1, … , n. Calendar time, t, is measured from the start of the study. We define the residual life observed at t as Xi(t) = (Xi − t)I(Xi ≥ t) with failure indicator variable δi(t) = δiI(Xi ≥ t).
For a τ-length interval starting at calendar time, t, the τ-restricted mean residual lifetime is . Here, u denotes an internal time scale measured from calendar time t. We examine values of μ(t, τ) measured at different calendar times t ∈ {t1, …, tb}, with t1 = 0 in all that follows.
Counting process notation includes the two timescales described, t and u. For individual i, the event counting process is Ni(t, u) = I{Xi(t) ≤ u, δi(t) = 1} with at-risk process Yi(t, u) = I{Xi(t) ≥ u}. At t, the total number of events occurring no later than u is and the number at risk at u is . We require notation combining counting process quantities across calendar times, {t1, … ,tb}. For individual i and internal time u, . Combining information across follow-up intervals and patients gives us , total number of events occurring no later than u, and , total number at risk at u.
2.2 Estimation of τ-restricted mean residual life
The τ-restricted mean residual lifetime function, μ(t, τ), tracks subsequent expected lifetime during an interval of length τ given the patient has survived up to time t. Henceforth we call μ(t, τ) the RMRL function. Using notation from the previous section, a consistent nonparametric estimator of the RMRL function is
2.3 Confidence Bands
We slightly modify work from Yu (2003) for RMRL confidence band calculations applied to times {t1, … ,tb}. Plotting the RMRL function with its corresponding bands is useful in suggesting whether follow-up windows may be combined for estimation or not.
Technical development of confidence bands for {μ(t1, τ), … , μ(tb, τ)} is based on the Gaussian process and the covariance of this process at times and . The estimated covariance matrix , with estimates of cov as the (j1, j2) elements, is
Following Lin et al. (1994), the asymptotic distribution of {B(t1, τ), … , B(tb, τ)} is approximated by generating a large number of mean zero multivariate Normal samples using the observed covariance structure, . Using these samples, we calculate qα satisfying . Level 100(1 − α)% confidence band values surrounding μ(t, τ) become , calculated at times t = {t1, … ,tb}. In practice, these confidence bands perform well provided that there are at least 25 event times following tb (Yu, 2003).
3 Overall τ-restricted mean survival
The RMRL plots and associated confidence bands are a useful diagnostic tool to assist in deciding whether disease progression is stable, i.e. the RMRL is the same at each t1, … tb, or not. When the RMRL and its corresponding confidence bands do not indicate a strong trend, we develop a more efficient estimate of the expected number of days lived in the next τyears by pooling appropriate follow-up periods beginning at times t ∈ {t1, …, tb}.
3.1 Estimation
The proposed estimate of the overall τ-restricted mean survival is
| (1) |
Let . In Appendix A of Supplementary Materials, we show that (1) converges in probability to , which is the mean of the mixture distribution created from combining follow-up times across the different intervals. If the overall dataset reflects a single distribution, that is, λ(tk1 , u) = λ(tk2 , u) for 0 ≤ u ≤ τ and k1, k2 ∈ {1, …, b}, then μ*(τ) reduces to μ(t1, τ), the usual restricted mean that is typically estimated using a single follow-up period. We also suspect that in cases where the integrated survival curves within the follow-up windows are equivalent, our estimate is consistent estimator of μ(t1, τ). A rigorous proof backing up this intuition has eluded us, although special cases of distributions with this property have given μ*(τ) = μ(t1, τ).
Variance calculations in the following section acknowledge the potential mixture of hazards that might occur when combining follow-up times across intervals.
3.2 Variance of proposed estimate
The proposed variance estimate is calculated via linearizing components of via Taylor series approximations. Suppose that in the dataset of combined follow-up times we observe M events at internal times where events from the same individual during different follow-up windows are correlated; for convenience, we define . Let . In the following we temporarily submerge arguments of Fj .We define . After rewriting in term of Gm(F0, F1, …, Fm), the variance of becomes
The non-linear terms G0, G1, …, GM may be made more tractable for variance calculations via linearization based on a Taylor series expansion of Gm(F0, F1, …, Fm) about . The variance of is then
| (2a) |
| (2b) |
| (2c) |
| (2d) |
| (2e) |
Terms (2a) and (2c) are nonrandom and therefore do not contribute to the variance. The fourth term (2d) converges to zero in probability, details given in Appendix B of Supplementary Materials. Similarly all higher order terms of the Taylor series linearization (2e) converge to zero in probability and the variance reduces to , the variance of a linear sum of non-linear components F0, F1, …, FM .
Each of the non-linear Fj terms may be made more tractable for variance calculations via further linearization based on a Taylor series expansion of about the expected values of and . The variance of is then
| (3a) |
| (3b) |
| (3c) |
| (3d) |
| (3e) |
Term (3a) is a constant and therefore does not contribute to the variance. Terms (3b) and (3c) simplify to . The fourth term (3d) converges in probability to zero, details given in Appendix B of Supplementary Materials. Similarly, all higher order terms of the Taylor series linearization (3e) converge to zero in probability. Recall that . The expected value of is and the expected value of . The variance then reduces to and
Whereas event times across overlapping follow-up periods are generally not i.i.d., the making empirical variance estimates based on appropriate for this setting. In practice, the empirical variance estimator of is where . The sample estimates of are given by
In understanding design issues discussed in Section 4, it is convenient to have the asymptotic closed form variance, , where
in asymptotically equivalent stochastic integral notation. In Appendix C of Supplementary Materials we show that
4 Practical Issues
Number and spacing of follow-up windows should be chosen to increase precision of . We examine the estimator’s closed form variance σ2 in the special case where Ti ~ Exp(λ). This implies that for all k, λ(tk , u) = λW (u) = λ. The censoring distribution is Ci ~Uniform[A − A*, A], where A is the length of the study and A* is the accrual time. A patient recruited at the start of the study would be followed until time A and a patient recruited at the end of the accrual period would be followed until time A − A*. Standard probability calculations for u, v ∈ (0, τ] give
We consider τ = 1 year. The parameter λ was chosen to give a constant 1-year RMRL of 11 months, A* = 1 year, A = 3 years and n = 100. Figure 1 shows the behavior of σ2/n, the finite sample size variance, for three one-year follow-up windows with t1 = 0, t3 = 12 months and t2 varying between these values. The one-year windows starting at t1 and t3 do not overlap, so the choice of t2 examines if there is an advantage to adding a 3rd follow-up window that overlaps the other two. The plot suggests that an additional 1-year window starting in the middle of t1 = 0 and t3 = 12 months, i.e. at t2 = 6 months, reduces the variance the most.
Figure 1.

Closed form asymptotic variance of for 3 year study with τ=1 year, t1 = 0, t3 = 12 months and varying t2. Dashed line corresponds to variance of estimator constructed using two follow-up windows t1 = 0 and t3 = 12.
Next consider (1) whether additional equally spaced follow-up windows further reduce σ2/n and (2) the extent to which incorporating an additional year’s worth of follow-up information, and corresponding 1-year windows, into construction of reduces its variability. The first entry of Table 1 corresponds to the variance obtained if estimating the standard 1-year restricted mean that doesn’t use information from additional follow-up intervals. For a given tb of 12 or 24 months, increasing the number of follow-up windows improves the precision of the estimator. However, there are diminishing returns from introducing starting times more frequently than at 6-month intervals.
Table 1.
Study of follow-up window choices based on calculated variance and Asymptotic Relative Efficiency (ARE) for the special case discussed in Section 4.
| Number of Windows |
{t1, …. tb} | σ2/n | ARE | |
|---|---|---|---|---|
| tb = 0 | 1 | 0 | 0.071 | 1.00 |
|
| ||||
| 2 | 0, 12 | 0.039 | 1.82 | |
| tb = 12 | 3 | 0, 6, 12 | 0.035 | 2.03 |
| 5 | 0, 3, 6, 9, 12 | 0.035 | 2.03 | |
|
| ||||
| 3 | 0, 12, 24 | 0.030 | 2.37 | |
| tb = 24 | 5 | 0, 6, 12, 18, 24 | 0.026 | 2.73 |
| 9 | 0, 3, 6, 9, 12, 15, 18, 21, 24 | 0.025 | 2.84 | |
It seems clear from exploration of this special case that spacing of t1, …, tb should be at 6month intervals when estimating an overall 1 year restricted mean, with tb = 24. In general, the recommended number of intervals is based on available follow-up time and we propose using intervals starting from less than A − τ.
5 Simulation Study
Simulation experiments were conducted to assess finite sample size performance of . We consider whether augmenting the first observation window improved the estimator and the effect of the number of intervals on the performance of the estimator. The performance of our proposed variance estimate is compared to a variance estimate that assumes independence and the sandwich variance (formulae to calculate these are given in Appendix D of Supplementary Materials).
The simulation experiment design assumes we wish to estimate 12-month restricted mean survival in a 36-month study. We assume that 30% of the sample (n = 100) were recruited at the start of the study and were observed for the full 36 months. The remaining 70% were uniformly accrued over the first 12 months. Ti was simulated from a piecewise Weibull distribution with parameters chosen so that the 12-month restricted mean survival was 11 months at the recommended follow-up times tk ∈ {0, 6, 12, 18, 24}, k = 1, …, 5. The hazard function of the piecewise Weibull distribution is given by , where the parameters are constant within a 6 month interval. The parameters are defined as α = (1.1, 0.9, 1.1, 0.9, 1.1, 0.9) and λ = (1.25, 2.01, 1.05, 2.00, 1.26, 0.10) × 10−2.
Simulation results in Table 2 show that augmenting the data with additional follow-up time improves the precision of the estimator, with ARE ranging from 1.87 to 2.70. The bias is minimal in all cases. As expected, the case with {t1, …, t5} = {0, 6, 12, 18, 24} outperforms other scenarios for, b = 1, 2, 3. All three variance methods perform well when {t1, …, tb} produced disjoint intervals (rows 1 − 3 of Table 2). With overlapping intervals, the proposed variance gives the best coverage; independent and sandwich variances both underestimate the simulation empirical variance.
Table 2.
Study of follow-up windows and performance of variance estimators in 500 Monte Carlo simulations (n = 100) from a Piecewise Weibull Distribution
| {t1, …, tb} | Est | Emp Var |
Independent Variance |
Sandwich Variance |
Proposed Variance |
|||
|---|---|---|---|---|---|---|---|---|
|
|
||||||||
| Est | Coverage 95% CI |
Est | Coverage 95% CI |
Est | Coverage 95% CI |
|||
| 0 | 10.986 | 0.073 | 0.070 | 0.938 | 0.069 | 0.938 | 0.070 | 0.938 |
|
| ||||||||
| 0, 12 | 10.993 | 0.039 | 0.039 | 0.938 | 0.039 | 0.934 | 0.039 | 0.940 |
|
| ||||||||
| 0, 12, 24 | 10.975 | 0.030 | 0.031 | 0.954 | 0.031 | 0.952 | 0.031 | 0.952 |
|
| ||||||||
| 0, 6, 12, 18, 24 | 10.980 | 0.027 | 0.018 | 0.878 | 0.021 | 0.902 | 0.028 | 0.948 |
6 Examples
In a study by Schmidt et al. (2014), which aimed to provide better prognostic information to idiopathic pulmonary fibrosis (IPF) patients, 734 patients were identified through interstitial lung disease databases from three referral centers, the Royal Brompton and Harefield National Health Service Foundation Trust, National Jewish Health and the University of Michigan Health System, from 1981 through 2008. There is currently no effective treatment for IPF, with patients experiencing a steady average decline in lung function per year. A natural question is whether the expected number of days of life in the next year is also stable.
For each patient, calendar time begins at first pulmonary function test at their referral center within the study period. As recommended, one-year intervals with start times every six months are used. The final interval at 9.5 years, chosen to ensure at least 25 risk set deaths remaining, used follow-up through year 10.5. The RMRL in Figure 2(a) fluctuates between 321 and 350 days with more stability in the first 6 years, where there is more available data. The confidence bands suggest that it is still reasonable to report an overall point estimate that the average number of days of life lived in the next year is 333.6 (95% CI: 330.7-336.4). During the first decade of their disease, IPF patients are expected to live 91% of each year given they were alive at the start of the year.
Figure 2.
One-year restricted mean residual life function (solid) evaluated every six months. Associated CB are given (dashed).
The Early Treatment Diabetic Retinopathy Study (ETDRS) (ETDRS Research Group (1991a,b)) enrolled patients with severe diabetic retinopathy in both eyes who were taking Aspirin daily. In addition one eye of each patient was randomly assigned to early photocoagulation and the other to deferral of photocoagulation until a later time when high-risk proliferative retinopathy was detected. We focus on the 583 patients who were randomized to the deferred photocoagulation treatment group. The major endpoint of interest was time to severe vision loss, defined as visual acuity less than 5/200 at two consecutive visits.
One-year intervals with start times every six months are used. The final interval at 3.5 years, chosen to ensure at least 25 risk set deaths remaining, used follow-up through year 4.5. The RMRL in Figure 2(b) suggests a slightly declining trend, reflecting somewhat quicker eyesight deterioration over time despite initiation of therapy once patients became especially high risk. Given the narrow width of the confidence bands (< 10 days), one may argue that it is reasonable to report an overall point estimate of average days of sight per year, 362.2 days (95% CI: 361.3-363.1), which falls within the reported confidence bands through the 3.5 year period considered. In this case, it may also be instructive to include the graphic so that future potential trend may be monitored.
7 Discussion
Health economist literature emphasizes the value patients place on short-term versus long-terms outcomes. This gives strong motivation for developing better estimates of short-term survival from the patient’s point of view. In longitudinal settings, measures taken over time are easy to come by and software estimating average trends over time is widely available. In cases where measures are stable over time, it is common to estimate the average outcome across time as a summary measure. Our method of defining times-to-first event in different follow-up windows over time has a lot in common with these more standard longitudinal data structures, except that (1) our outcomes are subject to censoring and (2) the correlation structure due to potentially overlapping follow-up windows is estimable using counting process notation, without a need for parametric assumptions.
As in longitudinal data analysis, the user has the choice of describing the trend over time (RMRL function, see Section 2.2) or, for stable disease progression with no otherwise discernible time (follow-up window) effect, an overall mean can be given (overall τ-restricted mean survival, see Section 3.1). Our approach changes the way censored survival data are traditionally organized for analysis, better reflecting short-term experiences of patients throughout follow-up that health economists find more attractive. Currently, if one wants to estimate a short-term τ-restricted mean survival, one traditionally uses only the first τyears of follow-up in estimation, ignoring all subsequent follow-up information. From the longitudinal data structure perspective, this wastes available statistical information that could be put to better use.
The methodology in this manuscript is completely nonparametric and is a reasonable starting point for understanding the newly introduced data structure, practical aspects of follow-up window choice, and efficiency gains when describing overall restricted means for outcomes with stable disease progression. It is our belief that this work can form the basis for developing further methodological tools for hypothesis testing and multivariable models of short-term times-to-event.
Supplementary Material
Acknowledgments
The authors would like to thank our University of Michigan collaborators: Kevin Flaherty, Shelley Schmidt, Meilan Han, Gaelen Toews and Fernando Martinez, for the use of their data.This work was supported in part by National Institute of Health grants NIH/NHLBI RC2 HL101740 and NIH/NHLBI K24 HL111316.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
Software
Software in the form of R code, together with a sample input data set and complete documentation is available on request from the corresponding author (skmurray@umich.edu).
Conflict of Interest: The authors have declared no conflict of interest.
References
- Ghorai J, Susarla A, Susarla V, Van-Ryzin J. Nonparametric Statistical Inference. Vol. 32. North-Holland; Amsterdam-New York: 1982. Nonparametric estimation of mean residual life time with censored data; pp. 269–291. volume 1 of Colloquia Mathematica Societatis. [Google Scholar]
- Ghorai JK, Rejto L. Estimation of mean residual life with censorded data under the proportional hazard model. Communications in Statistics- Theory and Methods. 1987;16:2097–2114. [Google Scholar]
- Gyrd-Hansen D, Sogaard J. Discounting life years: Whither time preference? Health Economics. 1998;7:121–127. doi: 10.1002/(sici)1099-1050(199803)7:2<121::aid-hec318>3.0.co;2-h. [DOI] [PubMed] [Google Scholar]
- Lin DY, Fleming TR, Wei LJ. Confidence bands for survival curves under the proportional hazards model. Biometrika. 1994;81:73–81. [Google Scholar]
- Na MH, Kim JJ. On inference for mean residual life. Communications in Statistics-Theory and Methods. 1999;28:2917–2833. [Google Scholar]
- Raghu G, Collard HR, Egan JJ, Martinez FJ, Behr J, Brown KK, Colby TV, Cordier JF, Flaherty KR, Lasky JA, Lynch DA, Ryu JH, Swigris JJ, Wells AU, Ancochea J, Bouros D, Carvalho C, Costabel U, Ebina M, Hansell DM, Johkoh T, Kim DS, King TE, Kondoh Y, Myers J, Muüller NL, Nicholson AG, Richeldi L, Selman M, Dudden RF, Griss BS, Protzko SL, Schuünemann HJ. An official ats/ers/jrs/alat statement: Idiopathic pulmonary fibrosis: Evidence-based guidelines for diagnosis and management. American Journal of Respiratory and Critical Care Medicine. 2011;183:788–823. doi: 10.1164/rccm.2009-040GL. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmidt SL, Tayob N, Han MK, Zappala C, Kervitsky D, Murray S, Wells AU, Brown KK, Martinez FJ, Flaherty KR. Predicting pulmonary fibrosis disease course from past trends in pulmonary function. Chest. 2014;145:579–585. doi: 10.1378/chest.13-0844. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Williams RL. Product-limit survival funtions with correlated survival times. Lifetime Data Analysis. 1995;1:171–186. doi: 10.1007/BF00985768. [DOI] [PubMed] [Google Scholar]
- Woodruff D. A simple method for approximating the variance of a complicated estimate. Journal of the American Statistical Association. 1971;66:411–414. [Google Scholar]
- Yu J. Prosper function analysis for organ allocation: A counting process and martingale approach. University of Michigan Department of Biostatistics; Ann Arbor, MI: 2003. Ph.D. thesis. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.

