Abstract
In cancer randomized controlled trials, surrogate endpoints are frequently time-to-event endpoints, subject to the competing risk from the time-to-event clinical outcome. In this context, we introduce a counterfactual-based mediation analysis for a causal assessment of surrogacy. We use a multistate model for risk prediction to account for both direct transitions towards the clinical outcome and indirect transitions through the surrogate outcome. Within the counterfactual framework, we define natural direct and indirect effects with a causal interpretation. Based on these measures, we define the proportion of the treatment effect on the clinical outcome mediated by the surrogate outcome. We estimate the proportion for both the cumulative risk and restricted mean time lost. We illustrate our approach by using 18-year follow-up data from the SPCG-4 randomized trial of radical prostatectomy for prostate cancer. We assess time to metastasis as a surrogate outcome for prostate cancer-specific mortality.
1. Introduction
Surrogate outcomes are intermediate markers of disease development or progression that can be measured earlier than clinical outcomes. Using a surrogate outcome in a randomized controlled trial has the potential to shorten follow-up duration and decrease the required sample size, thereby reducing trial cost and minimizing participant burden.[1, 2] The evaluation of surrogate outcomes has received considerable attention in the literature.[3, 4] In this context, a measure of the proportion of treatment effect on the clinical outcome explained by the surrogate outcome is a useful way to convey how good a surrogate outcome is.[5, 6, 7]
In this paper, we consider a single-trial setting and we focus on the situation where both the surrogate and clinical outcomes are time-to-event outcomes subject to right censoring. In addition, the surrogate outcome is subject to semi-competing risk from the clinical outcome. This competing risk situation is particularly frequent in cancer trials, where time to progression is considered as a surrogate for time to death.[8, 9] Our work is motivated by data from the Scandinavian Prostate Cancer Group 4 (SPCG-4) randomized trial.[10, 11] This trial enrolled 695 men with clinically detected, localized prostate cancer at 14 centers in Sweden, Finland, and Iceland between 1989–1999. Participants were randomly allocated to radical prostatectomy (n=347) or watchful waiting (n=348) and followed for development of distant metastases and death from any cause. We aim to assess time to metastasis as a surrogate outcome for (i) time to death and (ii) time to prostate cancer-specific death. Here, the development of metastases is subject to the competing risk of death. In addition, prostate cancer death is subject to the competing risk of death from other causes.
Few methods have been proposed to assess surrogacy in this setting, where both outcomes are time-to-event outcomes subject to censoring and with potential competing risks. In the setting of a single trial, Ghosh has described a semi-parametric accelerated failure time model to derive the proportion of treatment effect explained.[12] In the multiple-study framework, Burzykowski et al. proposed bivariate Copula models focusing primarily on the proportional hazards models.[13] Moreover, Ghosh has extended his prior work based on the accelerated failure time models to the meta-analytic setting, allowing for analysis in the semi-competing risks setting.[14]
An alternative approach to the evaluation of surrogate outcomes is through a counterfactual-based causal mediation analysis. This framework is appropriate for evaluation of a special subclass of surrogate outcomes that are believed to be mediators on the causal pathway between treatment and the clinical outcome. The counterfactual outcome framework enables separating the direct effect of the treatment from the indirect effect through the intermediate surrogate outcome.[15, 16, 17] The ratio of the indirect effect to the total effect is interpreted as the proportion of the treatment effect on the clinical outcome explained by the surrogate outcome. Previous works have focused on causal mediation analysis when either the surrogate or clinical outcome is a time-to-event outcome but not both.[15, 18] We propose a method based on a multistate model to derive the indirect effect and the proportion of mediation when both the surrogate and clinical outcomes are time-to-event outcomes. We derive these quantities for two measures of the treatment effect: the difference in cumulative risks and the difference in restricted mean times lost. The latter quantifies the between-group gain or loss in the mean time lost due to a specific cause over a pre-specified time interval.[19] In a competing risk setting, the restricted mean time lost is the area below the cumulative incidence function.[20, 21] In the absence of competing risks, it comes down to the area above the survival function, or one minus the restricted mean survival time.[22, 23]
In Section 2, we define the notation for our motivating example in the SPCG-4 prostate cancer trial. In Sections 3 and 4, we describe the multistate model and define the indirect effect and the proportion of mediation. In Section, 5, we introduce estimators of these metrics based on the multistate model. In Section 6, we apply the proposed method to the prostate cancer trial example. We provide R code for reproducibility and implementation on GitHub (github.com/iweir/Mediation_Surrogacy).
2. Motivating example and notations
We use data from the SPCG-4 randomized trial to assess time to metastasis as a surrogate outcome for (i) time to death and (ii) time to prostate cancer death. Let M denote the time of metastasis, T the time of death, and C the time of right-censoring. Let A be the indicator for treatment arm with values A = 1 for radical prostatectomy and A = 0 for watchful waiting. We denote by n1 and n0 the numbers of participants randomly allocated to radical prostatectomy and to watchful waiting, respectively. We denote Z the vector of covariates at randomization. We assume that (M, T) and C are conditionally independent given Z and A. We observe n0 + n1 observations (Xl, , Yl, ; Zl, Al) with l = 1,⋯, n0, n0 + 1⋯, n0 + n1 a random sample from (X, dX, Y, dY; Z, A), where X = M ∧ T ∧ C and Y = T ∧ C, with corresponding censoring indicators, and , denoting whether the time represents metastasis, death, or right-censoring. Metastasis is subject to the semi-competing risk of death. Thus, M is censored by the minimum of T and C, and not only by C.
In the case where we distinguish prostate cancer-specific death (ϵ = 1) from other causes of death (ϵ = 2), we further note T1 the time to prostate cancer death and T2 the time to death from other causes. We observe a random sample from (X, dX, Y, dY; Z, A), where X = M ∧T1 ∧T2 Ĉ, , Y = T1 ∧ T2 ∧ C, and . Prostate cancer death is the event of interest, and death from another cause is a competing event.
3. Multistate risk prediction model
We estimate the cumulative risk of all-cause death or prostate cancer death through multistate models without recovery. Let t denote the time elapsed since randomization. We let D(t) denote the occupied state at time t, where D(t) takes values in {0, 1, 2} corresponding to stable prostate cancer, metastatic prostate cancer, and death, respectively (Figure 1). All individuals start in state 0 at time t = 0 and can experience the following transitions at any time t > 0: 0 → 1, 0 → 2, and 1 → 2.
Figure 1:

Progressive illness-death and multistate models
Left: Illness-death framework in which states D(t) in {0, 1, 2} correspond to stable prostate cancer (PCa), metastatic PCa, and death.
Right: Multistate framework which states D′(t) in {0, 1, 2, 3} correspond to stable PCa, metastatic PCa, PCa-specific death, and death from other causes.
We can also distinguish between prostate cancer-specific death and death from other causes. Prostate cancer death is defined as death in the presence of metastatic disease. As a consequence, individuals who do not transition to metastasis cannot transition to prostate cancer death. We denote the different states by D′(t) taking values in {0, 1, 2, 3} corresponding to stable prostate cancer, metastatic prostate cancer, prostate cancer-specific death, and death from other causes (Figure 1). We model the following transitions: 0 → 1, 0 → 3, 1 → 2, and 1 → 3.
In the following section, we describe the non-parametric estimation of the risk function without covariates. We then describe a semi-parametric model to predict cumulative risks based on covariates at randomization and time-dependent covariates.
3.1. Non-parametric estimation
The cause-specific hazard of a transition from state j to state k at time t is defined by . The cumulative transition hazard is then defined by . We organize the cumulative hazards into the following square matrix with numbers of rows and columns equal to the number of states in the multistate model:
The diagonal entries, , indicate that no transition occurred and the individual remained in state j at time t. When using the model that distinguishes between prostate cancer-specific death and death from other causes, transition 0 → 2 is not possible, and the cumulative hazards matrix is given by:
We use H to estimate the probability of transition from state j at time s into state k at time t, Pjk(s, t) = P (D(t) = k|D(s) = j). The transition probability matrix can be calculated with the product integral by , where I is the identity matrix. The Aalen-Johansen estimator of the transition probability matrix is
| (1) |
where u takes values among all event times in (s,t], is the matrix of Nelson-Aalen estimators of the cumulative hazards, and ∏ denotes the finite product over event times.[24, 25]
When we focus on all-cause death as the clinical outcome, we estimate the cumulative risk of all-cause death between time 0 and t (occupation of state D = 2) by . When we focus on prostate cancer death as the clinical outcome, we estimate the cumulative incidence (or marginal probability) of prostate cancer death between time 0 and t (occupation of state D′ = 2) by .[26] In both cases, the functions take into account the probability of a transition through the intermediate state of metastasis.
3.2. Semi-parametric estimation with Cox regression
To assess the associations between covariates and transition hazards, we use a Cox model stratified by transition. Data are stacked such that for each participant, there is one row corresponding to each transition for which that participant is at risk. The row includes the start time, stop time, and indicator for the transition. Each stratum of the Cox model corresponds to a transition. Covariates are fixed at baseline but we let covariates have different effects on each transition hazard. The model for the hazard of transition from state j to k is given by
| (2) |
where hjk,0(t) denotes the baseline hazard function for a transition from state j to state k at time t.
We predict the transition probabilities by plugging in the cumulative hazards estimated from the Cox regression models into equation (1). In particular, for participant l, we can estimate the cumulative risk of death (or prostate cancer death) by , while accounting for a potential transition through the intermediate state of metastasis.
The βjk are estimated by maximizing the partial log-likelihood described in [24]. We used the mvna and etm R packages for non-parametric estimation and the mstate R package for the semi-parametric Cox model.[27]
We also expand the semi-parametric Cox regression model to include a time-dependent covariate, m, to consider how the time of occurrence of the surrogate outcome or the time since the occurrence of the surrogate may influence the transition to death. We update the stratified Cox model (2) and equation (1) by using:
or
We use the clock forward approach to estimate h12,0(t) and h13,0(t), in which case only individuals who have transitioned to the surrogate outcome contribute to the estimation.
4. Counterfactual-based mediation analysis
We now are interested in the causal effect of radical prostatectomy on (i) time to death and (ii) prostate cancer death. Different estimands can be used as target for inference, for example the cumulative risk of death (or cumulative incidence of prostate cancer death). We first give the counterfactual definition of the estimand. We denote by the cumulative risk of death between 0 and τ had, possibly contrary to fact, all participants been randomly allocated to treatment arm A = a but with the metastasis status they would have under allocation to A = a′.[28] Under this definition of the counterfactual estimand, the average causal effect of prostatectomy on death is . In addition, denotes the counterfactual cumulative risk of death by time τ had all participants been randomly allocated to prostatectomy but with the metastasis status they would have under watchful waiting.
We make the standard assumptions for mediation analysis.[15, 16] Namely, that there are no unmeasured confounders for the associations between (i) the randomization arm and the true outcome, (ii) the randomization arm and surrogate outcome, and (iii) the surrogate outcome and true outcome. In addition, we assume that (iv) there is no confounder for the association between the surrogate and true outcome that is an effect of the randomization arm.
In the setting of a randomized trial, the exposure (treatment arm, A) is randomized, however, the mediator experience is not. Therefore, to satisfy assumption (iii), we assume that the baseline covariates in Z control for the confounding between the mediator and outcome. In some settings, assumption (iv) can be more markedly considered by ruling out potential confounders. In cases where the assumptions do not hold, sensitivity analysis techniques can be used for the direct and indirect effects. [29, 30]
We define the indirect effect and proportion of mediation for two measures of treatment effect, the difference in cumulative risks of the outcome and the difference in restricted mean times lost. In the case of competing risks, for example when focusing on prostate cancer-specific death in the SPCG-4 data, we focus on the cumulative incidence function instead of the cumulative risk.
4.1. Difference in cumulative risks
Using the counterfactual estimands, we define the direct effect of radical prostatectomy on cumulative risk at time τ by . Similarly, the indirect effect is defined by . The total effect is the sum of the direct and indirect effects, . We then define the proportion of mediation as the ratio of the indirect effect to the total effect:
| (3) |
The direct effect captures the effect of radical prostatectomy on the cumulative risk of death (or prostate cancer-specific death) if allocation to radical prostatectomy did not induce a change in time to metastasis. The natural indirect effect captures the effect if we were to assign all participants to radical prostatectomy and compare the cumulative risk of death (or prostate cancer-specific death) if the distribution of metastasis were set to what it would have been with radical prostatectomy versus watchful waiting. The total effect provides a measure of the total causal effect of changing the treatment arm from radical prostatectomy to watchful waiting for each participant. Finally, the proportion of mediation yields the proportion of the treatment effect on risk of death (or prostate cancer-specific death) that is explained by the treatment effect on time to metastasis.[31]
Note that the proportion of mediation is not bounded on the interval [0, 1]. When the direct and indirect effects are in opposite directions, the resulting proportion of mediation falls outside of this range. In such a case, the proportion of mediation does not allow its usual interpretation as a proportion of effect explained. However, the fact that the direct and indirect effects are in opposite directions suggests that there is no or poor surrogacy.
4.2. Difference in restricted mean times lost
We also can measure the treatment effect based on the restricted mean time lost . Using the counterfactual estimands, we then define the direct effect of radical prostatectomy on the restricted mean time lost at time τ by . The indirect effect is and the total effect is . The proportion of mediation is defined by:
| (4) |
5. Estimation of mediated proportion
5.1. Difference in cumulative risks
We use the multistate models described in Section 3 to estimate the counterfactual estimands and the proportion of mediation. We first fit the model using both the experimental and control groups and we estimate the predicted cumulative risk of outcome for each individual l, . Note that these predictions account for the possibility of indirect transitions through metastasis.
To estimate F1M0, we specifically predict the cumulative risk of death among watchful waiting (A = 0) participants by using their observed covariate profile, their observed metastasis experience, but by modifying their allocation to radical prostatectomy (A = 1). This way, we estimate the risk of death had all participants been randomly allocated to prostatectomy but with the metastasis status they would have under watchful waiting, since the latter was indeed observed among participants randomized to watchful waiting. We define the estimator for by:
where is given by
and .
We insist that, to estimate , we average the predicted probabilities only over the n0 participants who were truly randomly allocated to the control arm, A = 0, but by counter-factually forcing their exposure to be (A = 1), indicating random allocation to the experimental arm. Because randomization ensures that the two treatment groups only differ with respect to treatment allocation, this method enables predicting the unobserved cumulative risk of death (or prostate cancer death) for participants randomly allocated to radical prostatectomy but with the time to metastasis they would have experienced under allocation to watchful waiting. In Appendix Table 1, we illustrate the counterfactual predicted risks of death at 18 years for an hypothetical participant profile and 8 combinations of time to metastasis and time to death.
We estimate and using the same strategy, by averaging predicted probabilities over the experimental and control arms, respectively, but without modifying the exposures of individuals (detailed formulas are provided in Appendix Section 2). We plug in the estimates of , , and in Equation (3) to estimate the proportion mediated according to the difference in cumulative risks.
5.2. Difference in restricted mean times lost
To obtain the estimate of the restricted mean time lost, we integrate the counterfactual cumulative risks or incidence functions over the time interval from [0, τ). Let us note tm the event times within this interval. We have
| (5) |
Similarly, we derive estimates of and . We estimate the proportion mediated according to the difference in restricted mean times lost using Equation (4).
5.3. Confidence intervals
We derive confidence intervals on the mediation measures by using a perturbation-resampling approach.[32, 33] For , let be a vector of independent random variables from a standard exponential distribution. Note that , where .
Let . This quantity is averaged over participants in the control group, with their observed covariate profile and metastasis experience, but by forcing the allocation to radical prostatectomy.
Similarly, we define , averaged over participants in the experimental group, and , averaged over participants in the control group, in both cases with their observed covariate profile, metastasis experience, and allocation.
We further define the direct, indirect, and total effects by
and the proportion of mediation by
We can estimate the distribution of
by the empirical distribution of
To obtain the 95% confidence interval for each mediation statistic, we calculate the 2.5th and 97.5th empirical percentiles of , , , and , respectively. We apply the same method to derive the confidence interval for the measures based on the restricted mean time lost.
As previously noted, Π1(τ) and Π2(τ) are not restricted to the closed interval [0, 1]. As a consequence, we may also find confidence interval bounds outside of [0, 1] indicating that there is no evidence that the direct and indirect effects share the same direction.
6. Results in the SPCG-4 randomized trial
We analyzed data from the SPCG-4 trial collected through 18 years of follow-up. We performed 6 analyses in total. We evaluated time to metastasis as a surrogate outcome for all-cause mortality (D = 2) and for prostate cancer-specific mortality (D′ = 2). For both cases, we fit three models: (i) a multistate model with covariates at randomization (age, prostate-specific antigen level, Gleason score, tumor stage, WHO grade, and year of diagnosis); (ii) model (i) plus time of metastasis occurrence as a time-dependent covariate; (iii) model (i) plus time spent in metastasis as a time-dependent covariate. For all analyses, we used complete-case data, with 297 radical prostatectomy participants and 285 watchful waiting participants. Appendix Table 2 shows characteristics of participants by treatment arm. Below, we present the analysis for prostate cancer-specific death based on multistate model (ii), incorporating time of metastasis occurrence as a time-dependent covariate. We report the 5 remaining analyses in Appendix Tables 3–7 and Appendix Figures 1–10.
Table 1 shows the transition-specific hazard ratios for each covariate in the multistate model. There is evidence that radical prostatectomy decreases the hazard of metastasis (HR=0.56, 95% CI from 0.40 to 0.77, p < 0.001) but no evidence of an association for the remaining three transitions. Older age is associated with increased hazard of transition from stable prostate cancer and from metastatic prostate cancer to death from other causes (HR=1.57, 95% CI from 1.34 to 1.85, p < 0.001 and HR=2.21, 95% CI from 1.28 to 3.82, p = 0.005, respectively).
Table 1:
Multistate model for the SPCG-4 trial accounting for time of metastasis occurrence
| Transition 0 → 1 | Transition 0 → 3 | Transition 1 → 2 | Transition 1 → 3 | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| HR | 95% CI | P-value | HR | 95% CI | P-value | HR | 95% CI | P-value | HR | 95% CI | P-value | |
| RP vs WW | 0.56 | (0.40, 0.77) | <0.001 | 1.00 | (0.76, 1.33) | 0.98 | 0.86 | (0.58, 1.27) | 0.44 | 0.47 | (0.17, 1.28) | 0.14 |
| Age, SD | 0.93 | (0.81, 1.06) | 0.28 | 1.57 | (1.34, 1.85) | <0.001 | 1.07 | (0.88, 1.29) | 0.51 | 2.21 | (1.28, 3.82) | 0.005 |
| Prostate-specific antigen, ng/mL | 1.03 | (1.02, 1.04) | <0.001 | 1.01 | (0.99, 1.02) | 0.29 | 0.99 | (0.97, 1.00) | 0.09 | 0.99 | (0.95, 1.03) | 0.71 |
| Gleason score, points (range 3–9) | 1.45 | (1.24, 1.69) | <0.001 | 1.09 | (0.95, 1.25) | 0.22 | 1.03 | (0.86, 1.24) | 0.76 | 1.02 | (0.64, 1.62) | 0.95 |
| Tumor Stage | ||||||||||||
| T2 vs. T1b | 1.01 | (0.60, 1.68) | 0.98 | 0.94 | (0.61, 1.44) | 0.76 | 0.96 | (0.52, 1.76) | 0.89 | 1.14 | (0.30, 4.40) | 0.85 |
| T1c vs. T1b | 0.44 | (0.19, 1.02) | 0.06 | 0.94 | (0.51, 1.73) | 0.84 | 1.81 | (0.62, 5.33) | 0.28 | 7.35 | (0.71, 76.50) | 0.10 |
| Year of Diagnosis | 0.90 | (0.84, 0.97) | 0.006 | 0.93 | (0.87, 0.99) | 0.03 | 0.98 | (0.90, 1.07) | 0.62 | 0.96 | (0.77, 1.18) | 0.68 |
| WHO grade 2 vs. 1 | 1.38 | (0.96, 1.99) | 0.08 | 1.00 | (0.73, 1.35) | 0.98 | 1.38 | (0.89, 2.14) | 0.15 | 1.31 | (0.47, 3.67) | 0.60 |
| Time of transition to metastatic PCa | - | - | - | - | - | - | 1.07 | (1.00, 1.16) | 0.06 | 0.92 | (0.80, 1.06) | 0.26 |
RP: radical prostatectomy; WW: watchful waiting; PCa: prostate cancer.
Transitions 0 → 1: from stable prostate cancer to metastatic prostate cancer; 0 → 3: from stable prostate cancer to other cause death; 1 → 2: from metastatic prostate cancer to prostate cancer death; 1 → 3: from metastatic prostate cancer to other cause death.
Figure 2 shows the direct, indirect, and total effects over time with perturbation-resampling 95% confidence intervals for the difference in cumulative risks and in restricted mean times lost. With the difference in cumulative risks, the indirect effect indicates a 2.5% reduction in cumulative risk of prostate cancer death (95% CI from −4.7% to −0.45%) at 18 years. Furthermore, at 18 years, 40% (95% CI from 14% to 53%) of the treatment effect on prostate cancer death is mediated by the treatment effect on development of metastases (Figure 3). With the difference in restricted mean times lost, the indirect effect suggests an improvement of 2.4 months in life-years lost to prostate cancer over 18 years of follow-up. We find that the proportion of mediation is 35% (95% CI from 5.3% to 47%) at 18 years. Both estimates of the proportion of mediation suggest that time to the development of metastases is a satisfactory surrogate for the clinical outcome of time to prostate cancer death. For both measures, we see that the proportion mediated increases over time. The confidence intervals on the indirect effect are wide. As a consequence, the confidence intervals for the proportion mediated are also wide.
Figure 2: Direct, indirect, and total effects in the SPCG-4 trial.

Top: difference in cumulative risks of prostate cancer death. Bottom: difference in restricted mean times lost to prostate cancer death. Solid lines: point estimate; dotted lines: perturbation-resampling 95% confidence intervals.
Figure 3: Proportion mediated by metastasis for the difference in cumulative risks of prostate cancer death (left) and for the difference in restricted mean times lost to prostate cancer death in the SPCG-4 trial.

7. Discussion
We have introduced a new model in the causal mediation framework for surrogate outcome evaluation when both the surrogate and clinical outcomes are time-to-event outcomes and the surrogate outcome is on the causal pathway between the exposure and the clinical outcome. The method handles censoring and competing risks. We used the framework of counterfactual-based mediation analysis, which offers a causal interpretation of surrogacy in the bivariate survival context using a single randomized trial. We defined the direct, indirect, total effect, and proportion of mediation with respect to differences in cumulative risks and differences in restricted mean times lost. In the SPCG-4 randomized trial, we found that a considerable proportion of the treatment effect on all-cause death was explained by the treatment effect on metastasis. However, we found substantial uncertainty on our estimates, as evidenced by wide confidence intervals. Our findings are consistent with results from the Intermediate Clinical Endpoints in Cancer of the Prostate (ICECaP) individual participant data meta-analysis which found metastasis-free survival a strong surrogate for overall survival for localized prostate cancer.[34]
Previous works focused on surrogate endpoint evaluation in the counterfactual mediation framework for different surrogate and clinical endpoint types. For example, Oba et al. used marginal structural models to estimate direct and indirect effects in the setting of a continuous mediator and binary outcome.[35] In their illustrative example, they assessed systolic blood pressure as a surrogate endpoint for cardiovascular, cerebrovascular, and cardiac events. They found wide 95% confidence intervals for the direct, indirect, and total effects. Hsu et al. outlined a conceptual framework for a counterfactual mediation model for time-varying exposures and time-varying surrogates.[36] Vandenberghe et al. described a counterfactual-based model for surrogate evaluation with a binary mediator and a continuous or binary true endpoint.[37] They proposed several estimators using propensity score-based models and generalized linear models. Subsequently, Vandenberghe et al. described a counterfactual-based model for evaluation of a binary surrogate endpoint for a time-to-event clinical endpoint.[38] They defined restricted maximum likelihood and locally efficient estimators for each mediation measure on the risk ratio scale. In another paper the authors introduced methods for surrogate evaluation for a continuous mediator measured repeatedly over time in the setting of a time-to-event true endpoint. They used a generalised dynamic path analysis framework to estimate the mediation measures using the hazard ratio scale.[39] Weber and Titman applied copula-based models to explore the association between progression-free and overall survival in oncology trials using the illness-death model for estimation of Kendall’s τ. [40] They found the Clayton copula to be appropriate for many oncology scenarios but acknowledged that the use of copula models in practice may be a limitation.
In our work, we used a multistate model approach which allows us to handle competing risks data and to explain the treatment effect on the clinical outcome with intermediate outcomes reflecting different parts of the disease process. Our method is unique because it is grounded in the causal mediation framework and addresses the specific setting where both the surrogate and clinical outcomes are time-to-event endpoints. To our knowledge, there are no existing methods for surrogate outcome evaluation that estimate the causal mediation measures in this bivariate time-to-event setting and therefore we did not pursue comparisons with other methods. We include the details and results of a small simulation study in Appendix Section 3 showing a comparison of the estimated mediation measures between our method and a Cox model which treats the surrogate outcome as binary, not time-to-event, for increasing proportions of censoring. Future work will connect the true values for the direct, indirect, total effects, and the proportion of mediation to the simulation parameters and will assess the statistical performance of the proposed method in a comprehensive simulation study.
The vast majority of applied work on surrogacy evaluation has focused on the hazard ratio as a measure of treatment effect.[41] Stensrud et al. have discussed how hazard ratios have a difficult causal interpretation, whereas the difference in restricted mean times lost offers a more accessible interpretation as the causal difference in life-years lost between two groups over a restricted period of time.[42] Parast et al. have also defined a model to assess surrogacy using the difference in restricted mean survival times, however it is not in the counterfactual context.[43] Our proposed method focuses on the evaluation of surrogate outcomes that are causal mediators. The mediation measures with the difference in restricted mean times lost provide a causal interpretation of treatment effect on the time scale. However, our estimation model relies on the hazard functions to be proportional over time. This condition appeared consistent with the data in our motivating example, but may be violated in other randomized trials. An alternative approach would be to use parametric multistate survival models with time-dependent effects.[44]
Conventional surrogacy evaluation methods focus on the relationship between the treatment effect on the surrogate endpoint and the treatment effect on the true endpoint. In our motivating example, our causal measures provide insight into the tumor-dependent and tumor-independent mechanisms through which treatment affects survival. Our method provides insight into the biological mechanisms through which treatment influences progression to lethal prostate cancer. This idea can be applied more broadly to other clinical applications.
In the SPCG-4 trial, radical prostatectomy showed a benefit for all-cause mortality, prostate cancer-specific death, as well as metastasis.[10] This gave ground for the application of our method to explore metastasis as a potential surrogate for all-cause mortality and prostate cancer mortality. We found that the treatment effect on metastasis explained a considerable proportion of the treatment effect on all-cause death, however the corresponding confidence interval was wide. Despite controlling for covariates at randomization, there could be residual confounding between the surrogate and clinical outcome due to the fact that our all-cause death can potentially occur at a much later time point than metastasis. We could apply our method at lagged time points, i.e. assuming that data on metastasis are truncated at an earlier time point than data on mortality. This way, one would assess the temporal relationship between the treatment effect on the mediator at an earlier time point and the treatment effect on the clinical endpoint at a later time point.
Parast et al. addressed a similar idea in which they suggest to assess whether the treatment effect for the clinical endpoint at an earlier time point is a surrogate for the treatment effect on the same clinical endpoint at a later time point.[45] However, in their method, if an individual experiences both the surrogate and the primary outcome, only the primary outcome is incorporated into the analysis. Thus the method ignores indirect paths to the primary outcome and in such cases, the surrogate event does not contribute to the proportion of mediation.
Supplementary Material
Acknowledgments
We thank Anna Bill-Axelson (Uppsala University) and the SPCG-4 study team for providing access to data. We thank Katia Bulekova (Boston University) for her help with the BU Shared Computing Cluster.
Funding
IRW was supported by the National Institute of General Medical Sciences (NIGMS) Interdisciplinary Training Grant for Biostatisticians (T32 GM074905) and the Statistical and Data Management Center of the AIDS Clinical Trials Group at Harvard University (National Institute of Allergy and Infectious Diseases UM1-AI068634). LT and JR were supported by the Boston University School of Public Health Early Career Catalyst Award.
Data Availability
Research data are not shared.
References
- [1].Molenberghs G, Buyse M, Geys H, Renard D, Burzykowski T, and Alonso A. Statistical challenges in the evaluation of surrogate endpoints in randomized trials. Controlled Clinical Trials, 23(6):607–625, 2002. [DOI] [PubMed] [Google Scholar]
- [2].Buyse M, Sargent DJ, Grothey A, Matheson A, and De Gramont A. Biomarkers and surrogate end points—the challenge of statistical validation. Nature reviews Clinical oncology, 7(6):309, 2010. [DOI] [PubMed] [Google Scholar]
- [3].Weir CJ and Walley RJ. Statistical evaluation of biomarkers as surrogate endpoints: a literature review. Statistics in medicine, 25(2):183–203, 2006. [DOI] [PubMed] [Google Scholar]
- [4].Buyse M and Molenberghs G. Criteria for the validation of surrogate endpoints in randomized experiments. Biometrics, 54(3):1014–1029, 1998. [PubMed] [Google Scholar]
- [5].Lin DY, Fleming TR, and De Gruttola V. Estimating the proportion of treatment effect explained by a surrogate marker. Statistics in medicine, 16(13):1515–1527, 1997. [DOI] [PubMed] [Google Scholar]
- [6].Wang Y and Taylor JMG. A measure of the proportion of treatment effect explained by a surrogate marker. Biometrics, 58(4):803–812, 2002. [DOI] [PubMed] [Google Scholar]
- [7].Taylor JMG, Wang Y, and Thiébaut R. Counterfactual links to the proportion of treatment effect explained by a surrogate marker. Biometrics, 61(4):1102–1111, 2005. [DOI] [PubMed] [Google Scholar]
- [8].Jackson WC, Suresh K, Tumati V, Allen SG, Dess RT, Salami SS, George A, Kaffenberger SD, Miller DC, Hearn JWD, et al. Intermediate endpoints after postprostatectomy radiotherapy: 5-year distant metastasis to predict overall survival. European Urology, 74(4):413–419, 2018. [DOI] [PubMed] [Google Scholar]
- [9].Burzykowski T, Buyse M, Piccart-Gebhart MJ, Sledge G, Carmichael J, Lück HJ, Mackey JR, Nabholtz JM, Paridaens R, Biganzoli L, et al. Evaluation of tumor response, disease control, progression-free survival, and time to progression as potential surrogate end points in metastatic breast cancer. Journal of Clinical Oncology, 2008. [DOI] [PubMed] [Google Scholar]
- [10].Bill-Axelson A, Holmberg L, Garmo H, Rider JR, Taari K, Busch C, Nordling S, Häggman M, Andersson SO, Spångberg A, et al. Radical prostatectomy or watchful waiting in early prostate cancer. New England Journal of Medicine, 370(10):932–942, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].Trinquart L, Bill-Axelson A, and Rider JR. Restricted mean survival times to improve communication of evidence from cancer randomized trials and observational studies. European Urology, 76(2):137–39, 2019. [DOI] [PubMed] [Google Scholar]
- [12].Ghosh D. Semiparametric inference for surrogate endpoints with bivariate censored data. Biometrics, 64(1):149–156, 2008. [DOI] [PubMed] [Google Scholar]
- [13].Burzykowski T, Molenberghs G, Buyse M, Geys H, and Renard D. Validation of surrogate end points in multiple randomized clinical trials with failure time end points. Journal of the Royal Statistical Society: Series C (Applied Statistics), 50(4):405–422, 2001. [Google Scholar]
- [14].Ghosh D, Taylor JMG, and Sargent DJ. Meta-analysis for surrogacy: Accelerated failure time models and semicompeting risks modeling. Biometrics, 68(1):226–232, 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [15].Lange T and Hansen JV. Direct and indirect effects in a survival context. Epidemiology, pages 575–581, 2011. [DOI] [PubMed] [Google Scholar]
- [16].VanderWeele TJ. Causal mediation analysis with survival data. Epidemiology, 22(4):582, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [17].Joffe MM and Greene T. Related causal frameworks for surrogate outcomes. Biometrics, 65(2):530–538, 2009. [DOI] [PubMed] [Google Scholar]
- [18].Wang J and Shete S. Estimation of indirect effect when the mediator is a censored variable. Statistical Methods in Medical Research, 27(10):3010–3025, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [19].Zhao L, Claggett B, Tian L, Uno H, Pfeffer MA, Solomon SD, Trippa L, and Wei LJ. On the restricted mean survival time curve in survival analysis. Biometrics, 72(1):215–221, 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [20].Conner SC and Trinquart L. Estimation and modeling of the restricted mean time lost in the presence of competing risks. Statistics in medicine, 40(9):2177–2196, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [21].Zhao L, Tian L, Claggett B, Pfeffer M, Kim DH, Solomon S, and Wei LJ. Estimating treatment effect with clinical interpretation from a comparative clinical trial with an end point subject to competing risks. JAMA cardiology, 3(4):357–358, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [22].Trinquart L, Jacot J, Conner SC, and Porcher R. Comparison of treatment effects measured by the hazard ratio and by the ratio of restricted mean survival times in oncology randomized controlled trials. Journal of Clinical Oncology, 34(15):1813–9, 2016. [DOI] [PubMed] [Google Scholar]
- [23].Conner Sarah C, Sullivan Lisa M, Benjamin Emelia J, LaValley Michael P, Galea Sandro, and Trinquart Ludovic. Adjusted restricted mean survival times in observational studies. Statistics in medicine, 38(20):3832–3860, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [24].De Wreede LC, Fiocco M, and Putter H. The mstate package for estimation and prediction in non-and semi-parametric multi-state and competing risks models. Computer Methods and Programs in Biomedicine, 99(3):261–274, 2010. [DOI] [PubMed] [Google Scholar]
- [25].Andersen PK, Borgan O, Gill RD, and Keiding N. Statistical models based on counting processes. Springer Science & Business Media, 1993. [Google Scholar]
- [26].Kragh Andersen Per, Abildstrom Steen Z, and Rosthøj Susanne. Competing risks as a multi-state model. Statistical methods in medical research, 11(2):203–215, 2002. [DOI] [PubMed] [Google Scholar]
- [27].Putter H, Fiocco M, and Geskus RB. Tutorial in biostatistics: competing risks and multi-state models. Statistics in Medicine, 26(11):2389–2430, 2007. [DOI] [PubMed] [Google Scholar]
- [28].Tchetgen Tchetgen Eric J. On causal mediation analysis with a survival outcome. The international journal of biostatistics, 7(1), 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [29].VanderWeele TJ. Bias formulas for sensitivity analysis for direct and indirect effects. Epidemiology (Cambridge, Mass.), 21(4):540, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [30].Imai K, Keele L, and Tingley D. A general approach to causal mediation analysis. Psychological methods, 15(4):309, 2010. [DOI] [PubMed] [Google Scholar]
- [31].Pearl J. Direct and indirect effects. In Proceedings of the seventeenth conference on uncertainty in artificial intelligence, pages 411–420. Morgan Kaufmann Publishers Inc., 2001. [Google Scholar]
- [32].Gilbert PB, Wei LJ, Kosorok MR, and Clemens JD. Simultaneous inferences on the contrast of two hazard functions with censored observations. Biometrics, 58(4):773–780, 2002. [DOI] [PubMed] [Google Scholar]
- [33].Tian L, Zucker D, and Wei LJ. On the cox model with time-varying regression coefficients. Journal of the American Statistical Association, 100(469):172–183, 2005. [Google Scholar]
- [34].Xie W, Regan MM, Buyse M, Halabi S, Kantoff PW, Sartor O, Soule H, Clarke NW, Collette L, Dignam JJ, et al. Metastasis-free survival is a strong surrogate of overall survival in localized prostate cancer. Journal of Clinical Oncology, 35(27):3097, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [35].Oba K, Sato T, Ogihara T, Saruta T, and Nakao K. How to use marginal structural models in randomized trials to estimate the natural direct and indirect effects of therapies mediated by causal intermediates. Clinical Trials, 8(3):277–287, 2011. [DOI] [PubMed] [Google Scholar]
- [36].Hsu JY, Kennedy EH, Roy JA, Stephens-Shields AJ, Small DS, and Joffe MM. Surrogate markers for time-varying treatments and outcomes. Clinical Trials, 12(4):309–316, 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [37].Vandenberghe S, Vansteelandt S, and Loeys T. Boosting the precision of mediation analyses of randomised experiments through covariate adjustment. Statistics in Medicine, 36(6):939–957, 2017. [DOI] [PubMed] [Google Scholar]
- [38].Vandenberghe S, Duchateau L, Slaets L, Bogaerts J, and Vansteelandt S. Surrogate marker analysis in cancer clinical trials through time-to-event mediation techniques. Statistical Methods in Medical Research, 27(11):3367–3385, 2018. [DOI] [PubMed] [Google Scholar]
- [39].Vansteelandt S, Linder M, Vandenberghe S, Steen J, and Madsen J. Mediation analysis of time-to-event endpoints accounting for repeatedly measured mediators subject to time-varying confounding. Statistics in medicine, 38(24):4828–4840, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [40].Weber EM and Titman AC. Quantifying the association between progression-free survival and overall survival in oncology trials using kendall’s τ. Statistics in medicine, 38(5):703–719, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [41].Xie W, Halabi S, Tierney JF, Sydes MR, Collette L, Dignam JJ, Buyse M, Sweeney CJ, and Regan MM. A systematic review and recommendation for reporting of surrogate endpoint evaluation using meta-analyses. JNCI Cancer Spectrum, 3(1):pkz002, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [42].Stensrud MJ, Aalen JM, Aalen OO, and Valberg M. Limitations of hazard ratios in clinical trials. European Heart Journal, 40(17):1378–1383, 2018. [DOI] [PubMed] [Google Scholar]
- [43].Parast L, Cai T, and Tian L. Evaluating surrogate marker information using censored data. Statistics in Medicine, 36(11):1767–1782, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [44].Crowther Michael J and Lambert Paul C. Parametric multistate survival models: Flexible modelling allowing transition-specific distributions with application to estimating clinically useful measures of effect differences. Statistics in medicine, 36(29):4719–4742, 2017. [DOI] [PubMed] [Google Scholar]
- [45].Parast L, Tian L, and Cai T. Assessing the value of a censored surrogate outcome. Lifetime Data Analysis, pages 1–21, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Research data are not shared.
