Skip to main content
Biostatistics (Oxford, England) logoLink to Biostatistics (Oxford, England)
. 2019 Dec 10;22(3):504–521. doi: 10.1093/biostatistics/kxz049

A Gaussian copula approach for dynamic prediction of survival with a longitudinal biomarker

Krithika Suresh 1,, Jeremy M G Taylor 2, Alexander Tsodikov 2
PMCID: PMC8561844  PMID: 31820798

Summary

Dynamic prediction uses patient information collected during follow-up to produce individualized survival predictions at given time points beyond treatment or diagnosis. This allows clinicians to obtain updated predictions of a patient’s prognosis that can be used in making personalized treatment decisions. Two commonly used approaches for dynamic prediction are landmarking and joint modeling. Landmarking does not constitute a comprehensive probability model, and joint modeling often requires strong distributional assumptions and computationally intensive methods for estimation. We introduce an alternative approximate approach for dynamic prediction that aims to overcome the limitations of both methods while achieving good predictive performance. We separately specify the marker and failure time distributions conditional on surviving up to a prediction time of interest and use standard variable selection and goodness-of-fit techniques to identify the best-fitting models. Taking advantage of its analytic tractability and easy two-stage estimation, we use a Gaussian copula to link the marginal distributions smoothly at each prediction time with an association function. With simulation studies, we examine the proposed method’s performance. We illustrate its use for dynamic prediction in an application to predicting death for heart valve transplant patients using longitudinal left ventricular mass index information.

Keywords: Dynamic prediction, Gaussian copula, Joint modeling, Landmarking, Longitudinal data, Survival analysis

1. Introduction

Personalized medicine focuses on tailoring a treatment to an individual based on an their particular risk. For survival outcomes, an estimate of this risk is traditionally obtained from a prediction model employed at some baseline time, such as diagnosis, and only uses information up to that point. However, there is increased interest in predicting the conditional survival of patients beyond baseline. During follow-up, new information such as updated biomarker measurements may become available for the patient. To obtain accurate, individualized survival predictions during follow-up, prediction models must incorporate this patient information that can change over time, and thus produce dynamic predictions. These predictions can then be used by clinicians to monitor a patient’s prognosis during follow-up to implement additional therapies or modify their screening schedule.

The statistical task is to develop a technique that produces survival predictions at baseline, and incorporates additional marker information to obtain updated predictions for patients that are still alive at future time points. Thus, dynamic prediction methods require incorporating time-dependent marker information, Inline graphic, into a model for the failure time, Inline graphic, to obtain the conditional distribution Inline graphic, where Inline graphic is the marker information available up to time Inline graphic. The dynamic prediction of experiencing the event of interest in the next Inline graphic interval given survival up to time Inline graphic and up-to-date marker information is then the conditional survival probability Inline graphic. Two common approaches for obtaining these dynamic predictions are joint modeling and landmarking.

1.1. Dynamic prediction with joint modeling and landmarking

Joint modeling specifies a model for the marker process, Inline graphic, and a model for the failure time that links it to the marker process, Inline graphic, e.g., a survival model with hazard Inline graphic (Wulfsohn and Tsiatis, 1997;Henderson and others, 2000;Wang and Taylor, 2001). From these two models, the joint distribution Inline graphic can be derived. Joint modeling produces a valid prediction function from which we can obtain consistent conditional survival predictions that have a defined, meaningful relationship with predictions obtained from the model at other time points (Jewell and Nielsen, 1993), thus making it suited for dynamic prediction ( Rizopoulos, 2011; Taylor and others, 2013; Rizopoulos and others, 2017). The dynamic predictions obtained from joint modeling at time Inline graphic for surviving up to time Inline graphic involve integrating the conditional hazard Inline graphic from Inline graphic to Inline graphic, which requires knowledge of the distribution of future values of the marker process beyond the current measurement Inline graphic. Thus, utilizing joint modeling for prediction requires the full specification of the marker process, which involves making specific distributional assumptions. In addition, the marker model may be difficult to estimate when there are sparse longitudinal measurements, and misspecification of this model can result in biased predictions (Rizopoulos and others, 2008b). A practical disadvantage of this method is that it can require computationally intensive methods for both estimation and the calculation of dynamic predictions (Rizopoulos, 2011).

Landmarking requires directly specifying a survival model for Inline graphic by looking at the empirical failure time distribution at fixed time points, Inline graphic, conditional on being alive at Inline graphic and having marker value Inline graphic (van Houwelingen, 2007;Zheng and Heagerty, 2005;Gong and Schaubel, 2013). Thus, at each Inline graphic, we obtain the best-fitting model for Inline graphic using information from individuals alive at Inline graphic and their last available longitudinal marker measurement, Inline graphic. Estimation of this empirical distribution is accomplished by using a Cox regression to model the hazard Inline graphic, where the baseline hazard and covariate effects can be restricted to vary smoothly with Inline graphic. The dynamic survival predictions can be directly computed as Inline graphic. The advantages of this method are that it avoids having to specify the distribution of the marker process and can be easily implemented in standard software. A disadvantage of landmarking is the numerous decisions required by the method. To conduct estimation, landmarking requires prespecifying the prediction times of interest, referred to as landmark times. For simple landmark models, computing dynamic predictions is restricted to these time points. Since a model for the marker process is not specified, to perform estimation landmarking also requires selecting an imputation method for marker values at landmark times at which individuals do not have observations. As well, the landmarking approach does not satisfy the consistency criteria described in Jewell and Nielsen (1993) since it directly models the conditional hazard Inline graphic and does not derive it from the joint distribution of failure time and marker processes, as in joint modeling. In previous work (Suresh and others, 2017), we demonstrated that under a binary marker process, landmarking results in a theoretically incorrect model; however, with increased flexibility it provides a sufficient approximation to a joint model.

The advantage of using an ad hoc approach, such as landmarking, for dynamic prediction is that it is a simpler method that does not require assumptions about the marker distribution and does not impose a computational burden on estimation or calculating predicted probabilities. However, it lacks the consistency of a valid prediction function that is offered by a joint model. Thus, we propose an alternative approximate approach for dynamic prediction that aims to combine the advantages and mitigate the disadvantages of landmarking and joint modeling, while maintaining good predictive performance.

1.2. Copula approach for dynamic prediction

Copulas are multivariate cumulative distribution functions with uniform marginals, and by Sklar’s theorem they provide a convenient approach to link marginals to construct a joint distribution (Nelsen, 1999). We propose an approximate method for dynamic prediction that requires specifying the marginal models Inline graphic and Inline graphic for individuals alive at time Inline graphic, and then uses a bivariate Gaussian copula to model the joint distribution of Inline graphic and Inline graphic conditional on being alive at Inline graphic, Inline graphic. From this joint distribution, we can directly compute the dynamic predictions. The copula allows us to specify the marginal distributions of the marker data and time-to-event process and then model their association separately. It is a flexible way of specifying this association since there is no restriction on the marginal distributions, which do not have to be specified parametrically.

As with landmarking, this method does not produce a comprehensive probability model; however, we maintain a greater level of consistency in our predictions by specifying a single model for Inline graphic, and then deriving the model for Inline graphic, which will be consistently defined for all Inline graphic. Unlike joint modeling, we do not require a flexible specification of the marker process using random effects that can lead to complex estimation. Instead, we specify the marginal distribution of the longitudinal data at each Inline graphic, allowing the mean and variance of the distribution to change smoothly with Inline graphic. We use two-stage estimation to first estimate the parameters from the marginal models, and then hold them fixed in the joint likelihood to estimate the association parameters. Estimation is conducted using likelihood-based methods, which allow for standard methods of model checking and validation.

Researchers have previously used copulas to facilitate joint modeling and dynamic prediction. Rizopoulos and others (2008a),Rizopoulos and others (2008b) proposed a reparameterization of a shared random effects model by using a copula to model the joint distribution of the random effects of the marker process and the frailty term of the survival process. In our formulation, we avoid the complexity of random effects estimation by using the copula to directly model the association between simple, but flexible models, for the marginal distributions of the survival and marker data. Ganjali and Baghfalaki (2015) uses a multivariate copula to obtain a fully specified joint model for Inline graphic and Inline graphic measured at fixed time points Inline graphic, given by Inline graphic. This approach uses an individual’s entire longitudinal marker history to make predictions but does not easily accommodate measurement times that vary by individual. Emura and others (2018) use a joint-frailty copula model to specify the joint distribution between two survival outcomes, time-to-death and time-to-tumor progression. From this joint model, they derive the dynamic prediction conditional on the individual’s tumor progression status. Similarly, we obtain the dynamic prediction by conditioning on an individual’s marker value. In comparison, only one of our outcomes is time-to-event, thus for convenience of derivation we do not use a survival copula in our model specification. Unlike these approaches, our model formulation does not specify a joint model for Inline graphic and Inline graphic, but rather models the cross-sectional distribution between the survival and marker data at landmark times, allowing that relationship to change smoothly over time.

In summary, we aim to describe a new method for dynamic prediction using a novel Gaussian copula approach. In Section 2, we introduce the model and discuss a two-stage approach for estimation in the situation of a continuous marker. Using a simulation study, we explore the performance of our method in Section 3. In Section 4, we illustrate the use of our method to obtain dynamic predictions of survival for heart valve replacement patients with longitudinal measurements of heart function. Section 5 is a discussion of the advantages and limitations of our method and future directions.

2. Method

Our proposed method for dynamic prediction specifies the marginal distributions of the marker data and the survival outcome and uses a copula to model the association between the two outcomes over time. The intuition behind this approach is that we can specify a model for each of the marginals that imposes fewer assumptions on the marginal distributions and for which we can assess goodness-of-fit, and then model their correlation using a copula with a time-varying association structure.

2.1. Copula model and estimation

Let Inline graphic denote the observed data, where Inline graphic is the true event time, Inline graphic is the censoring time, Inline graphic is the observed event time, Inline graphic is the censoring indicator, Inline graphic is the baseline covariate vector, and Inline graphic is the Inline graphic longitudinal marker vector, with element Inline graphic denoting the marker value at time Inline graphic, for subject Inline graphic. We assume non-informative censoring, i.e., conditional on baseline covariates any additional dropout process is not related to the event of interest or the longitudinal process. We discuss dependent censoring in Section A of the supplementary material available at Biostatistics online.

The dynamic prediction of interest is the subject-specific predicted probability of experiencing the survival event in the time interval Inline graphic], Inline graphic, given that a new subject Inline graphic has survived up to time Inline graphic, i.e.,

graphic file with name mt064.gif (2.1)

where Inline graphic are the set of longitudinal measurements recorded up to time Inline graphic. In practice, this prediction may depend only on the scalar marker measurement at time Inline graphic, Inline graphic.

We are interested in specifying the marginal distributions of Inline graphic, the time-to-event outcome, and Inline graphic, the longitudinal marker data, for each landmark time Inline graphic. Thus, we restrict the models for Inline graphic and Inline graphic to be conditional on the patient being alive at time Inline graphic, and are specifically interested in modeling the conditional survival time Inline graphic and the cross-sectional marker data at Inline graphic, Inline graphic, denoted by Inline graphic and Inline graphic, respectively.

A Gaussian copula is then used to link the survival time distribution and the marker data distribution at all landmark time points, allowing us to compute the dynamic predictions from an overall model.

We consider the situation of a continuous marker process. Let Inline graphic and Inline graphic be the marginal distributions of the time-to-event outcome Inline graphic and the marker data Inline graphic, respectively, conditional on the individual being alive at time Inline graphic. Both of the marginals can be conditional on baseline covariates Inline graphic, i.e., Inline graphic and Inline graphic; however, we shall omit them from the following model specification for brevity. The joint distribution Inline graphic is then defined using a Gaussian copula as

graphic file with name mt089.gif (2.2)

where Inline graphic is the standard normal distribution, Inline graphic is the standard bivariate normal distribution, and Inline graphic is the correlation, which is specified as a smooth function of landmark time and possibly baseline covariates Inline graphic. The joint density is given by

graphic file with name mt094.gif

where Inline graphic and Inline graphic, and Inline graphic and Inline graphic are the marginal densities of Inline graphic and Inline graphic, respectively. This is the likelihood contribution of individuals who at time Inline graphic are alive and have observed marker value Inline graphic, and at time Inline graphic have an observed event. For individuals who are alive at time Inline graphic, but are censored at time Inline graphic, the joint density is given by

graphic file with name mt106.gif

Let Inline graphic be the parameter vector containing the respective marginal parameters Inline graphic and Inline graphic of Inline graphic and Inline graphic, and association parameters Inline graphic. The likelihood contribution for individual Inline graphic at measurement time Inline graphic is then

graphic file with name mt115.gif (2.3)

where Inline graphic is the time at which individual Inline graphic has the event or was censored (i.e., last observed time). Assuming working independence between measurements at different time points for each individual, we construct the following pseudo-likelihood function

graphic file with name mt118.gif

Due to the number of parameters associated with both the marginal models and the association structure, direct maximization of this pseudo-likelihood may still be computationally difficult. Thus, we further simplify estimation by using the method of inference functions for margins (IFM), where the marginal parameters, Inline graphic and Inline graphic, are first estimated from the marginal models and then held fixed in the maximization of the pseudo-likelihood over the association parameters Inline graphic (Joe and Xu, 1996). Specifically,

  • (a) The marginal parameter estimates, Inline graphic and Inline graphic, are obtained separately from their respective marginal models using an appropriate estimation method (e.g., maximum partial likelihood estimation for a Cox survival model, least squares if using linear regression for the marker model). Estimation of Inline graphic uses data Inline graphic and estimation of Inline graphic uses data Inline graphic.

  • (b) The function Inline graphic is maximized over Inline graphic to get Inline graphic using Nelder–Mead optimization.

  • (c) Inline graphic is the IFM estimate.

The standard errors for the marginal survival model can be obtained in the same way as with a standard Cox or parametric survival model (Andersen and Gill, 1982). Robust standard errors for the marginal marker model are computed using a sandwich estimator (Zeger and Liang, 1986). Following arguments presented in existing literature (Joe and Xu, 1996;Prenen and others, 2017;Song, 2007), we describe two-stage parametric variance estimation for the association parameters in Section B of the supplementary material available at Biostatistics online, with the extension to semiparametric variation following from arguments presented in Prenen and others (2017) and Spiekerman and Lin (1998). The analytic standard errors of the association parameter vector are complicated since they account for variability from the estimates from the marginal models. Thus, in practice the standard errors are computed using a resampling scheme, such as jackknife (Joe and Xu, 1996) or bootstrapping (Efron and Tibshirani, 1993).

Using the parameter estimates Inline graphic obtained from the IFM method, we can compute the dynamic prediction of interest from (2.1) as,

graphic file with name mt133.gif (2.4)

where Inline graphic, Inline graphic, and Inline graphic. Note that we compute this prediction conditioning on the scalar value of the marker at time Inline graphic, Inline graphic. For the situation where the marker is not observed at the prediction time, we instead use Inline graphic in our prediction, where Inline graphic is the time at which the marker was last observed. That is, we compute the quantile of the marker distribution at the time at which the marker was observed and carry it forward to the prediction time of interest, rather than carrying the marker value to a new time at which the marker distribution is different.

2.2. Modeling copula components

In choosing models for the components of the copula, we consider simple, flexible, but possibly misspecified models that serve as a good approximation to the true distribution. This allows us to avoid making strong modeling assumptions and allows for easy estimation that can be readily implemented in standard software.

2.2.1. Modeling the marker data

Instead of specifying a mixed effects model for the evolution of the marker data over time, as is the case with joint modeling, we specify the distribution of the longitudinal data at each landmark time Inline graphic, where this model has a mean and variance that can be specified as a function of landmark time Inline graphic and baseline covariates Inline graphic. We define the model Inline graphic, where Inline graphic is a vector of regression coefficients, Inline graphic is a function of landmark time, baseline covariates, and regression coefficients, and Inline graphic is an error term that is independently distributed. We consider a linear regression for Inline graphic, where we model Inline graphic as a smooth parametric function of landmark time and Inline graphic, where Inline graphic is also a smooth function of Inline graphic and Inline graphic is a vector of regression coefficients. The marginal parameters of Inline graphic are then Inline graphic.

2.2.2. Modeling the failure time data

We model the time-to-event outcome, Inline graphic, and then compute the conditional survival Inline graphic. We propose modeling the event time using a Cox model that can be extended to accommodate non-proportional hazards or additional flexibility, Inline graphic, where Inline graphic is the baseline hazard, Inline graphic is a vector of regression coefficients, and Inline graphic is a function of baseline covariates, regression coefficients, and possibly time, to allow for non-proportional hazards or time-varying covariate effects. The marginal distribution of the time-to-event outcome is then given by Inline graphic and the corresponding parameters to be estimated, Inline graphic, are the parameter vector Inline graphic and the cumulative baseline hazard Inline graphic.

2.2.3. Modeling the association

Once we define the marginals Inline graphic and Inline graphic, we can use the Gaussian copula defined in (2.2) to describe the joint distribution at landmark time Inline graphic. To model the association between the two marginals, we reparameterize Inline graphic using Fisher’s z-transformation Inline graphic. We define Inline graphic as a function of landmark time Inline graphic, baseline covariates Inline graphic, and regression coefficients Inline graphic. Thus, from the association function we can evaluate the extent of the correlation between the time-to-event outcome and the marker process, and how that relationship changes with landmark time.

2.2.4. Choosing the copula

We consider a Gaussian copula due to its tractable nature, and easy implementation in standard software. However, there are other choices of copulas that have differing strengths of dependence in the distribution tails. The Student’s t copula is symmetric and has an additional parameter for degrees of freedom that controls the strength of the tail dependence. This copula is both upper- and lower-tail dependent, which allows for joint extreme events and can be beneficial if we expect our distribution to have heavy tails. The Clayton and Gumbel copulas are Archimedean copulas that are lower-tail and upper-tail dependent, respectively. We present our method under these alternative copulas in Section C of the supplementary material available at Biostatistics online and consider the performance of the Student’s t copula in our simulation study.

3. Simulation study

We conduct a simulation study to assess the predictive performance of our proposed method in comparison to the existing methods of joint modeling and landmarking.

3.1. Performance comparison metrics

We evaluate the discrimination and calibration of dynamic predictions computed under the different methods for the interval Inline graphic using dynamic analogs of weighted area under the curve (AUC) and Brier score (BS), which account for censoring. We denote these measures as Inline graphic and Inline graphic, respectively, and use the following definitions presented in Blanche and others (2015),

graphic file with name mt178.gif

where Inline graphic and Inline graphic is the dynamic prediction of interest given in (2.1). The inverse probability of censoring weight estimators are then

graphic file with name mt181.gif

where Inline graphic, Inline graphic, and weights to account for censoring are given by Inline graphic, where Inline graphic is the Kaplan–Meier estimator of survival function for the censoring time at Inline graphic.

Since BS depends on the cumulative incidence of death in Inline graphic, we use a standardized version that produces an Inline graphic-type measure that compares how well the predictions perform relative to predictions from a null model given by the Kaplan–Meier estimate, Inline graphic, which does not take into account subject-specific information. We denote this scaled measure as Inline graphic. To make comparisons between the different models, we compute the best-attainable AUC and Inline graphic using the predicted probabilities from the true models, denoted Inline graphic and Inline graphic, respectively. We then report the relative measures Inline graphic and Inline graphic for each of the models, where values close to 0 indicate better performance. To ensure that our method is not consistently predicting higher or lower than the true probabilities, we also evaluate calibration using the root mean squared prediction errors (RMSPEs) between the true conditional survival probabilities, Inline graphic, and the predictions obtained from each of the different models considered, given by

graphic file with name mt197.gif

Five hundred simulations of 1000 subjects were run for each scenario. A random sample of 500 subjects were selected to create a training data set, to which the models were fit. The performance metrics were then computed for predictions from the remaining 500 subjects who compose the validation data set, and averaged across the simulations. The data sets included in this average are those for which all methods converged.

3.2. Simulation set-up

We simulate patients who have been followed for a period of 10 years, for whom longitudinal biomarker measurements are available at baseline. We simulate marker measurement under two patterns of observation: (i) the marker process is observed every year for 9 years following baseline and (ii) the value of the marker is observed at inspection times occurring according to a Poisson process with rate 0.5 and 1. We simulate a binary baseline covariate Inline graphic that has 50% prevalence. We generate the longitudinal marker measurements using the following linear mixed effects model

graphic file with name mt199.gif

where Inline graphic and Inline graphic. To generate the survival times, we use the following joint model

graphic file with name mt202.gif (3.1)

with Inline graphic as the Weibull baseline hazard. Since Inline graphic describes how the survival process is affected by the longitudinal biomarker, we consider different levels of correlation between the two processes. The noisiness of the marker process is described by Inline graphic, thus we simulate under both a noisy and a more stable marker process. We also consider a missing at random (MAR) scenario where the marker value at each time point is missing with probability 0.2 and 0.1 for those with Inline graphic and Inline graphic, respectively.

To assess the robustness of the proposed copula method to more general situations, such as non-linear marker trajectories and complex dependencies, we also simulate under alternative joint model generating mechanisms. We generate the longitudinal marker measurements using the following linear mixed effects model with a non-linear function of time,

graphic file with name mt208.gif

where Inline graphic is the B-spline basis for a cubic spline with boundary knots at 0 and 10 years and two internal knots at Inline graphic 3 and 6 years, Inline graphic and Inline graphic. For the joint models, when generating the data, we consider two types of association structure between the survival and marker processes: (i) where the functional form depends on only the marker value as given in (3.5) or (ii) depends both on the marker value and the slope at Inline graphic,

graphic file with name mt214.gif

The parameters used in the various submodels to generate the data are given in Section E of the supplementary material available at Biostatistics online, and the simulation scenarios are described in Table E.1 of the supplementary material available at Biostatistics online. We generate right-censoring from a Uniform(0,10) distribution. Under the various scenarios considered, we vary the value of Inline graphic to achieve censoring rates of 45–60%. We consider landmark times Inline graphic and present results for predicting failure within a window of Inline graphic years beyond the prediction time.

3.2.1. Models used for dynamic prediction

The joint, landmark, and copula models fit to the marker data are shown in Table 1. We consider a shared random effects model for the joint models, where (JM1) has a linear marker submodel and survival submodel with the marker value, (JM2) has a cubic spline marker submodel and survival submodel with the marker value, and (JM3) has a cubic spline marker submodel and survival model with both the marker value and slope. We fit (JM1) in Scenarios 1–4, (JM2) in Scenario 5, and (JM3) in Scenario 6. The landmark models considered are the super model (LM1) and the extended super model (LM2) (van Houwelingen, 2007). For additional flexibility, we also include an interaction between the marker value and the baseline covariates in the (LMInt*) models. For (LM1) and (LMInt1), we create a landmark data set for each prediction time of interest with administrative censoring at the prediction horizon, and stack them to form a super data set to which we fit the models. In the irregular inspection time scenarios, for (LM1) and (LMInt1) the marker value at each landmark time was imputed using last observation carried forward (LOCF). For (LM2) and (LMInt2), we used a longitudinal data set, which does not require imputation of the marker values for estimation. Details for the estimation and prediction of the joint and landmark models are given in Section D of the supplementary material available at Biostatistics online.

Table 1.

Summary of models fit in the simulation study

Class Model Label
Joint Models Inline graphic (JM1)
  Inline graphic  
  Inline graphic  
  Inline graphic (JM2)
  Inline graphic  
  Inline graphic  
  Inline graphic  
  Inline graphic (JM3)
  Inline graphic  
  Inline graphic  
  Inline graphic  
Landmark Models Inline graphic (LM1)
  Inline graphic (LMInt1)
  Inline graphic (LM2)
  Inline graphic (LMInt2)
Copula Models Inline graphic : Gaussian copula  
  Inline graphic  
  Inline graphic  
  Inline graphic  
  Inline graphic ; Inline graphic modeled nonparametrically (CC1)
  Inline graphic ; Inline graphic modeled as Weibull hazard (CW1)
  Inline graphic : Gaussian copula  
  Inline graphic  
  Inline graphic  
  Inline graphic  
  Inline graphic ; Inline graphic modeled nonparametrically (CC2)
  Inline graphic ; Inline graphic modeled as Weibull hazard (CW2)
  Inline graphic : Student’s t (df=4)  
  Inline graphic  
  Inline graphic  
  Inline graphic  
  Inline graphic ; Inline graphic modeled nonparametrically (CC3)
  Inline graphic ; Inline graphic modeled as Weibull hazard (CW3)

Inline graphic , Inline graphic is a B-spline basis for a cubic spline with boundary knots at 0 and 10 years.

To identify the modeling structure for the copula components, we examine diagnostic plots and test goodness-of-fit. We fit a population-averaged model to the longitudinal data, and from the loess curve plotted to the marker trajectories, we identify that a B-spline for landmark time with an interaction with the baseline covariate is the best-fitting function for the marginal mean. Also, we allow the variance of the population-averaged model to increase with time. We use this structure for the marginal marker model in all of the copula models considered. We model the failure time data parametrically (W: Weibull) or semiparametrically (C: Cox) and include the effect of the baseline covariate Inline graphic. In (C*1), the association component is a function of time, the baseline covariate, and their interaction. In models (C*2), the association component is described more flexibly using a B-spline for landmark time. Finally, (C*3) has the same marginal model components and association as (C*1), which are instead joined using a Student’s t copula with 4 degrees of freedom to identify whether the heavier tails of this copula provide a better fit to the data.

3.3. Results

We present the simulation results in Section E, Tables E.2–E.19, and Figures E.2–E.8 of the supplementary material available at Biostatistics online. In Figure 1, we compare the performance of all three methods under Scenario 1a (Inline graphic, Inline graphic, inspection rate = 0.5). The copula model with semiparametric hazard (CC1) performs slightly better than the parametric version (CW1) at earlier landmark times, with lower Inline graphic and RMSPE but similar Inline graphicAUC. The copula models outperform the landmark model (LM2) across all metric and have better performance than the (LM1) model at earlier prediction times where the data are less sparse. The copula models perform similarly to the joint model (JM1) at earlier landmark times; however, the joint model performs better than the approximate methods at later time points.

Fig. 1.

Fig. 1.

Simulation estimates for Scenario 1a (Inline graphic, Inline graphic, inspection rate 0.5) for Inline graphicAUC (top-left) and Inline graphic (top-right), and RMSPE for Inline graphic (bottom-left) and Inline graphic (bottom-right) for predicted probability Inline graphic from copula models (CC1), (CW2), joint model (JM1), and landmark models (LM1), (LM2). Lower values indicate better predictive performance.

We compare the performance of the different semiparametric copula models (Figure E.2 of the supplementary material available at Biostatistics online), noting that a comparison of the parametric copula models exhibits similar relationships. The copula models considered have the same AUC. In comparing (CC1) and (CC2), we find that changing the association structure results in similar performance. This suggests that choosing a flexible form for the association function is sufficient as long as well-fitting models are chosen for the marginal marker and failure time distributions. The (CC3) models that use the Student’s t copula have similar performance to the (CC1) and (CC2) model, with slightly better performance in scenarios with more frequent measurement times.

As the inspection rate increases (Scenarios *a vs. *b; Figure E.3 of the supplementary material available at Biostatistics online), the RMSPE tended to decrease for all the models, with the copula models still performing better than (LM2) and (LMInt2), and marginally better or similar to (LM1) and (LMInt1) at earlier landmark times. With a fixed inspection time (Scenarios *c; Figure E.4 of the supplementary material available at Biostatistics online), the copula and landmark models have similar AUC. Model (LM1) has lower RMSPE than the copula models at later inspection times; however, it has comparable Inline graphicAUC and Inline graphic. The parametric copula model (CW1) has worsening performance at the later landmark times. The copula models outperform (LM2) and (LMInt2), which have a smaller improvement in RMSPE and Inline graphic compared to the other models for all landmark times. Under the MAR assumption (Scenarios *d; Figure E.5 of the supplementary material available at Biostatistics online), the RMSPE increases for all the models; however, the relative performance between the models remains the same.

As the association between the marker process and the hazard (Inline graphic) decreases (Scenario 1 vs. 2, and 3 vs. 4; Figure E.6 of the supplementary material available at Biostatistics online), the semiparametric copula models have better predictive performance than the landmarking models. With increasing measurement error (Inline graphic) (Scenarios 1 vs. 3, and 2 vs. 4; Figure E.7 of the supplementary material available at Biostatistics online), the RMSPE for all the models increases. The semiparametric copula models have similar RMSPE and BS as the (LM*1) models but higher AUC, and the copula models perform better across all metrics than the (LM*2) models.

We evaluate the robustness of the copula method to capture dependence in more general situations. In Scenario 5, we generate data under a non-linear marker trajectory. The performance of the joint model (JM2) is superior to the other methods in terms of higher AUC, but the RMSPE and BS is similar to the copula and (LM*1) models at earlier landmark times (Figure 2). The (LM*2) models have poor performance across all of the metrics compared to the other models. The Student’s t copula models (C*3) have worse performance than the Gaussian copula models (C*1) and (C*2), which perform similarly. In Scenario 6, we simulate under a non-linear marker trajectory and a complex dependency, by including both the value of the marker and its slope in the dependence between the longitudinal and survival submodels, and find similar relative relationships in model performance (Figure E.8). Under both Scenario 5 and 6 (Tables E.18, E.19 of the supplementary material available at Biostatistics online), the joint models from which the data were generated, (JM2) and (JM3), respectively, had better performance than the landmarking or copula models, indicating the need to incorporate additional flexibility into these approaches when dealing with complex dependencies.

Fig. 2.

Fig. 2.

Simulation estimates for Scenario 5 (non-linear marker trajectory, inspection rate 1) for Inline graphicAUC (top-left) and Inline graphic (top-right), and RMSPE for Inline graphic (bottom-left) and Inline graphic (bottom-right) for predicted probability Inline graphic from copula models (CC1), (CW1), joint model (JM2), and landmark models (LM1), (LM2). Lower values indicate better predictive performance.

In general, the copula method has good performance across all of the metrics considered. It has consistently better performance than the landmark model with baseline hazard as a function of landmark time, and outperforms the landmark model with stratified hazards when there are irregular measurement times. Under a linear marker trajectory and simple dependency, at earlier prediction times, the copula model performs similarly to the joint model from which the data is generated. It also appears to be robust to the choice of association function if the marginal models are well chosen, and is able to maintain good prediction with varying levels of measurement error and association between the marker and survival process.

4. Application to Aortic Heart Valve Study

To demonstrate the ability of the copula method to produce dynamic predictions, we use data from an observational study that followed 248 patients who received an aortic valve replacement with the aim of comparing the efficacy of two artificial heart valves: homograft or stentless (Lim and others, 2008;Philipson and others, 2017). Longitudinal measurements of the left ventricular mass index (LVMI) were collected after surgery (baseline time), with an average of 3.7 and a maximum of 10 measurements per patient. Long-term buildup of left ventricular muscle mass can result in a fatal heart attack, thus there is interest in using a patient’s changing LVMI to predict their future risk of death in a prediction window of 3 years. The baseline covariate information used in the models considered were: type of implanted aortic prosthesis (homograft: 53%, stentless: 47%); age (median: 68; interquartile range: 59–75); and gender (male: 71%, female: 29%).

Figure F.1a of the supplementary material available at Biostatistics online depicts the survival curves by stent type and demonstrates a significantly higher survival probability for those who received the stentless valve compared to those who received the homograft valve. We examine the fit of a Cox model to the failure time data and find no violation of the proportional hazards assumption for any of the baseline covariates. Figure F.1b of the supplementary material available at Biostatistics online depicts the longitudinal log(LVMI) observations per patient and from the loess curves we see that there is a decrease in mean log(LVMI) in the first year, after which it appears to be increasing with time. Thus, we consider a non-linear marker trajectory, with possible interactions between the covariates and time. Selecting the best-fitting model using backwards selection with Akaike information criterion (AIC), we identify the population-averaged model for Inline graphic as the main effects model with a quadratic spline effect for landmark time and constant variance Inline graphic. We considered more flexible forms for the association function including interactions and splines, but found that the results are similar to simpler forms. Thus, we fit the following Gaussian copula model: Inline graphic, Inline graphic, Inline graphic, where Inline graphic is a vector of the baseline covariates age, valve type, and gender, Inline graphic is modeled nonparametrically, and Inline graphic is a B-spline basis for a quadratic spline with boundary knots at 0 and 11 years and an internal knot at Inline graphic year. The parameter estimates and bootstrapped standard errors for the copula model are given in Table F.1 of the supplementary material available at Biostatistics online. From the marginal model for Inline graphic, we find that age has a significant positive effect on time to death, while the effect of gender and stent type were not significant. From the marginal model for Inline graphic, females have a lower average log(LVMI) than males, and those with the homograft valve have a higher average log(LVMI) than those with the stent valve. There was a significant quadratic spline effect for time on average log(LVMI). The association between the risk of death and increased log(LVMI) is negative indicating decreased time to death and did not change with landmark time or baseline covariates. In Figure 3, we depict the predicted survival probabilities and the 95% bootstrap prediction intervals from the copula model for two patients in the data set as their marker value changes. Individual A is a younger male, who received the stentless valve, and has lower log(LVMI) that is increasing over time. Individual B is older, received the homograft valve, and has a steady, but higher log(LVMI) with a sudden increase at the last measurement. Since Individual A is a relatively low-risk patient, we see that their predicted probability of death is low, with risk of death increasing as their log(LVMI) increases. Individual B is at higher risk due to their increased age at baseline, and their risk of death in the next 3 years increases greatly after their log(LVMI) spikes suddenly at 4 years, and they eventually die at 5.4 years from baseline. These predicted probability plots can be used by clinicians to monitor a patient’s prognosis following valve replacement to identify if the patient’s changing log(LVMI) is putting them at high risk for future death and further interventions must be implemented.

Fig. 3.

Fig. 3.

Predicted survival probabilities and 95% bootstrap prediction interval for risk of death within 3 years from the copula model for two patients in the heart valve data set. Individual A (top) is male, 59 years old at baseline, received the stentless valve, and does not die before the end of the study. Individual B (bottom) is male, 78 years old at baseline, received the homograft valve and died at 5.4 years after baseline. The dashed line indicates time at which the most recent marker measurement (black dot) was recorded. The dotted line indicates time of death.

We also apply the joint modeling and landmarking approaches to the data for comparison. We fit a shared random effects model with the same quadratic spline for time in the longitudinal submodel as in the copula model and a random intercept and slope. More flexible spline functions were considered for the random effects specification, but resulted in convergence issues likely due to the limited sample size. To allow for irregular prediction times, we consider the landmark extended super model. The details of these models are given in Section F of the supplementary material available at Biostatistics online. We compare performance by computing AUC and BS using 5-fold cross-validation, repeated and averaged over 500 iterations. From Table 2, we see that the models perform similarly. The copula and joint model have similar AUC across landmark times and the joint model has slightly lower BS. Landmarking performs slightly worse in terms of AUC and BS than the other two methods. In Figures F.2 and F.3 of the supplementary material available at Biostatistics online, we compare the bootstrap prediction intervals from the joint and landmark models for the individuals presented in Figure 3, and find that the joint model has the narrowest intervals while the landmark model has slightly wider intervals than the copula model.

Table 2.

Cross-validated AUC and BS for the considered models in the aortic heart valve analysis at landmark times 0.5–3.5 years with a prediction horizon of 3 years

    0.5 1.5 2.5 3.5
AUC Joint Model 0.639 0.706 0.769 0.865
Landmark 0.635 0.707 0.761 0.860
Copula 0.637 0.711 0.769 0.870
BS Joint Model 0.092 0.111 0.116 0.110
Landmark 0.097 0.114 0.122 0.116
Copula 0.095 0.113 0.121 0.113

5. Discussion

Dynamic models that incorporate the effect of time-dependent covariates on survival are essential for making personalized clinical decisions about an individual’s care. While there are two popular statistical methods for dynamic prediction, landmarking and joint modeling, they both have limitations that we address by presenting an alternative approximate method that has useful advantages and good predictive performance.

We propose the use of a copula-based approach for incorporating longitudinally collected marker information in predicting an individual’s future survival. First, we specify from a well-established class of models the marginal distributions of the marker (e.g., linear regression, generalized linear model) and the failure time (e.g., Cox, cure model). This allows us to apply standard goodness-of-fit tests and variable selection techniques to identify the best-fitting marginal models. Second, we define the joint distribution of the survival time and marker conditional on being alive at a particular prediction time. With this formulation, we can easily derive the dynamic prediction of interest. We present our modeling framework for a continuous marker process and demonstrate its performance using a simulation study and a data application.

There are several advantages of our approach over the existing joint model and landmarking methods. In comparison to landmarking, the copula model does not require the creation of a landmark data set, instead only using marker information available at measurement times. This allows us to avoid prespecifying landmark times and imputing unobserved marker values at these times, which can introduce bias. We also do not need to fix a time horizon for prediction and can obtain predictions for a new patient at any continuous time point beyond baseline. Since the marginal model for time-to-event is internally consistent for all landmark times, we maintain a greater level of consistency in our predictions than landmarking. As in joint modeling, we specify a model for the marker; however, since we are modeling the population-averaged trajectory, rather than allowing for individual-specific random effects, we are able to specify a simpler model than a shared random effects or frailty model that can require complex and computationally intensive estimation. In principle, it is easier to check goodness-of-fit for marginal models; thus, we are able to minimize bias at this stage. Table F.3 of the supplementary material available at Biostatistics online presents a comparison of the different methods, listing some pros and cons of each approach.

A limitation of our proposed method is that it relies heavily on the availability of data at prediction times of interest for it to properly model the joint distribution between the marker and failure time. As well, with the different models for the marginals and the association, there are several parameters to be estimated. The two-stage approach results in large standard errors for the association parameters, due to the estimation variability of the marginal model parameters. In the simulation study, we found that the copula models’ performance was similar when comparing flexible association functions. Thus, rather than performing variable selection for the association parameters, we can consider choosing a flexible-enough association function or using cross-validation techniques to select an appropriate association structure. Finally, the use of a Gaussian copula is applicable only when linking two continuous outcomes, a survival time and a continuous marker value. Thus, to describe the association between a binary marker (e.g., the occurrence of an intermediate event) and a time-to-event outcome, our model specification must be adjusted using methods such as those introduced by Song and others (2009) and de Leon and Wu (2011). The predictive performance can be compared to Emura and others (2018) that uses a copula to incorporate time to the intermediate event rather than just the indication of experiencing the event.

We compare our copula models to standard landmark models and supermodels. In both approaches, we impute the marker value at landmark times using LOCF (of the marker value or the quantile). We can consider alternative imputation methods, such as fitting a mixed model for Inline graphic as in Ferrer and others (2019) that can increase the computational burden but improve marker prediction. In the simulation study, we explore the performance of the copula model under more complex longitudinal trajectories and dependence structures. As in Ferrer and others (2019), we find that the performance of misspecified models that did not account for these additional complexities were poor compared to the true joint models. Thus, extensions under the copula framework are required to improve predictive performance under more complicated settings. For example, to achieve a more accurate prediction, it might be of interest to incorporate an individual’s longitudinal marker history Inline graphic, where Inline graphic represents the individual’s history of longitudinal marker values up to time Inline graphic. Joint modeling handles this naturally. Rizopoulos and others (2017) and Ferrer and others (2019) extended the landmark model specification to consider parameterizations that incorporate the marker relationship with survival in various ways, such as the slope of the marker trajectory, Inline graphic. The copula model can also potentially be extended to directly incorporate a summary of the marker history up to time Inline graphic, by modeling a different aspect of the marginal marker distribution instead of the marginal marker mean. For example, we could have instead considered a marginal model for the change in the marker value from baseline, Inline graphic. The ability to model the marginals and their distributions separately allows for more complicated, better-fitting models to be considered for the marginals, but still maintains simple estimation using the two-stage method.

Using a copula framework provides the potential for several extensions to more complicated data structures. We can consider a multivariate Gaussian copula to accommodate multiple longitudinal markers or marker summaries. Thus, rather than specifying a full joint model for the different marker processes, it is easier to consider modeling their marginal behavior and using a flexible form for the copula association function to model their joint distributions. This model structure can also help identify the size and direction of the correlation between the various longitudinal markers. Such an approach can greatly increase the dimension of the parameter space, so care should be taken to choose parsimonious models for the marginal components and simpler association functions for the resulting correlation matrix.

While joint modeling and landmarking are popular in current literature, our copula-based approach provides an alternative method for dynamic prediction that has good predictive performance and easy estimation. By choosing more flexible and complicated models for the marginals, we could potentially further decrease the bias introduced by fitting a misspecified model. Future work will focus on extending the copula framework for dynamic prediction to address more complex data forms and applications.

6. Software

The R code is available on Github at http://github.com/ksuresh17/copula-dyn-pred.

Acknowledgments

Conflict of Interest: None declared.

Supplementary material

Supplementary material is available online at http://biostatistics.oxfordjournals.org.

Funding

National Institutes of Health (CA129102), in part.

References

  1. Andersen,  P. K. and Gill, R. D. (1982). Cox’s regression model for counting processes: a large sample study. The Annals of Statistics 10, 1100–1120. [Google Scholar]
  2. Blanche,  P., Proust-Lima, C., Loubère, L., Berr, C., Dartigues, J.-F. and Jacqmin-Gadda, H. (2015). Quantifying and comparing dynamic predictive accuracy of joint models for longitudinal marker and time-to-event in presence of censoring and competing risks. Biometrics 71, 102–113. [DOI] [PubMed] [Google Scholar]
  3. de Leon,  A. R. and Wu, B. (2011). Copula-based regression models for a bivariate mixed discrete and continuous outcome. Statistics in Medicine 30, 175–185. [DOI] [PubMed] [Google Scholar]
  4. Efron,  B. and Tibshirani, R. J. (1993). An Introduction to the Bootstrap. New York: Chapman and Hall. [Google Scholar]
  5. Emura,  T., Nakatochi, M., Matsui, S., Michimae, H. and Rondeau, V. (2018). Personalized dynamic prediction of death according to tumour progression and high-dimensional genetic factors: meta-analysis with a joint model. Statistical Methods in Medical Research 27, 2842–2858. [DOI] [PubMed] [Google Scholar]
  6. Ferrer,  L., Putter, H. and Proust-Lima, C. (2019). Individual dynamic predictions using landmarking and joint modelling: Validation of estimators and robustness assessment. Statistical Methods in Medical Research 28, 3649–3666. [DOI] [PubMed] [Google Scholar]
  7. Ganjali,  M. and Baghfalaki, T. (2015). A copula approach to joint modeling of longitudinal measurements and survival times using Monte Carlo expectation-maximization with application to AIDS studies. Journal of Biopharmaceutical Statistics 25, 1077–1099. [DOI] [PubMed] [Google Scholar]
  8. Gong,  Q. and Schaubel, D. E. (2013). Partly conditional estimation of the effect of a time-dependent factor in the presence of dependent censoring. Biometrics 69, 338–347. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Henderson,  R., Diggle, P. and Dobson, A. (2000). Joint modelling of longitudinal measurements and event time data. Biostatistics 1, 465–480. [DOI] [PubMed] [Google Scholar]
  10. Jewell,  N. P. and Nielsen, J. P. (1993). A framework for consistent prediction rules based on markers. Biometrika 80, 153–164. [Google Scholar]
  11. Joe,  H. and Xu, J. J. (1996). The estimation method of inference functions for margins for multivariate models. Technical Report, University of British Columbia. [Google Scholar]
  12. Lim,  E., Ali, A., Theodorou, P., Sousa, I., Ashrafian, H., Chamageorgakis, T., Duncan, A., Henein, M., Diggle, P. and Pepper, J. (2008). Longitudinal study of the profile and predictors of left ventricular mass regression after stentless aortic valve replacement. The Annals of Thoracic Surgery 85, 2026–2029. [DOI] [PubMed] [Google Scholar]
  13. Nelsen,  R. B. (1999). An Introduction to Copulas. New York: Springer Series in Statistics. [Google Scholar]
  14. Philipson,  P., Sousa, I., Diggle, P. J., Williamson, P., Kolamunnage-Dona, R., Henderson, R. and Hickey, G. L. (2017). joineR: Joint Modelling of Repeated Measurements and Time-to-Event Data. Available from: http://cran.r-project.org.
  15. Prenen,  L., Braekers, R. and Duchateau, L. (2017). Extending the Archimedean copula methodology to model multivariate survival data grouped in clusters of variable size. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 79, 483–505. [Google Scholar]
  16. Rizopoulos,  D. (2011). Dynamic predictions and prospective accuracy in joint models for longitudinal and time-to-event data. Biometrics 67, 819–829. [DOI] [PubMed] [Google Scholar]
  17. Rizopoulos,  D.s, Molenberghs, G. and Lesaffre, E.M.E.H. (2017). Dynamic predictions with time-dependent covariates in survival analysis using joint modeling and landmarking. Biometrical Journal 59, 1261–1276. [DOI] [PubMed] [Google Scholar]
  18. Rizopoulos,  D., Verbeke, G., Lesaffre, E. and Vanrenterghem, Y. (2008a). A two-part joint model for the analysis of survival and longitudinal binary data with excess zeros. Biometrics 64, 611–619. [DOI] [PubMed] [Google Scholar]
  19. Rizopoulos,  D., Verbeke, G. and Molenberghs, G. (2008b). Shared parameter models under random effects misspecification. Biometrika 95, 63–74. [Google Scholar]
  20. Song,  P. X.-K. (2007). Correlated Data Analysis: Modeling, Analytics, and Applications. New York: Springer Series in Statistics. [Google Scholar]
  21. Song,  P. X.-K., Li, M. and Yuan, Y. (2009). Joint regression analysis of correlated data using Gaussian copulas. Biometrics 65, 60–68. [DOI] [PubMed] [Google Scholar]
  22. Spiekerman,  C. F. and Lin, D. Y. (1998). Marginal regression models for multivariate failure time data. Journal of the American Statistical Association 93, 1164–1175. [Google Scholar]
  23. Suresh,  K., Taylor, J. M. G., Spratt, D. E., Daignault, S. and Tsodikov, A. (2017). Comparison of joint modeling and landmarking for dynamic prediction under an illness-death model. Biometrical Journal 59, 1277–1300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Taylor,  J. M. G., Park, Y., Ankerst, D. P., Proust-Lima, C., Williams, S., Kestin, L., Bae, K., Pickles, T. and Sandler, H. (2013). Real-time individual predictions of prostate cancer recurrence using joint models. Biometrics 69, 206–213. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. van Houwelingen,  H. C. (2007). Dynamic prediction by landmarking in event history analysis. Scandinavian Journal of Statistics 34, 70–85. [Google Scholar]
  26. Wang,  Y. and Taylor, J. M. G. (2001). Jointly modeling longitudinal and event time data with application to acquired immunodeficiency syndrome. Journal of the American Statistical Association 96, 895–905. [Google Scholar]
  27. Wulfsohn,  M. S. and Tsiatis, A. A. (1997). A joint model for survival and longitudinal data measured with error. Biometrics 33, 330–339. [PubMed] [Google Scholar]
  28. Zeger,  S. L. and Liang, K.-Y. (1986). Longitudinal data analysis for discrete and continuous outcomes. Biometrics 42, 121–130. [PubMed] [Google Scholar]
  29. Zheng,  Y. and Heagerty, P. J. (2005). Partly conditional survival models for longitudinal data. Biometrics 61, 379–391. [DOI] [PubMed] [Google Scholar]

Articles from Biostatistics (Oxford, England) are provided here courtesy of Oxford University Press

RESOURCES