Abstract
In observational/field studies, competing risks and left-truncation may co-exist, yielding ‘left-truncated competing risks’ settings. Under the assumption of independent competing risks, parametric estimation methods were developed for left-truncated competing risks data. However, competing risks may be dependent in real applications. In this paper, we propose a Bayesian estimator for both independent competing risks and copula-based dependent competing risks models under left-truncation. The simulations show that the Bayesian estimator for the copula-based dependent risks model yields the desired performance when competing risks are dependent. We also comprehensively explore the choice of the prior distributions (Gamma, Inverse-Gamma, Uniform, half Normal and half Cauchy) and hyperparameters via simulations. Finally, two real datasets are analyzed to demonstrate the proposed estimators.
KEYWORDS: Bayesian estimation, competing risk, copula, Weibull distribution, survival analysis, truncation
1. Introduction
Competing risks often arise in observational or field studies, where survival/failure time of interest may not be ascertained due to the incidence of other events, known as competing risks. Three different models were available to analyze competing risks data; the first and the most classical one is the latent failure time model [9], the second one is the cause-specific hazard model [35], and the third one is the subdistribution hazard model [19]. The latent failure time model explicitly specifies a multivariate distribution on latent failure times under the independence model [5,27], multivariate dependence models [4,17,30] or copula models [16,48,49]. The cause-specific and subdistribution models do not explicitly specify the independence/dependence structures for competing risks. All these competing risks models are widely applied to reliability engineering [17,30,50] and medicine [1,2,11,38]. While the choice of the models depends on the goal of the research and application fields, we will focus on the latent failure time model.
Besides competing risks, ‘left-truncation’ arises in observational/field studies. Subjects who have already experienced the first milestone (e.g. onset of disease) before starting a study may be excluded from the data, which is known as left-truncation in survival analysis [25]. It is well known that ignorance of left-truncation causes a systematic bias toward favorable survival [25]. To analyze such left-truncated data in observational/field studies, Hong et al. [23] divided the data into two parts: a truncated part (e.g. times before the starting date of study) and an untruncated part (e.g. times after the starting date of study), and then introduced a combined likelihood function consisting of the two parts. Recently, this likelihood-based approach for left-truncated observational/field data (with right-censoring) received considerable attentions: see [3,12,14,24,29,32,36,43–46], and references therein.
Competing risks and left-truncation may co-exist, yielding a ‘left-truncated competing risks’ setting. Kundu et al. [27] first considered this setting under observational/field left-truncated data by extending the likelihood approach of Hong et al. [23] to introduce competing risks. Under the ‘independent’ competing risks model, Kundu et al. [27] developed their inference methods based on left-truncated competing risks data. However, competing event times may be dependent in real applications. For example, in many cancer studies focusing on time to progression (TTP) and overall survival (OS), strong correlation is observed between TTP and OS [8,15,20,28]. Therefore, the assumption of the independent risks may lead to biases due to model misspecification. Accordingly, some authors suggested specific models for left-truncated competing risks, such as the Marshall–Olkin bivariate Rayleigh model [46], and the Marshall–Olkin bivariate Weibull model [43,44]. These specific bivariate models are not the standard models, though they are good starting points for fitting dependent competing risks.
Copula-based competing risks models provide a flexible approach to incorporate the standard parametric models, such as the Weibull model. To the best of our knowledge, copula-based competing risks models with left-truncation are limited to the nonparametric models of Uña-Álvarez and Veraverbeke [10] and the parametric models of Michimae and Emura [31]. However, a problem remains: it is difficult to estimate a copula parameter. Uña-Álvarez and Veraverbeke [10] resorted to an assumed copula. Also, Michimae and Emura [31] pointed out the low estimation accuracy for parameter estimates when the copula parameter is unknown. Bayes estimators may offer a solution to this problem.
In this paper, we propose a Bayesian estimator for both the independent competing risks model and copula-based dependent competing risks models under left-truncation. There are two main contributions: First, we improve the estimation accuracy of the copula model of Michimae and Emura [31] by developing Bayesian estimators. Second, we extend the Bayesian estimator of Kundu et al. [27] that were restricted to Gamma priors and independent risks. We implement five prior distributions (Gamma, Inverse-Gamma, Uniform, half Normal and half Cauchy) and dependent risks. Our simulations show that the Bayesian estimators for the copula-based dependent risks model yield superior performance than these existing methods. Finally, two real datasets are analyzed for illustration. Thus, our paper is the comprehensive extension and improvement of the previous papers, Kundu et al. [27] and Michimae and Emura [31].
The structure of the paper is as follows. Section 2 briefly introduces the left-truncated competing risks setting. Section 3 proposes two Bayesian estimators under the independence model and copula-based model, respectively. Five different prior distributions are also suggested in this section. Section 4 presents Monte Carlo simulation results, comparing the two Bayesian estimators and the five prior distributions. Section 5 analyzes two datasets. Section 6 concludes with discussions.
2. Backgrounds
We review left-truncated competing risks data that were introduced by Kundu et al. [27].
Figure 1 shows a schematic description for an observational clinical study with follow-up. The study begins at a calendar time point, where subjects who experience disease onsets are recruited for further follow-up. During the study, a recruited subject after the starting date may experience one of two failure events, Event 1 (the event of interest: Case 1) and Event 2 (the competing risk: Case 2). Subjects who experienced the disease onset and who failed (by either Event 1 or Event 2) before the recruitment time were not observable, leading to truncation of subjects (‘Not observed’). However, we can observe the subjects if they did not fail before the recruitment time (Cases 4–6). They may experience one of two failure events, Event 1 (Case 4) and Event 2 (Case 5) during the study. As the follow-up time is often limited, the recruited subjects may be censored at the end of the study without experiencing any event (Cases 3 and 6). This leads to left-truncated competing risks data with right-censoring.
Figure 1.
Left-truncated competing risks (Events 1 and 2), and onset of disease ( ) from a hypothetical observational clinical study. Any event occurring in the first duration (D1) is not observable. An event is observable only within the second duration (D2). T and indicate observed event times and truncation times.
Observed data from n subjects consist of , where , , , and will be defined below. Let be continuous failure times (measured from the time of disease onset) for Event 1 and Event 2, respectively, for a subject . Also, let be random right-censored time, which is assumed to be independent of . What we observe is an event time defined as a realized value of the first occurring event time . We also observe an event type , namely, for Event 1, for Event 2, or for censoring. We also observe a ‘left-truncation time’ (see dotted lines in Figure 1) for a subject who experienced the disease onset before starting the study. For such truncated subjects, we account for the truncation constraint since a subject has survived long enough. We observe a truncation indicator defined as
3. Proposed Bayesian methods
First, the Bayesian estimators for the independent competing risks model will be proposed using different prior distributions from the independent risks model of Kundu et al. [27]. Second, the copula-based Bayesian estimators will be proposed as a general dependent case.
3.1. Bayesian estimators for independent competing risks model
Let be the density of and be the density of ; the corresponding survival functions are and , where , is the vector of parameters. The likelihood of an individual sample is divided into the following cases (see Figure 1) according to the observed events and truncation status (Kundu et al. [27]):
where is a set of censored observations, and I1 and I2 are sets of failure observations due to Event 1 and Event 2, respectively. That is,
Combining the likelihoods of Cases 1–6, the likelihood function is given as follows:
Under the Weibull survival function , Kundu et al. [27] used the following likelihood function for estimating the parameters:
Kundu et al. [27] suggested using Gamma prior distributions for the two Weibull parameters; , , and , where all the Gamma parameters are positive. Under the Gamma priors, the joint prior is .
Since the choice of the gamma priors is somewhat ad-hoc, we also considered different priors not considered in Kundu et al. [27]. Specifically, we considered the Inverse-Gamma prior, the Uniform prior, half Normal prior and half Cauchy prior in addition to the Gamma prior. In the five priors, hyperparameters were chosen for the prior distributions to be uninformative. Table S1 gives the details of the prior specifications, including the hyperparameter values of the prior distributions.
Given the observed data and prior distributions, the joint posterior density becomes
Given the observed data, one obtains a Bayesian estimator defined as the posterior median under a given prior. We will call the estimator described above as the independent Bayesian estimator.
3.2. Bayesian estimators for copula-based dependent competing risks model
A copula gives a dependence structure in a bivariate distribution. Let and be cumulative distribution functions (CDFs). Then, a bivariate CDF can be defined as , where is a copula [34,13].
In this paper, we consider bivariate latent failure times whose marginal survival functions are and . Instead of the bivariate CDF, we adopt a bivariate survival function for dependent competing risks [16] as follows;
As in Escarela and Carriere [16], we separately parametrize the copula and marginal survival functions. For the copula, we adopt the bivariate Clayton copula due to its simplicity and popularity, defined by
where is a dependence parameter.
As in the independent competing risks model, the likelihood of an individual sample is divided into the following cases (see Figure 1) according to the observed events and truncation status:
Here, , , and for the case of the Weibull model.
Combining the likelihoods of Cases 1–6, the copula-based likelihood function is given as follows:
For the marginal survival functions, we set Weibull distributions . The Weibull distribution is perhaps the most standard and convenient parametric model for analyzing biomedical survival data [37] and reliability engineering data [17,47]. Nonetheless, the Weibull distribution can be flexibly changed to other distributions under the copula model. Following Michimae and Emura [31] who dealt with left-truncation and copula-based competing risks, the corresponding likelihood function is
where , , and .
Similar to the independent risks model, we propose five priors (Gamma, Inverse-Gamma, Uniform, half Normal and half Cauchy) as shown in Table S1. Under the copula-based dependent risks model, the joint posterior density is
We set the improper Uniform prior distribution for the copula parameter . Given the observed data, one obtains a Bayesian estimator as the posterior median with respect to the posterior density. We will call it the copula-based Bayesian estimator.
3.3. Computation of proposed Bayesian estimators
Let be a prior density function for unknown parameters . Irrespective of the independence or dependence model, the posterior density is
The posterior median is the Bayes estimator, where for the independent risks model while for the copula-based dependent risks model. The forms of are as stated above. Here, the two main issues are the choice of the prior density and computation methods for .
For computing with complex posterior distributions such as , analytical formulas for the posterior medians through conjugacy may not exist. A simulation-based approach to compute the posterior medians through MCMC (Markov Chain Monte Carlo) algorithms is valid even in such a case. The Stan software [41,40] uses the No-U-Turn Sampler [22], an improved type of MCMC algorithm known as the Hamiltonian Monte Carlo (HMC), which can efficiently sample from complex posterior distributions with correlated parameters. We adopt the posterior median as the point estimate, and the posterior 2.5% and 97.5% points as the 95% credible interval (95% CI).
We adopted the Stan software to compute the posterior distribution by inputting the formulas of and . We set the number of Markov chains to 4, the number of iterations for each chain to 4000, and the number of worm-up iterations per chain to 2000 for both the simulation (Section 4) and data analysis (Section 5). We checked ‘Rhat’ criterion in Stan, known as the Gelman-Rubin convergence diagnostic [21] for the successful convergence of the MCMC algorithm. As a rule of thumb, we employed for all parameters, which indicates that four chains have converged.
The issues of choosing the prior densities will be explored in the following sections. Some conjugate gamma prior densities were explored in the Bayesian estimator by Kundu et al. [27] under the independent risks model with the ‘common shape parameter’. However, their explanations for different shape parameters (uncommon parameters) were not sufficient and unclear. Since we do not assume the common shape parameter, we need to explore this issue under the independent risks model.
4. Simulation study
We conducted simulation studies to assess the performance of the proposed estimators (the independent Bayesian estimator and copula-based Bayesian estimator). The main purpose is to compare the independent Bayesian estimator and copula-based Bayesian estimator when the true model has independent competing risks or dependent competing risks. Another purpose is to see how different priors (Gamma, Inverse-Gamma, Uniform, half Normal and half Cauchy) influence the performance of the Bayesian estimators. Supplementary Materials report another simulation study comparing the Bayes estimator with the maximum likelihood estimator, which confirm the superiority of the Bayes estimators.
4.1. Simulation designs
Table S2 shows eight scenarios (Scenarios 1–8) with different configurations of parameters.
For Scenarios 1–4, we set the first duration and the second duration (see Figure 1 for the definitions of and ). For Scenarios 5–8, we set and . In all the scenarios, we set the sample size n = 100 or 500. The truncation percentage was set to be 50% (50 truncated subjects occur in , and 50 untruncated subjects occur in for n = 100). We sampled a birth time for a truncated sample ; we also sampled a birth time for an untruncated sample . For , the truncation time was set by . For , we do not define . Censoring time was defined as .
In the case of independent risks, we sampled two independent latent failure times, and , under Weibull distributions with the shape parameter and scale parameter . In the case of dependent competing risks, we sampled two dependent Weibull failure times, and , form the Clayton copula model with parameter (Kendall’s tau = 0.5). We considered the common parameters across events: for Scenarios 1–2, for Scenarios 3–4, and for Scenarios 5–6, corresponding to constant, decreasing and increasing hazards, respectively. We also considered the case of the varying parameters across events: and for Scenarios 7 and 8.
Following the left-truncated competing risks setting, we used the first occurring event time , the event type , and the truncation indicator as observations to calculate the Weibull parameter estimators as well as the copula parameter estimator (for the case of the copula model) in 10 different ways: five independent Bayesian (five priors) and five copula-based Bayesian (five priors) estimators with the 95% CIs. Also, we calculated the estimators of the survival probabilities, , for .
1000 Monte Carlo repetitions (for n = 100) were performed to evaluate the performance of the estimators based on the median absolute errors (MAEs), and coverage probabilities of the 95%CI. For n = 100, the computing time ranged from 10,683 s (Scenario 4 for the independent model using the uniform prior) to 253,396 s (Scenario 1 for the copula-based model using the inverse-Gamma prior). For n = 500, we reduced the number of Monte Carlo repetitions from 1000 to 100; the computing time ranged from 4942 s (Scenario 2 for the independent model using the uniform prior) to 151,641 s (Scenario 2 for the copula-based model using the uniform prior).
4.2. Simulation results
In all the scenarios (Scenarios 1–8), the MAEs of all the estimators tend to decrease when the sample sizes increase (Tables S3–S10) and when the model is correctly specified. However, the comparative performance in terms of the MAEs depend largely on the true models (independent vs. dependent) and priors (Gamma, Inverse-Gamma, Uniform, half Normal and half Cauchy).
4.2.1. Weibull parameters and survival probabilities
When the true model is the independent risks model (Scenarios 1, 3, 5 and 7), the independent Bayesian estimator showed all very small MAEs for the Weibull parameters (Tables S3–S6) and the survival probabilities (Tables S7–S10). This is a natural conclusion since the true model is correctly specified. Also, the MAEs are similar across different prior densities (Tables S3–S10). In summary, the independent Bayesian estimator gives very accurate estimates irrespective of the prior choices as long as the independent risks model is correct. However, this independent estimator exhibits unacceptable biases and inflated MAEs when the true model is the copula-based dependent risks model.
When the true model is the copula-based dependent risks model (Scenarios 2, 4, 6 and 8), the independent Bayesian estimator yielded large MAEs due to biases in estimation. In contrast, the copula-based Bayesian estimator shows smaller MAEs compared to the independent Bayesian estimator. Among the different priors (Gamma, Inverse-Gamma, Uniform, half Normal and half Cauchy), the Uniform, the half Normal and the half Cauchy priors showed better performance than the Gamma and the Inverse-Gamma priors, especially in Scenarios 2, 4 and 6. However, in Scenario 8, there was no best estimator in terms of MAEs (Tables S6 and S10). Therefore, we conclude that the Bayes estimator with the improper Uniform, the half Normal or the half Cauchy prior provides the most accurate estimate for unknown Weibull parameters and survival probabilities.
4.2.2. Copula parameters
When the true model is the independent risks model (Scenarios 1, 3, 5 and 7), the Gamma and the Inverse-Gamma priors showed better performance than the Uniform, the half Normal and the half Cauchy priors, especially in Scenarios 1, 3 and 5. However, in the large sample size of Scenario 7, there was no better estimator in terms of MAEs (Table S6).
Contrary to the results, when the true model is the copula-based dependent risks model (Scenarios 2, 4, 6 and 8), the Uniform, the half Normal and the half Cauchy priors showed better performance than the Gamma and the Inverse-Gamma priors, in Scenarios 2, 4 and 6. However, in Scenario 8, the result was reversed. Therefore, we conclude that the best choice of the prior depends on the situations and cannot be uniquely determined for unknown copula parameters.
The conclusions from our simulations are clear-cut except for estimating copula parameters. When the true model has dependent risks, the copula-based Bayesian estimator with the Uniform, the half Normal and the half Cauchy priors showed better performance. When the true model has independent risks, the independent Bayes estimator showed similar performance. Therefore, we suggest the copula-based Bayesian estimator with the Uniform, the half Normal and the half Cauchy priors when the true competing risks are dependent. The independent estimators are recommended only when one can assure the assumption of the independent competing risks.
5. Data analysis
Two real data were analyzed to illustrate the proposed methods.
5.1. Analysis of electric power transformers data
We used the left-truncated competing risks dataset available in Appendix of Kundu et al. [27]. For convenience, the dataset is given in Supplementary Materials (reproduced from Appendix of Kundu et al. [27]). The dataset consists of n = 100 samples, which mimics 100 electric power transformers assumed to be installed from 1960 to 2008. The starting year of the study was assumed to be at 1980, and therefore the transformers installed between 1960 and 1980 were regarded as left-truncated ones. Accordingly, the 30 transformers were left-truncated , and other 70 transformers were untruncated . The 14 transformers were assumed to experience Event 1 , 33 transformers Event 2 , 53 transformers censoring . The dataset contains event times , truncation times , censoring indicators , and truncation indicators . As in Kundu et al. [27], we divided event times (in years) by 100 to scale the parameter estimates.
Our goal is to see how the proposed estimators produce different results from the benchmark estimates reported by Kundu et al. [27] who analyzed the same dataset under the Weibull distribution with independent competing risks. They calculated the independent Bayes estimates under the gamma prior. The last row of Table 1 quotes their estimates as the benchmark.
Table 1.
Estimated parameters (95%CI in parenthesis).
| Event 1 | Event 2 | Copula | |||
|---|---|---|---|---|---|
| Fitted method | λ1 (95%CI) | α1 (95%CI) | λ2 (95%CI) | α2 (95%CI) | ϕ (95%CI) |
| indep. Bayes | 6.18 | 2.74 | 14.96 | 2.75 | – |
| (Gamma prior) | (1.58, 27.60) | (1.71, 4.01) | (6.07, 39.02) | (2.04, 3.55) | |
| indep. Bayes | 6.13 | 2.74 | 15.02 | 2.76 | – |
| (Inverse-Gamma prior) | (1.51, 29.50) | (1.67, 4.05) | (5.91, 40.72) | (2.04, 3.58) | |
| indep. Bayes | 12.86 | 3.33 | 20.33 | 2.99 | – |
| (Uniform prior) | (3.10, 60.50) | (2.18, 4.69) | (7.88, 56.19) | (2.24, 3.85) | |
| indep. Bayes | 12.67 | 3.31 | 20.03 | 2.98 | – |
| (half Normal prior) | (2.79, 60.80) | (2.12, 4.68) | (7.71, 54.16) | (2.23, 3.82) | |
| indep. Bayes | 6.06 | 2.74 | 12.85 | 2.64 | – |
| (half Cauchy prior) | (1.93, 21.15) | (1.84, 3.81) | (5.29, 33.06) | (1.96, 3.40) | |
| copula. Bayes | 16.51 | 3.16 | 18.00 | 2.81 | 1.03 |
| (Gamma prior) | (1.50, 80.45) | (1.71, 4.47) | (6.46, 46.43) | (2.09, 3.60) | (−0.58, 19.78) |
| copula. Bayes | 16.45 | 3.16 | 17.69 | 2.81 | 0.97 |
| (Inverse-Gamma prior) | (1.49, 81.67) | (1.73, 4.48) | (6.09, 48.74) | (2.07, 3.63) | (−0.60, 21.39) |
| copula. Bayes | 49.02 | 3.88 | 31.58 | 3.19 | 2.93 |
| (Uniform prior) | (9.06, 167.42) | (2.82, 5.14) | (11.66, 79.2) | (2.46, 3.98) | (0.11, 27.47) |
| copula. Bayes | 48.26 | 3.86 | 30.28 | 3.18 | 2.89 |
| (half Normal prior) | (7.65, 156.51) | (2.70, 5.09) | (12.33, 75.18) | (2.48, 3.97) | (0.10, 23.19) |
| copula. Bayes | 9.11 | 2.91 | 13.96 | 2.66 | 0.48 |
| (half Cauchy prior) | (1.86, 44.72) | (1.88, 4.05) | (5.58, 35.64) | (2.00, 3.44) | (−0.60, 12.92) |
| indep. Bayes | 8.53 | 2.78 | 17.08 | 2.77 | – |
| quoted from Kundu et al. [27]* | (0.58, 22.79) | (1.65, 4.01) | (4.08, 34.85) | (1.99, 3.52) | |
*Highest posterior density credible interval is shown.
Table 1 shows the Bayesian estimates for the Weibull parameters and the copula parameter . The estimates of the independent Bayesian estimates by the Gamma and the inverse-Gamma priors are somewhat close to the same estimates reported by Kundu et al. [27]. However, the independent Bayes estimates under the uniform and the half Normal priors, and the copula-based estimates, especially under the uniform and the half normal priors, shows remarkably different results from those under the Gamma prior (Table 1 and Figure 2). This indicates the possibility of having different estimates by different model assumptions.
Figure 2.
Weibull parameter estimates (95% CI in parenthesis) for Event 1 (E1) and Event 2 (E2) and the estimates of the hazard functions (95% CI in shaded regions) using the left-truncated competing risks dataset provided by Kundu et al. [27].
A particularly notable result is that the estimates of copula parameter under the uniform and the half-normal priors and its lower limit of the 95% CI are greater than 0, which may indicate the positive correlation between Event 1 and Event 2.
5.2. Advanced lung cancer and idiopathic interstitial pneumonias data
We analyzed an advanced lung cancer and idiopathic interstitial pneumonias dataset, which was originally reported by Miyamoto et al. [33]. The study focused on the survival of patients with advanced-stage lung cancer, who received chemotherapy. The primary endpoint of interest is time at the occurrence of acute exacerbation (AE) of idiopathic interstitial pneumonia (IIP). The data was collected from consecutive patients from January 2012 to December 2013.
We set event times by:
T1: the time from the 1st line treatment to the date of death.
T2: time from the 1st line treatment to the occurrence date of the AE of IIP.
The starting date of the study was set at January 2013, and therefore the patients started the 1st line of therapy before the staring time have truncation times (truncated samples). However, the patients experienced any failure events before the starting date were completely missed out of the sampling protocol. Some of the patients were censored at December 2013. In short, the 572 patients were considered in this analysis, of which the 205 were truncated samples and the 367 were untruncated samples . Also, the 175 patients died , the 61 patients experienced the AE of IIP , and the 336 patients were censored . We scaled the failure times (in days) by dividing them by 1000 to avoid many decimal places in parameter estimates.
Table 2 compares the results of the estimates. The estimates by the independent Bayes estimators showed values close to each other (Table 2 and Figure 3). However, the estimates by the copula-based estimators take somewhat different values to each other. Especially, the copula-based Bayesian estimates of copula parameter by both the Uniform and half Normal priors considerably differ from other corresponding copula-based estimates (Table S8 and Figure 3), although both of the lower limits of the 95% CI are less than 0. It is likely that time to death and time to acute exacerbation are positively correlated because acute exacerbation in patients with IIP is recognized as a lethal event. Therefore, chemotherapy-related acute exacerbation may be a direct mortality cause in these patients [39,26]. Thus, it would be advantageous to adopt the copula-based Bayesian estimator, considering the superior performance of simulation results under dependent competing risk events.
Table 2.
Estimated parameters (95%CI in parenthesis).
| Death | AE* | ||||
|---|---|---|---|---|---|
| Fitted method | λ1 (95%CI) | α1 (95%CI) | λ2 (95%CI) | α2 (95%CI) | Copula ϕ (95%CI) |
| indep. Bayes | 2.58 | 1.32 | 0.80 | 0.51 | – |
| (Gamma prior) | (2.05, 3.21) | (1.16, 1.48) | (0.64, 1.00) | (0.36, 0.71) | |
| indep. Bayes | 2.58 | 1.32 | 0.80 | 0.51 | – |
| (Inverse-Gamma prior) | (2.05, 3.21) | (1.16, 1.48) | (0.64, 1.00) | (0.36, 0.71) | |
| indep. Bayes | 2.62 | 1.33 | 0.83 | 0.53 | – |
| (Uniform prior) | (2.10, 3.31) | (1.17, 1.51) | (0.66, 1.03) | (0.38, 0.74) | |
| indep. Bayes | 2.62 | 1.33 | 0.83 | 0.53 | – |
| (half Normal prior) | (2.10, 3.31) | (1.17, 1.51) | (0.66, 1.03) | (0.38, 0.74) | |
| indep. Bayes | 2.62 | 1.33 | 0.83 | 0.53 | – |
| (half Cauchy prior) | (2.08, 3.26) | (1.17, 1.49) | (0.65, 1.02) | (0.38, 0.73) | |
| copula. Bayes | 2.82 | 1.34 | 0.65 | 0.89 | 0.57 |
| (Gamma prior) | (2.07, 3.61) | (1.18, 1.51) | (0.36, 1.31) | (0.65, 1.18) | (−0.48, 2.48) |
| copula. Bayes | 2.86 | 1.33 | 0.69 | 0.92 | 0.71 |
| (Inverse-Gamma prior) | (2.05, 3.65) | (1.13, 1.51) | (0.37, 3.02) | (0.66, 1.29) | (−0.49, 54.58) |
| copula. Bayes | 3.02 | 1.32 | 0.98 | 1.05 | 1.49 |
| (Uniform prior) | (2.29, 3.74) | (1.11, 1.51) | (0.45, 3.45) | (0.74, 1.37) | (−0.17, 103.71) |
| copula. Bayes | 2.99 | 1.32 | 0.96 | 1.05 | 1.41 |
| (half Normal prior) | (2.26, 3.73) | (1.11, 1.51) | (0.44, 3.31) | (0.74, 1.35) | (−0.23, 71.79) |
| copula. Bayes | 2.97 | 1.34 | 0.83 | 0.99 | 1.09 |
| (half Cauchy prior) | (2.25, 3.74) | (1.13, 1.51) | (0.43, 3.17) | (0.71, 1.33) | (−0.21, 55.66) |
*Acute exacerbation of idiopathic interstitial pneumonia (IIP).
Figure 3.
Weibull parameter estimates (95%CI in parenthesis) for death and AE (Acute Exacerbation) using the lung cancer data.
6. Discussion
This paper proposes a copula-based Bayesian estimator for dependent competing risks data subject to left-truncation. With two dependent competing event times, the simulation results confirm that the proposed estimators were highly reliable and better than the independent Bayesian estimator (Tables S3–S10). Among the five non-informative priors (Gamma, Inverse-Gamma, Uniform, half Normal and half Cauchy), the Uniform, the half Normal and the half Cauchy priors give more accurate results in Weibull parameter estimates than the Gamma and Inverse-Gamma prior. However, this is not the case for the copula parameter estimates. On the other hand, the performance of the independent risks model does not depend on priors. However, the independence is a quite strong assumption since competing risks are often positively dependent in real applications (Section 5.2). Therefore, we generally recommend the copula-based Bayesian estimator.
We formulate the likelihood functions separately for the independent risks (Section 3.1) and dependent risks by the Clayton copula (Section 3.2). There are two reasons for this separate formulation. First, the Clayton copula does not strictly include the independence copula as a special case. The Clayton copula approaches to the independence copula when the parameter is close to zero, but it does not exactly equal to the independence copula. Second, the treatment of the unknown parameter in any parametric copula is crucial in the present paper. Hence, the Clayton copula model differs from the independence copula since the former produces an unknown parameter in the likelihood function.
We take up a few important issues to be addressed as future works. First, the present failure time distribution is restricted to the Weibull model. It is possible to extend the Weibull model to other parametric models, such as the log-normal, gamma, Lomax, and Gompertz models [5,14,31,32,36]. Second, it is interesting to estimate the effects of covariates on times to failures [23]. Third, the joint distribution of latent failure times is constructed by the Clayton copula only: many other copulas [34,13] could be tried, and a data-driven copula selection criterion could be considered. Finally, our implicit assumption of independent left-truncation could be modeled by copulas if the assumption is rejected by the quasi-independence tests [6,7,18,42]. The resultant new model makes the likelihood more complex by two copulas: one is for competing risks and the other for dependent truncation.
Supplementary Material
Acknowledgements
The authors thank two referees for their valuable suggestions that improved the paper.
Funding Statement
This work was financially supported by JSPS KAKENHI [JP21K12127, JP22K11948, 22K11948].
Disclosure statement
No potential conflict of interest was reported by the author(s).
References
- 1.Al-Wahsh H., Tangri N., Quinn R., Liu P., Ferguson T., Fiocco M., Lam N.N., Tonelli M., and Ravani P., Accounting for the competing risk of death to predict kidney failure in adults with stage 4 chronic kidney disease. JAMA Network Open 4 (2021), pp. e219225. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Austin P.C., Lee D.S., and Fine J.P., Introduction to the analysis of survival data in the presence of competing risks. Circulation 133 (2016), pp. 601–609. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Balakrishnan N., and Mitra D., Some further issues concerning likelihood inference for left truncated and right censored lognormal data. Commun. Stat. Simul. Comput. 43 (2014), pp. 400–416. [Google Scholar]
- 4.Basu A.P., and Ghosh J.K., Identifiability of the multinormal and other distributions under competing risks model. J. Multivar. Anal. 8 (1978), pp. 413–429. [Google Scholar]
- 5.Bhattacharya S., Pradhan B., and Kundu D., Analysis of hybrid censored competing risks data. J. Theor. Appl. Stat. 48 (2014), pp. 1138–1154. [Google Scholar]
- 6.Chiou S.H., Austin M.D., Qian J., and Betensky R.A., Transformation model estimation of survival under dependent truncation and independent censoring. Stat. Methods Med. Res. 28 (2019), pp. 3785–3798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Chiou S.H., Qian J., Mormino E., and Betensky R.A., Permutation tests for general dependent truncation. Comput. Stat. Data Anal. 128 (2018), pp. 308–324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Chirila C., Odom D., Devercelli G., Khan S., Sherif B.N., Kaye J.A., Molnár I., and Sherrill B., Meta-analysis of the association between progression-free survival and overall survival in metastatic colorectal cancer. Int. J. Colorectal Dis. 27 (2012), pp. 623–634. [DOI] [PubMed] [Google Scholar]
- 9.Cox D.R., The analysis of exponentially distributed lifetimes with two types of failures. J. R. Stat. Soc. Ser. B 21 (1959), pp. 411–421. [Google Scholar]
- 10.de Uña-Álvareza J., and Noël Veraverbekeb N., Copula-graphic estimation with left-truncated and right-censored data. Statistics 54 (2017), pp. 387–403. [Google Scholar]
- 11.Dignam J.J., Zhang Q., and Kocherginsky M., The use and interpretation of competing risks regression models. Clin. Cancer Res. 18 (2012), pp. 2301–2308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Dörre A., Bayesian estimation of a lifetime distribution under double truncation caused by time-restricted data collection. Stat. Pap. 61 (2020), pp. 945–965. [Google Scholar]
- 13.Durante F., and Sempi C., Principles of Copula Theory, Vol. 474, CRC Press, Boca Raton, FL, 2016. [Google Scholar]
- 14.Emura T., and Michimae H., Left-truncated and right-censored field failure data: Review of parametric analysis for reliability. Qual. Reliab. Eng. Int. 38 (2022), pp. 3919–3934. [Google Scholar]
- 15.Emura T., Nakatochi M., Murotani K., and Rondeau V., A joint frailty-copula model between tumour progression and death for meta-analysis. Stat. Methods Med. Res. 26 (2017), pp. 2649–2666. [DOI] [PubMed] [Google Scholar]
- 16.Escarela G., and Carriere J.F., Fitting competing risks with an assumed copula. Stat. Methods Med. Res. 12 (2003), pp. 333–349. [DOI] [PubMed] [Google Scholar]
- 17.Fan T.H., Wang Y.F., and Ju S.K., A competing risks model with multiply censored reliability data under multivariate Weibull distributions. IEEE Trans. Reliab. 68 (2019), pp. 462–475. [Google Scholar]
- 18.Fernández T., Xu W., Ditzhaus M., and Gretton A., A kernel test for quasi-independence. Adv. Neural Inf. Process Syst. 33 (2020), pp. 15326–15337. [Google Scholar]
- 19.Fine J.P., and Gray R.J., A proportional hazards model for the subdistribution of a competing risk. J. Am. Stat. Assoc. 94 (1999), pp. 496–509. [Google Scholar]
- 20.Forsythe A., Chandiwana D., Barth J., Thabane M., and Baeck J., Tremblay, progression-free survival/time to progression as a potential surrogate for overall survival in HR+, HER2 – metastatic breast cancer. Breast Cancer Targets Ther. 10 (2018), pp. 69–78. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Gelman A., and Rubin D.B., Inference from iterative simulation using multiple sequences. Stat. Sci. 7 (1992), pp. 457–511. [Google Scholar]
- 22.Hoffman M.D., and Gelman A., The no-U-turn sampler: Adaptively setting path lengths in Hamiltonian Monte Carlo. J. Mach. Learn. Res. 15 (2014), pp. 1593–1623. [Google Scholar]
- 23.Hong Y., Meeker W.Q., and McCalley J.D., Prediction of remaining life of power transformers based on left truncated and right censored lifetime data. Ann. Appl. Stat. 3 (2009), pp. 857–879. [Google Scholar]
- 24.Jiang W., Ye Z., and Zhao X., Reliability estimation from left-truncated and right-censored data using splines. Stat. Sin. 30 (2020), pp. 845–875. [Google Scholar]
- 25.Klein J.P., and Moeschberger M.L., Survival Analysis: Techniques for Censored and Truncated Data, Springer, New York, 2003. [Google Scholar]
- 26.Kondoh Y., Taniguchi H., Ebina M., Azuma A., Ogura T., Taguchi Y., Suga M., Takahashi H., Nakata K., Sugiyama Y., Kudoh S., and Nukiwa T., Risk factors for acute exacerbation of idiopathic pulmonary fibrosis-extended analysis of pirfenidone trial in Japan. Respir. Investig. 53 (2015), pp. 271–278. [DOI] [PubMed] [Google Scholar]
- 27.Kundu D., Mitra D., and Ganguly A., Analysis of left truncated and right censored competing risks data. Comp. Stat. Data Anal. 108 (2017), pp. 12–26. [Google Scholar]
- 28.Lee D.W., Jang M.J., Lee K.H., Cho E.J., Lee J.H., Yu S.J., Kim Y.J., Yoon J.H., Kim T.Y., Han W., Oh D.Y., Lm S.A., and Kim T.Y., TTP as a surrogate endpoint in advanced hepatocellular carcinoma treated with molecular targeted therapy: Meta-analysis of randomised controlled trials. Br. J. Cancer 115 (2016), pp. 1201–1205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Li Q., Li D., Huang B., Jiang Z., Mingcheng E., and Ma J., Failure analysis for truncated and fully censored lifetime data with a hierarchical grid algorithm. IEEE. Access 8 (2020), pp. 34468–34480. [Google Scholar]
- 30.Liu X., Planning of accelerated life tests with dependent failure modes based on a gamma frailty model. Technometrics 54 (2012), pp. 398–409. [Google Scholar]
- 31.Michimae H., and Emura T., Likelihood inference for copula models based on left-truncated and competing risks data from field studies. Mathematics 10 (2022), pp. 2163. [Google Scholar]
- 32.Mitra D., Kundu D., and Balakrishnan N., Likelihood analysis and stochastic EM algorithm for left truncated right censored data and associated model selection from the Lehmann family of life distributions. Jpn. J. Stat. Data Sci. 4 (2021), pp. 1019–1048. [Google Scholar]
- 33.Miyamoto A., Michimae H., Nakahara Y., Akagawa K., Nakagawa K., Minegishi Y., Ogura T., Hontsu S., Date H., Takahashi K., Homma S., and Kishi K., Chemotherapy versus best supportive care in advanced lung cancer and idiopathic interstitial pneumonias: A retrospective multicentre cohort study. Respir. Investig. 37 (2023), pp. 284–295. doi: 10.1016/j.resinv.2022.10.009. [DOI] [PubMed] [Google Scholar]
- 34.Nelsen R.B., An Introduction to Copulas, 2nd ed., Springer, New York, 2006. [Google Scholar]
- 35.Prentice R.L., Kalbfleish J.D., Peterson Jr A.V., Flurnoy N., Farewell V.T., and Breslow N.E., The analysis of failure times in presence of competing risks. Biometrics 34 (1978), pp. 541–554. [PubMed] [Google Scholar]
- 36.Ranjan R., Sen R., and Upadhyay S.K., Bayes analysis of some important lifetime models using MCMC based approaches when the observations are left truncated and right censored. Reliab. Eng. Syst. Saf. 214 (2021), pp. 107747. [Google Scholar]
- 37.Schneider S., Demarqui F.N., Colosimo E.A., and Mayrink V.D., An approach to model clustered survival data with dependent censoring. Biom. J. 62 (2020), pp. 157–174. [DOI] [PubMed] [Google Scholar]
- 38.Schuster N.A., Hoogendijka E.O., Koka A.A.L., Twiska J.W.R., and Heymansa M.W., Ignoring competing events in the analysis of survival data may lead to biased results: A nonmathematical illustration of competing risk analysis. J. Clin. Epidemiol. 122 (2020), pp. 42–48. [DOI] [PubMed] [Google Scholar]
- 39.Song J.W., Hong S.B., Lim C.M., Koh Y., and Kim D.S., Acute exacerbation of idiopathic pulmonary fibrosis: Incidence, risk factors and outcome. Eur. Respir. J. 37 (2011), pp. 356–363. [DOI] [PubMed] [Google Scholar]
- 40.Stan Development Team , RStan: The R Interface to Stan. R package Version 2.17.3, 2018. Available at http://mc-stan.org.
- 41.Stan Development Team , Stan Modeling Language Users Guide and Reference Manual, 2017. Available at: https://mc-stan.org.
- 42.Tamura T., Ikegami M., Kanemasa Y., Yomota M., Furusawa A., Otani R., Saita C., Yonese I., Onishi T., Kobayashi H., and Akiyama T., Selection bias due to delayed comprehensive genomic profiling in Japan. Cancer Sci. 114 (2023), pp. 1015–1025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Wang L., Lio Y., Tripathi Y.M., Dey S., and Zhang F., Inference of dependent left-truncated and right-censored competing risks data from a general bivariate class of inverse exponentiated distributions. Statistics 56 (2022), pp. 347–374. [Google Scholar]
- 44.Wang L., Tripathi Y.M., Dey S., Zhang C., and Wu K., Analysis of dependent left-truncated and right-censored competing risks data with partially observed failure causes. Math. Comput. Simul. 194 (2022), pp. 285–307. [Google Scholar]
- 45.Weißbach R., and Dörre A., Retrospective sampling of survival data based on a Poisson birth process: Conditional maximum likelihood. Statistics 56 (2022), pp. 844–866. [Google Scholar]
- 46.Wu K., Wang L., Yan L., and Lio Y., Statistical inference of left truncated and right censored data from Marshall–Olkin bivariate Rayleigh distribution. Mathematics 9 (2021), pp. 2703. [Google Scholar]
- 47.Wu M., Shi Y., and Zhang C., Statistical analysis of dependent competing risks model in accelerated life testing under progressively hybrid censoring using copula function. Commun. Stat. Simul. Comput. 46 (2017), pp. 4004–4017. [Google Scholar]
- 48.Zheng M., and Klein J.P., A self-consistent estimator of marginal survival functions based on dependent competing risk data and an assumed copula. Commun. Stat. Theory Methods 23 (1994), pp. 2299–2311. [Google Scholar]
- 49.Zheng M., and Klein J.P., Estimates of marginal survival for dependent competing risks based on an assumed copula. Biometrika 82 (1995), pp. 127–138. [Google Scholar]
- 50.Zhou Y., Lu Z., Shi Y., and Cheng K., The copula-based method for statistical analysis of step-stress accelerated life test with dependent competing failure modes. Proc. Inst. Mech. Eng. O J. Risk Reliab. 233 (2019), pp. 401–418. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.





