Exploring Causal Effects of Hormone- and Radio-Treatments in an Observational Study of Breast Cancer Using Copula-Based Semi-Competing Risks Models

Tonghui Yu; Mengjiao Peng; Yifan Cui; Elynn Chen; Chixiang Chen

doi:10.1002/sim.70131

. Author manuscript; available in PMC: 2025 Sep 16.

Published in final edited form as: Stat Med. 2025 Jun;44(13-14):e70131. doi: 10.1002/sim.70131

Exploring Causal Effects of Hormone- and Radio-Treatments in an Observational Study of Breast Cancer Using Copula-Based Semi-Competing Risks Models

Tonghui Yu ¹, Mengjiao Peng ², Yifan Cui ³, Elynn Chen ⁴, Chixiang Chen ^5,⁶

PMCID: PMC12434651 NIHMSID: NIHMS2100360 PMID: 40462407

Abstract

Breast cancer patients may experience relapse or death after surgery during the follow-up period, leading to dependent censoring of relapse. This phenomenon, known as semi-competing risk, imposes challenges in analyzing treatment effects on breast cancer and necessitates advanced statistical tools for unbiased analysis. Despite progress in estimation and inference within semi-competing risks regression, its application to causal inference is still in its early stages. This article aims to propose a frequentist and semi-parametric framework based on copula models that can facilitate valid causal inference, net quantity estimation and interpretation, and sensitivity analysis for unmeasured factors under right-censored semi-competing risks data. We also propose novel procedures to enhance parameter estimation and its applicability in practice. After that, we apply the proposed framework to a breast cancer study and detect the time-varying causal effects of hormone- and radio-treatments on patients’ relapse and overall survival. Moreover, extensive numerical evaluations demonstrate the method’s feasibility, highlighting minimal estimation bias and reliable statistical inference.

Keywords: principle stratification, semi-competing risks, semiparametric copula models, sensitivity analysis, treatment effects

1 |. Introduction

1.1 |. Semi-Competing Risks Data

In clinical and biomedical studies, time-to-event endpoints often play a pivotal role, offering a tangible measure of the impact and effectiveness of medical interventions over time [1]. However, the analysis of time-to-event endpoints becomes intricate in the presence of semi-competing risks, a scenario frequently observed in various observational/clinical studies [2]. This paper focuses on studying the effects of hormone and radiotherapies in the treatment of breast cancer patients. While these two therapies are widely adopted in clinical practice to improve survival for patients with breast cancer, it remains unclear whether the combination of hormone and radiotherapies is more beneficial to patients after surgery than using hormone therapy alone [3,4]. To unbiasedly address this question while accounting for different time-to-event endpoints during breast cancer progression, we aim to investigate both relapse time (RFS, that is, the time from the initial diagnosis to the time of tumor loco-regional or distant relapse) and overall survival (OS) and develop a valid causal inference tool to assess the effects of hormone and radiotherapies on these semi-competing risks.

Formally, semi-competing risks survival times encompass both the time to a non-terminal event (e.g., RFS) and the time to a terminal event (e.g., OS) from the study entry, denoted by $T_{1}$ and $T_{2}$ , respectively. The pair $(T_{1}, T_{2})$ can be conceptualized as a bivariate failure time, where $T_{2}$ acts as a censoring variable for $T_{1}$ , but the reverse is not true- $T_{1}$ does not censor $T_{2}$ . Unlike the classic competing risk setting [5, 6] focusing on the first event and ignoring the information after the non-terminal event, the two endpoints in the semi-competing risk setting focus on both outcomes in a sequential manner and interplay between these outcomes [7]. In our application, we are particularly interested in the time to cancer relapse and time to death, where the latter event will truncate the former event, but the reverse is not true. Both events are highly valued by patients and their caregivers. Understanding how factors influence both events and how two events interact would inform clinical decisions and promote dynamic predictions on the terminal event [8]. The above motivation underscores the necessity for specialized statistical methods that appropriately account for the interplay between these two distinct time-to-event processes in the analysis of semi-competing risks.

1.2 |. Semi-Parametric Regression With Semi-Competing Risks

Semi-parametric regressions within the framework of semi-competing risks are of importance and can be divided into two distinct categories: Crude quantities and net quantities [7]. Concerning crude quantities, the sub-distribution approach and multi-state approach (specifically the so-called illness-death approach) are two kinds of typical means to handle survival data subject to dependently censored data. The sub-distribution approach [5, 9] evaluates the probability of each event occurring as the first one. Accordingly, it treats semi-competing risks akin to conventional competing risks. The illness-death approach [10] encompasses two cause-specific hazard models, quantifying the impact of covariates on the occurrence of either the terminal or non-terminal event, along with a Markov model that characterizes the progression from the non-terminal to the terminal event. However, both approaches focus on observable quantities rather than the marginal distributions of specific event times. According to the bounds established by Peterson [11], the marginal distribution of $T_{1}$ is bounded from below by the sub-distribution function of the non-terminal event and from above by the distribution of the first event occurring. Such wide bounds suggest that predicting the non-terminal event time based solely on cause-specific hazards in the illness-death model could lead to biases in the case of heavily dependent censoring. Moreover, such shared frailty models encounter challenges in effectively describing the negative correlation between the two event times [7].

The net-quantity-based approaches concern primarily the “net” effects of potential covariates on the marginal distribution of non-terminal and terminal event times. The marginal distribution in the absence of external influences, that is, hypothesizing the removal of the terminal event, may hold more direct clinical implications [7]. Marginal methodologies include joint models specified through copulas [12–14] or two separate marginal models augmented with artificial censoring techniques [15, 16]. The latter approaches concentrate on estimating regression coefficients in marginal models of $T_{1}$ and $T_{2}$ . They, however, fail to provide insights into the dependence between the non-terminal and terminal events. Additionally, the marginal distribution of the non-terminal event time faces an issue of unidentifiability without making substantial additional assumptions. Therefore, copula-based models may offer a more meaningful framework, given their capacity to simultaneously explore marginal distributions and quantify the association between the two event times [17–20].

1.3 |. Causal Inference Under Semi-Competing Risks

Several efforts have been made for causal interpretation under survival data with competing risks in the literature [6, 21, 22]. The average treatment effects on the event of interest in the competing risk setting [6] is constructed based on “marginal” risk; however, it puts aside the causal effects on both events when they occur sequentially. In practice, treatment can influence both non-terminal and terminal events, and these two events are inter-connected in complex ways. Jointly modeling these dual events provides a more accurate representation of this complexity [23] However, the application of semi-competing risks to causal inference is still in its early stages. Our focus in this article is to study the treatment effect on dual survival outcomes, that is, relapse and death times. A notable challenge arises when comparing outcomes between treated and untreated groups after treatment initiation, where the studied subjects may experience death before reaching the target event, and the death rates in the treated and control groups may provide informative insights. Consequently, the conventional average causal effect becomes neither well-defined nor easily interpretable in a causal context, except under additional assumptions [23, 24]. Robins [25] showed the identifiability of average causal effect when censoring time is always observed, and without distinguishing between censoring due to death or other reasons.

The existing causal estimands tailored to semi-competing risks data can be broadly categorized into several classes. In terms of methodological approaches, bayesian methods include works such as [26, 27], while frequentist methods include [23, 28–31]. In terms of causal frameworks, mediation analysis and Survivor Average Causal Effect (SACE) are two active topics in recent years. The mediation analysis, which regards the time to relapse as a time-varying mediator and aims to estimate direct and indirect causal effects, has been studied in References [28–31]. On the other hand, SACE quantifies the causal effect within the stratum of individuals who would have survived under both treatment values [32], and has been explored in References [23, 26, 27] for semi-competing risks data. Among these, Comment et al. [26] and Xu et al. [27] focused on causal effects for time-varying populations and estimated SACE via Bayesian methods. Under fixed population partitions, Nevo and Gorfine [23] further investigated its non-parametric partial identifiability and proposed a sensitivity analysis using frailty-based illness-death models.

Despite advancements, the frailty-based illness-death approach restricts its applicability to datasets where the association between non-terminal and terminal events is strictly non-negative. Notably, flipping the sign of association patterns can lead to invalid causal inferences for semi-competing risk data. This limitation motivates the integration of copula-based models into causal inference frameworks for semi-competing risks. As outlined earlier, copula-based models offer advantages in accommodating both negative and positive associations between the two event times, estimating net quantities, and supporting clinically meaningful interpretations. In this article, we propose and evaluate a copula-based causal inference framework for semi-competing risks, an area that remains understudied in existing literature, and apply it to analyze our breast cancer data. The goal is to enable valid causal interpretation, estimate net causal effects, and perform sensitivity analyses for unmeasured confounding in the context of semi-competing risks. Compared to frailty-based illness-death models [23], the proposed method offers greater flexibility in modeling diverse associations between events, facilitates more intuitive interpretation of covariate effects, and demonstrates improved robustness to model misspecification in numerical studies.

The rest of this article is organized as follows. Section 2 introduces basic notations for semi-competing risks data. Section 3 introduces the causal framework. Section 4 summarizes estimation methods and provides a sensitivity analysis for unmeasured factors. Section 5 describes the analysis for breast cancer data. Section 6 displays extensive simulation studies to illustrate the proposed causal framework. Section 7 provides discussions and future extensions.

2 |. Copula-Based Models for Bivariate Survival Function

In line with Sklar’s theorem [33], we establish a functional relationship between the bivariate survival function of $(T_{1}, T_{2})$ and their respective marginal distributions as follows:

\Pr (T_{1} \geq t_{1}, T_{2} \geq t_{2} ∣ Z) = C \{S_{1} (t_{1} ∣ Z), S_{2} (t_{2} ∣ Z); α\}, 0 \leq t_{1} \leq t_{2}

(1)

where $S_{k} (t ∣ Z)$ is the survival functions of $T_{k}$ conditional on the covariate vector $Z$ , for $k = 1, 2$ , and $C {u, v; α}$ denotes a specified copula function modeling inter-dependence between two conditional survival distributions with an unknown association parameter $α$ . This copula-based joint distribution is defined within the upper wedge where $T_{1} \leq T_{2}$ . In contrast to models based solely on observable quantities (i.e., whichever event occurs first), the marginal models for non-terminal and terminal events-detailed in later sections-allow for the evaluation of net covariate effects. Specifically, they estimate the effect on the non-terminal event time $T_{1}$ , independent of or adjusted for the influence of the terminal event $T_{2}$ . The “net” covariate effects hold notable clinical implications from the causal perspective in biomedical studies [16, 34]. In this article, we consider the Archimedean copula, which can be expressed as

C {u, v; α} = ϕ_{α}^{- 1} \{ϕ_{α} (u) + ϕ_{α} (v)\}, 0 \leq u, v \leq 1

(2)

where the generator function $ϕ_{α} : [0, 1] \to [0, \infty)$ is continuous and strictly decreasing from $ϕ_{α} (0) > 0$ to $ϕ_{α} (1) = 0$ . Consequently, Kendall’s $τ$ , a widely recognized metric for assessing the association between the two survival times, can be expressed as [35]

τ = 4 \int_{0}^{1} ϕ_{α} (u) / ϕ_{α}^{'} (u) d u + 1

(3)

Upon the above models (1–3), we consider a sample of $n$ individuals. Let $\{T_{i 1}, T_{i 2}, C_{i}, Z_{i}\}$ , be $n$ independent copies of $\{T_{1}, T_{2}, C, Z\}$ , where $C_{i}$ is censoring time such that $T_{i 1}$ and $T_{i 2}$ are observed as $(X_{i}, Y_{i}, δ_{i 1}, δ_{i 2})$ . Here, $X_{i} = \min (T_{i 1}, T_{i 2}, C_{i})$ and $Y_{i} = \min (T_{i 2}, C_{i})$ , $δ_{i 1} = I (X_{i} = T_{i 1})$ and $δ_{i 2} = I (Y_{i} = T_{i 2})$ .

The implementation of copula-based estimation techniques poses challenges due to their intricate nature. These approaches are often perceived as sophisticated and not extensively explored, particularly in the context of semi-competing risk causal inference. The remainder of this article aims to fill the gap and investigate the feasibility and potential advantages of applying the net-quantity-based semi-parametric framework to the field of causal inference. We begin by delving into the causal parameter specific to the context of semi-competing risks data. The estimation of regression parameters is elaborated upon in Section 4.

3 |. Treatment Effects

3.1 |. Observed Data and Potential Outcomes

In clinical and biomedical research, investigators are often interested in evaluating the causal impact of treatments on both slowing the advancement of disease and extending overall survival following an intermediate event. To address these questions, we employ the potential outcomes framework [36, 37] to define causal effects. The data is then distinguished as observed data and potential outcomes.

Observed data.

In addition to $(T_{i 1}, T_{i 2})$ and their observations $(X_{i}, Y_{i}, δ_{i 1}, δ_{i 2})$ defined in the previous section, there exists a treatment variable $A_{i}$ , indicating whether individual $i$ receive a treatment or not. It is assumed that $Z_{i}$ , are pre-treatment covariates unaffected by the treatment assignment. Consequently, the observed data is denoted as $O_{i} = \{X_{i}, Y_{i}, δ_{i 1}, δ_{i 2}, A_{i}, Z_{i}\}$ , $i = 1, \dots, n$ .

Potential outcomes.

Before defining the potential outcomes, we first take into account the stable unit treatment value assumption (SUTVA) [38]:

Assumption 1.

There is no interference between patients, and each treatment value corresponds to a single and consistent outcome, without the existence of multiple variations.

Let $(T_{i 1} (a), T_{i 2} (a))$ be the potential time-to-event outcomes when the individual $i$ is subjected to treatment $A = a$ and $a = 0$ or $1$ . This establishes pairs of potential outcomes $\{(T_{i 1} (0), T_{i 2} (0)), (T_{i 1} (1), T_{i 2} (1))\}$ for each individual. The causal effect pertains to the comparison between potential outcomes for either $T_{1}$ or $T_{2}$ within the same group of subjects, each under the influence of two competing treatments. We notice here that, within the framework of causal inference, at most one of the potential outcomes can be observed for each individual, while the others remain unobserved. The SUTVA suggests that the observed event times $T_{1}$ and $T_{2}$ can be represented as linear combinations of the potential outcomes under different treatments, specifically $T_{1} = T_{1} (1) A + T_{1} (0) (1 - A)$ and $T_{2} = T_{2} (1) A + T_{2} (0) (1 - A)$ . Similarly, in the presence of censoring, denoted by $C$ , the underlying potential outcomes $(T_{i 1} (a), T_{i 2} (a))$ could be observed in the form of $(X_{i} (a), Y_{i} (a))$ accompanied by the corresponding censoring indicators $(δ_{i 1} (a), δ_{i 2} (a))$ . For further defining causal estimands, we adopt the no-unmeasured confounding assumption, which is often imposed in the literature:

Assumption 2.

(i) (Conditional randomization) $A ⊥ (T_{1} (a)$ , $T_{2} (a), C) ∣ Z$ and (ii) (Non-informative censoring) $C ⊥ (T_{1} (a), T_{2} (a)) ∣ Z$ .

We notice here that Assumption 2ii is less restrictive than it initially appears. Given the fact that the non-terminal event time is subject to either terminal event time or random censoring $C$ , as a result, informative censoring is applied to the non-terminal event, while non-informative censoring is applied to the terminal event.

3.2 |. Causal Estimands

An appropriate causal estimand should involve a valid causal interpretation, ensuring comparable individual profiles between treated and control groups throughout the time course [23]. One popular approach to achieving this is by employing the concept of principal stratification [32, 39–41]. Specifically, to account for censoring, the stratum-specific survivor causal effect (SCE) of treatment on the terminal event time is defined by the comparison of potential outcomes of $T_{2}$ among individuals who would have survived regardless of treatment exposure [42]. However, in the semi-competing risks setting, the presence of a non-terminal event and the sequential ordering of the two events complicate causal inference. The treatment may delay the occurrence of the non-terminal event, which in turn postpones the terminal event. Therefore, it is more appropriate to evaluate causal effects on $T_{2}$ while accounting for the non-terminal event time. Additionally, $E (T_{1} (1) - T_{1} (0))$ is not well-defined in the semi-competing risks framework, as some individuals may die without experiencing the non-terminal event. To account for dependence between the two event times, as described by Nevo and Gorfine [23], the entire population can be categorized into four strata: (1) the “always-diseased” stratum, where individuals experience cancer relapse before death regardless of treatment $(T_{1} (0) \leq T_{2} (0), T_{1} (1) \leq T_{2} (1))$ ; (2) the “never-diseased” stratum, where individuals never experience cancer relapse under either treatment condition $(T_{1} (0) > T_{2} (0), T_{1} (1) > T_{2} (1))$ ; (3) the “harmed”, where treatment $(a = 1)$ accelerates cancer relapse $(T_{1} (0) > T_{2} (0), T_{1} (1) \leq T_{2} (1))$ ; and (4) the “protected”, where the treatment $(a = 1)$ prevents cancer relapse $(T_{1} (0) \leq T_{2} (0), T_{1} (1) > T_{2} (1))$ . The opposite effects in the latter two strata complicate result interpretation and may not align with the clinical relevance of treatment decisions. Therefore, excluding the “harmed” and “protected” strata renders the remaining causal estimands identifiable, interpretable, and applicable to real-world decision-making [23]. Given the considerable interest in the survival probability of non-terminal events in clinical studies, we primarily focus on the following causal estimand:

AD-SC E_{1} (t) = \Pr (T_{1} (1) \geq t ∣ T_{1} (0) \leq T_{2} (0), T_{1} (1) \leq T_{2} (1)) - \Pr (T_{1} (0) \geq t ∣ T_{1} (0) \leq T_{2} (0), T_{1} (1) \leq T_{2} (1))

(4)

Herein, we study the time-varying SCE estimands in Equation (4) rather than its integration version (or restricted survival mean) defined in Nevo and Gorfine [23] because estimators of survival means may lead to inevitably inaccurate and unsatisfactory results caused by point predictions in many finite-sample numerical examples [43]. The goal of this article is to explore varying (dynamic) treatment effects over time. In addition, since the terminal event could occur after or before the non-terminal event, we consider two types of stratum-specific SCE for the terminal event:

AD - {SCE}_{2} (t) = \Pr (T_{2} (1) \geq t ∣ T_{1} (0) \leq T_{2} (0), T_{1} (1) \leq T_{2} (1)) - \Pr (T_{2} (0) \geq t ∣ T_{1} (0) \leq T_{2} (0), T_{1} (1) \leq T_{2} (1))

(5)

ND - {SCE}_{2} (t) = \Pr (T_{2} (1) \geq t ∣ T_{1} (0) > T_{2} (0), T_{1} (1) > T_{2} (1)) - \Pr (T_{2} (0) \geq t ∣ T_{1} (0) > T_{2} (0), T_{1} (1) > T_{2} (1))

(6)

We remark here that the estimands above are subject to the joint distribution of event times from both worlds, that is, $a = 0, 1$ , which cannot be observed simultaneously. To facilitate non-parametric identification, we introduce a combined covariate vector $Z$ to describe the alterations in the occurrences of both event times, and impose the following assumption.

Assumption 3.

The cross-world dependence is captured by pre-treatment covariates $Z$ in the sense that $(T_{1} (0), T_{2} (0)) ⊥ (T_{1} (1), T_{2} (1)) ∣ Z$ .

We notice here that the cross-world conditional independence in Assumption 3 is strong and untestable in practice [44–46]. More discussion about cross-world unmeasured factors and a weaker assumption can be found in Section 4.2. Under Assumptions 1-3, we establish the identification of causal estimands of our interest in Proposition 1, with proof provided in the Supplementary Section 1. The above estimands (4–6) can alternatively be defined across observable strata, such as ${X (0) \leq Y (0), X (1) \leq Y (1)}$ instead of the “AD” stratum, and ${X (0) \geq Y (0), X (1) \geq Y (1)}$ instead of the “ND” stratum, which can be verified that these two types of definitions are equivalent under non-informative censoring in Assumption 2.

Proposition 1 (Non-parametric identification).

Under Assumptions 1 - 3, the stratum-specific survivor average causal effects in Equations (4–6) are identified by

E_{Z} [(S_{k ∣ T_{1} \leq T_{2}, A = 1, Z} (t) AD - {SCE}_{k} (t) = \frac{- S_{k ∣ T_{1} \leq T_{2}, A = 0, Z} (t)) Π_{A = 0, Z} Π_{A = 1, Z}]}{E_{Z} [Π_{A = 0, Z} Π_{A = 1, Z}]}, k = 1, 2

(7)

and

E_{Z} [(S_{2 ∣ T_{1} \geq T_{2}, A = 1, Z} (t) ND - SC E_{2} (t) = \frac{- S_{2 ∣ T_{1} \geq T_{2}, A = 0, Z} (t)) {\tilde{Π}}_{A = 0, Z} {\tilde{Π}}_{A = 1, Z}]}{E_{Z} [{\tilde{Π}}_{A = 0, Z} {\tilde{Π}}_{A = 1, Z}]}, t > 0

(8)

where $S_{k ∣ Q} (t) = \Pr (T_{k} \geq t ∣ Q)$ , $Π (Q) = \Pr (T_{1} \leq T_{2} ∣ Q)$ , $\tilde{Π} (Q) = \Pr (T_{1} \geq T_{2} ∣ Q)$ for any event $Q$ .

4 |. Identification and Estimation of Principal Causal Effects

Proposition 1 justifies the validity and identifiability of the proposed estimands in general. In this section, we detail how identifiability is maintained when applying copula-based techniques. The estimation of copula-based semicompeting risks regression has been extensively explored by existing literature. Nevertheless, adapting this framework to address causal inference involves significant adjustments and remains inadequately studied, where the literature either exhibits bias in cases of heavily dependent censoring or encounters challenges in effectively portraying the negative correlation between the two event times [7,11]. Herein, we provide an improved procedure to estimate parameters in the copula-based causal model.

4.1 |. Identification Based on Stringent Assumption

We first consider a stringent assumption as follows:

Assumption 4.

Conditional on covariates $Z$ , the joint distribution function of $(T_{1} (a), T_{2} (a))$ follows a copula model

D (t_{1}, t_{2} ∣ a, Z) = \Pr (T_{1} \geq t_{1}, T_{2} \geq t_{2} ∣ A = a, Z) = C^{(a)} \{S_{1} (t_{1} ∣ a, Z), S_{2} (t_{2} ∣ a, Z); α^{(a)}\}

(9)

where $0 \leq t_{1} \leq t_{2}, S_{k} (t ∣ a, Z)$ is the survival functions of $T_{k} (a) (k = 1, 2)$ given treatment and covariates $Z$ ; $C^{(a)} {u, v; α}$ is a pre-specified copula function with $α^{(a)}$ being a constant parameter describing extra association between event times under the treatment status $a$ .

It is noteworthy that Assumption 4 permits the utilization of various copula structures corresponding to different treatment assignments. Consequently, the proposed model exhibits greater flexibility compared to parametric frailty models found in the existing literature [23]. A weaker substitution of Assumption 4 will be discussed in Section 4.2.

We will now present the identification formulas within the framework of copulas. To ease presentation, we define $C_{1}^{(a)} (u, v, α) = \partial C^{(a)} (u, v; α) / \partial u$ , $C_{2}^{(a)} (u, v, α) = \partial C^{(a)} (u, v; α) / \partial v$ , $D {s, t ∣ a, Z} = C^{(a)} \{S_{1} (s ∣ a, Z), S_{2} (t ∣ a, Z); α^{(a)}\}$ , $D_{k} {s, t ∣ a, Z} = C_{k}^{(a)} \{S_{1} (s ∣ a, Z), S_{2} (t ∣ a, Z); α^{(a)}\}$ , $k = 1, 2$ . Then, under Assumptions 1-4, by simple derivation, the quantities in Proposition 1 can be rewritten as

S_{1 ∣ T_{1} \leq T_{2}, A = a, Z} (t) = \frac{\int_{t}^{\infty} D_{1} (s, s ∣ a, Z) d S_{1} (s ∣ a, Z)}{\int_{0}^{\infty} D_{1} (s, s ∣ a, Z) d S_{1} (s ∣ a, Z)}

(10)

S_{2 ∣ T_{1} \leq T_{2}, A = a, Z} (t) = \frac{S_{2} (t ∣ a, Z) + \int_{t}^{\infty} D_{2} (s, s ∣ a, Z) d S_{2} (s ∣ a, Z)}{1 + \int_{0}^{\infty} D_{2} (s, s ∣ a, Z) d S_{2} (s ∣ a, Z)}

(11)

S_{2 ∣ T_{1} > T_{2}, A = a, Z} (t) = \frac{\int_{t}^{\infty} D_{2} (s, s ∣ a, Z) d S_{2} (s ∣ a, Z)}{\int_{0}^{\infty} D_{2} (s, s ∣ a, Z) d S_{2} (s ∣ a, Z)}

(12)

The denominators in Equations (10–12) are the explicit expressions of $Π_{Q}$ and ${\tilde{Π}}_{Q}$ for event $Q$ . Quantities (10–12) use observable data in the whole upper wedge of $(T_{1}, T_{2})$ and thus can be estimable.

Plugging estimates of $α^{(a)}$ , $S_{1} (\cdot ∣ a, Z)$ , and $S_{2} (\cdot ∣ a, Z)$ assisted by methods in Peng and Fine [12] and Zhu et al. [47], estimates of causal effects (4–6) can be obtained from Equations (7, 8), (10–12). To facilitate the estimation, we limit $C^{(a)} {\cdot}$ to the Archimedean copula class, and for illustration, we use (but not limited to) the proportional hazards models as in Equation (13) to marginally fit $T_{1}$ and $T_{2}$ , respectively

λ_{k} (t ∣ Z, A = a) = λ_{0 k}^{(a)} (t) \exp (β_{k}^{(a) T} Z)

(13)

where $λ_{k} (t ∣ Z, A = a)$ is the conditional hazard function of $T_{k} (a)$ , $k = 1, 2$ , given the covariates $Z$ and treatment $A = a$ ; $β_{k}^{(a)}$ is a vector of unknown regression coefficients, and $λ_{0 k}^{(a)}$ is an unspecified baseline hazard function. Let $Λ_{0 k}^{(a)} (t) = \int_{0}^{t} λ_{0 k}^{(a)} (s) d s$ be the baseline cumulative hazard function. And the conditional survival functions are in the forms of $S_{k} (t ∣ Z, A) = \exp [- Λ_{0 k}^{(a)} (t) \exp (β_{k}^{(a) T} Z)]$ . We denote $C_{21} (u, v, α) = C_{12} (u, v, α) = \partial^{2} C (u, v; α) / \partial u \partial v$ and $D_{j k} \{X_{i}, Y_{i}; α^{(a)}\} = C_{j k} \{S_{1} (X_{i} ∣ Z_{i}, a, γ_{i}), S_{2} (Y_{i} ∣ Z_{i}, a, γ_{i}); α^{(a)}\}$ , $j$ , $k = 1, 2$ . Under models (9) and (13), the log-likelihood function concerning the parameters to be estimated given observed data $O = \{X_{i}, Y_{i}, δ_{i 1}, δ_{i 2}, A_{i}, Z_{i} : i = 1, \dots, n\}$ can be written as

\begin{array}{l} l (α^{(a)}, Λ_{01}^{(a)}, Λ_{02}^{(a)}, β_{1}^{(a)}, β_{2}^{(a)}) \\ = \sum_{i = 1}^{n} (1 - δ_{i 1}) (1 - δ_{i 2}) I (A_{i} = a) \log [D \{X_{i}, Y_{i}; α^{(A_{i})}\}] \\ + δ_{i 1} δ_{i 2} I (A_{i} = a) \log [D_{12} \{X_{i}, Y_{i}; α^{(A_{i})}\}] \\ + δ_{i 1} (1 - δ_{i 2}) I (A_{i} = a) \log [D_{1} \{X_{i}, Y_{i}; α^{(A_{i})}\}] \\ + (1 - δ_{i 1}) δ_{i 2} I (A_{i} = a) \log [D_{2} \{X_{i}, Y_{i}; α^{(A_{i})}\}] \\ + δ_{i 1} I (A_{i} = a) \log [S_{1} (X_{i} ∣ Z_{i}, A_{i}) λ_{01}^{(A_{i})} (X_{i}) \exp (β_{1}^{(A_{i}) T} Z_{i})] \\ + δ_{i 2} I (A_{i} = a) \log [S_{2} (Y_{i} ∣ Z_{i}, A_{i}) λ_{02}^{(A_{i})} (Y_{i}) \exp (β_{2}^{(A_{i}) T} Z_{i})] \end{array}

These parameters can be estimated by adapting the non-parametric maximum likelihood estimation (NPMLE) [14, 48]. However, the objective function of the aforementioned NPMLE method is only locally convex in a small neighborhood of the truth and thus sensitive to initial estimates. Poor initialization will lead to biases in the final estimation or even algorithm convergence issues. Accordingly, we improve the above NPMLE method by substituting initial values with estimates derived from Peng et al. [14] and Zhu et al. [47], which have proven effective in simulation studies. Such adaptation and integration can effectively reduce the number of iterations and computation time as well as estimation biases. It is worth noting that the proposed estimation is not limited to the time-independent Cox proportional hazards model aforementioned. We can easily extend it to other model forms, including the proportional odds model and varying-coefficient models [14, 47].

4.2 |. Sensitivity Analysis

While Assumptions 3 and 4 could be valid in some applications, they may not hold—and are generally not testable—in broader contexts, as cross-world data cannot be simultaneously observed for any individual. In practice, we recommend incorporating expert knowledge to guide the identification of crucial covariates while concurrently assessing the sensitivity of estimates to unmeasured factors [23, 24]. To this end, we propose the copula-based sensitivity analysis. To capture the remaining dependency attributed to unmeasured factors shared across two worlds, we introduce a frailty $γ$ and a weaker restriction than Assumptions 3 and 4.

Assumption 5.

There exists an unmeasured time-unvarying frailty variable $γ > 0$ such that the cross-world dependence is captured by pre-treatment covariates $Z$ together with $γ$ in the sense that $(T_{1} (0), T_{2} (0)) ⊥ (T_{1} (1), T_{2} (1)) ∣ (γ, Z)$ .

Assumption 6.

The time-unvarying frailty variable $γ > 0$ , from a distribution with mean 1 and variance $σ$ , satisfies:

The frailty variable operates multiplicatively on conditional cumulative hazard function of $T_{k} (a)$ , that is, $Λ_{k} (t ∣ A = a, γ, Z) = γ {\tilde{Λ}}_{k} (t ∣ A = a, Z)$ , where $Λ_{k} (t ∣ A = a, γ, Z)$ is the conditional cumulative hazard function of $T_{k} (a)$ given treatment $a$ and $(γ, Z)$ , and ${\tilde{Λ}}_{k} (t ∣ A = a, Z)$ is irrelevant to $γ$ .
The conditional joint distribution function of $(T_{1} (a), T_{2} (a))$ , given $(γ, Z)$ , follows a copula model $D (t_{1}, t_{2} ∣ a, γ, Z) = C^{(a)} \{S_{1} (t_{1} ∣ a, γ, Z), S_{2} (t_{2} ∣ a, γ, Z); α^{(a)}\}$ where $S_{k} (t ∣ a, γ, Z) = \exp [- Λ_{k} (t ∣ a, γ, Z)]$ is the survival functions of $T_{k} (a) (k = 1, 2)$ given treatment and $(γ, Z)$ , the definitions of $α^{(a)}$ and $C^{(a)} {u, v; α}$ are provided in Assumption 4.

The frailty variable in Assumption 5 is commonly used in the survival analysis literature to explain unobserved heterogeneity due to unmeasured covariates or other sources of natural variation. It is also widely applied in many causal inference literature to account for the cross-world dependence [24, 49, 50], a plausible remedy when the cross-world conditional independence in Assumption 3 is violated. This assumption also implies their marginal hazard ratios at time t on treatment $A = 1$ vs. control $A = 0$ for the same patient population with covariates $Z$ and unmeasured factor $γ$ who would survive that time, no matter what treatment, which is related to the so-called conditional causal hazard ratio [50]. We stress in Assumption 6ii that the unmeasured factor has impacts on marginal distributions except for the global association between two event times under each counterfactual arm. We refer readers to the discussion section for more general identification assumptions and possible extensions.

Based on the weaker Assumptions 5 and 6, and coupled with Assumptions 1 and 2, the causal estimands in Equations (4–6) are identifiable with detailed formulas in Supplementary Section 2. It is important to note that the distribution of frailty $γ$ may not be estimable in general. As a result, we evaluate the effect of unmeasured factors by pre-specifying values of the unidentifiable parameter $σ$ . The expectation with respect to frailty as Equations (A.2-A.4) in the Supporting Information can then be numerically calculated by Monte Carlo integration techniques. Note that incorporating a frailty, even with a pre-specified value of $σ$ , is non-trivial and necessitates more intricate estimation, thus warranting further methodological development. Suppose that the marginal regressions of $T_{k} (a)$ on $Z$ , $k = 1, 2$ , follow the shared-frailty Cox proportional hazards models

λ_{k} (t ∣ Z, A = a, γ) = γ λ_{0 k}^{(a)} (t) \exp (β_{k}^{(a) T} Z)

(14)

where $λ_{k} (t ∣ Z, A = a, γ)$ is the conditional hazard function of $T_{k} (a)$ given the covariates $Z$ and treatment a; the definitions of $β_{k}^{(a)}$ and $λ_{0 k}^{(a)}$ are same as model (13); and $γ$ is the frailty defined in Assumption 6. The log-likelihood function regarding to the parameters $(α^{(a)}, Λ_{01}^{(a)}, Λ_{02}^{(a)}, β_{1}^{(a)}, β_{2}^{(a)})$ given observed data $O$ can be expressed by

\begin{array}{l} l_{o} (α^{(a)}, Λ_{01}^{(a)}, Λ_{02}^{(a)}, β_{1}^{(a)}, β_{2}^{(a)}; σ) \\ = \sum_{i = 1}^{n} \log \{\int \exp [l_{i} (α^{(a)}, Λ_{01}^{(a)}, Λ_{02}^{(a)}, β_{1}^{(a)}, β_{2}^{(a)})] f_{γ} (γ_{i} ∣ σ) d γ_{i}\} \end{array}

(15)

where $f_{γ} (γ ∣ σ)$ is the density function of $γ$ with prespecified variance parameter $σ$ , and

\begin{array}{l} l_{i} (α^{(a)}, Λ_{01}^{(a)}, Λ_{02}^{(a)}, β_{1}^{(a)}, β_{2}^{(a)}) \\ = (1 - δ_{i 1}) (1 - δ_{i 2}) \log [D \{X_{i}, Y_{i}; α^{(A_{i})}\}] \\ + δ_{i 1} δ_{i 2} \log [D_{12} \{X_{i}, Y_{i}; α^{(A_{i})}\} S_{1} (X_{i} ∣ Z_{i}, A_{i}, γ_{i}) \\ \times λ_{01}^{(A_{i})} (X_{i}) γ_{i} \exp (β_{1}^{(A_{i}) T} Z_{i})] \\ + δ_{i 1} δ_{i 2} \log [S_{2} (Y_{i} ∣ Z_{i}, A_{i}, γ_{i}) λ_{02}^{(A_{i})} (Y_{i}) γ_{i} \exp (β_{2}^{(A_{i}) T} Z_{i})] \\ + δ_{i 1} (1 - δ_{i 2}) \log [D_{1} \{X_{i}, Y_{i}; α^{(A_{i})}\} S_{1} (X_{i} ∣ Z_{i}, A_{i}, γ_{i}) \\ \times λ_{01}^{(A_{i})} (X_{i}) γ_{i} \exp (β_{1}^{(A_{i}) T} Z_{i})] \\ + (1 - δ_{i 1}) δ_{i 2} \log [D_{2} \{X_{i}, Y_{i}; α^{(A_{i})}\} S_{2} (Y_{i} ∣ Z_{i}, A_{i}, γ_{i}) \\ \times λ_{02}^{(A_{i})} (Y_{i}) γ_{i} \exp (β_{2}^{(A_{i}) T} Z_{i})] \end{array}

The integral in Equation (15) poses analytical intractability, making a direct evaluation of the likelihood function challenging. A prevalent approach involves approximating it and subsequently utilizing the approximated likelihood to conduct inference about the model parameters. We modify the MCEM algorithm [14] to accommodate our causal framework. To reduce computation complexity, we start from a small MCMC sample size and increase it as computation progresses to approximate the likelihood above to derive estimates of $(α^{(a)}, Λ_{01}^{(a)}, Λ_{02}^{(a)}, β_{1}^{(a)}, β_{2}^{(a)})$ . We refer readers to Section 3 of the Supporting Information for more implementation details. Unlike Peng et al. [14]’s regression model, frailty variance is not estimable and should be prespecified in our potential outcomes framework, given that the two-world outcomes cannot be simultaneously observed. Nevertheless, it is noteworthy that their theoretical results remain applicable to our setting. In addition, from Equation (15), the proposed method is based upon (but not limited to) gamma-frailty for illustration. The proposed method can be extended to other general survival models and frailty distributions as well.

4.3 |. Statistical Inference for Causal Parameters

Utilizing the estimators derived from Sections 4.1 and 4.2, we proceed to estimate the causal effects via principal stratification in the context of semi-competing risks data. Specifically, the forms of estimators for baseline cumulative hazard functions $Λ_{01}^{(a)}$ and $Λ_{02}^{(a)}$ ) are provided in Equations (A.5-A.8) of the Supporting Information. When the conditional expectations of frailty in Equations (A.5) and (A.6) are excluded, these estimators reduce to estimators for $Λ_{01}^{(a)}$ and $Λ_{02}^{(a)}$ in the semiparametric model without frailty. Estimators of association parameter $α^{(a)}$ ) and regression coefficients $(β_{1}^{(a)}, β_{2}^{(a)}) (a = 0, 1)$ are obtained by maximizing the log-likelihood functions referred in Sections 4.1 and 4.2. It can be seen that the estimations for regression parameters in the two worlds are separate under the semiparametric model without frailty, whereas these parameters are jointly estimated under the model with frailty. If there is no unmeasured confounding, plugging estimators for $(α^{(a)}, Λ_{01}^{(a)}, Λ_{02}^{(a)}, β_{1}^{(a)}, β_{2}^{(a)})$ , $a = 0, 1$ , obtained from Section 4.1, which are consistent and asymptotically normal [12, 14, 47], into (10–12) and replacing $S_{k ∣ T_{1} \leq T_{2}, A = a, Z}$ and $S_{2 ∣ T_{1} \geq T_{2}, A = a, Z}$ in Equations (4–6) with its empirical counterpart yields consistent and asymptotically normal estimators for the three SCEs including ND-SCE₂(t), AD-SCE₁(t), and AD-SCE₂(t) with each time point $t$ . The asymptotic properties for the causal parameters follow from the continuous mapping theorem and the delta method. The same asymptotic properties are applied to the proposed method for sensitivity analysis. These estimators share the same asymptotic properties as those discussed in the literature [14], though their asymptotic variance can be more complex and lacks an explicit form. Thus, the classical bootstrap method based on resampling with replacement is more suggested to be used in practice to construct confidence intervals for both regression and causal parameters. Our simulation studies have further demonstrated the utility and effectiveness of the bootstrap method.

5 |. Application to a Breast Cancer Study

Hormone therapy or neoadjuvant endocrine therapy is often used in clinical practice for breast cancer patients as an adjuvant treatment following surgery. Despite its effectiveness, there is considerable debate surrounding the decision of whether to combine it with chemotherapy or radiotherapy or to pursue separate therapy after surgery [3]. Our objective was to apply our developed methods to evaluate the impact of hormone therapy and compare it with the combination of hormone therapy and radiotherapy in breast cancer patients who have undergone mastectomy surgery.

Data.

We applied our methods to analyze the data from the Molecular Taxonomy of Breast Cancer International Consortium (METABRIC) cohort [51, 52]. The response variable of our interest was the pair of time to relapse (RFS, $T_{1}$ ) and overall survival (OS, $T_{2}$ ) outcomes measured in months (i.e., semi-competing risks). Due to loss to follow-up and dependent censoring, the overall censoring rates for RFS and OS were 55% and 34%, respectively. Covariates used in our analysis included age at diagnosis (21–96 years old), ER status (ER, 1=positive, 0=negative), PR status (PR, 1=positive, 0=negative), HER2 status (HER2,1=positive, 0=negative), inferred menopausal state (MENO, 1=post-menopausal, 0=pre-menopausal), whether the number of lymph nodes examined positive bigger than 0 or not (NODE, 1= yes, 0=no), Nottingham prognostic index (NPI), and tumor size (SIZE). After data processing, there were 1114 patients taking mastectomies with complete records for covariates, RFS, and OS. In the following analyses, we used the Frank copula to describe the joint distribution of RFS and OS.

Primary analysis.

We conducted two primary analyses: (1) We first quantified the causal effect of hormone treatment alone $(a = 1)$ in comparison to no treatment $(a = 0)$ using the proposed causal framework. To eliminate the influence and interaction of other treatments, we specifically focused on a subset of patients who did not receive chemotherapy or radiotherapy, leaving 611 patients (59.4% patients with hormone treatment alone vs. 40.6% without any treatment) in this analysis. The estimation of regression coefficients is summarized in Table 1. We dropped the variables PR and HER2 in this analysis, considering that the estimated coefficients for PR and HER2 status were very small in this limited sample. Figure 1 displays the curves of causal parameters varying over time under two principal strata (AD, ND). (2) We continued to study the causal effect of the combination of hormone and radio treatment $(a = 1)$ in comparison to the use of hormone treatment alone $(a = 0)$ . Thus, there were 588 patients in this subgroup (38.3% patients with hormone and radiotherapy, vs. 61.7% with hormone therapy alone). Supplementary Table S.4 summarizes the effects of covariates on the marginal survival distribution for each outcome. Figure 2 displays causal estimands under the AD or ND stratum. Similar to the analysis above, we dropped the variable ER status due to very small estimated coefficients (Supplementary Table S.4).

TABLE 1 |.

Estimation of regression coefficients and Kendall’s tau $τ^{(a)}$ for the breast cancer study under different specified frailty variance $σ$ , where $a = 0$ and $1$ represent no treatment and hormone treatment, respectively.

		$RFS (T_{1})$						$OS (T_{2})$
$σ$	$τ^{(a)}$	Age	ER	Meno	Node	NPI	size	Age	ER	Meno	Node	NPI	Size
a=0 (no treatment)
0	0.574^**	0.005	0.457	−0.057	0.613	0.228 ^*	0.008	0.056^**	−0.021	−0.732^**	0.909^**	0.175 ^*	0.011
0.5	0.566^**	0.000	0.403	0.033	0.696 ^*	0.248 ^*	0.006	0.055	−0.003	−0.705^**	0.981^**	0.253^**	0.013
1	0.438^**	−0.002	0.388	−0.025	0.762	0.319^**	0.005	0.054	−0.083	−0.730^*	1.090^**	0.283^**	0.012
1.5	0.548^**	0.000	0.334	−0.034	0.678^*	0.266^*	0.008	0.054	−0.012	−0.710^**	0.927^**	0.213^*	0.011
10	0.498^**	−0.005	0.369	−0.037	0.743	0.252^*	0.005	0.051	−0.011	−0.749^*	0.964^*	0.197	0.011
a=1 (hormone treatment)
0	0.675^**	0.016	0.008	−0.081	0.500^**	0.179^*	0.021^**	0.044^**	−0.071	−0.027	0.270	0.188^**	0.017^**
0.5	0.655^**	0.010	−0.037	−0.134	0.515^**	0.281^*	0.025^*	0.045	−0.097	−0.005	0.337^*	0.236^**	0.017^**
1	0.505^**	0.004	−0.082	−0.124	0.559^*	0.289^**	0.024^**	0.042	−0.104	0.093	0.312	0.239	0.019^**
1.5	0.652^**	0.007	−0.082	−0.152	0.558^**	0.273^*	0.025	0.044	−0.129	0.008	0.355^*	0.249^**	0.019^**
10	0.584^**	0.004	−0.056	−0.151	0.563^**	0.270^**	0.024	0.041	−0.111	0.000	0.329^*	0.226^**	0.016^**

Open in a new tab

Note:

^{* and **}

indicate significance computed by 100 Bootstrap replicates at levels 0.1 and 0.05, respectively.

FIGURE 1 | — Causal effects of hormone treatment alone compared to no treatment use in breast cancer. The three plots in the top row present the estimated ND-SCE₂, AD-SCE1, and AD-SCE₂ with different specified frailty variance $σ$ (sensitivity analysis). The three plots in the bottom row present the estimated causal parameters along with their 90% confidence interval under pre-specified $σ = 0$ (i.e., no unmeasured confounding).

FIGURE 2 | — Causal effects of hormone- and radio-treatment compared to hormone therapy alone in breast cancer. The three plots in the top row present the estimated ND-SCE₂, AD-SCE₁, and AD-SCE₂ with different specified frailty variance $σ$ (sensitivity analysis). The three plots in the bottom row present the estimated causal parameters along with their 90% confidence interval under pre-specified $σ = 0$ (i.e., no unmeasured confounding).

Sensitivity analysis.

To more comprehensively evaluate the robustness of our findings to unmeasured confounding, we conducted a sensitivity analysis by following the strategy in Sections 4.2, 4.3, and varying the pre-specified $σ$ values. We also compared our method to the Kaplan-Meier analysis based upon the samples matched by propensity scores (PS) using baseline variables as covariates in the PS model [53], a widely used method in clinical studies. Figure 3 summarizes the results.

FIGURE 3 | — Kaplan-Meier survival curves obtained using propensity score matching method. (a) and (c) are KM survival curves of RFS by ignoring dependent censoring. (b) and (d) are KM survival curves of OS for breast cancer patients.

Result interpretation.

Results from the Copula regression model (Table 1, Supplementary Table S.4) indicated that there was a strongly positive association between RFS and OS, even after adjusting for covariates. This highlights the importance of modeling dependence between semi-parametric risks. These tables also indicated that the impacts of covariates (such as Her2, Node, Size) on RFS $(T_{1})$ and OS $(T_{2})$ varied between the two scenarios (i.e., $a = 1$ vs. $a = 0$ ). However, sensitivity analyses, involving variations in the pre-specified $σ$ values, suggested that the frailty variance may primarily affect the association between $T_{1}$ and $T_{2}$ and has a lesser influence on the significance of covariates. Specifically, in the first primary analysis, only the NPI variable consistently showed a positive association with $T_{1}$ for patients with $a = 0$ , while the variables Node, NPI, and Size were positively associated with $T_{1}$ for patients with $a = 1$ . In the second primary analysis, the variables Node, NPI, and Size were positively associated with $T_{1}$ for patients with $a = 0$ , while HER2, Size, and Meno had an impact for patients with $a = 1$ , where the first two showed a positive association and the last one showed a negative association.
Results from causal estimand estimation: By assuming no unmeasured confounding (i.e., $σ = 0$ ), using Hormone therapy alone compared to no treatment had a positive impact and was significantly beneficial to those patients in the “ND” stratum at some duration after surgery. But it may not have significant effects on both RFS and OS for patients in the “AD” stratum with long-time treatment (Figure 1). These imply that using Hormone therapy alone could be good for patients’ overall survival in the “ND” stratum, compared to no treatment use. Sensitivity analysis with varied $σ$ also supported the above conclusion (Figure 1). On the other hand, we detected a weak effect by comparing combined therapy with Hormone therapy alone (Figure 2) in the “AD” stratum at some duration after surgery. This implies that using the combined therapy might be even better for patients’ overall survival in the “AD” stratum, compared to using Hormone therapy alone. More discussions about AD/ND stratum are provided in Section 7.
In the Kaplan-Meier analysis under PS-matched cohorts (Figure 3), we observed that not undergoing any treatment resulted in a significantly higher survival rate compared to hormone treatment, and the combined treatment did not exhibit a clear effect on OS, contrary to expectations in clinical practice. This finding supports the necessity of the principal stratification analysis and demonstrates that the proposed causal framework for terminal event times provides more conservative and arguably more reliable results than conventional survival analysis. As for the KM survival curves of RFS, the combined treatment exhibited a significant effect on postponing the occurrence of relapse. This discrepancy could be attributed to the Kaplan-Meier analysis, even after PS matching, ignoring the dependence structure between RFS and OS, and failing to account for potential differences over principal strata. Consequently, we place more emphasis on and advocate for the use of our proposed methods, which offer a more valid causal comparison and yield more reasonable final results.

We also applied the semi-parametric estimator from the Nevo-Gorfine illness-death model [23] to our breast cancer dataset, with results presented in Figure S.9 in the Supporting Information. Their method suggested that hormone treatment alone reduced the survival rate for patients in the “ND” (never-diseased) stratum at certain durations post-surgery compared to no treatment. In the “AD” (always-diseased) stratum, hormone therapy showed a significantly positive impact on delaying relapse and death. Furthermore, combined treatment showed a positive effect on OS in the “ND” stratum but negative effects on both RFS and OS in the “AD” stratum. However, these findings do not align well with well-recognized clinical knowledge and practice. Notably, radiotherapy, a more aggressive treatment, is typically reserved for high-risk patients, while hormone therapy, being gentler, is preferred for patients with lower relapse risk [4, 54]. Generally speaking, for patients expected to die without relapse, hormone therapy alone is often sufficient, avoiding the need for aggressive radiotherapy; for patients at high risk of relapse, radiotherapy is recommended to eliminate residual tumors and prevent cancer progression. However, the results produced using the illness-death model deviate from these clinical guidelines, suggesting treatment strategies that contradict standard practice. In contrast, our proposed estimator produces results that align more closely with clinical expectations. These also highlight the robustness of our method in real-world practice.

6 |. Simulation Studies

The data generation mechanisms are classified into two categories: The “without frailty” class (where the cross-world dependence is exclusively captured by all observed covariates) and the “with frailty” class (where the cross-world dependence is not only attributed to observed covariates but also to latent and unobservable frailty variables). We evaluated the proposed methods based on four simulation settings (Ex1-Ex4 cases), where case Ex2 contains more covariates than case Ex1; case Ex3 is used to examine the robustness of our causal framework when treatment allocation is affected by baseline covariates as indicated by Assumption 2; and case Ex4 is to evaluate our method in the presence of a frailty describing two-world dependency. Due to the page limit, we refer readers to Supplementary Section 4 for more technical details.

Under each scenario, we generated 200 Monte Carlo replicates with sample size n = 500 or 1000. In all scenarios, we first estimated regression coefficients, baseline cumulative hazard functions, and the association between non-terminal and terminal times for each arm. We provided a comprehensive evaluation of the estimation results of regression coefficients and association parameters in terms of mean biases, Monte Carlo standard deviations (MCSD), and asymptotic standard errors (ASE) across 200 simulation runs. We only present the results from case Ex1. The other three cases showed similar patterns, thus omitted in this manuscript. We then estimated stratum-specific survivor causal effects (4–6) and evaluated their performance under cases Ex1-Ex4 varying with time over a grid of 30 time points which is determined by the sample order statistics of observed $T_{1}$ . We used the bootstrap method with 100 replicates to estimate the standard errors of the estimators for both causal parameters and regression coefficients.

Under Scenario Ex1 with low/high censoring setting, the biases and MCSDs of estimated regression coefficients appear to be negligible and diminish as the sample size increases for both Kendall’s tau values (0.3 and 0.6) (Supplementary Table S.2). The estimated standard errors are close to the empirical standard deviation of point estimates of regression coefficients. The estimated baseline cumulative hazard functions (depicted in red) for the first 50 simulated samples surround the truth in blue (Supplementary Figure S.1). In Figures 4 and 5, the mean estimated causal parameters (red curves) plus/minus 1.96 times empirical standard deviation (pink vertical bars) encapsulate the truth (depicted in black dashed curves), and discrepancies between ASE and MCSD at 20 time points seem to be negligible and fluctuate around zero. In Table 2, the biases of estimated causal parameters at specific time points are small and their coverage probabilities (CP) are reasonable but fluctuate slightly around the nominal level 0.95, which is expected regarding the complex nature of causal estimand estimation.

FIGURE 4 | — Estimation of causal parameters under the case Ex1 with $τ = 0.3$ and n = 1000. The three plots in the top row present the mean estimated causal parameters (represented by red curves) plus/minus 1.96 times the empirical standard deviation, encapsulating the true values (depicted in black dashed curves). The three plots in the bottom row present the difference between the Bootstrap-based ASE and the MCSD, fluctuating around zero. From left to right, these plots correspond to ND-SCE₂, AD-SCE₁, and AD-SCE₂, respectively.

FIGURE 5 | — Estimation of causal parameters under the case Ex1 with $τ = 0.6$ and n = 1000. The three plots in the top row present the mean estimated causal parameters (represented by red curves) plus/minus 1.96 times the empirical standard deviation, encapsulating the true values (depicted in black dashed curves). The three plots in the bottom row present the difference between the Bootstrap-based ASE and the MCSD, fluctuating around zero. From left to right, these plots correspond to ND-SCE₂, AD-SCE1, and AD-SCE₂, respectively.

TABLE 2 |.

Estimation of causal parameters (ND-SCE₂, AD-SCE₁ and AD-SCE₂) at time points t = 3,6 in the case Ex1 with low censoring. The true values of causal parameters, the mean biases, and the coverage probabilities (CP) of their estimates are shown in this table.

	ND-SCE₂			AD-SCE₁			AD-SCE₂
Time	Truth	Bias	95%CP	Truth	Bias	95%CP	Truth	Bias	95%CP
$τ = 0.3$ , $n = 500$
3	0.32	−0.002	0.96	0.38	0.003	1.00	0.26	−0.004	0.94
6	0.26	0.013	0.96	0.17	0.009	0.94	0.27	−0.005	0.96
$τ = 0.3$ , $n = 1000$
3	0.32	0.005	0.94	0.38	−0.008	0.92	0.26	−0.004	1.00
6	0.26	0.004	0.92	0.17	0.002	0.90	0.27	−0.006	0.94
$τ = 0.6$ , $n = 500$
3	0.30	0.018	0.98	0.37	−0.026	0.98	0.28	−0.010	0.98
6	0.31	−0.055	0.94	0.18	−0.017	1.00	0.25	0.028	0.86
$τ = 0.6$ , $n = 1000$
3	0.30	0.020	0.96	0.37	−0.031	0.98	0.28	−0.011	1.00
6	0.31	−0.015	0.94	0.18	−0.010	0.98	0.25	0.016	0.92

Open in a new tab

In the presence of more covariates as in case Ex2, estimators of causal parameters still perform well in terms of negligible estimation bias (Supplementary Figure S.2). Under case Ex3, it can be seen from Figure S.3 in the Supporting Information that the proposed estimator shows satisfactory performance except for a slight deviation shown in the ND stratum. This anomaly is understandable in light of the high-censoring environment, where fewer than 8% of subjects belong to the ND stratum and are observed to have the $T_{2}$ event. In case Ex4 with a frailty describing two-world dependence, the proposed MCEM algorithm in Section 4 yields asymptotically unbiased estimation for regression coefficients (Supplementary Table S.3) as well as causal parameters (Figure 6). The results presented in the manuscript are based on n = 1000, Kendall’s $τ = 0.6$ , and $σ = 0.2$ or 0.4. More simulation evaluations based upon different combinations of sample size $n$ , Kendall’s $τ$ , and $σ$ can be found in Supplementary Figures S.4-S.6. Similar patterns are observed over all the settings, demonstrating the validity of our new method.

FIGURE 6 | — Estimation of causal parameters (ND-SCE₂, AD-SCE₁, and AD-SCE₂) in the case Ex4 (evaluating the method for sensitivity analysis) with n = 1000, $τ = 0.6$ , and $σ = 0.2 / 0.4$ .

We further evaluated the sensitivity of our method to copula misspecification and compared its performance with the semiparametric approach proposed by [23] under various data-generating scenarios. Specifically, when the data were generated using the Clayton copula, the gamma-frailty illness-death model by [23] was correctly specified, as the Clayton copula structure aligns to some extent with a gamma frailty model [10, 55]. In contrast, data generated from the Frank copula led to misspecification in the frailty model. We conducted simulations under Case Ex4 using either the Clayton or Frank copula, with $σ = 0.2$ governing the cross-world dependency, to assess the sensitivity of both the proposed method and the Nevo-Gorfine method under model misspecification. Results are presented in Figures S.7 and S.8 of the Supporting Information. When both models are correctly specified (Figure S.7a), the two methods yield comparable results. When the Nevo-Gorfine method is correctly specified but the proposed method is misspecified (Figure S.7b), the proposed method shows slightly inferior yet acceptable performance. However, when the proposed method is correctly specified and the Nevo-Gorfine method is misspecified (Figure S.8a), the Nevo-Gorfine method appears more sensitive to model misspecification compared to the earlier case. Finally, when both models are misspecified (Figure S.8b), the proposed method demonstrates greater robustness-particularly in estimating the terminal event time in the “ND” stratum.

7 |. Discussion

This article establishes a frequentist framework using copula models for causal interpretation, net quantity estimation, and sensitivity analysis for unmeasured factors under right-censored semi-competing risks data. We have demonstrated the non-parametric identification and justified the feasibility and utility of the proposed estimation. The advantage of the proposed method over existing methods, including Nevo and Gorfine [23], is its capacity to assess net covariate effects and dependency between non-terminal and terminal event times under each counterfactual arm. Additionally, by leveraging the copula structure and a cross-world frailty variable, the proposed causal framework offers enhanced flexibility for practical applications and demonstrates more robust performance against model misspecification in numerical studies, compared to the illness-death based approach [23].

In the observational study of breast cancer, by applying our method, we detected that hormone therapy could be beneficial to patients with a low risk of relapse compared to no treatment use, and that the combined treatment will be beneficial to those with a higher risk of relapse, compared to hormone therapy alone. These findings imply a more tailored intervention strategy and suggest intermediate intervention after assessing the risk of relapse could be much more beneficial to patients’ overall survival, which to some extent aligns with previous clinical research [56] and provides more insight into personalized medicine for breast cancer patients to improve their post-surgery outcomes.

For future method development, we introduce various extensions and pose open questions for readers to explore and enhance the current framework: (1) Copula model selection. Hsieh et al. [57] provided the hypothesis test in terms of whether a copula model fits the data. Specifically, for each covariate group, the function $G_{a} (t_{1}, t_{2}) = \Pr (T_{1} \geq t_{1}, T_{2} \geq t_{2} ∣ δ_{1} = 1, δ_{2} = 1, A = a)$ is identifiable non-parametrically in the upper wedge as

{\tilde{G}}_{a} (t_{1}, t_{2}) = \sum_{i = 1}^{n} I (X_{i} \geq t_{1}, Y_{i} \geq t_{2}, δ_{i 1} = 1, δ_{i 2} = 1, A_{i} = a) / \sum_{i = 1}^{n} I (δ_{i 1} = 1, δ_{i 2} = 1, A_{i} = a)

The goodness-of-fit test can be thus constructed by comparing ${\tilde{G}}_{a} (t_{1}, t_{2})$ and ${\hat{G}}_{a} (t_{1}, t_{2})$ based on two copula models based on some distance measure [57]. (2) This paper considers a simple copula model with homogeneous association for illustration. The extensions of such a model include time-dependent association [12] and covariate-dependent association [58]. (3) Another extension is to study multiple treatments involving considering the effects of various interventions on survival outcomes subject to semi-competing risks, while addressing potential confounding factors. This extension often requires methodologies like the generalized propensity score, instrumental variables, or causal mediation analysis, adapted to handle multiple treatment groups. (4) Recurrent event data offers a valuable opportunity for methodological extension. One potential direction is to replace the non-terminal event time in models (1) and (9) of the manuscript with the recurrence gap time. By adopting population partitions such as “always recurrent” and “never recurrent,” analogous to those used in this study, the focus shifts to paired outcomes-namely, the time to first recurrence and time to death-thereby reducing the problem to a semi-competing risks framework. (5) How to appropriately identify the latent AD and ND sub-populations is unclear in practice. This motivates our future work on defining a proper discrimination measure under the semi-competing risks setting. Such an approach would serve as a useful tool for clinical audiences. All of the above merits further investigation in future studies.

Supplementary Material

Supp Material

NIHMS2100360-supplement-Supp_Material.pdf^{(1MB, pdf)}

Supporting Information

Additional supporting information can be found online in the Supporting Information section. Data S1. The Supporting Information contains technical proofs of propositions referenced in Section 3, the maximum likelihood estimation algorithm referenced in Section 4, and additional numerical results referenced in Sections 5 and 6.

Acknowledgments

C. Chen’s work was supported by National Institutes of Health grants: R01AG089377 and P30AG028747.

Funding:

This work was supported by the National Institutes of Health (Grant No. R01AG089377, P30AG028747).

Footnotes

Conflicts of Interest

The authors declare no conflicts of interest.

Data Availability Statement

The data that support the findings of this study are available on request from the corresponding author. The data are not publicly available due to privacy or ethical restrictions.

References

1.Cohen S, Cummings J, Knox S, Potashman M, and Harrison J, “Clinical Trial Endpoints and Their Clinical Meaningfulness in Early Stages of Alzheimer’s Disease,” Journal of Prevention of Alzheimer’s Disease 9, no. 3(2022): 507–522. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Mao L. and Kim K, “Statistical Models for Composite Endpoints of Death and Nonfatal Events: A Review,” Statistics in Biopharmaceutical Research 13, no. 3 (2021): 260–269. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Mayer IA and Arteaga CL, “PIK3CA Activating Mutations: A Discordant Role in Early Versus Advanced Hormone-Dependent Estrogen Receptor-Positive Breast Cancer?,” Journal of Clinical Oncology 32, no. 27 (2014): 2932–2934. [DOI] [PubMed] [Google Scholar]
4.McDuff SG and Blitzblau RC, “Optimizing Adjuvant Treatment Recommendations for Older Women With Biologically Favorable Breast Cancer: Short-Course Radiation or Long-Course Endocrine Therapy?,” Current Oncology 30, no. 1 (2022): 392–400. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Fine JP and Gray RJ, “A Proportional Hazards Model for the Subdistribution of a Competing Risk,” Journal of the American Statistical Association 94 (1999): 496–509. [Google Scholar]
6.Young JG, Stensrud MJ, Tchetgen EJT, and Hernán MA, “A Causal Framework for Classical Statistical Estimands in Failure-Time Settings With Competing Events,” Statistics in Medicine 39 (2020): 1199–1236. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Varadhan R, Xue QL, and Bandeen-Roche K, “Semicompeting Risks in Aging Research: Methods, Issues and Needs,” Lifetime Data Analysis 20, no. 4 (2014): 538–562. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Li R. and Cheng Y, “Flexible Association Modelling and Prediction With Semi-Competing Risks Data,” Canadian Journal of Statistics 44, no. 3(2016): 361–374. [Google Scholar]
9.Li R. and Peng L, “Varying Coefficient Subdistribution Regression for Left-Truncated Semi-Competing Risks Data,” Journal of Multivariate Analysis 131 (2014): 65–78. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Xu J, Kabfleisch JD, and Tai B, “Statistical Analysis of Illness Death Processes and Semi-Competing Risks Data,” Biometrics 66 (2010): 716–725. [DOI] [PubMed] [Google Scholar]
11.Peterson AV, “Bounds for a Joint Distribution Function With Fixed Sub-Distribution Functions: Application to Competing Risks,” Proceedings of the National Academy of Sciences of the United States of America 73, no. 1 (1976): 11–13, 10.1073/pnas.73.1.11. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Peng L. and Fine JP, “Regression Modeling of Semicompeting Risks Data,” Biometrics 63, no. 1 (2007): 96–108. [DOI] [PubMed] [Google Scholar]
13.Chen YH, “Maximum Likelihood Analysis of Semicompeting Risks Data With Semiparametric Regression Models,” Lifetime Data Analysis 18 (2012): 36–57. [DOI] [PubMed] [Google Scholar]
14.Peng M, Xiang L, and Wang S, “Semiparametric Regression Analysis of Clustered Survival Data With Semi-Competing Risks,” Computational Statistics and Data Analysis 124 (2018): 53–70. [Google Scholar]
15.Peng L. and Fine JP, “Rank Estimation of Accelerated Lifetime Models With Dependent Censoring,” Journal of the American Statistical Association 101, no. 475 (2006): 1085–1093. [Google Scholar]
16.Ding AA, Shi G, Wang W, and Hsieh JJ, “Marginal Regression Analysis for Semi-Competing Risks Data Under Dependent Censoring,” Scandinavian Journal of Statistics 36, no. 3 (2009): 481–500. [Google Scholar]
17.Emura T, Sofeu CL, and Rondeau V, “Conditional Copula Models for Correlated Survival Endpoints: Individual Patient Data Meta-Analysis of Randomized Controlled Trials,” Statistical Methods in Medical Research 30, no. 12 (2021): 2634–2650. [DOI] [PubMed] [Google Scholar]
18.Sun T, Liang W, Zhang G, Yi D, Ding Y, and Zhang L, “Penalised Semi-Parametric Copula Method for Semi-Competing Risks Data: Application to Hip Fracture in Elderly,” Journal of the Royal Statistical Society. Series C, Applied Statistics 73, no. 1 (2024): qlad093. [Google Scholar]
19.Wei Y, Wojtyś M, Sorrell L, and Rowe P, “Bivariate Copula Regression Models for Semi-Competing Risks,” Statistical Methods in Medical Research 32, no. 10 (2023): 1902–1918. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Yu JC and Huang YT, “Unified Semicompeting Risks Analysis of Hepatitis Natural History Through Mediation Modeling,” Statistics in Medicine 42, no. 24 (2023): 4301–4318. [DOI] [PubMed] [Google Scholar]
21.Stensrud MJ, Hernán MA, Tchetgen Tchetgen EJ, Robins JM, Didelez V, and Young JG, “A Generalized Theory of Separable Effects in Competing Event Settings,” Lifetime Data Analysis 27, no. 4 (2021): 588–631. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Stensrud MJ, Young JG, Didelez V, Robins JM, and Hernán MA, “Separable Effects for Causal Inference in the Presence of Competing Events,” Journal of the American Statistical Association 117, no. 537 (2022): 175–183. [Google Scholar]
23.Nevo D. and Gorfine M, “Causal Inference for Semi-Competing Risks Data,” Biostatistics 23, no. 4 (2022): 1115–1132. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Axelrod R. and Nevo D, “A Sensitivity Analysis Approach for the Causal Hazard Ratio in Randomized and Observational Studies,” Biometrics 79, no. 3 (2022): 2743–2756. [DOI] [PubMed] [Google Scholar]
25.Robins JM, “An Analytic Method for Randomized Trials With Informative Censoring: Part 1,” Lifetime Data Analysis 1 (1995): 241–254. [DOI] [PubMed] [Google Scholar]
26.Comment L, Mealli F, Haneuse S, and Zigler C, “Survivor average causal effects for continuous time: a principal stratification approach to causal inference with semicompeting risks. arXiv preprint arXiv:1902.09304,” 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Xu Y, Scharfstein D, Müller P, and Daniels M, “A Bayesian Nonparametric Approach for Evaluating the Causal Effect of Treatment in Randomized Trials With Semi-Competing Risks,” Biostatistics 23, no. 1 (2022): 34–49. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Huang YT, “Causal Mediation of Semicompeting Risks,” Biometrics 77, no. 4 (2021): 1143–1154. [DOI] [PubMed] [Google Scholar]
29.Deng Y, Wang Y, Zhan X, and Zhou XH, “Separable Pathway Effects of Semi-Competing Risks via Multi-State Models. arXiv preprint arXiv:2306.15947,” 2023. [Google Scholar]
30.Deng Y, Wang Y, and Zhou XH, “Direct and Indirect Treatment Effects in the Presence of Semi-Competing Risks. arXiv preprint arXiv:2309.01721,” 2024. [DOI] [PubMed] [Google Scholar]
31.Breum MS, Munch A, Gerds1 TA, and Martinussen T, “Estimation of Separable Direct and Indirect Effects in a Continuousx02011;Time illnessx02011;Death Model,” Lifetime Data Analysis 30 (2024): 143–180. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Frangakis CE and Rubin DB, “Principal Stratification in Causal Inference,” Biometrics 58, no. 1 (2002): 21–29. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Sklar A, “Fonctions de Repartition a n Dimensions et Leurs Marges,” Publications de L’institut Statistique de L’université de Paris 8 (1959): 229–231. [Google Scholar]
34.Chen YH, “Semiparametric Marginal Regression Analysis for Dependent Competing Risks Under an Assumed Copula,” Journal of the Royal Statistical Society. Series B, Statistical Methodology 72, no. 2 (2010): 235–251, 10.1111/j.1467-9868.2009.00734.x. [DOI] [Google Scholar]
35.Lakhal L, Rivest LP, and Abdous B, “Estimating Survival and Association in a Semicompeting Risks Model,” Biometrics 64, no. 1 (2008): 180–188. [DOI] [PubMed] [Google Scholar]
36.Neyman J, “Sur les applications de la théorie des probabilités aux experiences agricoles: Essai des principes,” Roczniki Nauk Rolniczych 10, no. 1 (1923): 1–51. [Google Scholar]
37.Rubin DB, “Estimating Causal Effects of Treatments in Randomized and Nonrandomized Studies,” Journal of Education & Psychology 66, no. 5 (1974): 688–701. [Google Scholar]
38.Rubin DB, “Comment: Neyman (1923) and Causal Inference in Experiments and Observational Studies,” Statistical Science 5, no. 4 (1990): 472–480. [Google Scholar]
39.Rubin DB, “Causal Inference Through Potential Outcomes and Principal Stratification: Application to Studies With “Censoring” due to Death,” Statistical Science 21, no. 3 (2006): 299–309, 10.1214/088342306000000114. [DOI] [Google Scholar]
40.Ding P, Geng Z, Yan W, and Zhou XH, “Identifiability and Estimation of Causal Effects by Principal Stratification With Outcomes Truncated by Death,” Journal of the American Statistical Association 106, no. 496 (2011): 1578–1591. [Google Scholar]
41.Dai JY, Gilbert PB, and Mâsse BR, “Partially Hidden Markov Model for Time-Varying Principal Stratification in HIV Prevention Trials,” Journal of the American Statistical Association 107, no. 497 (2012): 52–65. [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Tchetgen Tchetgen EJ, “Identification and Estimation of Survivor Average Causal Effects,” Statistics in Medicine 33, no. 21 (2014): 3601–3628. [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Graf E, Schmoor C, Sauerbrei W, and Schumacher M, “Assessment and Comparison of Prognostic Classification Schemes for Survival Data,” Statistics inMedicine 18, no. 17-18 (1999): 2529–2545. [DOI] [PubMed] [Google Scholar]
44.Richardson TS and Robins JM, “Single World Intervention Graphs (SWIGs): A Unification of the counterfactual and graphical approaches to causality,” Center for the Statistics and the Social Sciences, University of Washington Series. Working Paper 2013 128, no. 30 (2013). [Google Scholar]
45.Robins JM, Richardson TS, and Shpitser I, “An Interventionist Approach to Mediation Analysis,” in Probabilistic and Causal Inference: The Works of Judea Pearl (ACM Books, 2022), 713–764. [Google Scholar]
46.Stensrud MJ, Young JG, and Martinussen T, “Discussion on Causal Mediation of Semicompeting Risks by Yen-Tsung Huang,” Biometrics 77, no. 4 (2021): 1160–1164. [DOI] [PubMed] [Google Scholar]
47.Zhu H, Lan Y, Ning J, and Shen Y, “Semiparametric Copula-Based Regression Modeling of Semi-Competing Risks Data,” Communications in Statistics - Theory and Methods 51, no. 22 (2022): 7830–7845. [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Peng M. and Xiang L, “Joint Regression Analysis for Survival Data in the Presence of Two Sets of Semi-Competing Risks,” Biometrical Journal 61, no. 6 (2019): 1402–1416. [DOI] [PubMed] [Google Scholar]
49.Aalen OO, Cook RJ, and Roysland K, “Does Cox Analysis of a Randomized Survival Study Yield a Causal Treatment Effect?,” Lifetime Data Analysis 21 (2015): 579–593. [DOI] [PubMed] [Google Scholar]
50.Martinussen T, Vansteelandt S, and Andersen PK, “Subtleties in the Interpretation of Hazard Contrasts,” Lifetime Data Analysis 26 (2020): 833–855. [DOI] [PubMed] [Google Scholar]
51.Curtis C, Shah SP, Chin SF, et al. , “The Genomic and Transcriptomic Architecture of 2,000 Breast Tumours Reveals Novel Subgroups,” Nature 486, no. 7403 (2012): 346–352, 10.1038/nature10983. [DOI] [PMC free article] [PubMed] [Google Scholar]
52.Pereira B, Chin SF, Rueda OM, et al. , “The Somatic Mutation Profiles of 2,433 Breast Cancers Refine Their Genomic and Transcriptomic Landscapes,” Nature Communications 7, no. 1 (2016): 11479. [DOI] [PMC free article] [PubMed] [Google Scholar]
53.Austin PC, “The Use of Propensity Score Methods With Survival or Time-To-Event Outcomes: Reporting Measures of Effect Similar to Those Used in Randomized Experiments,” Statistics inMedicine 33, no. 7 (2014): 1242–1258. [DOI] [PMC free article] [PubMed] [Google Scholar]
54.Chadha M, White J, Swain S, et al. , “Optimal Adjuvant Therapy in Older (Ge; 70 Years of Age) Women With Low-Risk Early-Stage Breast Cancer,” npj Breast Cancer 9, no. 1 (2023): 99. [DOI] [PMC free article] [PubMed] [Google Scholar]
55.Goethals K, Janssen P, and Duchateau L, “Frailty Models and Copulas: Similarities and Differences,” Journal of Applied Statistics 35, no. 9 (2008): 1071–1079. [Google Scholar]
56.Sledge GW, Mamounas EP, Hortobagyi GN, Burstein HJ, Goodwin PJ, and Wolff AC, “Past, Presentand Future Challenges in Breast Cancer Treatment,” Journal of Clinical Oncology 32, no. 19 (2014): 1979–1986. [DOI] [PMC free article] [PubMed] [Google Scholar]
57.Hsieh JJ, Wang W, and Ding AA, “Regression Analysis Based on Semicompeting Risks Data,” Journal of the Royal Statistical Society. Series B, Statistical Methodology 70, no. 1 (2008): 3–20, 10.1111/j.1467-9868.2007.00621.x. [DOI] [Google Scholar]
58.Li R. and Peng L, “Quantile Regression Adjusting for Dependent Censoring From Semicompeting Risks,” Journal of the Royal Statistical Society. Series B, Statistical Methodology 77, no. 1 (2015): 107–130. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supp Material

NIHMS2100360-supplement-Supp_Material.pdf^{(1MB, pdf)}

Data Availability Statement

The data that support the findings of this study are available on request from the corresponding author. The data are not publicly available due to privacy or ethical restrictions.

[R1] 1.Cohen S, Cummings J, Knox S, Potashman M, and Harrison J, “Clinical Trial Endpoints and Their Clinical Meaningfulness in Early Stages of Alzheimer’s Disease,” Journal of Prevention of Alzheimer’s Disease 9, no. 3(2022): 507–522. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] 2.Mao L. and Kim K, “Statistical Models for Composite Endpoints of Death and Nonfatal Events: A Review,” Statistics in Biopharmaceutical Research 13, no. 3 (2021): 260–269. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] 3.Mayer IA and Arteaga CL, “PIK3CA Activating Mutations: A Discordant Role in Early Versus Advanced Hormone-Dependent Estrogen Receptor-Positive Breast Cancer?,” Journal of Clinical Oncology 32, no. 27 (2014): 2932–2934. [DOI] [PubMed] [Google Scholar]

[R4] 4.McDuff SG and Blitzblau RC, “Optimizing Adjuvant Treatment Recommendations for Older Women With Biologically Favorable Breast Cancer: Short-Course Radiation or Long-Course Endocrine Therapy?,” Current Oncology 30, no. 1 (2022): 392–400. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] 5.Fine JP and Gray RJ, “A Proportional Hazards Model for the Subdistribution of a Competing Risk,” Journal of the American Statistical Association 94 (1999): 496–509. [Google Scholar]

[R6] 6.Young JG, Stensrud MJ, Tchetgen EJT, and Hernán MA, “A Causal Framework for Classical Statistical Estimands in Failure-Time Settings With Competing Events,” Statistics in Medicine 39 (2020): 1199–1236. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] 7.Varadhan R, Xue QL, and Bandeen-Roche K, “Semicompeting Risks in Aging Research: Methods, Issues and Needs,” Lifetime Data Analysis 20, no. 4 (2014): 538–562. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] 8.Li R. and Cheng Y, “Flexible Association Modelling and Prediction With Semi-Competing Risks Data,” Canadian Journal of Statistics 44, no. 3(2016): 361–374. [Google Scholar]

[R9] 9.Li R. and Peng L, “Varying Coefficient Subdistribution Regression for Left-Truncated Semi-Competing Risks Data,” Journal of Multivariate Analysis 131 (2014): 65–78. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R10] 10.Xu J, Kabfleisch JD, and Tai B, “Statistical Analysis of Illness Death Processes and Semi-Competing Risks Data,” Biometrics 66 (2010): 716–725. [DOI] [PubMed] [Google Scholar]

[R11] 11.Peterson AV, “Bounds for a Joint Distribution Function With Fixed Sub-Distribution Functions: Application to Competing Risks,” Proceedings of the National Academy of Sciences of the United States of America 73, no. 1 (1976): 11–13, 10.1073/pnas.73.1.11. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] 12.Peng L. and Fine JP, “Regression Modeling of Semicompeting Risks Data,” Biometrics 63, no. 1 (2007): 96–108. [DOI] [PubMed] [Google Scholar]

[R13] 13.Chen YH, “Maximum Likelihood Analysis of Semicompeting Risks Data With Semiparametric Regression Models,” Lifetime Data Analysis 18 (2012): 36–57. [DOI] [PubMed] [Google Scholar]

[R14] 14.Peng M, Xiang L, and Wang S, “Semiparametric Regression Analysis of Clustered Survival Data With Semi-Competing Risks,” Computational Statistics and Data Analysis 124 (2018): 53–70. [Google Scholar]

[R15] 15.Peng L. and Fine JP, “Rank Estimation of Accelerated Lifetime Models With Dependent Censoring,” Journal of the American Statistical Association 101, no. 475 (2006): 1085–1093. [Google Scholar]

[R16] 16.Ding AA, Shi G, Wang W, and Hsieh JJ, “Marginal Regression Analysis for Semi-Competing Risks Data Under Dependent Censoring,” Scandinavian Journal of Statistics 36, no. 3 (2009): 481–500. [Google Scholar]

[R17] 17.Emura T, Sofeu CL, and Rondeau V, “Conditional Copula Models for Correlated Survival Endpoints: Individual Patient Data Meta-Analysis of Randomized Controlled Trials,” Statistical Methods in Medical Research 30, no. 12 (2021): 2634–2650. [DOI] [PubMed] [Google Scholar]

[R18] 18.Sun T, Liang W, Zhang G, Yi D, Ding Y, and Zhang L, “Penalised Semi-Parametric Copula Method for Semi-Competing Risks Data: Application to Hip Fracture in Elderly,” Journal of the Royal Statistical Society. Series C, Applied Statistics 73, no. 1 (2024): qlad093. [Google Scholar]

[R19] 19.Wei Y, Wojtyś M, Sorrell L, and Rowe P, “Bivariate Copula Regression Models for Semi-Competing Risks,” Statistical Methods in Medical Research 32, no. 10 (2023): 1902–1918. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] 20.Yu JC and Huang YT, “Unified Semicompeting Risks Analysis of Hepatitis Natural History Through Mediation Modeling,” Statistics in Medicine 42, no. 24 (2023): 4301–4318. [DOI] [PubMed] [Google Scholar]

[R21] 21.Stensrud MJ, Hernán MA, Tchetgen Tchetgen EJ, Robins JM, Didelez V, and Young JG, “A Generalized Theory of Separable Effects in Competing Event Settings,” Lifetime Data Analysis 27, no. 4 (2021): 588–631. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] 22.Stensrud MJ, Young JG, Didelez V, Robins JM, and Hernán MA, “Separable Effects for Causal Inference in the Presence of Competing Events,” Journal of the American Statistical Association 117, no. 537 (2022): 175–183. [Google Scholar]

[R23] 23.Nevo D. and Gorfine M, “Causal Inference for Semi-Competing Risks Data,” Biostatistics 23, no. 4 (2022): 1115–1132. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] 24.Axelrod R. and Nevo D, “A Sensitivity Analysis Approach for the Causal Hazard Ratio in Randomized and Observational Studies,” Biometrics 79, no. 3 (2022): 2743–2756. [DOI] [PubMed] [Google Scholar]

[R25] 25.Robins JM, “An Analytic Method for Randomized Trials With Informative Censoring: Part 1,” Lifetime Data Analysis 1 (1995): 241–254. [DOI] [PubMed] [Google Scholar]

[R26] 26.Comment L, Mealli F, Haneuse S, and Zigler C, “Survivor average causal effects for continuous time: a principal stratification approach to causal inference with semicompeting risks. arXiv preprint arXiv:1902.09304,” 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R27] 27.Xu Y, Scharfstein D, Müller P, and Daniels M, “A Bayesian Nonparametric Approach for Evaluating the Causal Effect of Treatment in Randomized Trials With Semi-Competing Risks,” Biostatistics 23, no. 1 (2022): 34–49. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] 28.Huang YT, “Causal Mediation of Semicompeting Risks,” Biometrics 77, no. 4 (2021): 1143–1154. [DOI] [PubMed] [Google Scholar]

[R29] 29.Deng Y, Wang Y, Zhan X, and Zhou XH, “Separable Pathway Effects of Semi-Competing Risks via Multi-State Models. arXiv preprint arXiv:2306.15947,” 2023. [Google Scholar]

[R30] 30.Deng Y, Wang Y, and Zhou XH, “Direct and Indirect Treatment Effects in the Presence of Semi-Competing Risks. arXiv preprint arXiv:2309.01721,” 2024. [DOI] [PubMed] [Google Scholar]

[R31] 31.Breum MS, Munch A, Gerds1 TA, and Martinussen T, “Estimation of Separable Direct and Indirect Effects in a Continuousx02011;Time illnessx02011;Death Model,” Lifetime Data Analysis 30 (2024): 143–180. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] 32.Frangakis CE and Rubin DB, “Principal Stratification in Causal Inference,” Biometrics 58, no. 1 (2002): 21–29. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] 33.Sklar A, “Fonctions de Repartition a n Dimensions et Leurs Marges,” Publications de L’institut Statistique de L’université de Paris 8 (1959): 229–231. [Google Scholar]

[R34] 34.Chen YH, “Semiparametric Marginal Regression Analysis for Dependent Competing Risks Under an Assumed Copula,” Journal of the Royal Statistical Society. Series B, Statistical Methodology 72, no. 2 (2010): 235–251, 10.1111/j.1467-9868.2009.00734.x. [DOI] [Google Scholar]

[R35] 35.Lakhal L, Rivest LP, and Abdous B, “Estimating Survival and Association in a Semicompeting Risks Model,” Biometrics 64, no. 1 (2008): 180–188. [DOI] [PubMed] [Google Scholar]

[R36] 36.Neyman J, “Sur les applications de la théorie des probabilités aux experiences agricoles: Essai des principes,” Roczniki Nauk Rolniczych 10, no. 1 (1923): 1–51. [Google Scholar]

[R37] 37.Rubin DB, “Estimating Causal Effects of Treatments in Randomized and Nonrandomized Studies,” Journal of Education & Psychology 66, no. 5 (1974): 688–701. [Google Scholar]

[R38] 38.Rubin DB, “Comment: Neyman (1923) and Causal Inference in Experiments and Observational Studies,” Statistical Science 5, no. 4 (1990): 472–480. [Google Scholar]

[R39] 39.Rubin DB, “Causal Inference Through Potential Outcomes and Principal Stratification: Application to Studies With “Censoring” due to Death,” Statistical Science 21, no. 3 (2006): 299–309, 10.1214/088342306000000114. [DOI] [Google Scholar]

[R40] 40.Ding P, Geng Z, Yan W, and Zhou XH, “Identifiability and Estimation of Causal Effects by Principal Stratification With Outcomes Truncated by Death,” Journal of the American Statistical Association 106, no. 496 (2011): 1578–1591. [Google Scholar]

[R41] 41.Dai JY, Gilbert PB, and Mâsse BR, “Partially Hidden Markov Model for Time-Varying Principal Stratification in HIV Prevention Trials,” Journal of the American Statistical Association 107, no. 497 (2012): 52–65. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R42] 42.Tchetgen Tchetgen EJ, “Identification and Estimation of Survivor Average Causal Effects,” Statistics in Medicine 33, no. 21 (2014): 3601–3628. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R43] 43.Graf E, Schmoor C, Sauerbrei W, and Schumacher M, “Assessment and Comparison of Prognostic Classification Schemes for Survival Data,” Statistics inMedicine 18, no. 17-18 (1999): 2529–2545. [DOI] [PubMed] [Google Scholar]

[R44] 44.Richardson TS and Robins JM, “Single World Intervention Graphs (SWIGs): A Unification of the counterfactual and graphical approaches to causality,” Center for the Statistics and the Social Sciences, University of Washington Series. Working Paper 2013 128, no. 30 (2013). [Google Scholar]

[R45] 45.Robins JM, Richardson TS, and Shpitser I, “An Interventionist Approach to Mediation Analysis,” in Probabilistic and Causal Inference: The Works of Judea Pearl (ACM Books, 2022), 713–764. [Google Scholar]

[R46] 46.Stensrud MJ, Young JG, and Martinussen T, “Discussion on Causal Mediation of Semicompeting Risks by Yen-Tsung Huang,” Biometrics 77, no. 4 (2021): 1160–1164. [DOI] [PubMed] [Google Scholar]

[R47] 47.Zhu H, Lan Y, Ning J, and Shen Y, “Semiparametric Copula-Based Regression Modeling of Semi-Competing Risks Data,” Communications in Statistics - Theory and Methods 51, no. 22 (2022): 7830–7845. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R48] 48.Peng M. and Xiang L, “Joint Regression Analysis for Survival Data in the Presence of Two Sets of Semi-Competing Risks,” Biometrical Journal 61, no. 6 (2019): 1402–1416. [DOI] [PubMed] [Google Scholar]

[R49] 49.Aalen OO, Cook RJ, and Roysland K, “Does Cox Analysis of a Randomized Survival Study Yield a Causal Treatment Effect?,” Lifetime Data Analysis 21 (2015): 579–593. [DOI] [PubMed] [Google Scholar]

[R50] 50.Martinussen T, Vansteelandt S, and Andersen PK, “Subtleties in the Interpretation of Hazard Contrasts,” Lifetime Data Analysis 26 (2020): 833–855. [DOI] [PubMed] [Google Scholar]

[R51] 51.Curtis C, Shah SP, Chin SF, et al. , “The Genomic and Transcriptomic Architecture of 2,000 Breast Tumours Reveals Novel Subgroups,” Nature 486, no. 7403 (2012): 346–352, 10.1038/nature10983. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R52] 52.Pereira B, Chin SF, Rueda OM, et al. , “The Somatic Mutation Profiles of 2,433 Breast Cancers Refine Their Genomic and Transcriptomic Landscapes,” Nature Communications 7, no. 1 (2016): 11479. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R53] 53.Austin PC, “The Use of Propensity Score Methods With Survival or Time-To-Event Outcomes: Reporting Measures of Effect Similar to Those Used in Randomized Experiments,” Statistics inMedicine 33, no. 7 (2014): 1242–1258. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R54] 54.Chadha M, White J, Swain S, et al. , “Optimal Adjuvant Therapy in Older (Ge; 70 Years of Age) Women With Low-Risk Early-Stage Breast Cancer,” npj Breast Cancer 9, no. 1 (2023): 99. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R55] 55.Goethals K, Janssen P, and Duchateau L, “Frailty Models and Copulas: Similarities and Differences,” Journal of Applied Statistics 35, no. 9 (2008): 1071–1079. [Google Scholar]

[R56] 56.Sledge GW, Mamounas EP, Hortobagyi GN, Burstein HJ, Goodwin PJ, and Wolff AC, “Past, Presentand Future Challenges in Breast Cancer Treatment,” Journal of Clinical Oncology 32, no. 19 (2014): 1979–1986. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R57] 57.Hsieh JJ, Wang W, and Ding AA, “Regression Analysis Based on Semicompeting Risks Data,” Journal of the Royal Statistical Society. Series B, Statistical Methodology 70, no. 1 (2008): 3–20, 10.1111/j.1467-9868.2007.00621.x. [DOI] [Google Scholar]

[R58] 58.Li R. and Peng L, “Quantile Regression Adjusting for Dependent Censoring From Semicompeting Risks,” Journal of the Royal Statistical Society. Series B, Statistical Methodology 77, no. 1 (2015): 107–130. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Exploring Causal Effects of Hormone- and Radio-Treatments in an Observational Study of Breast Cancer Using Copula-Based Semi-Competing Risks Models

Tonghui Yu

Mengjiao Peng

Yifan Cui

Elynn Chen

Chixiang Chen

Abstract

1 |. Introduction

1.1 |. Semi-Competing Risks Data

1.2 |. Semi-Parametric Regression With Semi-Competing Risks

1.3 |. Causal Inference Under Semi-Competing Risks

2 |. Copula-Based Models for Bivariate Survival Function

3 |. Treatment Effects

3.1 |. Observed Data and Potential Outcomes

Observed data.

Potential outcomes.

Assumption 1.

Assumption 2.

3.2 |. Causal Estimands

Assumption 3.

Proposition 1 (Non-parametric identification).

4 |. Identification and Estimation of Principal Causal Effects

4.1 |. Identification Based on Stringent Assumption

Assumption 4.

4.2 |. Sensitivity Analysis

Assumption 5.

Assumption 6.

4.3 |. Statistical Inference for Causal Parameters

5 |. Application to a Breast Cancer Study

Data.

Primary analysis.

TABLE 1 |.

FIGURE 1 |.

FIGURE 2 |.

Sensitivity analysis.

FIGURE 3 |.

Result interpretation.

6 |. Simulation Studies

FIGURE 4 |.

FIGURE 5 |.

TABLE 2 |.

FIGURE 6 |.

7 |. Discussion

Supplementary Material

Acknowledgments

Funding:

Footnotes

Data Availability Statement

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases