Simulating survival data with predefined censoring rates under a mixture of non-informative right censoring schemes

Fei Wan

doi:10.1080/03610918.2020.1722838

. Author manuscript; available in PMC: 2026 Feb 21.

Published in final edited form as: Commun Stat Simul Comput. 2020 Feb 7;51(7):3851–3867. doi: 10.1080/03610918.2020.1722838

Simulating survival data with predefined censoring rates under a mixture of non-informative right censoring schemes

Fei Wan ¹

PMCID: PMC12922656 NIHMSID: NIHMS2136681 PMID: 41727356

Abstract

Simulation studies have been routinely used to validate the performances of statistical methods for censored survival data under various scenarios. Our previous work proposed an integrated approach of simulating right censored survival data for proportional hazards models given a set of arbitrarily distributed baseline covariates and predefined censoring rates. However, the limitations are that all study subjects are assumed to be enrolled at the same time and there is no study ending time. We extended the previous work to accommodate the more realistic scenario under which study subjects are enrolled at a constant rate during an enrollment period and are then followed until one of the following events occurs: (a) the event of interest (e.g., death or occurrence of disease); (b) the end of study period; (c) early withdraws from random censoring events, whichever comes first. To demonstrate the application of the proposed approach in practice, we generated censored survival data and assessed the impact of several factors (the magnitude of confounding, size of treatment effect, the sine distance between coefficient vectors of confounders in the treatment and outcome models, and censoring rate) on the potential bias of propensity score matching estimators in estimating conditional and marginal hazards ratios.

Keywords: Survival analysis, administrative censoring, non-informative right censoring, proportional hazards model, propensity score matching, bias

1. Introduction

Clinical researchers often encounter time to event outcomes in medical research. In a typical cohort study, patients enter the study during a fixed length enrollment period and are then followed to record their time until an event of interest occurs (i.e., death, occurrence of cancer, etc.), or until a censoring event occurs. Right censoring is the most common censoring mechanism in clinical studies, in which the individual’s time to event is greater than his or her censoring time.

There are several common types of right censoring in clinical research. For example, randomized trials are often designed to compare the effects of two competing therapies in improving cancer patients’ survival. Patients will be accrued in a fixed enrollment period (e.g., 2 years) and will be followed for a fixed period of time (e.g., 3 years). The whole study should be completed in 5 years. The first one is administrative censoring when the event is observed only if it occurs prior to the pre-specified study ending time. Because of time or cost considerations, the investigator will terminate the study or report the results before all subjects realize their events. The other type of right censoring is censoring from experiencing competing events or lost-to-follow up before the study ends. Some study subjects may experience competing events which cause them to be removed from the study, or they move from the study location for reasons unrelated to the event of interest. Thus, their events of interest are not observable.

Simulation studies have been routinely used to validate the performances of PH models and other statistical methods for survival outcomes in presence of censoring (Aalen, Cook, and Roysland 2015; Wu 2017). Our previous work (Wan 2017) presented an integrated approach of simulating survival data for PH models by simultaneously incorporating a baseline hazard function with known distribution, a known censoring distribution, and a set of baseline covariates of arbitrary distributions. The proposed approach numerically determines the value of censoring parameter in a specified censoring distribution to achieve the predefined censoring proportions in the simulated survival data. The limitations in the previous work are that the subjects enter the study at the same time and there is no study ending time point. In this study, we improved the previous framework to accommodate the more realistic scenario in which study subjects enter the study at varying time points during a fixed enrollment period and are subject to right censoring during the study period and the administrative censoring when the study ends.

This article is organized as follows. In Sec. 2, we lay out the notations, assumptions, and models for a typical cohort study with survival outcome of interest. In Sec. 3, we present the general approach of simulating censored survival data with baseline covariates for a predetermined censoring rate, allowing the conditions that study subjects enter the study at a constant rate during the enrollment period and are subject to random censoring events during the follow-up period and the administrative censoring when the study ends. In Sec. 4, we design the cohort studies to verify that the censoring rates in the simulated survival data are close to their nominal levels. To demonstrate the usefulness of the proposed method in real practice, in Sec. 5, we design a complex simulation study to assess the bias of propensity score (PS) matching estimators for survival outcomes with different censoring rates. We make our final conclusion in Sec. 6.

2. Notations, assumptions, and models

Suppose in a cohort study (Figure 1), we enroll a total of $n$ patients. The calendar starting and terminal time of the study are predetermined by the investigator. We define $l$ to be the total calendar time of the study from the study starting time to the study ending time, and $a$ to be the length of study accrual period. Thus, every individual patient has a minimal follow-up period of length $l - a$ . Each subject enters the study randomly at constant rate during the enrollment period $[0, a]$ , is then followed until one of the following events occurs: (i) the event of interest; (ii) the study ends; (iii) early withdraws from random censoring events, whichever happens first.

We let $X_{i, j}$ denote the $j$ th baseline covariate of the $i$ th subject and $X_{i} = (X_{i, 1}, X_{i, 2}, \dots, X_{i, p})$ denote a $1 \times p$ vector of baseline covariates for the $i$ th subject, where $i = 1, 2, \dots, n$ and $j = 1, 2, \dots, p$ . Next, let $T_{i}$ represent the time to the event of interest for the $i$ th subject and $t$ denote an actual value of event time. The event times of the sample units are assumed to be independently and identically (i.i.d) distributed. The hazard function for subject $i$ is given by the following multiplicative risk model

h (t |X_{i}) = h_{0} (t) exp (X_{i} β), t > 0,

(1)

where $h_{0} (t)$ is a nonnegative baseline hazard function and $β = {(β_{1}, β_{2}, \dots, β_{p})}^{T}$ is the corresponding $p \times 1$ vector of regression coefficients. The covariates component in model (1), $exp (X_{i} β)$ , characterizes how covariates may influence the hazard function.

We assume $h_{0} (t)$ is from $W e i b u l l (α, ν)$ with the following density function

f (t) = α \frac{t^{α - 1}}{ν^{α}} \exp (- {(\frac{t}{ν})}^{α}), t > 0,

where $α > 0$ is the shape parameter and $ν > 0$ is the scale parameter of the distribution. Thus, we have

h_{0} (t) = α ν^{- α} t^{α - 1}

It follows that hazard function $h (t |X_{i})$ can be specified as

\begin{array}{l} h (t |X_{i}) = α ν^{- α} t^{α - 1} \exp (X_{i} β) \\ = α {(ν \exp (\frac{- X_{i} β}{α}))}^{- α} t^{α - 1} \\ = α λ_{i}^{- α} t^{α - 1}, t > 0 \end{array}

where $λ_{i} = \exp (- X_{i}^{*} β^{*}) > 0$ , $X_{i}^{*} = (1, X_{i})$ , and $β^{*} = (\frac{\log (ν^{- α})}{α}, \frac{β}{α})$ . Thus, the event time for individual $i$ given a set of baseline covariates follows a $W e i b u l l (α, λ_{i})$ . The survival functions of $T_{i}$ given $X_{i}$ is

S (t |X_{i}) = \exp (- {(\frac{t}{λ_{i}})}^{α})

(2)

In realistic settings baseline covariates are usually from different distributions such as normal, Bernoulli, or Poisson, etc. It is difficult to derive joint probability density function $f_{x} (\cdot)$ for the high-dimensional covariate vector $X_{i}$ , analytically and numerically. Instead of working with $X_{i}$ and $f_{x} (\cdot)$ directly, it is much simpler to work with a single variable $λ_{i}$ and its density function $f_{λ} (\cdot)$ .

Because of the constant enrollment rate assumption, the time to entry for patient $i$ , denoted by $E_{i}$ , is

E_{i} ~ U n i f o r m (0, a),

and the time from the entry to the study’s ending time point for individual $i$ is defined as follows:

C_{1 i} = l - E_{i}

We can interpret $C_{1 i}$ as the time to the administrative censoring event. It follows

C_{1 i} ~ U n i f o r m (l - a, l)

Let $f_{c_{1}} (c_{1})$ denote the probability density function of $C_{1 i}$ . This is

f_{c_{1}} (c_{1}) = \{\begin{array}{l} \frac{1}{a}, & l - a \leq c_{1} \leq l \\ 0, & otherwise \end{array}

Let $C_{r i}$ denote the time to the occurrence of a right censoring event (i.e., lost to follow-up or censoring due to death from causes not related with the event of interest). We assume that the censoring times of all sample units are i.i.d. Let $f_{c_{r}} (c_{r} |θ)$ denote the density function of $C_{r i}$ , where $θ$ is a censoring parameter. Some common choices of censoring distribution include uniform distribution ~ $U n i f o r m (0, θ)$ or Weibull distribution ~ $W e i b u l l (k, θ)$ .

We let $Y_{i} = \min (T_{i}, C_{r i}, C_{1 i})$ be the observed follow-up time and $δ_{i} = I (T_{i} < \min (C_{r i}, C_{1 i}))$ be the censoring indicator with $δ_{i} = 1$ if individual $i$ experiences an event of interest, and 0 if this individual is censored from a right censoring event during the study period or from administrative censoring. The administrative censoring time $C_{1 i}$ , the random right censoring time $C_{r i}$ , and the event time $T_{i}$ are assumed to be independent of each other. That is, these censoring are non-informative. The important notations used were summarized in the nomenclature at the end of this paper.

3. The general framework

We aim to generate censored survival data in which the relationship between baseline covariates and the time to event outcome can be described by a proportional hazards (PH) model. In particular, the proportion of censored subjects in this data needs to be equal to the pre-determined nominal level. For this purpose, we follow a general procedure consisting of the following three steps: (1) derive the conditional censoring probability for each individual given a set of baseline covariates and censoring parameter; (2) derive the censoring rate function for the study population by marginalizing baseline covariates out from the conditional censoring probability function. This censoring rate function is a function of censoring parameter only; and (3) set the censoring rate function equal to a given value of censoring proportion and solve the equation for the corresponding value of censoring parameter.

3.1. Derivation of individual censoring probability $ℙ (δ_{i} = 0 |λ_{i}, θ)$

An individual observes an event of interest if his or her event time is less than censoring times. This consists of two scenarios: $C_{r i}$ can occur before or after $C_{1 i}$ . Thus, the event of interest occurs when $T_{i} < C_{r i} < C_{1 i}$ or $T_{i} < C_{1 i} < C_{r i}$ , where $T_{i} > 0$ , $C_{r i} > 0$ , and $l - k \leq C_{1 i} \leq l$ . We let $ℙ (δ_{i} = 1 |X_{i}, θ)$ denote the conditional probability of having an event for the $i$ th individual given a set of baseline covariates $X_{i}$ and the censoring parameter $θ$ during the study period of $l$ after this individual enters the study within the enrollment period $a$ . Because of simplicity, we work with $λ_{i}$ instead of $X_{i}$ . Equivalently, we have

ℙ (δ_{i} = 1 |λ_{i}, θ) = ℙ (0 < T_{i} < C_{r i}, 0 < C_{r i} < C_{1 i}, l - a < C_{1 i} < l) + P (0 < T_{i} < C_{1 i}, C_{1 i} < C_{r i}, l - a < C_{1 i} < l) = \int_{l - a}^{l} \int_{0}^{c_{1}} \int_{0}^{c_{r}} f_{c_{1}} (c_{1} |a) f_{c_{r}} (c_{r} |θ) f (t |X_{i}) d t d c_{r} d c_{1} + \int_{l - a}^{l} \int_{c_{1}}^{\infty} \int_{0}^{c_{1}} f_{c_{r}} (c_{r} |θ) f_{c_{1}} (c_{1} |a) f (t |X_{i}) d t d c_{1} d c_{r} = \int_{l - a}^{l} \int_{0}^{c_{1}} \frac{1}{a} f_{c_{r}} (c_{r} |θ) (1 - \exp (- {(\frac{c_{r}}{λ_{i}})}^{α})) d c_{r} d c_{1} + \int_{l - a}^{l} \int_{c_{1}}^{\infty} (1 - \exp (- {(\frac{c_{1}}{λ_{i}})}^{α})) \frac{1}{a} f_{c_{r}} (c_{r} |θ) d c_{r} d c_{1}

(3)

We need to specify $f_{c_{r}} (c_{r} |θ)$ in the Eq. (3). Some common distributions for $C_{r i}$ include Weibull and uniform distributions.

Scenario1: Weibull censoring time: When the censoring time $C_{r i} \sim W e i b u l l (k, θ)$ , we have

f_{c_{r}} (c_{r} |k, θ) = k \frac{c_{r}^{k - 1}}{θ^{k}} \exp (- {(\frac{c_{r}}{θ})}^{k}),

where $k > 0$ is the shape parameter and $θ > 0$ is the scale parameter of the distribution. Study subjects are censored at varying rates during the study period after they enter the study. The conditional probability of experiencing an event of interest for individual $i$ is:

ℙ (δ_{i} = 1 |λ_{i}, θ) = \int_{l - a}^{l} \int_{0}^{c_{1}} \frac{1}{a} k \frac{c_{r}^{k - 1}}{θ^{k}} \exp (- {(\frac{c_{r}}{θ})}^{k}) (1 - \exp (- {(\frac{c_{r}}{λ_{i}})}^{α})) d c_{r} d c_{1} + \int_{l - a}^{l} \int_{c_{1}}^{\infty} (1 - \exp (- {(\frac{c_{1}}{λ_{i}})}^{α})) \frac{1}{a} k \frac{c_{r}^{k - 1}}{θ^{k}} \exp (- {(\frac{c_{r}}{θ})}^{k}) d c_{r} d c_{1} = \int_{l - a}^{l} \int_{0}^{c_{1}} \frac{1}{a} k \frac{c_{r}^{k - 1}}{θ^{k}} \exp (- {(\frac{c_{r}}{θ})}^{k}) (1 - \exp (- {(\frac{c_{r}}{λ_{i}})}^{α})) d c_{r} d c_{1} + \int_{l - a}^{l} (1 - \exp (- {(\frac{c_{1}}{λ_{i}})}^{α})) \frac{1}{a} \exp (- {(\frac{c_{1}}{θ})}^{k}) d c_{1}

Scenario2: Uniform censoring time: When $C_{r i} ~ U n i f o r m (0, θ)$ , the density function of $C_{r i}$ is:

f_{c_{r}} (c_{r} |θ) = \frac{1}{θ}, 0 < c < θ

and study subjects are censored at a constant rate during the time interval $[0, θ]$ . However, we have to determine $θ$ in two different ways for a given censoring rate depending on whether $θ$ is larger than $l - a$ or not. When $θ$ is less than $l - a$ , $C_{r i}$ is always less than $C_{1 i}$ so that being censored from administrative censoring is impossible. Then, the conditional probability of having an event for individual $i$ is simply (Wan 2017)

ℙ (δ_{i} = 1 |λ_{i}, θ) = \frac{λ_{i}}{α θ} γ (\frac{1}{α}, {(θ / λ_{i})}^{α}),

(4)

where the lower incomplete gamma function $γ (k, x)$ is

γ (k, t) = \int_{0}^{x} t^{k - 1} e^{- t} d t

When $θ$ is greater than $l - a$ , the conditional probability of having an event for individual $i$ based on Eq. (3) is

\begin{array}{l} ℙ (δ_{i} = 1 |λ_{i}, θ) = \int_{l - a}^{l} \int_{0}^{c_{1}} \frac{1}{a θ} (1 - \exp (- {(\frac{c_{r}}{λ_{i}})}^{α})) d c_{r} d c_{1} \\ + \int_{l - a}^{l} \int_{c_{1}}^{θ} (1 - \exp (- {(\frac{c_{1}}{λ_{i}})}^{α})) \frac{1}{a θ} d c_{r} d c_{1} \\ = \frac{1}{2 a θ} (l^{2} - {(l - a)}^{2}) - \frac{1}{a θ} \int_{l - a}^{l} \frac{λ_{i}}{α} γ (\frac{1}{α}, {(\frac{c_{1}}{λ_{i}})}^{α}) d c_{1} \\ + \int_{l - a}^{l} \frac{1}{a θ} (1 - \exp (- {(\frac{c_{1}}{λ_{i}})}^{α})) (θ - c_{1}) d c_{1}, \end{array}

The conditional probability of being censored for individual $i$ given baseline covariates $λ_{i}$ is

ℙ (δ_{i} = 0 |λ_{i}, θ) = 1 - ℙ (δ_{i} = 1 |λ_{i}, θ),

(5)

To derive the censoring rate in the entire study population, the individual specific baseline covariate component $λ_{i}$ need to be integrated out from $ℙ (δ_{i} = 1 |λ_{i}, θ)$ .

3.2. Derivation of censoring rate $ℙ (δ_{i} = 0 |θ)$

We have the censoring rate in the study population by taking the expectation of $ℙ (δ = 1 |λ_{i}, θ)$ with respect to $λ_{i}$ :

\begin{array}{l} ℙ (δ_{i} = 0 |θ) = E_{λ_{i}} (ℙ (δ_{i} = 0 |λ_{i}, θ)) \\ = \int_{D} ℙ (δ_{i} = 0 |u, θ) f_{λ} (u) d u, \end{array}

(6)

where $D$ denotes the domain of $λ_{i}$ . It is difficult to derive the exact probability density function of $λ_{i}$ analytically. However, we can estimate the density function $f_{λ} (λ)$ using the kernel density estimation (KDE) method. KDE is a non-parametric method to estimate the probability density function of a random variable for a given dataset.

Suppose that we have a univariate independent and identically distributed random sample $(λ_{1}, λ_{2}, \dots, λ_{n})$ drawn from some unknown probability density function $f_{λ} (λ)$ , we are interested in knowing the shape of this density function. A kernel density estimator of $f_{λ} (λ)$ is

{\hat{f}}_{λ} (λ) = \frac{1}{n h} \sum_{i = 1}^{n} K (\frac{λ - λ_{i}}{h}),

where $h > 0$ is a smoothing parameter called the bandwidth. $K (\cdot)$ is the kernel function. $\frac{1}{h} K (\frac{λ - λ_{i}}{h})$ is called the scaled kernel.

The kernel function is a symmetric, non-negative, real valued function that integrates to 1. The kernel is used in mathematics to denote a weighting function and it weights each data point $λ_{i}$ differently based on their distance to $λ$ . A common choice is a Gaussian kernel,

K (λ) = \frac{1}{\sqrt{2 π}} e^{- λ^{2} / 2}

The kernel estimator ${\hat{f}}_{λ_{i}} (λ)$ is a biased estimator of density $f_{λ} (λ)$ . A large value for $h$ results in larger bias and smaller variance. A smaller $h$ results in smaller bias and larger variance. There is always a tradeoff between the bias of the kernel density estimator and its variance when choosing the bandwidth. A recommended rule of thumb for bandwidth selection (Venables and Ripley 2002) is

\hat{h} = 1.06 \min (\hat{σ} ΙQR / 1.34) n^{- 1 / 5},

where IQR is the interquartile range computed as the difference between 75th and 25th percentiles and $\hat{σ}$ is the sample standard deviation.

3.3. Numerical solution of the censoring parameter $θ$ for a predefined proportion

We set up a function $γ (θ)$ based on Eq. (6)

γ (θ |p) = ℙ (δ = 0 |θ) - p = \int_{0}^{+ \infty} ℙ (δ = 0 |u, θ) f_{λ} (λ) d u - p

(7)

For each possible combination of individual censoring probability $ℙ (δ = 1 |λ_{i}, α, θ)$ and density function $f_{λ} (u)$ , we solve $γ (θ |p) = 0$ for $θ$ that yields the desirable censoring proportion $p$ . However, we cannot solve this equation explicitly. Instead, we use numerical integration to compute the integral component in Eq. (7) first and then use the Brent-Decker root-finding algorithm, implemented in $R$ “uniroot” function, to find the solution for $θ$ .

4. Case study

In this section, we design a hypothetical cohort study with survival outcomes to validate the algorithm proposed. Suppose a 10-year long cohort study is initiated to enroll the patients in the first two years, and each patient is followed until one of the following events occur: (i) the event of interest; (ii) competing censoring events or the lost to follow-up, or (iii) the end of the study period, whichever comes first. So the minimal and maximal follow-up times are 8 and 10 years, respectively. Details of simulation include:

The entry time for each subject follows a $U n i f o r m (0, 2)$ . Four independent baseline variables were generated for each subject: $X_{1} ~ N o r m a l (0, 1)$ , $X_{2} ~ U n i f o r m (0, 1)$ , $X_{3} ~ B e r n o u l i (0.5)$ , and $X_{4} ~ P o i s s o n (5)$ .
We assume the baseline hazard function of the event time is from $W e i b u l l (α, ν)$ . i.e., $h_{0} (t) = α / ν^{α} t^{α - 1}$ . The shape parameter $α$ was set at 0.5, 1, 1.5 to represent decreasing, constant, and increasing hazards, respectively. The scale parameter $ν$ was set at 2 so that $β_{0} = \log (ν^{- α}) = - 0.347, - 0.693, - 1.040$ , respectively. The regression coefficients for $X_{i}$ were set as: $(β_{1}, β_{2}, β_{3}, β_{4}) = (0.2, - 0.2, 0.1, - 0.1) \times α$ . Event time for individual $i$ was generated from $W e i b u l l (α, λ_{i} = \exp (- X_{i} β i / α))$ so that the underlying adjusted hazards function follows the form of a PH model defined by Eq. (1).
An administrative censoring time $C_{1 i} ~ U n i f o r m (8, 12)$ . The time to random censoring events $C_{r i}$ was generated from two scenarios: (i) study subjects were censored at higher rates at later time point. i.e., $C_{r i} ~ W e i l b u l l (1.2, θ)$ ; (ii) subjects were censored at a constant rate. i.e., $C_{r i} ~ U n i f o r m (0, θ)$ . We searched for $θ$ that yields a total of 30%, 50%, 70% censoring rates in the simulated data.

For each scenario, we solved $θ$ for a given censoring rate and used it to simulate 1000 sample data with sample size $n = 10000$ each. We then averaged these 1000 sample censoring rates to estimate the true censoring proportion.

It is worthy of noting that we need to check the range of $λ$ in the simulated data in order to have more precise numerical integration in Eq. (7). e.g., when $α = 0.5$ , $λ \in [0, 10]$ ; When $α = 1.5$ , $λ \in [0, 16]$ (Sample R code are in the appendix). The results were reported in Table 1. When censoring distribution follows a uniform distribution, censoring parameter $θ$ was determined using two different approaches laid out in Sec. 3 depending on whether the values for $θ$ are large or smaller than the length of follow-up given the specified censoring rate. The sample censoring proportions based on the computed censoring parameters are very close to their nominal levels.

Table 1.

A comparison of sample censoring rates and nominal censoring rates.

Survival time	Censoring time	$θ$	Nominal censoring proportion	Sample censoring proportion
$W e i b u l l (0.5, λ)$	$U n i f o r m (0, θ)$	15.329	30%	30.04%
		4.094	50%	50.05%
		1.021	70%	70.04%
	$W e i b u l l (1.2, θ)$	9.333	30%	29.98%
		2.444	50%	49.99%
		0.594	70%	70.04%
$W e i b u l l (1, λ)$	$U n i f o r m (0, θ)$	31.343	30%	30.05%
		9.126	50%	50.09%
		4.256	70%	70.08%
	$W e i b u l l (1.2, θ)$	7.465	30%	29.90%
		3.307	50%	49.98%
		1.464	70%	69.98%
$W e i b u l l (1.5, λ)$	$U n i f o r m (0, θ)$	10.881	30%	30.03%
		5.937	50%	49.97%
		3.363	70%	70.01%
	$W e i b u l l (1.2, θ)$	7.220	30%	29.94%
		3.636	50%	50.01%
		1.896	70%	70.03%

Open in a new tab

5. Application: bias of propensity score matching estimator

In this section, we design a complex simulation study to assess various factors (the size of treatment effect, the similarity between the coefficients of confounding variables in treatment and outcome models, the size of confounding, censoring rates, etc.) on the potential bias of propensity score matching estimators for estimating conditional and marginal hazard ratios. We need to generate censored survival data with pre-specified censoring rates in hundreds of different scenarios and it is impossible to do so by tuning the censoring parameter manually. However, this can be easily done using our approach. Through this demonstration, we will show the usefulness of our approach in real simulation studies. Propensity score matching is very common tool for researchers to design the observational studies to mimic randomized trials. However, some commonly used statistical methods used in randomized trials or conventional matched designs may not work in PS matched design as we would have expected. Specifically, whether these methods estimate marginal or conditional treatment effects still remain unclear among applied researchers. A conditional treatment effect is the average effect of treatment on the individual. A marginal treatment effect is the average effect of treatment on the population.

Austin (2013) and Austin et al. (2007) examined the following three methods: (1) unadjusted Cox PH model including binary treatment indicator only (the “Naive” Cox model); (2) unadjusted Cox PH model with robust sandwich variance estimator (the “robust” Cox model); (3) unadjusted Cox PH model stratified on the matched pairs. These studies showed the native Cox model is biased in estimating conditional hazards ratio in non-censored survival data. Both the native and robust methods appears to be consistent in estimating marginal hazards ratio but the stratified approach is biased. We design an extensive simulation study to re-assess the consistency of these three methods in estimating conditional and marginal hazards ratios in the censored survival data. We will also compare three variance estimators for PS matching estimator of the marginal hazards ratio: the empirical variance estimator, the model-based variance estimator, and the robust sandwich estimator to assess whether the robust sandwich estimator is adequate in capturing the true variability of PS matching estimator.

Suppose an 8-year-long prospective observational cohort study enrolls patients during a two year period with a constant enrollment rate. Once entered into the study each patient will be followed until the event of interest occurs, or until the end of study. Each patient receives either treatment (i.e., $D = 1$ ) or control (i.e., $D = 0$ ) once they are enrolled. The likelihood of receiving each treatment depends on his or her four baseline characteristics $X = (X_{1}, X_{2}, X_{3}, X_{4})$ as follows:

logit (P (D = 1 |X)) = α_{0} + \sum_{j = 1}^{4} α_{1, j} X_{j}

(8)

where $α_{1} = (α_{1, 1}, α_{1, 2}, α_{1, 3}, α_{1, 4})$ are regression coefficients for $X$ . The time to event outcome for each patient is determined also by $X$ via the following PH model

h (t |D, X) = h_{0} (t) \exp (β_{0} + β_{1} T + \sum_{j = 1}^{4} β_{2, j} X_{j}),

(9)

where $e^{β_{1}}$ is interpreted as the conditional hazards ratio, and $β_{2} = (β_{2, 1}, β_{2, 2}, β_{2, 3}, β_{2, 4})$ is the regression coefficients for $X$ . The marginal hazards ratio is define by

δ = \frac{\log (S_{1} (t))}{\log (S_{0} (t))},

where $S_{1} (t) = ℙ (T_{1} > t)$ and $T_{1}$ is the potential event time when patients receive treatment A. $S_{0} (t) = ℙ (T_{0} > t)$ and $T_{0}$ is the potential event time when patients receive treatment B.

We used the same algorithm used in previous studies (Wan and Mitra 2018; Wan, Small, and Mitra 2018; Wan 2019) to generate simulation data:

To generate $β_{2} = (β_{2, 1}, β_{2, 2}, β_{2, 3}, β_{2, 4})$ , the elements of the coefficient vector were sampled randomly from $< 1, 2, 3, \dots, 9 >$ first, and then the coefficient vector was normalized. The sign of each element $~ B e r n o u l l i (p = 0.5)$ . $β_{2}$ is equal to $k$ multiplied by its normalized factor; $k$ determines the magnitude of confounding and was set to 0.3 and 1.2, representing low and high level of confounding. We repeated the same procedure to generate $α_{1} = (α_{1, 1}, α_{1, 2}, α_{1, 3}, α_{1, 4})$ , but $α_{1}$ was set to 1 multiplied by the normalized vector.
For each pair of $β_{2}$ and $α_{1}$ , confounding variables $X_{1}$ and $X_{2} ~ B e r n o u l l i (p = 0.5)$ , and $X_{3}$ and $X_{4}$ were generated independently from $N (0, 1)$ with sample size $n = 2000$ . We normalized $X_{1}$ and $X_{2}$ with zero means and unit standard deviations. The treatment variable $D$ was generated with the treatment model (8). The intercept $α_{0}$ was set to −1.5. Thus, ~ 20% of simulated subjects received treatment. Next, the time to event $T$ was generated using Eq. (9) and the baseline hazard $h_{0} (t) ~ W e i b u l l (1.5, 2)$ . $β_{1}$ was set at {0, 0.3}, representing the null and non-null treatment effects. The marginal hazards ratio $δ$ was generated using the approach used in Austin (2013). To generate censored scenarios, administrative censoring time $C_{1 i} ~ U n i f o r m (8, 12)$ . Random censoring time $C_{r i} ~ W e i l b u l l (1.2, θ)$ . We searched for $θ$ that yields a total of 30% and 50% censoring rates in the simulated data.
In the simulated data, we estimated the PS using a logistic-regression model for every subject. A nearest-neighborhood matching algorithm was used to match one treated subject with a control subject on the logit of the PS without replacement using a caliper of width equal to 0.005. In matched samples, we performed the following analyses: (1) the “naive” Cox model; (2) the “robust” Cox model; and (3) the stratified Cox model. This simulation process was repeated 1000 times. The estimates of treatment effect and the model-based standard errors for each method were computed in each simulated data.
We sampled 10 coefficient pairs from each of five distance intervals {[0, 0.2], (0.2, 0.4], (0.4, 0.6], (0.6, 0.8], (0.8, 1]}. In summary, we will examine the effects of confounding $‖β_{1}‖ = (0.3, 1.2))$ , the size of conditional treatment effect $(β_{1} = (0, 0.3))$ , the censoring level (0%, 30%, 50%), the distance intervals (5 levels), and 10 coefficient pairs in each interval. We repeated the process in i–iii for each of 300 scenarios. It is worthy of noting that censoring parameter $θ$ needs to be computed for each of 300 scenarios, which makes manual selection very difficult.

As shown in Figure 2, Naive and Robust Cox and stratified Cox models are both unbiased when the null hypothesis is true. All estimates circle around zeros in all scenarios. However, when there is a true treatment effect (Figure 3), Naive and Robust Cox and stratified Cox models are biased in estimating the conditional hazards ratios. Such biases increased with the increasing distance metrics and larger confounding effect. It’s also worthy of noting that censored data was associated with less biases. Figure 4 reveals the pattern of bias for estimating marginal hazards ratios. In the contrast, larger dissimilarity between $α ’ s$ and $β ’ s$ was associated with smaller bias. Naive or Robust Cox model was less biased than the stratified cox model. Larger confounding was associated with larger bias. On the contrary, censored data tended to be associated with lager bias. We could observe from Figure 5) that robust variance estimates were closer to the empirical variance estimates than the model-based variance estimates. This confirms the necessity of using robust variance methods in PS analyses.

Figure 3. — The simulation results under $H_{a}$ : $β_{1} = 0.3$ . X-axis denotes the sine dissimilarity metric intervals. 5 denotes the most dissimilar interval (0.8,1], 1 denotes interval [0,0.2]. Y-axis denotes the estimates from Naive/Robust Cox and stratified models. Red color denotes confounding effect of 0.3 and green color denotes confounding effect of 1.2.

Figure 4. — X-axis denotes the sine dissimilarity metric intervals. 5 denotes the most dissimilar interval (0.8,1], 1 denotes interval [0,0.2]. Y-axis denotes the difference between averaged estimates and log marginal hazards ratios. Red color denotes confounding effect of 0.3 and green color denotes confounding effect of 1.2.

Figure 5. — X-axis denotes the absolute difference between the averaged robust variance estimates and the averaged empirical variance estimates. Y-axis denotes the absolute difference between the averaged model-based variance estimates and the averaged empirical variance estimates.

6. Conclusion

Simulation studies have always been used to assess the properties of statistical methods analyzing time-to-event outcomes and censoring rate is often one factor under investigation. This article extends the general framework we proposed for simulating censored survival data by considering the more realistic settings under which study subjects are enrolled in a random fashion during an accrual period and are then followed until one of following events occurs: the event of interest occurs, early withdraws from random censoring events, and the ending of the study, whichever comes first. The approach relies on numerical integration and a root-finding algorithm to compute the value of censoring parameter so that we can avoid imposing specific distribution forms on covariates. As we demonstrated in our simulation study of assessing the PS matching estimators, our approach is particularly useful when we need to simulate censored survival data for many different scenarios because manual selection of censoring parameter is difficult. For simplicity and practicality, we made constant enrollment assumption. This assumption could be violated in practice. If a reasonable distribution for the enrollment rate can be specified, the algorithm can be modified accordingly to accommodate non-constant enrollment settings. Our improved approach could be an important tool for the design of simulation studies for survival outcomes.

Supplementary Material

supplementalRcode

NIHMS2136681-supplement-supplementalRcode.txt^{(1.8KB, txt)}

Nomenclature

$X_{i}$: a $1 \times p$ vector of baseline covariates for the ith subject
$T_{i}$: the time to the event of interest for the $i$ th subject
$E_{i}$: the time to entry for patient $i$
$C_{1 i}$: the time to the administrative censoring event
$C_{r i}$: the time to the occurrence of a right censoring event
$Y$: the observed follow-up time
$δ$: the censoring indicator
$α, ν$: the shape and scale parameters of the Weibull distribution
$β$: the $p \times 1$ vector of regression coefficients in proportional hazards model
$λ$: the exponential of the linear predictor in proportional hazards model

A Sample R code


library (”pracma”)

n<-1000000
ul <–rnorm(n , 0 , 1)
u2 <–runif(n ,0 , 1)
u3 <–rbinom(n , 1 ,0.5)
u4 <–rpois(n , 5)

alpha . t<–1.5
alpha . c<–1.2

al<– 0.2*alpha . t
a2<– –0.2*alpha . t
a3<– 0.1*alpha . t
a4<– –0.1*alpha . t

a0<– –1.040
k<– –(a0+a1*u1+a2*u2+a3*u3+a4*u4) / alpha . t
lambda<–exp (k)

### kernel smoothing method###
dens<–density (lambda , n=1500, bw=0.01 , from=0, to=16,na .rm=TRUE)
x<–dens$x
y<–dens$y
y . loess <–1oess (y^~x, span=0.1)

###get the non–parametric density estimates for f (t|lambda)###
density . fun . lambda<–function(x){
pred . y <– predict (y. loess , newdata=x)
return (pred . y)
}

###subject to a mixture of type I and random right censoring ###;

f . fun1<–function (c1 , cr , arg2){
theta <–arg2 [1]
lambda . i<–arg2 [2]
alpha . c <–arg2 [3]
alpha . t <–arg2 [4]
a <–arg2 [5]

part .1<-1/a
part .2<-dweibull(cr , alpha . c , theta)
part .3<–1–exp(–(cr /lambda . i) ^ alpha . t)
f . 1<–part . 1 * part . 2 * part . 3
}

f . fun2<-function(c1 , cr , arg2){
theta <–arg2 [1]
lambda . i<–arg2 [2]
alpha . c <–arg2 [3]
alpha . t <–arg2 [4]
a <–arg2 [5]

part .1<-1/a
part .2<-exp(–(c1/theta)âlpha . c)
part .3<–1–exp(–(c1/lambda . i) ^ alpha . t)

f . 1<–part . 1 * part . 2 * part . 3
}
censor .prop<–function (theta , arg1){

p<–arg1 [1]

cen . P<–integrate (function (u){

sapply (u , function (u){

lambda . i<–u

arg2<–c(theta , u , arg1 [–1])

prob . i<–1–(integral2 (f . fun1 , c1Min1 , c1Max1 , crMin1 , crMax1 , arg2=arg2 )$Q
+integral ( f . fun2 , c1Min2 , c1Max2 , arg2=arg2 , reltol = le – 10))

f . lambda . i<–density . fun . lambda (lambda . i)

return ( prob i*f . lambda . i )
})
}, 0 ,16) $value

return( cen . P–p)

}


p<–0.3
1<–12;
a<–2;
crMin1<-0; crMax1<–function (c1) c1;
c1Min1<–1–a; c1Max1<–1;
c1Min2<–1–a; c1Max2<–1;

arg1<–c(p , alpha . c , alpha . t , a)

###censoring parameter###
theta<–uniroot ( censor . prop, arg1=arg1 , c (0.01 , 100) , tol = 0.00000001)$root

References

Aalen OO, Cook RJ, and Roysland K. 2015. Does Cox analysis of a randomized survival study yield a causal treatment effect. Lifetime Data Analysis 21 (4):579–93. doi: 10.1007/s10985-015-9335-y. [DOI] [PubMed] [Google Scholar]
Austin PC 2013. The performance of different propensity score methods for estimating marginal hazards ratios. Statistics in Medicine 32 (16):2837–49. doi: 10.1002/sim.5705. [DOI] [PMC free article] [PubMed] [Google Scholar]
Austin PC, Grootendorst P, Normand SL, and Anderson GM. 2007. Conditioning on the propensity score can result in biased estimation of common measures of treatment effect: A Monte Carlo study. Statistics in Medicine 26 (4):754–68. doi: 10.1002/sim.2618. [DOI] [PubMed] [Google Scholar]
Venables WN, and Ripley BD. 2002. Modern applied statistics with S. 4th ed. New York, NY: Springer. [Google Scholar]
Wan F 2017. Simulating survival data with predefined censoring rates for proportional hazards models. Statistics in Medicine 36 (5):838–54. doi: 10.1002/sim.7178. [DOI] [PubMed] [Google Scholar]
Wan F 2019. Matched or unmatched analyses with propensity-score-matched data? Statistics in Medicine 38 (2):289–300. doi: 10.1002/sim.7976. [DOI] [PubMed] [Google Scholar]
Wan F, and Mitra N. 2018. An evaluation of bias in propensity score adjusted non-linear regression models. Statistical Methods in Medical Research 27 (3):846–62. doi: 10.1177/0962280216643739. [DOI] [PubMed] [Google Scholar]
Wan F, Small D, and Mitra N. 2018. A general approach to evaluating the bias of 2-stage instrumental variable estimators for proportional hazards models. Statistics in Medicine 37 (12):1997–2015. doi: 10.1002/sim.7636. [DOI] [PubMed] [Google Scholar]
Wu JR 2017. Single-arm phase II survival trial design under the proportional hazards model. Statistics in Biopharmaceutical Research 9 (1):25–34. doi: 10.1080/19466315.2016.1174147. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supplementalRcode

NIHMS2136681-supplement-supplementalRcode.txt^{(1.8KB, txt)}

[R1] Aalen OO, Cook RJ, and Roysland K. 2015. Does Cox analysis of a randomized survival study yield a causal treatment effect. Lifetime Data Analysis 21 (4):579–93. doi: 10.1007/s10985-015-9335-y. [DOI] [PubMed] [Google Scholar]

[R2] Austin PC 2013. The performance of different propensity score methods for estimating marginal hazards ratios. Statistics in Medicine 32 (16):2837–49. doi: 10.1002/sim.5705. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] Austin PC, Grootendorst P, Normand SL, and Anderson GM. 2007. Conditioning on the propensity score can result in biased estimation of common measures of treatment effect: A Monte Carlo study. Statistics in Medicine 26 (4):754–68. doi: 10.1002/sim.2618. [DOI] [PubMed] [Google Scholar]

[R4] Venables WN, and Ripley BD. 2002. Modern applied statistics with S. 4th ed. New York, NY: Springer. [Google Scholar]

[R5] Wan F 2017. Simulating survival data with predefined censoring rates for proportional hazards models. Statistics in Medicine 36 (5):838–54. doi: 10.1002/sim.7178. [DOI] [PubMed] [Google Scholar]

[R6] Wan F 2019. Matched or unmatched analyses with propensity-score-matched data? Statistics in Medicine 38 (2):289–300. doi: 10.1002/sim.7976. [DOI] [PubMed] [Google Scholar]

[R7] Wan F, and Mitra N. 2018. An evaluation of bias in propensity score adjusted non-linear regression models. Statistical Methods in Medical Research 27 (3):846–62. doi: 10.1177/0962280216643739. [DOI] [PubMed] [Google Scholar]

[R8] Wan F, Small D, and Mitra N. 2018. A general approach to evaluating the bias of 2-stage instrumental variable estimators for proportional hazards models. Statistics in Medicine 37 (12):1997–2015. doi: 10.1002/sim.7636. [DOI] [PubMed] [Google Scholar]

[R9] Wu JR 2017. Single-arm phase II survival trial design under the proportional hazards model. Statistics in Biopharmaceutical Research 9 (1):25–34. doi: 10.1080/19466315.2016.1174147. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Simulating survival data with predefined censoring rates under a mixture of non-informative right censoring schemes

Fei Wan

Abstract

1. Introduction

2. Notations, assumptions, and models

Figure 1.

3. The general framework

3.1. Derivation of individual censoring probability $ℙ (δ_{i} = 0 |λ_{i}, θ)$

3.2. Derivation of censoring rate $ℙ (δ_{i} = 0 |θ)$

3.3. Numerical solution of the censoring parameter $θ$ for a predefined proportion

4. Case study

Table 1.

5. Application: bias of propensity score matching estimator

Figure 2.

Figure 3.

Figure 4.

Figure 5.

6. Conclusion

Supplementary Material

Nomenclature

A Sample R code

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Simulating survival data with predefined censoring rates under a mixture of non-informative right censoring schemes

Fei Wan

Abstract

1. Introduction

2. Notations, assumptions, and models

Figure 1.

3. The general framework

3.1. Derivation of individual censoring probability ℙδi=0λi,θ

3.2. Derivation of censoring rate ℙδi=0θ

3.3. Numerical solution of the censoring parameter θ for a predefined proportion

4. Case study

Table 1.

5. Application: bias of propensity score matching estimator

Figure 2.

Figure 3.

Figure 4.

Figure 5.

6. Conclusion

Supplementary Material

Nomenclature

A Sample R code

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

3.1. Derivation of individual censoring probability $ℙ (δ_{i} = 0 |λ_{i}, θ)$

3.2. Derivation of censoring rate $ℙ (δ_{i} = 0 |θ)$

3.3. Numerical solution of the censoring parameter $θ$ for a predefined proportion