Propensity Score Estimation in the Presence of Length-biased Sampling: A Nonparametric Adjustment Approach

Ashkan Ertefaie; Masoud Asgharian; David Stephens

doi:10.1002/sta4.46

. Author manuscript; available in PMC: 2015 Jan 1.

Published in final edited form as: Stat. 2014 Mar 27;3(1):83–94. doi: 10.1002/sta4.46

Propensity Score Estimation in the Presence of Length-biased Sampling: A Nonparametric Adjustment Approach

Ashkan Ertefaie ^a,^*, Masoud Asgharian ^b, David Stephens ^b

PMCID: PMC4142657 NIHMSID: NIHMS575212 PMID: 25170178

Abstract

The pervasive use of prevalent cohort studies on disease duration increasingly calls for an appropriate methodology to account for the biases that invariably accompany samples formed by such data. It is well-known, for example, that subjects with shorter lifetime are less likely to be present in such studies. Moreover, certain covariate values could be preferentially selected into the sample, being linked to the long-term survivors. The existing methodology for estimating the propensity score using data collected on prevalent cases requires the correct conditional survival/hazard function given the treatment and covariates. This requirement can be alleviated if the disease under study has stationary incidence, the so-called stationarity assumption. We propose a nonparametric adjustment technique based on a weighted estimating equation for estimating the propensity score which does not require modeling the conditional survival/hazard function when the stationarity assumption holds. The estimator’s large-sample properties are established and its small-sample behavior is studied via simulation. The estimated propensity score is utilized to estimate the survival curves.

Keywords: Propensity score, Length-biased sampling, Causal inference, Survival curve

1. Introduction

Survival or failure time data typically comprise an initiating event, say onset of a disease, and a terminating event, say death. In an ideal situation, recruited subjects have not experienced the initiating event, the so-called incident cases. These cases are then followed to a terminating event or censoring, say loss to follow-up. In many practical situations, however, recruiting incident cases is infeasible due to logistic or other constraints. In such circumstances, subjects who have experienced the initiating event prior to the start of the study, so-called prevalent cases, are recruited. It is well known that these cases tend to have a longer survival time, and hence form a biased sample from the target population. This bias is termed length bias when the initiating events are generated by a stationary Poisson process (Cox & Lewis, 1966; Zelen & Feinlein, 1969), the so-colled stationarity assumption.

Studies on length-biased sampling can be traced as far back as Wicksell (1925) and his corpuscle problem. The phenomenon was later noticed by Fisher (1934) in his article on methods of ascertainment. Neyman (1955) discussed length-biased sampling further and coined the term incidence-prevalence bias. Cox (1969) studied length-biased sampling in industrial applications, while Zelen & Feinlein (1969) observed the same bias in screening tests for disease prevalence (Asgharian et al., 2002; Asgharian & Wolfson, 2005). More recently, Shen et al. (2009), Qin & Shen (2010) and Ning et al. (2010) have studied the analysis of covariates under biased sampling.

In observational studies, treatment is assigned to the experimental units without randomization. Thus, in each treatment group, the covariate distributions may be imbalanced which may lead to bias in estimating the treatment effect if the covariate imbalance is not properly taken into account (Cochran & Rubin, 1973; Rubin, 1973). The propensity score is a tool that is widely used in causal inference to adjust for this source of bias (Robins et al., 2000; Hernán et al., 2000). Rosenbaum & Rubin (1983) define the propensity score for a binary treatment D as p(D = 1|X) where X is a vector of measured covariates. They show that under some assumptions, treatment is independent of the covariates inside each propensity score stratum (the balancing property of the propensity score).

In cases where the sample is not representative of the population, naive propensity score estimation will not in general have the balancing property. Cheng & Wang (2012) develop a method that consistently estimates the parameters of the propensity score from prevalent survival data. They also present a method that can be used in a special case of length-biased sampling. Their method requires correct specification of the conditional hazard model given the treatment and covariates. We refer to their estimator as CW in the sequel.

Our goal is to develop a method that estimates the propensity score using a weighted logistic regression where weights are estimated nonparametrically. Our estimating equation is designed specifically for length-biased data, i.e., for a disease with stationary incidence. Unlike the method proposed by CW, our method does not require any model specification for the conditional failure time given the exposure and the covariates. We also generalize a nonparametric survival curve estimation method introduced by Huang & Qin (2011) to accommodate confounding as well as length-biased sampling.

2. Length-biased sampling

In this section, we introduce concepts and notations necessary to formulate problems involving length-biased sampling. We adopt the common modeling framework for prevalent cohort studies. We assume that affected individuals in the study population develop the condition of interest (onset) according to some stochastic mechanism at random points in calendar time, and undergo a terminal event (failure) at some subsequent time point that is also determined by a stochastic mechanism. Individuals enter into the study at some census time, and are followed up until the terminal or censoring event.

2.1. Notations

Let T^pop be the time measured from the onset to failure time in the target population with an absolutely continuous distribution F and density f. Also, let D^pop and X^pop be the binary treatment variable and the vector of covariates, respectively. Let T be the same measured time for observed subjects with distribution F_LB. The variables with superscript pop represent the population variables; variables without pop denote the observed truncated variables. It is well known that if the onset times are generated by a stationary Poisson process, then

F_{L B} (t) = \frac{\int_{0}^{t} s d F (s)}{\int_{0}^{\infty} s d F (s)} = \frac{1}{μ} \int_{0}^{t} s d F (s) and f_{L B} (t) = \frac{t f (t)}{μ},

(1)

where f_LB is the density function of F_LB and μ is the mean survival time under F. The observed event time, T, can be written A + R, when A is the time from the onset of the disease to the recruitment time, and R, the residual life time, is the time from recruitment to the event, also called backward and forward recurrence times, respectively. When individuals are also subject to right-censoring, the observed survival time is Y = A + min(R, C), where C is a censoring time measured from the recruitment to the loss to follow-up; for all subjects, both A and min(R, C) are observed. The censoring indicator is denoted by δ (δ = 1 indicating failure). The sample consists of (y_i, a_i, δ_i, d_i, x_i) for n independent subjects. The following diagram illustrates the different random quantities introduced in this Section.

graphic file with name nihms575212f2.jpg

Throughout the paper, we assume that the following assumptions hold:

A1. The variable (T^pop, D^pop, X^pop) is independent of the calendar time of the onset of the disease.
A2. The disease has stationary incidence, i.e., the disease incidence occurs at a constant rate.
A3. The censoring time C is independent of (A, R, D, X).

2.2. Potential outcomes

We use the counterfactual or potential outcome framework to define the causal effect of interest. Potential outcome models are introduced by Neyman (1990) and Rubin (1978) for time independent treatment. We define the counterfactual values (A(d), R(d), Y(d)) corresponding to the backward, forward recurrence times, and observed survival time, respectively, had – possibly contrary to fact – the treatment taken the value d. Also, let T^pop(d) denote the counterfactual response if contrary to the fact that all the individuals would have received the treatment D = d, and let D denote the treatment received. The observed response, T^pop, is defined as DT^pop (1) + (1 − D)T^pop (0).

We make the following identifiability assumptions to link the counterfactual outcome and the observed data (Robins, 1994, 1997):

A4. Consistency: Potential outcome for a treatment corresponds to the actual outcome if assigned to that treatment.
A5. No unmeasured confounding: Given the observed covariates X, the counterfactual outcome Y (d) is independent of the assignment of treatment.
A6. Positivity: Let p_D|X (d|x) be the conditional probability of receiving treatment d given X = x. For each treatment d and for each possible value x, p_D|X (d|x) > 0.

3. Propensity score estimation under length-biased sampling

Assuming a logit model for the propensity score in the target population, we have

π (x, α) = p (D^{pop} = 1 | X^{pop} = x) = \frac{exp (α x)}{1 + exp (α x)} .

(2)

where α is a p × 1 vector of parameters. The vector of covariates X may include a column of 1s. It can be shown that under assumption A2, we have

p_{L B} (D = 1 | X = x) = \frac{μ_{1} (x, θ) p (D^{pop} = 1 | X^{pop} = x)}{π (x, α) μ_{1} (x, θ) + (1 - π (x, α)) μ_{0} (x, θ)},

(3)

where $μ_{d} (x, θ) = \int_{0}^{\infty} p (T^{pop} (d) \geq a | X = x, θ) d a$ for d = 0, 1 is the conditional counterfactual mean failure time if treated at D = d (Bergeron et al. (2008)). Note that under assumptions A5 – 7, p(T^pop(d) ≥ a|, X = x, θ) = p(T^pop ≥ a|D = d, X = x, θ) where θ parametrizes the conditional density of T^pop.

Assuming the proportional hazard model, i.e., λ_T^pop (u|D^pop = d, X^pop = x) = λ₀(u)e^γd+βx, Cheng & Wang (2012) show that the parameter of the propensity score can be consistently estimated using the logistic regression but adjusted for the ‘offset’ term log(α̂(x; Λ̂, γ̂, β̂)) as the intercept where $α̂ (x; Λ̂, γ̂, β̂) = \frac{\sum_{i = 1}^{n} exp [- Λ̂ (a_{i}) exp (γ̂ + β̂ x)}{\sum_{i = 1}^{n} exp [- Λ̂ (a_{i}) exp (β̂ x)]}$ . The cumulative hazard function Λ is estimated using the Breslow estimator. The consistency of the parameters of the propensity score in the CW method relies on the correct specification of the conditional hazard model given the treatment and covariates.

When the initiating event of the duration variable has stationary incidence, it is possible to devise a robust method for estimating the propensity score that does not require knowledge of the conditional hazard model. See among others Wolfson et al. (2001) and De Uña-álvarez (2004) for examples of such duration variables in medical and labor force studies, respectively.

Let f (t|d, x, θ) be the unbiased conditional density of survival times given the covariates and treatment. Then, under assumptions A1 and A2, the joint density of (A, T) given (D, X) is $f (t | d, x, θ) / \int_{0}^{\infty} u f (u | d, x, θ) d u I (t > a > 0)$ as shown in Asgharian et al. (2006). Assumption A3 is used to show that

p (Y \in (t, t + d t), A \in (a, a + d a), δ = 1 | d, x, θ) = \frac{f (t | d, x, θ) S_{c} (t - a) d t d a}{μ_{d} (x, θ)},

where S_c (․) is the survival function for the residual censoring variable C, respectively. By integrating the above equation over 0 < a < t, we have

p (Y \in (t, t + d t), δ = 1 | d, x, θ) = \frac{f (t | d, x, θ) w (t) d t}{μ_{d} (x, θ)},

(4)

where $w (t) = \int_{0}^{t} S_{C} (s) d s$ (Shen et al., 2009; Qin & Shen, 2010).

We construct an unbiased estimating equation for estimating the parameters of the propensity score using the weighted logistic regression where weights are estimated nonparametrically. Let F (d|x) be the unbiased conditional distribution of the treatment given the covariates. Then

𝔼 [δ \frac{(D - π (x, α))}{w (Y)} | X = x] = 𝔼 [𝔼 {δ \frac{(d - π (x, α))}{w (Y)} | D = d, X = x}] = \int (d - π (x, α)) \int \frac{f (y | x, d, θ) w (y)}{w (y) μ_{d} (x, θ)} d y \times \frac{μ_{d} (x, θ) d F (d | x)}{μ (x, α, θ)} = \frac{1}{μ (x, α, θ)} \int (d - π (x, α)) d F (d | x) = 0 .

The second equality follows from equations (4), and (3). The last equality holds since f (y|x, d) is a proper density and (2). An unbiased estimating equation for α is therefore

U (α) = \sum_{i = 1}^{n} U_{i} (α) = \sum_{i = 1}^{n} δ_{i} x_{i}^{⊤} \frac{(d_{i} - π (x_{i}, α))}{ŵ (y_{i})} = 0,

(5)

where $ŵ (y) = \int_{0}^{y} Ŝ_{C} (s) d s$ and Ŝ_C is the Kaplan-Meier estimator of the survivor function of the residual censoring variable C.

The following theorem presents the asymptotic properties of the estimators obtained by (5) in the presence of length-biased sampling when w(․) is replaced by its estimated value.

Theorem 1

Let α̂ be an estimator obtained by (5). Then under conditions C1 – C6 and assumptions A1 – A6, α̂ → α₀ in probability as n → ∞. Moreover,

\sqrt{n} (α̂ - α) \overset{d}{\to} 𝒩 (0, η (α)),

where η(α) is given in the Appendix.

Proof

See the Appendix.

A consistent plug-in estimator of η(α) is presented in the Appendix.

Note that the censored individuals contribute to this estimating equation through Ŝ_C as well as the uncensored ones. Let $M_{i C} (t) = 𝟙 (Y_{i} - A_{i} < t, δ_{i} = 0) - \int_{0}^{t} 𝟙 (min (Y_{i} - A_{i}, C_{i}) > u) d Λ_{C} (u)$ , with Λ_C(․) be the cumulative hazard function of the censoring variable. As a part of the proof of Theorem 1, we show that as n → ∞,

\frac{1}{n} \sum_{i = 1}^{n} U_{i} (α) = \frac{1}{n} \sum_{i = 1}^{n} [δ_{i} x_{i}^{⊤} \frac{d_{i} - π (x_{i}, α)}{w (y_{i})} + \int_{0}^{s} \frac{v (t) d M_{i C} (t)}{S_{C} (t) S_{R} (t)}] + o_{p} (n^{- 1 / 2}), s = sup [t : p (C > t) > 0],

(6)

where S_R(u) is the survival function of the residual life time and the function v (t) is defined in the Appendix. The second part of the summation in the RHS of (6) is often referred to as the augmentation element (Rotnitzky & Robins, 2005).

4. Survival Curve Estimation

Various methods have been proposed to adjust for length-biased sampling, including: the truncation product-limit estimator (Wang et al., 1986) and the maximum pseudo-partial likelihood estimator (Luo & Tsai, 2009). Here, we generalize the method introduced by Huang & Qin (2011) which incorporates the information in the marginal distribution of the truncation time from disease onset to recruitment time. The bias induced by confounding can be adjusted by creating a pseudo-population using the inverse probability of being in the group that the individuals actually belong to (Nieto & Coresh, 1996; Xie & Liu, 2005; Cole & Hernán, 2004).

Our goal is to estimate the counterfactual survival function S_d(y), where S_d(y) = 𝔼[I(T^pop(d) > t)]. Under assumptions A4–6, the function S_d(y) can be identified using the observed data as follows

S_{d} (t) = 𝔼 [I (T^{pop} (d) > t)] = 𝔼 [\frac{I (D = d)}{p (D = d | X)} I (T^{pop} > t)] .

Following Huang & Qin (2011), S_d(t) can be estimated by

{S̃}_{d} (t) = \prod_{u \in [0, t]} [1 - d {Λ̂}_{d} (u)],

where

d {Λ̂}_{i d} = \int_{0}^{t} \frac{d Ñ_{d} (t)}{{R̃}_{d} (t)}

with

Ñ_{d} (t) = 1 / n \sum_{i = 1}^{n} \frac{I (d_{i} = d)}{p (d_{i} = d | x_{i})} δ_{i} I (y_{i} \leq t) {R̃}_{d} (t) = 1 / n \sum_{i = 1}^{n} \frac{I (d_{i} = d)}{p (d_{i} = d | x_{i})} I (y_{i} \geq t) + {S̃}_{d A} (t) .

Also, the product-limit estimator S̃_dA (t) is

{S̃}_{d A} (t) = \prod_{u \in [0, t]} [1 - \frac{d {Q̃}_{d} (u)}{{K̃}_{d} (u)}],

where

{Q̃}_{d} (u) = 1 / n \sum_{i = 1}^{n} \frac{I (d_{i} = d)}{p (d_{i} = d | x_{i})} [I (a_{i} \leq t) + δ_{i} I (y_{i} - a_{i} \leq t)] {K̃}_{d} (t) = 1 / n \sum_{i = 1}^{n} \frac{I (d_{i} = d)}{p (d_{i} = d | x_{i})} [I (a_{i} \geq t) + I (y_{i} - a_{i} \geq t)] .

We utilize our proposed estimating equation (5) to estimate the propensity score and replace p(d_i = d|x_i) with p(d_i = d|x_i,α̂).

5. Simulation Studies

In this Section, we describe a simulation study to examine the performance of the proposed propensity score estimator. Our simulation consists of 500 datasets of sizes 500 and 5000. The censoring variable C is generated from a uniform distribution in the interval (0, τ) where the parameter τ is set such that it results in a desired censoring proportion.

To create length-biased samples, we generate a variable A from a uniform distribution (0, ρ) and ignore those whose generated unbiased failure time is less than A.

We generated the unbiased failure times from the following hazard model h(t|d, x) = 0.2 exp{d − 0.5x₁ + 0.5x₂ + 0.5dx₁ − 0.5dx₂}, where $D ~ Bernoulli (\frac{exp {- 0.1 + 1 x_{1} - 1 x_{2}}}{1 + exp {- 0.1 + 1 x_{1} - 1 x_{2}}})$ with X₁ and X₂ distributed according to N(0, σ = 0.5). We estimate the parameters of the propensity score using CW and the proposed method and compare the results with the true values. We assume three different censoring proportions 10%, 20% and 30%.

We estimate the parameters of the propensity score using four different estimators: α̂ is the estimator obtained by the proposed method; α̂_w and ${α̂}_{w}^{m}$ are the estimator obtained by the CW method when the hazard model is correctly and incorrectly specified, respectively, and α̂_Un is obtained by a naive method that does not adjust for the length-biased sampling. In ${α̂}_{w}^{m}$ , we assume that the interaction between the treatment D and the covariate X₂ has been ignored in the fitted hazard model.

Table 1 summarizes the estimated propensity score parameters and their standard errors. Our simulation results confirm that the proposed estimating equation (5) adjusts the length-biased sampling. The standard errors, however, are larger than the one obtained by the CW method, which is the price we pay for relaxing the modeling assumption of the hazard model. As we expected, CW estimator is highly sensitive to model misspecification even when just one of the interaction terms is ignored. Specifically, when the interaction term between the treatment and variable X₂ is omitted in the fitted hazard model, the estimated coefficient corresponding to X₂ in the propensity score model is biased. In general, if variables in the study are correlated, then missing one variable in the hazard model may cause bias in the estimation of other variables in the propensity score model as well.

Table 1.

Simulation: Propensity score parameter estimation. α̂: Estimated parameters using the proposed method. α̂_w: Estimated parameters using the CW method. ${α̂}_{w}^{m}$ : Estimated parameters using the CW method when the hazard model is misspecified. α̂_Un: Estimated parameters when unadjusted for the length-biased sampling. α=(−0.1,1,−1).

Method

Bias

S.D.

Bias

S.D.

10 % Cens.

n = 500

n = 5000

α̂

(0.01,0.04,0.03)

(0.21,0.43,0.45)

(0.00,0.01,0.01)

(0.09,0.18,0.19)

α̂_w

(0.08,0.02,0.01)

(0.17,0.28,0.28)

(0.01,0.00,0.01)

(0.06,0.10,0.10)

{α̂}_{w}^{m}

(0.03,0.02,0.49)

(0.17,0.29,0.23)

(0.08,0.03,0.48)

(0.06,0.09,0.08)

α̂_Un

(0.10,0.50,0.50)

(0.11,0.21,0.22)

(0.10,0.51,0.50)

(0.04,0.07,0.08)

20 % Cens.

n = 500

n = 5000

α̂

(0.01,0.05,0.05)

(0.22,0.42,0.43)

(0.02,0.03,0.03)

(0.09,0.18,0.20)

α̂_w

(0.02,0.01,0.01)

(0.17,0.29,0.27)

(0.02,0.01,0.01)

(0.06,0.10,0.10)

{α̂}_{w}^{m}

(0.01,0.02,0.47)

(0.16,0.29,0.21)

(0.02,0.05,0.46)

(0.06,0.10,0.08)

α̂_Un

(0.11,0.49,0.50)

(0.10,0.22,0.20)

(0.10,0.51,0.50)

(0.04,0.07,0.08)

30 % Cens.

n = 500

n = 5000

α̂

(0.04,0.08,0.09)

(0.22,0.44,0.44)

(0.03,0.06,0.07)

(0.10,0.19,0.20)

α̂_w

(0.07,0.00,0.02)

(0.17,0.29,0.29)

(0.08,0.03,0.02)

(0.06,0.10,0.11)

{α̂}_{w}^{m}

(0.07,0.02,0.45)

(0.17,0.29,0.21)

(0.03,0.06,0.45)

(0.06,0.10,0.08)

α̂_Un

(0.11,0.49,0.50)

(0.10,0.22,0.20)

(0.10,0.51,0.50)

(0.04,0.07,0.08)

Open in a new tab

5.1. Survival Curve Estimation

Here, we compare our proposed method of estimating the survival curves if treated and untreated with two naive approaches. The naive method (NV1) estimates the propensity score without considering the length-biased sampling and the second naive method (NV2) ignores both the confounding and the length-biased sampling. We generated the unbiased failure times from a Weibull distribution with shape parameter 10 and scale parameter h₂(t|d, x) = 0.2 exp{3d + 2.5x₁ − 2x₂ − 2dx₁ + 1.1dx₂}, where $D ~ Bernoulli (\frac{exp {2 x_{1} - 2 x_{2} - 2 x_{1} x_{2}}}{1 + exp {2 x_{1} - 2 x_{2} - 2 x_{1} x_{2}}})$ with X₁ and X₂ are N(2, σ = 0.3). We report the result based on the censoring proportion 30%.

The survival curves for the treated and untreated groups are presented in Figure 1. The light and dark grey shaded areas are the survival curves based on the 500 datasets for treated and untreated individuals, respectively. The dark solid line is the true survival curve. As the plots show, the true survival curve lies entirely in the shaded area provided by our proposed estimator while ignoring either or both of the length-biased sampling and/or confounding result in a biased estimator.

Simulation: The estimated survival curves using the proposed (Prop.) and two naive estimators. The light and dark shaded areas are the treated and untreated survival curves, respectively. The solid line represents the true curve. NV1 is when the propensity score is naively estimated without considering the length-biased sampling. NV2 is when we ignore both the confounding and the length-biased sampling.

6. Discussion

We present a weighted estimating equation to estimate the parameters of the propensity score from right-censored length-biased samples. In many cases, recruiting prevalent cases is more efficient. However, it is well known that in these cases subjects with longer survival time have a greater chance to be selected. This may affect the distribution of the observed covariates (Bergeron et al., 2008). For example, if treated subjects tend to live longer, then these subjects will be over represented in the observed sample. As such, if the propensity score is fitted without adjusting for this source of bias, it will be skewed to the left. Recently, Cheng & Wang (2012) proposed a method to adjust for the length-biased sampling which requires the correctly specified conditional survival function given the treatment and covariates. This modeling assumption may limit the application of this approach, particularly, when an investigator is interested in estimating the marginal causal effect.

In our proposed method, we estimate the weights nonparametrically. Thus, unlike the existing methods, it does not require any modeling assumptions for the conditional hazard function given the treatment and covariates. Our method produces an unbiased estimator but the standard errors are larger than the method proposed by Cheng & Wang (2012).

Generalizing a nonparametric survival curve estimation method introduced by Huang & Qin (2011), we derive a method for estimating the counterfactual survival curves in the presence of length-biased sampling. The bias induced by confounding is adjusted by creating a pseudo-population using the inverse probability of being in the group to which the individuals actually belong. The treatment assignment probabilities are estimated using the proposed estimating equation.

We confined our attention to the stationary case; the methodology presented in this manuscript can, however, be extended to any other left-truncation cases as long as the left truncation distribution is known (Luo & Tsai, 2009).

Acknowledgement

This research was supported in part by NIDA grant P50 DA010075. The second and third authors acknowledge the support of Discovery Grants from the Natural Sciences and Engineering Research Council (NSERC) of Canada.

Appendix

In this section, we present the assumptions and proofs of the main result. The following conditions are required for establishing Theorem 1:

C.1 X is a p vector of bounded covariates, not contained in a (p − 1) dimensional hyperplane.
C.2 sup[t : p(R > t) > 0] ≥ sup[t : p(C > t) > 0]=s and p(δ = 1) > 0.
C.3 $\int_{0}^{s} [{(\int_{t}^{s} S_{C} (v) d v)}^{2} / (S_{C}^{2} (t) S_{R} (t))] d S_{C} (t) < \infty$ .
C.4 det $𝔼 [{δ X^{⊤} \frac{D - π (X, α)}{w (Y)}}^{\otimes 2}] < \infty$ .
C.5 $Λ = 𝔼 [δ X^{⊤} \frac{\partial π (X, α)}{\partial α}]$ is nonsingular.
C.6 det $[\int_{0}^{s} v^{2} (t) / (S_{C}^{2} (t) S_{R} (t)) d S_{C} (t)] < \infty$ where $v (t) = 𝔼 [\frac{δ 𝟙 (Y > t) X^{⊤} [D - π (X, α))] \int_{t}^{Y} S_{C} (v) d v}{w^{2} (Y)}]$ ,

C.2 is an identifiability condition (Wang, 1991), C.3–C.6 are required to obtain an estimator with a finite variance.

Proof of Theorem 1. The stochastic process M_C(s) has mean zero,

𝔼 [M_{C} (s)] = 𝔼 [𝟙 (C < Y - A < s)] - \int_{0}^{s} 𝔼 [𝟙 (Y - A > u) . 𝟙 (C > u)] d Λ_{C} (u) = \int_{0}^{s} S_{C} (u) λ_{C} (u) S_{R} (u) d u - \int_{0}^{s} S_{C} (u) S_{R} (u) d Λ_{C} (u) = 0 .

Using the strong consistency of ŵ(y) to w(y) (Pepe & Fleming, 1991), we have

\frac{1}{ŵ (Y)} = \frac{1}{w (Y)} [1 + \frac{w (Y) - ŵ (Y)}{w (Y)}] + o_{p} (1) .

Thus

Ũ (α) = \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} U_{i} (α) = \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} δ_{i} x_{i}^{⊤} \frac{d_{i} - π (x_{i}, α)}{ŵ (y_{i})} = \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} δ_{i} x_{i}^{⊤} \frac{d_{i} - π (x_{i}, α)}{w (y_{i})} [1 + \frac{w (y_{i}) - ŵ (y_{i})}{w (y_{i})}] + o_{p} (1), = \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} [δ_{i} X_{i} \frac{D_{i} - π (X_{i}, α)}{w (Y_{i})} + \int_{0}^{s} \frac{v̂ (t) d M_{C} (t)}{S_{C} (t) S_{R} (t)}],

(7)

where

v̂ (t) = \frac{1}{n} \sum_{i = 1}^{n} [\frac{δ_{i} I (Y_{i} > t) X_{i} [D_{i} - π (X_{i}, α))] \int_{t}^{Y_{i}} S_{C} (v) d v}{w^{2} (Y_{i})}] .

The last equality (7) follows from the martingale integral representation $\sqrt{n} (ŵ (Y) - w (Y))$ (Shen et al., 2009; Qin & Shen, 2010).

Using the standard Taylor expansion, we can derive the asymptotic variance of the estimator as follows

η (α) = Λ' Σ^{- 1} Λ,

where

\begin{matrix} Σ = 𝔼 [Ũ (α) Ũ (α)] & = 𝔼 [{δ X^{⊤} \frac{D - π (X, α)}{w (Y)}}^{\otimes 2} {1 + \frac{w (Y) - ŵ (Y)}{w (Y)}}^{2}] \\ = 𝔼 [{δ X^{⊤} \frac{D - π (X, α)}{w (Y)} + \int_{0}^{s} \frac{v (t) d M_{C} (t)}{S_{C} (t) S_{R} (t)}}^{\otimes 2}] \\ Λ & = 𝔼 [\frac{\partial U_{i} (α)}{\partial α}] = 𝔼 [\frac{δ}{w (Y)} X^{⊤} \frac{\partial π (X, α)}{\partial α}] \end{matrix}

where $v (t) = 𝔼 [\frac{δ 𝟙 (Y > t) X^{⊤} [D - π (X, α))] \int_{t}^{Y} S_{C} (v) d v}{w^{2} (Y)}]$ Let ℙ_n be the empirical average. The components of the variance-covariance matrix η(α) can be consistently estimated by

Σ̂ = ℙ_{n} [{δ X^{⊤} \frac{D - π (X, α)}{ŵ (Y)} + \int_{0}^{s} \frac{v̂ (t) d {M̂}_{C} (t)}{Ŝ_{C} (t) Ŝ_{R} (t)}}^{\otimes 2}], Λ̂ = ℙ_{n} [\frac{\partial U_{i} (α)}{\partial α}] = ℙ_{n} [\frac{δ}{ŵ (Y)} X^{⊤} \frac{\partial π (X, α)}{\partial α}] .

Also, the stochastic process M_C(s) can be estimated by replacing the Λ_C(․) by its estimate, Λ̂_C(․).

References

Asgharian M, M’Lan CE, Wolfson DB. Length-biased sampling with right censoring. Journal of the American Statistical Association. 2002;97(457):201–209. [Google Scholar]
Asgharian M, Wolfson DB. Asymptotic behavior of the unconditional NPMLE of the length-biased survivor function from right censored prevalent cohort data. The Annals of Statistics. 2005;33(5):2109–2131. [Google Scholar]
Asgharian M, Wolfson DB, Zhang X. Checking stationarity of the incidence rate using prevalent cohort survival data. Statistics in Medicine. 2006;25(10):1751–1767. doi: 10.1002/sim.2326. [DOI] [PubMed] [Google Scholar]
Bergeron PJ, Asgharian M, Wolfson DB. Covariate bias induced by length-biased sampling of failure times. Journal of the American Statistical Association. 2008;103(482):737–742. [Google Scholar]
Cheng Y, Wang M. Estimating propensity scores and causal survival functions using prevalent survival data. Biometrics. 2012;68:707–716. doi: 10.1111/j.1541-0420.2012.01754.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Cochran W, Rubin D. Controlling bias in observational studies: A review. Sankhyā: The Indian Journal of Statistics, Series A. 1973:417–446. [Google Scholar]
Cole SR, Hernán MA. Adjusted survival curves with inverse probability weights. Computer methods and programs in biomedicine. 2004;75(1):45–49. doi: 10.1016/j.cmpb.2003.10.004. [DOI] [PubMed] [Google Scholar]
Cox DR. Some Sampling Problems in Technology. New York: Wiley Interscience, in New Developments in Survey Sampling; 1969. pp. 506–527. [Google Scholar]
Cox DR, Lewis P. The statistical analysis of series of events. John Wiley and Sons; 1966. [Google Scholar]
De Uña-álvarez J. Nonparametric estimation under length-biased sampling and type I censoring: a moment based approach. Annals of the Institute of Statistical Mathematics. 2004;56(4):667–681. [Google Scholar]
Fisher RA. The effect of methods of ascertainment upon the estimation of frequencies. Annals of Human Genetics. 1934;6(1):13–25. [Google Scholar]
Hernán M, Brumback B, Robins J. Marginal structural models to estimate the causal effect of zidovudine on the survival of hiv-positive men. Epidemiology. 2000;11(5):561–570. doi: 10.1097/00001648-200009000-00012. [DOI] [PubMed] [Google Scholar]
Huang CY, Qin J. Nonparametric estimation for length-biased and right-censored data. Biometrika. 2011;98(1):177. doi: 10.1093/biomet/asq069. [DOI] [PMC free article] [PubMed] [Google Scholar]
Luo X, Tsai W. Nonparametric estimation for right-censored length-biased data: a pseudo-partial likelihood approach. Biometrika. 2009;96(4):873–886. [Google Scholar]
Neyman J. Statistics–servant of all science. Science. 1955;122(3166):401–406. doi: 10.1126/science.122.3166.401. [DOI] [PubMed] [Google Scholar]
Neyman J. On the application of probability theory to agricultural experiments. essay on principles. section 9. Translation of excerpts by D. Dabrowska and T. Speed. Statistical Science. 1990;6:462–47. [Google Scholar]
Nieto FJ, Coresh J. Adjusting survival curves for confounders: a review and a new method. American Journal of Epidemiology. 1996;143(10):1059. doi: 10.1093/oxfordjournals.aje.a008670. [DOI] [PubMed] [Google Scholar]
Ning J, Qin J, Shen Y. Non-parametric tests for right-censored data with biased sampling. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2010;72(5):609–630. doi: 10.1111/j.1467-9868.2010.00742.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pepe MS, Fleming TR. Weighted Kaplan-Meier statistics: Large sample and optimality considerations. Journal of the Royal Statistical Society. Series B (Methodological) 1991;53(2):341–352. [Google Scholar]
Qin J, Shen Y. Statistical methods for analyzing right-censored length-biased data under Cox model. Biometrics. 2010;66(2):382–392. doi: 10.1111/j.1541-0420.2009.01287.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Robins J. Causal inference from complex longitudinal data. Latent variable modeling and applications to causality. 1997:69–117. [Google Scholar]
Robins J, Hernán M, Brumback B. Marginal structural models and causal inference in epidemiology. Epidemiology. 2000;11(5):550–560. doi: 10.1097/00001648-200009000-00011. [DOI] [PubMed] [Google Scholar]
Robins JM. Correcting for non-compliance in randomized trials using structural nested mean models. Communications in Statistics-Theory and methods. 1994;23(8):2379–2412. [Google Scholar]
Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):41–55. [Google Scholar]
Rotnitzky A, Robins JM. Inverse probability weighting in survival analysis. Encyclopedia of Biostatistics. 2005 [Google Scholar]
Rubin D. The use of matched sampling and regression adjustment to remove bias in observational studies. Biometrics. 1973:185–203. [Google Scholar]
Rubin DB. Bayesian inference for causal effects: The role of randomization. The Annals of Statistics. 1978;6(1):34–58. [Google Scholar]
Shen Y, Ning J, Qin J. Analyzing length-biased data with semiparametric transformation and accelerated failure time models. Journal of the American Statistical Association. 2009;104(487):1192–1202. doi: 10.1198/jasa.2009.tm08614. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wang MC. Nonparametric estimation from cross-sectional survival data. Journal of the American Statistical Association. 1991;86(413):130–143. [Google Scholar]
Wang MC, Jewell NP, Tsai WY. Asymptotic properties of the product limit estimate under random truncation. The Annals of Statistics. 1986;14(4):1597–1605. [Google Scholar]
Wicksell SD. The corpuscle problem: a mathematical study of a biometric problem. Biometrika. 1925;17(1/2):84–99. [Google Scholar]
Wolfson C, Wolfson DB, Asgharian M, M’Lan CE, Østbye T, Rockwood K, Hogan DB. A reevaluation of the duration of survival after the onset of dementia. New England Journal of Medicine. 2001;344(15):1111–1116. doi: 10.1056/NEJM200104123441501. [DOI] [PubMed] [Google Scholar]
Xie J, Liu C. Adjusted Kaplan–Meier estimator and log-rank test with inverse probability of treatment weighting for survival data. Statistics in Medicine. 2005;24(20):3089–3110. doi: 10.1002/sim.2174. [DOI] [PubMed] [Google Scholar]
Zelen M, Feinlein M. On the theory of screening for chronic diseases. Biometrika. 1969;56(3):601–614. [Google Scholar]

[R1] Asgharian M, M’Lan CE, Wolfson DB. Length-biased sampling with right censoring. Journal of the American Statistical Association. 2002;97(457):201–209. [Google Scholar]

[R2] Asgharian M, Wolfson DB. Asymptotic behavior of the unconditional NPMLE of the length-biased survivor function from right censored prevalent cohort data. The Annals of Statistics. 2005;33(5):2109–2131. [Google Scholar]

[R3] Asgharian M, Wolfson DB, Zhang X. Checking stationarity of the incidence rate using prevalent cohort survival data. Statistics in Medicine. 2006;25(10):1751–1767. doi: 10.1002/sim.2326. [DOI] [PubMed] [Google Scholar]

[R4] Bergeron PJ, Asgharian M, Wolfson DB. Covariate bias induced by length-biased sampling of failure times. Journal of the American Statistical Association. 2008;103(482):737–742. [Google Scholar]

[R5] Cheng Y, Wang M. Estimating propensity scores and causal survival functions using prevalent survival data. Biometrics. 2012;68:707–716. doi: 10.1111/j.1541-0420.2012.01754.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] Cochran W, Rubin D. Controlling bias in observational studies: A review. Sankhyā: The Indian Journal of Statistics, Series A. 1973:417–446. [Google Scholar]

[R7] Cole SR, Hernán MA. Adjusted survival curves with inverse probability weights. Computer methods and programs in biomedicine. 2004;75(1):45–49. doi: 10.1016/j.cmpb.2003.10.004. [DOI] [PubMed] [Google Scholar]

[R8] Cox DR. Some Sampling Problems in Technology. New York: Wiley Interscience, in New Developments in Survey Sampling; 1969. pp. 506–527. [Google Scholar]

[R9] Cox DR, Lewis P. The statistical analysis of series of events. John Wiley and Sons; 1966. [Google Scholar]

[R10] De Uña-álvarez J. Nonparametric estimation under length-biased sampling and type I censoring: a moment based approach. Annals of the Institute of Statistical Mathematics. 2004;56(4):667–681. [Google Scholar]

[R11] Fisher RA. The effect of methods of ascertainment upon the estimation of frequencies. Annals of Human Genetics. 1934;6(1):13–25. [Google Scholar]

[R12] Hernán M, Brumback B, Robins J. Marginal structural models to estimate the causal effect of zidovudine on the survival of hiv-positive men. Epidemiology. 2000;11(5):561–570. doi: 10.1097/00001648-200009000-00012. [DOI] [PubMed] [Google Scholar]

[R13] Huang CY, Qin J. Nonparametric estimation for length-biased and right-censored data. Biometrika. 2011;98(1):177. doi: 10.1093/biomet/asq069. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] Luo X, Tsai W. Nonparametric estimation for right-censored length-biased data: a pseudo-partial likelihood approach. Biometrika. 2009;96(4):873–886. [Google Scholar]

[R15] Neyman J. Statistics–servant of all science. Science. 1955;122(3166):401–406. doi: 10.1126/science.122.3166.401. [DOI] [PubMed] [Google Scholar]

[R16] Neyman J. On the application of probability theory to agricultural experiments. essay on principles. section 9. Translation of excerpts by D. Dabrowska and T. Speed. Statistical Science. 1990;6:462–47. [Google Scholar]

[R17] Nieto FJ, Coresh J. Adjusting survival curves for confounders: a review and a new method. American Journal of Epidemiology. 1996;143(10):1059. doi: 10.1093/oxfordjournals.aje.a008670. [DOI] [PubMed] [Google Scholar]

[R18] Ning J, Qin J, Shen Y. Non-parametric tests for right-censored data with biased sampling. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2010;72(5):609–630. doi: 10.1111/j.1467-9868.2010.00742.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] Pepe MS, Fleming TR. Weighted Kaplan-Meier statistics: Large sample and optimality considerations. Journal of the Royal Statistical Society. Series B (Methodological) 1991;53(2):341–352. [Google Scholar]

[R20] Qin J, Shen Y. Statistical methods for analyzing right-censored length-biased data under Cox model. Biometrics. 2010;66(2):382–392. doi: 10.1111/j.1541-0420.2009.01287.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] Robins J. Causal inference from complex longitudinal data. Latent variable modeling and applications to causality. 1997:69–117. [Google Scholar]

[R22] Robins J, Hernán M, Brumback B. Marginal structural models and causal inference in epidemiology. Epidemiology. 2000;11(5):550–560. doi: 10.1097/00001648-200009000-00011. [DOI] [PubMed] [Google Scholar]

[R23] Robins JM. Correcting for non-compliance in randomized trials using structural nested mean models. Communications in Statistics-Theory and methods. 1994;23(8):2379–2412. [Google Scholar]

[R24] Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):41–55. [Google Scholar]

[R25] Rotnitzky A, Robins JM. Inverse probability weighting in survival analysis. Encyclopedia of Biostatistics. 2005 [Google Scholar]

[R26] Rubin D. The use of matched sampling and regression adjustment to remove bias in observational studies. Biometrics. 1973:185–203. [Google Scholar]

[R27] Rubin DB. Bayesian inference for causal effects: The role of randomization. The Annals of Statistics. 1978;6(1):34–58. [Google Scholar]

[R28] Shen Y, Ning J, Qin J. Analyzing length-biased data with semiparametric transformation and accelerated failure time models. Journal of the American Statistical Association. 2009;104(487):1192–1202. doi: 10.1198/jasa.2009.tm08614. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] Wang MC. Nonparametric estimation from cross-sectional survival data. Journal of the American Statistical Association. 1991;86(413):130–143. [Google Scholar]

[R30] Wang MC, Jewell NP, Tsai WY. Asymptotic properties of the product limit estimate under random truncation. The Annals of Statistics. 1986;14(4):1597–1605. [Google Scholar]

[R31] Wicksell SD. The corpuscle problem: a mathematical study of a biometric problem. Biometrika. 1925;17(1/2):84–99. [Google Scholar]

[R32] Wolfson C, Wolfson DB, Asgharian M, M’Lan CE, Østbye T, Rockwood K, Hogan DB. A reevaluation of the duration of survival after the onset of dementia. New England Journal of Medicine. 2001;344(15):1111–1116. doi: 10.1056/NEJM200104123441501. [DOI] [PubMed] [Google Scholar]

[R33] Xie J, Liu C. Adjusted Kaplan–Meier estimator and log-rank test with inverse probability of treatment weighting for survival data. Statistics in Medicine. 2005;24(20):3089–3110. doi: 10.1002/sim.2174. [DOI] [PubMed] [Google Scholar]

[R34] Zelen M, Feinlein M. On the theory of screening for chronic diseases. Biometrika. 1969;56(3):601–614. [Google Scholar]

PERMALINK

Propensity Score Estimation in the Presence of Length-biased Sampling: A Nonparametric Adjustment Approach

Ashkan Ertefaie

Masoud Asgharian

David Stephens

Abstract

1. Introduction

2. Length-biased sampling

2.1. Notations

2.2. Potential outcomes

3. Propensity score estimation under length-biased sampling

4. Survival Curve Estimation

5. Simulation Studies

Table 1.

5.1. Survival Curve Estimation

Figure 1.

6. Discussion

Acknowledgement

Appendix

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Propensity Score Estimation in the Presence of Length-biased Sampling: A Nonparametric Adjustment Approach

Ashkan Ertefaie

Masoud Asgharian

David Stephens

Abstract

1. Introduction

2. Length-biased sampling

2.1. Notations

2.2. Potential outcomes

3. Propensity score estimation under length-biased sampling

4. Survival Curve Estimation

5. Simulation Studies

Table 1.

5.1. Survival Curve Estimation

Figure 1.

6. Discussion

Acknowledgement

Appendix

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases