On a simple estimation of the proportional odds model under right truncation

Peng Liu; Kwun Chuen Gary Chan; Ying Qing Chen

doi:10.1007/s10985-022-09584-2

. 2023 Jan 5;29(3):537–554. doi: 10.1007/s10985-022-09584-2

On a simple estimation of the proportional odds model under right truncation

Peng Liu ^1,^✉, Kwun Chuen Gary Chan ², Ying Qing Chen ³

PMCID: PMC10258175 PMID: 36602639

Abstract

Retrospective sampling can be useful in epidemiological research for its convenience to explore an etiological association. One particular retrospective sampling is that disease outcomes of the time-to-event type are collected subject to right truncation, along with other covariates of interest. For regression analysis of the right-truncated time-to-event data, the so-called proportional reverse-time hazards model has been proposed, but the interpretation of its regression parameters tends to be cumbersome, which has greatly hampered its application in practice. In this paper, we instead consider the proportional odds model, an appealing alternative to the popular proportional hazards model. Under the proportional odds model, there is an embedded relationship between the reverse-time hazard function and the usual hazard function. Building on this relationship, we provide a simple procedure to estimate the regression parameters in the proportional odds model for the right truncated data. Weighted estimations are also studied.

Keywords: Biased sampling, Odds ratio, Reverse-time hazard function

Introduction

Truncation is common in survival analysis where the incomplete nature of the observations is due to a systematic biased selection process originated in the study design. Right truncated data arise naturally when an incubation period (i.e., the time between disease incidence and the onset of clinical symptoms) cannot be observed completely in a retrospective study. In survival analysis, right truncation will lead to biased sampling in which shorter observations will be oversampled (Gürler 1996). For example, to study AIDS caused by blood transfusion (Lagakos et al. 1988), the incubation period is the time from a contaminated blood transfusion to the time when symptoms and signs of AIDS are first apparent. However, in those studies, the following-up period are usually limited. Therefore, only those developed AIDS before the end of study can be identified.

Many authors have studied right truncated data: Woodroofe (1985) and Wang et al. (1986) focused on the asymptotic properties the product limit estimator under random truncation. Keiding and Gill (1990) studied asymptotic properties of random left truncation estimator by a reparametrization of the left truncation model as a three-state Markov process. Lagakos et al. (1988) considered nonparametric estimation and inference of right truncated data by treating the process in reverse time, they showed that $λ^{B} (t) = λ (τ - t)$ , where $τ$ is the study duration, $λ^{B} (t)$ and $λ (t)$ are reverse-time hazard and forward-time hazard, respectively. The authors also discussed the implications and limitations of introducing reverse time hazard to analyze right truncated data. Gross (1992) further explained the necessity of reverse time hazard in the Cox model setting.

However, in most of the current literature, researchers study right truncated data in nonparametric setting, fairly few studied semiparametric models, among them, Kalbfleisch and Lawless (1989) formulating the Cox model on the reverse time hazard (or retro hazard, Lagakos et al. (1988); Keiding and Gill (1990)). For other related work on reverse time hazard, please refer to Gross (1992); Chen et al. (2004), among others.

In this paper, we study right truncated data under a semiparametric proportional odds model. Different from a proportional hazards model, the reverse-time hazard in proportional odds model has a simple log-linear relationship with the forward-time hazard, which leads to an intuitive estimator. While Sundaram (2009)’s method can also be adapted to proportional odds model for right truncated data, she focused on applying a reversed-time argument to an estimator for left truncated data. Our estimator, on the other hand, utilize a direct relationship between the reverse-time hazard, the forward-time hazard and the baseline odds function, so that we obtain a simpler estimator. Weighted functions are also being inserted into the estimating equation to obtain more efficient estimates.

The rest of the paper is organized as follows. Section 2 describes the inference procedure as well as asymptotic results, Sect. 3 shows simulation and real data results, Sect. 5 provides some discussion. Proof of theorems are left into the Appendix part.

Inference procedure

Assume that the failure time of interest T follows the semiparametric proportional odds model:

\begin{matrix} log \{\frac{1 - S (t ∣ Z)}{S (t ∣ Z)}\} = α (t) + Z^{⊤} β, \end{matrix}

and the observed failure time is subject to a right truncation time variable R. The observed data is $(T_{i}, R_{i}), i = 1, \dots, n$ , where $T_{i} \leq R_{i}$ . Let $τ$ be the study duration, which is greater than $max {T_{1}, T_{2}, \dots, T_{n}}$ . An (observed) reverse-time sample, $(T_{i}^{*}, R_{i}^{*}), i = 1, \dots, n$ can be constructed, where $T^{*} = τ - T, R^{*} = τ - R$ , so that $T^{*}$ is left truncated by the variable $R^{*}$ . Denote $({\tilde{T}}^{*}, {\tilde{R}}^{*})$ as the reverse-time sample (potentially truncated). Then the hazard function of ${\tilde{T}}^{*}$ is a quantity originated in $τ$ and counts backward in time. The reverse hazard and cumulative reverse hazard function of backward recurrence time is defined as

\begin{matrix} λ^{B} (t ∣ Z) = lim_{Δ t \to 0} \frac{Pr {{\tilde{T}}^{*} \in (t - Δ t, t] ∣ {\tilde{T}}^{*} \leq t, Z}}{Δ t} = \frac{f (t ∣ Z)}{F (t ∣ Z)}, \\ Λ^{B} (t ∣ Z) = \int_{t}^{τ} λ^{B} (s ∣ Z) d s . \end{matrix}

We would like to mention that a similar definition of the reverse hazard can also be found in Kalbfleisch and Lawless (1989) and Jiang (2011). Denote $v (t) = exp (α (t))$ , and $λ (t) = f (t) / S (t)$ as the forward-time hazard, then

\begin{matrix} log λ (t ∣ Z) - log λ^{B} (t ∣ Z) = & α (t) + Z^{⊤} β, λ^{B} (t ∣ Z) \\ = & \frac{1}{{1 + v (t) exp (Z^{⊤} β)} v (t)} \frac{d v (t)}{dt} . \end{matrix}

Consider the counting process

\begin{matrix} N_{i} (t) = I (t \leq T_{i} \leq R_{i}), Y_{i} (t) = I (T_{i} \leq t \leq R_{i}), \end{matrix}

and denote

\begin{matrix} M_{i} (t, β) = N_{i} (t) - \int_{t}^{τ} Y_{i} (s) \frac{1}{{exp (Z_{i}^{⊤} β) v (s) + 1} v (s)} d v (s) . \end{matrix}

Then $M_{i} (t, β)$ is a martingale with respect to the self-exciting (canonical) filtration (Keiding and Gill 1990; Stralkowska-Kominiak and Stute 2009) and

\begin{matrix} M_{i} (d t, β) = d N_{i} (t) + Y_{i} (t) \frac{1}{{exp (Z_{i}^{⊤} β) v (t) + 1} v (t)} d v (t) . \end{matrix}

Multiply both sides of (2) by ${exp (Z_{i}^{⊤} β) v (t) + 1}$ and summing over n observations,

\begin{matrix} \sum_{i = 1}^{n} {exp (Z_{i}^{⊤} β) v (t) + 1} d N_{i} (t) + \sum_{i = 1}^{n} Y_{i} (t) \frac{d v (t)}{v (t)} \\ = \sum_{i = 1}^{n} {exp (Z_{i}^{⊤} β) v (t) + 1} M_{i} (d t, β) . \end{matrix}

Divide both left-hand side and right-hand side by $\sum_{i = 1}^{n} Y_{i} (t)$ , we obtain:

\begin{matrix} \frac{\sum_{i = 1}^{n} {exp (Z_{i}^{⊤} β) v (t) + 1} d N_{i} (t)}{\sum_{i = 1}^{n} Y_{i} (t)} + \frac{d v (t)}{v (t)} = \frac{\sum_{i = 1}^{n} {exp (Z_{i}^{⊤} β) v (t) + 1} M_{i} (d t, β)}{\sum_{i = 1}^{n} Y_{i} (t)} . \end{matrix}

which is equivalent to:

\begin{matrix} v (t) \frac{\sum_{i = 1}^{n} exp (Z_{i}^{⊤} β) d N_{i} (t)}{\sum_{i = 1}^{n} Y_{i} (t)} + \frac{\sum_{i = 1}^{n} d N_{i} (t)}{\sum_{i = 1}^{n} Y_{i} (t)} + \frac{d v (t)}{v (t)} \\ = \sum_{i = 1}^{n} \frac{exp (Z_{i}^{⊤} β) v (t) + 1}{\sum_{i = 1} n Y_{i} (t)} M_{i} (d t, β) . \end{matrix}

Denote the left-hand side of (4) as:

\begin{matrix} U (β, d t) = \frac{d v (t)}{v (t)} + p_{n} (t) d t - q_{n} (t, β) v (t) d t, \end{matrix}

where

\begin{matrix} p_{n} (t) d t = \frac{\sum_{i = 1}^{n} d N_{i} (t)}{\sum_{j = 1}^{n} Y_{j} (t)}, q_{n} (t, β) d t = - \frac{\sum_{i = 1}^{n} exp (Z_{i}^{⊤} β) d N_{i} (t)}{\sum_{j = 1}^{n} Y_{j} (t)} . \end{matrix}

From standard counting process arguments (Anderson and Gill, 1982;Aalen10), we know that the stochastic integral with respect to the counting process martingale $M_{i} (d t, β)$ is also a martingale, motivate by the following equation

\begin{matrix} E [\frac{1}{n}, U, (β, d t)] = E [\frac{1}{n}, \sum_{i = 1}^{n}, \frac{exp (Z_{i}^{⊤} β) v (t) + 1}{\sum_{i = 1} n Y_{i} (t)}, M_{i}, (d t, β)] . \end{matrix}

We construct the following estimating equation

\begin{matrix} \frac{1}{n} U (β, d t) = 0 . \end{matrix}

Only v(t) is unknown in (5), let the estimate of v(t) be ${\hat{v}}_{n} (t, β)$ . Denote

\begin{matrix} P_{n} (t) = exp \{\int_{t}^{τ}, \frac{\sum_{i = 1}^{n} d N_{i} (s)}{\sum_{j = 1}^{n} Y_{j} (s)}\}, Q_{n} (t, β) = \int_{t}^{τ} \frac{\sum_{i = 1}^{n} exp (Z_{i}^{⊤} β) d N_{i} (s)}{\sum_{j = 1}^{n} Y_{j} (s)}, \end{matrix}

then

\begin{matrix} {\hat{v}}_{n} (t, β) = \frac{P_{n} (t)}{\int_{t}^{τ} P_{n} (s) Q_{n} (d s, β)} . \end{matrix}

Multiply (2) by $Z_{i} {exp (Z_{i}^{⊤} β) v (t) + 1} / n$ and summing over n observations, we obtain

\begin{matrix} \frac{1}{n} \sum_{i = 1}^{n} Z_{i} [{exp (Z_{i}^{⊤} β) v (t) + 1} d N_{i} (t) + Y_{i} (t) \frac{d v (t)}{v (t)}] \\ = \frac{1}{n} \sum_{i = 1}^{n} Z_{i} {exp (Z_{i}^{⊤} β) v (t) + 1} M_{i} (d t, β) . \end{matrix}

By virtue of the same idea of (5), take integration on both sides of (7), we can also construct another equation:

\begin{matrix} \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} Z_{i} [{exp (Z_{i}^{⊤} β) v (t) + 1} d N_{i} (t) + Y_{i} (t) \frac{d v (t)}{v (t)}] = 0 \end{matrix}

Substituting (6) into (8), we can obtain the estimate of $β$ by solving the following equation:

\begin{matrix} \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} Z_{i} [\{exp (Z_{i}^{⊤} β) {\hat{v}}_{n} (t, β) + 1\} d N_{i} (t) + Y_{i} (t) \frac{{\hat{v}}_{n} (d t, β)}{{\hat{v}}_{n} (t, β)}] = 0 . \end{matrix}

Moreover, since

\begin{matrix} \frac{{\hat{v}}_{n} (d t, β)}{{\hat{v}}_{n} (t, β)} = - \frac{\sum_{k = 1}^{n} d N_{k} (t)}{\sum_{l = 1}^{n} Y_{l} (t)} - \frac{\sum_{k = 1}^{n} exp (Z_{k}^{⊤} β) d N_{k} (t)}{\sum_{l = 1}^{n} Y_{l} (t)} {\hat{v}}_{n} (t, β), \end{matrix}

then

\begin{matrix} \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} \{Z_{i} - \bar{Z} (t)\} \{exp (Z_{i}^{⊤} β) {\hat{v}}_{n} (t, β) + 1\} d N_{i} (t) = 0, \end{matrix}

where

\begin{matrix} \bar{Z} (t) = \frac{\sum_{i = 1}^{n} Z_{i} Y_{i} (t)}{\sum_{j = 1}^{n} Y_{j} (t)} . \end{matrix}

Finally, let

\begin{matrix} S_{n} (β) = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} \{Z_{i} - \bar{Z} (t)\} \{exp (Z_{i}^{⊤} β) {\hat{v}}_{n} (t, β) + 1\} d N_{i} (t), \end{matrix}

and denote the solution of $S_{n} (β) = 0$ be ${\hat{β}}_{n}$ , we have the following theorem:

Theorem 1

Under assumptions A1-A4 in the Appendix, $\sqrt{n} ({\hat{β}}_{n} - β_{0})$ converges weakly to a mean-zero normal distribution, with covariance matrix $U^{- 1} V {(U^{- 1})}^{⊤}$ , where V is the covariance matrix of $\sqrt{n} S_{n} (β_{0})$ , $U = {lim}_{n \to \infty} {\partial S_{n} (β) / \partial β} ∣_{β = β_{0}}$ . The kth row of U is:

\begin{matrix} lim_{n \to \infty} \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} \{Z_{i} - \bar{Z} (t)\} \\ \times \{Z_{ik} exp (Z_{i}^{⊤} β_{0}) {\hat{v}}_{n} (t, β_{0}) + exp (Z_{i}^{⊤} β_{0}) \frac{\partial {\hat{v}}_{n} (t, β)}{\partial β_{k}} ∣_{β = β_{0}}\} d N_{i} (t) . \end{matrix}

Remark

For proportional odds model with the normal logit link:

\begin{matrix} log \{\frac{S (t ∣ Z)}{1 - S (t ∣ Z)}\} = α (t) + Z^{⊤} β . \end{matrix}

Define

\begin{matrix} {\tilde{M}}_{i} (t, β) = N_{i} (t) - \int_{t}^{τ} Y_{i} (s) \frac{exp (Z^{⊤} β)}{1 + exp (Z^{⊤} β) v (s)} d v (s), \end{matrix}

we claim that ${\tilde{M}}_{i} (t, β)$ is a martingale. Recall that $v (t) = exp (α (t))$ , following (10), we have

\begin{matrix} S (t | Z) = \frac{exp (α (t) + Z^{⊤} β)}{1 + exp (Z^{⊤} β) v (t)}, \end{matrix}

as a result, we can obtain

\begin{matrix} f (t | Z) = \frac{exp (Z^{⊤} β) v^{'} (t)}{{(1 + exp (Z^{⊤} β) v (t))}^{2}}, F (t | Z) = \frac{1}{1 + exp (Z^{⊤} β) v (t)} . \end{matrix}

Following the definition of reverse hazard in Sect. 2, we can write the reverse hazard as

\begin{matrix} {\tilde{λ}}^{B} (t | Z) = \frac{f (t | Z)}{F (t | Z)} = \frac{exp (Z^{⊤} β) v^{'} (t)}{1 + exp (Z^{⊤} β) v (t)} . \end{matrix}

From the general definition of martingale in Fleming and Harrington (1991) (pp. 25), we can easily show that ${\tilde{M}}_{i} (t, β)$ is a martingale. While for model (1),

\begin{matrix} λ^{B} (t | Z) = \frac{v^{'} (t)}{1 + exp (Z^{⊤} β) v (t)}, \end{matrix}

and $N_{i} (t) - \int_{t}^{τ} Y_{i} (s) λ^{B} (t | Z) d t$ is the martingale.

The corresponding estimating equation under model (10) has the following form

\begin{matrix} S_{n}^{(1)} (β) = \sum_{i = 1}^{n} \int_{0}^{τ} \{Z_{i} - \bar{Z} (t, β)\} \{exp (Z_{i}^{⊤} β) {\hat{v}}_{n} (t, β) + 1\} d N_{i} (t), \end{matrix}

where

\begin{matrix} \bar{Z} (t, β) = \frac{\sum_{i = 1}^{n} Z_{i} Y_{i} (t) exp (Z_{i}^{⊤} β)}{\sum_{j = 1}^{n} Y_{j} (t) exp (Z_{j}^{⊤} β)} . \end{matrix}

Equation (11) also can be used to estimate $β$ , however, comparing with (9), (11) is more complicated and more computational intensive, while the derivative of (9) with respect to $β$ can be easily obtained. As a result, (9) can be easily solved by the newton raphson algorithm. In the following simulations, we will use estimating equation (9).

In addition to the unweighted object function (9), weighted object function can also being included to obtain a class of weighted estimators of $β_{0}$ . This procedure is often used to minimize the sandwich estimate as well as improve the efficiency. The weighted version of object function is

\begin{matrix} S_{n, W} (β) \\ = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} W_{n} (t) \{Z_{i} - \bar{Z} (t)\} \{exp (Z_{i}^{⊤} β) {\hat{v}}_{n} (t, β) + 1\} d N_{i} (t) = 0, \end{matrix}

here $W_{n} (t)$ is a predictable weight function with respect to the canonical filtration which converges to a non-random function w(t). One of the common used weight function is the Prentice-Wilcoxon type function $W_{n 1} (t) = {\hat{S}}_{LB} (t)$ , where ${\hat{S}}_{LB} (\cdot)$ is the Lynden Bell estimate of the baseline survival function for right truncated failure time data. Denote the corresponding estimate of $β$ as ${\hat{β}}_{n, w}$ . Then we have the following theorem:

Theorem 2

Under the same assumptions as Theorem 1, when $n \to \infty$ , for a prespecified weight function $W_{n} (\cdot) \to w (\cdot)$ , $\sqrt{n} ({\hat{β}}_{n, w} - β_{0})$ converges weakly to a mean-zero normal distribution, with covariance matrix $U_{w}^{- 1} V_{w} {(U_{w}^{- 1})}^{⊤}$ , where $V_{w}$ is the covariance matrix of $\sqrt{n} S_{n, w} (β_{0})$ , $U_{w} = {lim}_{n \to \infty} {\partial S_{n, w} (β) / \partial β} ∣_{β = β_{0}}$ . The kth row of $U_{w}$ is:

\begin{matrix} lim_{n \to \infty} \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} W_{n} (t) \{Z_{i} - \bar{Z} (t)\} [Z_{ik} exp (Z_{i}^{⊤} β_{0}) {\hat{v}}_{n} (t, β_{0})) \\ (+ exp (Z_{i}^{⊤} β_{0}) \{\partial {\hat{v}}_{n} (t, β) / \partial β_{k}\} ∣_{β = β_{0}}] d N_{i} (t) . \end{matrix}

Recently, many people considered problem of finding the optimal weight in a weighted estimating equation, including Chen and Cheng (2005); Chen and Wang (2000); Chen et al. (2012), among others. To achieve this goal, we only need to find the w(t) such that $U_{w} {(β_{0})}^{- 1} V_{w} (β_{0}) U_{w} {(β_{0})}^{- 1}$ achieves the minimum. Since both the empirical weight function $W_{n} (t)$ and its limit w(t) do not rely on unknown parameter $β_{0}$ , it is reasonable to set $β_{0} = 0$ . Another explanation for letting $β_{0} = 0$ is that it represents the baseline distribution. Therefore, let $β_{0} = 0$ , then we have:

\begin{matrix} U_{w} (β_{0}) & = lim_{n \to \infty} \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} W_{n} (t) \{Z_{i} - \bar{Z} (t)\} Z_{i} exp (Z_{i}^{⊤} β_{0}) v (t) d N_{i} (t) \\ = lim_{n \to \infty} \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} W_{n} (t) {\{Z_{i} - \bar{Z} (t)\}}^{\otimes 2} exp (Z_{i}^{⊤} β_{0}) v (t) d N_{i} (t) \\ = lim_{n \to \infty} \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} W_{n} (t) {\{Z_{i} - \bar{Z} (t)\}}^{\otimes 2} Y_{i} (t) \frac{exp (Z_{i}^{⊤} β_{0}) v^{'} (t)}{exp (Z_{i}^{⊤} β_{0}) v (t) + 1} d t \\ = lim_{n \to \infty} \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} W_{n} (t) {\{Z_{i} - \bar{Z} (t)\}}^{\otimes 2} Y_{i} (t) \frac{v^{'} (t)}{v (t) + 1} d t . \end{matrix}

\begin{matrix} V_{w} (β_{0}) & = lim_{n \to \infty} \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} W_{n} {(t)}^{2} {\{Z_{i} - \bar{Z} (t)\}}^{\otimes 2} \\ \times {\{exp (Z_{i}^{⊤} β_{0}) v (t) + 1\}}^{2} Y_{i} (t) \frac{1}{\{exp (Z_{i}^{⊤} β_{0}) v (t) + 1\} v (t)} v^{'} (t) d t \\ = lim_{n \to \infty} \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} W_{n} {(t)}^{2} {\{Z_{i} - \bar{Z} (t)\}}^{\otimes 2} Y_{i} (t) \{exp (Z_{i}^{⊤} β_{0}) v (t) + 1\} \frac{v^{'} (t)}{v (t)} d t \\ = lim_{n \to \infty} \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} W_{n} {(t)}^{2} {\{Z_{i} - \bar{Z} (t)\}}^{\otimes 2} Y_{i} (t) \{v (t) + 1\} \frac{v^{'} (t)}{v (t)} d t \end{matrix}

Apply the Cauchy-Schwarz inequality to $U_{w} {(β_{0})}^{- 1} V_{w} (β_{0}) U_{w} {(β_{0})}^{- 1}$ and let $β_{0} = 0$ , then it follows that the optimal weight is proportional to

\begin{matrix} w (t) = \frac{v (t)}{{(v (t) + 1)}^{2}} = S (t) \{1 - S (t)\}, \end{matrix}

which minimize the variance of ${\hat{β}}_{n}$ . Since when (15) holds, we have

\begin{matrix} U_{w} (β_{0}) = lim_{n \to \infty} \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} {\{Z_{i} - \bar{Z} (t)\}}^{\otimes 2} Y_{i} (t) \frac{v (t) v^{'} (t)}{{(v (t) + 1)}^{3}} d t, \\ V_{w} (β_{0}) = lim_{n \to \infty} \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} {\{Z_{i} - \bar{Z} (t)\}}^{\otimes 2} Y_{i} (t) \frac{v (t) v^{'} (t)}{{(v (t) + 1)}^{3}} d t, \end{matrix}

which means when $β_{0} = 0$ , given $w (t) = S (t) {1 - S (t)}$ , we have $U_{w} {(β_{0})}^{- 1} V_{w} (β_{0}) U_{w} {(β_{0})}^{- 1}$ achieves the minimum value $U_{w} {(β_{0})}^{- 1}$ (or equivalently $V_{w} {(β_{0})}^{- 1}$ ).

In simulation, let $W_{n 2} (t) = {\hat{S}}_{LB} (t) \{1 - {\hat{S}}_{LB} (t)\}$ , the results are shown in Table , it can be seen that the weight $W_{n 2} (t)$ achieve the minimal variance among the three estimators.

Simulation and real data

We perform simulation studies to evaluate the finite sample properties of the proposed estimator. In simulation, let $α (t) = 3 log t$ , $β_{0} = {(1, 0.5)}^{⊤}$ , $Z_{1}$ is a continuous variable follows a uniform distribution from 0 to 2, $Z_{2}$ is a discrete variable follows a Bernoulli distribution with probability 0.5. The failure time variable is generated from model (1). The right truncation variable follows a uniform distribution from 0 to 4. This makes the truncation rate equals to 20%. For each simulation, 1000 datasets are generated, in each dataset, there are n observations, $n = 300, 400, 500, 600$ , respectively. $W_{n 1} (t)$ and $W_{n 2} (t)$ are chosen as the weight functions in weighted estimating equations. As is shown in Table 1, three estimation equations yield unbiased estimates and the empirical coverage probability is around nominal level 95%, when weighted function is incorporated into the estimation equation, the efficiency is greatly improved, and the variance achieve minimal for $W_{n 2} (t)$ under three estimates.

Table 1.

Simulation results

		${\hat{β}}_{n}^{(1)}$				${\hat{β}}_{n}^{(2)}$
n		Bias $\times 10^{3}$	SSE $\times 10^{3}$	SEE $\times 10^{3}$	Cov(%)	Bias $\times 10^{3}$	SSE $\times 10^{3}$	SEE $\times 10^{3}$	Cov(%)
300	Unweight	31	329	342	96	8	319	338	97
	Prentice-Wilcoxon	30	249	277	95	18	268	278	95
	$W_{n 2} (t)$	23	227	258	93	8	254	261	93
	Shen et al. (2017)	-2	271	NA	NA	39	274	NA	NA
400	Unweight	20	278	289	96	7	271	288	96
	Prentice-Wilcoxon	12	213	240	96	12	227	241	95
	$W_{n 2} (t)$	8	194	222	94	5	211	225	94
	Shen et al. (2017)	-9	210	NA	NA	7	251	NA	NA
500	Unweight	14	247	254	96	5	247	255	96
	Prentice-Wilcoxon	8	188	214	96	7	205	215	95
	$W_{n 2} (t)$	-1	172	198	94	-3	188	200	94
	Shen et al. (2017)	-21	187	NA	NA	9	235	NA	NA
600	Unweight	11	222	229	96	7	227	232	96
	Prentice-Wilcoxon	5	173	195	96	5	186	196	95
	$W_{n 2} (t)$	-4	155	180	95	-4	172	182	95
	Shen et al. (2017)	-20	164	NA	NA	-5	180	NA	NA

Open in a new tab

SSE The sampling standard deviation, SEE The sampling standard error, Cov The empirical coverage of approximate 95% confidence intervals

As pointed out by one of the referees and the associate editor, Shen et al. (2017) also studied right truncated data under linear transformation models, and we know that when the error term in the linear transformation model follows logistic distribution (Fine et al., 1998), the model becomes the proportional odds model. Let

\begin{matrix} N_{i}^{†} (t) = I (τ - T_{i} \leq t) = I (T_{i} \geq τ - t), \\ Y_{i}^{†} (t) = I (τ - R_{i} \leq t \leq τ - T_{i}) = I (T_{i} \leq τ - t \leq R_{i}), \end{matrix}

then the estimating equations (3) and (4) in Shen et al. (2017) can be written as

\begin{matrix} U (β, α (τ - t)) \\ = \sum_{i = 1}^{n} \int_{- \infty}^{τ} Z_{i} [d N_{i} (t) - Y_{i} (t) d (log \frac{exp (Z_{i}^{⊤} β + α (τ - t))}{1 + exp (Z_{i}^{⊤} β + α (τ - t))})] = 0, \\ \sum_{i = 1}^{n} [d N_{i} (t) - Y_{i} (t) d (log \frac{exp (Z_{i}^{⊤} β + α (τ - t))}{1 + exp (Z_{i}^{⊤} β + α (τ - t))})] = 0 . \end{matrix}

We recognize that Shen et al. (2017)’s methodology is general and works for all the linear transformation models, including the proportional odds model. However, our approach will be more convenient compared with Shen et al. (2017)’s under the proportional odds model, since our approach has a simpler form, and the estimation of the intercept $α (t)$ can be done beforehand and plugged in the final estimating equation, while Shen et al. (2017) can not achieve this and their estimation produce involves a complicated iteration which increases the risk of non-convergence. Besides, Shen et al. (2017) only deal with the reverse time but not the reverse hazard function, and we utilize the relationship between the reverse hazard function and the forward-time hazard function and produced a more intuitive estimator.

We conduct simulations for Shen et al. (2017)’s method and the results are reported in Table 1. The code was obtained from the authors via personal communication. However, one of the authors, Prof. Pao-Sheng Shen mentioned that they were unable to calculate the asymptotic variance and coverage probabilities, the existing results in their paper contain some errors, and their current code only consists of bias and standard error. As a result, we only report bias and standard error of Shen et al. (2017)’s method. All the simulations were conducted under the same model as ours. We also want to mention that we found the computation speed is very slow for Shen et al. (2017)’s method, though asymptotic variance and coverage probability were not calculated, their method is still more than 3 times slower than ours under the same model setting and the sample size. The SSE of Shen et al. (2017)’s method is smaller than our unweighted estimator, but is bigger than the two weighted estimators. For the second approach in their paper, i.e. the conditional maximum-likelihood approach, since the bias is large, we did not perform further comparisons here. We would like to mention that the large bias of the conditional maximum-likelihood approach is also confirmed in Vakulenko-Lagun et al. (2020).

As suggested by one of the reviewers, we also perform simulations without accounting for the truncation, and the results are shown in Table . We choose the truncation distribution as uniform distributions from 0 to 4, 2 and 1, respectively, which corresponds to 20% truncation rate (mild truncation), 40% truncation rate (moderate truncation) as well as 70% truncation rate (heavy truncation). As we can see from Table 2, all the estimators are biased, and a larger truncation rate will lead to a bigger bias and variance, though for the same truncation, variances will decrease when the sample sizes increase. These results also coincide with Table 2.1 (pp. 20) in Rennert (2018) and Table 1 in Rennert and Xie (2018), though the two articles deal with the doubly truncated data under the Cox model.

Table 2.

Simulation results when ignoring truncation

		${\hat{β}}_{n}^{(1)}$				${\hat{β}}_{n}^{(2)}$
n	Truncation	Bias $\times 10^{3}$	SSE $\times 10^{3}$	SEE $\times 10^{3}$	Cov(%)	Bias $\times 10^{3}$	SSE $\times 10^{3}$	SEE $\times 10^{3}$	Cov(%)
300	Mild	-39	284	309	95	-35	267	310	99
300	Moderate	-134	307	305	86	-74	297	308	96
300	Heavy	-306	555	629	91	-106	555	601	98
400	Mild	-67	229	260	95	-46	225	263	98
400	Moderate	-128	254	261	89	-71	265	264	96
400	Heavy	-379	245	251	70	-157	236	261	93
500	Mild	-66	200	229	96	-26	222	235	96
500	Moderate	-125	215	230	91	-72	227	234	96
500	Heavy	-398	215	222	55	-170	224	231	89
600	Mild	-53	173	209	96	-20	206	214	98
600	Moderate	-129	184	208	93	-72	213	212	93
600	Heavy	-406	223	202	45	-165	202	210	87

Open in a new tab

SSE, the sampling standard deviation; SEE, the sampling standard error; Cov, the empirical coverage of approximate 95% confidence intervals

To better illustrate how to employ the proposed method in real situation, we analyze the Centers for Disease Control’s blood-transfusion data, this data was used by Kalbfleisch and Lawless (1989) and Wang (1989). The data include 494 cases reported to the Center of Disease Control prior January, 1, 1987, and diagnosed before July, 1, 1986. Only 295 of the 494 has consistent data, and they got infection by a single blood transfusion or a short series of transfusions, analyse is restricted to this subset. We obtain the raw observation data via personal communication, Thomas Peterman, Centers for Disease Control and Prevention. The data contains three variables: T is the time from blood transfusion to the diagnosis of AIDS (in months), R is the time from blood transfusion to the end of the study (July, 1986, in months), Age is the age of the person when transfusing blood (in years). Comparing the data with Kalbfleisch and Lawless (1989)’s as well as Wang (1989)’s, the observation (X=16, T=33, Age=34) cannot be found in the raw data, thus is being deleted and the final sample size is 294, and a few fractions of the data are also corrected because these entries are not correct compared to the raw data.

We apply the proposed method to this data and treat Age as the covariate in regression. In Wang (1989)’s paper, the data are categorized into three age groups: ‘children’ aged 1-4, ‘adults’ aged 5-59, and ‘elderly patients’ aged 60 and older because of different patterns of survivorship, the survivor behaviour of groups ‘adults’ and ‘elderly patients’ are similar except for the right tail while there is an evident distinction compared with ‘children’, in current analysis, we delete the data from ‘children’, and focus on a combined sample of ‘adults’ and ‘elderly patients’ with a sample size equal to 260. Finally, the range of T is from 0 to 89, and the range of R is from 0 to 99. For all $i \in {1, \dots, 260}$ , we have $T_{i} \leq R_{i}$ . As a result, our dataset will not have the identifiability issue as mentioned in Seaman et al. (2022). We also applied Shen et al. (2017)’s method and the result is similar. All the results are shown in Table , where the weights are chosen as $W_{n 1} (t)$ and $W_{n 2} (t)$ , the estimated parameter between unweighted and weighted estimation equation does not show much difference, but the variance is reduced when weights are considered. In both situations mentioned above, Age has a very weakening positive effect on the odds ratio, but the effect is not significant.

Table 3.

Age effect for blood transfusion data

	Age	SSE
unweighted	-0.0128	0.0153
Prentice-Wilcoxon	-0.0120	0.0143
$W_{n 2} (t)$	-0.0122	0.0122
Shen et al. (2017)	-0.0125	0.0150

Open in a new tab

Discussion

Directly consider the right truncated data in normal time order can be failed because ‘at risk’ process is not adapt to the history of the process (Gross 1992). Retro hazard solves this problem which transform right truncated data to left truncated in reverse time (Woodroofe 1985). Statistical modelling is even more flexible by incorporating the nature structure of proportional odds model. The usual form of proportional odds model can also be utilized but the theoretical and computational burden for the estimator will be increased, employ (1) can substantially improve the situation.

Acknowledgements

We thank Thomas Peterman, Centers for Disease Control and Prevention provided us the CDC blood transfusion data. We also thank for Pao-sheng Shen, Tunghai University provided us the code for Shen et al. (2017) and Vakulenko-Lagun Bella, University of Haifa discussed the simulation results of Vakulenko-Lagun et al. (2020).

Appendix 1

Assumptions:

A1: $β_{0} \in R^{p}$ is the interior point of a compact set $B$ .

A2: Z is a bounded process.

A3: $V (β_{0})$ is non-negative.

A4: f(t) is continuous.

Assumption A1 is also used by Chen et al. (2012), A2 is a standard assumption to ensure martingale properties holds (Fleming and Harrington 1991), A3 is also a standard assumption to avoid theoretical discussion, it is also being used in Huang and Qin (2013), A4 is being used in prove the martingale representation of ${\hat{v}}_{n} (t, β_{0}) - v (t)$ . Besides that, we also need an condition to ensure that the truncated distribution to be correctly identified, let $F (\cdot)$ and G(t) be the distribution function of T and R, define $(a_{F}, b_{F})$ and $(a_{G}, b_{G})$ be the support of $F (\cdot)$ and $G (\cdot)$ of T and R under the meaning that $a_{W} = inf {x : W (x) > 0}, b_{W} = sup {x : W (x) < 1}$ , where W is a distribution function. Under right truncation, actually, only conditional distribution $P (T \leq x | T \leq b_{G})$ and $P (R \leq x | R \geq a_{F})$ can be estimated, thus we assume $a_{F} = a_{G} = 0$ , $b_{R} = \infty$ , so that the conditional distribution will be the actual distribution of T and R, we also assume $P (T \leq Y) = α > 0$ to ensure that there exist observations satisfy our condition, similar assumption and discussion also appeared in Woodroofe (1985); Wang (1989), and Sundaram (2009), among others.

Proof of Theorem 1

To prove the Theorem 1, the first step is to derive the martingale representation of ${\hat{S}}_{n} (β_{0})$ . To do this, we need the martingale representation of ${\hat{v}}_{n} (t, β_{0}) - v_{0} (t)$ . Notice that

\begin{matrix} \sum_{i = 1}^{n} {exp (Z_{i}^{⊤} β_{0}) v_{0} (t) + 1} d N_{i} (t) + \sum_{i = 1}^{n} Y_{i} (t) \frac{d v_{0} (t)}{v_{0} (t)} \\ = \sum_{i = 1}^{n} {exp (Z_{i}^{⊤} β_{0}) v_{0} (t) + 1} M_{i} (d t, β_{0}) \end{matrix}

\begin{matrix} \sum_{i = 1}^{n} {exp (Z_{i}^{⊤} β_{0}) {\hat{v}}_{n} (t, β_{0}) + 1} d N_{i} (t) + \sum_{i = 1}^{n} Y_{i} (t) \frac{{\hat{v}}_{n} (d t, β_{0})}{dt} \frac{1}{{\hat{v}}_{n} (t, β_{0})} = 0 . \end{matrix}

Denote $w_{0} (t) = 1 / v_{0} (t)$ and ${\hat{w}}_{n} (t, β) = 1 / {\hat{v}}_{n} (t, β)$ , then (17) and (16) becomes:

\begin{matrix} \sum_{i = 1}^{n} {exp (Z_{i}^{⊤} β_{0}) + w_{0} (t)} d N_{i} (t) - \sum_{i = 1}^{n} Y_{i} (t) d w_{0} (t) \\ = \sum_{i = 1}^{n} {exp (Z_{i}^{⊤} β_{0}) + w_{0} (t)} M_{i} (d t, β_{0}), \end{matrix}

\begin{matrix} \sum_{i = 1}^{n} {exp (Z_{i}^{⊤} β_{0}) + {\hat{w}}_{n} (t, β_{0})} d N_{i} (t) - \sum_{i = 1}^{n} Y_{i} (t) {\hat{w}}_{n} (d t, β_{0}) = 0 . \end{matrix}

(19)-(18) and divide both side by $- \sum_{i = 1}^{n} Y_{i} (t)$ :

\begin{matrix} \frac{\partial {{\hat{w}}_{n} (t, β_{0}) - w_{0} (t)}}{\partial t} - p_{n} (t) d t {{\hat{w}}_{n} (t, β_{0}) - w_{0} (t)} \\ = \frac{\sum_{i = 1}^{n} {exp (Z_{i}^{⊤} β_{0}) + w_{0} (t)} M_{i} (d t, β_{0})}{\sum_{i = 1}^{n} Y_{i} (t)} . \end{matrix}

Then

\begin{matrix} {\hat{w}}_{n} (t, β_{0}) - w_{0} (t) = \frac{1}{P_{n} (t)} \sum_{i = 1}^{n} \int_{t}^{τ} P_{n} (s) \frac{\{exp (Z_{i}^{⊤} β_{0}) + w_{0} (s)\}}{\sum_{j = 1}^{n} Y_{j} (s)} M_{i} (d s, β_{0}) . \end{matrix}

In the interval $(0, τ)$ , since $0 < v_{0} (t) < \infty$ , by delta method,

\begin{matrix} {\hat{v}}_{n} (t, β_{0}) - v_{0} (t) & = - \frac{1}{w_{0}^{2} (t)} \{{\hat{w}}_{n} (t, β_{0}) - w_{0} (t)\} \\ = - \frac{v_{0}^{2} (t)}{P_{n} (t)} \sum_{i = 1}^{n} \int_{t}^{τ} P_{n} (s) \frac{v_{0} (s) exp (Z_{i}^{⊤} β_{0}) + 1}{\sum_{j = 1}^{n} Y_{j} (s) v_{0} (s)} M_{i} (d s, β_{0}) . \end{matrix}

At the point 0, (20) holds without condition because ${\hat{v}}_{n} (0, β) = v_{0} (t) = 0$ . At the point $τ$ , if denote $0 \times \infty = 0$ , then (20) also holds.

By using (20), for $S_{n} (β_{0})$ :

\begin{matrix} S_{n} (β_{0}) & = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} \{Z_{i} - \bar{Z} (t)\} \{exp (Z_{i}^{⊤} β_{0}) {\hat{v}}_{n} (t, β_{0}) + 1\} d N_{i} (t) \\ = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} \{Z_{i} - \bar{Z} (t)\} \{exp (Z_{i}^{⊤} β_{0}) {\hat{v}}_{n} (t, β_{0}) + 1\} M_{i} (d t, β_{0}) \\ - \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} \{Z_{i} - \bar{Z} (t)\} Y_{i} (t) \frac{exp (Z_{i}^{⊤} β_{0}) {\hat{v}}_{n} (t, β_{0}) + 1}{exp (Z_{i}^{⊤} β_{0}) v_{0}^{2} (t) + v_{0} (t)} d v_{0} (t) \\ = I + II . \end{matrix}

In the following, we will show that the second part can also be represented as a summation of integral with respect to martingale.

\begin{matrix} II = & - \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} {Z_{i} - \bar{Z} (t)} Y_{i} (t) \{\frac{exp (Z_{i}^{⊤} β_{0}) {\hat{v}}_{n} (t, β_{0}) + 1 - exp (Z_{i}^{⊤} β_{0}) v_{0} (t) - 1}{exp (Z_{i}^{⊤} β_{0}) v_{0}^{2} (t) + v_{0} (t)}) \\ (+ \frac{1}{v_{0} (t)}\} d v_{0} (t) \\ = & - \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} {Z_{i} - \bar{Z} (t)} Y_{i} (t) \frac{exp (Z_{i}^{⊤} β_{0})}{exp (Z_{i}^{⊤} β_{0}) v_{0}^{2} (t) + v_{0} (t)} \{{\hat{v}}_{n} (t, β_{0}) - v_{0} (t)\} d v_{0} (t) . \end{matrix}

Substitute (20) into (21) and change the integration order, then

\begin{matrix} II & = \frac{1}{n} \sum_{j = 1}^{n} \int_{0}^{τ} \{P_{n}, (t), \frac{exp (Z_{j}^{⊤} β_{0}) v_{0} (t) + 1}{\sum_{k = 1} Y_{k} (t) v_{0} (t)}) \\ (\sum_{i = 1}^{n}, \int_{0}^{t}, {Z_{i} - \bar{Z} (s)}, Y_{i}, (s), \frac{exp (Z_{i}^{⊤} β_{0}) v_{0} (s)}{exp (Z_{i}^{⊤} β_{0}) v_{0} (s) + 1}, \frac{1}{P_{n} (s)}, d, v_{0}, (s)\} M_{j} (d t, β_{0}) . \end{matrix}

Denote

\begin{matrix} ξ_{i} (t, β_{0}) = & \{Z_{i} - \bar{Z} (t)\} \{exp (Z_{i}^{⊤} β_{0}) {\hat{v}}_{n} (t, β_{0}) + 1\} \\ + P_{n} (t) \frac{exp (Z_{i}^{⊤} β_{0}) v_{0} (t) + 1}{\sum_{k = 1} Y_{k} (t) v_{0} (t)} \\ \times \sum_{j = 1}^{n} \int_{0}^{t} {Z_{j} - \bar{Z} (s)} Y_{j} (s) \frac{exp (Z_{j}^{⊤} β_{0}) v_{0} (s)}{exp (Z_{j}^{⊤} β_{0}) v_{0} (s) + 1} \frac{1}{P_{n} (s)} d v_{0} (s) . \end{matrix}

Then the martingale representation of $S_{n} (β_{0})$ is

\begin{matrix} S_{n} (β_{0}) = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} ξ_{i} (t, β_{0}) M_{i} (d t, β_{0}) . \end{matrix}

Through (22), it is obvious to prove that $S_{n} (β_{0})$ converges to zero 0 in probability by the weak law of large numbers.

Let

\begin{matrix} μ (t) = lim_{n \to \infty} \frac{\sum_{i = 1}^{n} Y_{i} (t) Z_{i}}{\sum_{j = 1}^{n} Y_{j} (t)}, v (t, β) = lim_{n \to \infty} {\hat{v}}_{n} (t, β) . \end{matrix}

Denote

\begin{matrix} s_{n} (β) = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} {Z_{i} - μ (t)} \{exp (Z_{i}^{⊤} β) {\hat{v}}_{n} (t, β) - exp (Z_{i}^{⊤} β_{0}) {\hat{v}}_{n} (t, β_{0})\} d N_{i} (t) . \end{matrix}

The derivative of $S_{n} (β)$ and $s_{n} (β)$ are

\begin{matrix} S_{n}^{'} (β) = \frac{1}{n} \sum_{i = 1}^{n} {Z_{i} - \bar{Z} (t)} \{exp (Z_{i}^{⊤} β) Z_{i} {\hat{v}}_{n} (t, β) + exp (Z_{i}^{⊤} β) {\partial {\hat{v}}_{n} (t, β) / \partial β}\} d N_{i} (t), \\ s_{n}^{'} (β) = \frac{1}{n} \sum_{i = 1}^{n} {Z_{i} - μ (t)} \{exp (Z_{i}^{⊤} β) Z_{i} {\hat{v}}_{n} (t, β) + exp (Z_{i}^{⊤} β) {\partial {\hat{v}}_{n} (t, β) / \partial β}\} d N_{i} (t) . \end{matrix}

Notice $s_{n} (β_{0}) = 0$ . Assume that there exists $ε > 0$ such that (A5): $P {∣ Z_{i} - μ (t) ∣ > ε, i = 1, 2, \dots, n} > 0$ , which means covariate can not be identical for all individuals. Together with the assumption (A6):

\begin{matrix} ∣ E \{exp (Z^{⊤} β_{0}) Z {\hat{v}}_{n} (t, β_{0})\} + E [exp (Z^{⊤} β_{0}) {\partial {\hat{v}}_{n} (t, β) / \partial β} ∣_{β = β_{0}}] ∣ > 0 . \end{matrix}

we have $∣ {lim}_{n} s_{n}^{'} (β_{0}) ∣ > 0$ . Without loss of generality, let ${lim}_{n} s_{n}^{'} (β_{0}) > 0$ , then there exist a neighborhood of $β_{0}$ such that $s_{n} (β)$ is strictly increasing. Further notice that $S_{n} (β) = s_{n} (β) + o_{p} (1)$ , $S_{n}^{'} (β) = s_{n}^{'} (β) + o_{p} (1)$ , then $S_{n} (β)$ is strictly increasing in a neighborhood of $β_{0}$ , thus prove the consistency of ${\hat{β}}_{n}$ .

By martingale central limit theorem, the variance of $S_{n} (β_{0})$ is

\begin{matrix} V (β_{0}) & = lim_{n \to \infty} V_{n} = lim_{n \to \infty} < n^{- 1 / 2} S_{n} (β_{0}), n^{- 1 / 2} S_{n} (β_{0}) > (τ) \\ = lim_{n \to \infty} \frac{1}{n} \int_{0}^{τ} \sum_{i = 1}^{n} ξ_{i} {(t, β_{0})}^{\otimes 2} d \int_{t}^{τ} - Y_{i} (s) {exp (Z_{i}^{⊤} β_{0}) v^{2} (s) + v (s)}^{- 1} d v (s) \\ = lim_{n \to \infty} \frac{1}{n} \int_{0}^{τ} \sum_{i = 1}^{n} ξ_{i} {(t, β_{0})}^{\otimes 2} Y_{i} (t) {exp (Z_{i}^{⊤} β_{0}) v^{2} (t) + v (t)}^{- 1} d v (t) . \end{matrix}

Further using the delta method will complete the proof of Theorem 1.

$□$

Proof of Theorem 2

Since Theorem 1 and 2 are quite similar, in this part, we will omit the proof detail and only give the detailed expression of $V_{w}$ .

\begin{matrix} V_{w} (β_{0}) = lim_{n \to \infty} \frac{1}{n} \int_{0}^{τ} \sum_{i = 1}^{n} ξ_{i, w} {(t, β_{0})}^{\otimes 2} Y_{i} (t) {exp (Z_{i}^{⊤} β_{0}) v^{2} (t) + v (t)}^{- 1} d v (t) . \end{matrix}

where

\begin{matrix} ξ_{i, w} (t, β_{0}) = & W_{n} (t) \{Z_{i} - \bar{Z} (t)\} \{exp (Z_{i}^{⊤} β_{0}) {\hat{v}}_{n} (t, β_{0}) + 1\} \\ + P_{n} (t) \frac{exp (Z_{i}^{⊤} β_{0}) v_{0} (t) + 1}{\sum_{k = 1} Y_{k} (t) v_{0} (t)} \\ \times \sum_{j = 1}^{n} \int_{0}^{t} W_{n} (s) {Z_{j} - \bar{Z} (s)} Y_{j} (s) \frac{exp (Z_{j}^{⊤} β_{0}) v_{0} (s)}{exp (Z_{j}^{⊤} β_{0}) v_{0} (s) + 1} P_{n} {(s)}^{- 1} d v_{0} (s) . \end{matrix}

$□$

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Aalen OO, Andersen PK, Borgan Ø, Gill RD, Keiding N (2010) History of applications of martingales in survival analysis. arxiv preprint arXiv:1003.0188
Andersen PK, Gill RD. Cox’s regression model for counting processes: a large sample study. Ann Statist. 1982;10:1100–20. doi: 10.1214/aos/1176345976. [DOI] [Google Scholar]
Chen YQ, Wang M-C. Analysis of accelerated hazards models. J Am Statist Assoc. 2000;95:608–18. doi: 10.1080/01621459.2000.10474236. [DOI] [Google Scholar]
Chen YQ, Wang M-C, Huang Y. Semiparametric regression analysis on longitudinal pattern of recurrent gap times. Biostatistics. 2004;5:277–90. doi: 10.1093/biostatistics/5.2.277. [DOI] [PubMed] [Google Scholar]
Chen YQ, Cheng S. Semiparametric regression analysis of mean residual life with censored survival data. Biometrika. 2005;92:19–29. doi: 10.1093/biomet/92.1.19. [DOI] [Google Scholar]
Chen YQ, Hu N, Musoke P, Zhao LP. Estimating regression parameters in an extended proportional odds model. J Am Statist Assoc. 2012;107:318–30. doi: 10.1080/01621459.2012.656021. [DOI] [PMC free article] [PubMed] [Google Scholar]
Fine JP, Ying Z, Wei LG. On the linear transformation model for censored data. Biometrika. 2012;85:980–6. doi: 10.1093/biomet/85.4.980. [DOI] [Google Scholar]
Fleming TR, Harrington DP. Counting processes and survival analysis. New York: John Wiley; 1991. [Google Scholar]
Gross ST. Regression models for truncated survival data. Scand J Stat. 1992;19:193–213. [Google Scholar]
Gürler Ü. Bivariate estimation with right-truncated data. J Am Statist Assoc. 1996;91:1152–65. [Google Scholar]
Huang C-Y, Qin J. Semiparametric estimation for the additive hazards model with left-truncated and right-censored data. Biometrika. 2013;100:877–88. doi: 10.1093/biomet/ast039. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jiang Y. Estimation of hazard function for right truncated data. Thesis: Georgia State University; 2011. [Google Scholar]
Kalbfleisch JD, Lawless JF. Inferences based on retrospective ascertainment: an analysis of the data on transfusion-related AIDS. J Am Statist Assoc. 1989;84:360–72. doi: 10.1080/01621459.1989.10478780. [DOI] [Google Scholar]
Keiding N, Gill R. Random truncation models and Markov processes. Ann Statist. 1990;18:582–602. doi: 10.1214/aos/1176347617. [DOI] [Google Scholar]
Lagakos SW, Barraj LM, De Gruttola V. Nonparametetric analysis of truncated survival data, with application to AIDS. Biometrika. 1988;75:515–23. doi: 10.1093/biomet/75.3.515. [DOI] [Google Scholar]
Rennert L (2018) Statistical methods for truncated survival data. Doctoral dissertation, University of Pennsylvania
Rennert L, Xie SX. Cox regression model with doubly truncated data. Biometrics. 2018;74:725–33. doi: 10.1111/biom.12809. [DOI] [PMC free article] [PubMed] [Google Scholar]
Seaman SR, Presanis A, Jackson C. Estimating a time-to-event distribution from right-truncated data in an epidemic: a review of methods. Stat Methods Med Res. 2022;31:1641–55. doi: 10.1177/09622802211023955. [DOI] [PMC free article] [PubMed] [Google Scholar]
Shen PS, Liu Y, Maa DP, Ju Y. Analysis of transformation models with right-truncated data. Statistics. 2017;51:404–18. doi: 10.1080/02331888.2016.1268617. [DOI] [Google Scholar]
Stralkowska-Kominiak E, Stute W. Martingale representations of the Lynden-Bell estimator with applications. Stat Probabil Lett. 2009;79:814–20. doi: 10.1016/j.spl.2008.10.038. [DOI] [Google Scholar]
Sundaram R. Semiparametric inference of proportional odds model based on randomly truncated data. J Stat Plan Inference. 2009;139:1381–93. doi: 10.1016/j.jspi.2008.08.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
Vakulenko-Lagun B, Mandel M, Betensky RA. Inverse probability weighting methods for Cox regression with right-truncated data. Biometrics. 2020;76:484–955. doi: 10.1111/biom.13162. [DOI] [PMC free article] [PubMed] [Google Scholar]
Vakulenko-Lagun B, Mandel M, Betensky RA. Inverse probability weighting methods for cox regression with right-truncated data. Biometrics. 2020;76:484–95. doi: 10.1111/biom.13162. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wang M-C, Jewell NP, Tsai W-Y. Asymptotic properties of the product limit estimate under random truncation. Ann Statist. 1986;14:1597–605. doi: 10.1214/aos/1176350180. [DOI] [Google Scholar]
Wang M-C. A semiparametric model for randomly truncated data. J Am Statist Assoc. 1989;84:742–8. doi: 10.1080/01621459.1989.10478828. [DOI] [Google Scholar]
Woodroofe M. Estimating a distribution function with truncated data. Ann Statist. 1985;13:163–77. doi: 10.1214/aos/1176346584. [DOI] [Google Scholar]

[CR1] Aalen OO, Andersen PK, Borgan Ø, Gill RD, Keiding N (2010) History of applications of martingales in survival analysis. arxiv preprint arXiv:1003.0188

[CR2] Andersen PK, Gill RD. Cox’s regression model for counting processes: a large sample study. Ann Statist. 1982;10:1100–20. doi: 10.1214/aos/1176345976. [DOI] [Google Scholar]

[CR3] Chen YQ, Wang M-C. Analysis of accelerated hazards models. J Am Statist Assoc. 2000;95:608–18. doi: 10.1080/01621459.2000.10474236. [DOI] [Google Scholar]

[CR4] Chen YQ, Wang M-C, Huang Y. Semiparametric regression analysis on longitudinal pattern of recurrent gap times. Biostatistics. 2004;5:277–90. doi: 10.1093/biostatistics/5.2.277. [DOI] [PubMed] [Google Scholar]

[CR5] Chen YQ, Cheng S. Semiparametric regression analysis of mean residual life with censored survival data. Biometrika. 2005;92:19–29. doi: 10.1093/biomet/92.1.19. [DOI] [Google Scholar]

[CR6] Chen YQ, Hu N, Musoke P, Zhao LP. Estimating regression parameters in an extended proportional odds model. J Am Statist Assoc. 2012;107:318–30. doi: 10.1080/01621459.2012.656021. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR7] Fine JP, Ying Z, Wei LG. On the linear transformation model for censored data. Biometrika. 2012;85:980–6. doi: 10.1093/biomet/85.4.980. [DOI] [Google Scholar]

[CR8] Fleming TR, Harrington DP. Counting processes and survival analysis. New York: John Wiley; 1991. [Google Scholar]

[CR9] Gross ST. Regression models for truncated survival data. Scand J Stat. 1992;19:193–213. [Google Scholar]

[CR10] Gürler Ü. Bivariate estimation with right-truncated data. J Am Statist Assoc. 1996;91:1152–65. [Google Scholar]

[CR11] Huang C-Y, Qin J. Semiparametric estimation for the additive hazards model with left-truncated and right-censored data. Biometrika. 2013;100:877–88. doi: 10.1093/biomet/ast039. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] Jiang Y. Estimation of hazard function for right truncated data. Thesis: Georgia State University; 2011. [Google Scholar]

[CR13] Kalbfleisch JD, Lawless JF. Inferences based on retrospective ascertainment: an analysis of the data on transfusion-related AIDS. J Am Statist Assoc. 1989;84:360–72. doi: 10.1080/01621459.1989.10478780. [DOI] [Google Scholar]

[CR14] Keiding N, Gill R. Random truncation models and Markov processes. Ann Statist. 1990;18:582–602. doi: 10.1214/aos/1176347617. [DOI] [Google Scholar]

[CR15] Lagakos SW, Barraj LM, De Gruttola V. Nonparametetric analysis of truncated survival data, with application to AIDS. Biometrika. 1988;75:515–23. doi: 10.1093/biomet/75.3.515. [DOI] [Google Scholar]

[CR16] Rennert L (2018) Statistical methods for truncated survival data. Doctoral dissertation, University of Pennsylvania

[CR17] Rennert L, Xie SX. Cox regression model with doubly truncated data. Biometrics. 2018;74:725–33. doi: 10.1111/biom.12809. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR18] Seaman SR, Presanis A, Jackson C. Estimating a time-to-event distribution from right-truncated data in an epidemic: a review of methods. Stat Methods Med Res. 2022;31:1641–55. doi: 10.1177/09622802211023955. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR19] Shen PS, Liu Y, Maa DP, Ju Y. Analysis of transformation models with right-truncated data. Statistics. 2017;51:404–18. doi: 10.1080/02331888.2016.1268617. [DOI] [Google Scholar]

[CR20] Stralkowska-Kominiak E, Stute W. Martingale representations of the Lynden-Bell estimator with applications. Stat Probabil Lett. 2009;79:814–20. doi: 10.1016/j.spl.2008.10.038. [DOI] [Google Scholar]

[CR21] Sundaram R. Semiparametric inference of proportional odds model based on randomly truncated data. J Stat Plan Inference. 2009;139:1381–93. doi: 10.1016/j.jspi.2008.08.006. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] Vakulenko-Lagun B, Mandel M, Betensky RA. Inverse probability weighting methods for Cox regression with right-truncated data. Biometrics. 2020;76:484–955. doi: 10.1111/biom.13162. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] Vakulenko-Lagun B, Mandel M, Betensky RA. Inverse probability weighting methods for cox regression with right-truncated data. Biometrics. 2020;76:484–95. doi: 10.1111/biom.13162. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] Wang M-C, Jewell NP, Tsai W-Y. Asymptotic properties of the product limit estimate under random truncation. Ann Statist. 1986;14:1597–605. doi: 10.1214/aos/1176350180. [DOI] [Google Scholar]

[CR25] Wang M-C. A semiparametric model for randomly truncated data. J Am Statist Assoc. 1989;84:742–8. doi: 10.1080/01621459.1989.10478828. [DOI] [Google Scholar]

[CR26] Woodroofe M. Estimating a distribution function with truncated data. Ann Statist. 1985;13:163–77. doi: 10.1214/aos/1176346584. [DOI] [Google Scholar]

PERMALINK

On a simple estimation of the proportional odds model under right truncation

Peng Liu

Kwun Chuen Gary Chan

Ying Qing Chen

Abstract

Introduction

Inference procedure

Theorem 1

Remark

Theorem 2

Simulation and real data

Table 1.

Table 2.

Table 3.

Discussion

Acknowledgements

Appendix 1

Proof of Theorem 1

Proof of Theorem 2

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

On a simple estimation of the proportional odds model under right truncation

Peng Liu

Kwun Chuen Gary Chan

Ying Qing Chen

Abstract

Introduction

Inference procedure

Theorem 1

Remark

Theorem 2

Simulation and real data

Table 1.

Table 2.

Table 3.

Discussion

Acknowledgements

Appendix 1

Proof of Theorem 1

Proof of Theorem 2

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases