Assessing Dynamic Covariate Effects with Survival Data

Ying Cui; Limin Peng

doi:10.1007/s10985-022-09571-7

. Author manuscript; available in PMC: 2023 Feb 6.

Published in final edited form as: Lifetime Data Anal. 2022 Aug 13;28(4):675–699. doi: 10.1007/s10985-022-09571-7

Assessing Dynamic Covariate Effects with Survival Data

Ying Cui ¹, Limin Peng ¹

PMCID: PMC9901566 NIHMSID: NIHMS1867801 PMID: 35962886

Abstract

Dynamic (or varying) covariate effects often manifest meaningful physiological mechanisms underlying chronic diseases. However, a static view of covariate effects is typically adopted by standard approaches to evaluating disease prognostic factors, which can result in depreciation of some important disease markers. To address this issue, in this work, we take the perspective of globally concerned quantile regression, and propose a flexible testing framework suited to assess either constant or dynamic covariate effects. We study the powerful Kolmogorov-Smirnov (K-S) and Cramér-Von Mises (C-V) type test statistics and develop a simple resampling procedure to tackle their complicated limit distributions. We provide rigorous theoretical results, including the limit null distributions and consistency under a general class of alternative hypotheses of the proposed tests, as well as the justifications for the presented resampling procedure. Extensive simulation studies and a real data example demonstrate the utility of the new testing procedures and their advantages over existing approaches in assessing dynamic covariate effects.

Keywords: Hypothesis testing, Globally concerned quantile regression, Testing consistency, Resampling

1. Introduction

Identifying useful prognostic factors is often of critical interests in chronic disease studies. When the disease outcome is captured by a time-to-event, a commonly used approach is to model the mechanism of a prognostic factor influencing the time-to-event outcome via a standard survival regression model and then test the corresponding covariate effects (see a review in Kleinbaum and Klein (2010) and Cox and Oakes (2018)). The standard survival regression models, such as the Cox proportional hazard (PH) regression model and the accelerated failure time (AFT) model, impose assumptions like the proportional hazards and the location-shift effects, which implicitly confine the prognostic factor of interest to be a static portent of disease progression.

There has been growing awareness that a prognostic factor may follow a dynamic association with a time-to-event disease outcome. Many reports in literature (Dickson et al., 1989; Thorogood et al., 1990; Verweij and van Houwelingen, 1995; Bellera et al., 2010, for example) have suggested that postulating constant covariate effects, sometimes, is not adequate to reflect underlying physiological disease mechanisms, leading to distorted assessment of the prognostic factor. For example, an analysis of a dialysis dataset reported by Peng and Huang (2008) suggested that the severity of restless leg syndrome (RLS) symptoms may be prognostic of mortality for short-term dialysis survivors but not for long-term dialysis survivors. The standard tests based on the Cox PH model and the AFT model failed to detect such a dynamic effect.

Quantile regression (Koenker and Bassett, 1978), which directly formulates covariate effects on quantile(s) of a response, confers a seminal venue to characterize a dynamic effect of a prognostic factor. Specifically, given a time-to-event outcome T and a covariate $\tilde{Z}$ (which represents the prognostic factor of interest), a linear quantile regression model may assume,

Q_{T} (τ ∣ \tilde{Z}) = exp {Z^{⊤} θ_{0} (τ)}, τ \in Δ,

(1)

where $Z = {(1, \tilde{Z})}^{⊤}, Q_{T} (τ ∣ \tilde{Z}) \equiv inf {t : Pr (T \leq t ∣ \tilde{Z}) \geq τ}$ denotes the τ-th conditional quantile of T given $\tilde{Z}$ , $θ_{0} (τ) \equiv {(β_{0}^{(0)} (τ), β_{0}^{(1)} (τ))}^{⊤}$ is an unknown coefficient vector, and Δ ⊆ (0, 1) is a pre-specified set including the quantile levels of interest. The coefficient $β_{0}^{(1)} (τ)$ represents the effect of $\tilde{Z}$ on the τ-th conditional quantile of T, and is allowed to change with τ. This implicates that the prognostic factor is permitted to have different effects across different segments of the distribution of the time-to event outcome.

Many authors have studied linear quantile regression with a time-to-event outcome (Powell, 1986; Ying et al., 1995; Portnoy, 2003; Zhou, 2006; Peng and Huang, 2008; Wang and Wang, 2009; Huang, 2010, for example). Most of the existing methods concern covariate effects on a single or multiple pre-specified quantile levels (e.g. Δ is a singleton set {0.5}), and, following the terminology of Zheng et al. (2015), are locally concerned. As discussed in Zheng et al. (2015), locally concerned quantile regression cannot inform of the covariate effect on quantiles other than the specifically targeted ones (e.g. median), and thus may miss important prognostic factors. Adopting the perspective of globally concerned quantile regression, one can simultaneously examine covariate effects over a continuum of quantile levels (e.g. Δ is an interval [0.1, 0.9]), and thus confer a more comprehensive assessment of a prognostic factor. However, powerful tests tailored to evaluate covariate effects under the perspective of globally concerned quantile regression have not been formally studied, partly owing to the associated inferential complexity.

In this work, we develop a new framework for evaluating a survival prognostic factor following the spirit of globally concerned quantile regression. As a proof of concept, we shall confine the scope of this work to the standard survival setting where the time-to-event outcome T is subject to random censoring. Specifically, our proposal is to simultaneously assess the influence of the prognostic factor on a range of quantiles of T, indexed by a τ-interval, [τ_L, τ_U] ⊂ (0, 1). As the key rationale, a significant prognostic factor is allowed to have a dynamic τ-varying effect, which may be non-zero throughout the whole τ-interval (i.e. full effect), or only over a part of the τ-interval (i.e. partial effect). Under this view, when model (1) with Δ = [τ_L, τ_U] holds, the task of identifying a prognostic factor reduces to testing the null hypothesis,

H_{0} : β_{0}^{(1)} (τ) = 0, τ \in [τ_{L}, τ_{U}] .

Moreover, without assuming any models, we may consider the null hypothesis formulated as,

H_{0}^{*} : Q_{T} (τ ∣ \tilde{Z}) = Q_{T} (τ) for τ \in [τ_{L}, τ_{U}],

where Q_T(τ) = inf{t : Pr(T ≤ t) ≥ τ}, denoting the τ-th unconditional (or marginal) quantile of T. The null hypothesis $H_{0}^{*}$ corresponds to the setting where $\tilde{Z}$ has no influence on the conditional quantile of T at any quantile level between τ_L and τ_U.

It is remarkable that under mild regularity conditions, $H_{0}^{*}$ implies that model (1) holds with Δ = [τ_L, τ_U] and $β_{0}^{(1)} (τ) = 0$ for τ ∈ [τ_L, τ_U]; on the other hand, model (1) with Δ = [τ_L, τ_U] and $β_{0}^{(1)} (τ) = 0$ for τ ∈ [τ_L, τ_U] implies $Q_{T} (τ ∣ \tilde{Z}) = Q_{T} (τ)$ for τ ∈ [τ_L, τ_U]; see Lemma 1 in the Appendix A. This finding sheds an important insight that a model-based test developed for H₀ may be used towards testing the model-free null hypothesis $H_{0}^{*}$ . From an alternative view, this result suggests that the globally concerned quantile regression model (1) with Δ = [τ_L, τ_U] can be used as a working model to test $H_{0}^{*}$ , which adopts the view that the effect of a prognostic factor can be assessed through contrasting the conditional versus unconditional quantiles of T.

Regarding $H_{0}^{*}$ , we study two “omnibus” test statistics constructed based on the estimator of θ₀(τ) obtained under the working model (1) with Δ = [τ_L, τ_U]. One test is a Kolmogorov-Smirnov (K-S) type test statistic defined upon the maximum “signal” strength (i.e. covariate effect) over τ’s in [τ_L, τ_U]. The other one is a CramérVon-Mises (C-V) type test statistics based on the average “signal” strength over τ’s in [τ_L, τ_U]. These two types of test statistics are known to be very sensitive to detect any departure from the null hypothesis H₀ under model (1). However, the analytic form of their limit null distributions are generally complex and sometimes intractable. This challenge is more intense in the quantile regression setting, where coefficient estimates do not have a closed form, and the corresponding asymptotic variance matrix involves unknown density functions (Koenker, 2005). To overcome these difficulties, we propose to approximate the limit null distributions through a resampling procedure that perturbs the influence function associated with the adopted coefficient estimator under the working model (1), following similar strategies of Lin et al. (1993) and Li and Peng (2014). We derive a sample-based procedure to estimate the influence function without requiring the correct specification of model (1), thereby circumvents directly evaluating the unknown density function via smoothing. The proposed resampling procedure is easy to implement and is shown to perform well even with realistic sample sizes. Moreover, we provide rigorous theoretical justifications for the proposed resampling procedure.

The rest of this paper is organized as follows. In Section 2, we first briefly review some existing results about the estimation of model (1), which we use as a working model for testing $H_{0}^{*}$ . We then present the proposed test statistics along with their theoretical properties. A resampling procedure is developed to carry out inference regarding H₀ or $H_{0}^{*}$ based on the proposed test statistics. We also discuss some computational strategies to help simplify or improve the implementation of the proposed method. In Section 3, we report extensive simulation studies conducted to evaluate the finite-sample performance of the proposed testing procedures. Our simulation results show that the proposed tests have accurate empirical sizes and can be much more powerful than benchmark methods when assessing a covariate with a dynamic effect. In Section 4, we further demonstrate the usefulness of the proposed testing procedures with a real data example. Concluding remarks and discussions are provided in Section 5.

2. The Proposed Testing Procedures

2.1. Estimation of θ₀(τ) under model (1)

As explained in Section 1, we propose to use globally concerned quantile regression as a vehicle to address the testing problem regarding the general null hypothesis $H_{0}^{*}$ . The first step is to obtain an estimator of θ₀(τ) (and thus $β_{0}^{(1)} (τ)$ ) from fitting the working model (1) to the observed data. Here and hereafter, we shall set the Δ in model (1) as Δ = [τ_L, τ_U], which is a pre-specified interval within (0, 1). Let C denote time to censoring, X = min(T, C), and δ = I(T ≤ C). The observed data include n i.i.d. replicates of (X, δ, Z), denoted by ${(X_{i}, δ_{i}, Z_{i})}_{i = 1}^{n}$ .

To estimate θ₀(τ) under model (1), we choose to adapt the existing results of Peng and Fine (2009) developed for competing risks data to the setting with randomly censored data. Compared to the other available estimators developed by Portnoy (2003) and Peng and Huang (2008), which require τ_L = 0, the estimator derived from Peng and Fine (2009) is more robust to any realistic violation of the global linearity assumed by model (1) (Peng, 2021). The influence function associated with Peng and Fine (2009)’s estimator also has a simpler form that can facilitate the development of the corresponding testing procedures.

The estimator of θ₀(τ) adapted from Peng and Fine (2009)’s work, denoted by $\hat{θ} (τ)$ , is obtained as the solution to the following estimating equation:

S_{n} (b, τ) = n^{- 1 / 2} \sum_{i = 1}^{n} Z_{i} [\frac{I (X_{i} \leq exp {Z_{i}^{⊤} b}) I (δ_{i} = 1)}{\hat{G} (X_{i} ∣ Z_{i})} - τ] = 0,

(2)

where $\hat{G} (x ∣ Z)$ is a reasonable estimator of G(x|Z) ≡ Pr(C ≥ x|Z). For simplicity of illustration, in sequel, we shall assume C is independent of $\tilde{Z}$ and thus take $\hat{G} (x ∣ Z)$ as the Kaplan-Meier estimator of the marginal survival function of C, $\hat{G} (x)$ . As noted by Peng and Fine (2009), solving (2) can be formulated as a L₁-type minimization problem of the following convex objective function:

U_{n} (b, τ) = \sum_{i = 1}^{n} I (δ_{i} = 1) | \frac{log (X_{i})}{\hat{G} (X_{i})} - b^{⊤} \frac{Z_{i}}{\hat{G} (X_{i})} | + | M - b^{⊤} \sum_{l = 1}^{n} \frac{- Z_{l} I (δ_{l} = 1)}{\hat{G} (X_{i})} | + | M - b^{⊤} \sum_{k = 1}^{n} (2 τ Z_{k}) | .

Here M is a sufficiently large number. This L₁-type minimization problem can be easily solved using the rq() function in the R package quantreg by Koenker (2022).

By the results of Peng and Fine (2009), the estimator $\hat{θ} (τ)$ enjoys desirable asymptotic properties. Specifically, under certain regularity conditions, we have (i) ${lim}_{n \to \infty} {sup}_{τ \in [τ_{L}, τ_{U}]} ‖ \hat{θ} (τ) - θ_{0} (τ) ‖ \to_{p} 0$ ; and (ii) $\sqrt{n} {\hat{θ} (τ) - θ_{0} (τ)}$ converge weakly to a mean zero Gaussian process for τ ∈ [τ_L, τ_U] with covariance function Φ(τ′, τ) = E{ξ₁(τ′)ξ₁(τ)^⊤}. Here ξ_i(τ) (i = 1, …, n) are defined as

ξ_{i} (τ) \equiv {ξ_{i}^{(0)} (τ), ξ_{i}^{(1)} (τ)}^{⊤} = {A (θ_{0} (τ))}^{- 1} {Z_{i} (\frac{I (log (X_{i}) \leq Z_{i}^{⊤} θ_{0} (τ), δ_{i} = 1)}{G (X_{i})} - τ) - \int_{0}^{\infty} w {θ_{0} (τ), s} y {(s)}^{- 1} d M_{i}^{G} (s)},

where G(x) = Pr(C > x), A(b) = E[ZZ^⊤f(Z^⊤b|Z)] with f(t|Z) denoting the conditional density of X given Z, w(b, t) = E[ZY (t)I(X ≤ exp{Z^⊤b})I(δ = 1)G(X)⁻¹], and $M_{i}^{G} (t) = N_{i}^{G} (t) - \int_{0}^{\infty} Y_{i} (s) d Λ^{G} (t)$ with $N_{i}^{G} (t) = I (X_{i} \leq t, δ_{i} = 0)$ , Y_i(t) = I(X_i ≥ t), y(t) = Pr(X ≥ t), λ^G(t) = lim_Δ→0 P(C ∈ (t, t + Δ)|C ≥ t)/Δ, and $Λ^{G} (t) = \int_{0}^{t} λ^{G} (s) d s$ . In addition, $n^{1 / 2} {\hat{θ} (τ) - θ_{0} (τ)} \approx n^{- 1 / 2} \sum_{i = 1}^{n} ξ_{i} (τ)$ , where ≈ indicate asymptotical equivalence uniformly in τ ∈ [τ_L, τ_U]. Consequently, ξ_i(τ) is referred to as the influence function of $n^{1 / 2} {\hat{θ} (τ) - θ_{0} (τ)}$ .

Note that the variance estimation for $\hat{θ} (τ)$ is complicated by the involvement of the unknown density f(t|Z) in the asymptotic covariance matrix Φ(τ′, τ). As justified by Peng and Fine (2009), a sample-based procedure that avoids smoothing-based density estimation can be used for variance estimation and is outlined below:

(1.a)
Compute an consistent variance estimate for S_n(θ₀(τ), τ) given by
$\hat{Σ} (τ, τ) = n^{- 1} \sum_{i = 1}^{n} Z_{i}^{\otimes 2} {(\frac{I [log (X_{i}) \leq Z_{i}^{⊤} \hat{θ} (τ)), δ_{i} = 1]}{\hat{G} (X_{i})} - τ)}^{2} - n^{- 1} \sum_{i = 1}^{n} I (δ_{i} = 0) {(\sum_{j = 1}^{n} Z_{j} I (X_{j} \geq X_{i}) I [log (X_{j}) \leq Z_{i}^{⊤} \hat{θ} (τ), δ_{j} = 1] {\hat{G} (X_{j})}^{- 1} / \sum_{j = 1}^{n} I (X_{j} \geq X_{i}))}^{\otimes 2},$

where for a vector a, a^⊗2 = aa^⊤.
(1.b)
Find a symmetric and nonsingular matrix E_n(τ) ≡ {e_n,0(τ), e_n,1(τ)} such that ${E_{n} (τ)}^{2} = \hat{Σ} (τ, τ)$ .
(1.c)
Calculate $D_{n} (τ) = {S_{n}^{- 1} {e_{n, 0} (τ), τ} - \hat{θ} (τ), S_{n}^{- 1} {e_{n, 1} (τ), τ} - \hat{θ} (τ)}$ , where $S_{n}^{- 1} {e (τ), τ}$ is the solution to the perturbed estimating equation S_n(b, τ) = e(τ).
(1.d)
Obtain an estimate for the asymptotic variance of $\sqrt{n} {\hat{θ} (τ) - θ_{0} (τ)}$ as $V_{n} (τ) \equiv n D_{n}^{\otimes 2} (τ)$ .

Here E_n(τ) can be computed with the eigenvalue eigenvector decomposition of $\hat{Σ} (τ, τ)$ using the R function eigen(). As another important remark, the above procedure ensures that the perturbation terms, e_n,j(τ), j = 1, 2, have the desired asymptotic order. As a result, this procedure remains valid when e_n,j(τ) in step (1.c) is replaced by u · e_n,j(τ) for some constant u. Based on our numerical experiences, incorporating some constant u can help stabilize variance estimation when sample size is small or τ is close to 0 or 1. Variance estimation based on the above procedure is found to have satisfactory finite sample performance based on some unreported simulation studies.

2.2. The proposed test statistics and theoretical properties

Express $\hat{θ} (τ) \equiv {({\hat{β}}^{(0)} (τ), {\hat{β}}^{(1)} (τ))}^{'}$ and let ${\hat{σ}}_{n}^{(1)} (τ)$ denote the square root of the second diagonal element of V_n(τ), which corresponds to the variance estimate for $\sqrt{n} {\hat{β}}^{(1)} (τ)$ under $H_{0}^{*}$ . We propose to construct two “omnibus” test statistics based on ${\hat{β}}^{(1)} (τ)$ and ${\hat{σ}}_{n}^{(1)} (τ)$ :

{\hat{T}}_{sup}^{(1)} = sup_{τ \in [τ_{L}, τ_{U}]} | \frac{\sqrt{n} {\hat{β}}^{(1)} (τ)}{{\hat{σ}}_{n}^{(1)} (τ)} |,

and

{\hat{T}}_{inte}^{(1)} = \int_{τ_{L}}^{τ_{U}} {| \frac{\sqrt{n} {\hat{β}}^{(1)} (τ)}{{\hat{σ}}_{n}^{(1)} (τ)} |}^{2} d τ .

These two test statistics mimic the classic Kolmogorov-Smirnov (K-S) test statistic and CramérVon-Mises (C-V) test statistic for two-sample distribution comparisons (Darling, 1957). Under model (1), ${\hat{T}}_{sup}^{(1)}$ and ${\hat{T}}_{inte}^{(1)}$ capture the maximum and average magnitude of the covariate effect over τ ∈ [τ_L, τ_U] respectively. By this design, both test statistics are sensitive to any type of departures from the null hypothesis H₀ and can be used to construct powerful tests for H₀.

Without assuming model (1), we can also show that ${\hat{T}}_{sup}^{(1)}$ and ${\hat{T}}_{inte}^{(1)}$ provide valid tests for $H_{0}^{*}$ and have power approaching one under a general class of alternative hypotheses as specified in Theorem 2. The key insight is that even when model (1) does not hold, $\hat{θ} (τ)$ may still converge in probability to a deterministic function $\tilde{θ} (τ) \equiv {({\tilde{β}}^{(0)} (τ), {\tilde{β}}^{(1)} (τ))}^{'}$ that is the solution to μ(b, τ) ≡ E[Z{I(log T ≤ Z^⊤b) − τ}] = 0. It is easy to see that $\tilde{θ} (τ) = θ_{0} (τ)$ under model (1). By Lemma 1, it follows that under $H_{0}^{*}$ , ${\tilde{β}}^{(1)} (τ) = 0$ for τ ∈ [τ_L, τ_U]. As detailed in Theorems A1–A2 in Appendix A, under certain regularity conditions, we further have ${lim}_{n \to \infty} {sup}_{τ \in [τ_{L}, τ_{U}]} ∥ \hat{θ} (τ) - \tilde{θ} (τ) ∥ \to_{p} 0$ , and $\sqrt{n} {\hat{θ} (τ) - \tilde{θ} (τ)}$ converge weakly to a mean zero Gaussian process for τ ∈ [τ_L, τ_U] with covariance function $\tilde{Φ} (τ^{'}, τ) = E {{\tilde{ξ}}_{1} (τ^{'}) {\tilde{ξ}}_{1} {(τ)}^{⊤}}$ , where ${\tilde{ξ}}_{i} (τ) (i = 1, \dots, n)$ are defined as

{\tilde{ξ}}_{i} (τ) \equiv {{\tilde{ξ}}_{i}^{(0)} (τ), {\tilde{ξ}}_{i}^{(1)} (τ)}^{⊤} = {A (\tilde{θ} (τ))}^{- 1} {Z_{i} (\frac{I (log (X_{i}) \leq Z_{i}^{⊤} \tilde{θ} (τ), δ_{i} = 1)}{G (X_{i})} - τ) - \int_{0}^{\infty} w {\tilde{θ} (τ), s} y {(s)}^{- 1} d M_{i}^{G} (s)} .

A useful by-product from the proof of Theorem A2 is that

n^{1 / 2} {\hat{θ} (τ) - \tilde{θ} (τ)} \approx n^{- 1 / 2} \sum_{i = 1}^{n} {\tilde{ξ}}_{i} (τ),

(3)

We can prove these results by adapting the arguments of Peng and Fine (2009) which utilize model assumption (1) only through using its implication μ(θ₀, τ) = 0 for τ ∈ [τ_L, τ_U]. This provides the critical justification for why ${\hat{β}}^{(1)} (τ)$ can be used to test $H_{0}^{*}$ even when model (1) does not hold. The sample-based procedure reviewed in Section 2.1 is still applicable to estimate the asymptotic covariance matrix $\tilde{Φ} (τ^{'}, τ)$ .

In Theorems 1 and 2, we establish useful asymptotic properties of ${\hat{T}}_{sup}^{(1)}$ and ${\hat{T}}_{inte}^{(1)}$ without assuming model (1). Specifically, in Theorem 1, we provide the limit distributions of the proposed test statistics under the null hypothesis $H_{0}^{*}$ :

Theorem 1 Assuming the regularity conditions (C1)–(C5) in the Appendix hold, under the null hypothesis H₀ or $H_{0}^{*}$ , we have

{\hat{T}}_{sup}^{(1)} = sup_{τ \in [τ_{L}, τ_{U}]} | \frac{n^{1 / 2} {\hat{β}}^{(1)} (τ)}{{\hat{σ}}_{n}^{(1)} (τ)} | \to_{d} sup {| 𝒳^{(1)} (τ) |, τ \in [τ_{L}, τ_{U}]}

{\hat{T}}_{inte}^{(1)} = \int_{τ_{L}}^{τ_{U}} {| \frac{n^{1 / 2} {\hat{β}}^{(1)} (τ)}{{\hat{σ}}_{n}^{(1)} (τ)} |}^{2} d τ \to_{d} \int_{τ_{L}}^{τ_{U}} {𝒳^{(1)} (τ)}^{2} d τ,

where 𝒳⁽¹⁾(τ) is a mean zero Gaussian process defined in Appendix C.

We also investigate the asymptotic behavior of the proposed test statistics under a general class of alternative hypotheses. The findings are stated in Theorem 2.

Theorem 2 Assuming the regularity conditions (C1)–(C5) in the Appendix hold,

${\hat{T}}_{sup}^{(1)}$ is consistent against the alternative hypothesis
$H_{a, 1} : sup_{τ \in [τ_{L}, τ_{U}]} | {\tilde{β}}^{(1)} (τ) | > 0.$
${\hat{T}}_{inte}^{(1)}$ is consistent against the alternative hypothesis:

H_{a, 2} : \int_{τ_{L}}^{τ_{U}} {{\tilde{β}}^{(1)} (τ)}^{2} d τ > 0.

The results of Theorem 2 indicate that the test statistics have power approaching to 1 (as n goes to ∞) under alternative cases subject to very mild constraints. Given the smoothness of ${\tilde{β}}^{(1)} (\cdot)$ , a general scenario that ensures the consistency of both ${\hat{T}}_{sup}^{(1)}$ and ${\hat{T}}_{inte}^{(1)}$ can be described as

{\tilde{H}}_{a} : There exists an interval [τ_{1}, τ_{2}] \subseteq [τ_{L}, τ_{U}] such that | {\tilde{β}}_{0}^{(1)} (τ) | > 0 for τ \in [τ_{1}, τ_{2}] .

This suggests that the proposed tests are powerful to identify a significant prognostic factor even when it only influences a segment of the outcome distribution, not necessarily the whole outcome distribution. This feature is conceptually appealing for handling a dynamic covariate effect, which may not have similar effect strength across different quantiles. The detailed proofs for Theorems 1 and 2 can be found in Appendix C.

2.3. The proposed resampling procedure to obtain p values

The results in Theorem 1 suggest that ${\hat{T}}_{sup}^{(1)}$ and ${\hat{T}}_{inte}^{(1)}$ , like the classic K-S test statistic and C-V test statistic, have complex, non-standard limit null distributions. This motivates us to develop a resampling-based procedure to approximate their limit null distributions and obtain the corresponding p values for testing $H_{0}^{*}$ .

Our key strategy is to approximate the distribution of $n^{1 / 2} {{\hat{β}}^{(1)} (τ) - {\tilde{β}}_{0}^{(1)} (τ)}$ , which reduces to $n^{1 / 2} {\hat{β}}^{(1)} (τ)$ under H₀, through perturbing the influence function ${\tilde{ξ}}_{i}^{(1)} (τ)$ , which is the second component of ${\tilde{ξ}}_{i} (τ)$ . Similarly ideas were used by other authors, for example, Lin et al. (1993) and Li and Peng (2014). The core justification of our proposal is provided by equation (3), which suggests that $n^{- 1 / 2} \sum_{i = 1}^{n} {\tilde{ξ}}_{i}^{(1)} (τ) ι_{i} / {\hat{σ}}_{n}^{(1)} (τ)$ may be used to approximate $\sqrt{n} {\hat{β}}^{(1)} (τ) / {\hat{σ}}_{n}^{(1)} (τ)$ , where ${ι_{i}}_{i = 1}^{n}$ are i.i.d. standard normal variates.

Specifically, we take the following steps:

(2.a)
Generate B independent sets of ${ι_{i}^{b}}_{i = 1}^{n}$ , where ${ι_{i}^{b}}_{i = 1}^{n}$ are independent random variables from a standard normal distribution and b = 1, 2, …, B.
(2.b)
Compute the estimates for the influence function ${\tilde{ξ}}_{i}^{(1)} (τ)$ as the second component of
${\hat{ξ}}_{i} (τ) = {\hat{A} (\hat{θ} (τ))}^{- 1} {Z_{i} (\frac{I [log (X_{i}) \leq Z_{i}^{⊤} \hat{θ} (τ)), δ_{i} = 1]}{\hat{G} (X_{i})} - τ) - I (δ_{i} = 0) \frac{\sum_{j = 1}^{n} Z_{j} I (X_{j} \geq X_{i}) I [log (X_{j}) \leq Z_{j}^{⊤} \hat{θ} (τ), δ_{j} = 1] {\hat{G} (X_{j})}^{- 1}}{\sum_{j = 1}^{n} I (X_{j} \geq X_{i})}},$

where $\hat{A} {\hat{θ} (τ)}^{- 1} = n^{1 / 2} D_{n} (τ) E_{n} {(τ)}^{- 1}$ .
(2.c)
For b = 1, …, B, calculate
${\hat{T}}_{sup, b}^{(1)} = sup_{τ \in [τ_{L}, τ_{U}]} | \frac{n^{- 1 / 2} \sum_{i = 1}^{n} {\hat{ξ}}_{i}^{(1)} (τ) ι_{i}^{b}}{{\hat{σ}}_{n}^{(1)} (τ)} | and {\hat{T}}_{inte, b}^{(1)} = \int_{τ_{L}}^{τ_{U}} {| \frac{n^{- 1 / 2} \sum_{i = 1}^{n} {\hat{ξ}}_{i}^{(1)} (τ) ι_{i}^{b}}{{\hat{σ}}_{n}^{(1)} (τ)} |}^{2} d τ,$

where ${\hat{ξ}}_{i}^{(1)} (τ)$ is the second component of ${\hat{ξ}}_{i} (τ)$ .
(2.d)
The p values based on ${\hat{T}}_{sup}^{(1)}$ and ${\hat{T}}_{inte}^{(1)}$ are calculated respectively as

p_{sup}^{(1)} = \sum_{b = 1}^{B} I ({\hat{T}}_{sup, b}^{(1)} > {\hat{T}}_{sup}^{(1)}) / B and p_{inte}^{(1)} = \sum_{b = 1}^{B} I ({\hat{T}}_{inte, b}^{(1)} > {\hat{T}}_{inte}^{(1)}) / B .

The resampling procedure presented above is easy to implement without involving smoothing. The rigorous theoretical justification for the presented resampling procedure is provided in Appendix D.

2.4. Some Computational Considerations

Note that ${\hat{β}}^{(1)} (τ)$ and ${\hat{σ}}_{n}^{(1)} (τ)$ are piecewise constant; thus an exact calculation of the supremum or integration involved in ${\hat{T}}_{sup}^{(1)}$ and ${\hat{T}}_{inte}^{(1)}$ is possible. Alternatively, we may follow the recommendation of Zheng et al. (2015) to compute ${\hat{T}}_{sup}^{(1)}$ and ${\hat{T}}_{inte}^{(1)}$ based on a simpler piecewise-constant approximation of $\hat{R} (τ) \equiv {\hat{β}}^{(1)} (τ) / {\hat{σ}}_{n}^{(1)} (τ)$ on a pre-determined fine τ-grid, $𝒢 \equiv τ_{L} = τ_{1} < τ_{2} < \dots < τ_{N_{*}} = τ_{U}$ , with the grid size ${max}_{1 \leq l \leq N_{*} - 1} (τ_{l + 1} - τ_{l}) = o (n^{- 1 / 2})$ . In this case, the proposed test statistics can be calculated as

{\hat{T}}_{sup}^{(1)} = \sqrt{n} max {\hat{R} (τ_{l}) : 1 \leq l \leq N_{*}}, {\hat{T}}_{inte}^{(1)} = \sum_{l = 1}^{N_{*} - 1} n {\hat{R} (τ_{l})}^{2} (τ_{l + 1} - τ_{l}) .

(4)

When n is not large, the sample-based variance estimation (i.e. the computation of ${\hat{σ}}_{n}^{(1)} (τ)$ ) sometimes is not stable. Our remedy is to replace the e_n,j(τ) in step (1.c) (see Section 2.1) with u · e_n,j(τ), where u is a pre-specified constant. We develop the following algorithm to determine a good choice of the adjusting constant u among a set of candidate values, 𝒰 = {1, 2, …, U}.

(3.a)
For each u ∈ 𝒰, calculate $\hat{R} (τ; u) \equiv {\hat{β}}^{(1)} (τ) / {\hat{σ}}_{n}^{(1)} (τ; u)$ for τ ∈ 𝒢, where ${\hat{σ}}_{n}^{(1)} (τ; u)$ is the ${\hat{σ}}_{n}^{(1)} (τ)$ computed with the adjusting constant u.
(3.b)
For each u ∈ 𝒰, calculate ${\hat{R}}^{*} (u) = {max}_{τ \in 𝒢} \hat{R} (τ; u)$ and ${\hat{R}}^{†} (u) = {median}_{τ \in 𝒢} \hat{R} (τ; u)$ .
(3.c)
For each u ∈ 𝒰, calculate $\tilde{R} (u) = {max}_{τ \in 𝒢} max {V_{n} (τ; u)} - {min}_{τ \in 𝒢} min {V_{n} (τ; u)}$ , where V_n(τ; u) is V_n(τ) computed with the adjusting constant u. Here, for a matrix A, max(A) (or min(A)) denotes the largest (or the smallest) component of the matrix A.
(3.d)
Assign a large positive value to A^[0] and B^[0], say 10⁵. Set k = 1 and u^[0] = U + 1.
1. If ${\hat{R}}^{*} (k) - {\hat{R}}^{†} (k) < A^{[k - 1]}$ and $\tilde{R} (k) < B^{[k - 1]}$ , then let $A^{[k]} = {\hat{R}}^{*} (k) - {\hat{R}}^{†} (k)$ , $B^{[k]} = \tilde{R} (k)$ , and u^[k] = k. Otherwise, let A^[k] = A^[k−1], B^[k] = B^[k−1] and u^[k] = u^[k−1].
2. Increase k by 1 and go back to (i) until k > U.
(3.e)
If u^[U] < U + 1, then choose u as u^[U]. Otherwise, no appropriate u can be selected from 𝒰.

By this algorithm, we provide an empirical strategy to select u based on two estimation instability measures: (A) ${\hat{R}}^{*} (k) - {\hat{R}}^{†} (k)$ , which reflects the spread of $\hat{R} (τ) \equiv {\hat{β}}^{(1)} (τ) / {\hat{σ}}_{n}^{(1)} (τ)$ over τ given u = k; (B) $\tilde{R} (k)$ , which measures the maximum fluctuation of the estimated variance matrices across τ given u = k. It is clear that both measures would be large when unstable variance estimation occurs. Our algorithm first compares them with pre-specified initial values, A^[0] and B^[0], to rule out the occurrence of obviously outlying estimates of $\hat{R} (τ)$ or ${\hat{σ}}_{n}^{(1)} (τ)$ . Once these two measures are found to meet the stability criteria set by the initial values with some u ∈ 𝒰, the algorithm will proceed to check if other u’s can yield smaller values of the instability measures. The output from this algorithm is either the value of u that produces the smallest instability measures, or an error message indicating that none of the constants in 𝒰 can lead to stable estimation required by the proposed testing procedure. Based on our numerical experiences, setting 𝒰 = {1, 2, …, 6}, which corresponds to U = 6, works well for small sample sizes such as 200 or 400. In a rare case where this algorithm fails to identify an appropriate u, we recommend adaptively increasing the value of U until an appropriate u can be identified. Our extensive numerical experiences suggest that incorporating the adjusting constant u selected by this algorithm results in good and stable numerical performance of the proposed tests. The algorithm can be easily generalized to allow 𝒰 to include non-integer values.

3. Simulation Studies

We conduct extensive simulation studies to investigate the finite-sample performance of the proposed resampling-based testing procedures. To simulate randomly censored data, we consider six setups where T and $\tilde{Z}$ follow different relationships. In all setups, we generate $\tilde{Z}$ from Uniform(0, 1) and generate censoring time C from Uniform(U_L, U_U), where U_L and U_U are properly specified to produce 15% or 30% censoring. Let Φ(·) denote the cumulative distribution function of the standard normal distribution. The six simulation set-ups are described as follows.

Setup I: Generate T such that Q_τ{log(T)} = Φ⁻¹(τ). Set (U_L, U_U) = (2, 3.8) to produce 15% censoring, and set (U_L, U_U) = (1, 2.5) to produce 30% censoring.
Setup II: Generate T such that Q_τ{log(T)} = 0.2X +Φ⁻¹(τ). Set (U_L, U_U) = (2.5, 3.9) to produce 15% censoring and set (U_L, U_U) = (1.2, 2.8) to produce 30% censoring.
Setup III: Generate T such that Q_τ{log(T)} = 0.5X + Φ⁻¹(τ). Set (U_L, U_U) = (2.7, 4.9) to produce 15% censoring, and set (U_L, U_U) = (1.5, 3) to produce 30% censoring.
Setup IV: Generate T such that Q_τ{log(T)} = l₄(τ)X + Φ⁻¹(τ), where l₄(τ) is as plotted in Figure 1. Set (U_L, U_U) = (2, 3.9) to produce 15% censoring, and set (U_L, U_U) = (1, 2.5) to produce 30% censoring.
Setup V: Generate T such that Q_τ{log(T)} = l₅(τ)X + Φ⁻¹(τ), where l₅(τ) is as plotted in Figure 1. Set (U_L, U_U) = (5.2, 6.5) to produce 15% censoring, and set (U_L, U_U) = (1.5, 3.5) to produce 30% censoring.
Setup VI: Generate T such that Q_τ{log(T)} = l₆(τ)X + Φ⁻¹(τ), where l₆(τ) is as plotted in Figure 1. Set (U_L, U_U) = (3.5, 5.5) to produce 15% censoring, and set (U_L, U_U) = (1.1, 3.5) to produce 30% censoring.

Fig. 1 — The true coefficient function for all simulation set-ups.

Under all setups, model (1) holds for τ ∈ (0, 1) and thus for τ ∈ [0.1, 0.6], a pre-specified τ-interval of interest [τ_L, τ_U]. In Figure 1, we plot the true coefficient function $β_{0}^{(1)} (τ)$ for each setup. It is easy to see that setup (I) represents a null case, where $\tilde{Z}$ has no effect on any quantile of T. Setup (II) and (III) are two setups where $\tilde{Z}$ has nonzero constant effects over all τ ∈ [0.1, 0.6]. The constant effect in setup (II) has a magnitude of 0.2, which is smaller than that in setup (III), which is 0.5. In setups (IV), (V), and (VII), $\tilde{Z}$ has a dynamic effect varying across different τ’s. More specifically, $\tilde{Z}$ has a partial effect over the τ-interval [0.1, 0.49] in setup (IV). In setup (V), the magnitude of $\tilde{Z}$ ’s effect is symmetric around 0.5, while the sign of the effect is opposite for τ < 0.5 and for τ > 0.5, and the effect equals 0 at τ = 0.5. In setup (VI), the τ-varying effect pattern of $\tilde{Z}$ is similar to that in setup (V) except that there is a small interval around 0.5 where $\tilde{Z}$ has no effect in setup(VI).

We compare the proposed method with the Wald test based on the Cox PH model, denoted by “CPH (Wald)”, as well as the Wald test based on the locally concerned quantile regression that focuses on τ = 0.4, 0.5, or 0.6, denoted by “CQR (Wald)”. To implement CQR (Wald), we adopt Peng and Huang (2008)’s estimates with variance estimated by bootstrapping. The resampling size used for both CQR (Wald) and the proposed testing procedures is set as 2500. In the sequel, we shall refer the testing procedures based on ${\hat{T}}_{sup}^{(1)}$ and ${\hat{T}}_{inte}^{(1)}$ respectively to as GST and GIT. For all the methods, we consider sample sizes 200, 400, and 800. We set 𝒰 = {1, …, 6} when implementing the algorithm for selecting the constant u.

In Table 1, we report the empirical rejection rates based on 1000 simulations. The results in setup I show that the proposed GIT, and the existing tests, CQR (Wald) and CPH (Wald), have empirical sizes quite close to the nominal level 0.05. The proposed GST yields relatively larger empirical type I errors as compared to the other tests. The empirical size of GST equals 0.1 when the sample size is 200 but decreases to 0.077 when the sample size increases to 800. Such an anti-conservative behavior of GST is not surprising because the K-S type test statistic is defined based on the largest value of ${\hat{β}}^{(1)} (τ) / {\hat{σ}}_{n}^{(1)} (τ)$ over τ ∈ [0.1, 0.6], which is more sensitive to a possible outlying value of ${\hat{σ}}_{n}^{(1)} (τ)$ at some τ.

Table 1.

Empirical rejection rate based on 1000 simulations.

Set-up	n	Proposed Test		CQR (Wald)			CPH (Wald)
Set-up	n	GST	GIT	τ = 0.4	τ = 0.5	τ = 0.6	CPH (Wald)
15% censoring
I	200	0.100	0.073	0.066	0.062	0.057	0.049
	400	0.091	0.078	0.072	0.072	0.066	0.051
	800	0.077	0.055	0.064	0.063	0.059	0.061
II	200	0.234	0.167	0.117	0.131	0.117	0.115
	400	0.275	0.214	0.155	0.153	0.150	0.178
	800	0.410	0.362	0.277	0.265	0.247	0.322
III	200	0.566	0.485	0.359	0.401	0.360	0.450
	400	0.786	0.772	0.585	0.592	0.576	0.722
	800	0.957	0.957	0.873	0.887	0.865	0.960
IV	200	0.377	0.254	0.097	0.060	0.053	0.063
	400	0.652	0.478	0.116	0.065	0.063	0.067
	800	0.939	0.816	0.148	0.047	0.058	0.090
V	200	0.653	0.464	0.143	0.070	0.118	0.260
	400	0.937	0.827	0.208	0.071	0.153	0.458
	800	0.999	0.993	0.291	0.053	0.279	0.757
VI	200	0.731	0.552	0.149	0.062	0.086	0.125
	400	0.971	0.896	0.198	0.055	0.095	0.201
	800	1.000	0.995	0.260	0.033	0.142	0.364
30% censoring
I	200	0.171	0.095	0.062	0.060	0.048	0.047
	400	0.110	0.085	0.069	0.074	0.065	0.056
	800	0.066	0.052	0.063	0.059	0.050	0.038
II	200	0.302	0.186	0.115	0.122	0.105	0.122
	400	0.305	0.221	0.152	0.156	0.138	0.188
	800	0.411	0.359	0.277	0.259	0.245	0.298
III	200	0.681	0.539	0.360	0.393	0.322	0.432
	400	0.828	0.791	0.585	0.590	0.534	0.703
	800	0.959	0.957	0.874	0.877	0.855	0.952
IV	200	0.440	0.271	0.101	0.061	0.044	0.056
	400	0.668	0.480	0.115	0.065	0.062	0.085
	800	0.947	0.804	0.150	0.048	0.046	0.089
V	200	0.799	0.573	0.135	0.069	0.103	0.092
	400	0.960	0.846	0.206	0.068	0.135	0.140
	800	1.000	0.993	0.292	0.054	0.282	0.211
VI	200	0.803	0.587	0.148	0.063	0.077	0.053
	400	0.978	0.903	0.199	0.052	0.082	0.064
	800	1.000	0.995	0.263	0.033	0.141	0.083

Open in a new tab

When the quantile effect of $\tilde{Z}$ is constant over τ (i.e. setups (II) and (III)), we note that in setup (II) where the effect size (i.e. magnitude of the constant effect) is relatively small, CPH (Wald) has lower empirical power as compared to the proposed GIT and GST, and the power improvement associated with the proposed GIT and GST is more evident with the smaller sample size 200. In setup (III), where the effect size is larger, CPH (Wald) still generally has lower empirical power compared to the proposed tests but its empirical power becomes comparable to that of GIT when the sample size is large (i.e. n = 800). These observations suggest that even in the trivial constant effect cases, the proposed tests can outperform the traditional Cox regression based tests in data scenarios with small effect sizes or sample sizes. In both setups (II) and (III), the locally concerned CQR (Wald) consistently yields lower empirical power than the proposed globally concerned GIT and GST. This reflects the power benefit resulted from integrating information on covariate effects on different quantiles as in GST and GIT, rather than focusing on the covariate effect on a single quantile as in CQR (Wald).

In setups (IV), (V), and (VI), the effect of $\tilde{Z}$ is τ-varying, reflecting its dynamic association with T. In these cases, CPH (Wald), which assumes a constant covariate effect, can have poor power to detect the dynamic effect of $\tilde{Z}$ (e.g. 8.3% empirical power in setup (VI) with n = 800 in the presence of 30% censoring), while the proposed GST and GIT may yield much higher power (e.g. >99% power in setup (VI) with n = 800 in the presence of 30% censoring). The locally concerned CQR (Wald) can have higher power than CPH (Wald) when the targeted quantile level is within the τ-region where $β_{0}^{(1)} (τ)$ is non-zero. When the targeted quantile level is outside the τ-region with non-zero effect, such as τ = 0.6 in setup (IV) or τ = 0.5 in setups (V) and (VI), the CQR (Wald) has even poorer power compared to CPH (Wald). This is well expected because these cases may serve as the null cases for the locally concerned CQR (Wald). This confirms that CQR (Wald) is inadequate to capture the meaningful effect of $\tilde{Z}$ that is manifested at non-targeted quantiles.

We compare the simulation results across settings that are only differed by the censoring distribution. For each relationship between $\tilde{Z}$ and T specified by setups (I)-(VI), we consider three different censoring distributions to yield 0%, 15%, and 30% censoring. The results for settings with 15% and 30% censoring are presented in Table 1 and the results based on uncensored data are presented in Table A.1 in Appendix E. From our comparisons, we find that quantile regression based tests, including GST, GIT and CQR (Wald), demonstrate small variations in empirical powers as the censoring rate (or distribution) changes. In cases with a constant covariate effect, the Cox regression based test, CPH (Wald), also has similar performance among settings with different censoring rates. However, in setup (V), where the covariate effect is not constant over τ, CPH (Wald) has reasonably good power when there is no censoring or only 15% censoring, but its performance deteriorates considerably when the censoring rate is increased to 30%. We have a similar observation for CPH (Wald) in setup (VI). A reasonable interpretation of these observations is that the capacity to detect a dynamic effect can be weakened by incorrectly assuming a constant proportional hazard effect and can be further attenuated by the missing data from censoring.

We also investigate whether the proposed tests are sensitive to the choice of 𝒰. We conduct additional simulation studies with 𝒰 set as {1, …, 3}, {1, …, 6}, and {1, …, 12} for the six set-ups with 15% censoring. The results are summarized in Table A.2. in the Appendix. From this table, we note that GIT is quite robust to the change in 𝒰, while GST demonstrates more variations across different choices of 𝒰. Another observation is that GIT becomes less sensitive to the change in 𝒰 when the sample size becomes larger. A possible explanation for these results is similar to that for the observed anti-conservative behavior of GST. That is, GST, by its construction, is sensitive to any outlying value of ${\hat{σ}}_{n}^{(1)} (τ)$ with τ ∈ [τ_L, τ_U], which is more likely to occur when the sample size is not large.

Aligning with the definitions of the proposed tests, the simulation results suggest that GST, as compared to GIT, is more sensitive to detect a departure from the null hypothesis, yielding higher power. This observation is also consistent with the anti-conservative behavior of GST observed in the null cases, which is reflected by empirical sizes notably greater than 0.05. With a smaller sample size, such as n = 200, GST can produce quite elevated type I errors, while GIT yields more reasonable empirical sizes. Therefore, in practice, one may need to exercise caution for applying GST to a small dataset, for which we recommend using GIT instead. In summary, our simulation results demonstrate the proposed testing procedures have robust satisfactory performance for detecting a covariate of either a constant or dynamic effect. The new tests tend to exhibit greater advantages over benchmark approaches when the covariate presents a dynamic effect, or the covariate has a constant effect but of a small magnitude.

4. Real Data Analysis

To illustrate the utility of the proposed testing framework, we apply our method to investigate the prognostic factors for dialysis survival based on a dataset collected from a cohort of 191 incident dialysis patients (Kutner et al., 2002). In this dataset, time to death is censored in about 35% of dialysis patients due to either renal transplantation or end of the study as of December 31, 2005. In our analysis, we consider six potential prognostic factors (or covariates), which include age in years (AGE), indicator of reporting fish consumption over the first year of dialysis (FISHH), the indicator for baseline HD dialysis modality (BHDPD); whether the patient has severe symptoms of restless leg syndrome or not (BLEGS); whether or not education level is equal or higher than college (HIEDU); and the indicator of being in the black race group (BLACK). In our analyses, we standardize AGE by subtracting the sample mean and then dividing the resulting quantity by the sample standard deviation.

As a part of exploratory analyses, we check the proportional hazard assumption for each covariate based on Grambsch and Therneau (1994)’s method, using the R function cox.zph() in the R package survival. The p-values corresponding to AGE, FISHH, BHDPD, BLEGS, HIEDU and BLACK are 0.43, 0.63, 0.55, 0.0006, 0.047 and 0.0004, respectively. These results suggest that the proportional hazard assumption may be violated for BLEGS, HIEDU and BLACK.

We fit model (1) for time to death (i.e. T) with each covariate separately. We set [τ_L, τ_U] as [0.1, 0.6] for FISHH, BLGES, HIEDU, and BLACK, but set [τ_L, τ_U] as [0.1, 0.54] and [0.1, 0.49] respectively for AGE and BHDPD. This is because the estimation of $β_{0}^{(1)} (τ)$ based on Peng and Fine (2009) does not converge for some τ’s larger than 0.54 and 0.49 when $\tilde{Z}$ is AGE or BHDPD. Figure 2 presents the estimated coefficients with the pointwise 95% confidence interval across τ ∈ [τ_L, τ_U]. It is suggested by Figure 2 that AGE and BLACK have strong and persistent effects across all or most quantiles of time to death, implying an apparent survival advantage for younger or black patients. For each of the rest covariates, FISHH, BHDPD, BLEGS, or HIEDU, we note a partial effect pattern. For example, FISHH and BLEGS may only impact some lower quantiles of the survival time. BHDPD and HIEDU may only have quantile effects in the τ-intervals, [0.15, 0.3] and [0.3, 0.4], respectively. These observations suggest the presence of dynamic covariate effects as well as the need to appropriately accommodate such dynamic covariate effects.

Fig. 2 — The estimated coefficient with the 95% confidence interval for the covariates based on the censored quantile regression model on the dialysis data.

To evaluate each potential prognostic factor considered, we apply the proposed testing procedures, GST and GIT, along with the benchmark methods, CPH (Wald) and CQR (Wald), as described in Section 3. Table 2 summarizes the p values obtained from different methods for evaluating each covariate. We note that all tests consistently suggest a strong effect of AGE or BLACK on the survival time. The locally concerned quantile regression tests, CQR (Wald), reveal τ-varying effects of FISHH, BHDPD, BLEGS, and HIEDU. For example, BLEGS may significantly influence the 10th and 20th quantiles of the survival time but not the 30th, 40th, 50th, 60th of quantiles. HIEDU may also have a partial effect, influencing some quantiles, such as the 30th and 40th quantiles, but not the other quantiles. The classic Cox regression based test, CPH (Wald), however, fails to capture the partial effects of BLEGS and HIEDU. The p values for testing the effect of BLEGS and HIEDU based on CPH (Wald) are 0.35 and 0.25 respectively. This is possibly caused by imposing a restrictive static view on how a covariate can influence the survival time. In contrast, the proposed GIT and GST, through simultaneously examining covariate effects at quantile levels [τ_L, τ_U], are able to detect the partial effect of BLEGS, with small p values ≤ 0.001 and to suggest a trend toward the association between HIEDU and the survival time, with marginal p values 0.01 and 0.09. The proposed GIT and GST also provide some evidence for the dynamic prognostic value of FISHH and BHDPD for dialysis survival. For example, as suggested by CQR (Wald), fish consumption in the first year may benefit dialysis patients with shorter survival time but may manifest little effect on the long-term survival. In general, our analysis results are consistent with the analyses of Peng and Huang (2008) based on multivariate censored quantile regression model. This example demonstrates the good practical utility of the proposed methods when varying covariate effects are present.

Table 2.

A summary of p-values for each covariates with different methods.

Covariate	Proposed Test		CQR (Wald)						CPH (Wald)
Covariate	GST	GIT	τ = 0.1	τ = 0.2	τ = 0.3	τ = 0.4	τ = 0.5	τ = 0.6	CPH (Wald)
AGE	<0.001	<0.001	0.001	<0.001	<0.001	<0.001	<0.001	<0.001	<0.001
FISHH	<0.001	0.018	0.037	0.055	0.036	0.214	0.473	0.316	0.026
BHDPD	<0.001	0.005	0.090	0.021	0.152	0.228	0.229	0.030	0.008
BLEGS	<0.001	0.001	<0.001	0.001	0.062	0.082	0.091	0.507	0.349
HIEDU	0.013	0.093	0.596	0.137	0.003	0.032	0.068	0.241	0.245
BLACK	<0.001	<0.001	<0.001	<0.001	<0.001	<0.001	<0.001	<0.001	<0.001

Open in a new tab

5. Discussion

In this paper, we develop a new testing framework for evaluating a survival prognostic factor. The main thrust of the new framework lies in its flexibility of accommodating a dynamic covariate effect, which is achieved through adapting the spirit of globally concerned quantile regression. Our testing procedures are conveniently developed based on existing results on fitting a working quantile regression model with randomly censored data. It is important to note that the validity of the testing procedures does not require that the working model is the true model. Moreover, the proposed methods can be readily extended to handle more complex survival outcomes, such as time to event subject to competing risks.

As suggested by one referee, we would like to point out that $Q_{T} (τ ∣ \tilde{Z}) = Q_{T} (τ)$ for τ ∈ (0, 1) implies the statistical independence between T and $\tilde{Z}$ . Nevertheless, in this work, we confine our attention to $H_{0}^{*}$ with τ_U less than 1. This is because right censoring typically precludes the information on the upper tail of the distribution of T, and thus Q_T(τ) or $Q_{T} (τ ∣ \tilde{Z})$ can become non-identifiable as τ approaches 1. The null hypothesis $H_{0}^{*}$ entails a weaker version of the independence between T and $\tilde{Z}$ that can be better assessed with right censored data. Rejecting $H_{0}^{*}$ can provide evidence for the dependence between T and $\tilde{Z}$ , while accepting $H_{0}^{*}$ may not sufficiently indicate the independence between T and $\tilde{Z}$ .

Another commendable extension of this work is to generalize the current null hypothesis and testing procedures to permit evaluating multiple prognostic factors simultaneously. This work also lays a key foundation for developing a nonparametric screening method for helping identify useful prognostic factors among a large number of candidates. These extensions will be reported in separate work.

Appendix

Appendix A: Lemma 1 and its proof

Lemma 1 Suppose the conditional distribution function of T given $\tilde{Z} = \tilde{z}$ is continuous and strictly monotone for all possible values of $\tilde{z}$ . Then $Q_{T} (τ ∣ \tilde{Z}) = Q_{T} (τ)$ for τ ∈ [τ_L, τ_U] is equivalent to model (1) holds with Δ = [τ_L, τ_U] and $β_{0}^{(1)} (τ) = 0$ for τ ∈ [τ_L, τ_U].

Proof for Lemma 1. Suppose we have $Q_{T} (τ ∣ \tilde{Z}) = Q_{T} (τ)$ for τ ∈ [τ_L, τ_U]. It is clear that for τ ∈ [τ_L, τ_U], we can write $Q_{T} (τ ∣ \tilde{Z}) = exp {Z^{⊤} θ_{0} (τ)}$ with θ₀(τ) = (logQ_T(τ), 0)^⊤. This means that model (1) holds with Δ ∈ [τ_L, τ_U] and $β_{0}^{(1)} (τ) = 0$ for τ ∈ [τ_L, τ_U].

Suppose model (1) holds with Δ = [τ_L, τ_U] and $β_{0}^{(1)} (τ) = 0$ for τ ∈ [τ_L, τ_U]. This means, $Q_{T} (τ ∣ \tilde{Z}) = exp {β_{0}^{(0)} (τ)}$ for τ ∈ [τ_L, τ_U]. Given that the conditional distribution function of T given $\tilde{Z}$ is continuous and strictly monotone, it follows from the definition of $Q_{T} (τ ∣ \tilde{Z})$ that $Pr (T \leq exp {β_{0}^{(0)} (τ)} ∣ \tilde{Z}) = τ$ for τ ∈ [τ_L, τ_U]. Taking expectation on both sides of this equality with respect to $\tilde{Z}$ , we then get $Pr (T \leq exp {β_{0}^{(0)} (τ)} = τ$ for τ ∈ [τ_L, τ_U]. Given the continuity and strict monotonicity of the distribution function of T, which is implied by the continuity and strict monotonicity of the conditional distribution function of T given $\tilde{Z}$ , this implies that $exp {β_{0}^{(0)} (τ)} = Q_{T} (τ)$ . Thus, $Q_{T} (τ ∣ \tilde{Z}) = Q_{T} (τ)$ for τ ∈ [τ_L, τ_U]. This completes the proof of Lemma 1.

Appendix B: Asymptotic properties of $\hat{θ}$ without assuming model (1)

We assume the following regularity conditions:

(C1)
There exist a constant v such that P(C = v) > 0 and P(C > v) = 0.
(C2)
$\tilde{Z}$ is uniformly bounded, i.e. ${sup}_{i} | {\tilde{Z}}_{i} | < \infty$ .
(C3)
(i) $\tilde{θ} (τ)$ is Lipschitz continuous for τ ∈ [τ_L, τ_U]; (ii) f(y|z) is bounded above uniformly in y and z, where f(y|z) denotes the conditional density of X given Z = z.
(C4)
For some ρ₀ > 0 and c₀ > 0, ${inf}_{b \in ℬ (ρ_{0})}$ eigminA(b) ≥ c₀, where $ℬ (ρ) = {b \in R^{2} : {inf}_{τ \in [τ_{L}, τ_{U}]} ∥ b - \tilde{θ} (τ) ∥ \leq ρ}$ and A(b) = E[ZZ^⊤f(Z^⊤b|Z)]. Here ∥ · ∥ is the Euclidean norm and eigminA(b) represents the minimal eigenvalue of A(b).

Condition (C1) is adopted to simplify the theoretical arguments to ensure that $\hat{G} (\cdot)$ is consistent for G(·). This condition is usually satisfied in studies subject to administrative censoring. Condition (C2) imposes covariate boundedness. Condition (C3) assumes that the limit coefficient process is smooth and the conditional density distribution is bounded and smooth. Condition (C4) requires that the asymptotic limit of U_n(b, τ) is strictly convex in a neighborhood of $\tilde{θ} (τ)$ for τ ∈ [τ_L, τ_U], implying the uniqueness of the solution to μ(b, τ) ≡ E{ZI(log T ≤ Z^⊤b) − τ)} = 0. This plays a critical role in establishing the uniform convergence of $\hat{θ} (τ)$ to $\tilde{θ} (τ)$ .

Theorem A1 Under regularity conditions (C1)–(C4), we have

lim_{n \to \infty} sup_{τ \in [τ_{L}, τ_{U}]} ∥ \hat{θ} (τ) - \tilde{θ} (τ) ∥ \to_{p} 0.

Theorem A2 Under regularity conditions (C1)–(C4), we have $\sqrt{n} (\hat{θ} (τ) - \tilde{θ} (τ))$ converge weakly to a mean zero Gaussian process for τ ∈ [τ_L, τ_U] with covariance

\tilde{Φ} (τ^{'}, τ) = E {{\tilde{ξ}}_{1} (τ^{'}) {\tilde{ξ}}_{1} {(τ)}^{⊤}} .

The proofs of Theorems A1 and A2 closely resemble the proofs in Peng and Fine (2009) and thus are omitted.

Appendix C: Proofs of Theorem 1 and 2

We assume one additional regularity condition:

(C5)
${inf}_{τ \in [τ_{L}, τ_{U}]} σ^{(1)} (τ) > 0$ , where {σ⁽¹⁾(τ)}² is the second diagonal element of $\tilde{Φ} (τ, τ)$ .

Proof of Theorem 1

Following the lines of Peng and Fine (2009), we can show that the sample-based variance estimation procedure presented in Section 2.1 provides consistent variance estimation, which implies ${sup}_{τ \in (τ_{L}, τ_{U}]} | {\hat{σ}}_{n}^{(1)} (τ) - σ^{(1)} (τ) | \to_{p} 0$ .

Note that under the null hypothesis $H_{0}^{*}$ , we have ${\tilde{β}}^{(1)} (τ) = 0$ and consequently,

n^{1 / 2} \hat{R} (τ) = \frac{n^{1 / 2} {{\hat{β}}^{(1)} (τ) - {\tilde{β}}^{(1)} (τ)}}{{\hat{σ}}_{n}^{(1)} (τ)} = \frac{n^{1 / 2} {\hat{β} (τ) - {\tilde{β}}^{(1)} (τ)}}{σ^{(1)} (τ)} (\frac{σ^{(1)} (τ)}{{\hat{σ}}_{n}^{(1)} (τ)} - 1) + \frac{n^{1 / 2} {{\hat{β}}^{(1)} (τ) - {\tilde{β}}^{(1)} (τ)}}{σ^{(1)} (τ)} .

(5)

By Theorem A2, $n^{1 / 2} {{\hat{β}}^{(1)} (τ) - {\tilde{β}}^{(1)} (τ)} / σ^{(1)} (τ)$ converges weakly to a mean zero Gaussian process 𝒳⁽¹⁾(τ) with covariance process

{\tilde{Φ}}^{(1)} (τ, τ^{'}) = \frac{{\tilde{Φ}}^{(2, 2)} (τ, τ^{'})}{σ^{(1)} (τ) σ^{(1)} (τ^{'})},

where ${\tilde{Φ}}^{(2, 2)} (τ, τ^{'})$ denotes the element in the second row and the second column of $\tilde{Φ} (τ, τ^{'})$ . In addition, condition (C5) and ${sup}_{τ \in (τ_{L}, τ_{U}]} | {\hat{σ}}_{n}^{(1)} (τ) - σ^{(1)} (τ) | \to_{p} 0$ imply ${sup}_{τ \in (τ_{L}, τ_{U}]} | \frac{σ^{(1)} (τ)}{{\hat{σ}}_{n}^{(1)} (τ)} - 1 | \to_{p} 0$ . Applying the result of Theorem A2 and the Slutsky’s Theorem (line 11 of Example 1.4.7 in Boucheron et al. (2013)) to (5), we then get $n^{1 / 2} \hat{R} (τ) \to_{d} 𝒳^{(1)} (τ)$ in l^∞(ℱ_T), where l^∞(S) is the collection of all bounded functions f: S → R for any index set S and $ℱ_{T} = {\frac{{\tilde{ξ}}_{1}^{(1)} (c, τ)}{σ^{(1)} (τ)}, c \in R^{2}, τ \in [τ_{L}, τ_{U}]}$ . Then, by the extended continuous mapping theorem (Theorem 1.11.1 in van der Vaart et al. (1996)), we can establish the limiting null distribution for ${\hat{T}}_{sup}^{(1)}$ and ${\hat{T}}_{inte}^{(1)}$ as

\begin{array}{l} {\hat{T}}_{sup}^{(1)} = sup_{τ \in [τ_{L}, τ_{U}]} | \frac{n^{1 / 2} {\hat{β}}^{(1)} (τ)}{{\hat{σ}}_{n}^{(1)} (τ)} | = sup_{τ \in [τ_{L}, τ_{U}]} | n^{1 / 2} \hat{R} (τ) | \to_{d} sup {| 𝒳^{(1)} (τ) |, τ \in [τ_{L}, τ_{U}]}, \\ {\hat{T}}_{inte}^{(1)} = \int_{τ_{L}}^{τ_{U}} {| \frac{n^{1 / 2} {\hat{β}}^{(1)} (τ)}{{\hat{σ}}_{n}^{(1)} (τ)} |}^{2} d τ = \int_{τ_{L}}^{τ_{U}} {| n^{1 / 2} \hat{R} (τ) |}^{2} d τ \to_{d} \int_{τ_{L}}^{τ_{U}} {𝒳^{(1)} (τ)}^{2} d τ . \end{array}

This completes the proof of Theorem 1.

Proof for Theorem 2

We first investigate the asymptotic limit of ${\hat{T}}_{sup}^{(1)}$ under the alternative hypothesis H_a,1. Simple algebra shows that

\begin{array}{l} {\hat{T}}_{sup}^{(1)} = sup_{τ \in [τ_{L}, τ_{U}]} | \frac{n^{1 / 2} {\hat{β}}^{(1)} (τ)}{{\hat{σ}}_{n}^{(1)} (τ)} | = sup_{τ \in [τ_{L}, τ_{U}]} | \frac{n^{1 / 2} {\tilde{β}}^{(1)} (τ)}{{\hat{σ}}_{n}^{(1)} (τ)} + \frac{n^{1 / 2} ({\hat{β}}^{(1)} (τ) - {\tilde{β}}^{(1)} (τ))}{{\hat{σ}}_{n}^{(1)} (τ)} | \\ \geq sup_{τ \in [τ_{L}, τ_{U}]} | \frac{n^{1 / 2} {\tilde{β}}^{(1)} (τ)}{{\hat{σ}}_{n}^{(1)} (τ)} | - sup_{τ \in [τ_{L}, τ_{U}]} | \frac{n^{1 / 2} ({\hat{β}}^{(1)} (τ) - {\tilde{β}}^{(1)} (τ))}{{\hat{σ}}_{n}^{(1)} (τ)} | \equiv {\hat{T}}_{sup, 1}^{(1)} - {\hat{T}}_{sup, 2}^{(1)} . \end{array}

By the extended continuous mapping theorem, we can show that the ${\hat{T}}_{sup, 2}^{(1)}$ converges in distribution to ${sup}_{τ \in [τ_{L}, τ_{U}]} | 𝒳^{(1)} (τ) |$ and thus is O_p(1). At the same time, given ${sup}_{τ \in (τ_{L}, τ_{U}]} | {\hat{σ}}_{n}^{(1)} (τ) - σ^{(1)} (τ) | \to_{p} 0$ , under condition (C5), we get $n^{- 1 / 2} {\hat{T}}_{sup, 1}^{(1)} \to_{p} ν_{0}$ , where $ν_{0} = {sup}_{τ \in [τ_{L}, τ_{U}]} | \frac{{\tilde{β}}^{(1)} (τ)}{σ^{(1)} (τ)} |$ .

Under the alternative hypothesis H_a,1 and condition (C5), we have ν₀ > 0, and hence $P (n^{- 1 / 2} {\hat{T}}_{sup, 1}^{(1)} > ν_{0} / 2) \to P (ν_{0} > ν_{0} / 2) = 1$ as n → ∞. Furthermore, for any a > 0, we have $n^{- 1 / 2} {\hat{T}}_{sup, 2}^{(1)} + a \cdot n^{- 1 / 2} = o_{p} (1)$ , which implies $P (n^{- 1 / 2} {\hat{T}}_{sup, 2}^{(1)} + a \cdot n^{- 1 / 2} > ν_{0} / 2) \to 0$ as n → ∞. Note that

\begin{array}{l} P ({\hat{T}}_{sup}^{(1)} > a) \geq P (n^{- 1 / 2} {\hat{T}}_{sup, 1}^{(1)} > n^{- 1 / 2} {\hat{T}}_{sup, 2}^{(1)} + a \cdot n^{- 1 / 2}) \\ \geq P (n^{- 1 / 2} {\hat{T}}_{sup, 1}^{(1)} > ν_{0} / 2) - P (n^{- 1 / 2} {\hat{T}}_{sup, 2}^{(1)} + a \cdot n^{- 1 / 2} > ν_{0} / 2) . \end{array}

It then follows that $P ({\hat{T}}_{sup}^{(1)} > a) \to 1$ as n → ∞ under the alternative hypothesis H_a,1. This immediately implies that ${\hat{T}}_{sup}^{(1)}$ is a consistent test against H_a,1 because $P ({\hat{T}}_{sup}^{(1)} > C_{sup, α}) \to 1$ as n → ∞ given H_a,1 holds, where C_sup,α denotes the α-level critical value determined upon the limit null distribution of ${\hat{T}}_{sup}^{(1)}$ , which is greater than 0.

Next, we consider ${\hat{T}}_{inte}^{(1)}$ under the alternative hypothesis H_a,2. Write ${\hat{T}}_{inte}^{(1)}$ as

\begin{array}{l} {\hat{T}}_{inte}^{(1)} = \int_{τ_{L}}^{τ_{U}} {| \frac{n^{1 / 2} {\hat{β}}^{(1)} (τ)}{{\hat{σ}}_{n}^{(1)} (τ)} |}^{2} d τ = \int_{τ_{L}}^{τ_{U}} {| \frac{n^{1 / 2} {\tilde{β}}^{(1)} (τ)}{{\hat{σ}}_{n}^{(1)} (τ)} - \frac{n^{1 / 2} ({\tilde{β}}^{(1)} (τ) - {\hat{β}}^{(1)} (τ))}{{\hat{σ}}_{n}^{(1)} (τ)} |}^{2} d τ \\ \geq \int_{τ_{L}}^{τ_{U}} {| \frac{n^{1 / 2} {\tilde{β}}^{(1)} (τ)}{{\hat{σ}}_{n}^{(1)} (τ)} |}^{2} d τ - \int_{τ_{L}}^{τ_{U}} 2 | \frac{n^{1 / 2} {\tilde{β}}^{(1)} (τ)}{{\hat{σ}}_{n}^{(1)} (τ)} | \cdot | \frac{n^{1 / 2} ({\tilde{β}}^{(1)} (τ) - {\hat{β}}^{(1)} (τ))}{{\hat{σ}}_{n}^{(1)} (τ)} | d τ \equiv {\hat{T}}_{inte, 1}^{(1)} - {\hat{T}}_{inte, 2}^{(1)} . \end{array}

By the continuous mapping theorem, combined with ${sup}_{τ \in (τ_{L}, τ_{U}]} | {\hat{σ}}_{n}^{(1)} (τ) - σ^{(1)} (τ) | \to_{p} 0$ and condition (C5), we get $n^{- 1} {\hat{T}}_{inte, 1}^{(1)} \to_{p} ν_{0}^{*}$ , where $ν_{0}^{*} = \int_{τ_{L}}^{τ_{U}} {| \frac{{\tilde{β}}^{(1)} (τ)}{σ^{(1)} (τ)} |}^{2} d τ$ , and

n^{- 1 / 2} {\hat{T}}_{inte, 2}^{(1)} \to_{d} \int_{τ_{L}}^{τ_{U}} 2 | \frac{{\tilde{β}}^{(1)} (τ)}{σ^{(1)} (τ)} | \cdot {𝒳^{(1)} (τ)} d τ

and thus O_p(1). By condition (C5), the alternative hypothesis H_a,2 implies $ν_{0}^{*} > 0$ . Then following the same arguments for showing $P ({\hat{T}}_{sup}^{(1)} > a) \to 1$ for any a > 0 based on the results that $n^{- 1 / 2} {\hat{T}}_{sup, 1}^{(1)} \to_{p} ν_{0} > 0$ and ${\hat{T}}_{sup, 2}^{(1)} = O_{p} (1)$ , we can prove that $P (n^{- 1 / 2} {\hat{T}}_{inte}^{(1)} > a) \to 1$ as n → ∞ for any a > 0 under H_a,2. This implies that $P ({\hat{T}}_{inte}^{(1)} > a) \to 1$ as n → ∞ for any a > 0 under H_a,2. Therefore, ${\hat{T}}_{inte}^{(1)}$ is a consistent test against the alternative hypothesis H_a,2.

Appendix D: Justification for the proposed resampling procedure

Given the observed data denoted by ${O_{i}}_{i = 1}^{n} \equiv {(X_{i}, δ_{i}, {\tilde{Z}}_{i})}_{i = 1}^{n}$ , since ${ι_{i}^{b}}_{i = 1}^{n}$ are i.i.d. standard normal random variables, we have

E {\frac{n^{- 1 / 2} \sum_{i = 1}^{n} {\hat{ξ}}_{i}^{(1)} (τ) ι_{i}^{b}}{{\hat{σ}}_{n}^{(1)} (τ)} \cdot \frac{n^{- 1 / 2} \sum_{i = 1}^{n} {\hat{ξ}}_{i}^{(1)} (τ^{'}) ι_{i}^{b}}{{\hat{σ}}_{n}^{(1)} (τ^{'})} ∣ {O_{i}}_{i = 1}^{n}} = n^{- 1} \sum_{i = 1}^{n} \frac{{\hat{ξ}}_{i}^{(1)} (τ) {\hat{ξ}}_{i}^{(1)} (τ^{'})}{{\hat{σ}}_{n}^{(1)} (τ) {\hat{σ}}_{n}^{(1)} (τ^{'})} \to_{p} {\tilde{Φ}}^{(1)} (τ, τ^{'}) .

By the arguments of Lin et al. (1993), the distribution of $n^{- 1 / 2} \sum_{i = 1}^{n} {\hat{ξ}}_{i}^{(1)} (τ) ι_{i}^{b} / {\hat{σ}}_{n}^{(1)} (τ)$ converges weakly to 𝒳⁽¹⁾(τ), the same limit as that of $n^{1 / 2} {{\hat{β}}^{(1)} (τ) - {\tilde{β}}^{(1)} (τ)} / {\hat{σ}}_{n}^{(1)} (τ)$ , for almost all realizations of ${O_{i}}_{i = 1}^{n}$ . Applying the extended continuous mapping theorem as in the proof of Theorem 2, we have that under $H_{0}^{*}$ , the conditional distribution of ${\hat{T}}_{sup, b}^{(1)}$ (or ${\hat{T}}_{inte, b}^{(1)}$ ) given the observed data is asymptotically equivalent to the unconditional distributions of $T_{sup, b}^{(1)}$ (or $T_{inte, b}^{(1)}$ ). This justifies using the resampling procedure in Section 2.3 to obtain the p values of the proposed tests.

Appendix E: Additional simulation results

Table A.1.

Empirical rejection rate for the uncensored case based on 1000 simulations.

Set-up	n	Proposed Test		CQR (Wald)			CPH (Wald)
Set-up	n	GST	GIT	τ = 0.4	τ = 0.5	τ = 0.6	CPH (Wald)
I	200	0.098	0.070	0.055	0.052	0.056	0.048
	400	0.093	0.075	0.069	0.064	0.060	0.047
	800	0.076	0.058	0.053	0.053	0.048	0.061
II	200	0.215	0.156	0.104	0.108	0.108	0.121
	400	0.275	0.216	0.162	0.156	0.139	0.183
	800	0.420	0.372	0.276	0.265	0.238	0.328
III	200	0.541	0.478	0.344	0.374	0.337	0.456
	400	0.790	0.771	0.589	0.595	0.590	0.745
	800	0.958	0.961	0.883	0.886	0.873	0.963
IV	200	0.378	0.250	0.074	0.045	0.049	0.060
	400	0.656	0.476	0.101	0.056	0.055	0.049
	800	0.935	0.808	0.118	0.034	0.045	0.085
V	200	0.618	0.452	0.106	0.057	0.121	0.428
	400	0.939	0.828	0.169	0.071	0.165	0.737
	800	1.000	0.994	0.255	0.041	0.313	0.968
VI	200	0.729	0.543	0.095	0.047	0.088	0.228
	400	0.971	0.898	0.154	0.048	0.097	0.446
	800	1.000	0.995	0.243	0.020	0.154	0.756

Open in a new tab

Table A.2.

Empirical rejection rate for the proposed test with different choices of 𝒰 on the six set-ups subject to 15% censoring based on 1000 simulations.

Set-up	n	𝒰 = {1,…, 3}		𝒰 = {1,…, 6}		𝒰 = {1,…, 12}
Set-up	n	GST	GIT	GST	GIT	GST	GIT
I	200	0.128	0.074	0.091	0.067	0.092	0.065
	400	0.126	0.079	0.086	0.067	0.081	0.063
	800	0.112	0.060	0.080	0.059	0.072	0.058
II	200	0.287	0.178	0.228	0.161	0.225	0.158
	400	0.359	0.231	0.283	0.218	0.256	0.206
	800	0.472	0.379	0.415	0.369	0.376	0.361
III	200	0.666	0.549	0.593	0.510	0.585	0.513
	400	0.841	0.789	0.779	0.773	0.761	0.757
	800	0.975	0.964	0.956	0.956	0.940	0.957
IV	200	0.427	0.257	0.364	0.242	0.362	0.243
	400	0.702	0.490	0.666	0.470	0.649	0.471
	800	0.952	0.808	0.942	0.811	0.936	0.808
V	200	0.695	0.478	0.649	0.452	0.649	0.446
	400	0.962	0.850	0.948	0.831	0.944	0.827
	800	1.000	0.994	1.000	0.991	1.000	0.993
VI	200	0.768	0.558	0.723	0.534	0.726	0.535
	400	0.981	0.902	0.971	0.894	0.970	0.892
	800	1.000	0.997	1.000	0.997	1.000	0.998

Open in a new tab

References

Bellera CA, MacGrogan G, Debled M, de Lara CT, Brouste V, and Mathoulin-Pélissier S (2010), “Variables with time-varying effects and the Cox model: some statistical concepts illustrated with a prognostic factor study in breast cancer,” BMC medical research methodology, 10, 1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
Boucheron S, Lugosi G, and Massart P (2013), Concentration inequalities: A nonasymptotic theory of independence, Oxford university press. [Google Scholar]
Cox DR and Oakes D (2018), Analysis of survival data, Chapman and Hall/CRC. [Google Scholar]
Darling DA (1957), “The kolmogorov-smirnov, cramer-von mises tests,” The Annals of Mathematical Statistics, 28, 823–838. [Google Scholar]
Dickson ER, Grambsch PM, Fleming TR, Fisher LD, and Langworthy A (1989), “Prognosis in primary biliary cirrhosis: model for decision making,” Hepatology, 10, 1–7. [DOI] [PubMed] [Google Scholar]
Grambsch PM and Therneau TM (1994), “Proportional hazards tests and diagnostics based on weighted residuals,” Biometrika, 81, 515–526. [Google Scholar]
Huang Y (2010), “Quantile calculus and censored regression,” The Annals of Statistics, 38, 1607–1637. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kleinbaum DG and Klein M (2010), Survival analysis, vol. 3, Springer. [Google Scholar]
Koenker R (2005), Quantile regression, vol. 38, Cambridge Univ Pr. [Google Scholar]
— (2022), “quantreg: Quantile Regression,” R package version 5.87 [Google Scholar]
Koenker R and Bassett G (1978), “Regression quantiles,” Econometrica: journal of the Econometric Society, 33–50. [Google Scholar]
Kutner NG, Clow PW, Zhang R, and Aviles X (2002), “Association of fish intake and survival in a cohort of incident dialysis patients,” American journal of kidney diseases, 39, 1018–1024. [DOI] [PubMed] [Google Scholar]
Li R and Peng L (2014), “Varying coefficient subdistribution regression for left-truncated semi-competing risks data,” Journal of multivariate analysis, 131, 65–78. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lin DY, Wei L-J, and Ying Z (1993), “Checking the Cox model with cumulative sums of martingale-based residuals,” Biometrika, 80, 557–572. [Google Scholar]
Peng L (2021), “Quantile Regression for Survival Data,” Annual review of statistics and its application, 8, 413–437. [DOI] [PMC free article] [PubMed] [Google Scholar]
Peng L and Fine JP (2009), “Competing risks quantile regression,” Journal of the American Statistical Association, 104, 1440–1453. [Google Scholar]
Peng L and Huang Y (2008), “Survival analysis with quantile regression models,” Journal of the American Statistical Association, 103, 637–649. [Google Scholar]
Portnoy S (2003), “Censored quantile regression,” Journal of American Statistical Association, 98, 1001–1012. [Google Scholar]
Powell JL (1986), “Censored regression quantiles,” Journal of econometrics, 32, 143–155. [Google Scholar]
Thorogood J, Persijn G, Schreuder GM, D’amaro J, Zantvoort F, Van Houwelingen J, and Van Rood J (1990), “The effect of HLA matching on kidney graft survival in separate posttransplantation intervals.” Transplantation, 50, 146–150. [DOI] [PubMed] [Google Scholar]
van der Vaart A, van der Vaart A, van der Vaart A, and Wellner J (1996), Weak Convergence and Empirical Processes: With Applications to Statistics, Springer Series in Statistics, Springer. [Google Scholar]
Verweij PJ and van Houwelingen HC (1995), “Time-dependent effects of fixed covariates in Cox regression,” Biometrics, 1550–1556. [PubMed] [Google Scholar]
Wang H and Wang L (2009), “Locally weighted censored quantile regression,” Journal of the American Statistical Association, 104, 1117–1128. [Google Scholar]
Ying Z, Jung S, and Wei L (1995), “Survival analysis with median regression models,” Journal of the American Statistical Association, 90, 178–184. [Google Scholar]
Zheng Q, Peng L, and He X (2015), “Globally adaptive quantile regression with ultra-high dimensional data,” Annals of statistics, 43, 2225. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhou L (2006), “A simple censored median regression estimator,” Statistica Sinica, 16, 1043–1058. [Google Scholar]

[R1] Bellera CA, MacGrogan G, Debled M, de Lara CT, Brouste V, and Mathoulin-Pélissier S (2010), “Variables with time-varying effects and the Cox model: some statistical concepts illustrated with a prognostic factor study in breast cancer,” BMC medical research methodology, 10, 1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] Boucheron S, Lugosi G, and Massart P (2013), Concentration inequalities: A nonasymptotic theory of independence, Oxford university press. [Google Scholar]

[R3] Cox DR and Oakes D (2018), Analysis of survival data, Chapman and Hall/CRC. [Google Scholar]

[R4] Darling DA (1957), “The kolmogorov-smirnov, cramer-von mises tests,” The Annals of Mathematical Statistics, 28, 823–838. [Google Scholar]

[R5] Dickson ER, Grambsch PM, Fleming TR, Fisher LD, and Langworthy A (1989), “Prognosis in primary biliary cirrhosis: model for decision making,” Hepatology, 10, 1–7. [DOI] [PubMed] [Google Scholar]

[R6] Grambsch PM and Therneau TM (1994), “Proportional hazards tests and diagnostics based on weighted residuals,” Biometrika, 81, 515–526. [Google Scholar]

[R7] Huang Y (2010), “Quantile calculus and censored regression,” The Annals of Statistics, 38, 1607–1637. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] Kleinbaum DG and Klein M (2010), Survival analysis, vol. 3, Springer. [Google Scholar]

[R9] Koenker R (2005), Quantile regression, vol. 38, Cambridge Univ Pr. [Google Scholar]

[R10] — (2022), “quantreg: Quantile Regression,” R package version 5.87 [Google Scholar]

[R11] Koenker R and Bassett G (1978), “Regression quantiles,” Econometrica: journal of the Econometric Society, 33–50. [Google Scholar]

[R12] Kutner NG, Clow PW, Zhang R, and Aviles X (2002), “Association of fish intake and survival in a cohort of incident dialysis patients,” American journal of kidney diseases, 39, 1018–1024. [DOI] [PubMed] [Google Scholar]

[R13] Li R and Peng L (2014), “Varying coefficient subdistribution regression for left-truncated semi-competing risks data,” Journal of multivariate analysis, 131, 65–78. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] Lin DY, Wei L-J, and Ying Z (1993), “Checking the Cox model with cumulative sums of martingale-based residuals,” Biometrika, 80, 557–572. [Google Scholar]

[R15] Peng L (2021), “Quantile Regression for Survival Data,” Annual review of statistics and its application, 8, 413–437. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] Peng L and Fine JP (2009), “Competing risks quantile regression,” Journal of the American Statistical Association, 104, 1440–1453. [Google Scholar]

[R17] Peng L and Huang Y (2008), “Survival analysis with quantile regression models,” Journal of the American Statistical Association, 103, 637–649. [Google Scholar]

[R18] Portnoy S (2003), “Censored quantile regression,” Journal of American Statistical Association, 98, 1001–1012. [Google Scholar]

[R19] Powell JL (1986), “Censored regression quantiles,” Journal of econometrics, 32, 143–155. [Google Scholar]

[R20] Thorogood J, Persijn G, Schreuder GM, D’amaro J, Zantvoort F, Van Houwelingen J, and Van Rood J (1990), “The effect of HLA matching on kidney graft survival in separate posttransplantation intervals.” Transplantation, 50, 146–150. [DOI] [PubMed] [Google Scholar]

[R21] van der Vaart A, van der Vaart A, van der Vaart A, and Wellner J (1996), Weak Convergence and Empirical Processes: With Applications to Statistics, Springer Series in Statistics, Springer. [Google Scholar]

[R22] Verweij PJ and van Houwelingen HC (1995), “Time-dependent effects of fixed covariates in Cox regression,” Biometrics, 1550–1556. [PubMed] [Google Scholar]

[R23] Wang H and Wang L (2009), “Locally weighted censored quantile regression,” Journal of the American Statistical Association, 104, 1117–1128. [Google Scholar]

[R24] Ying Z, Jung S, and Wei L (1995), “Survival analysis with median regression models,” Journal of the American Statistical Association, 90, 178–184. [Google Scholar]

[R25] Zheng Q, Peng L, and He X (2015), “Globally adaptive quantile regression with ultra-high dimensional data,” Annals of statistics, 43, 2225. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R26] Zhou L (2006), “A simple censored median regression estimator,” Statistica Sinica, 16, 1043–1058. [Google Scholar]

PERMALINK

Assessing Dynamic Covariate Effects with Survival Data

Ying Cui

Limin Peng

Abstract

1. Introduction

2. The Proposed Testing Procedures

2.1. Estimation of θ₀(τ) under model (1)

2.2. The proposed test statistics and theoretical properties

2.3. The proposed resampling procedure to obtain p values

2.4. Some Computational Considerations

3. Simulation Studies

Fig. 1.

Table 1.

4. Real Data Analysis

Fig. 2.

Table 2.

5. Discussion

Appendix

Appendix A: Lemma 1 and its proof

Appendix B: Asymptotic properties of $\hat{θ}$ without assuming model (1)

Appendix C: Proofs of Theorem 1 and 2

Appendix D: Justification for the proposed resampling procedure

Appendix E: Additional simulation results

Table A.1.

Table A.2.

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Assessing Dynamic Covariate Effects with Survival Data

Ying Cui

Limin Peng

Abstract

1. Introduction

2. The Proposed Testing Procedures

2.1. Estimation of θ0(τ) under model (1)

2.2. The proposed test statistics and theoretical properties

2.3. The proposed resampling procedure to obtain p values

2.4. Some Computational Considerations

3. Simulation Studies

Fig. 1.

Table 1.

4. Real Data Analysis

Fig. 2.

Table 2.

5. Discussion

Appendix

Appendix A: Lemma 1 and its proof

Appendix B: Asymptotic properties of θ^ without assuming model (1)

Appendix C: Proofs of Theorem 1 and 2

Appendix D: Justification for the proposed resampling procedure

Appendix E: Additional simulation results

Table A.1.

Table A.2.

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

2.1. Estimation of θ₀(τ) under model (1)

Appendix B: Asymptotic properties of $\hat{θ}$ without assuming model (1)