Semiparametric regression of panel count data with informative terminal event

XIANGBIN HU; LI LIU; YING ZHANG; XINGQIU ZHAO

doi:10.3150/22-bej1565

. Author manuscript; available in PMC: 2025 Apr 29.

Published in final edited form as: Bernoulli (Andover). 2023 Aug 22;29(4):2828–2853. doi: 10.3150/22-bej1565

Semiparametric regression of panel count data with informative terminal event

XIANGBIN HU ^1,^a, LI LIU ^2,^c, YING ZHANG ^3,^d, XINGQIU ZHAO ^1,^b

PMCID: PMC12040413 NIHMSID: NIHMS2018374 PMID: 40303898

Abstract

We study a semiparametric model for robust analysis of panel count data with an informative terminal event. To explore the explicit effect of the terminal event on recurrent events of interest, we propose a conditional mean model for a reversed counting process anchoring at the terminal event. Treating the distribution function of the terminal event as a nuisance functional parameter, we develop a predicted least squares-based two-stage estimation procedure with the spline-based sieve estimation technique, and derive the convergence rate of the proposed estimator. Furthermore, overcoming the difficulties caused by the convergence rate slower than $1 ∕ \sqrt{n}$ , we establish the asymptotic normality for the estimator of the finite-dimensional parameter and a functional of the estimator of the infinite-dimensional parameter. The proposed method is evaluated through extensive simulation studies and illustrated with an application to the Longitudinal Healthy Longevity Survey study on elder people in China.

Keywords: Asymptotic normality, counting process, empirical process, panel count data, predicted least squares, terminal event, two-stage estimation

1. Introduction

In many longitudinal follow-up studies, the observations of recurrent events usually occur at some random discrete time points, and only the event counts between the adjacent observation times are possibly recorded. Such data are referred to as panel count data (Kalbfleisch and Lawless, 1985). We take the number of serious diseases in a dataset of the Chinese Longitudinal Healthy Longevity Survey (CLHLS) (Zeng et al., 2017) as an example. In this study, the population-based survey on individuals who were at least 65 years old started in 1998 followed by six other waves in 2000, 2002, 2005, 2008, 2011, and 2014. During this longitudinal survey, the individuals were reached out for the information of severe illness since the last survey. For each individual, the survey dates were different, and the occurrences of severe illness between two adjacent survey dates were recorded resulting in panel count data on the counting process of occurrences of severe illness.

There were many studies for analysis of panel count data. Using the isotonic regression, Sun and Kalbfleisch (1995) first investigated the nonparametric estimation for the mean function of the counting process with panel count data. Wellner and Zhang (2000) and Lu, Zhang and Huang (2007) proposed the nonparametric maximum pseudo-likelihood and maximum likelihood estimation procedures for the mean function. Considering the covariate effect, Zhang (2002) and Wellner and Zhang (2007) studied the semiparametric maximum pseudo-likelihood and maximum likelihood estimations with the unknown baseline function estimated by the step function. Introducing the monotone spline approximation, Lu, Zhang and Huang (2009) improved the convergence rate of the estimators proposed by Zhang (2002) and Wellner and Zhang (2007). Zhang (2006), Balakrishnan and Zhao (2009), Zhao and Sun (2011), and Zhao and Zhang (2017) considered the two-sample or multi-sample hypothesis test for the mean function of the counting process with panel count data. For the problem of variable selection with panel count data, Tong et al. (2009) and Zhang, Sun and Wang (2013) studied the regularized estimation with the non-concave penalty and the seamless- $L_{0}$ (SELO) penalty, respectively. Recently, the statistical estimation and analysis approaches with panel count data were summarized in Sun and Zhao (2013) and Chiou et al. (2019).

In the above studies of panel count data, researchers modeled the counting process in a forward manner with the event count being 0 at the starting time of the study. However, this approach is not practically convenient when one is particularly interested in exploring the recurrent events near an informative terminal event, such as the severe disease profiles near death as exemplified in CLHLS study. In the early studies, the frailty model (Sun, Tong and He, 2007; Sun et al., 2012; Zhao, Li and Sun, 2013a; 2013b; Zhou et al., 2017) was the most commonly used method to describe the effect of an informative terminal event. This model introduced a latent frailty variable to characterize the correlation between the counting process and the terminal event and supposed that they were conditionally independent given the latent variable. In the CLHLS study, elder people tended to encounter more serious diseases before death, and a high incidence of severe illness can, in turn, lead them to death. However, in the frailty model, the frailty, the latent random variable while useful to acknowledge the association between the recurrent event process and the terminal event, lacks an explicit interpretation of the relationship between them. Recently, to evaluate the rate of recurrent events prior to the terminal events, Chan and Wang (2010) considered the time-backward processes, which started at the terminal event time and counted in a time-going-backward way. Then their model can be used to directly illustrate the effect of the terminal event through an unspecified nonparametric function. Recognizing the special data structure, the backward model for longitudinal data with the starting time anchoring at the informative terminal event seems more relevant in such applications. Treating the terminal event time as a fixed effect covariate and following the backward model in Chan and Wang (2010), Li et al. (2017) studied the semiparametric model for longitudinal data with an informative terminal event, and Shen et al. (2021) further investigated the model with an additional effect of the number of observations. Kong et al. (2018) established the asymptotic properties of the backward semiparametric model for longitudinal data with an informative terminal event. Li et al. (2018) also treated the terminal event time in the longitudinal data as a covariate, and they proposed a more general two-dimensional nonparametric function to represent the effect of the terminal event. To the best of our knowledge, there was no study for the backward semiparametric model with panel count data. For the backward nonparametric model, Liu et al. (2022) extended the method of Kong et al. (2018) to model panel count data with an informative terminal event backwards. They proposed a two-stage spline-based sieve nonparametric maximum likelihood estimation procedure for the inference of underlying recurrent events with panel count data associated with an informative right-censored terminal event.

This paper studies a semiparametric regression model for panel count data with an informative terminal event, where the stochastic mechanism for the underlying counting process arising from recurrent events is completely unspecified. We use the monotone I-spline function to approximate the unknown mean function of the process, and propose a least squares-based two-stage estimation by treating the distribution of the terminal event as a nuisance functional parameter. In stage 1, we estimate the conditional distribution of the terminal event given covariates under a conventional semiparametric model such as the Cox model (Cox, 1972; Breslow, 1972). In stage 2, we construct a loss function based on predicted least squares for estimation of the reversed counting process model. Furthermore, we establish the asymptotic properties of the proposed two-stage estimator.

The rest of this paper is organized as follows. In Section 2, we introduce the semiparametric model and the loss function for model estimation. We establish the consistency, the convergence rate, and the asymptotic normality of the proposed estimator in Section 3. Simulation and application results are reported in Sections 4 and 5, respectively. Some concluding remarks are made in Section 6. All technical proofs are given in the Appendix.

2. Model setting and estimation procedure

Denote the number of recurrent events of interest occurred up to time $t$ by the counting process ${N (t) : 0 \leq t \leq τ}$ , where $τ$ is a fixed time point. Set $K$ to be the total number of observations and $T = (T_{1}, \dots, T_{K})$ to be the observation times of $N (t)$ . Then the observed panel counts on the counting process are represented by $N = (N (T_{1}), \dots, N (T_{K}))$ . Let $U$ and $C$ denote the terminal event time and the censoring time, respectively. We can only observe $Y = U \land C$ and $Δ = 1_{{U \subset C}}$ . Let $Z$ be a covariate vector associated with recurrent events and the terminal event. Let $X = (Y, Δ, K, T, N, Z)$ . Then for a panel count data study with $n$ subjects, the observed data consist of $X = {X_{i}, i = 1, \dots, n}$ , where $X_{i} = (Y_{i}, Δ_{i}, K_{i}, T_{i}, N_{i}, Z_{i})$ with $T_{i} = (T_{i, 1}, \dots, T_{i, K_{i}})$ and $N_{i} = {N_{i} (T_{i, 1}), \dots, N_{i} (T_{i, K_{i}})}$ for $i = 1, \dots, n$ .

Setting the number of recurrent events from time $t$ to the terminal event $U$ to be $\tilde{N} (t; U)$ , we suppose that given the covariate and the terminal event time, the conditional expectation of $\tilde{N} (t; U)$ is

E (\tilde{N} (t; U) ∣ U = u, Z = z) = e^{β^{T} z} Λ (u - t), 0 \leq t \leq u \leq τ,

(1)

where $Λ$ is a non-negative and non-decreasing function with $Λ (0) = 0$ . Noting that $N (t_{2}) - N (t_{1}) = \tilde{N} (t_{1}; U) - \tilde{N} (t_{2}; U)$ for all $0 \leq t_{1} \leq t_{2} \leq u \leq τ$ , then

E ((N (t_{2}) - N (t_{1})) ∣ U = u, Z = z) = e^{β^{T} z} (Λ (u - t_{1}) - Λ (u - t_{2})) .

(2)

Suppose that the conditional distribution function of $U$ given $Z$ satisfies the Cox model

F (u ∣ Z = z) = P (U \leq u ∣ Z = z) = 1 - e^{- H (u) e^{γ^{T} z}},

(3)

where $H (u)$ is the baseline cumulative hazard function of $U$ . Then the unknown parameters and functions to be estimated under models (1) and (3) are ( $β$ , $Λ$ ; $H$ , $γ$ ). Although the covariates associated with the terminal event may not be the same as the covariates associated with the counting process, we would like to include all the potential covariates $Z$ for exploring their effects on both the counting process and terminal events and for the sake of easy notation. That is, we denote the union of covariates associated with the counting process and the terminal event by $Z$ . In models (1) and (3), the coefficients for unrelated covariates are 0. We need the following basic assumptions before the analysis: (i) given $Z$ , $C$ and $U$ are independent; (ii) given $Z$ , $C$ is noninformative to $Λ$ ; and (iii) given ( $Y$ , $Δ$ , $Z$ ), ( $K$ , $T$ ) is noninformative to $Λ$ . In the following, we let $𝒫$ and $P_{n}$ denote the probability measure and the empirical measure, respectively.

Set $Δ N_{j} = N (T_{j}) - N (T_{j - 1})$ and $Δ Λ_{j} (u) = Λ (u - T_{j - 1}) - Λ (u - T_{j})$ for $j = 1, \dots, K$ with $T_{0} = 0$ . To use the least squares approach, we consider

P_{n} [\sum_{j = 1}^{K} {Δ N_{j} - e^{β^{T} Z} Δ Λ_{j} (U)}^{2}] = \frac{1}{n} \sum_{i = 1}^{n} \sum_{j = 1}^{K_{i}} {Δ N_{i, j} - e^{β^{T} Z_{i}} Δ Λ_{i, j} (U_{i})}^{2} .

However, some $U_{i}$ are unknown due to censoring. We turn to consider the predicted least squares as a loss function. Define

m (β, Λ, F; X) ≔ E [\sum_{j = 1}^{K} {Δ N_{j} - e^{β^{T} Z} Δ Λ_{j} (U)}^{2} ∣ Y, Δ, K, T, N, Z] = \sum_{j = 1}^{K} [Δ {Δ N_{j} - e^{β^{T} Z} Δ Λ_{j} (Y)}^{2} + \frac{1 - Δ}{1 - F (Y ∣ Z)} \int_{Y}^{\infty} {Δ N_{j} - e^{β^{T} Z} Δ Λ_{j} (u)}^{2} d F (u ∣ Z)] .

(4)

By (4), we propose the empirical predicted least squares-based loss function as

ℓ_{n} (β, Λ, F; X) = P_{n} m (β, Λ, F; X) = \frac{1}{n} \sum_{i = 1}^{n} \sum_{j = 1}^{K_{i}} Δ_{i} {Δ N_{i, j} - e^{β^{T} Z_{i}} Δ Λ_{i, j} (Y_{i})}^{2} + \frac{1}{n} \sum_{i = 1}^{n} \sum_{j = 1}^{K_{i}} \frac{1 - Δ_{i}}{1 - F (Y_{i} ∣ Z_{i})} \int_{Y_{i}}^{\infty} {Δ N_{i, j} - e^{β^{T} Z_{i}} Δ Λ_{i, j} (u)}^{2} d F (u ∣ Z_{i}),

(5)

where $Δ N_{i, j} = N_{i} (T_{i, j}) - N_{i} (T_{i, j - 1})$ and $Δ Λ_{i, j} (Y_{i}) = Λ (Y_{i} - T_{i, j - 1}) - Λ (Y_{i} - T_{i, j})$ .

Replacing $F (u ∣ Z_{i})$ by $1 - \exp {- H (u) \exp (γ^{T} Z_{i})}$ , a reasonable estimator is the minimizer of the loss function (5). Nevertheless, since the loss function consists of the unknown distribution function $F$ , it is difficult to obtain the minimizer directly. Then we consider the two-stage estimation procedure by treating $F$ as the nuisance functional parameter. In stage 1, we estimate $γ$ and $H$ by the partial likelihood estimator ${\hat{γ}}_{n}$ and the Breslow estimator ${\hat{H}}_{n}$ (Breslow, 1972), respectively. Then we obtain the estimator of the conditional distribution function of the terminal event $\hat{F} (u ∣ z) = 1 - \exp {- {\hat{H}}_{n} (u) \exp ({\hat{γ}}_{n}^{T} z)}$ . In stage 2, replacing $F$ by ${\hat{F}}_{n}$ in (5), ( ${\hat{β}}_{n}$ , ${\hat{Λ}}_{n}$ ) is obtained by minimizing the loss function $ℓ_{n} (β, Λ, {\hat{F}}_{n}; X)$ .

To estimate $Λ$ , we use the monotone I-spline function approximation. To this end, we divide [0, $τ$ ] into $m_{n} + 1$ subintervals by $0 = t_{1} = \dots = t_{d} < t_{d + 1} < \dots < t_{m_{n} + d} < t_{m_{n} + d + 1} = \dots = < t_{m_{n} + 2 d} = τ$ with knots ${t_{i} : i = 1, \dots, m_{n} + 2 d}$ , where $d$ represents the order of I-spline functions. Let the I-spline basis functions be ${I_{l} (s), l = 1, \dots, q_{n}}$ , where $q_{n} = m_{n} + d$ . Then we define the functional space for $Λ$ :

Φ_{n} = {\sum_{l = 1}^{q_{n}} α_{l} I_{l} (s) : α_{l} \geq 0, l = 1, \dots, q_{n}} .

Define $I (s) = {(I_{1} (s), \dots, I_{q_{n}} (s))}^{T}$ and $α = {(α_{1}, \dots, α_{q_{n}})}^{T}$ , and replace $Λ (s)$ by $I {(s)}^{T} α$ . Then we can minimize the loss function by the constrained BFGS algorithm (Lange, 2001). Setting the minimizer of the loss function to be $({\hat{β}}_{n}, {\hat{α}}_{n})$ , the spline estimator of $Λ (s)$ is ${\hat{Λ}}_{n} (s) = I {(s)}^{T} {\hat{α}}_{n}$ .

3. Asymptotic properties

In this section, we establish the asymptotic properties of $({\hat{β}}_{n}, {\hat{Λ}}_{n})$ . First, we define the following function classes

ℋ_{r} = {g : ∣ g^{(r - 1)} (s) - g^{(r - 1)} (t) ∣ \leq c_{0} ∣ s - t ∣ for all 0 \leq s, t \leq τ}, Φ = {Λ \in ℋ_{r} : Λ is a nondecreasing continuous function on [0, τ] with Λ (0) = 0}, ℱ = {F : F (\cdot ∣ z) is a distribution function on [0, \infty) for z \in 𝒵},

where $g^{(r)}$ is the $r$ derivative of $g$ for $r \geq 1$ and $𝒵 \subset R^{p}$ . For a bounded and convex set $ℛ \subset R^{p}$ , denote the interior of $ℛ$ by $ℛ^{\circ}$ . Set $F_{Z}$ to be the distribution function of $Z$ with a bounded support $𝒵$ , and $(β_{0}, Λ_{0}, F_{0}) \in ℛ^{\circ} \times Φ \times ℱ$ to be the true value of ( $β$ , $Λ$ , $F$ ). Let $ℬ$ and $ℬ^{p}$ be the collection of Borel sets in $R$ and $R^{p}$ , respectively. Then for $B_{1}$ , $B_{2} \in ℬ_{[0, τ]} ≔ {B \cap [0, τ] : B \in ℬ}$ and $C \in ℬ^{p}$ , we define

μ_{1} (B_{1} \times B_{2} \times C) = \int_{C} \int \sum_{k = 1}^{\infty} P (K = k ∣ U = u, Z = z) \times \sum_{j = 1}^{k} P ((u - T_{j}) \in B_{1}, (u - T_{j - 1}) \in B_{2} ∣ K = k, U = u, Z = z) d F_{0} (u ∣ z) d F_{Z} (z), μ_{2} (B_{1} \times B_{2}) = μ_{1} (B_{1} \times B_{2} \times R^{p}) .

Setting $Δ Λ (s_{1}, s_{2}) = Λ (s_{2}) - Λ (s_{1})$ , for any functions $Λ_{1}$ , $Λ_{2} \in Φ$ , we define the metric

d_{1}^{2} (Λ_{1}, Λ_{2}) = {∣ ∣ Δ Λ_{1} (s_{1}, s_{2}) - Δ Λ_{2} (s_{1}, s_{2}) ∣ ∣}_{L_{2} (μ_{2})}^{2} = E [\sum_{j = 1}^{K} {(Δ Λ_{1, j} (U) - Δ Λ_{2, j} (U))}^{2}] = E [\sum_{j = 1}^{K} {Δ {(Δ Λ_{1, j} (Y) - Δ Λ_{2, j} (Y))}^{2} + \frac{1 - Δ}{1 - F_{0} (Y ∣ Z)} \int_{Y}^{\infty} {(Δ Λ_{1, j} (u) - Δ Λ_{2, j} (u))}^{2} d F_{0} (u ∣ Z)}] .

For any functions $F_{1}$ , $F_{2} \in ℱ$ , we define the metric

d_{2} (F_{1}, F_{2}) = sup_{u, z} ∣ F_{1} (u ∣ z) - F_{2} (u ∣ z) ∣ .

For any ( $β_{1}$ , $Λ_{1}$ ) and ( $β_{2}$ , $Λ_{2}$ ) in the space $ℛ \times Φ$ , we define the metric

d_{3} ((β_{1}, Λ_{1}), (β_{2}, Λ_{2})) = {{‖ β_{1} - β_{2} ‖}_{2}^{2} + d_{1}^{2} (Λ_{1}, Λ_{2})}^{1 ∕ 2} .

To establish the asymptotic properties of the proposed estimator, we need the following regularity conditions.

(C1) $0 < Λ_{0} (τ) < \infty$ .

(C2) The true values of $γ$ and $H$ satisfy $γ_{0} \in ℛ^{\circ}$ and $H_{0} (τ) < \infty$ , respectively. Furthermore, the derivative of $H_{0} (u)$ has a uniform positive lower bound for all $u \in [M_{1}, τ]$ , where $M_{1} < τ$ is a constant representing the minimum value of the support of $U$ .

(C3) $E [{N (T_{K})}^{2}] < \infty$ .

(C4) The probability of censoring $ϱ = P (Y < U)$ satisfies that $0 < ϱ < 1$ .

(C5) The measure $μ_{2} \times F_{Z}$ is absolutely continuous with respect to $μ_{1}$ .

(C6) $P (a^{T} Z \neq c) > 0$ for all $a \neq 0 \in R^{p}$ and for all $c \in R$ .

(C7) There is a constant $M_{2} > 0$ such that $P (K \leq M_{2}) = 1$ .

(C8) The number of subinterval in [0, $τ$ ] satisfies $m_{n} = O (n^{ν})$ for $0 < ν < 1 ∕ 2$ . Furthermore,

\max_{d + 1 \leq i \leq m_{n} + d + 1} ∣ t_{i} - t_{i - 1} ∣ = O (n^{- ν}) and \frac{\max_{d + 1 \leq i \leq m_{n} + d + 1} ∣ t_{i} - t_{i - 1} ∣}{{min}_{d + 1 \leq i \leq m_{n} + d + 1} ∣ t_{i} - t_{i - 1} ∣} \leq M_{3},

uniformly for $n$ with a constant $M_{3} > 0$ .

(C9) $P (T_{j} - T_{j - 1} \geq M_{4})$ for all $j = 1, \dots, K) = 1$ and $P (Y \geq M_{4}) = 1$ with a constant $M_{4} > 0$ .

Remark 1. Conditions (C1) and (C3) are mild for the statistical analysis of panel count data. Conditions (C2) and (C4) are common in survival data analysis. According to Wellner and Zhang (2007), Conditions (C5) and (C6) are necessary for the identifiability of the semiparametric model. Condition (C7) indicates that the number of observations is bounded, which is standard in many applications for panel count data. Condition (C8) is a regularity condition for the spline approximation by Lu, Zhang and Huang (2007, 2009). By Wellner and Zhang (2007), Condition (C9) is regular in applications of panel count data, meaning that the adjacent observation times are separable.

Theorem 3.1 (Consistency). Suppose Conditions (C1)–(C9) hold. Then we have the following:

( $β_{0}$ , $Λ_{0}$ ) is the unique minimum of $𝒫 m (β, Λ, F_{0}; X)$ .
$d_{3} (({\hat{β}}_{n}, {\hat{Λ}}_{n}), (β_{0}, Λ_{0})) \to 0$ almost surely.

We need the following additional conditions to derive the convergence rate and establish asymptotic normality.

(C10) $\inf_{z \in 𝒵} P (U \geq τ ∣ Z = z) > 0$ and $P (C \geq τ) > 0$ .

(C11) $μ_{2}$ is absolutely continuous with respect to Lebesgue measure with a derivative ${\dot{μ}}_{2}$ , and ${\dot{μ}}_{2}$ has a uniform positive lower bound.

(C12) There is a constant $0 < M_{5} < \infty$ such that $1 ∕ M_{5} < Λ_{0}^{'} (s) < M_{5}$ for all $s \in [τ^{'}, τ]$ , where $0 < τ^{'} \leq τ$ such that $Λ_{0} (τ^{'}) > 0$ .

(C13) There is a sufficiently large constant $c$ such that $E [\exp (c N (τ)) ∣ Z]$ is uniformly bounded for $Z \in 𝒞$ .

(C14) There is a constant $η \in (0, 1)$ such that for all $a \in R^{p}$ , we have $a^{T} Var (Z ∣ S_{1}, S_{2}) a \geq η a^{T} E (Z Z^{T} ∣ S_{1}, S_{2}) a$ a.e. for ( $S_{1}$ , $S_{2}$ , $Z$ ) having the distribution $μ_{1}$ .

Remark 2. By Kong et al. (2018), Condition (C10) is necessary for the uniform weak convergence rate of ${\hat{F}}_{n}$ on a finite interval. According to Wellner and Zhang (2007) and Lu, Zhang and Huang (2009), Conditions (C11)–(C14) are common in analysis of panel count data.

Theorem 3.2 (Convergence Rate). Suppose that Conditions (C1)–(C14) hold. Then, taking $ν = 1 ∕ (1 + 2 r)$ , we have $d_{3} (({\hat{β}}_{n}, {\hat{Λ}}_{n}), (β_{0}, Λ_{0})) = O_{p} (n^{- r ∕ (1 + 2 r)})$ .

Remark 3. Although the overall convergence rate of ( ${\hat{β}}_{n}$ , ${\hat{Λ}}_{n}$ ) is slower than $1 ∕ \sqrt{n}$ , the convergence rate of ${\hat{β}}_{n}$ is still $1 ∕ \sqrt{n}$ , and we can also find a functional of ${\hat{Λ}}_{n}$ having the convergence rate $1 ∕ \sqrt{n}$ .

Theorem 3.3 (Asymptotic Normality). Suppose that Conditions (C1)–(C14) hold, and $Λ_{0} \in ℋ_{r}$ with $r \geq 2$ . Then we have the following:

(i)
For all $h_{1} \in ℛ$ and $h_{2} \in ℋ_{r}$ , we have

\sqrt{n} R_{1} (h_{1}, h_{2}) ({\hat{β}}_{n} - β_{0}) + \sqrt{n} R_{2} (h_{1}, h_{2}) ({\hat{Λ}}_{n} - Λ_{0}) ⇝ N (0, σ_{0} {[h_{1}, h_{2}]}^{2}),

where $R_{1} (h_{1}, h_{2}) ({\hat{β}}_{n} - β_{0})$ , $R_{2} (h_{1}, h_{2}) ({\hat{Λ}}_{n} - Λ_{0})$ , and $σ_{0} {[h_{1}, h_{2}]}^{2}$ are defined in the Appendix.

(ii)
Furthermore, we have

\sqrt{n} ({\hat{β}}_{n} - β_{0}) ⇝ N (0, {(A^{*})}^{- 1} B^{*} {({(A^{*})}^{- 1})}^{T}),

where $A^{*}$ and $B^{*}$ are defined in the Appendix.

The result of the theorem can be used for making statistical inference of covariate effects on the recurrent event process.

4. Simulation studies

In this section, we conducted some simulation studies to evaluate the finite-sample performance of the proposed method. We generated the covariate vector $Z_{i} = {(Z_{i 1}, Z_{i 2})}^{T}$ by $Z_{i 1} \sim Unif (0, 1)$ and $Z_{i 2} \sim Bernoulli (0.5)$ . Given the covariate vector $Z_{i}$ , the terminal event $U_{i}$ , satisfied model (3) with $γ_{0} = {(γ_{1}, γ_{2})}^{T} = {(- 1, - 2)}^{T}$ and $H_{0} (u) = u^{2} - 6$ for $u \in [6, \infty)$ . The censoring time $C_{i}$ was generated from the Cox model with covariates $Z_{i}$ along with the coefficients ${(κ, 2 κ)}^{T}$ and the baseline cumulative hazard function $c - 6$ for $c \in [6, \infty)$ , where $κ$ was −1.611 and −0.594 to yield 20% and 40% censoring rate, respectively. Then we had $Y_{i} = U_{i} \land C_{i}$ and $Δ_{i} = 1_{{U_{i} \leq C_{i}}}$ with $τ = 10$ . For the observation time process, we generated a sequence of independent times $Δ T_{i j} \sim Unif (0.1, 3)$ for $j = 1, 2, \dots$ , and $K_{i}$ was the maximum number of $k$ such that $T_{i k} = \sum_{j = 1}^{k} Δ T_{i j} \leq Y_{i}$ . Then we obtained the observation time points ${T_{i 1}, \dots, T_{i K_{i}}}$ . Under model (1), we considered the following two different cases of $Λ_{0}$

Case 1 : Λ_{0} (s) = s and Case 2 : Λ_{0} (s) = \frac{10 s}{s + 1}

to generate the counting process $N_{i} = {N_{i} (T_{i 1}), \dots, N_{i} (T_{i K_{i}})}$ from the Poisson process with $β_{0} = {(β_{1}, β_{2})}^{T} = {(1, 1.5)}^{T}$ . That is $N_{i} (T_{i 1})$ was generated from the Poisson distribution with mean ${Λ_{0} (U_{i}) - Λ_{0} (U_{i} - T_{i 1})} \exp (β_{0} Z_{i}^{T})$ was generated from the Poisson distribution with mean ${Λ_{0} (U_{i} - T_{i (j - 1)}) - Λ_{0} (U_{i} - T_{i j})} \exp (β_{0} Z_{i}^{T})$ for $j = 2, \dots, K_{i}$ . For the knots of the I-spline basis functions, we set $t_{d + 1}, \dots, t_{d + m_{n}}$ to be the $1 ∕ (m_{n} + 1), \dots, m_{n} (m_{n} + 1)$ percentiles of ${Y_{i} - T_{i j}} : j = 1, \dots, K_{i}; i = 1, \dots, n$ with $d = 4$ and $m_{n} = [n^{1 ∕ 3}]$ . Since it was difficult to empirically estimate the asymptotic variance given in Theorem 3.3, the standard error of the proposed estimate of regression parameter $β_{0}$ was estimated based on 100 bootstrap samples. The initial value of the BFGS iteration was taken as $α = 1_{q_{n}}$ and $β = 0$ , where $1_{q_{n}}$ was the $q_{n}$ -dimensional vector with elements 1. The simulation results were summarized based on 500 replications with sample size $n = 100$ and 200.

For comparison purposes, we also applied the forward proportional mean model with panel count data (Wellner and Zhang, 2007; Lu, Zhang and Huang, 2009). We implemented the maximum pseudolikelihood spline (MPLS) and the maximum likelihood spline (MLS) by “panelReg” with methods “MPLs” and “MLs” in the R package “spef”.

We show the estimation results of $Λ$ under our reversed mean model for Case 1 in Figure 1. The plots for Case 2 are similar, and we show them in Figure 2. In those figures, the dash lines display the averages of the estimated functions, the solid lines are the true value $Λ_{0}$ for comparison, and the dotted-dash lines are the 2.5% and 97.5% pointwise percentiles of the estimated functions, which reveal the uncertainty of the estimated functions. We can see that the averages of the estimated functions are close to $Λ_{0}$ , meaning that ${\hat{Λ}}_{n}$ is consistent. The simulation results for the regression parameter $β$ based on MPLS, MLS, and our proposed model are summarized in Table 1. Note that in Case 1, we set $Λ_{0} (s) = s$ , for which the proposed model and the forward proportional mean model with panel count data (Wellner and Zhang, 2007; Lu, Zhang and Huang, 2009) are essentially the same. Hence, Table 1 shows that MPLS and MLS were valid for Case 1 as expected and had slightly better estimation efficiency than the proposed method due to the fact that no estimation for the conditional distribution function was needed. It is also interesting to note that for Case 1, the censoring rate does not seem to impact the estimation results of the counting process much. We believe it is due to the fact that our model is equivalent to the forward proportional mean model with no effect of the terminal event being considered, for which the censoring rate for the terminal event is not even relevant. However, for Case 2, the forward proportional mean model was misspecified, which resulted in biased inferences for MPLS and MLS. In addition, Table 1 also shows that the biases of our estimates are small and the sample standard deviations (SSD) are close to the estimated standard errors (ESE). Both of them decrease as the sample size increases, and they also decrease as the censoring rate decreases in Case 2. The empirical coverage probabilities (CP) of the 95% Wald confidence intervals are close to 0.95. The simulation studies provide the numerical evidence to support the asymptotic properties depicted in Section 3. It appears that the inference based on the asymptotic theory for our proposed method is valid in finite sample with moderate sample size, say $n > 100$ .

Table 1.

Simulation results for the estimation of parameter $β$ .

	Censoring rate = 20%			Censoring rate = 40%
	Proposed	MPLS	MLS	Proposed	MPLS	MLS
	Case 1, $n = 100$
Bias	(0.001,0.001)	(−0.001,−0.003)	(0.001,0.001)	(−0.001,0.001)	(−0.001,−0.001)	(−0.002,0.002)
SSD	(0.079,0.051)	(0.078,0.052)	(0.067,0.045)	(0.084,0.057)	(0.079,0.058)	(0.071,0.051)
ESE	(0.075,0.054)	(0.073,0.055)	(0.065,0.049)	(0.076,0.055)	(0.074,0.055)	(0.066,0.049)
CP	(0.934,0.946)	(0.916,0.940)	(0.938,0.956)	(0.910,0.932)	(0.922,0.914)	(0.922,0.906)
	Case 1, $n = 200$
Bias	(0.006,0.001)	(0.006,0.001)	(0.005,−0.001)	(0.001,0.001)	(0.001,−0.001)	(0.001,−0.001)
SSD	(0.052,0.038)	(0.053,0.039)	(0.047,0.034)	(0.052,0.037)	(0.050,0.041)	(0.045,0.036)
ESE	(0.052,0.038)	(0.050,0.039)	(0.045,0.035)	(0.054,0.039)	(0.052,0.039)	(0.046,0.035)
CP	(0.954,0.952)	(0.928,0.948)	(0.938,0.942)	(0.966,0.932)	(0.948,0.912)	(0.962,0.932)
	Case 2, $n = 100$
Bias	(0.003,0.015)	(−0.148,−0.256)	(−0.172,−0.305)	(−0.002,0.007)	(−0.155,−0.256)	(−0.176,−0.300)
SSD	(0.111,0.080)	(0.176,0.102)	(0.191,0.098)	(0.140,0.089)	(0.175,0.101)	(0.193,0.101)
ESE	(0.106,0.078)	(0.180,0.099)	(0.193,0.101)	(0.131,0.087)	(0.184,0.100)	(0.197,0.102)
CP	(0.922,0.922)	(0.870,0.272)	(0.860,0.156)	(0.936,0.940)	(0.852,0.286)	(0.842,0.184)
	Case 2, $n = 200$
Bias	(0.001,0.012)	(−0.141,−0.260)	(−0.166,−0.308)	(0.010,0.014)	(−0.135,−0.248)	(−0.160,−0.292)
SSD	(0.079,0.055)	(0.127,0.073)	(0.135,0.073)	(0.092,0.066)	(0.123,0.073)	(0.133,0.072)
ESE	(0.073,0.054)	(0.124,0.069)	(0.136,0.070)	(0.089,0.061)	(0.129,0.071)	(0.138,0.070)
CP	(0.932,0.932)	(0.784,0.066)	(0.762,0.018)	(0.938,0.916)	(0.818,0.080)	(0.774,0.018)

Open in a new tab

5. Application

In this section, we used the proposed semiparametric approach to analyze the incidence of serious diseases for elder people in China based on the datasets of the Chinese Longitudinal Healthy Longevity Survey (CLHLS) in the period 1998 to 2014 (Zeng et al., 2017). The CLHLS was conducted by the Center for Healthy Aging and Development Studies (CHADS) of the National School of Development at Peking University and the Chinese Center for Disease Control and Prevention (CDC), starting in 1998 with 6 follow-up waves in 2000, 2002, 2005, 2008, 2011 and 2014. Aiming to provide a better understanding of the determinants of health and longevity, the CLHLS interviewed a large number of elder people in the 22 provinces of China, who were at least 65 years old at the interviews, and collected the information about their medical history, socioeconomic status, lifestyles, family and demographic profile.

In this study, for the $i th$ elder person, we took the number of months from the date of the first survey to the date of the $j th$ follow-up wave of survey to be $T_{i j}$ for $j = 1, \dots, K_{i}$ , where $K_{i} \leq 6$ representing the number of follow-up surveys. $τ = 197$ is the longest follow-up time possibly occurred in this study. Denote the incidence of serious diseases for this subject before the $j th$ follow-up survey to be $N (T_{i j})$ , the incidence of serious diseases from the $j th$ follow-up survey to death to be $\tilde{N} (T_{i j})$ , the terminal event time due to death to be $U_{i}$ , and the censoring time due to loss-of-connection to be $C_{i}$ . Then $Y_{i} = U_{i} \land C_{i}$ is the follow-up time, and $Δ_{i} = 1_{{U_{i} \leq C_{i}}}$ the indicator of the observation of death.

We focused on the difference of the incidence of serious diseases between elders living in urban and rural. For this analysis, we considered 5 covariates that include three demographic variables: residence status ( $Z_{1} = 1$ for urban and $Z_{1} = 0$ for rural), age ( $Z_{2}$ ), and gender ( $Z_{3} = 1$ for male and $Z_{3} = 0$ for female); and two clinical variables: indicator of hypertension ( $Z_{4} = 1$ for systolic blood pressure ≥ 140 mmHg and $Z_{4} = 0$ for others), and peak lung flow ( $Z_{5}$ ) at the first interview. We chose the individuals who had at least one follow-up survey. Hence a total of 4831 individuals interviewed in both 1998 and 2000 were selected for analysis. After removing 1099 individuals with missing or erroneous records, and 1160 individuals who had lived in both areas during the study period, we finally included 2572 individuals in the analysis, among which 73.7% had the terminal event, death. Table 2 shows the number of elders with different categories of age, gender, blood pressure, and peak lung flow stratified by urban and rural, respectively. In this table, the p-values of $𝒳^{2}$ tests reveal that the age of elders living in urban is different from elders living in rural at significance level 0.01; and the gender, the hypertension, and the degree of peak lung flow are not different between elders living in urban and rural at significance level 0.01.

Table 2.

The number of participants with different types of covariates for different residence status.

	Total	Age				Gender
	–	≤79	80-89	90-99	≥100	Male	Female
Urban	1152	15	636	381	120	507	645
Rural	1420	13	689	406	312	590	830
Total	2572	28	1325	787	432	1097	1475
p-value	–	< 0.001^***				0.224

	Systolic Pressure		Peak Lung Flow
	≥140	≤139	≤99	100-199	200-299	300-399	≥400
Urban	820	332	373	484	232	53	10
Rural	1005	415	423	467	267	71	12
Total	1825	747	796	1131	499	124	22
p-value	0.856		0.406

Open in a new tab

^***

represents significance level of 0.01.

Although the unit of peak lung flow ( $Z_{5}$ ) was not specified in the dataset, it was clinically important because people with larger peak lung flow value generally have higher functional cardiorespiratory system capacity. We standardized the covariates $Z_{2}$ and $Z_{5}$ to put them on the same scale before the analysis. We considered the I-spline sieve estimation for $Λ (\cdot)$ with order $d = 4$ and seven internal knots located at $t_{d + 1} = τ ∕ 8, \dots t_{d + 7} = 7 τ ∕ 8$ for the I-spline basis functions. We chose the initial value $α = 1$ and $β = 0$ in the BFGS algorithm. Similar to Section 4, we used MLS for comparison and obtained the bootstrap standard error of the proposed estimates of the regression parameters based on 100 bootstrap samples.

In Figure 3, the solid line represents the estimate of $Λ$ under our backward model. Table 3 summarizes the inferences on the covariates effects on the incidence of serious diseases under our reversed mean model and the MLS. Compared to the MLS method, for which only $Z_{1}$ showed a significant positive effect at the 0.01 level, our proposed model seemed to have a better power to pick out more statistically significant covariates. $Z_{1}$ is significant at the 0.05 level; $Z_{3}$ and $Z_{5}$ are marginally significant at the 0.1 level. Specifically, $Z_{1}$ has positive effect on the incidence of serious diseases, which may reflect the fact that people living in urban have a better opportunity to access advanced healthcare services than people living in rural that allow then to have more serious diseases identified. That $Z_{3}$ has negative effect on the incidence of serious diseases may be due to the longer lifetime in females. The positive effect of $Z_{5}$ is reasonable because of the higher functional cardiorespiratory system capacity for people with larger peak lung flow.

Table 3.

Inference results for the CLHLS data.

	Reversed Mean Model					MLS
	$Z_{1}$	$Z_{2}$	$Z_{3}$	$Z_{4}$	$Z_{5}$	$Z_{1}$	$Z_{2}$	$Z_{3}$	$Z_{4}$	$Z_{5}$
Estimates	0.246	−0.061	−0.175	−0.065	0.106	0.284	0.068	−0.073	0.106	0.040
ESE	0.100	0.063	0.101	0.122	0.056	0.077	0.044	0.076	0.083	0.039
p-value	0.014^**	0.333	0.084^*	0.595	0.059^*	0.001^***	0.130	0.330	0.200	0.310

Open in a new tab

represents a significant inference at level of 0.1

^**

represents a significant inference at level of 0.05

^***

represents a significant inference at level of 0.01.

6. Concluding remarks

For analyzing complex panel count data with an informative terminal event, we proposed a reversed mean model to depict its explicit relationship with recurrent events. For estimating unknown parameters of the proposed model, we developed a two-stage spline-based sieve estimation procedure to reduce the computation burden. Overcoming the theoretical challenges from the estimator having the overall convergence rate slower than the standard rate, we established the joint asymptotic normality for a functional of the estimator, and further concluded that the finite-dimensional estimator still achieves the standard convergence rate and is asymptotically normal.

Note that the proposed estimation procedure is robust in the sense that the stochastic mechanism of the recurrent event process is completely unspecified. When the underlying counting process is a Poisson-type process, we can use the maximum likelihood approach to improve the estimation efficiency. Since the likelihood function in this situation is much more complicated, extra efforts are needed to study the theoretical properties, which are currently under investigation. Though Cox model (3) and Breslow-induced estimator ${\hat{F}}_{n} (t ∣ Z)$ were adopted in our implementation for stage 1 due to their popularity and good asymptotic properties, they are not the only choice. Indeed, our theories only require that the estimator of the conditional distribution function of the terminal event time can be asymptotically represented by the sum of a series of i.i.d. terms, such as the representation in Lemma 1. Then the asymptotic properties for ${\hat{Λ}}_{n}$ and ${\hat{β}}_{n}$ in Theorems 3.1-3.3 still hold.

The proposed method focuses on modeling the data with some conditions that the observation times are independent of the recurrent events and the covariates are time-independent. Though these conditions were commonly adopted in analysis for panel count data, the use of the proposed method is somehow restricted in view of real-world applications. A further direction is to consider the informative observation times and time-dependent covariates for analysis of panel count data with an informative terminal event.

Supplementary Material

Suppl

NIHMS2018374-supplement-Suppl.pdf^{(175.4KB, pdf)}

Acknowledgments

The authors would like to thank the editor, the associate editor, and the referee for their valuable comments and suggestions.

Funding

This work is supported in part by the Research Grant Council of Hong Kong (15301218, 15303319) and the CAS AMSS-PolyU Joint Laboratory of Applied Mathematics for Xingqiu Zhao, the Natural Science Foundation of China (12171374) for Li Liu, and NIH/NIGMS (2 U54 GM115458-06) for Ying Zhang.

Appendix: Proofs of main results

A.1. Proof of Theorem 3.1

Proof. (i) We show that ( $β_{0}$ , $Λ_{0}$ ) is the unique minimum of $𝒫_{m} (β, Λ, F_{0}; X)$ . After some algebraic calculations, we have

𝒫 m (β, Λ, F_{0}; X) - 𝒫 m (β_{0}, Λ_{0}, F_{0}; X) = 𝒫 [\sum_{j = 1}^{K} {e^{β_{0}^{T} Z} Δ Λ_{0, j} (U) - e^{β^{T} Z} Δ Λ_{j} (U)}^{2}] \geq 0 .

It follows that $𝒫_{m} (β, Λ, F_{0}; X) \geq 𝒫_{m} (β_{0}, Λ_{0}, F_{0}; X)$ , and $𝒫_{m} (β, Λ, F_{0}; X) = 𝒫_{m} (β_{0}, Λ_{0}, F_{0}; X)$ if and only if $Δ Λ (s_{1}, s_{2}) \exp (β^{T} z) = Δ Λ_{0} (s_{1}, s_{2}) \exp (β_{0}^{T} z)$ a.e. with respect to $μ_{1}$ . This implies that $(Δ Λ (s_{1}, s_{2}) - Δ Λ_{0} (s_{1}, s_{2})) \exp (β^{T} z) = Δ Λ_{0} (s_{1}, s_{2}) (\exp (β_{0}^{T} z - \exp (β^{T} z))$ a.e. with respect to $μ_{1}$ . By Condition (C5), $μ_{2} \times F_{Z}$ is absolutely continuous with respect to $μ_{1}$ . Using Fubini’s theorem, we obtain

\int {a (s_{1}, s_{2}) (Δ Λ (s_{1}, s_{2}) - Δ Λ_{0} (s_{1}, s_{2}))} d μ_{2} \int {b (z) e^{β^{T} z}} d F_{Z} = \int {a (s_{1}, s_{2}) Δ Λ_{0} (s_{1}, s_{2})} d μ_{2} \int {b (z) (e^{β_{0}^{T} z} - e^{β^{T} z})} d F_{Z},

for all $μ_{2}$ measurable function $a (s_{1}, s_{2})$ and $F_{Z}$ measurable function $b (z)$ . Taking $a (s_{1}, s_{2}) = (Δ Λ (s_{1}, s_{2}) - Δ_{0} Λ (s_{1}, s_{2})) 1_{A}$ and $b (z) = \exp (β^{T} z) 1_{B}$ for $A \in ℬ_{[0, τ]}^{2}$ and $B \in ℬ^{p}$ , we have

\int_{A} {(Δ Λ (s_{1}, s_{2}) - Δ Λ_{0} (s_{1}, s_{2}))}^{2} d μ_{2} \int_{B} e^{2 β^{T} z} d F_{Z} = \int_{A} {(Δ Λ (s_{1}, s_{2}) - Δ Λ_{0} (s_{1}, s_{2})) Δ Λ_{0} (s_{1}, s_{2})} d μ_{2} \int_{B} {(e^{β_{0}^{T} z} - e^{β^{T} z}) e^{β^{T} z}} d F_{Z} .

Similarly, $a (s_{1}, s_{2}) = (Δ Λ_{0} (s_{1}, s_{2}) 1_{A}$ and $b (z) = (\exp (β_{0}^{T} z)) 1_{B}$ yield that

\int_{A} {Δ Λ_{0} (s_{1}, s_{2}) (Δ Λ (s_{1}, s_{2}) - Δ Λ_{0} (s_{1}, s_{2}))} d μ_{2} \int_{B} {(e^{β_{0}^{T} z} - e^{β^{T} z}) e^{β^{T} z}} d F_{Z} = \int_{A} {(Δ Λ_{0} (s_{1}, s_{2}))}^{2} d μ_{2} \int_{B} {(e^{β_{0}^{T} z} - e^{β^{T} z})}^{2} d F_{Z} .

Then for all the product sets $A \times B$ , we obtain

\int_{A \times B} {(Δ Λ (s_{1}, s_{2}) - Δ Λ_{0} (s_{1}, s_{2}))}^{2} e^{2 β^{T} z} d μ_{2} \times F_{Z} = \int_{A \times B} {(Δ Λ_{0} (s_{1}, s_{2}))}^{2} {(e^{β_{0}^{T} z} - e^{β^{T} z})}^{2} d μ_{2} \times F_{Z} .

That is ${(Δ Λ (s_{1}, s_{2}) - Δ Λ_{0} (s_{1}, s_{2}))}^{2} \exp (2 β^{T} z) = {(Δ Λ_{0} (s_{1}, s_{2}))}^{2} {(\exp (β_{0}^{T} z) - \exp (β^{T} z))}^{2}$ a.e. with respect to $μ_{2} \times F_{Z}$ , which is equivalent to ${(Δ Λ (s_{1}, s_{2}) ∕ Δ Λ_{0} (s_{1}, s_{2}) - 1)}^{2} = {(\exp ({(β_{0} - β)}^{T} z - 1))}^{2}$ a.e. with respect to $μ_{2} \times F_{Z}$ . Intergrading the above equality with respect to $μ_{2}$ , we obtain that the right hand side is a constant a.e. with respect to $F_{Z}$ . Then Condition (C6) implies that $β = β_{0}$ and $Δ Λ (s_{1}, s_{2}) = Δ Λ_{0} (s_{1}, s_{2})$ a.e. with respect to $μ_{2}$ .

(ii) To prove the consistency, we first show that ${\hat{λ}}_{n}$ is uniformly bounded. By Lemma A1 of Lu, Zhang and Huang (2007), under Condition (C8), there is a $Λ_{n}^{*} \in Φ_{n}$ such that ${‖ Λ_{n}^{*} - Λ_{0} ‖}_{\infty} = O (n^{- ν r})$ . Consider a direction vector $h_{1, n} \in ℛ$ with ${‖ h_{1, n} ‖}_{2}^{2} = O (n^{- a})$ for a constant $0 < a < 1 ∕ 2$ and a bounded positive monotone nondecreasing direction function $h_{2, n} \in Φ_{n}$ with ${‖ Δ h_{2, n} (s_{1}, s_{2}) ‖}_{L_{2} (μ_{2})}^{2} = O (n^{- ν r} + n^{- (1 - ν ∕ 2)})$ , where $Δ h_{2, n} (s_{1}, s_{2}) = h_{2, n} (s_{2}) - h_{2, n} (s_{1})$ . Then for any constant $α > 0$ , we have ${‖ Δ Λ_{n}^{*} (s_{1}, s_{2}) - Δ Λ_{0} (s_{1}, s_{2}) + α Δ h_{2, n} (s_{1}, s_{2}) ‖}_{L_{2} (μ_{2})}^{2} = O (n^{- ν r} + n^{- (1 - ν) ∕ 2})$ and $\inf_{s_{2} - s_{1} \geq M_{4}} (Δ Λ_{n}^{*} (s_{1}, s_{2}) - Δ Λ_{0} (s_{1}, s_{2}) + α Δ h_{2, n} (s_{1}, s_{2})) > 0$ for sufficiently large $n$ with $M_{4}$ defined in Condition (C9). By some direct calculations,

\dot{m} (α) = \partial m (β_{0} + α h_{1, n}, Λ_{n}^{*} + α h_{2, n}, {\hat{F}}_{n}; X) ∕ \partial α = - 2 ψ (β_{0} + α h_{1, n}, Λ_{n}^{*} + α h_{2, n}, {\hat{F}}_{n}; X) [h_{1, n}, h_{2, n}],

where

ψ (β, Λ, F; X) [h_{1}, h_{2}] = \sum_{j = 1}^{K} [Δ {Δ N_{j} - e^{β^{T} Z} Δ Λ_{j} (Y)} e^{β^{T} Z} {Δ Λ_{j} (Y) h_{1}^{T} Z + Δ h_{2, j} (Y)} + \frac{1 - Δ}{1 - F (Y ∣ Z)} \int_{Y}^{\infty} {Δ N_{j} - e^{β^{T} Z} Δ Λ_{j} (u)} e^{β^{T} Z} {Δ Λ_{j} (u) h_{1}^{T} Z + Δ h_{2, j} (u)} d F (u ∣ Z)] .

(6)

Note that we obtain ( ${\hat{β}}_{n}$ , ${\hat{Λ}}_{n}$ ) by minimizing $P_{n} m (β, Λ, {\hat{F}}_{n}; X)$ under the constraint that $(β, Λ) \in ℛ \times Φ_{n}$ . Then we can verify $d_{3} (({\hat{β}}_{n}, {\hat{Λ}}_{n}), (β_{0}, Λ_{0})) = o_{p} (1)$ by showing that $P_{n} \dot{m} (α) > 0$ and $P_{n} \dot{m} (- α) < 0$ for any constant $α > 0$ . Similar to Lemma 3, we have

{ψ (β, Λ, F; X) [h_{1}, h_{2}] : (β, Λ) \in ℛ \times Φ, F \in ℱ, (h_{1}, h_{2}) \in ℛ \times Φ, Λ and h_{2} are uniformly bounded}

is Donsker. Therefore, we obtain $(P_{n} - 𝒫) \dot{m} (α) = O_{p} (1 ∕ \sqrt{n})$ . Furthermore, Lemma 2 implies that

𝒫 \dot{m} (α) ≳ - 2 𝒫 ψ (β_{0} + α h_{1, n}, Λ_{n}^{*} + α h_{2, n}, F_{0}; X) [h_{1, n}, h_{2, n}] - d_{2} ({\hat{F}}_{n}, F_{0}) = - 2 𝒫 [\sum_{j = 1}^{K} {(e^{β_{0}^{T} Z} Δ Λ_{0, j} (U) - e^{{(β_{0} + α h_{1, n})}^{T} Z} (Δ Λ_{n, j}^{*} (U) + α Δ h_{2, n, j} (U))) \times e^{{(β_{0} + α h_{1, n})}^{T} Z} ((Δ Λ_{n, j}^{*} (U) + α Δ h_{2, n, j} (U)) h_{1, n}^{T} Z + Δ h_{2, n, j} (U))}] - d_{2} ({\hat{F}}_{n}, F_{0}) = 2 𝒫 [\sum_{j = 1}^{K} {(b (1) - b (0)) e^{{(β_{0} + α h_{1, n})}^{T} Z} ((Δ Λ_{n, j}^{*} (U) + α Δ h_{2, n, j} (U)) h_{1, n}^{T} Z + Δ h_{2, n, j} (U))}] - d_{2} ({\hat{F}}_{n}, F_{0}),

where $b (ξ) = \exp ({(β_{0} + ξ α h_{1, n})}^{T} Z) (Δ Λ_{0, j} (U) + ξ (Δ Λ_{n, j}^{*} (U) - Δ Λ_{0, j} (U) + α Δ h_{2, n, j} (U)))$ , and the notation $c_{1} ≳ c_{2}$ means that $c_{1} \geq c c_{2}$ for a constant $c$ . By the mean value theorem, there exists a $0 < ξ^{*} < 1$ such that $b (1) - b (0) = b^{'} (ξ^{*})$ , where

b^{'} (ξ) = e^{{(β_{0} + ξ α h_{1, n})}^{T} Z} (Δ Λ_{n, j}^{*} (U) - Δ Λ_{0, j} (U) + α Δ h_{2, n, j} (U)) + e^{{(β_{0} + ξ α h_{1, n})}^{T} Z} α h_{1, n}^{T} Z (Δ Λ_{0, j} (U) + ξ (Δ Λ_{n, j}^{*} (U) - Δ Λ_{0, j} (U) + α Δ h_{2, n, j} (U))) .

Since $Λ_{0}$ , $Λ_{n}^{*}$ , and $h_{2, n}$ are bound on $[0, τ]$ , and $β_{0}$ and $h_{1, n}$ are bounded vectors, it follows that $b^{'} (ξ^{*}) ≳ (Δ Λ_{n, j}^{*} (U) - Δ Λ_{0, j} (U) + α Δ h_{2, n, j} (U)) + h_{1, n}^{T} Z$ . Thus,

𝒫 \dot{m} (α) \geq c 𝒫 [\sum_{j = 1}^{K} {((Δ Λ_{n, j}^{*} (U) - Δ Λ_{0, j} (U) + α Δ h_{2, n, j} (U)) + h_{1, n}^{T} Z) (h_{1}^{T} Z + Δ h_{2, j} (U))}] - d_{2} ({\hat{F}}_{n}, F_{0}) \geq c O (n^{- ν r} + n^{- (1 - ν) ∕ 2} + n^{- a}) - d_{2} ({\hat{F}}_{n}, F_{0}),

for a constant $c$ . Note that $n^{- ν r} + n^{- (1 - ν ∕ 2)} \geq n^{- r ∕ (1 + 2 r)} > (1 ∕ \sqrt{n}$ , $0 < a < 1 ∕ 2$ , and $d_{2} ({\hat{F}}_{n}, {\hat{F}}_{0}) = O_{p} (1 ∕ \sqrt{n})$ . This yields that $P_{n} \dot{m} (α) \geq (P_{n} - 𝒫) \dot{m} (α) + c O (n^{- ν r} + n^{- (1 - ν) ∕ 2} + n^{- a}) - d_{2} ({\hat{F}}_{n}, F_{0}) > 0$ with probability converging to one. We can similarly show that $P_{n} \dot{m} (- α) < 0$ except on an event with probability converging to zero. Therefore, for all $ε > 0$ , we have $P (d_{3} (({\hat{β}}_{n}, {\hat{Λ}}_{n}), (β_{0}, Λ_{0})) > ε) \to 0$ as $n \to \infty$ . It follows that for any $∊ > 0$ , there exists a measurable set $Ξ$ with $P (Ξ) \geq 1 - ∊$ such that ${\hat{Λ}}_{n} (s)$ is uniformly bounded for $s \in [0, τ]$ on $Ξ$ .

We restrict us on the measurable set $Ξ$ at the moment. By the Cauchy–Schwarz inequality, under Conditions (C1) and (C7), we have

𝒫 m (β, Λ, F_{0}; X) - 𝒫 m (β_{0}, Λ_{0}, F_{0}; X) = E [\sum_{j = 1}^{K} {(e^{β^{T} Z} Δ Λ_{j} (U) - e^{β_{0}^{T} Z} Δ Λ_{0, j} (U))}^{2}] ≲ E [\sum_{j = 1}^{K} {(Δ Λ_{j} (U) - Δ Λ_{0, j} (U))}^{2}] + E [\sum_{j = 1}^{K} {(e^{β^{T} Z} - e^{β_{0}^{T} Z})}^{2}] + 2 {E [\sum_{j = 1}^{K} {(Δ Λ_{j} (U) - Δ Λ_{0, j} (U))}^{2}]}^{\frac{1}{2}} {E [\sum_{j = 1}^{K} {(e^{β^{T} Z} - e^{β_{0}^{T} Z})}^{2}]}^{\frac{1}{2}} ≲ E [\sum_{j = 1}^{K} {(Δ Λ_{j} (U) - Δ Λ_{0, j} (U))}^{2}] + E [{(e^{β^{T} Z} - e^{β_{0}^{T} Z})}^{2}] .

By the mean value theorem, there exists a $β_{ζ} \in ℛ$ such that $E [{(\exp (β^{T} Z) - \exp (β_{0}^{T} Z))}^{2}] = E [\exp (2 β_{ζ}^{T} Z) {Z^{T} (β - β_{0})}^{2}] ≲ {| | β - β_{0} | |}_{2}^{2}$ . It follows that

𝒫 m (β, Λ, F_{0}; X) - 𝒫 m (β_{0}, Λ_{0}, F_{0}; X) ≲ {‖ β - β_{0} ‖}_{2}^{2} + d_{1}^{2} (Λ, Λ_{0}) = d_{3}^{2} ((β, Λ), (β_{0}, Λ_{0})) .

Furthermore, Note that $𝒫_{m} (β, Λ, F_{0}; X) - 𝒫_{m} (β_{0}, Λ_{0}, F_{0}; X) \geq 0$ with equality if and only if $β = β_{0}$ and $Δ Λ (s_{1}, s_{2}) = Δ Λ_{0} (s_{1}, s_{2})$ a.e. with respect to $μ_{2}$ . Hence, for every $δ > 0$ , there exists an $ε > 0$ such that ${d_{3} (({\hat{β}}_{n}, {\hat{Λ}}_{n}), (β_{0}, Λ_{0})) \geq δ} \subset {𝒫_{m} ({\hat{β}}_{n}, {\hat{Λ}}_{n}, F_{0}; X) - 𝒫_{m} (β_{0}, Λ_{0}, F_{0}; X) > ε}$ . Note that

0 \leq 𝒫 m ({\hat{β}}_{n}, {\hat{Λ}}_{n}, F_{0}; X) - 𝒫 m (β_{0}, Λ_{0}, F_{0}; X) = 𝒫 m ({\hat{β}}_{n}, {\hat{Λ}}_{n}, F_{0}; X) - 𝒫 m ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}; X) + 𝒫 m ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}; X) - P_{n} m ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}; X) + P_{n} m ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}; X) - P_{n} m (β_{0}, Λ_{n}^{*}, {\hat{F}}_{n}; X) + P_{n} m (β_{0}, Λ_{n}^{*}, {\hat{F}}_{n}; X) - 𝒫 m (β_{0}, Λ_{n}^{*}, {\hat{F}}_{n}; X) + 𝒫 m (β_{0}, Λ_{n}^{*}, {\hat{F}}_{n}; X) - 𝒫 m (β_{0}, Λ_{n}^{*}, F_{0}; X) + 𝒫 m (β_{0}, Λ_{n}^{*}, F_{0}; X) - 𝒫 m (β_{0}, Λ_{0}, F_{0}; X) .

(7)

According to Conditions (C2) and (C7), $0 \leq 𝒫 m (β_{0}, Λ_{n}^{*}, F_{0}; X) - 𝒫 m (β_{0}, Λ_{0}, F_{0}; X) ≲ {‖ Λ_{n}^{*} - Λ_{0} ‖}_{\infty}^{2} = o (1)$ . The definition of $({\hat{β}}_{n}, {\hat{Λ}}_{n})$ yields that $P_{n} m ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}; X) \leq P_{n} m (β_{0}, Λ_{n}^{*}, {\hat{F}}_{n}; X)$ . By Lemma 2 of the online Supplementary Material (Hu et al., 2023), we have $𝒫 m ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}; X) - 𝒫 m ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}; X) = o_{p} (1)$ and $𝒫 m (β_{0}, Λ_{n}^{*}, {\hat{F}}_{n}; X) - 𝒫 m (β_{0}, Λ_{n}^{*}, F_{0}; X) = o_{p} (1)$ . By Lemma 3 of the online Supplementary Material (Hu et al., 2023), { $m (β, Λ, F; X) : β \in ℛ$ , $Λ \in Φ$ , $Λ$ is uniformly bounded, $F \in ℱ$ , $d_{2} (F, F_{0}) \leq δ$ } is Donsker, meaning that it is Glivenko-Cantelli. Noting that $d_{2} ({\hat{F}}_{n}, F_{0}) = O_{p} (n^{- 1 ∕ 2})$ , we have $(P_{n} - 𝒫) m ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}; X) = o_{p} (1)$ and $(P_{n} - 𝒫) m (β_{0}, Λ_{n}^{*}, {\hat{F}}_{n}; X) = o_{p} (1)$ . Combining them with (7), we have $0 \leq 𝒫 m ({\hat{β}}_{n}, {\hat{Λ}}_{n}, F_{0}; X) - 𝒫 m (β_{0}, Λ_{0}, F_{0}; X) \leq o_{p} (1)$ . Therefore, ${𝒫 m ({\hat{β}}_{n}, {\hat{Λ}}_{n}, F_{0}; X) > 𝒫 m (β_{n}, Λ_{n}, F_{0}; X) + ε}$ goes into a null set as $n \to \infty$ . Then $({\hat{β}}_{n}, {\hat{Λ}}_{n}) \to (β_{0}, Λ_{0})$ almost uniformly, recalling that the relation holds on the measurable set $Ξ$ with $P (Ξ) \geq 1 - ∊$ . Thus, the almost sure convergence of ( ${\hat{β}}_{n}$ , ${\hat{Λ}}_{n}$ ) follows by Lemma 1.9.2 of van der Vaart and Wellner (1996). □

A.2. Proof of Theorem 3.2

Proof. We use Lemma 5 of the online Supplementary Material (Hu et al., 2023) to prove the rate of convergence.

First, by some direct calculations, we have

𝒫 (m (β_{0}, Λ_{0}, F; X) - m (β, Λ, F; X)) = - E [\sum_{j = 1}^{K} {(e^{β^{T} Z} Δ Λ_{j} (U) - e^{β_{0}^{T} Z} Δ Λ_{0, j} (U))}^{2}] + 𝒫 [\sum_{j = 1}^{K} \frac{1 - Δ}{1 - F_{0} (Y ∣ Z)} \int_{Y}^{\infty} {(e^{β^{T} Z} Δ Λ_{j} (u) - e^{β_{0}^{T} Z} Δ Λ_{0, j} (u))}^{2} d F_{0} (u ∣ Z) - \sum_{j = 1}^{K} \frac{1 - Δ}{1 - F (Y ∣ Z)} \int_{Y}^{\infty} {(e^{β^{T} Z} Δ Λ_{j} (u) - e^{β_{0}^{T} Z} Δ Λ_{0, j} (u))}^{2} d F (u ∣ Z)] .

The bound of $E [\sum_{j = 1}^{K} {\exp (β^{T} Z) Δ Λ_{j} (U) - \exp (β_{0}^{T} Z) Δ Λ_{0, j} (u)}^{2}]$ can be assessed by the arguments similar to those for the proof of Theorem 3.2 in Wellner and Zhang (2007). For $Λ \in Φ$ , $β \in R^{p}$ and $(S_{1}, S_{2}, Z) \sim μ_{1}$ , let $g (ξ) = \exp (β_{0}^{T} Z) Δ Λ_{ξ} (S_{1}, S_{2})$ , where $Δ Λ_{ξ} (S_{1}, S_{2}) = ξ Δ Λ (S_{1}, S_{2}) + (1 - ξ) Δ Λ_{0} (S_{1}, S_{2})$ and $β_{ξ} = ξ β + (1 - ξ) β_{0}$ with $ξ \in (0, 1)$ . Then we have $\exp (β^{T} Z) Δ Λ (S_{1}, S_{2}) - \exp (β_{0}^{T} Z) Δ Λ_{0} (S_{1}, S_{2}) = g (1) - g (0)$ . By the mean value theorem, there is a $ξ \in (0, 1)$ such that

g (1) - g (0) = g^{'} (ξ) = e^{β_{ξ}^{T} Z} [(Δ Λ (S_{1}, S_{2}) - Δ Λ_{0} (S_{1}, S_{2})) + Δ Λ_{ξ} (S_{1}, S_{2}) {(β - β_{0})}^{T} Z] = e^{β_{ξ}^{T} Z} [{1 + \frac{(Δ Λ (S_{1}, S_{2}) - Δ Λ_{0} (S_{1}, S_{2}))}{Δ Λ_{0} (S_{1}, S_{2}) ∕ ξ}} {(β - β_{0})}^{T} Z Δ Λ_{0} (S_{1}, S_{2}) + (Δ Λ (S_{1}, S_{2}) - Δ Λ_{0} (S_{1}, S_{2}))],

where $g^{'}$ is the derivative of $g$ . Setting $g_{1} = {(β - β_{0})}^{T} Z Δ Λ_{0} (S_{1}, S_{2})$ , $g_{2} = (Δ Λ (S_{1}, S_{2}) - Δ Λ_{0} (S_{1}, S_{2}))$ and $g_{3} = 1 + ξ (Δ Λ (S_{1}, S_{2}) - Δ Λ_{0} (S_{1}, S_{2})) ∕ Δ Λ_{0} (S_{1}, S_{2})$ , we have $g (1) - g (0) = \exp (β_{ξ}^{T} Z) (g_{1} g_{3} + g_{2})$ This yields that

E [\sum_{j = 1}^{K} {e^{β^{T} Z} Δ Λ_{j} (U) - e^{β_{0}^{T} Z} Δ Λ_{0, j} (U)}^{2}] = E_{μ_{1}} [{g (1) - g (0)}^{2}] ≳ E_{μ_{1}} [{(g_{1} g_{3} + g_{2})}^{2}] .

Similar to the proof of Theorem 3.2 in Wellner and Zhang (2007), Condition (C14) implies that $E_{μ_{1}}^{2} [g_{1} g_{2}] \leq (1 - η) E_{μ_{1}} [{(g_{1})}^{2}] E_{μ_{1}} [{(g_{2})}^{2}]$ . According to Lemma 8.8 of van der Vaart (2002), we have $E_{μ_{1}} [{(g_{1} g_{3} + g_{2})}^{2}] ≳ E_{μ_{1}} [{(g_{1})}^{2}] + E_{μ_{1}} [{(g_{2})}^{2}] ≳ d_{3}^{2} ((β, Λ), (β_{0}, Λ_{0}))$ . Therefore,

𝒫 (m (β_{0}, Λ_{0}, F; X) - m (β, Λ, F; X)) ≲ - d_{3}^{2} ((β, Λ), (β_{0}, Λ_{0})) + 𝒫 [\sum_{j = 1}^{K} \frac{1 - Δ}{1 - F_{0} (Y ∣ Z)} \int_{Y}^{\infty} {e^{β^{T} Z} Δ Λ_{j} (u) - e^{β_{0}^{T} Z} Δ Λ_{0, j} (u)}^{2} d F_{0} (u ∣ Z) - \sum_{j = 1}^{K} \frac{1 - Δ}{1 - F (Y ∣ Z)} \int_{Y}^{\infty} {e^{β^{T} Z} Δ Λ_{j} (u) - e^{β_{0}^{T} Z} Δ Λ_{0, j} (u)}^{2} d F (u ∣ Z)] .

By Conditions (C1), (C2) and (C7), Cauchy–Schwarz inequality and Lemma 2 of the online Supplementary Material (Hu et al., 2023),

∣ 𝒫 [\sum_{j = 1}^{K} (1 - Δ) {\frac{\int_{Y}^{\infty} {(e^{β^{T} Z} Δ Λ_{j} (u) - e^{β_{0}^{T} Z} Δ Λ_{0, j} (u))}^{2} d F_{0} (u ∣ Z)}{1 - F_{0} (Y ∣ Z)} - \frac{\int_{Y}^{\infty} {(e^{β^{T} Z} Δ Λ_{j} (u) - e^{β_{0}^{T} Z} Δ Λ_{0, j} (u))}^{2} d F (u ∣ Z)}{1 - F (Y ∣ Z)}}] ∣ ≲ {𝒫 [\sum_{j = 1}^{K} {(e^{β^{T} Z} Δ Λ_{j} (U) - e^{β_{0}^{T} Z} Δ Λ_{0, j} (U))}^{2}] + 𝒫 [\sum_{j = 1}^{K} 2 ∣ e^{β^{T} Z} Δ Λ_{j} (U) - e^{β_{0}^{T} Z} Δ Λ_{0, j} (U) ∣ ∣ e^{β^{T} Z} Δ Λ_{j}^{'} (U) - e^{β_{0}^{T} Z} Δ Λ_{0, j}^{'} (U) ∣]} d_{2} (F, F_{0}) ≲ d_{3} ((β, Λ), (β_{0}, Λ_{0})) d_{2} (F, F_{0}) + d_{3}^{2} ((β, Λ), (β_{0}, Λ_{0})) d_{2} (F, F_{0}) .

This yields that

𝒫 (m (β_{0}, Λ_{0}, {\hat{F}}_{n}; X) - m ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}; X)) ≲ - d_{3}^{2} (({\hat{β}}_{n}, {\hat{Λ}}_{n}), (β_{0}, Λ_{0})) + d_{3} (({\hat{β}}_{n}, {\hat{Λ}}_{n}), (β_{0}, Λ_{0})) d_{2} ({\hat{F}}_{n}, F_{0}) + d_{3}^{2} (({\hat{β}}_{n}, {\hat{Λ}}_{n}), (β_{0}, Λ_{0})) d_{2} ({\hat{F}}_{n}, F_{0}) .

Second, we need to find a $ϕ_{n} (η)$ such that

E sup_{{(β, Λ) \in ℛ \times Φ_{n} : d_{3} ((β, Λ), (β_{0}, Λ_{0})) < η}} ∣ (P_{n} - 𝒫) (m (β, Λ, {\hat{F}}_{n}; X) - m (β_{0}, Λ_{0}, {\hat{F}}_{n}; X)) ∣ ≲ \frac{ϕ_{n} (η)}{\sqrt{n}} .

By Lemma 4 of the online Supplementary Material (Hu et al., 2023), for sufficiently large $n$ , we have

\log N_{[]} (ε, ℳ_{η} ({\hat{F}}_{n}), {‖ \cdot ‖}_{P, B}) ≲ q_{n} \log (η ∕ ε),

where $ℳ ({\hat{F}}_{n}) = {m (β, Λ, {\hat{F}}_{n}; X) - m (β_{0}, Λ_{0}, {\hat{F}}_{n}; X) : β \in ℛ, Λ \in Φ_{n}, d_{3}^{2} ((β, Λ), (β_{0}, Λ_{0})) \leq η^{2}}$ . For $(β, Λ) \in ℛ \times Φ_{n}$ satisfying $d_{3} ((β, Λ), (β_{0}, Λ_{0})) < η$ , similar to the proof of Lemma 4 of the online Supplementary Material (Hu et al., 2023), we have

∣ m (β, Λ, {\hat{F}}_{n}; X) - m (β_{0}, Λ_{0}, {\hat{F}}_{n}; X) ∣ ≲ \sum_{j = 1}^{K} [(Δ N_{j} + 1) {Δ ∣ e^{β^{T} Z} Δ Λ_{j} (Y) - e^{β_{0}^{T} Z} Δ Λ_{0, j} (Y) ∣ + \frac{1 - Δ}{1 - {\hat{F}}_{n} (Y ∣ Z)} \sum_{j = 1}^{K} \int_{Y}^{\infty} ∣ e^{β^{T} Z} Δ Λ_{j} (u) - e^{β_{0}^{T} Z} Δ Λ_{0, j} (u) ∣ d {\hat{F}}_{n} (u ∣ Z)}] .

Furthermore, since $\exp (β^{T} Z)$ , $Δ Λ_{j}$ , $\exp (β_{0}^{T} Z)$ and $Δ Λ_{0, j}$ are bounded and $d_{2} ({\hat{F}}_{n}, F_{0}) = o_{p} (1)$ , we have $\exp (∣ m (β, Λ, {\hat{F}}_{n}; X) - m (β_{0}, Λ_{0} {\hat{F}}_{n}; X) ∣) ≲ \exp (C N (T_{K}))$ . The above two inequalities yield that

𝒫 [e^{∣ m (β, Λ, {\hat{F}}_{n}; X) - m (β_{0}, Λ_{0}, {\hat{F}}_{n}; X) ∣} {∣ m (β, Λ, {\hat{F}}_{n}; X) - m (β_{0}, Λ_{0}, {\hat{F}}_{n}; X) ∣}^{2}] ≲ 𝒫 [\sum_{j = 1}^{K} Δ {e^{β^{T} Z} Δ Λ_{j} (Y) - e^{β_{0}^{T} Z} Δ Λ_{0, j} (Y)}^{2} + \sum_{j = 1}^{K} \frac{1 - Δ}{1 - {\hat{F}}_{n} (Y ∣ Z)} \int_{Y}^{\infty} {e^{β^{T} Z} Δ Λ_{j} (u) - e^{β_{0}^{T} Z} Δ Λ_{0, j} (u)}^{2} d {\hat{F}}_{n} (u ∣ Z)] ≲ d_{3}^{2} ((β, Λ), (β_{0}, Λ_{0})) + d_{3} ((β, Λ), (β_{0}, Λ_{0})) d_{2} ({\hat{F}}_{n}, F_{0}) .

That means for sufficiently large $n$ , ${‖ m (β, Λ, {\hat{F}}_{n}; X) - m (β_{0}, Λ_{0} {\hat{F}}_{n}; X) ‖}_{P, B}^{2} ≲ η^{2}$ . By Lemma 3.4.3 of van der Vaart and Wellner (1996),

E {‖ n^{1 ∕ 2} (P_{n} - 𝒫) ‖}_{ℳ_{n} ({\hat{F}}_{n})} ≲ J_{[]} (η, ℳ_{η} ({\hat{F}}_{n}), {‖ \cdot ‖}_{P, B}) {1 + J_{[]} (η, ℳ_{n} ({\hat{F}}_{n}), {‖ \cdot ‖}_{P, B}) ∕ (η^{2} n^{1 ∕ 2})},

where $J_{[]} (η, ℳ_{η} ({\hat{F}}_{n}), {‖ \cdot ‖}_{P, B}) ≔ \int_{0}^{η} {1 + \log N_{[]} (ε, ℳ_{n} ({\hat{F}}_{n}), {‖ \cdot ‖}_{P, B})}^{1 ∕ 2} d ε ≲ q_{n}^{1 ∕ 2} η$ . It follows that

E sup_{{(β, Λ) \in ℛ \times Φ_{n} : d_{3} ((β, Λ), (β_{0}, Λ_{0})) < η}} ∣ (P_{n} - 𝒫) (m (β, Λ, {\hat{F}}_{n}; X) - m (β_{0}, Λ_{0}, {\hat{F}}_{n}; X)) ∣ ≲ \frac{\sqrt{n q_{n}} η + q_{n}}{n} .

Setting $ϕ_{n} (η) = \sqrt{q_{n}} η + q_{n} ∕ \sqrt{n}$ such that $ϕ_{n} (η) ∕ η$ decreases about $η$ , for a sequence $r_{n} = O (n^{a})$ , we have $r_{n}^{2} ϕ (1 ∕ r_{n}) = \sqrt{q_{n}} r_{n} + q_{n} r_{n}^{2} ∕ \sqrt{n}$ . Note that $q_{n} = O (n^{ν})$ , $0 < ν < 1 ∕ 2$ . This yields that $r_{n}^{2} ϕ (1 ∕ r_{n}) = O (n^{a + ν ∕ 2} + n^{2 a + ν - 1 ∕ 2})$ . Since $a \leq (1 - ν) ∕ 2$ ensures $r_{n}^{2} ϕ (1 ∕ r_{n}) ≲ \sqrt{n}$ , we choose $r_{n} = O (n^{(1 - ν) ∕ 2})$ .

Finally, we determine $ν$ satisfying $P_{n} (m ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}; X) - m (β_{0}, Λ_{0}, {\hat{F}}_{n}; X)) \leq O_{p} (1 ∕ r_{n}^{2})$ . Note that for $Λ_{0} \in ℋ_{r}$ , there is a $Λ_{n}^{*} \in Φ_{n}$ such that ${‖ Λ_{n}^{*} - Λ_{0} ‖}_{\infty} = O (n^{- ν r})$ . By the definition of $({\hat{β}}_{n}, {\hat{Λ}}_{n})$ and $0 \leq 𝒫_{m} (β_{0}, Λ_{n}^{*}, {\hat{F}}_{n}; X) - 𝒫_{m} (β_{0}, Λ_{0}, {\hat{F}}_{n}; X) ≲ {‖ Λ_{n}^{*} - Λ_{0} ‖}_{\infty}^{2}$ , we have

P_{n} (m ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}; X) - m (β_{0}, Λ_{0}, {\hat{F}}_{n}; X)) = P_{n} m ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}; X) - P_{n} (β_{0}, Λ_{n}^{*}, {\hat{F}}_{n}; X) + P_{n} m (β_{0}, Λ_{n}^{*}, {\hat{F}}_{n}; X) - 𝒫 m (β_{0}, Λ_{n}^{*}, {\hat{F}}_{n}; X) + 𝒫 m (β_{0}, Λ_{n}^{*}, {\hat{F}}_{n}; X) - 𝒫 m (β_{0}, Λ_{0}, {\hat{F}}_{n}; X) + 𝒫 m (β_{0}, Λ_{0}, {\hat{F}}_{n}; X) - P_{n} m (β_{0}, Λ_{0}, {\hat{F}}_{n}; X) \leq n^{- ν r + ε} (P_{n} - 𝒫) [{m (β_{0}, Λ_{n}^{*}, {\hat{F}}_{n}; X) - m (β_{0}, Λ_{0}, {\hat{F}}_{n}; X)} ∕ n^{- ν r + ε}] + O_{p} (n^{- 2 ν r}) .

Set $\tilde{ℳ} ({\hat{F}}_{n}) = {m (β_{0}, Λ, {\hat{F}}_{n}; X) - m (β_{0}, Λ_{0}, {\hat{F}}_{n}; X) : Λ \in Φ_{n}, {‖ Λ - Λ_{0} ‖}_{\infty} \leq O (n^{- ν r})}$ . According to Lemma 4 of the online Supplementary Material (Hu et al., 2023), $\tilde{ℳ} ({\hat{F}}_{n})$ is Donsker. After some algebraic calculations, for any $f \in \tilde{ℳ} ({\hat{F}}_{n})$ , we have $P {(f ∕ n^{- ν r + ε})}^{2} \to 0$ as $n \to 0$ for any $ε > 0$ . Using Corollary 2.3.12 of van der Vaart and Wellner (1996), we have $(P_{n} - 𝒫) (m (β_{0}, Λ_{n}^{*}, {\hat{F}}_{n}; X) - m (β_{0}, Λ_{0}, {\hat{F}}_{n}; X)) = o_{p} (n^{- ν r + ε - 1 ∕ 2})$ . When $0 < ε \leq 1 ∕ 2 - r ν$ , we have $P_{n} (m ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}; X) - m (β_{0}, Λ_{0}, {\hat{F}}_{n}; X)) \leq O_{p} (n^{- 2 ν r})$ , meaning that $ν \geq 1 ∕ (1 + 2 r)$ ensures $P_{n} (m ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}; X) - m (β_{0}, Λ_{0}, {\hat{F}}_{n}; X)) \leq O_{p} (r_{n}^{- 2})$ . Thus, taking $ν = 1 ∕ (1 + 2 r)$ , we have $d_{3} (({\hat{β}}_{n}, {\hat{Λ}}_{n}), (β_{0}, Λ_{0})) = O_{p} (n^{- r ∕ (1 + 2 r)})$ . □

A.3. Proof of Theorem 3.3

Proof. (i) Define $\tilde{ℋ} = {(h_{1}, h_{2}) : h_{1} \in ℛ, h_{2} \in ℋ_{r}}$ . For $(h_{1}, h_{2}) \in \tilde{ℋ}$ , let $Q_{n} (β, Λ, F) [h_{1}, h_{2}] = P_{n} ψ (β, Λ, F; X) [h_{1}, h_{2}]$ and $Q (β, Λ, F) [h_{1}, h_{2}] = 𝒫 ψ (β, Λ, F; X) [h_{1}, h_{2}]$ with $ψ (β, Λ, F; X) [h_{1}, h_{2}]$ given in (6). Following Theorem 1 of Zhao and Zhang (2017), it suffices to verify the following conditions (B1)-(B5) to prove this theorem.

(B1) $Q (β_{0}, Λ_{0}, F_{0}) [h_{1}, h_{2}] = 0$ and $Q_{n} ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}) [h_{1}, h_{2}] = o_{p} (n^{- 1 ∕ 2})$ .

(B2) $\sqrt{n} (Q_{n} - Q) ({\hat{β}}_{n}, {\hat{Λ}}_{n}, \hat{F}) [h_{1}, h_{2}] - \sqrt{n} (Q_{n} - Q) (β_{0}, Λ_{0}, F_{0}) = [h_{1}, h_{2}] = o_{p} (1)$ .

(B3) $Q (β, Λ, F) [h_{1}, h_{2}]$ is Fréchet-differentiable with respect to ( $β$ , $Λ$ ) at ( $β_{0}$ , $Λ_{0}$ , $F_{0}$ ) with a continuous derivative ${\dot{Q}}_{1, β_{0}, Λ_{0}, F_{0}} [h_{1}, h_{2}]$ ; $Q (β, Λ, F) [h_{1}, h_{2}]$ is Fréchet-differentiable with respect to $F$ at ( $β_{0}$ , $Λ_{0}$ , $F_{0}$ ) with a continuous derivative ${\dot{Q}}_{2, β_{0}, Λ_{0}, F_{0}} [h_{1}, h_{2}]$ .

(B4) $Q ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}) [h_{1}, h_{2}] - Q (β_{0}, Λ_{0}, F_{0}) [h_{1}, h_{2}] - {\dot{Q}}_{1, β_{0}, Λ_{0}, F_{0}} ({\hat{β}}_{n} - β_{0}, {\hat{Λ}}_{n} - Λ_{0}) [h_{1}, h_{2}] - {\dot{Q}}_{2, β_{0}, Λ_{0}, F_{0}} ({\hat{F}}_{n} - F_{0}) [h_{1}, h_{2}] = o_{p} (n^{- 1 ∕ 2})$ .

(B5) $\sqrt{n} Q_{n} (β_{0}, Λ_{0}, F_{0}) [h_{1}, h_{2}] + \sqrt{n} {\dot{Q}}_{2, β_{0}, Λ_{0}, F_{0}} ({\hat{F}}_{n} - F_{0}) [h_{1}, h_{2}]$ converges in distribution to a tight Gaussian progress.

For (B1), under model (1), we have $Q (β_{0}, Λ_{0}, F_{0}) [h_{1}, h_{2}] = 0$ . By the definition of ( ${\hat{β}}_{n}$ , ${\hat{Λ}}_{n}$ ), for all $(h_{1}, h_{2}) \in ℛ \times Φ_{n}$ , we obtain $\lim_{η \to 0} P_{n} m ({\hat{β}}_{n} + η h_{1}, {\hat{Λ}}_{n} + η h_{2} . {\hat{F}}_{n}; X) ∕ η = 0$ . This implies that $Q_{n} ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}) [h_{1}, h_{2}] = 0$ for all $(h_{1}, h_{2}) \in ℛ \times Φ_{n}$ . By Lemma A1 of Lu, Zhang and Huang (2007) and the properties of spline functions, for any $h_{2} \in ℋ_{r}$ , we can find an $h_{2, n} \in Φ_{n}$ satisfying ${‖ h_{2, n} - h_{2} ‖}_{\infty} = O (n^{- r ∕ (1 + 2 r)})$ and ${‖ h_{2, n}^{'} - h_{2}^{'} ‖}_{\infty} = O (1)$ , where $h_{2}^{'}$ is the derivative of $h_{2}$ . Thus, for each $h_{2} \in ℋ_{r}$ , we need to prove $Q_{n} ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}) [0, h_{2} - h_{2, n}] = P_{n} ψ ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}) [0, h_{2} - h_{2, n}] = o_{p} (n^{- 1 ∕ 2})$ to verify $Q_{n} ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}) [h_{1}, h_{2}] = o_{p} (n^{- 1 ∕ 2})$ . Note that

Q_{n} ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}) [0, h_{2} - h_{2, n}] = {Q_{n} ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}) [0, h_{2} - h_{2, n}] - Q_{n} ({\hat{β}}_{n}, {\hat{Λ}}_{n}, F_{0}) [0, h_{2} - h_{2, n}]} + {Q_{n} ({\hat{β}}_{n}, {\hat{Λ}}_{n}, F_{0}) [0, h_{2} - h_{2, n}] - Q_{n} (β_{0}, Λ_{0}, F_{0}) [0, h_{2} - h_{2, n}]} + Q_{n} (β_{0}, Λ_{0}, F_{0}) [0, h_{2} - h_{2, n}] ≔ I_{1 n} + I_{2 n} + I_{3 n} .

For the first term $I_{1 n}$ , Lemma 2 of the online Supplementary Material (Hu et al., 2023) yields that

𝒫 ∣ I_{1 n} ∣ = 𝒫 ∣ Q_{n} ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}) [0, h_{2} - h_{2, n}] - Q_{n} ({\hat{β}}_{n}, {\hat{Λ}}_{n}, F_{0}) [0, h_{2} - h_{2, n}] ∣ \leq 𝒫 [\sum_{j = 1}^{K} (1 - Δ) ∣ \frac{\int_{Y}^{\infty} {Δ N_{j} - e^{{\hat{β}}_{n}^{T} Z} Δ {\hat{Λ}}_{n, j} (u)} (Δ h_{2, j} (u) - Δ h_{2, n, j} (u)) d {\hat{F}}_{n} (u ∣ Z)}{1 - {\hat{F}}_{n} (Y ∣ Z)} - \frac{\int_{Y}^{\infty} {Δ N_{j} - e^{{\hat{β}}_{n}^{T} Z} Δ {\hat{Λ}}_{n, j} (u)} (Δ h_{2, j} (u) - Δ h_{2, n, j} (u)) d F_{0} (u ∣ Z)}{1 - F_{0} (Y ∣ Z)} ∣ e^{{\hat{β}}_{n}^{T} Z}] ≲ d_{2} ({\hat{F}}_{n}, F_{0}) ({‖ h_{2} - h_{2, n} ‖}_{\infty} + {‖ h_{2}^{'} - h_{2, n}^{'} ‖}_{\infty}) = o_{p} (n^{- 1 ∕ 2}) .

For the second term $I_{2 n}$ , after some algebraic calculations, we have

𝒫 ∣ I_{2 n} ∣ = 𝒫 ∣ Q_{n} ({\hat{β}}_{n}, {\hat{Λ}}_{n}, F_{0}) [0, h_{2} - h_{2, n}] - Q_{n} (β_{0}, Λ_{0}, F_{0}) [0, h_{2} - h_{2, n}] ∣ \leq {‖ h_{2} - h_{2, n} ‖}_{\infty} 𝒫 [\sum_{k = 1}^{K} Δ ∣ {Δ N_{j} - e^{{\hat{β}}_{n}^{T} Z} Δ {\hat{Λ}}_{n_{j}} (Y)} e^{{\hat{β}}_{n}^{T} Z} - {Δ N_{j} - e^{β_{0}^{T} Z} Δ Λ_{0, j} (Y)} e^{β_{0}^{T} Z} ∣ + \frac{1 - Δ}{1 - F_{0} (Y ∣ Z)} \int_{Y}^{\infty} ∣ {Δ N_{j} - e^{{\hat{β}}_{n}^{T} Z} Δ {\hat{Λ}}_{n, j} (u)} e^{{\hat{β}}_{n}^{T} Z} - {Δ N_{j} - e^{β_{0}^{T} Z} Δ Λ_{0, j} (u)} e^{β_{0}^{T} Z} ∣ d F_{0} (u ∣ Z)] ≲ {{‖ {\hat{β}}_{n} - β_{0} ‖}_{2} + d_{3} ((2 {\hat{β}}_{n}, {\hat{Λ}}_{n}), (2 β_{0}, Λ_{0}))} {‖ h_{2} - h_{2, n} ‖}_{\infty} = o_{p} (n^{- 1 ∕ 2}) .

For the third term $I_{3 n}$ , note that $Q (β_{0}, Λ_{0}, F_{0}; X) [0, h_{2} - h_{2, n}] = 0$ . By the independence of $X_{i}$ and $X_{j}$ , it follows that

𝒫 I_{3 n}^{2} = n^{- 1} 𝒫 (\frac{1}{n} \sum_{i = 1}^{n} ψ^{2} (β_{0}, Λ_{0}, F_{0}; X_{i}) [0, h_{2} - h_{2, n}]) ≲ n^{- 1} 𝒫 [\sum_{j = 1}^{K} {Δ ∣ Δ N_{j} - e^{β_{0}^{T} Z} Δ Λ_{0, j} (Y) ∣ e^{β_{0}^{T} Z} + \frac{1 - Δ}{1 - F_{0} (Y ∣ Z)} \int_{Y}^{\infty} {∣ Δ N_{j} - e^{β_{0}^{T} Z} Δ Λ_{0, j} (Y) ∣ e^{β_{0}^{T} Z} d F_{0} (u ∣ Z)}]}^{2} {‖ h_{2} - h_{2, n} ‖}_{\infty}^{2} ≲ n^{- 1} {‖ h_{2} - h_{2, n} ‖}_{\infty}^{2} .

Then we have $Q_{n} ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}) [0, h_{2} - h_{2, n}] = o_{p} (n^{- 1 ∕ 2})$ , and (B1) holds.

For (B2), after some algebraic calculations, we have

\sqrt{n} (Q_{n} - Q) ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}) [h_{1}, h_{2}] - \sqrt{n} (Q_{n} - Q) (β_{0}, Λ_{0}, F_{0}) [h_{1}, h_{2}] = \sqrt{n} (P_{n} - 𝒫) (ψ ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}; X) [h_{1}, h_{2}] - ψ (β_{0}, Λ_{0}, F_{0}; X) [h_{1}, h_{2}]) .

For each fixed bounded $(h_{1}, h_{2}) \in \tilde{ℋ}$ , set

{\bar{Ψ}}_{η} (h_{1}, h_{2}) = {ψ (β, Λ, F; X) [h_{1}, h_{2}] - ψ (β_{0}, Λ_{0}, F_{0}; X) [h_{1}, h_{2}] : β \in ℛ, Λ \in Φ_{n}, F \in ℱ d_{3} ((β, Λ), (β_{0}, Λ_{0})) < η, d_{2} (F, F_{0}) < η, Λ is uniformly bounded} .

Similar to Lemma 3 of the online Supplementary Material (Hu et al., 2023), it follows that ${\bar{Ψ}}_{η} (h_{1}, h_{2})$ is Donsker. By Condition (C6) and Lemma 2 of the online Supplementary Material (Hu et al., 2023), after some algebraic calculations, we obtain $𝒫 {(ψ (β, Λ, F; X) [h_{1}, h_{2}] - ψ (β_{0}, Λ_{0}, F_{0}; X) [h_{1}, h_{2}])}^{2} ≲ d_{3} {((β, Λ), (β_{0}, Λ_{0}))}^{2} + d_{2} {(F, F_{0})}^{2}$ . Then Corollary 2.3.12 of van der Vaart and Wellner (1996) implies that

\sqrt{n} (P_{n} - 𝒫) (ψ ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}; X) [h_{1}, h_{2}] - ψ (β_{0}, Λ_{0}, F_{0}; X) [h_{1}, h_{2}]) = o_{p} (1),

and (B2) holds.

For (B3), $Q (β, Λ, F) [h_{1}, h_{2}]$ is Fréchet-differentiable with respect to ( $β$ , $Λ$ ) at ( $β_{0}$ , $Λ_{0}$ , $F_{0}$ ) because $Q (β, Λ, F) [h_{1}, h_{2}]$ is a smooth functional with respect to ( $β$ , $Λ$ , $F$ ). Similarly, $Q (β, Λ, F) [h_{1}, h_{2}]$ is Fréchet-differentiable with respect to $F$ at ( $β_{0}$ , $Λ_{0}$ , $F_{0}$ ). By some direct calculations, we obtain

{\dot{Q}}_{1, β_{0}, Λ_{0}, F_{0}} ({\hat{β}}_{n} - β_{0}, {\hat{Λ}}_{n} - Λ_{0}) [h_{1}, h_{2}] = \frac{d}{d ε} {𝒫 [\sum_{j = 1}^{K} {Δ (Δ N_{j} - (Δ Λ_{0, j} (Y) + ε (Δ {\hat{Λ}}_{n, j} (Y) - Δ Λ_{0, j} (Y))) e^{{(β_{0} + ε ({\hat{β}}_{n} - β_{0}))}^{T} Z} \times (Δ h_{2, j} (Y) + (Δ Λ_{0, j} (Y) + ε (Δ {\hat{Λ}}_{n, j} (Y) - Δ Λ_{0, j} (Y))) h_{1}^{T} Z) e^{{(β_{0} + ε ({\hat{β}}_{n} - β_{0}))}^{T} Z} + \frac{1 - Δ}{1 - F_{0} (Y ∣ Z)} \int_{Y}^{\infty} (Δ N_{j} - (Δ Λ_{0, j} (u) + ε (Δ {\hat{Λ}}_{n, j} (u) - Δ Λ_{0, j} (u))) e^{{(β_{0} + ε ({\hat{β}}_{n} - β_{0}))}^{T} Z}) {\times (Δ h_{2, j} (u) + (Δ Λ_{0, j} (u) + ε (Δ {\hat{Λ}}_{n, j} (u) - Δ Λ_{0, j} (u))) h_{1}^{T} Z) e^{{(β_{0} + ε ({\hat{β}}_{n} - β_{0}))}^{T} Z} d F_{0} (u ∣ Z)}]} ∣}_{ε = 0} ≔ - R_{1} (h_{1}, h_{2}) ({\hat{β}}_{n} - β_{0}) - R_{2} (h_{1}, h_{2}) ({\hat{Λ}}_{n} - Λ_{0}),

where

R_{1} (h_{1}, h_{2}) ({\hat{β}}_{n} - β_{0}) = - 𝒫 [e^{β_{0}^{T} Z} \sum_{j = 1}^{K} {Δ (Δ N_{j} - 2 e^{β_{0}^{T} Z} Δ Λ_{0, j} (Y)) (Δ h_{2, j} (Y) + Δ Λ_{0, j} (Y) h_{1}^{T} Z) + \frac{1 - Δ}{1 - F_{0} (Y ∣ Z)} \int_{Y}^{\infty} (Δ N_{j} - 2 e^{β_{0}^{T} Z} Δ Λ_{0, j} (u)) (Δ h_{2, j} (u) + Δ Λ_{0, j} (u) h_{1}^{T} Z) d F_{0} (u ∣ Z)} Z^{T}] ({\hat{β}}_{n} - β_{0})

(8)

and

R_{2} (h_{1}, h_{2}) ({\hat{Λ}}_{n} - Λ_{0}) = - 𝒫 [e^{β_{0}^{T} Z} \sum_{j = 1}^{K} {Δ (Δ N_{j} h_{1}^{T} Z - 2 e^{β_{0}^{T} Z} Δ Λ_{0, j} (Y) h_{1}^{T} Z - e^{β_{0}^{T} Z} Δ h_{2, j} (Y)) \times (Δ {\hat{Λ}}_{n, j} (Y) - Δ Λ_{0, j} (Y)) + \frac{1 - Δ}{1 - F_{0} (Y ∣ Z)} \int_{Y}^{\infty} (Δ {\hat{Λ}}_{n, j} (u) - Λ_{0, j} (u)) \times (Δ N_{j} h_{1}^{T} Z - 2 e^{β_{0}^{T} Z} Δ Λ_{0, j} (u) h_{1}^{T} Z - e^{β_{0}^{T} Z} Δ h_{2, j} (u)) d F_{0} (u ∣ Z)}] .

(9)

Since the equation

{d \frac{\int_{Y}^{\infty} g (u - T_{j}) d (F_{0} + ε ({\hat{F}}_{n} - F_{0})) (u ∣ Z)}{1 - F_{0} (Y ∣ Z) - ε ({\hat{F}}_{n} - F_{0}) (Y ∣ Z)} ∕ d ε ∣}_{ε = 0} = \frac{(1 - F_{0} (Y ∣ Z)) \int_{Y}^{\infty} g (u - T_{j}) d ({\hat{F}}_{n} - F_{0}) (u ∣ Z) + ({\hat{F}}_{n} - F_{0}) (Y ∣ Z) \int_{Y}^{\infty} g (u - T_{j}) d F_{0} (u ∣ Z)}{{(1 - F_{0} (Y ∣ Z))}^{2}} = \frac{1}{1 - F_{0} (Y ∣ Z)} \int_{Y}^{\infty} {g (u - T_{j}) - \int_{Y}^{\infty} \frac{g (s - T_{j})}{1 - F_{0} (Y ∣ Z)} d F_{0} (s ∣ Z)} d ({\hat{F}}_{n} - F_{0}) (u ∣ Z)

holds for any differentiable function $g$ , we obtain

{\dot{Q}}_{2, β_{0}, Λ_{0}, F_{0}} ({\hat{F}}_{n} - F_{0}) [h_{1}, h_{2}] = {\frac{d}{d ε} {Q (β_{0}, Λ_{0}, F_{0} + ε ({\hat{F}}_{n} - F_{0})) [h_{1}, h_{2}]} ∣}_{ε = 0} = \frac{d}{d ε} {𝒫 [\sum_{j = 1}^{K} \frac{1 - Δ}{1 - F_{0} (Y ∣ Z) - ε ({\hat{F}}_{n} - F_{0}) (Y ∣ Z)} \int_{Y}^{\infty} {Δ N_{j} - e^{β_{0}^{T} Z} Δ Λ_{0, j} (u)} \times {e^{β_{0}^{T} Z} {Δ h_{2, j} (u) + Δ Λ_{0, j} (u) h_{1}^{T} Z} d (F_{0} + ε ({\hat{F}}_{n} - F_{0})) (u ∣ Z)]} ∣}_{ε = 0} = 𝒫 [\int_{Y}^{\infty} {\bar{φ}}_{β_{0}, Λ_{0}, F_{0}} (u; X) [h_{1}, h_{2}] d ({\hat{F}}_{n} - F_{0}) (u ∣ Z)],

where

{\bar{φ}}_{β_{0}, Λ_{0}, F_{0}} (u; X) [h_{1}, h_{2}] = \frac{1 - Δ}{1 - F_{0} (Y ∣ Z)} \sum_{j = 1}^{K} {{\tilde{φ}}_{j, β_{0}, Λ_{0}, F_{0}} (u; X) [h_{1}, h_{2}] - \int_{Y}^{\infty} \frac{{\tilde{φ}}_{j, β_{0}, Λ_{0}, F_{0}} (s; X) [h_{1}, h_{2}]}{1 - F_{0} (Y ∣ Z)} d F_{0} (s ∣ Z)},

and ${\tilde{φ}}_{j, β_{0}, Λ_{0}, F_{0}} (u; X) [h_{1}, h_{2}] = {Δ N_{j} - \exp (β_{0}^{T} Z) Δ Λ_{0, j} (u)} \exp (β_{0}^{T} Z) {Δ h_{2, j} (u) + Δ Λ_{0, j} (u) h_{1}^{T} Z}$ . Then (B3) is verified.

For (B4), since $d_{3} (({\hat{β}}_{n}, {\hat{Λ}}_{n}), (β_{0}, Λ_{0})) = O_{p} (n^{- r ∕ (1 + 2 r)})$ and by the Taylor expansion, we have $\exp ({\hat{β}}_{n}^{T} Z) = \exp (β_{0}^{T} Z) + \exp (β_{0}^{T} Z) Z^{T} ({\hat{β}}_{n} - β_{0}) + o_{p} (1 ∕ \sqrt{n})$ . By the above equation and Lemma 2 of the online Supplementary Material (Hu et al., 2023), we obtain

Q ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}) [h_{1}, h_{2}] - Q (β_{0}, {\hat{Λ}}_{n}, {\hat{F}}_{n}) [h_{1}, h_{2}] = 𝒫 [e^{β_{0}^{T} Z} \sum_{j = 1}^{K} {Δ (Δ N_{j} - 2 e^{β_{0}^{T} Z} Δ {\hat{Λ}}_{n, j} (Y)) (Δ h_{2, j} (Y) + Δ {\hat{Λ}}_{n, j} (Y) h_{1}^{T} Z) + \frac{1 - Δ}{1 - {\hat{F}}_{n} (Y ∣ Z)} \times \int_{Y}^{\infty} (Δ N_{j} - 2 e^{β_{0}^{T} Z} Δ {\hat{Λ}}_{n, j} (u)) (Δ h_{2, j} (u) + Δ {\hat{Λ}}_{n, j} (u) h_{1}^{T} Z) d {\hat{F}}_{n} (u ∣ Z)} Z^{T}] ({\hat{β}}_{n} - β_{0}) = 𝒫 [e^{β_{0}^{T} Z} \sum_{j = 1}^{K} {Δ (Δ N_{j} - 2 e^{β_{0}^{T} Z} Δ {\hat{Λ}}_{n, j} (Y)) (Δ h_{2, j} (Y) + Δ {\hat{Λ}}_{n, j} (Y) h_{1}^{T} Z) + \frac{1 - Δ}{1 - F_{0} (Y ∣ Z)} \times \int_{Y}^{\infty} (Δ N_{j} - 2 e^{β_{0}^{T} Z} Δ {\hat{Λ}}_{n, j} (u)) (Δ h_{2, j} (u) + Δ {\hat{Λ}}_{n, j} (u) h_{1}^{T} Z) d F_{0} (u ∣ Z)} Z^{T}] ({\hat{β}}_{n} - β_{0}) + o_{p} (n^{- 1 ∕ 2}) = - R_{1} (h_{1}, h_{2}) ({\hat{β}}_{n} - β_{0}) + o_{p} (n^{- 1 ∕ 2}) .

(10)

Similarly, since $d_{1} ({\hat{Λ}}_{n}, Λ_{0}) = O_{p} (n^{- r ∕ (1 + 2 r)})$ and $d_{1} ({\hat{Λ}}_{n}^{'}, Λ_{0}^{'}) = o_{p} (1)$ , using Lemma 2 of the online Supplementary Material (Hu et al., 2023), we have

Q (β_{0}, {\hat{Λ}}_{n}, {\hat{F}}_{n}) [h_{1}, h_{2}] - Q (β_{0}, Λ_{0}, {\hat{F}}_{n}) [h_{1}, h_{2}] = 𝒫 [\sum_{j = 1}^{K} {Δ (e^{2 β_{0}^{T} Z} (Δ Λ_{0, j} {(Y)}^{2} - Δ {\hat{Λ}}_{n, j} {(Y)}^{2}) h_{1}^{T} Z + (Δ {\hat{Λ}}_{n, j} (Y) - Δ Λ_{0, j} (Y)) \times (Δ N_{j} e^{β_{0}^{T} Z} h_{1}^{T} Z - e^{2 β_{0}^{T} Z} Δ h_{2, j} (Y))) + \frac{1 - Δ}{1 - F_{0} (Y ∣ Z)} \int_{Y}^{\infty} (e^{2 β_{0}^{T} Z} (Δ {\hat{Λ}}_{n, j} {(u)}^{2} - Δ Λ_{0, j} {(u)}^{2}) h_{1}^{T} Z + (Δ {\hat{Λ}}_{n, j} (u) - Δ Λ_{0, j} (u)) (Δ N_{j} e^{β_{0}^{T} Z} h_{1}^{T} Z - e^{2 β_{0}^{T} Z} Δ h_{2, j} (u))) d F_{0} (u ∣ Z)}] + o_{p} (n^{- 1 ∕ 2}) = - R_{2} (h_{1}, h_{2}) ({\hat{Λ}}_{n} - Λ_{0}) - 𝒫 [\sum_{j = 1}^{K} {Δ e^{2 β_{0}^{T} Z} {(Δ {\hat{Λ}}_{n, j} (Y) - {Δ Λ}_{0, j} (Y))}^{2} h_{1}^{T} Z + \frac{1 - Δ}{1 - F_{0} (Y ∣ Z)} \int_{Y}^{\infty} (e^{2 β_{0}^{T} Z} {(Δ {\hat{Λ}}_{n, j} (u) - Δ Λ_{0, j} (u))}^{2} h_{1}^{T} Z) d F_{0} (u ∣ Z)}] + o_{p} (n^{- 1 ∕ 2}) = - R_{2} (h_{1}, h_{2}) ({\hat{Λ}}_{n} - Λ_{0}) + o_{p} (n^{- 1 ∕ 2}) .

(11)

By (10) and (11), it follows that $Q ({\hat{β}}_{n}, {\hat{Λ}}_{n}, {\hat{F}}_{n}) [h_{1}, h_{2}] - Q (β_{0}, Λ_{0}, {\hat{F}}_{n}) [h_{1}, h_{2}] = {\dot{Q}}_{1, β_{0}, Λ_{0}, F_{0}} ({\hat{β}}_{n} - β_{0}, {\hat{Λ}}_{n} - Λ_{0}) [h_{1}, h_{2}] + o_{p} (1 ∕ \sqrt{n})$ . By Lemma 2 of the online Supplementary Material (Hu et al., 2023), we can obtain that

∣ Q (β_{0}, Λ_{0}, {\hat{F}}_{n}) [h_{1}, h_{2}] - Q (β_{0}, Λ_{0}, F_{0}) [h_{1}, h_{2}] - {\dot{Q}}_{2, β_{0}, Λ_{0}, F_{0}} ({\hat{F}}_{n} - F_{0}) [h_{1}, h_{2}] ∣ = ∣ 𝒫 [(1 - Δ) \frac{{\hat{F}}_{n} (Y ∣ Z) - F_{0} (Y ∣ Z)}{1 - F_{0} (Y ∣ Z)} \sum_{j = 1}^{K} {\frac{\int_{Y}^{\infty} {\tilde{φ}}_{j, β_{0}, Λ_{0}, F_{0}} (u; X) [h_{1}, h_{2}] d {\hat{F}}_{n} (u ∣ Z)}{1 - {\hat{F}}_{n} (Y ∣ Z)} - \frac{\int_{Y}^{\infty} {\tilde{φ}}_{j, β_{0}, Λ_{0}, F_{0}} (u; X) [h_{1}, h_{2}] d F_{0} (u ∣ Z)}{1 - F_{0} (Y ∣ Z)}}] ∣ ≲ {‖ {\hat{F}}_{n} - F_{0} ‖}_{\infty} 𝒫 [(1 - Δ) \sum_{j = 1}^{K} ∣ \frac{\int_{Y}^{\infty} {\tilde{φ}}_{j, β_{0}, Λ_{0}, F_{0}} (u; X) [h_{1}, h_{2}] d {\hat{F}}_{n} (u ∣ Z)}{1 - {\hat{F}}_{n} (Y ∣ Z)} - \frac{\int_{Y}^{\infty} {\tilde{φ}}_{j, β_{0}, Λ_{0}, F_{0}} (u; X) [h_{1}, h_{2}] d F_{0} (u ∣ Z)}{1 - F_{0} (Y ∣ Z)} ∣] ≲ {‖ {\hat{F}}_{n} - F_{0} ‖}_{\infty}^{2} = o_{p} (n^{- 1 ∕ 2}) .

Thus, (B4) holds.

Finally, we consider (B5). Note that

\sqrt{n} Q_{n} (β_{0}, Λ_{0}, F_{0}) [h_{1}, h_{2}] + \sqrt{n} {\dot{Q}}_{2, β_{0}, Λ_{0}, F_{0}} ({\hat{F}}_{n} - F_{0}) [h_{1}, h_{2}] = \sqrt{n} P_{n} ψ (β_{0}, Λ_{0}, F_{0}; X) [h_{1}, h_{2}] ∣ + \sqrt{n} 𝒫 [\int_{Y}^{\infty} {\bar{φ}}_{β_{0}, Λ_{0}, F_{0}} (u; X) [h_{1}, h_{2}] d ({\hat{F}}_{n} - F_{0}) (u ∣ Z)] .

According to Lemma 1 of the online Supplementary Material (Hu et al., 2023), we have

𝒫 [\int_{Y}^{\infty} {\bar{φ}}_{β_{0}, Λ_{0}, F_{0}} (u; X) [h_{1}, h_{2}] d ({\hat{F}}_{n} (u ∣ Z) - F_{0} (u ∣ Z))] = 𝒫 [\int_{Y}^{\infty} \frac{\partial {\bar{φ}}_{β_{0}, Λ_{0}, F_{0}} (u; X) [h_{1}, h_{2}]}{\partial u} ({\hat{F}}_{n} (u ∣ Z) - F_{0} (u ∣ Z)) d u - {\bar{φ}}_{β_{0}, Λ_{0}, F_{0}} (Y; X) [h_{1}, h_{2}] ({\hat{F}}_{n} (Y ∣ Z) - F_{0} (Y ∣ Z))] = 𝒫 [\frac{1}{n} \sum_{i = 1}^{n} {\int_{Y}^{\infty} \frac{\partial {\bar{φ}}_{β_{0}, Λ_{0}, F_{0}} (u; X) [h_{1}, h_{2}]}{\partial u} Ω (u, Z; {\tilde{Y}}_{i}, {\tilde{Δ}}_{i}, {\tilde{Z}}_{i}) d u - {\bar{φ}}_{β_{0}, Λ_{0}, F_{0}} (Y; X) [h_{1}, h_{2}] Ω (Y, Z; {\tilde{Y}}_{i}, {\tilde{Δ}}_{i}, {\tilde{Z}}_{i})}] ≔ P_{n} φ (β_{0}, Λ_{0}, F_{0}; \tilde{Y}, \tilde{Δ}, \tilde{Z}) [h_{1}, h_{2}],

where

φ (β_{0}, Λ_{0}, F_{0}; \tilde{Y}, \tilde{Δ}, \tilde{Z}) [h_{1}, h_{2}] = 𝒫_{X} [{\int_{Y}^{\infty} \frac{\partial {\bar{φ}}_{β_{0}, Λ_{0}, F_{0}} (u; X) [h_{1}, h_{2}]}{\partial u} Ω (u, Z; \tilde{Y}, \tilde{Δ}, \tilde{Z}) d u - {\bar{φ}}_{β_{0}, Λ_{0}, F_{0}} (Y; X) [h_{1}, h_{2}] Ω (Y, Z; \tilde{Y}, \tilde{Δ}, \tilde{Z})}] .

By the central limit theorem, setting

σ_{0} {[h_{1}, h_{2}]}^{2} = E [{ψ (β_{0}, Λ_{0}, F_{0}; X) [h_{1}, h_{2}] + φ (β_{0}, Λ_{0}, F_{0}; \tilde{Y}, \tilde{Δ}, \tilde{Z}) [h_{1}, h_{2}]}^{2}],

(12)

we have $\sqrt{n} Q_{n} (β_{0}, Λ_{0}, F_{0}) [h_{1}, h_{2}] + \sqrt{n} {\dot{Q}}_{2, β_{0}, Λ_{0}, F_{0}} ({\hat{F}}_{n} - F_{0}) [h_{1}, h_{2}] ⇝ N (0, σ_{0} {[h_{1}, h_{2}]}^{2})$ , and (B5) holds.

By Theorem 1 of Zhao and Zhang (2017), (B1)-(B5) yields that

\sqrt{n} R_{1} (h_{1}, h_{2}) ({\hat{β}}_{n} - β_{0}) + \sqrt{n} R_{2} (h_{1}, h_{2}) ({\hat{Λ}}_{n} - Λ_{0}) ⇝ N (0, σ_{0} {[h_{1}, h_{2}]}^{2}) .

(ii) To prove the asymptotic normality of ${\hat{β}}_{n}$ , we need to find an ( $h_{1}^{*}$ , $h_{2}^{*}$ ) such that $R_{2} (h_{1}^{*}, h_{2}^{*}) ({\hat{Λ}}_{n} - Λ_{0}) = 0$ . After some algebraic calculations, we obtain

R_{2} (h_{1}^{*}, h_{2}^{*}) ({\hat{Λ}}_{n} - Λ_{0}) = 𝒫 [\sum_{j = 1}^{K} {(Δ {\hat{Λ}}_{n, j} (U) - Δ Λ_{0, j} (U)) E [(Δ h_{2, j}^{*} (U) + Δ Λ_{0, j} (U) h_{1}^{* T} Z) e^{2 β_{0}^{T} Z} ∣ U, K, T]}] .

Setting $R^{*} (U, K, T) = E [\exp (2 β_{0}^{T} Z) Z ∣ U, K, T] ∕ E [\exp (2 β_{0}^{T} Z) ∣ U, K, T]$ , the above equality implies that $Δ h_{2, j}^{*} (U) = - h_{1}^{* T} (U, K, T) Δ Λ_{0, j} (U)$ . Then we have

Δ Λ_{j} (U) h_{1}^{* T} Z + Δ h_{2, j}^{*} (U) = Δ Λ_{j} (U) h_{1}^{* T} (Z - R^{*} (U, K, T)) .

(13)

It follows that $R_{1} (h_{1}^{*}, h_{2}^{*}) (({\hat{β}}_{n} - β_{0}) = h_{1}^{* T} A^{*} ({\hat{β}}_{n} - β_{0})$ , where

A^{*} = 𝒫 [\sum_{j = 1}^{K} {e^{2 β_{0}^{T} Z} Δ Λ_{0, j} {(U)}^{2} {(Z - R^{*} (U, K, T))}^{\otimes 2}}] .

Furthermore, by (13), we obtain

ψ (β_{0}, Λ_{0}, F_{0}; X) [h_{1}^{*}, h_{2}^{*}] = h_{1}^{* T} \sum_{j = 1}^{K} [Δ {Δ N_{j} - e^{β_{0}^{T} Z} Δ Λ_{0, j} (Y)} e^{β_{0}^{T} Z} Δ Λ_{0, j} (Y) (Z - R^{*} (Y, K, T)) + \frac{1 - Δ}{1 - F_{0} (Y ∣ Z)} \int_{Y}^{\infty} {Δ N_{j} - e^{β_{0}^{T} Z} Δ Λ_{0, j} (u)} e^{β^{T} Z} Δ Λ_{0, j} (u) (Z - R^{*} (u, K, T)) d F_{0} (u ∣ Z)] ≔ h_{1}^{* T} ψ^{*} (β_{0}, Λ_{0}, F_{0}; X), φ (β_{0}, Λ_{0}, F_{0}; \tilde{Y}, \tilde{Δ}, \tilde{Z}) [h_{1}^{*}, h_{2}^{*}] = h_{1}^{* T} 𝒫_{X} = [{\int_{Y}^{\infty} \frac{\partial {\bar{φ}}_{β_{0}, Λ_{0}, F_{0}}^{*} (u; X)}{\partial u} Ω (u, Z; \tilde{Y}, \tilde{Δ}, \tilde{Z}) d u - {\bar{φ}}_{β_{0}, Λ_{0}, F_{0}}^{*} (Y; X) Ω (Y, Z; \tilde{Y}, \tilde{Δ}, \tilde{Z})}] ≔ h_{1}^{* T} φ^{*} (β_{0}, Λ_{0}, F_{0}; \tilde{Y}, \tilde{Δ}, \tilde{Z}),

where

{\bar{φ}}_{β_{0}, Λ_{0}, F_{0}}^{*} (u; X) = \frac{1 - Δ}{1 - F_{0} (Y ∣ Z)} \sum_{j = 1}^{K} {{\tilde{φ}}_{j, β_{0}, Λ_{0}, F_{0}}^{*} (u; X) - \int_{Y}^{\infty} \frac{{\tilde{φ}}_{j, β_{0}, Λ_{0}, F_{0}}^{*} (s; X)}{1 - F_{0} (Y ∣ Z)} d F_{0} (s ∣ Z)}

and ${\tilde{φ}}^{*}_{j, β_{0}, Λ_{0}, F_{0}} (u; X) = {Δ N_{j} - \exp (β_{0}^{T} Z) Δ Λ_{0, j} (u)} \exp (β_{0}^{T} Z) {Δ Λ_{j} (u) (Z - R^{*} (u, K, T))$ . After some algebraic calculations, we have

σ_{0} {[h_{1}^{*}, h_{2}^{*}]}^{2} = E [{ψ (β_{0}, Λ_{0}, F_{0}; X) [h_{1}^{*}, h_{2}^{*}] + φ (β_{0}, Λ_{0}, F_{0}; \tilde{Y}, \tilde{Δ}, \tilde{Z}) [h_{1}^{*}, h_{2}^{*}]}^{2}] = h_{1}^{* T} E [{ψ^{*} (β_{0}, Λ_{0}, F_{0}; X) + φ^{*} (β_{0}, Λ_{0}, F_{0}; \tilde{Y}, \tilde{Δ}, \tilde{Z})}^{2}] h_{1}^{*} ≔ h_{1}^{* T} B^{*} h_{1}^{*} .

It follows that $\sqrt{n} h_{1}^{* T} A^{*} ({\hat{β}}_{n} - β_{0}) ⇝ N (0, h_{1}^{* T} B^{*} h_{1}^{*})$ for all $h_{1}^{*} \in ℛ$ . Then we obtain $\sqrt{n} ({\hat{β}}_{n} - β_{0}) ⇝ N (0, {(A^{*})}^{- 1} B^{*} {({(A^{*})}^{- 1})}^{T})$ . □

Footnotes

Supplementary Material

Lemmas (DOI: 10.3150/22-BEJ1565SUPP; .pdf). The supplementary material contains some Lemmas.

References

Balakrishnan N and Zhao X (2009). New multi-sample nonparametric tests for panel count data. Ann. Statist 37 1112–1149. MR2509069 10.1214/08-AOS599 [DOI] [Google Scholar]
Breslow NE (1972). Discussion of the paper by D. R. Cox. J. R. Stat. Soc. Ser. B. Stat. Methodol 34 216–217. [Google Scholar]
Chan KCG and Wang M-C (2010). Backward estimation of stochastic processes with failure events as time origins. Ann. Appl. Stat 4 1602–1620. MR2758343 10.1214/09-AOAS319 [DOI] [PMC free article] [PubMed] [Google Scholar]
Chiou SH, Huang C-Y, Xu G and Yan J (2019). Semiparametric regression analysis of panel count data: A practical review. Int. Stat. Rev 87 24–43. MR3940137 10.1111/insr.12271 [DOI] [PMC free article] [PubMed] [Google Scholar]
Cox DR (1972). Regression models and life-tables. J. Roy. Statist. Soc. Ser. B 34 187–220. MR0341758 [Google Scholar]
Hu X, Liu L, Zhang Y and Zhao X (2023). Supplement to “Semiparametric regression of panel count data with informative terminal event.” 10.3150/22-BEJ1565SUPP [DOI] [PMC free article] [PubMed] [Google Scholar]
Kalbfleisch JD and Lawless JF (1985). The analysis of panel data under a Markov assumption. J. Amer. Statist. Assoc 80 863–871. MR0819585 [Google Scholar]
Kong S, Nan B, Kalbfleisch JD, Saran R and Hirth R (2018). Conditional modeling of longitudinal data with terminal event. J. Amer. Statist. Assoc 113 357–368. MR3803470 10.1080/01621459.2016.1255637 [DOI] [PMC free article] [PubMed] [Google Scholar]
Lange K. (2001). Numerical Analysis for Statisticians. Statistics and Computing. NewYork: Springer. MR1681963 [Google Scholar]
Li L, Wu C-H, Ning J, Huang X, Shih Y-CT and Shen Y (2018). Semiparametric estimation of longitudinal medical cost trajectory. J. Amer. Statist. Assoc 113 582–592. MR3832210 10.1080/01621459.2017.1361329 [DOI] [PMC free article] [PubMed] [Google Scholar]
Li Z, Frost HR, Tosteson TD, Zhao L, Liu L, Lyons K, Chen H, Cole B, Currow D and Bakitas M (2017). A semiparametric joint model for terminal trend of quality of life and survival in palliative care research. Stat. Med 36 4692–4704. MR3731248 10.1002/sim.7445 [DOI] [PMC free article] [PubMed] [Google Scholar]
Liu L, Su W, Yin G, Zhao X and Zhang Y (2022). Nonparametric inference for reversed mean models with panel count data. Bernoulli 28 2968–2997. MR4474569 10.3150/21-bej1444 [DOI] [PMC free article] [PubMed] [Google Scholar]
Lu M, Zhang Y and Huang J (2007). Estimation of the mean function with panel count data using monotone polynomial splines. Biometrika 94 705–718. MR2410018 10.1093/biomet/asm057 [DOI] [Google Scholar]
Lu M, Zhang Y and Huang J (2009). Semiparametric estimation methods for panel count data using monotone B-splines. J. Amer. Statist. Assoc 104 1060–1070. MR2750237 10.1198/jasa.2009.tm08086 [DOI] [Google Scholar]
Shen B, Chen C, Liu D, Datta S, Ghahramani N, Chinchilli VM and Wang M (2021). Joint modeling of longitudinal data with informative cluster size adjusted for zero-inflation and a dependent terminal event. Stat. Med 40 4582–4596. MR4315439 10.1002/sim.9081 [DOI] [PMC free article] [PubMed] [Google Scholar]
Sun J and Kalbfleisch JD (1995). Estimation of the mean function of point processes based on panel count data. Statist. Sinica 5 279–289. MR1329298 [Google Scholar]
Sun J, Tong X and He X (2007). Regression analysis of panel count data with dependent observation times. Biometrics 63 1053–1059. MR2414582 10.1111/j.1541-0420.2007.00808.x [DOI] [PubMed] [Google Scholar]
Sun J and Zhao X (2013). Statistical Analysis of Panel Count Data. Statistics for Biology and Health. NewYork: Springer. MR 3136574 10.1007/978-1-4614-8715-9 [DOI] [Google Scholar]
Sun L, Song X, Zhou J and Liu L (2012). Joint analysis of longitudinal data with informative observation times and a dependent terminal event. J. Amer. Statist. Assoc 107 688–700. MR2980077 10.1080/01621459.2012.682528. [DOI] [Google Scholar]
Tong X, He X, Sun L and Sun J (2009). Variable selection for panel count data via non-concave penalized estimating function. Scand. J. Stat 36 620–635. MR2572579 10.1111/j.1467-9469.2009.00658.x [DOI] [Google Scholar]
van der Vaart A (2002). Semiparametric statistics. In Lectures on Probability Theory and Statistics (Saint-Flour, 1999). Lecture Notes in Math. 1781 331–457. Berlin: Springer. MR1915446 [Google Scholar]
van der Vaart AW and Wellner JA (1996). Weak Convergence and Empirical Processes: With Applications to Statistics. Springer Series in Statistics. New York: Springer. MR1385671 10.1007/978-1-4757-2545-2 [DOI] [Google Scholar]
Wellner JA and Zhang Y (2000). Two estimators of the mean of a counting process with panel count data. Ann. Statist 28 779–814. MR1792787 10.1214/aos/1015951998 [DOI] [Google Scholar]
Wellner JA and Zhang Y (2007). Two likelihood-based semiparametric estimation methods for panel count data with covariates. Ann. Statist 35 2106–2142. MR2363965 10.1214/009053607000000181 [DOI] [Google Scholar]
Zeng Y, Vaupel J, Xiao Z, Liu Y and Zhang C Chinese Longitudinal Healthy Longevity Survey (CLHLS) 1998–2014. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor] 2017-April-11. 10.3886/ICPSR36692.v1 [DOI] [Google Scholar]
Zhang H, Sun J and Wang D (2013). Variable selection and estimation for multivariate panel count data via the seamless- $L_{0}$ penalty. Canad. J. Statist 41 368–385. MR3061885 10.1002/cjs.11172 [DOI] [Google Scholar]
Zhang Y. (2002). A semiparametric pseudolikelihood estimation method for panel count data. Biometrika 89 39–48. MR1888344 10.1093/biomet/89.1.39 [DOI] [Google Scholar]
Zhang Y. (2006). Nonparametric k-sample tests with panel count data. Biometrika 93 777–790. MR2285071 10.1093/biomet/93.4.777 [DOI] [Google Scholar]
Zhao H, Li Y and Sun J (2013a). Analyzing panel count data with a dependent observation process and a terminal event. Canad. J. Statist 41 174–191. MR3030791 10.1002/cjs.11143 [DOI] [Google Scholar]
Zhao H, Li Y and Sun J (2013b). Semiparametric analysis of multivariate panel count data with dependent observation processes and a terminal event. J. Nonparametr. Stat 25 379–394. MR3056091 10.1080/10485252.2012.758724 [DOI] [Google Scholar]
Zhao X and Sun J (2011). Nonparametric comparison for panel count data with unequal observation processes. Biometrics 67 770–779. MR2829131 10.1111/j.1541-0420.2010.01504.x [DOI] [PubMed] [Google Scholar]
Zhao X and Zhang Y (2017). Asymptotic normality of nonparametric M-estimators with applications to hypothesis testing for panel count data. Statist. Sinica 27 931–950. MR3675037 [Google Scholar]
Zhou J, Zhang H, Sun L and Sun J (2017). Joint analysis of panel count data with an informative observation process and a dependent terminal event. Lifetime Data Anal. 23 560–584. MR3705825 10.1007/s10985-016-9375-y [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Suppl

NIHMS2018374-supplement-Suppl.pdf^{(175.4KB, pdf)}

[R1] Balakrishnan N and Zhao X (2009). New multi-sample nonparametric tests for panel count data. Ann. Statist 37 1112–1149. MR2509069 10.1214/08-AOS599 [DOI] [Google Scholar]

[R2] Breslow NE (1972). Discussion of the paper by D. R. Cox. J. R. Stat. Soc. Ser. B. Stat. Methodol 34 216–217. [Google Scholar]

[R3] Chan KCG and Wang M-C (2010). Backward estimation of stochastic processes with failure events as time origins. Ann. Appl. Stat 4 1602–1620. MR2758343 10.1214/09-AOAS319 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] Chiou SH, Huang C-Y, Xu G and Yan J (2019). Semiparametric regression analysis of panel count data: A practical review. Int. Stat. Rev 87 24–43. MR3940137 10.1111/insr.12271 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] Cox DR (1972). Regression models and life-tables. J. Roy. Statist. Soc. Ser. B 34 187–220. MR0341758 [Google Scholar]

[R6] Hu X, Liu L, Zhang Y and Zhao X (2023). Supplement to “Semiparametric regression of panel count data with informative terminal event.” 10.3150/22-BEJ1565SUPP [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] Kalbfleisch JD and Lawless JF (1985). The analysis of panel data under a Markov assumption. J. Amer. Statist. Assoc 80 863–871. MR0819585 [Google Scholar]

[R8] Kong S, Nan B, Kalbfleisch JD, Saran R and Hirth R (2018). Conditional modeling of longitudinal data with terminal event. J. Amer. Statist. Assoc 113 357–368. MR3803470 10.1080/01621459.2016.1255637 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] Lange K. (2001). Numerical Analysis for Statisticians. Statistics and Computing. NewYork: Springer. MR1681963 [Google Scholar]

[R10] Li L, Wu C-H, Ning J, Huang X, Shih Y-CT and Shen Y (2018). Semiparametric estimation of longitudinal medical cost trajectory. J. Amer. Statist. Assoc 113 582–592. MR3832210 10.1080/01621459.2017.1361329 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] Li Z, Frost HR, Tosteson TD, Zhao L, Liu L, Lyons K, Chen H, Cole B, Currow D and Bakitas M (2017). A semiparametric joint model for terminal trend of quality of life and survival in palliative care research. Stat. Med 36 4692–4704. MR3731248 10.1002/sim.7445 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] Liu L, Su W, Yin G, Zhao X and Zhang Y (2022). Nonparametric inference for reversed mean models with panel count data. Bernoulli 28 2968–2997. MR4474569 10.3150/21-bej1444 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] Lu M, Zhang Y and Huang J (2007). Estimation of the mean function with panel count data using monotone polynomial splines. Biometrika 94 705–718. MR2410018 10.1093/biomet/asm057 [DOI] [Google Scholar]

[R14] Lu M, Zhang Y and Huang J (2009). Semiparametric estimation methods for panel count data using monotone B-splines. J. Amer. Statist. Assoc 104 1060–1070. MR2750237 10.1198/jasa.2009.tm08086 [DOI] [Google Scholar]

[R15] Shen B, Chen C, Liu D, Datta S, Ghahramani N, Chinchilli VM and Wang M (2021). Joint modeling of longitudinal data with informative cluster size adjusted for zero-inflation and a dependent terminal event. Stat. Med 40 4582–4596. MR4315439 10.1002/sim.9081 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] Sun J and Kalbfleisch JD (1995). Estimation of the mean function of point processes based on panel count data. Statist. Sinica 5 279–289. MR1329298 [Google Scholar]

[R17] Sun J, Tong X and He X (2007). Regression analysis of panel count data with dependent observation times. Biometrics 63 1053–1059. MR2414582 10.1111/j.1541-0420.2007.00808.x [DOI] [PubMed] [Google Scholar]

[R18] Sun J and Zhao X (2013). Statistical Analysis of Panel Count Data. Statistics for Biology and Health. NewYork: Springer. MR 3136574 10.1007/978-1-4614-8715-9 [DOI] [Google Scholar]

[R19] Sun L, Song X, Zhou J and Liu L (2012). Joint analysis of longitudinal data with informative observation times and a dependent terminal event. J. Amer. Statist. Assoc 107 688–700. MR2980077 10.1080/01621459.2012.682528. [DOI] [Google Scholar]

[R20] Tong X, He X, Sun L and Sun J (2009). Variable selection for panel count data via non-concave penalized estimating function. Scand. J. Stat 36 620–635. MR2572579 10.1111/j.1467-9469.2009.00658.x [DOI] [Google Scholar]

[R21] van der Vaart A (2002). Semiparametric statistics. In Lectures on Probability Theory and Statistics (Saint-Flour, 1999). Lecture Notes in Math. 1781 331–457. Berlin: Springer. MR1915446 [Google Scholar]

[R22] van der Vaart AW and Wellner JA (1996). Weak Convergence and Empirical Processes: With Applications to Statistics. Springer Series in Statistics. New York: Springer. MR1385671 10.1007/978-1-4757-2545-2 [DOI] [Google Scholar]

[R23] Wellner JA and Zhang Y (2000). Two estimators of the mean of a counting process with panel count data. Ann. Statist 28 779–814. MR1792787 10.1214/aos/1015951998 [DOI] [Google Scholar]

[R24] Wellner JA and Zhang Y (2007). Two likelihood-based semiparametric estimation methods for panel count data with covariates. Ann. Statist 35 2106–2142. MR2363965 10.1214/009053607000000181 [DOI] [Google Scholar]

[R25] Zeng Y, Vaupel J, Xiao Z, Liu Y and Zhang C Chinese Longitudinal Healthy Longevity Survey (CLHLS) 1998–2014. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor] 2017-April-11. 10.3886/ICPSR36692.v1 [DOI] [Google Scholar]

[R26] Zhang H, Sun J and Wang D (2013). Variable selection and estimation for multivariate panel count data via the seamless- $L_{0}$ penalty. Canad. J. Statist 41 368–385. MR3061885 10.1002/cjs.11172 [DOI] [Google Scholar]

[R27] Zhang Y. (2002). A semiparametric pseudolikelihood estimation method for panel count data. Biometrika 89 39–48. MR1888344 10.1093/biomet/89.1.39 [DOI] [Google Scholar]

[R28] Zhang Y. (2006). Nonparametric k-sample tests with panel count data. Biometrika 93 777–790. MR2285071 10.1093/biomet/93.4.777 [DOI] [Google Scholar]

[R29] Zhao H, Li Y and Sun J (2013a). Analyzing panel count data with a dependent observation process and a terminal event. Canad. J. Statist 41 174–191. MR3030791 10.1002/cjs.11143 [DOI] [Google Scholar]

[R30] Zhao H, Li Y and Sun J (2013b). Semiparametric analysis of multivariate panel count data with dependent observation processes and a terminal event. J. Nonparametr. Stat 25 379–394. MR3056091 10.1080/10485252.2012.758724 [DOI] [Google Scholar]

[R31] Zhao X and Sun J (2011). Nonparametric comparison for panel count data with unequal observation processes. Biometrics 67 770–779. MR2829131 10.1111/j.1541-0420.2010.01504.x [DOI] [PubMed] [Google Scholar]

[R32] Zhao X and Zhang Y (2017). Asymptotic normality of nonparametric M-estimators with applications to hypothesis testing for panel count data. Statist. Sinica 27 931–950. MR3675037 [Google Scholar]

[R33] Zhou J, Zhang H, Sun L and Sun J (2017). Joint analysis of panel count data with an informative observation process and a dependent terminal event. Lifetime Data Anal. 23 560–584. MR3705825 10.1007/s10985-016-9375-y [DOI] [PubMed] [Google Scholar]

PERMALINK

Semiparametric regression of panel count data with informative terminal event

XIANGBIN HU

LI LIU

YING ZHANG

XINGQIU ZHAO

Abstract

1. Introduction

2. Model setting and estimation procedure

3. Asymptotic properties

4. Simulation studies

Figure 1.

Figure 2.

Table 1.

5. Application

Table 2.

Figure 3.

Table 3.

6. Concluding remarks

Supplementary Material

Acknowledgments

Funding

Appendix: Proofs of main results

A.1. Proof of Theorem 3.1

A.2. Proof of Theorem 3.2

A.3. Proof of Theorem 3.3

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Semiparametric regression of panel count data with informative terminal event

XIANGBIN HU

LI LIU

YING ZHANG

XINGQIU ZHAO

Abstract

1. Introduction

2. Model setting and estimation procedure

3. Asymptotic properties

4. Simulation studies

Figure 1.

Figure 2.

Table 1.

5. Application

Table 2.

Figure 3.

Table 3.

6. Concluding remarks

Supplementary Material

Acknowledgments

Funding

Appendix: Proofs of main results

A.1. Proof of Theorem 3.1

A.2. Proof of Theorem 3.2

A.3. Proof of Theorem 3.3

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases