Semiparametric Random Effects Models for Longitudinal Data with Informative Observation Times

Yang Li; Yanqing Sun

doi:10.4310/SII.2016.v9.n3.a7

. Author manuscript; available in PMC: 2017 May 15.

Published in final edited form as: Stat Interface. 2016;9(3):333–341. doi: 10.4310/SII.2016.v9.n3.a7

Semiparametric Random Effects Models for Longitudinal Data with Informative Observation Times

Yang Li ^1,^✉, Yanqing Sun ²

PMCID: PMC5431605 NIHMSID: NIHMS851581 PMID: 28515829

Abstract

Longitudinal data frequently arise in many fields such as medical follow-up studies focusing on specific longitudinal responses. In such situations, the responses are recorded only at discrete observation times. Most existing approaches for longitudinal data analysis assume that the observation or follow-up times are independent of the underlying response process, either completely or given some known covariates. We present a joint analysis approach in which possible correlations among the responses, observation and follow-up times can be characterized by time-dependent random effects. Estimating equations are developed for parameter estimation and the resulting estimates are shown to be consistent and asymptotically normal. A simulation study is conducted to assess the finite sample performance of the approach and the method is applied to data arising from a skin cancer study.

Keywords and phrases: estimating equations, informative censoring, informative observation process, joint analysis approach, longitudinal data

1. INTRODUCTION

Longitudinal data arise in many fields such as medical follow-up studies that focus on longitudinal responses. In such situations, each study subject is observed only at finite discrete times rather than continuously. Therefore, the responses are known only at a set of observation times but missing otherwise. The resulting data are usually incomplete and unbalanced among individuals.

Analysis of longitudinal data concerns two processes: one is the underlying response process, which is usually of practical interest but not continuously observable. The other refers to the observation process, which determines the discrete observation times. Many authors have considered the analysis of longitudinal data, for example, Diggle et al. (1994) who presented a relatively comprehensive review about the commonly considered models and estimation methods. Lin and Ying (2001), Welsh et al. (2002), Wellner and Zhang (2007) and Sun (2010) developed some semiparametric and nonparametric procedures for regression analysis. These approaches all assume that the two processes mentioned above are independent, either completely or conditional on some known covariates. To relax this assumption, Sun et al. (2007b), Zhao and Tong (2011) and Zhao et al. (2013b) modeled the possible correlations by time-independent random effects. However, these methods assume follow-up times to be independent from both the response and the observation processes given covariates.

In many situations, the underlying response process, the observation and follow-up times may be correlated. For example, both observation times and responses may depend on the stage of disease progression, which can also often determine the follow-up time. Lipsitz et al. (2002) considered general linear models for longitudinal data where the responses were assumed to have a multivariate Gaussian distribution. Sun et al. (2007), He et al. (2009) and Sun et al. (2012) proposed joint model based approaches; however, it is assumed that the shared random effects are fixed over time or follow some specific distributions, and the covariates are either multiplicative or additive in their effects to the response process. Without such specific distribution assumption, Sun et al. (2005) and Zhao et al. (2013) considered marginal model based methods; however, the models indicate that when the observation process is common for everyone, people with the same covariates are expected to have the same responses throughout the study. It is apparent that such assumptions may not be realistic in many applications.

We present a joint analysis approach for longitudinal data by which the possible correlations can be characterized by time-dependent random effects with arbitrary distributions. For the response process, a class of semiparametric transformation models are considered. Estimating equations are developed for parameter estimation and the resulting estimators are shown to be consistent and asymptotically normal. The remainder of this paper is organized as follows. We introduce notation and present the relevant models in Section 2. Section 3 presents the estimation procedure and establishes asymptotic properties of the proposed estimators. In Section 4, we demonstrate a model-checking technique and an extensive simulation study is presented in Section 5 to evaluate finite sample properties of the estimation procedure. An illustrative example is given in Section 6 and some discussion and remarks are provided in Section 7.

2. NOTATION AND MODELS

Consider a longitudinal study in which subjects are observed only at discrete times. For subject i (i = 1, …, n), let Y_i(t) denote the response process and let N_i(t) be the observation process which gives the cumulative number of observations at time t. In practice, one often observes Ñ_i(t) = N_i(t ∧ C_i) where a ∧ b = min(a, b) and C_i denotes a possible censoring or follow-up time. Let {T_i,₁, · · ·, T_{i,m_i}} be the discrete times when Y_i(t) is observed and let Z_i(t) be a p-dimensional vector of covariates, assumed to be continuously traceable in the study. In the following, we present a joint modeling approach and model the possible correlation between Y_i(t), N_i(t) and C_i through an unobserved random process b_i(t) = (b₁_i(t), b₂_i(t), b₃_i(t))′. Define ℬ_it = {b_i(s), s ≤ t} and 𝒵_it = {Z_i(s), s ≤ t}. We assume that the b_i(t)’s are independent and identically distributed with b₁_i(t) > 0 and b₂_i(t) > 0, ℬ_it is independent of 𝒵_it, and given 𝒵_it and ℬ_it, C_i, N_i(t) and Y_i(t) are mutually independent. Also we assume that the mean function of Y_i(t) can be postulated by the following semiparametric transformation model

E {Y_{i} (t) ∣ Z_{i} (t), b_{i} (t)} = g {μ_{0} (t) e^{θ^{'} Z_{i} (t)}} b_{1 i} (t),

(1)

where g(·) is a known twice continuously differentiable and strictly increasing link function, θ is a vector of unknown regression parameters and μ₀(t) denotes an unspecified smooth function of t. We assume that E{b₁_i(t)|d Ñ_i(t) = 1,𝒵_it} = 1 for identifiability. In particular, when g(x) = x, μ₀(t) represents the baseline mean function that is estimable at {T_i,₁, · · ·, T_{i,m_i}}.

The observation process N_i(t) is assumed to follow the marginal proportional rates model given by

E {d N_{i} (t) ∣ Z_{i} (t), b_{i} (t)} = exp {γ^{'} Z_{i} (t)} b_{2 i} (t) d Λ_{0} (t),

(2)

where E{b₂_i(t)} = 1, γ is a vector of unknown parameters and dΛ₀(t) is an unknown baseline rate function. It can be seen that both of the above models can be viewed as natural generalizations of the transformation model and proportional rates model studied in Li et al. (2010), Zhao et al. (2011) and Zhao et al. (2013) among others. Compared with the existing models, the proposed models are relatively flexible in handling the possible dependence since neither the form nor the distribution of b_i(t) needs to be specified. By taking different forms of g(·) and b_i(t), model (1) allows for various types of dependence for the mean function of Y_i(t) on N_i(t) and Z_i(t). In particular, when either b₁_i(t) or b₂_i(t) is unity or independent of the other one, the two processes Y_i(t) and N_i(t) are independent given 𝒵_it. Therefore, the estimation procedure proposed next also applies to data with noninformative observation times as special cases.

For the follow-up or censoring time C_i, we consider the following additive hazards model

λ_{i} (t ∣ Z_{i} (t), b_{i} (t)) = λ_{0} (t) + ξ^{'} Z_{i} (t) + b_{3 i} (t),

(3)

where E{b₃_i(t)} = 0, λ₀(t) is an unknown baseline hazard function and ξ is an unknown vector of regression parameters. The random effects b₁_i(t), b₂_i(t) and b₃_i(t) characterize possible correlations between C_i and Y_i(t), N_i(t), for which b₃_i(t) = 0 implies noninformative censoring. The same model has also been studied in Kalbfleisch and Prentice (2002), Lin et al. (1998), Zhang et al. (2005) and Sun et al. (2013) among others. In the following, we study the joint analysis of the proposed models with the focus on estimation of regression parameters θ along with γ and ξ.

3. ESTIMATION PROCEDURE

In this section, we present an estimation procedure for θ which is usually of primary interest. To this end, first note that Ñ_i(t) jumps by one at time t if and only if C_i ≥ t and dN_i(t) = 1. Based on the conditional independence assumption between C_i, N_i(t) and Y_i(t) given 𝒵_it and ℬ_it, we have, under (2)

\begin{array}{l} E {d {\tilde{N}}_{i} (t) ∣ Z_{i t}} = E [E {I (t \leq C_{i}) d N_{i} (t) ∣ Z_{i t}, ℬ_{i t}} | Z_{i t}] \\ = E [E {I (t \leq C_{i}) ∣ Z_{i t}, ℬ_{i t}} E {d N_{i} (t) ∣ Z_{i t}, ℬ_{i t}} | Z_{i t}] \\ = E {I (t \leq C_{i}) b_{2 i} (t) ∣ Z_{i t}} exp {γ^{'} Z_{i} (t)} d Λ_{0} (t) . \end{array}

(4)

By the property of double expectation and model (3), the first term in (4) equals

\begin{array}{l} E {I (t \leq C_{i}) b_{2 i} (t) ∣ Z_{i t}} \\ = E {exp {- Λ_{0}^{*} (t) - B_{i} (t) - ξ^{'} Z_{i}^{*} (t)} b_{2 i} (t) | Z_{i t}}, \end{array}

where $Λ_{0}^{*} (t) = \int_{0}^{t} λ_{0} (s) d s, B_{i} (t) = \int_{0}^{t} b_{3 i} (s) d s$ and $Z_{i}^{*} (t) = \int_{0}^{t} Z_{i} (s) d s$ . Hence,

E {d {\tilde{N}}_{i} (t) ∣ Z_{i t}} = exp {η^{'} X_{i}^{*} (t)} d Λ_{1}^{*} (t),

(5)

where η = (γ′, ξ′)′, $X_{i}^{*} (t) = {(Z_{i}^{'} (t), - Z_{i}^{' *} (t))}^{'}$ and $d Λ_{1}^{*} (t) = exp {- Λ_{0}^{*} (t)} E [b_{2 i} (t) \exp {- B_{i} (t)}] d Λ_{0} (t)$ .

Let τ be a known constant representing the length of the study. Define $d M_{i}^{*} (t; η) = d {\tilde{N}}_{i} (t) - e^{η^{'} X_{i}^{*} (t)} d Λ_{1}^{*} (t)$ and $d M_{i}^{*} (t) = d M_{i}^{*} (t; η_{0})$ , where η₀ denotes the true value of η. It is straightforward to show that $M_{i}^{*} (t)$ is a mean-zero stochastic process. It follows that η and $Λ_{1}^{*} (t)$ can be estimated by η̂ and ${\hat{Λ}}_{1}^{*} (t; \hat{η})$ , respectively, by solving the following two estimating equations

U_{η} (η) = \sum_{i = 1}^{n} \int_{0}^{τ} {X_{i}^{*} (t) - {\bar{X}}^{*} (t; η)} d {\tilde{N}}_{i} (t) = 0,

(6)

and

\sum_{i = 1}^{n} [d {\tilde{N}}_{i} (t) - e^{η^{'} X_{i}^{*} (t)} d Λ_{1}^{*} (t)] = 0.

(7)

where X̄^*(t; η) = S⁽¹⁾(t; η)/S⁽⁰⁾(t; η) and $S^{(k)} (t; η) = n^{- 1} \sum_{i = 1}^{n} e^{η^{'} X_{i}^{*} (t)} X_{i}^{*} {(t)}^{\otimes k}$ for k = 0, 1 and 2. Here and throughout a^⊗0 = 1, a^⊗1 = a and a^⊗2 = aa′. Define ${\hat{Λ}}_{1}^{*} (t) = {\hat{Λ}}_{1}^{*} (t; \hat{η})$ , x̄^*(t) = lim_n_→∞ X̄ ^*(t; η₀) and s⁽^k⁾(t) = lim_n_→∞S⁽^k⁾(t; η₀).

For the estimation of θ, consider

\begin{array}{l} E {Y_{i} (t) d {\tilde{N}}_{i} (t) ∣ Z_{i t}} \\ = E {Y_{i} (t) I (d {\tilde{N}}_{i} (t) = 1) ∣ Z_{i t}} \\ = \frac{E {Y_{i} (t) I (d {\tilde{N}}_{i} (t) = 1) ∣ Z_{i t}}}{E {I (d {\tilde{N}}_{i} (t) = 1) ∣ Z_{i t}}} E {d {\tilde{N}}_{i} (t) ∣ Z_{i t}} \end{array}

by the definition of d Ñ_i(t) and simple manipulation. From the conditional independence assumption between C_i, N_i(t) and Y_i(t) given 𝒵_it and ℬ_it, the last equality equals

\begin{array}{l} E {Y_{i} (t) d {\tilde{N}}_{i} (t) ∣ Z_{i t}} \\ = \frac{E [E {Y_{i} (t) I (d {\tilde{N}}_{i} (t) = 1) ∣ Z_{i t}, ℬ_{i t}} ∣ Z_{i t}]}{E {I (d {\tilde{N}}_{i} (t) = 1) ∣ Z_{i t}}} E {d {\tilde{N}}_{i} (t) ∣ Z_{i t}} \\ = \frac{g {μ_{0} (t) e^{θ^{'} Z_{i} (t)}} E {b_{1 i} (t) I (d {\tilde{N}}_{i} (t) = 1) ∣ Z_{i t}}}{E {I (d {\tilde{N}}_{i} (t) = 1) ∣ Z_{i t}}} \times E {d {\tilde{N}}_{i} (t) ∣ Z_{i t}} \\ = g {μ_{0} (t) e^{θ^{'} Z_{i} (t)}} E {b_{1 i} (t) ∣ d {\tilde{N}}_{i} (t) = 1, Z_{i t}} \times E {d {\tilde{N}}_{i} (t) ∣ Z_{i t}} \\ = g {μ_{0} (t) e^{θ^{'} Z_{i} (t)}} E {d {\tilde{N}}_{i} (t) ∣ Z_{i t}} . \end{array}

under models (1). Combining (5), it follows that

E {Y_{i} (t) d {\tilde{N}}_{i} (t) ∣ Z_{i t}} = e^{η^{'} X_{i}^{*} (t)} g {μ_{0} (t) e^{θ^{'} Z_{i} (t)}} d Λ_{1}^{*} (t) .

(8)

We define

d M_{i} (t; θ, η) = Y_{i} (t) d {\tilde{N}}_{i} (t) - e^{η^{'} X_{i}^{*} (t)} g {μ_{0} (t) e^{θ^{'} Z_{i} (t)}} d Λ_{1}^{*} (t)

and dM_i(t) = dM_i(t; θ₀, η₀), where θ₀ denotes the true value of θ. Then M_i(t) is a mean-zero stochastic process, which naturally suggests the following estimating equations to estimate θ and μ₀(t):

\sum_{i = 1}^{n} [Y_{i} (t) d {\tilde{N}}_{i} (t) - e^{{\hat{η}}^{'} X_{i}^{*} (t)} g {μ_{0} (t) e^{θ^{'} Z_{i} (t)}} d {\hat{Λ}}_{1}^{*} (t)] = 0, 0 \leq t \leq τ,

(9)

and

\sum_{i = 1}^{n} \int_{0}^{τ} W (t) Z_{i} (t) \times [Y_{i} (t) d {\tilde{N}}_{i} (t) - e^{{\hat{η}}^{'} X_{i}^{*} (t)} g {μ_{0} (t) e^{θ^{'} Z_{i} (t)}} d {\hat{Λ}}_{1}^{*} (t)] = 0,

(10)

where W(t) is a possibly data-dependent weight function. We denote the estimates of θ and μ₀(t) by θ̂ and μ̂₀(t; θ̂, η̂), respectively. Define μ̂₀(t) = μ̂₀(t; θ̂, η̂).

In general, neither θ̂ nor μ̂₀(t) have closed forms and some iterative algorithms may be necessary to solve (9) and (10). For some special cases, μ̂₀(t) can be written explicitly. For example, when g(x) = log(x), it can be shown that

{\hat{μ}}_{0} (t) = exp {\frac{\sum_{i = 1}^{n} Y_{i} (t) d {\tilde{N}}_{i} (t)}{\sum_{i = 1}^{n} e^{{\hat{η}}^{'} X_{i}^{*} (t)} d {\hat{Λ}}_{1}^{*} (t)} - {\hat{θ}}^{'} \bar{Z} (t; \hat{η})}

and

\hat{θ} = {\sum_{i = 1}^{n} \int_{0}^{τ} W (t) {Z_{i} (t) - \bar{Z} (t; \hat{η})} Z_{i}^{'} (t) e^{{\hat{η}}^{'} X_{i}^{*} (t)} d {\hat{Λ}}_{1}^{*} (t)}^{- 1} \times \sum_{i = 1}^{n} \int_{0}^{τ} W (t) {Z_{i} (t) - \bar{Z} (t; \hat{η})} Y_{i} (t) d {\tilde{N}}_{i} (t) .

where $\bar{Z} (t; \hat{η}) = \frac{\sum_{i = 1}^{n} Z_{i} (t) e^{{\hat{η}}^{'} X_{i}^{*} (t)}}{\sum_{i = 1}^{n} e^{{\hat{η}}^{'} X_{i}^{*} (t)}}$ .

To establish the asymptotic properties of θ̂, we define

\begin{array}{l} {\hat{M}}_{i}^{*} (t) = {\tilde{N}}_{i} (t) - \int_{0}^{t} e^{{\hat{η}}^{'} X_{i}^{*} (s)} d {\hat{Λ}}_{1}^{*} (s), \\ {\hat{M}}_{i} (t) = \int_{0}^{t} Y_{i} (s) d {\tilde{N}}_{i} (s) - \int_{0}^{t} e^{{\hat{η}}^{'} X_{i}^{*} (s)} g {{\hat{μ}}_{0} (s) e^{{\hat{θ}}^{'} Z_{i} (s)}} d {\hat{Λ}}_{1}^{*} (s), \\ {\hat{E}}_{Z} (t; \hat{θ}, \hat{η}) = \frac{\sum_{i = 1}^{n} Z_{i} (t) \dot{g} {{\hat{μ}}_{0} (t) e^{{\hat{θ}}^{'} Z_{i} (t)}} e^{{\hat{θ}}^{'} Z_{i} (t) + {\hat{η}}^{'} X_{i}^{*} (t)}}{\sum_{i = 1}^{n} \dot{g} {{\hat{μ}}_{0} (t) e^{{\hat{θ}}^{'} Z_{i} (t)}} e^{{\hat{θ}}^{'} Z_{i} (t) + {\hat{η}}^{'} X_{i}^{*} (t)}} \\ e_{z} (t) = l i m_{n \to \infty} {\hat{E}}_{Z} (t; θ_{0}, η_{0}) and {\hat{E}}_{Z} (t) = {\hat{E}}_{Z} (t; \hat{θ}, \hat{η}) . \end{array}

The following theorem establishes the consistency and asymptotic normality of θ̂ and η̂.

Theorem 1

Assume that the conditions (C1)–(C5) given in the Appendix hold. Then θ̂ and η̂ are consistent estimators of θ₀ and η₀, respectively. n^1/2(θ̂–θ₀) and n^1/2(η̂–η₀) converge weakly to mean-zero normal distributions with covariance matrices that can be consistently estimated by ${\sum^{^}}_{θ} = {\hat{A}}_{θ}^{- 1} \sum^{^} {\hat{A}}_{θ}^{- 1}$ and ${\sum^{^}}_{η} = {\hat{Ω}}_{η}^{- 1} \hat{Ψ} {\hat{Ω}}_{η}^{- 1}$ , respectively, where $\sum^{^} = n^{- 1} \sum_{i = 1}^{n} {({\hat{ξ}}_{1 i} - {\hat{ξ}}_{2 i} - {\hat{ξ}}_{3 i})}^{\otimes 2}, \hat{Ψ} = n^{- 1} \sum_{i = 1}^{n} {\hat{ζ}}_{i}^{\otimes 2}$ ,

\begin{array}{l} {\hat{ξ}}_{1 i} = \int_{0}^{τ} W (t) (Z_{i} (t) - {\hat{E}}_{Z} (t)) d {\hat{M}}_{i} (t) \\ {\hat{ξ}}_{2 i} = \int_{0}^{τ} \frac{W (t) \hat{D} (t; \hat{θ}, \hat{η})}{S^{(0)} (t; \hat{η})} d {\hat{M}}_{i}^{*} (t), \\ {\hat{ξ}}_{3 i} = \int_{0}^{τ} {\hat{A}}_{η} {\hat{Ω}}_{η}^{- 1} (X_{i}^{*} (t) - {\bar{X}}^{*} (t; \hat{η})) d {\hat{M}}_{i}^{*} (t), \\ {\hat{ζ}}_{i} = \int_{0}^{τ} (X_{i}^{*} (t) - {\bar{X}}^{*} (t; \hat{η})) d {\hat{M}}_{i}^{*} (t), \\ {\hat{A}}_{θ} = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} W (t) \dot{g} {{\hat{μ}}_{0} (t) e^{{\hat{θ}}^{'} Z_{i} (t)}} \times {Z_{i} (t) - {\hat{E}}_{Z} (t)}^{\otimes 2} e^{{\hat{θ}}^{'} Z_{i} (t) + {\hat{η}}^{'} X_{i}^{*} (t)} {\hat{μ}}_{0} (t) d {\hat{Λ}}_{1}^{*} (t), \\ \hat{D} (t; \hat{θ}, \hat{η}) = \frac{1}{n} \sum_{i = 1}^{n} {Z_{i} (t) - {\hat{E}}_{Z} (t)} g {{\hat{μ}}_{0} (t) e^{{\hat{θ}}^{'} Z_{i} (t)}} e^{{\hat{η}}^{'} X_{i}^{*} (t)}, \\ {\hat{A}}_{η} = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} W (t) g {{\hat{μ}}_{0} (t) e^{{\hat{θ}}^{'} Z_{i} (t)}} e^{{\hat{η}}^{'} X_{i}^{*} (t)} \times {Z_{i} (t) - {\hat{E}}_{Z} (t)} {X_{i}^{*} (t) - {\bar{X}}^{*} (t; \hat{η})}^{'} d {\hat{Λ}}_{1}^{*} (t), \\ {\hat{Ω}}_{η} = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ} {X_{i}^{*} (t) - {\bar{X}}^{*} (t; \hat{η})}^{\otimes 2} e^{{\hat{η}}^{'} X_{i}^{*} (t)} d {\hat{Λ}}_{1}^{*} (t) . \end{array}

The proof of the theorem above is sketched in Appendix A.

4. MODEL CHECKING

As mentioned above, a main advantage of the proposed methodology is that it is applicable to a class of correlated models through the link function g(·) and random effects b_i(t). On the other hand, one may question how to choose an appropriate form of g(·) for the response process. To answer this question, one may develop some model selection procedure and choose an optimal g(·) among several candidate models. However, such a strategy can be very difficult for longitudinal data because of their incompleteness. To access the adequacy of the proposed models with a given link function g(·), one can develop an omnibus goodness-of-fit test based on the cumulative summation of the residual process (Lin et al., 1993; Lin et al., 2000; Li et al., 2010; Zhao et al., 2013) as follows

ℱ (t, x) = n^{- 1 / 2} \sum_{i = 1}^{n} \int_{0}^{t} I (Z_{i} (s) \leq z) d {\hat{M}}_{i} (s),

where {Z_i(u) ≤ z} represents that each component of Z_i(u) is no greater than the corresponding component of z. In general, the distribution of ℱ(t, x) is unknown or very difficult to obtain. Under the proposed models, ℱ(t, x) is expected to flunctuate randomly around 0. In Appendix B, it is shown that the null distribution of ℱ(t, x) can be approximated by a mean-zero Gaussian distribution

\hat{ℱ} (t, z) = n^{- 1 / 2} \sum_{i = 1}^{n} {{\hat{u}}_{1 i} (t, z) - {\hat{u}}_{2 i} (t, z) - {\hat{V}}_{η} (t, z) {\hat{Ω}}_{η}^{- 1} {\hat{ζ}}_{i} - {\hat{V}}_{θ} (t, z) {\hat{A}}_{θ}^{- 1} ({\hat{ξ}}_{1 i} - {\hat{ξ}}_{2 i} - {\hat{ξ}}_{3 i})} e_{i},

(11)

where e₁, e₂, …, e_n are independent standard normal variables independent of the observed data,

\begin{array}{l} {\hat{u}}_{1 i} (t, z) & = & \int_{0}^{t} {I (Z_{i} (s) \leq z) - {\hat{E}}_{I} (s, z; \hat{θ}, \hat{η})} d {\hat{M}}_{i} (s), \\ {\hat{u}}_{2 i} (t, z) & = & \int_{0}^{t} \frac{\hat{Γ} (s; \hat{θ}, \hat{η})}{S^{(0)} (s; \hat{η})} d {\hat{M}}_{i}^{*} (s), \\ \hat{Γ} (t; \hat{θ}, \hat{η}) & = & n^{- 1} \sum_{i = 1}^{n} {I (Z_{i} (t) \leq z) - {\hat{E}}_{I} (t, z; \hat{θ}, \hat{η})} \times g {{\hat{μ}}_{0} (t) e^{{\hat{θ}}^{'} Z_{i} (t)}} e^{{\hat{η}}^{'} X_{i}^{*} (t)}, \\ {\hat{V}}_{η} (t, z) & = & n^{- 1} \sum_{i = 1}^{n} \int_{0}^{t} g {{\hat{μ}}_{0} (s) e^{{\hat{θ}}^{'} Z_{i} (s)}} e^{{\hat{η}}^{'} X_{i}^{*} (s)} {I (Z_{i} (s) \leq z) - {\hat{E}}_{I} (s, z; \hat{θ}, \hat{η})} \times {X_{i}^{*} (s) - {\bar{X}}^{*} (s; \hat{η})}^{'} d {\hat{Λ}}_{1}^{*} (s), \\ {\hat{V}}_{θ} (t, z) & = & n^{- 1} \sum_{i = 1}^{n} \int_{0}^{t} \dot{g} {{\hat{μ}}_{0} (s) e^{{\hat{θ}}^{'} Z_{i} (s)}} I (Z_{i} (s) \leq z) \times {Z_{i} (s) - {\hat{E}}_{Z} (s)}^{'} e^{{\hat{θ}}^{'} Z_{i} (s) + {\hat{η}}^{'} X_{i}^{*} (s)} {\hat{μ}}_{0} (s) d {\hat{Λ}}_{1}^{*} (s), \\ {\hat{E}}_{I} (t, z; \hat{θ}, \hat{η}) & = & \frac{\sum_{i = 1}^{n} I (Z_{i} (t) \leq z) \dot{g} {{\hat{μ}}_{0} (t) e^{{\hat{θ}}^{'} Z_{i} (t)}} e^{{\hat{θ}}^{'} Z_{i} (t) + {\hat{η}}^{'} X_{i}^{*} (t)}}{\sum_{i = 1}^{n} \dot{g} {{\hat{μ}}_{0} (t) e^{{\hat{θ}}^{'} Z_{i} (t)}} e^{{\hat{θ}}^{'} Z_{i} (t) + {\hat{η}}^{'} X_{i}^{*} (t)}} \\ e_{I} (t, z) & = & l i m_{n \to \infty} E_{I} (t, z; θ_{0}, η_{0}) \end{array}

and ζ̂_i, ξ̂₁_i, ξ̂₂_i, ξ̂₃_i are the same as defined in the previous section. Therefore for a given set of data, one can obtain a large number of realizations from ℱ̂(t, z) by repeatedly generating standard normal random samples {e₁, e₂, …, e_n}. A formal goodness-of-fit test can be performed with the corresponding p-value being calculated by comparing sup_0≤_t_≤_τ,z|ℱ(t, z)| to a large number of realizations from sup_0≤_t_≤_τ,z| ℱ̂(t, z)|.

5. A SIMULATION STUDY

In this section, we present results obtained from an extensive simulation study conducted to assess the finite sample behavior of the estimation procedure proposed in the previous sections. In the study, the covariate Z_i was assumed to be a Bernoulli random variable with the probability of success being 0.5. Given Z_i and some unobserved random effects b_i(t) = (b₁_i(t), b₂_i(t), b₃_i(t))′, the hazard function of the censoring time C_i was assumed to have the form

λ_{i} (t ∣ Z_{i}, b_{i} (t)) = λ_{0} - ξ Z_{i} + b_{3 i} (t),

(12)

with the length of study τ being 1. The number of observations N_i(t) was assumed to follow a Poisson process on (0, C_i) with the rate function

E {d N_{i} (t) ∣ Z_{i}, b_{i} (t)} = exp {γ Z_{i}} b_{2 i} (t) d Λ_{0} (t) .

(13)

In practice, the exact time of C_i may not be observable and d Ñ_i(t) is observed instead of dN_i(t); thus we considered E{Ñ_i(t)|Z_i, ℬ_it} for the observation times. From (12) and (13),

E {d {\tilde{N}}_{i} (t) ∣ Z_{i}, ℬ_{i t}} = exp {γ Z_{i} + ξ Z_{i} t} d Λ_{1}^{*} (t),

where $d Λ_{1}^{*} (t) = exp {- λ_{0} t - B_{i} (t)} b_{2 i} (t) d Λ_{0} (t)$ . Given Z_i and b_i(t), Ñ_i(t) was assumed to follow a nonhomogeneous Poisson process and the total number of observation times m_i was generated with mean E{m_i} = E{Ñ_i(τ )|Z_i, ℬ_iτ }. Then the observation times {T_i,₁, …, T_{i,m_i}} were taken as m_i order statistics from the density function

f_{\tilde{N}} (t) = \frac{exp {γ Z_{i} + ξ Z_{i} t} d Λ_{1}^{*} (t)}{\int_{0}^{τ} exp {γ Z_{i} + ξ Z_{i} t} d Λ_{1}^{*} (t)} .

To generate Y_i(T_i,j) at each observation time T_i,j, we considered

E {Y_{i} (T_{i, j}) ∣ Z_{i}, b_{i} (t)} = g {μ_{0} (t) e^{θ Z_{i}}} b_{1 i} (t),

and obtained Y_i(T_i,j) by first generating $Y_{i}^{*} (T_{i, j})$ from a Poisson distribution with the mean function of $Y_{i}^{*} (t)$ being equal to g{μ₀(t)e^θZ_i}b₁_i(t)E{I(t ≤ C_i)|Z_i, ℬ_it}, and then taking $Y_{i} (T_{i, j}) = \frac{Y_{i}^{*} (T_{i, j})}{E {I (T_{i, j} \leq C_{i}) ∣ Z_{i}, ℬ_{i t}}}$ . The results given below are based on the sample sizes of 100 and 200 with 1, 000 replications and W(t) = 1.

We took λ₀ = 2, $d Λ_{0} (t) = \frac{5}{t} (e^{0.5} - e^{- 0.5}) (e^{t} - e^{- t}) d t, b_{1 i} = \frac{2 e^{v_{i}}}{e - 1 / e}, b_{2 i} (t) = \frac{2 t e^{u_{i} + v_{i} t}}{(e^{0.5} - e^{- 0.5}) (e^{t} - e^{- t})}$ and b₃_i = v_i with u_i and v_i being random numbers generated from uniform distributions over (−0.5, 0.5) and (−1, 1), respectively. Table 1 shows the estimation results for θ based on the simulated data with the link function g(x) = log(x), μ₀(t)} = e₂_t, and the true values of (γ, ξ) being equal to (0, 0), (0, 0.2), (0.5, 0), (0.5, 0.2). The table includes the estimated bias given by the average of the proposed estimators θ̂ minus the true value θ₀, the average of the estimated standard errors (SEE), the empirical sampling standard error (SSE) and the 95% empirical coverage probability (CP). It can be seen that the proposed approach seems to perform well. Specifically, the proposed estimate seems to be unbiased and the estimated standard errors agree well with the empirical ones. Also as expected, the CP’s are close to their nominal levels and the standard errors become smaller when sample sizes increase.

Table 1.

Estimation results for θ with the link function g(x) = log(x).

θ₀	n = 100				n = 200

	0	0.2	0.5		0	0.2	0.5
				(γ₀, ξ₀) = (0, 0)
Bias	−0.004	−0.006	−0.018		−0.003	0.006	−0.008
SEE	0.186	0.199	0.218		0.131	0.140	0.154
SSE	0.193	0.208	0.220		0.129	0.149	0.149
CP	0.944	0.935	0.943		0.958	0.943	0.962
				(γ₀, ξ₀) = (0, 0.2)
Bias	0.033	0.029	0.024		0.021	0.028	0.024
SEE	0.180	0.192	0.211		0.129	0.137	0.152
SSE	0.187	0.206	0.214		0.133	0.137	0.155
CP	0.939	0.929	0.953		0.942	0.947	0.948
				(γ₀, ξ₀) = (0.5, 0)
Bias	0.005	0.002	0.000		−0.005	−0.001	−0.008
SEE	0.169	0.181	0.199		0.121	0.129	0.142
SSE	0.174	0.185	0.205		0.124	0.134	0.145
CP	0.943	0.950	0.946		0.942	0.943	0.949
				(γ₀, ξ₀) = (0.5, 0.2)
Bias	0.017	0.033	0.012		0.020	0.024	0.018
SEE	0.169	0.177	0.196		0.120	0.127	0.139
SSE	0.171	0.183	0.199		0.123	0.128	0.142
CP	0.940	0.937	0.952		0.938	0.946	0.945

Open in a new tab

In addition to the scenarios presented by Table 1, we investigated those with various link functions and random effects. For example, the results given in Table 2 were obtained with the same setups as those for Table 1 except that g(x) = x and μ₀(t) = 2t. Such results all suggest that the proposed procedure perform well for practical situations. To further study how various link functions affected the estimation results, we also calculated the averaged sum of absolute residuals ( $\bar{RES}$ ) for each scenario, defined as

Table 2.

Estimation results for θ with the link function g(x) = x.

θ₀	n = 100				n = 200

	0	0.2	0.5		0	0.2	0.5
				(γ₀, ξ₀) = (0, 0)
Bias	0.008	0.005	−0.006		0.009	0.004	0.009
SEE	0.269	0.261	0.249		0.191	0.186	0.178
SSE	0.287	0.277	0.246		0.201	0.187	0.185
CP	0.932	0.928	0.948		0.939	0.953	0.940
				(γ₀, ξ₀) = (0, 0.2)
Bias	0.035	0.041	0.047		0.042	0.036	0.040
SEE	0.259	0.254	0.245		0.186	0.181	0.174
SSE	0.282	0.265	0.257		0.191	0.184	0.184
CP	0.927	0.934	0.924		0.929	0.936	0.921
				(γ₀, ξ₀) = (0.5, 0)
Bias	−0.007	0.015	0.010		0.001	0.011	0.010
SEE	0.247	0.239	0.233		0.176	0.172	0.166
SSE	0.256	0.259	0.249		0.180	0.179	0.177
CP	0.939	0.930	0.927		0.939	0.933	0.936
				(γ₀, ξ₀) = (0.5, 0.2)
Bias	0.052	0.051	0.045		0.040	0.051	0.042
SEE	0.244	0.239	0.231		0.174	0.171	0.166
SSE	0.252	0.258	0.237		0.178	0.171	0.169
CP	0.932	0.917	0.935		0.929	0.947	0.930

Open in a new tab

\bar{RES} = \frac{1}{n} \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} ∣ d {\hat{M}}_{i} (T_{i, j}) ∣ .

Table 3 presents the results obtained for scenarios represented by Tables 1 and 2 when n = 200, where the baseline mean function is common for Y_i(t) given b_i(t) and Z_i. The results show that when the choice of g(·) is reasonable, such residuals are comparable whether the covariate effects are additive (for g(x) = log(x)) or multiplicative (for g(x) = x) to the response process.

Table 3.

Averaged sum of residuals based on results from Tables 1 and 2 when n = 200.

θ₀	g(t) = log(t)			g(t) = t

	0	0.2	0.5	0	0.2	0.5
(γ₀, ξ₀)= (0,0)	3.134	3.584	4.260	3.116	3.473	4.147
(γ₀, ξ₀)= (0, 0.2)	3.311	3.795	4.525	3.288	3.678	4.435
(γ₀, ξ₀)= (0.5, 0)	4.115	4.872	5.982	4.117	4.658	5.736
(γ₀, ξ₀)= (0.5, 0.2)	4.404	5.177	6.367	4.379	5.056	6.255

Open in a new tab

One question of practical interest is that for longitudinal data when the observation process is informative, whether some existing procedure applies to the situations as considered by models (1)–(3). While there are limited procedures for regression analysis based on a class of transformation models for the response process, most of them model possible correlation between Y_i(t) and Ñ_i(t) by incorporating a specific function of Ñ_i(s), s ≤ t to the marginal mean of Y_i(t) (Sun et al., 2005; Li et al., 2013; Zhao et al., 2013), for example, a function denoted by h(·) in Zhao et al. (2013). One possible drawback is that such applications are highly subject to the specific form of h(·), which cannot capture correlations of an arbitrary form. To illustrate this numerically, we considered both the proposed estimation procedure and the one given in Zhao et al. (2013). Note that the latter also considered a possible dependent terminal event time D_i but assumed a noninformative C_i given Z_i, For ease of comparison, we made D_i > C_i in our scenarios and used each subject’s last observation time as C_i when applied the competing procedure. Table 4 presents the estimation results for θ obtained for g(x) = log(x), $b_{1 i} = \frac{1}{2} (exp {0.5 - v_{i}} - exp {- 0.5 - v_{i}} + G_{i}), b_{2 i} (t) = \frac{(t + 1) exp {v_{i} (t + 1)}}{e^{0.5 (t + 1)} - e^{- 0.5 (t + 1)}}$ , b₃_i = v_i, $d Λ_{0} (t) = \frac{20 t}{t + 1} {e^{0.5 (t + 1)} - e^{- 0.5 (t + 1)}}$ , μ₀(t) = exp{5t}, with v_i and G_i being random numbers from the uniform distribution over (−0.5, 0.5) and the gamma distribution with mean 1 and variance 0.5, respectively. In the table, BIAS represents the estimated bias from the proposed estimate; ${BIAS}_{1}^{*}$ and ${BIAS}_{2}^{*}$ denote the estimated biases given by Zhao et al. (2013) using h(ℱ_it) = Ñ(t−) and h(ℱ_it) = 0, respectively. The results suggest that the proposed estimates still appear to be unbiased, but the competing method could give substantially biased estimates for θ when the correlations between Y_i(t), N_i(t) and C_i introduced by b_i(t) are misinterpreted by h(·) or totally ignored.

Table 4.

Estimation results of θ based on the proposed procedure and the one given by Zhao et al. (2013), when g(x) = log(x) and ξ₀ = 0.

θ₀

n = 100

n = 200

BIAS

{BIAS}_{1}^{*}

{BIAS}_{2}^{*}

BIAS

{BIAS}_{1}^{*}

{BIAS}_{2}^{*}

γ₀ = 0.5

0.008

−0.140

−0.118

0.000

−0.148

−0.135

0.2

0.003

−0.162

−0.141

−0.002

−0.154

−0.143

0.5

−0.009

−0.167

−0.139

−0.011

−0.169

−0.149

γ₀ = 0.8

−0.007

−0.208

−0.196

0.000

−0.212

−0.181

0.2

−0.005

−0.210

−0.202

−0.002

−0.224

−0.198

0.5

−0.009

−0.220

−0.192

−0.011

−0.246

−0.195

Open in a new tab

6. AN APPLICATION

In this section, we applied the proposed methodology described in the previous sections to longitudinal data arising from a skin cancer study conducted by the University of Wisconsin Comprehensive Cancer Center in Madison, Wisconsin (Li et al., 2011; Zhang et al., 2013). One main objective of this double-blind, placebo-controlled randomized Phase III clinical trial is to evaluate the effectiveness of 0.5g/m²/day PO difluoromethylornithine (DFMO) in reducing the recurrence rates of basal cell carcinoma (BCC) for patients with a history of skin cancers. At each visit, the numbers of BCC occurrences since the previous visit were recorded. Each patient was scheduled to be assessed every six months; however as expected, the actual observation times vary from patient to patient. Besides a patient’s treatment group (placebo or DFMO), the study also provided information on the number of prior skin cancer occurrences which is shown to be significantly related to the skin cancer recurrence process. For the analysis, we focus on the 290 patients with at least one observation. Among them, 161 patients had one or two skin cancer occurrences prior to the study, and the others had experienced more.

In the following, we consider covariates defined by Z_i = (Z_i₁, Z_i₂)′, where Z_i₁ = 1 if patient i was given the DFMO treatment and Z_i₁ = 0 otherwise, and Z_i₂ = 1 if the patient had experienced more than two (up to 35) skin cancer occurrences and Z_i₂ = 0 if not, i = 1, …, 290. Y_i(t) represents the total number of BCC occurrences observed up to time t. The longest follow-up time was scaled to be τ = 1, which corresponds to 1, 879 days in the original data set.

To apply the proposed estimation procedure, we assumed that the skin cancer recurrence process, the observation process and the hazard of censoring can be described by models (1)–(3), respectively. Following the notation above, the primary interest is to estimate θ₁, the effect of DFMO. Table 5 presents the analysis results obtained by applying the proposed estimation procedure with W(t) = 1. We considered two link functions: g(x) = x and g(x) = log(x), and the results include the point estimates (Est.), the estimated standard errors (SEE), the estimated 90% confidence intervals (CI’s) and p-values for tests with the null hypotheses assuming no covariate effects. At the significance level of α = 0.1, the results suggest that DFMO has significantly reduced the recurrence rates of BCC, and a more severe skin cancer history appears to be positively correlated with the recurrence rate of skin cancer. Such results appear consistent with those concluded by Li et al. (2014) for both choices of link functions. In addition, the results also suggest that both the observation and follow-up times significantly depend on the covariates.

Table 5.

Analysis results for the skin cancer data.

	Est.	SEE	90% CI	p-value

γ₁	0.529	0.072	(0.410, 0.648)	< 0.001
γ₂	0.566	0.072	(0.448, 0.684)	< 0.001
ξ₁	1.203	0.171	(0.922, 1.484)	< 0.001
ξ₂	1.038	0.171	(0.757, 1.319)	< 0.001
g(x) = x
θ₁	−0.448	0.187	(−0.814, −0.082)	0.017
θ₂	1.164	0.225	(0.723, 1.064)	< 0.001
g(x) = log(x)
θ₁	−0.225	0.123	(−0.427, −0.024)	0.066
θ₂	0.972	0.118	(0.777, 1.167)	< 0.001

Open in a new tab

To assess the adequacy of our models above, we applied the goodness-of-fit test derived in Section 4 and obtained the p-values of 0.801 and 0.383, respectively, for g(x) = x and g(x) = log(x). This suggests that while both of our link functions appear to be reasonable for the data, the former is preferred over the latter.

7. CONCLUDING REMARKS

This paper considers regression analysis of longitudinal data when both the observation and follow-up times may be informative about the underlying response process of interest. For the problem, we present a class of semiparametric transformation models for the response process which allow possible correlations to be characterized by time-dependent random effects. Comparing with existing models that assume either independence or structured dependence based on fixed forms or distributions, the proposed models provide flexibility for modeling both the underlying response process and its correlation to other processes. For parameter estimation, an easy-to-implement estimating equation approach is developed and both finite and asymptotic properties of the resulting estimators are established. In addition, the extensive simulation study indicated that the approach works well for practical situations and the approach is applied to a skin cancer study which motivated the research.

We note several possible directions for future work. First for simplicity, we assumed that the dependence between Y_i(t) and N_i(t) in models (1)–(2) can be completely characterized by random effects b_i(t) and covariates Z_i(t). However in practice, one may want to incorporate more terms to the content of g(·) as well when additional information is available. For example, if it is known from pivotal trials or experiences that a longitudinal response depends on the length of period since subject i is last observed, one may consider modifying model (1) as follows:

E {Y_{i} (t) ∣ Z_{i} (t), b_{i} (t)} = g {μ_{0} (t) e^{θ^{'} Z_{i} (t) + α (t - T_{i, j})}} b_{1 i} (t),

where j = max{k : T_i,k ≤ t} and T_i,j represents subject i’s last observation time. In such cases, the same methodology immediately applies for estimating θ and α together, by replacing θ and Z_i(t) by (θ′, α′)′ and (Z_i(t)′, t − T_i,j)′, respectively, in the estimation procedure. Second, the focus of the article has been on regression analysis of the response process Y_i(t), therefore, b_i(t) was treated as a shared latent vector. However, if one is solely interested in calculating any correlation between Y_i(t), N_i(t) and C_i at certain times, one may usually need a distribution assumption on b_i(t) and apply some existing procedures for inference (Lipsitz et al., 2002; He et al., 2009; Sun et al., 2007, 2007b; Li et al., 2013). Other than the effects of b_i(t), we have assumed the proportional rates and additive hazards models, respectively, on N_i(t) and C_i. In context of dependent processes, a procedure that is robust to such models is another interesting direction for future research.

Acknowledgments

The authors wish to thank the editor, the associate editor and the two reviewers for their constructive comments and suggestions that led to a great improvement of this manuscript. This work was partially supported by funds provided by National Science Foundation (grant DMS-1208978 to Sun), National Institutes of Health (grant 2 R37 AI054165 to Sun) and The University of North Carolina at Charlotte (to Sun and FRG 1-11172 to Li).

APPENDIX A

Proof of Theorem 1

To derive the asymptotic properties of the proposed estimator θ̂, we need the following regularity conditions.

(C1)
${{\tilde{N}}_{i} (\cdot), Y_{i} (\cdot), C_{i}, Z_{i} (\cdot)}_{i = 1}^{n}$ are independent and identically distributed.
(C2)
There exists a τ > 0 such that P(C_i ≥ τ ) > 0.
(C3)
Both Ñ_i(t) and Y_i(t) (0 ≤ t ≤ τ, i = 1, …, n) are bounded.
(C4)
W(t) and Z_i(·), i = 1, …, n, have bounded variations and W(t) converges almost surely to a deterministic function w(t) uniformly in t ∈ [0, τ].
(C5)
$A_{θ} = E \int_{0}^{τ} W (t) \dot{g} {μ_{0} (t) e^{θ_{0}^{'} Z_{i} (t)}} {Z_{i} (t) - e_{z} (t)}^{\otimes 2} e^{θ_{0}^{'} Z_{i} (t) + η_{0}^{'} X_{i}^{*} (t)} μ_{0} (t) d Λ_{1}^{*} (t)$ and $Ω_{η} = E [\int_{0}^{τ} {X_{i}^{*} (t) - {\bar{x}}^{*} (t)}^{\otimes 2} e^{η_{0}^{'} X_{i}^{*} (t)} d Λ_{1}^{*} (t)]$ are both positive definite.

Define

U_{1} (θ; \hat{η}) = \sum_{i = 1}^{n} \int_{0}^{τ} W (t) Z_{i} (t) \times [Y_{i} (t) d {\tilde{N}}_{i} (t) - e^{{\hat{η}}^{'} X_{i}^{*} (t)} g {{\hat{μ}}_{0} (t) e^{θ^{'} Z_{i} (t)}} d {\hat{Λ}}_{1}^{*} (t)] = 0

and note that μ̂₀(t) satisfies

\sum_{i = 1}^{n} [Y_{i} (t) d {\tilde{N}}_{i} (t) - e^{{\hat{η}}^{'} X_{i}^{*} (t)} g {{\hat{μ}}_{0} (t) e^{θ^{'} Z_{i} (t)}} d {\hat{Λ}}_{1}^{*} (t)] = 0, 0 \leq t \leq τ .

(14)

Let

\begin{matrix} {\hat{A}}_{θ} (θ) = - n^{- 1} \partial U_{1} (θ, \hat{η}) / \partial θ^{'}, {\hat{A}}_{η} (η) = - n^{- 1} \partial U_{1} (θ_{0}, η) / \partial η^{'}, \\ A_{θ} = lim_{n \to \infty} {\hat{A}}_{θ} (θ_{0}) and A_{η} = lim_{n \to \infty} {\hat{A}}_{η} (η_{0}) . \end{matrix}

The consistency of θ̂ and η̂ follows from the facts that U₁(θ₀; η̂) and U_η(η₀) both tend to 0 in probability as n → ∞, and that Â_θ (θ) and −n⁻¹∂U_η(η)/∂η both converge uniformly to the positive definite matrices A_θ and Ω_η over θ and η, respectively, in neighborhoods around the true values θ₀ and η₀. Then the Taylor series expansions of U₁(θ̂; η̂) at (θ₀; η̂) and (θ₀, η₀) yield $n^{1 / 2} (\hat{θ} - θ_{0}) = A_{θ}^{- 1} n^{- 1 / 2} U_{1} (θ_{0}; \hat{η}) + o_{p} (1) = A_{θ}^{- 1} {n^{- 1 / 2} U_{1} (θ_{0}; η_{0}) - A_{η} n^{1 / 2} (\hat{η} - η_{0})} + o_{p} (1)$ . The proof of Theorem 1 is sketched as follows:

First, using some derivation operation to U₁(θ; η̂) and (A.1), we can get
${\hat{A}}_{θ} (θ) = n^{- 1} \sum_{i = 1}^{n} \int_{0}^{τ} W (t) g {{\hat{μ}}_{0} (t) e^{{\hat{θ}}^{'} Z_{i} (t)}} \times {Z_{i} (t) - {\hat{E}}_{Z} (t)}^{\otimes 2} e^{θ^{'} Z_{i} (t) + {\hat{η}}^{'} X_{i}^{*} (t)} d {\hat{Λ}}_{1}^{*} (t) .$
The use of Taylor expansions of U₁(θ₀; η₀) and (A.1) at μ₀(t) yield
$U_{1} (θ_{0}; η_{0}) = \sum_{i = 1}^{n} \int_{0}^{τ} w (t) (Z_{i} (t) - e_{z} (t)) d M_{i} (t) - \sum_{i = 1}^{n} \int_{0}^{τ} w (t) (Z_{i} (t) - e_{z} (t)) g {μ_{0} (t) e^{θ_{0}^{'} Z_{i} (t)}} \times e^{η_{0}^{'} X_{i}^{*} (t)} d {{\hat{Λ}}_{1}^{*} (t; η_{0}) - Λ_{1}^{*} (t)} + o_{p} (n^{1 / 2}) .$

It follows from (7) that
${\hat{Λ}}_{1}^{*} (t; η_{0}) - Λ_{1}^{*} (t) = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{t} \frac{d M_{i}^{*} (t)}{s^{(0)} (t)} + o_{p} (n^{- 1 / 2}) .$

Thus
$U_{1} (θ_{0}; η_{0}) = \sum_{i = 1}^{n} (ξ_{1 i} - ξ_{2 i}) + o_{p} (n^{1 / 2}),$ (15)

where $ξ_{1 i} = \int_{0}^{τ} w (t) (Z_{i} (t) - e_{z} (t)) d M_{i} (t), ξ_{2 i} = \int_{0}^{τ} \frac{w (t) d (t)}{s^{(0)} (t)} d M_{i}^{*} (t)$ and d(t) = lim_n_→∞ D̂ (t; θ₀, η₀).
Differentiation of U₁(θ₀, η) and (A.1) with respect to η′ yields
${\hat{A}}_{η} (η) = n^{- 1} \sum_{i = 1}^{n} \int_{0}^{τ} W (t) g {{\hat{μ}}_{0} (t) e^{θ_{0}^{'} Z_{i} (t)}} e^{η^{'} X_{i}^{*} (t)} \times {Z_{i} (t) - {\hat{E}}_{Z} (t)} {X_{i}^{*} (t) - {\bar{X}}^{*} (t; η)}^{'} d {\hat{Λ}}_{1}^{*} (t; η)$ (A.1)
According to equation (6) and the arguments similar to Lin et al. (2000), one can show that
$n^{1 / 2} {\hat{η} - η_{0}} = Ω_{η}^{- 1} n^{- 1 / 2} \sum_{i = 1}^{n} ζ_{i} + o_{p} (1)$ (A.2)

where $Ω_{η} = E [\int_{0}^{τ} {X_{i}^{*} (t) - {\bar{x}}^{*} (t)}^{\otimes 2} e^{η_{0}^{'} X_{i}^{*} (t)} d Λ_{1}^{*} (t)]$ and $ζ_{i} = \int_{0}^{τ} (X_{i}^{*} (t) - {\bar{x}}^{*} (t)) d M_{i}^{*} (t)$ .

Combining the results in steps (1)–(4), we have

U_{1} (θ_{0}; \hat{η}) = \sum_{i = 1}^{n} (ξ_{1 i} - ξ_{2 i} - ξ_{3 i}) + o_{p} (n^{1 / 2}),

and hence

\sqrt{n} (\hat{θ} - θ_{0}) = A_{θ}^{- 1} n^{- 1 / 2} \sum_{i = 1}^{n} (ξ_{1 i} - ξ_{2 i} - ξ_{3 i}) + o_{p} (1),

(A.3)

where $ξ_{3 i} = \int_{0}^{τ} A_{η} Ω_{η}^{- 1} {X_{i}^{*} (t) - {\bar{x}}^{*} (t)} d M_{i}^{*} (t)$ . Then it follows from the multivariate central limit theorem that the conclusions hold.

APPENDIX B

Proof of the null distribution of ℱ(t, z)

Define $V (\hat{θ}, \hat{η}) = \sum_{i = 1}^{n} \int_{0}^{t} I (Z_{i} (s) \leq z) d {\hat{M}}_{i} (s)$ . By applying the Taylor expansion,

ℱ (t, x; \hat{θ}, \hat{η}) = n^{- 1 / 2} V (θ_{0}, η_{0}) + \frac{\partial V (θ_{0}, η_{0})}{n \partial η^{'}} \sqrt{n} (\hat{η} - η_{0}) + \frac{\partial V (θ_{0}, \hat{η})}{n \partial θ^{'}} \sqrt{n} (\hat{θ} - θ_{0}) + o_{p} (1) .

By following arguments and manipulations similar to those in Appendix A, it can be shown

V (θ_{0}, η_{0}) = \sum_{i = 1}^{n} {u_{1 i} (t, z) - u_{2 i} (t, z)} + o_{p} (n^{1 / 2}),

where $u_{1 i} (t, z) = \int_{0}^{t} {I (Z_{i} (s) \leq z) - e_{I} (s, z)} d M_{i} (s), u_{2 i} (t, z) = \int_{0}^{t} \frac{Γ (s)}{s^{(0)} (s)} d M_{i}^{*} (s)$ and Γ(t) = lim_n_→∞ Γ̂(t; θ₀, η₀).

Also $\frac{\partial V (θ_{0}, η_{0})}{n \partial η^{'}}$ and $\frac{\partial V (θ_{0}, \hat{η})}{n \partial θ^{'}}$ can be estimated by – V̂_η(t, z) and – V̂_θ(t, z), respectively. In addition, we obtained

n^{1 / 2} {\hat{η} - η_{0}} = Ω_{η}^{- 1} n^{- 1 / 2} \sum_{i = 1}^{n} ζ_{i} + o_{p} (1)

and

\sqrt{n} (\hat{θ} - θ_{0}) = A_{θ}^{- 1} n^{- 1 / 2} \sum_{i = 1}^{n} (ξ_{1 i} - ξ_{2 i} - ξ_{3 i}) + o_{p} (1),

from (A.2) and (A.3). Therefore, ℱ(t, z; θ̂, η̂) can be expressed as a sum of i.i.d. mean-zero terms for fixed t. By the multivariate central limit theorem, ℱ(t, z) converges in finite-dimensional distribution to a mean-zero Gaussian distribution. Since ℱ(t, z) is tight based on the empirical process theory, ℱ(t, z) converges weakly to a mean-zero Gaussian process that can be approximated by ℱ̂(t, z) given by equation (11).

Contributor Information

Yang Li, Department of Mathematics and Statistics, UNC Charlotte, Charlotte, NC 28223.

Yanqing Sun, Department of Mathematics and Statistics, UNC Charlotte, Charlotte, NC 28223.

References

1.Diggle PJ, Liang KY, Zeger SL. The Analysis of Longitudinal Data. Oxford University Press; Oxford: 1994. [Google Scholar]
2.Cheng SC, Wei LJ. Inferences for a semiparametric model with panel data. Biometrika. 2000;87:89–97. [Google Scholar]
3.He X, Tong X, Sun J. Semiparametric analysis of panel count data with correlated observation and follow-up times. Lifetime Data Analysis. 2009;15:177–196. doi: 10.1007/s10985-008-9105-1. [DOI] [PubMed] [Google Scholar]
4.Hu XJ, Sun J, Wei LJ. Regression parameter estimation from panel counts. Scandinavian Journal of Statistics. 2003;30:25–43. [Google Scholar]
5.Huang CY, Wang MC, Zhang Y. Analysing panel count data with informative observation times. Biometrika. 2006;93:763–775. doi: 10.1093/biomet/93.4.763. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Kalbfleisch JD, Prentice RL. The Statistical Analysis of Failure Time Data. Wiley; New York: 2002. [Google Scholar]
7.Li N, Sun L, Sun J. Semiparametric transformation models for panel count data with dependent observation process. Statistics in Biosciences. 2010;2(2):191–210. doi: 10.1002/cjs.10118. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Li N, Zhao H, Sun J. Semiparametric transformation models for panel count data with correlated observation and follow-up times. Statistics in Medicine. 2013;32(17):3039–3054. doi: 10.1002/sim.5724. [DOI] [PubMed] [Google Scholar]
9.Li Y, Zhao H, Sun J, Kim KM. Nonparametric tests for panel count data with unequal observation processes. Computational Statistics & Data Analysis. 2014;73:103–111. [Google Scholar]
10.Lin DY, Oaks D, Ying Z. Additive hazards regression with current status data. Biometirka. 1998;85(2):289–298. [Google Scholar]
11.Lin DY, Wei LJ, Yang I, Ying Z. Semiparametric regression for the mean and rate functions of recurrent events. Journal of the Royal Statistical Society, Series B. 2000;62:711–730. [Google Scholar]
12.Lin DY, Ying Z. Semiparametric and nonparametric regression analysis of longitudinal data. Journal of American Statistical Association. 2001;96:103–126. [Google Scholar]
13.Lipsitz SR, Fitzmaurice GM, Ibrahim JG, Gelber R, Lipshultz S. Parameter estimation in longitudinal studies with outcome-dependent follow-up. Biometrics. 2002;58:621–630. doi: 10.1111/j.0006-341x.2002.00621.x. [DOI] [PubMed] [Google Scholar]
14.Sun J, Kalbfleisch JD. Estimation of the mean function of point processes based on panel count data. Statistica Sinica. 1995;5:279–289. [Google Scholar]
15.Sun J, Park D-H, Sun L, Zhao X. Semiparametric regression analysis of longitudinal data with informative observation times. Journal of American Statistical Association. 2005;100:882–889. [Google Scholar]
16.Sun J, Sun L, Liu D. Regression analysis of longitudinal data in the presence of informative observation and censoring times. Journal of the American Statistical Association. 2007;102:1397–1406. [Google Scholar]
17.Sun J, Tong X, He X. Regression analysis of panel count data with dependent observation times. Biometrics. 2007b;63:1053–1059. doi: 10.1111/j.1541-0420.2007.00808.x. [DOI] [PubMed] [Google Scholar]
18.Sun J, Wei LJ. Regression analysis of panel count data with covariate-dependent observation and censoring times. Journal of the Royal Statistical Society, Series B. 2000;62:293–302. [Google Scholar]
19.Sun L, Song X, Zhou J, Liu L. Joint analysis of longitudinal data with informative observation times and a dependent terminal event. Journal of the American Statistical Association. 2013;107(498):688–700. [Google Scholar]
20.Sun Y. Estimation of semiparametric regression model with longitudinal data. Lifetime Data Analysis. 2010;16(2):271–298. doi: 10.1007/s10985-009-9136-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Wellner JA, Zhang Y. Two estimators of the mean of a counting process with panel count data. Annals of Statistics. 2000;28:779–814. [Google Scholar]
22.Wellner JA, Zhang Y. Two likelihood-based semiparametric estimation methods for panel count data with covariates. Annals of Statistics. 2007;35:2106–2142. [Google Scholar]
23.Welsh AH, Lin X, Carroll RJ. Marginal longitudinal nonparametric regression: locality and efficiency of spline and kernel Methods. Journal of American Statistical Association. 2002;97:482–493. [Google Scholar]
24.Zhang Y. A semiparametric pseudolikelihood estimation method for panel count data. Biometrika. 2002;89:39–48. [Google Scholar]
25.Zhang Z, Sun J, Sun L. Statistical analysis of current status data with informative observation times. Statistics in Medicine. 2005;24:1399–1407. doi: 10.1002/sim.2001. [DOI] [PubMed] [Google Scholar]
26.Zhao H, Li Y, Sun J. Analyzing Panel Count Data with Dependent Observation Process and a Terminal Event. The Canadian Journal of Statistics. 2013;41(1):174–191. [Google Scholar]
27.Zhao X, Balakrishnan N, Sun J. Nonparametric inference based on panel count data. Test. 2011;20:1–42. [Google Scholar]
28.Zhao X, Zhou J, Sun L. Semiparametric Transformation Models with Time-Varying Coefficients for Recurrent and Terminal Events. Biometrics. 2011;67:404–414. doi: 10.1111/j.1541-0420.2010.01458.x. [DOI] [PubMed] [Google Scholar]
29.Zhao X, Tong X. Semiparametric regression analysis of panel count data with informative observation times. Computational Statistics and Data Analysis. 2011;55(1):291–300. [Google Scholar]
30.Zhao X, Tong X, Sun J. Robust estimation for panel count data with informative observation times. Computational Statistics and Data Analysis. 2013b;57:33–40. [Google Scholar]

[R1] 1.Diggle PJ, Liang KY, Zeger SL. The Analysis of Longitudinal Data. Oxford University Press; Oxford: 1994. [Google Scholar]

[R2] 2.Cheng SC, Wei LJ. Inferences for a semiparametric model with panel data. Biometrika. 2000;87:89–97. [Google Scholar]

[R3] 3.He X, Tong X, Sun J. Semiparametric analysis of panel count data with correlated observation and follow-up times. Lifetime Data Analysis. 2009;15:177–196. doi: 10.1007/s10985-008-9105-1. [DOI] [PubMed] [Google Scholar]

[R4] 4.Hu XJ, Sun J, Wei LJ. Regression parameter estimation from panel counts. Scandinavian Journal of Statistics. 2003;30:25–43. [Google Scholar]

[R5] 5.Huang CY, Wang MC, Zhang Y. Analysing panel count data with informative observation times. Biometrika. 2006;93:763–775. doi: 10.1093/biomet/93.4.763. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] 6.Kalbfleisch JD, Prentice RL. The Statistical Analysis of Failure Time Data. Wiley; New York: 2002. [Google Scholar]

[R7] 7.Li N, Sun L, Sun J. Semiparametric transformation models for panel count data with dependent observation process. Statistics in Biosciences. 2010;2(2):191–210. doi: 10.1002/cjs.10118. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] 8.Li N, Zhao H, Sun J. Semiparametric transformation models for panel count data with correlated observation and follow-up times. Statistics in Medicine. 2013;32(17):3039–3054. doi: 10.1002/sim.5724. [DOI] [PubMed] [Google Scholar]

[R9] 9.Li Y, Zhao H, Sun J, Kim KM. Nonparametric tests for panel count data with unequal observation processes. Computational Statistics & Data Analysis. 2014;73:103–111. [Google Scholar]

[R10] 10.Lin DY, Oaks D, Ying Z. Additive hazards regression with current status data. Biometirka. 1998;85(2):289–298. [Google Scholar]

[R11] 11.Lin DY, Wei LJ, Yang I, Ying Z. Semiparametric regression for the mean and rate functions of recurrent events. Journal of the Royal Statistical Society, Series B. 2000;62:711–730. [Google Scholar]

[R12] 12.Lin DY, Ying Z. Semiparametric and nonparametric regression analysis of longitudinal data. Journal of American Statistical Association. 2001;96:103–126. [Google Scholar]

[R13] 13.Lipsitz SR, Fitzmaurice GM, Ibrahim JG, Gelber R, Lipshultz S. Parameter estimation in longitudinal studies with outcome-dependent follow-up. Biometrics. 2002;58:621–630. doi: 10.1111/j.0006-341x.2002.00621.x. [DOI] [PubMed] [Google Scholar]

[R14] 14.Sun J, Kalbfleisch JD. Estimation of the mean function of point processes based on panel count data. Statistica Sinica. 1995;5:279–289. [Google Scholar]

[R15] 15.Sun J, Park D-H, Sun L, Zhao X. Semiparametric regression analysis of longitudinal data with informative observation times. Journal of American Statistical Association. 2005;100:882–889. [Google Scholar]

[R16] 16.Sun J, Sun L, Liu D. Regression analysis of longitudinal data in the presence of informative observation and censoring times. Journal of the American Statistical Association. 2007;102:1397–1406. [Google Scholar]

[R17] 17.Sun J, Tong X, He X. Regression analysis of panel count data with dependent observation times. Biometrics. 2007b;63:1053–1059. doi: 10.1111/j.1541-0420.2007.00808.x. [DOI] [PubMed] [Google Scholar]

[R18] 18.Sun J, Wei LJ. Regression analysis of panel count data with covariate-dependent observation and censoring times. Journal of the Royal Statistical Society, Series B. 2000;62:293–302. [Google Scholar]

[R19] 19.Sun L, Song X, Zhou J, Liu L. Joint analysis of longitudinal data with informative observation times and a dependent terminal event. Journal of the American Statistical Association. 2013;107(498):688–700. [Google Scholar]

[R20] 20.Sun Y. Estimation of semiparametric regression model with longitudinal data. Lifetime Data Analysis. 2010;16(2):271–298. doi: 10.1007/s10985-009-9136-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Wellner JA, Zhang Y. Two estimators of the mean of a counting process with panel count data. Annals of Statistics. 2000;28:779–814. [Google Scholar]

[R22] 22.Wellner JA, Zhang Y. Two likelihood-based semiparametric estimation methods for panel count data with covariates. Annals of Statistics. 2007;35:2106–2142. [Google Scholar]

[R23] 23.Welsh AH, Lin X, Carroll RJ. Marginal longitudinal nonparametric regression: locality and efficiency of spline and kernel Methods. Journal of American Statistical Association. 2002;97:482–493. [Google Scholar]

[R24] 24.Zhang Y. A semiparametric pseudolikelihood estimation method for panel count data. Biometrika. 2002;89:39–48. [Google Scholar]

[R25] 25.Zhang Z, Sun J, Sun L. Statistical analysis of current status data with informative observation times. Statistics in Medicine. 2005;24:1399–1407. doi: 10.1002/sim.2001. [DOI] [PubMed] [Google Scholar]

[R26] 26.Zhao H, Li Y, Sun J. Analyzing Panel Count Data with Dependent Observation Process and a Terminal Event. The Canadian Journal of Statistics. 2013;41(1):174–191. [Google Scholar]

[R27] 27.Zhao X, Balakrishnan N, Sun J. Nonparametric inference based on panel count data. Test. 2011;20:1–42. [Google Scholar]

[R28] 28.Zhao X, Zhou J, Sun L. Semiparametric Transformation Models with Time-Varying Coefficients for Recurrent and Terminal Events. Biometrics. 2011;67:404–414. doi: 10.1111/j.1541-0420.2010.01458.x. [DOI] [PubMed] [Google Scholar]

[R29] 29.Zhao X, Tong X. Semiparametric regression analysis of panel count data with informative observation times. Computational Statistics and Data Analysis. 2011;55(1):291–300. [Google Scholar]

[R30] 30.Zhao X, Tong X, Sun J. Robust estimation for panel count data with informative observation times. Computational Statistics and Data Analysis. 2013b;57:33–40. [Google Scholar]

PERMALINK

Semiparametric Random Effects Models for Longitudinal Data with Informative Observation Times

Yang Li

Yanqing Sun

Abstract

1. INTRODUCTION

2. NOTATION AND MODELS

3. ESTIMATION PROCEDURE

Theorem 1

4. MODEL CHECKING

5. A SIMULATION STUDY

Table 1.

Table 2.

Table 3.

Table 4.

6. AN APPLICATION

Table 5.

7. CONCLUDING REMARKS

Acknowledgments

APPENDIX A

Proof of Theorem 1

APPENDIX B

Proof of the null distribution of ℱ(t, z)

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Semiparametric Random Effects Models for Longitudinal Data with Informative Observation Times

Yang Li

Yanqing Sun

Abstract

1. INTRODUCTION

2. NOTATION AND MODELS

3. ESTIMATION PROCEDURE

Theorem 1

4. MODEL CHECKING

5. A SIMULATION STUDY

Table 1.

Table 2.

Table 3.

Table 4.

6. AN APPLICATION

Table 5.

7. CONCLUDING REMARKS

Acknowledgments

APPENDIX A

Proof of Theorem 1

APPENDIX B

Proof of the null distribution of ℱ(t, z)

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases