Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Apr 1.
Published in final edited form as: Lifetime Data Anal. 2018 May 23;25(2):322–340. doi: 10.1007/s10985-018-9433-8

Semiparametric Temporal Process Regression of Survival-Out-of-Hospital

Tianyu Zhan 1
PMCID: PMC6251773  NIHMSID: NIHMS970376  PMID: 29796979

Abstract

The recurrent/terminal event data structure has undergone considerable methodological development in the last 10–15 years. An example of the data structure that has arisen with increasing frequency involves the recurrent event being hospitalization and the terminal event being death. We consider the response Survival-Out-of-Hospital, defined as a temporal process (indicator function) taking the value 1 when the subject is currently alive and not hospitalized, and 0 otherwise. Survival-Out-of-Hospital is a useful alternative strategy for the analysis of hospitalization/survival in the chronic disease setting, with the response variate representing a refinement to survival time through the incorporation of an objective quality-of-life component. The semiparametric model we consider assumes multiplicative covariate effects and leaves unspecified the baseline probability of being alive-and-out-of-hospital. Using zero-mean estimating equations, the proposed regression parameter estimator can be computed without estimating the unspecified baseline probability process, although baseline probabilities can subsequently be estimated for any time point within the support of the censoring distribution. We demonstrate that the regression parameter estimator is asymptotically normal, and that the baseline probability function estimator converges to a Gaussian process. Simulation studies are performed to show that our estimating procedures have satisfactory finite sample performances. The proposed methods are applied to the Dialysis Outcomes and Practice Patterns Study (DOPPS), an international end-stage renal disease study.

Keywords: Empirical process, Multiple imputation, Recurrent event, Relative risk, Semiparametric model, Terminal event

1 Introduction

Often in clinical or epidemiological studies, both a recurrent event process and terminal event are of interest. This is particularly true given the recent proliferation of administrative databases available for secondary analysis. Correspondingly, a considerable number of methods have been developed for the recurrent/terminal event data structure. One option is to model the marginal mean/rate (Cook and Lawless, 1997; Ghosh and Lin, 2002; Schaubel et al, 2006; Cook et al, 2009), essentially averaging over the survival experience. Another class of approaches involves jointly modeling survival and the conditional recurrent event rate given survival (Huang and Wang, 2004; Liu et al, 2004; Ye et al, 2007; Zeng and Cai, 2010; Kalbeisch et al, 2013).

In this report, we study a useful alternative framework for the analysis of recurrent/terminal event data. In particular, a frequently arising example of this data structure involves hospitalization representing the recurrent event, with death serving as the terminal event. Although hospital admission can be regarded as a point process, the length of stay for a hospitalization may be several days and, therefore, should not be ignored in the analysis. This concept is recognized in the works of Hu et al (2011) and Zhu et al (2014), for example. Specifically, we consider the response Survival-Out-of-Hospital, defined as a temporal process (indicator function) taking the value 1 when the subject is currently alive and not hospitalized, and 0 otherwise. Survival-out-of-hospital may be viewed as a refinement of survival time in the study of chronic illness such as end-stage renal disease (ESRD), with the refinement being the incorporation of each patient’s hospital admission and length of stay information. An appealing characteristic of survival-out-of-hospital is that it incorporates morbidity data in an objective and easily understood manner. Data of this structure are increasingly available, given the relatively recent proliferation of publicly available health administrative data sets (Holland and Lam, 2000; Sands et al, 2006; Carson et al, 2009).

Since Survival-Out-of-Hospital is indexed by a continuous timeline, temporal process regression would appear to be a natural conceptualization. Fine et al (2004) developed functional generalized linear models, for which covariates effects are completely unspecified and are estimated nonparametrically over time. Such an approach has also been generalized to multivariate survival settings to model both mean and association structures (Yan and Fine, 2005). To increase precision, the partly functional temporal process model has been developed, with covariate effects being nonparametric for some covariate elements and parametric for others (Yan and Huang, 2009; Estes et al, 2015). These functional generalized linear models generally focus on time-varying covariate effects. In addition, martingale-based estimating equations have been proposed for directly modeling the survival function by solving a sequence of monotone equations (Peng and Huang, 2007). Approaches listed in this paragraph are part of the inspiration for the methods we propose in this report. However, as will become more clear later in our report, none of these approaches are applicable to our setting given the specifics of our analytic objectives, along with the assumed data structure and model of interest.

Various other existing methods are pertinent to the data structure of our interest, but not applicable to our research question. For instance, methods proposed in Andersen et al (2003) and Scheike and Zhang (2007) involved directly modeling a state transition probability or a state occupation probability in a multi-state model. In these approaches, each state is assumed to be visited not more than once. As such, none of these approaches are directly applicable to our particular research question and data structure, since patients can move in and out of hospital many times prior to death. In addition, pseudo-observation approaches (Andersen et al, 2003; Grand and Putter, 2016), despite their utility, generally only allow for censoring that does not depend on the covariate vector, an assumption that may be violated in observational studies. As described in Section 2, the methods we propose allow censoring to depend on the covariate vector.

In this report, we propose a semiparametric temporal process regression method, where covariates have multiplicative effects on a completely unspecified baseline probability function. This model can be thought of a process version of the generalized linear model with the process indexed by time. In terms of estimation, the regression parameter is estimated by the solution to an estimating equation which is free of the baseline probability function. We propose a nonparametric estimator for the baseline probability process, a closed form, which can be computed after estimating the regression parameter. The estimating functions do not require inverse weighting in the setting where censoring times are known (e.g., if all censoring was administrative, occurring at the same calendar date). To accommodate the more commonly occurring scenario where censoring is random, we employ multiple imputation (Little and Rubin, 2002) to recover censoring times unobserved due to death (Schaubel and Zhang, 2010).

Our method has several distinguishing features. First, covariate effects are on the relative risk (as opposed to odds ratio) scale. The baseline probability process is specified as a nonparametric function of follow-up time, to increase flexibility and robustness. Through a development that parallels the derivation of the Cox partial likelihood score function (Cox, 1972), we derive an estimating function for the regression parameter that is free of the baseline probability function. This reduces the complexity of computation and permits the use of standard statistical software. Moreover, the baseline probability function can be subsequently estimated at any specific time point before the maximum observation time. We also propose estimating the integral of the baseline probability, which can then be used to predict expected survival time out-of-hospital over a finite time interval.

In this article, we are interested in the joint event of being out-of-hospital and being alive. This response variable takes quality of life information into consideration, while leaving the dependence structure between the temporal indicator process and terminal event time completely unspecified. Our method is different from the current practice in clinical trials, which is to analyze the time to the first recurrent event or terminal event (Lewis, 1999; Pfeffer et al, 2003). The proposed approaches are also distinct from a weighted composite endpoint of all recurrent and terminal events (Neaton et al, 2005; Mao and Lin, 2016).

The remainder of the article is organized as follows. In Section 2, we introduce notation, formulate the model assumptions, and propose estimating procedures for the regression parameter and baseline probability function. In Section 3, we show that the regression parameter estimator converges to a Normal distribution, and estimator of the baseline probability function converges to a Gaussian process. Simulation studies are performed to evaluate our method under various scenarios in Section 4. We then illustrate our method through an analysis of survival-out-of-hospital using data from the Dialysis Outcomes and Practice Patterns Study (DOPPS), a long-running international prospective study of ESRD patients. Finally, concluding remarks are provided at Section 6.

2 Model and Methods

Suppose there are a total of n independent subjects. Let Di denote the death (terminal event) time of subject i (i = 1, 2, …, n). We let Ci be the censoring time, and let Zi(t) be a p-dimensional covariate vector which may contain time-varying elements (assumed to be external; Kalbeisch and Prentice (2002), p.g. 196). Here, we consider follow-up time t ∈ [0, τ], where τ is a pre-specified constant satisfying Pr(Ciτ) > 0 for i = 1, 2, …, n. In practice, τ could be chosen as the maximum of observation time, Xi = CiDi, where ab = min(a, b).

Let Hi(t) = 1 if subject i is in the hospital at time t, and 0 if out of hospital. The probability of interest is the probability that a subject i is alive and out-of-hospital at time t, i.e.,

πi(t)=P{Hi(t)=0,Di>t|Zi(t)}. (1)

We assume that the covariates have multiplicative effects on an unspecified baseline probability function π0(t), such that

πi(t)=π0(t)exp{β0TZi(t)}, (2)

where β0 is a p-dimensional vector. This is similar to the Cox proportional hazard assumption (Cox, 1972), but πi(t) is interpreted as the probability of being out of hospital and alive for subject i at time t. Note the distinction between (2) and an intensity (Andersen and Gill, 1982) or marginal rate function (Lin et al, 2000).

With respect to the motivation for (1), modeling Survival-Out-of-Hospital is essentially focusing on a composite event (surviving and not being hospitalized) rather than its components. For instance, in studying patients with a chronic disease, it is often of interest to determine which patient characteristics are associated with ‘good outcomes’, in general sense, such as surviving and thriving (e.g., not being in the hospital). This outcome is relevant with respect to addressing the question, “Who is healthy enough to survive, without the burden of being hospitalized?” Direct analysis of the composite event is an alternative to a joint analysis of the components, hospitalization and death (Liu et al, 2004; Ye et al, 2007). Various existing methods in the survival analysis literature have targeted an outcome that can be cast as an aggregate. For example, Ghosh and Lin (2000) and Ghosh and Lin (2002) proposed modeling the marginal mean number of recurrent events, implicitly averaging over the survival experience. In the competing risk setting, Gray (1988) and Fine and Gray (1999) analyzed the cumulative incidence (subdistribution) hazard, as opposed to the cause-specific hazards.

Regarding the specific formulation of model (2), the motivation for using semiparametric loglinear regression is two-fold. First, due to the high dimensionality of π0(t) (since t is continuous), we will show that one can profile out π0(t) when estimating β0. Secondly, we will show that the estimating equation resembles the Cox regression score function, such that standard software can be leveraged for parameter estimation.

For brevity of notation, we define Ai(t) = I{Di > t} as the survival indicator, and Ai0(t)=I{Hi(t)=0,Di>t} as the survival-out-of-hospital indicator, of chief interest in this report. It is important to note that Zi(t) and Ai0(t) are still available on subject i (i.e., before the censoring time) even after the terminal event has occurred. This is due to the fact that Ai0(t) is known to equal 0 for t > Di, and since all time-dependent elements of Zi(t) are assumed to be external (such that death does not preclude subsequent observation). In summary, observed data for subject i are given by {Hi(t), Xi, I(CiDi), Zi(s)} for t ∈ [0, Xi] and s ∈ [0, τ].

2.1 Known Censoring

To begin, suppose that censoring time Ci is always known. This would be the case in a closely monitored prospective study from which patients could not be randomly lost to follow-up. Another example would be a retrospective study derived from a reportable-disease registry, which would typically feature staggered entry and the observation period ending on a fixed calendar date. In such cases, Ci would be known even for subjects with Ci > Di. This set-up does not match most observational studies, but it is a useful starting point. We assume that censoring time is independent of our target event, Ai0(t), conditional on Zi(t); more explicitly, we can express the assumption as follows,

E{Ai0(t)|Zi(t),Cit}=E{Ai0(t)|Zi(t)}. (3)

Consider the following two estimating functions,

i=1n0τZi(t)[Ai0(t)πi(t)]I(Cit)dt, (4)
i=1n[Ai0(t)πi(t)]I(Cit). (5)

These estimating functions have expectation zero under model (2) and assuming conditionally independent censoring (3). Note that equation (5) could be evaluated at any specific time point t ∈ [0, τ], and does not have to be an observed event time. Solving equation (5) for π0(t) treating β0 as known, then substituting the resulting π̂0(t) back into equation (4) yields the following zero-mean estimating function for β0,

U(β)=i=1n0τ{Zi(t)Z¯(t;β)}Ai0(t)I(Cit)dt, (6)

where (t; β) = S(1)(t; β)/S(0)(t; β), S(k)(t;β)=n1i=1nZi(t)kI(Cit)exp{βTZi(t)} for k = 0, 1, 2, where a⊗0 = 1, a⊗1 = a, and a⊗2 = aaT. We also define the following notations. Let s(k)(t; β) = E[Z1(t)kI(C1t)exp{βTZ1(t)}] and (t; β) = s(1)(t; β)/s(0)(t; β) be the limiting value of S(k)(t; β) and (t; β), respectively.

Equation (6) is reminiscent of the Cox regression score function, a property which can be exploited computationally (i.e., to ease programming effort). For instance, if the time scale is days (like our real-data application in Section 5) or some other discrete measure, then standard proportional hazards software (e.g., coxph(·) in R, proc phreg in SAS) can be used after augmenting the data set. Specifically, the augmented data would contain one record for each time unit t a subject is uncensored, with the event indicator for time unit t would be Ai0(t). Such data would left truncated such that, within subject, the (j + 1)th record begins (i.e., its left subinterval boundary) where the jth records ended (i.e., its right interval boundary). Alternatively, if the time scale were truly continuous, then the augmented data would contain, for each subject, one record for every time at which Zi(t), (t; β), or Ai0(t) changes. Additionally, for the jth record of subject i spanning, say, (tj−1, tj], a weight of (tjtj−1) would be used, along with offset log(tjtj−1). Note that the Breslow approximation (for tie-handling) would be used, as is the default in SAS’s phreg. Naturally, an alternative is to write explicit code to solve (6) using Newton-Raphson.

Denoting the solution to (6) by β̂, one can estimate π0(t) through the closed form,

π^0(t)=i=1nAi0(t)I(Cit)i=1nI(Cit)exp[β^TZi(t)]. (7)

Note that π̂0(t) is closely related to the Breslow estimator of baseline cumulative hazard function (Breslow, 1972). With respect to leveraging standard software, π̂0(t) can be computed directly through successive differencing in the previously described scenarios wherein the time scale is discrete (with one record per uncensored time unit per subject). If time is continuous, then π̂0(t) would need to be computed explicitly, with the value changing each time any of Ai0(t), or I(Cit) or Zi(t) changes its value for any subject i in the data set.

Due to the use of the log link, π̂0(t) > 1 is possible. This could occur, for example, in settings where both hospitalization and mortality rates are quite low. This does not induce bias for β̂ since, as described earlier, π0(t) is profiled out from estimating equation (6). When subsequently estimating π0(t) at specific time points from (7), one could cap π̂0(t) by 1. Note that an analogous issue is encountered in the semiparametric additive hazards model (Lin and Ying, 1994), for which estimated increments in the cumulative baseline hazard are not bounded below by 0.

We define the integral of the baseline survival-out-of-hospital probability as

Π0(L)=0Lπ0(t)dt, (8)

which could be interpreted as the expected length of being alive and out-of-hospital up to time L for a subject with the baseline (i.e., 0) value of each covariate. We could estimate Π0(L) by Π^0(L)=0Lπ^0(t)dt. Note that π̂0(t) in (7) would jump up when a subject is discharged from the hospital, and jump down when a subject either dies or is admitted to the hospital. This property would facilitate computation in real-data applications, since Π̂0(L) would essentially be a sum of rectangles with height π̂0(t) and (time) unit length.

2.2 Random Censoring

Now, consider the more typical scenario in which the censoring time Ci is not fixed at t = 0, with its randomness implying that Ci is not known in cases where Di is observed for subject i. In this set-up, one cannot carry out estimation through (6) and (7), defined in the preceding subsection, due to Ci being missing for subjects observed to die. A simple solution is to set censoring time as the maximum follow-up time across all subjects, which may lead to substantial bias in estimating β0 and π0(t). Similarly, setting the censoring time to Di for subjects with Ci > Di would generally lead to inaccurate inference.

One could use weighting techniques to recover missing censoring time, essentially weights subjects with Ci > Di by a conditional survival probability (Ghosh and Lin, 2002; Mao and Lin, 2016). However, a weighted version of estimating equation (6) is tedious to carry out, since the time line is continuous. In this report we consider an alternative imputation approach which is easy to implement using standard software.

Our solution is to impute Ci when Di < Ci and, for this purpose, we assume the following proportional hazard model for the censoring time,

λiC(t)=λ0C(t)exp{θ0TZi(t)}, (9)

where λ0C(t) is an unspecified baseline hazard function for Ci. In this manuscript, we only consider censoring which is independent of the process of interest, Ai0(t), per the assumption implied by (3). In light of the model for Ci, we now define the corresponding counting process NiC(t)=I(CitDi), and its increment dNiC(t)=NiC(t+dt)NiC(t). Similarly, the counting process for death time is defined by NiD(t)=I(DitCi). Let Yi(t) = I(Xit) be the at risk process, where (as indicated earlier) Xi = CiDi. Standard partial likelihood (Cox, 1975) techniques can be applied to the observed censoring time data {Xi,I(CiDi),Zi(t);t[0,τ]}i=1n to compute θ̂, which is known to be a strongly consistent estimator of θ0 (Andersen and Gill, 1982). The baseline cumulative hazard function for Λ0C(t) is estimated through the Breslow method (Breslow, 1972).

We will create M imputed datasets, where normally M = 10 would suffice. The proposed methods are also valid for M = 1, although this would be less efficient. Consider the mth imputed dataset, for a subject with CiDi, we set the imputed censoring time to the known censoring time. For a subject with Ci > Di, we impute C^im from the estimated conditional survival function,

G^i(t;θ^)=I(tDi)exp[Λ^iC(t;θ^)+Λ^iC(Di;θ^)]. (10)

More explicitly, we could express the imputed censoring time Cim as

Cim=[1NiD(Xi)]Ci+NiD(Xi)C^im.

We provide further commentary on the proposed imputation procedure in Section 6.

Consider the uncensored indicator, I(Cimt;θ), where we include θ in the parenthesis to emphasize that Cim depends on the parameter θ. In finite samples, we impute C^im from Ĝi(t; θ̂) for subjects with Ci > Di to obtain I(Cimt;θ^). Define Gi(t;θ0)=I(tDi)exp[ΛiC(t;θ0)+ΛiC(Di;θ0)]. Therefore, I(Cimt;θ0) refers to the scenario where we impute C^im from the true underlying Gi(t; θ0). When (9) is correctly specified, I(Cit) and I(Cimt;θ0) follow the same distribution. Let S(k)m(t;β,θ)=n1i=1nZi(t)kI(Cimt;θ)exp{βTZi(t)} k = 0, 1, 2, and set m(t; β, θ) = S(1)〈m (t; β, θ)/S(0)〈m (t; β, θ). For the mth imputed data set, the estimation of β0 and π0(t) is as defined in Section 2.1, but with Ci replaced by Cim. More explicitly, β̂m is computed through the following estimating function,

Um(β,θ^)=i=1n0τ{Zi(t)Z¯m(t;β,θ^)}Ai0(t)I(Cimt;θ^)dt, (11)

with π^0m(t) then given by

π^0m(t)=i=1nAi0(t)I(Cimt;θ^)i=1n{I(Cimt;θ^)exp[ZiT(t)β^m]}. (12)

Having computed β̂m and π^0m(t) (m = 1, 2, …, M), β0 and π0(t) will be estimated by the following pooled estimators:

β^M=M1m=1Mβ^m, (13)
π^0M(t)=i=1nm=1MAi0(t)I(Cimt;θ^)i=1nm=1MI(Cimt;θ^)exp[ZiT(t)β^M]. (14)

Asymptotic properties for π^0M(t) given in (14) are developed by expressing the numerator and denominator as empirical processes indexed by time t. Similar to π̂0(t) defined in (7), π^0M(t) would be larger then 1 at some time points; as such, we would cap this estimated probability by 1 after obtaining β̂M. An imputation version of Π^0M(L), denoted by π^0M(t), is obtained by integrating (14).

3 Asymptotic Properties

We assume the following set of regularity conditions:

  1. {Hi(t), Xi, I(CiDi), Zi(s)} for t ∈ [0, Xi] and s ∈ [0, τ], i = 1, 2, …, n are independent and identically distributed.

  2. Pr(Ciτ) > 0 for i = 1, 2, …, n, where τ is a pre-specified constant.

  3. |Zij(0)|+0τ|dZij(t)|<cZ< almost surely for i = 1, 2, …, n, j = 1, 2, …, p, i.e., Zi(t) has bounded total variations.

  4. Ω(β0)=E[0τ{s(2)(t;β0)/s(0)(t;β0)z¯(t;β0)2}A10(t)I(C1t)dt] is positive definite.

  5. For β ∈ ℬδ, where ℬδ is a small neighborhood around β0, s(0) (t; β), and s(1)(t; β) are bounded away from zero.

  6. For β ∈ ℬδ, k = 0, 1, 2, s(k) (t; β) are continuous uniformly on t ∈ [0, τ], and are bounded on [0, τ] × ℬδ.

We summarize the essential asymptotic properties of β̂, π̂0(t) and imputed versions β̂M, π^0M(t) in the following theorems. Proofs are sketched in the Supplemental Materials.

3.1 Known Censoring

We begin by describing the asymptotic properties of estimators applicable to the known-censoring set-up from Section 2.1.

Theorem 1

Under assumptions (2) and (3) and the afore-listed regularity conditions, β̂ is a consistent estimator of β0, and n1/2(β̂β0) converges in distribution to a mean-zero Normal random variable with a variance-covariance matrix

(β0)=Ω(β0)1E[u1(β0)u1(β0)T]Ω(β0)1, (15)

where ui(β) and Ω(β) are defined in Appendix 7. A consistent estimator of Σ(β0) is obtained by replacing limiting values by its empirical counter parts.

We now describe the asymptotic behavior of π̂0(t) from (7).

Theorem 2

Under assumptions (2) and (3), n1/2(π̂0π0) converges weakly to a mean-zero Gaussian process with a variance and covariance matrix between n1/2[π̂0(s) − π0(s)] and n1/2[π̂0(t) − π0(t)] given by σ(s, t) = E[ξ1(s)ξ1(t)], where

ξi(t)=fiπ1(t;β0)fiπ2(t;β0)s(0)(t;β0)

and fiπ1(t;β) and fiπ2(t;β) are further defined in Appendix 7.

3.2 Random Censoring

Now we describe the limiting behavior of estimator for the random-censoring set-up from Section 2.2.

Theorem 3

If censoring mechanism (9), assumptions (2) and (3) are correctly specified, then under the previously listed regularity conditions, β̂M is a consistent estimator of β0, and n1/2(β̂Mβ0) converges in distribution to a mean-zero Normal random variable with a variance-covariance matrix

M(β0,θ0)=[Ω(β0)]1E{[M1m=1Mu1m(β0,θ0)]2}[Ω(β0)]1,

where uim(β,θ) is defined in Appendix 7.

The asymptotic behavior of imputed version π^0M(t) from (14) is summarized in the following theorem.

Theorem 4

If censoring mechanism (9), assumptions (2) and (3) are correctly specified, then n1/2(π^0Mπ0) converges weakly to a mean-zero Gaussian process with a variance and covariance matrix between n1/2[π^0M(s)π0(s)] and n1/2[π^0M(t)π0(t)] given by σM (s, t) = E[ξ1M (s)ξ1M (t)], where

ξiM(t)=fiπ1(t;β0,θ0,M)fiπ2(t;β0,θ0,M)s(0)1(t;β0,θ0)

and fiπ1(t;β,θ,M) and fiπ2(t;β,θ,M) are further defined in Appendix 7.

4 Simulation Studies

We evaluate the finite-sample performance of our method through simulation studies. For each setting, n = 500 subjects are generated. The external covariate, Z(t) = Z1 + Z2(t), is generated from two components: Z1 and Z2(t), where Z1 ~ Uniform(0.5, 1) and Z2(t)=j=110I[10×(j1)<t10×j]Z3j with Z3j ~ Uniform(0, 1) for j = 1, 2, …, 10 and t = 1, 2, …, 100. The target model is survival-out-of-hospital probability, π(t) = π0(t)exp{β0Z(t)}, where π0(t) = I(t ≤ 10)(1 − 0.07t) + I(t > 10)(0.3 − 0.0025t) or 0.3 − 0.0025t, for t = 1, 2, …, 100. This model of the joint outcome can be generated through the hazard function for death time D, λD(t), and P{H(t) = 0|D > t, Z(t)}. We set λD(t)=λ0Dexp[α0Z(t)], where λ0D=0.008, α0 = log(0.7) and log(1.2). In this case, the out-of hospital indicator given survival follows the conditional probability P{H(t)=0|D>t,Z(t)}=π(t)exp [0tλD(s)ds]. The censoring time, C, is generated from the hazard function λC(t)=λ0Cexp[γ0Z(t)], where λ0C=0.008 and γ0 = log(1.5), such that C and the target event I{H(t) = 0, D > t} are conditionally independent given Z(t).

We consider 4 simulation settings, based on two versions of π0(t) and two magnitudes of α0 outlined in the previous paragraph. For π0(t) = I(t ≤ 10)(1−0.07t) + I(t > 10)(0.3 − 0.0025t), we consider β0 at −log(2), −log(1.5) and −log(1.3). In this scenario, the π̂0(t) in (7) may be larger than 1, but we make no adjustments when estimating β0. For π0(t) = 0.3−0.0025t, we evaluate β0 at log(1.5), −log(1.5) and log(1.3). Two magnitudes of α0 at log(0.7) and log(1.2) result in 76% and 60% subjects being censored. In addition to the known censoring scenario, we evaluate the performance of the proposed imputation method with M = 1 and M = 10. Parameters used for each of the four simulation settings are listed in Table 1.

Table 1.

Simulation Study: Data Configurations

Parameter Setting 1 Setting 2 Setting 3 Setting 4
λ0D
0.008 0.008 0.008 0.008
α0 log(0.7) log(1.2) log(0.7) log(1.2)
π0(t) I(t ≤ 10)(1 − 0.07t)+I(t > 10)(0.3 − 0.0025t) I(t ≤ 10)(1 − 0.07t)+I(t > 10)(0.3 − 0.0025t) 0.3 − 0.0025t 0.3 − 0.0025t
β −log(2), −log(1.5) or −log(1.3) −log(2), −log(1.5) or −log(1.3) log(1.5), −log(1.5) or log(1.3) log(1.5), −log(1.5) or log(1.3)
λ0C
0.008 0.008 0.008 0.008
γ0 log(1.5) log(1.5) log(1.5) log(1.5)

Table 2 and Table 3 summarize the simulation results for β̂ and Π̂0(50), respectively. In each setting, the bias of β̂ and Π̂0(50) are very small, indicating that our estimators are consistent even based on M = 1. Moreover, empirical standard deviations (ESDs) are generally close to the average asymptotic standard errors (ASEs), showing that our proposed variance estimator appear to be applicable to finite samples. The empirical coverage probabilities (ECPs) are also around 0.95, implying the accuracy of large-sample confidence intervals. For the random-censoring settings, we also calculate asymptotic relative efficiencies (AREs), defined as the ratio of mean square error (i.e., squared bias plus variance) between M = 1 and 10. The larger than 1 AREs indicate that multiple imputation is generally more efficient than single imputation.

Table 2.

Simulation results for β̂

Censoring Setting β0 BIAS ASE ESD ECP ARE
C known Setting 1 −0.693 −0.005 0.149 0.149 0.955 -
−0.405 0.001 0.133 0.136 0.946 -
−0.262 0.001 0.126 0.127 0.952 -
Setting 2 −0.693 0.003 0.180 0.185 0.950 -
−0.405 0.005 0.167 0.173 0.944 -
−0.262 −0.002 0.161 0.159 0.950 -
Setting 3 0.405 0.000 0.131 0.131 0.955 -
−0.405 0.001 0.162 0.161 0.947 -
0.262 0.000 0.134 0.135 0.949 -
Setting 4 0.405 0.004 0.182 0.188 0.934 -
−0.405 −0.010 0.206 0.205 0.955 -
0.262 −0.001 0.184 0.186 0.951 -

C random Setting 1 −0.693 0.004 0.149 0.153 0.946 1.021
M = 1 −0.405 0.007 0.132 0.134 0.951 1.03
−0.262 −0.002 0.125 0.124 0.948 1.003
Setting 2 −0.693 0.001 0.180 0.179 0.955 0.931
−0.405 0.000 0.167 0.168 0.953 0.972
−0.262 0.008 0.161 0.162 0.947 0.964
Setting 3 0.405 −0.002 0.130 0.132 0.947 0.963
−0.405 0.006 0.162 0.165 0.947 1.01
0.262 0.000 0.135 0.128 0.963 0.886
Setting 4 0.405 0.001 0.182 0.187 0.942 1.039
−0.405 −0.001 0.206 0.209 0.943 1.028
0.262 0.005 0.185 0.180 0.956 0.913

Table 3.

Simulation results for Π̂0(50)

Censoring Setting Π0(50) BIAS ASE ESD ECP ARE
C known Setting 1 15.100 0.371 2.882 2.972 0.948 -
15.100 0.222 2.582 2.697 0.945 -
15.100 0.186 2.424 2.489 0.946 -
Setting 2 15.100 0.430 3.520 3.788 0.941 -
15.100 0.273 3.221 3.399 0.930 -
15.100 0.332 3.100 3.066 0.943 -
Setting 3 11.812 0.151 1.964 2.030 0.943 -
11.812 0.246 2.455 2.481 0.952 -
11.812 0.188 2.027 2.096 0.942 -
Setting 4 11.812 0.270 2.755 2.919 0.926 -
11.812 0.564 3.177 3.274 0.952 -
11.812 0.347 2.813 2.872 0.937 -

C random Setting 1 15.100 0.196 2.869 3.022 0.942 1.009
M = 1 15.100 0.100 2.529 2.622 0.944 1.028
15.100 0.241 2.433 2.438 0.945 1.017
Setting 2 15.100 0.390 3.468 3.546 0.943 0.905
15.100 0.350 3.243 3.295 0.949 0.954
15.100 0.158 3.073 3.135 0.929 0.91
Setting 3 11.812 0.218 1.980 2.028 0.938 1.004
11.812 0.156 2.426 2.473 0.935 0.949
11.812 0.158 2.039 1.980 0.963 0.906
Setting 4 11.812 0.339 2.769 3.010 0.943 1.092
11.812 0.429 3.144 3.319 0.941 1.104
11.812 0.219 2.781 2.733 0.957 0.896

5 Real Data Analysis

We applied the proposed methods to data from the Dialysis Outcomes and Practice Patterns Study (DOPPS) Phase 5. The DOPPS is a prospective, observational study designed to elucidate aspects of hemodialysis practice that are associated with the best outcomes for hemodialysis patients (Young et al, 2000). In particular, Phase 5 data were collected between 2012 and 2015. Our research interests include identifying demographic and clinical variables that are associated with survival-out-of-hospital probability, and characterizing the underlying survival-out-of-hospital process.

The study population for DOPPS is of prevalent patients. We restricted our study sample to include the n = 6, 032 patients who entered DOPPS within 3 months of initiating dialysis since the interest is in time-since-dialysis-initiation instead of time-since-DOPPS-entry. Patients included in our analysis were from 470 hemodialysis units across 11 different countries, with the counties including: Belgium, Canada, China, Gulf Coast Consortium, Germany, Italy, Japan, Spain, Sweden, the United Kingdom and the U.S.. Covariates include age, race, gender, height, time on dialysis at study entry, as well as the following list of comorbid conditions: coronary artery disease (CAD), cancer, cardiovascular disease (CVD), stroke, congestive heart failure (CHF), diabetes, hypertension, chronic obstructive pulmonary disease (COPD), psychiatric disorder and peripheral vascular disease (PVD).

Since hospitalization and death times are recorded in days, t represents day (i.e, day post DOPPS entry) in our analysis. The mean number of hospital admissions was 0.525 per patient, while the median length of stay per visit was 5 days. Observed follow-up time had a median of 409 days. Around 16% of subjects were observed to die (i.e., for whom Ci > Di). Censoring time cannot realistically be considered to be fixed, since patients are frequently lost to follow-up for reasons other than death or closure of the DOPPS database. To recover missing censoring time, we created M = 10 imputed data sets as described in Section 2.2.

As shown in Table 4, coronary artery disease (p = 0.037), cancer (p = 0.010), cardiovascular disease (p = 0.031), congestive heart failure (p < 0.001) and peripheral vascular disease (p = 0.012) have significant negative effects on survival-out-of-hospital, while hypertension (p = 0.006) has significant positive effect. Patients from Canada (p = 0.009), Japan (p < 0.001) and Spain (p < 0.001) have significantly higher survival-out-of-hospital probability than patients from the U.S. (reference). For continuous variables, age had a significant negative effect (p < 0.001), while height had significant positive effect (p < 0.001). The estimated baseline survival-out-of-hospital probability π^010(t) is shown in Figure 1. The curve generally decreases as follow-up time increases, although the decrease is not monotone.

Table 4.

Analysis of DOPPS data: Covariate effects on survival-out-of-hospital probability (based on M = 10 imputations)

Covariate β̂
SE^(β)
p exp(β̂)
(Age in years)/5 −0.019 0.002 < 0.001* 0.982
Time on dialysis (years) −0.044 0.068 0.444 0.957
(Height in cm)/10 0.039 0.006 < 0.001* 1.040
Female 0.008 0.014 0.550 1.008
CADa −0.040 0.018 0.037* 0.961
Cancer −0.062 0.023 0.010* 0.940
CVDb −0.055 0.022 0.031* 0.946
Stroke 0.005 0.024 0.711 1.005
CHFc −0.073 0.017 < 0.001* 0.930
Diabetes 0.012 0.011 0.329 1.012
Hypertension 0.037 0.013 0.006* 1.038
COPDd −0.028 0.027 0.325 0.972
Psychiatric Disorder −0.027 0.022 0.249 0.973
PVDe −0.050 0.019 0.012* 0.951
Belgium 0.002 0.034 0.715 1.002
Canada 0.074 0.025 0.009* 1.076
China −0.010 0.035 0.654 0.990
Gulf −0.012 0.024 0.620 0.988
Germany 0.013 0.026 0.610 1.013
Italy 0.035 0.027 0.255 1.035
Japan 0.106 0.019 < 0.001* 1.112
Spain 0.093 0.024 < 0.001* 1.097
Sweden 0.033 0.029 0.301 1.033
UK 0.039 0.030 0.266 1.040
a

: Coronary artery disease

b

: Cardiovascular disease

c

: Congestive heart failure

d

: Chronic obstructive pulmonary disease

e

: Peripheral vascular disease

*

: p < 0.05

Fig. 1.

Fig. 1

Fitted baseline survival-out-of-hospital probability

6 Concluding Remarks

In this report, we propose semiparametric methods for analyzing the probability of survival-out-of-hospital, a novel end-point pertinent to a frequently arising instance of the recurrent/terminal event data structure. Estimation proceeds through estimating equations which are analogous to those employed in Cox regression. Multiple imputation is implemented to accommodate missing censoring times that are unobserved due to the subject dying. Asymptotic properties of estimators are derived, and simulation studies show the proposed methods have satisfactory finite sample performance.

We applied the methods to data from Phase 5 of the Dialysis Outcomes and Practice Patterns Study in order to identify significant predictors of survival-out-of-hospital. Coronary artery disease, cancer, cardiovascular disease, congestive heart failure and peripheral vascular disease are found to be comorbidity factors with significant negative effects on survival-out-of-hospital events, while hypertension has significant positive effect. Patients from Canada, Japan and Spain have significantly higher survival-out-of-hospital probability than patients in the U.S.. Moreover, increasing age had a significantly negative effect, while height had significantly positive effect on survival-out-of-hospital.

A key advantage of the proposed methods is that the baseline probability process does not need to be specified. The ability to profile out π0(t) in the estimation of β0 is facilitated by the use of a log link function in model (2). This was the reason for using the log link is lieu of its more frequently used alternatives. This is important, since the baseline probability function will often represent a nuisance parameter. Modeling the baseline parametrically could be tedious to carry out accurately, and could lead to bias in the regression parameter (of chief interest) if an incorrect parametric form is assumed. Note that, although the proposed baseline estimator shares a structure similar to that of the Breslow (1972) estimator, there are some important differences. First, our baseline estimator is not monotone, consistent with the temporal process it is targeting. Second, the estimator can be computed for any value of t, since the baseline probability reflects prevalence, as opposed to an intensity or occurrence rate.

Under the proposed methods, if C > D then C is considered to be missing data. Of the two most frequently employed techniques for handling missing data (namely inverse weighting and imputation), we chose to impute unobserved censoring times. Note that a related approach was developed by Schaubel and Zhang (2010). The multiple imputation method we use for unobserved censoring times represents so-called improper imputation, in the sense that the imputation model parameters are fixed at their estimated values. For an alternative strategy, proper imputation could also be considered, where the imputation model parameters are drawn from their estimated predictive posterior distribution (Little and Rubin, 2002). Both imputation methods would lead to consistent estimators of covariate effects but, potentially, with slightly different efficiency (Schaubel and Cai, 2006). The proposed methods are valid under single imputation, M = 1, which would not be the case under proper imputation. More specifically, Rubin’s renown variance formula for multiple imputation requires M > 1 since it explicitly utilizes the variability across the M estimates. As with its proper counterpart, it is well-known that improper imputation leads to consistent estimation. However, due to the inapplicability of Rubin’s formula under improper imputation, one must derive the variance explicitly (per Theorems 3–4 and their corresponding proofs in the Supplementary Materials). Examples of other methods that have handled imputation in a similar fashion include Lu and Tsiatis (2001), who imputed cause of death when missing. Note that, although M = 1 is valid, we recommend M = 5 or M = 10; using the mean of M imputed estimators is generally more efficient than using a single impute, although returns do diminish.

We proposed the Cox (1972) model for imputing censoring times, due to its flexibility and dominance in the analysis of censored epidemiologic data. Parameters estimated through the proposed methods are consistent only provided the censoring model is correctly specified. Assumptions on the censoring model could be assessed through standard techniques, such as Schoenfeld and Martingale residuals (Kalbfleisch and Prentice, 2002). Since dMi (t; β) defined at Section 3.1 is the difference between observed prevalence and predicted values, one could use the cumulative sum of dMi (t; β), analogous to the diagnostic techniques developed in Lin et al (2000) for recurrent event data.

Supplementary Material

10985_2018_9433_MOESM1_ESM

Acknowledgments

This work was supported in part by National Institutes of Health Grant R01 DK070869 and by an M-Cubed grant from the University of Michigan. The DOPPS is administered by Arbor Research Collaborative for Health and supported by scientific research grants from Amgen (since 1996), Kyowa Hakko Kirin (since 1999, in Japan), Sanofi Renal (since 2009), Abbott (since 2009), Baxter (since 2011), and Vifor Fresenius Renal Pharma (since 2011), without restrictions on publications. The authors thank Arbor Research Collaborative for Health for providing access to the DOPPS data and, in particular, Keith McCullough for creating the analysis files.

7 Appendix

We first list notation that appeared in Section 3.1:

Ω^(β)=n1i=1n0τ{S(2)(t;β)/S(0)(t;β)Z¯(t;β)2}Ai0(t)I(Cit)dt.
Ω(β)=E[0τ{s(2)(t;β)/s(0)(t;β)z¯(t;β)2}A10(t)I(C1t)dt].
ui(β)=0τ{Zi(t)z¯(t;β)}dMi(t;β).
dMi(t;β)=Ai0(t)I(Cit)dtexp[βTZi(t)]I(Cit)dt.
fiπ1(t;β)=Ai0(t)I(Cit)exp[βTZi(t)]I(Cit)π0(t).
fiπ2(t;β)=s(1)(t;β)Tπ0(t)fiβ(β).
fiβ(β)=Ω(β)1ui(β).

Formulas pertaining to Section 3.2 are as follows:

Ω^m(β,θ)=n1i=1n0τ{S(2)m(t;β,θ)/S(0)m(t;β,θ)Z¯m(t;β;θ)2}Ai0(t)I(Cimt;θ)dt.
Ω1(β,θ)=E[0τ{s(2)1(t;β,θ)/s(0)1(t;β,θ)z¯1(t;β,θ)2}A10(t)I(C11t;θ)dt].
uim(β,θ)=0τ{Zi(t)z¯1(t;β,θ)}dMim(t;β,θ).
dMim(t;β,θ)=Ai0(t)I(Cimt;θ)dtexp[βTZi(t)]I(Cimt;θ)dt.
fiπ1(t;β,θ,M)=M1m=1MI(Cimt;θ)Ai0(t)M1m=1MI(Cimt;θ)exp[βTZi(t)]π0(t).
fiπ2(t;β,θ,M)=s(1)1(t;β,θ)Tπ0(t)fiβ(β,θ,M).
fiβ(β,θ,M)=[Ω(β)]1M1m=1Muim(β,θ).

References

  1. Andersen PK, Gill RD. Cox’s regression model for counting processes: a large sample study. The annals of statistics. 1982;10:1100–1120. [Google Scholar]
  2. Andersen PK, Klein JP, Rosthøj S. Generalised linear models for correlated pseudo-observations, with applications to multi-state models. Biometrika. 2003:15–27. [Google Scholar]
  3. Breslow NE. Contribution to the discussion of the paper by D.R. Cox. Journal of the Royal Statistical Society, Series B. 1972;34(2):216–217. [Google Scholar]
  4. Carson RC, Juszczak M, Davenport A, Burns A. Is maximum conservative management an equivalent treatment option to dialysis for elderly patients with significant comorbid disease? Clinical Journal of the American Society of Nephrology. 2009;4(10):1611–1619. doi: 10.2215/CJN.00510109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Cook RJ, Lawless JF. Marginal analysis of recurrent events and a terminating event. Statistics in medicine. 1997;16(8):911–924. doi: 10.1002/(sici)1097-0258(19970430)16:8<911::aid-sim544>3.0.co;2-i. [DOI] [PubMed] [Google Scholar]
  6. Cook RJ, Lawless JF, Lakhal-Chaieb L, Lee KA. Robust estimation of mean functions and treatment effects for recurrent events under event-dependent censoring and termination: application to skeletal complications in cancer metastatic to bone. Journal of the American Statistical Association. 2009;104(485):60–75. [Google Scholar]
  7. Cox DR. Regression models and life tables (with discussion) Journal of the Royal Statistical Society. 1972;34:187–220. [Google Scholar]
  8. Cox DR. Partial likelihood. Biometrika. 1975:269–276. [Google Scholar]
  9. Estes JP, Nguyen DV, Dalrymple LS, Mu Y, Şentürk D. Time-varying effect modeling with longitudinal data truncated by death: conditional models, interpretations, and inference. Statistics in medicine. 2015 doi: 10.1002/sim.6836. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Fine JP, Gray RJ. A proportional hazards model for the subdistribution of a competing risk. Journal of the American statistical association. 1999;94(446):496–509. [Google Scholar]
  11. Fine JP, Yan J, Kosorok MR. Temporal process regression. Biometrika. 2004;91(3):683–703. [Google Scholar]
  12. Ghosh D, Lin D. Nonparametric analysis of recurrent events and death. Biometrics. 2000;56(2):554–562. doi: 10.1111/j.0006-341x.2000.00554.x. [DOI] [PubMed] [Google Scholar]
  13. Ghosh D, Lin DY. Marginal regression models for recurrent and terminal events. Statistica Sinica. 2002:663–688. [Google Scholar]
  14. Grand MK, Putter H. Regression models for expected length of stay. Statistics in medicine. 2016;35(7):1178–1192. doi: 10.1002/sim.6771. [DOI] [PubMed] [Google Scholar]
  15. Gray RJ. A class of k-sample tests for comparing the cumulative incidence of a competing risk. The Annals of statistics. 1988:1141–1154. [Google Scholar]
  16. Holland DC, Lam M. Predictors of hospitalization and death among predialysis patients: a retrospective cohort study. Nephrology Dialysis Transplantation. 2000;15(5):650–658. doi: 10.1093/ndt/15.5.650. [DOI] [PubMed] [Google Scholar]
  17. Hu XJ, Lorenzi M, Spinelli JJ, Ying SC, McBride ML. Analysis of recurrent events with non-negligible event duration, with application to assessing hospital utilization. Lifetime data analysis. 2011;17(2):215–233. doi: 10.1007/s10985-010-9183-8. [DOI] [PubMed] [Google Scholar]
  18. Huang CY, Wang MC. Joint modeling and estimation for recurrent event processes and failure time data. Journal of the American Statistical Association. 2004;99(468):1153–1165. doi: 10.1198/016214504000001033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Kalbeisch JD, Prentice RL. The Statistical Analysis of Failure Time Data. 2. New York: Wiley; 2002. [Google Scholar]
  20. Kalbeisch JD, Schaubel DE, Ye Y, Gong Q. An estimating function approach to the analysis of recurrent and terminal events. Biometrics. 2013;69(2):366–374. doi: 10.1111/biom.12025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Lewis JA. Statistical principles for clinical trials (ich e9): an introductory note on an international guideline. Statistics in medicine. 1999;18(15):1903–1942. doi: 10.1002/(sici)1097-0258(19990815)18:15<1903::aid-sim188>3.0.co;2-f. [DOI] [PubMed] [Google Scholar]
  22. Lin D, Ying Z. Semiparametric analysis of the additive risk model. Biometrika. 1994;81(1):61–71. [Google Scholar]
  23. Lin DY, Wei LJ, Yang I, Ying Z. Semiparametric regression for the mean and rate functions of recurrent events. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2000;62(4):711–730. [Google Scholar]
  24. Little RJ, Rubin DB. Statistical analysis with missing data. New York: Wiley; 2002. [Google Scholar]
  25. Liu L, Wolfe RA, Huang X. Shared frailty models for recurrent events and a terminal event. Biometrics. 2004;60(3):747–756. doi: 10.1111/j.0006-341X.2004.00225.x. [DOI] [PubMed] [Google Scholar]
  26. Mao L, Lin DY. Semiparametric regression for the weighted composite endpoint of recurrent and terminal events. Biostatistics. 2016;17(2):390. doi: 10.1093/biostatistics/kxv050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Neaton JD, Gray G, Zuckerman BD, Konstam MA. Key issues in end point selection for heart failure trials: composite end points. Journal of cardiac failure. 2005;11(8):567–575. doi: 10.1016/j.cardfail.2005.08.350. [DOI] [PubMed] [Google Scholar]
  28. Peng L, Huang Y. Survival analysis with temporal covariate effects. Biometrika. 2007;94(3):719–733. [Google Scholar]
  29. Pfeffer MA, Swedberg K, Granger CB, Held P, McMurray JJ, Michelson EL, Olofsson B, Östergren J, Yusuf S, Investigators C, Investigators C, Committees et al. Effects of candesartan on mortality and morbidity in patients with chronic heart failure: the charm-overall programme. The Lancet. 2003;362(9386):759–766. doi: 10.1016/s0140-6736(03)14282-1. [DOI] [PubMed] [Google Scholar]
  30. Sands JJ, Etheredge GD, Shankar A, Graff J, Loeper J, McKendry M, Farrell R. Predicting hospitalization and mortality in end-stage renal disease (esrd) patients using an index of coexisting disease (iced)-based risk stratification model. Disease Management. 2006;9(4):224–235. doi: 10.1089/dis.2006.9.224. [DOI] [PubMed] [Google Scholar]
  31. Schaubel DE, Cai J. Multiple imputation methods for recurrent event data with missing event category. Canadian Journal of Statistics. 2006;34(4):677–692. [Google Scholar]
  32. Schaubel DE, Zhang M. Estimating treatment effects on the marginal recurrent event mean in the presence of a terminating event. Lifetime data analysis. 2010;16(4):451–477. doi: 10.1007/s10985-009-9149-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Schaubel DE, Zeng D, Cai J. A semiparametric additive rates model for recurrent event data. Lifetime Data Analysis. 2006;12(4):389–406. doi: 10.1007/s10985-006-9017-x. [DOI] [PubMed] [Google Scholar]
  34. Scheike TH, Zhang MJ. Direct modelling of regression effects for transition probabilities in multistate models. Scandinavian Journal of Statistics. 2007;34(1):17–32. [Google Scholar]
  35. Yan J, Fine JP. Functional association models for multivariate survival processes. Journal of the American Statistical Association. 2005;100(469):184–196. [Google Scholar]
  36. Yan J, Huang J. Partly functional temporal process regression with semiparametric profile estimating functions. Biometrics. 2009;65(2):431–440. doi: 10.1111/j.1541-0420.2008.01071.x. [DOI] [PubMed] [Google Scholar]
  37. Ye Y, Kalbeisch JD, Schaubel DE. Semiparametric analysis of correlated recurrent and terminal events. Biometrics. 2007;63(1):78–87. doi: 10.1111/j.1541-0420.2006.00677.x. [DOI] [PubMed] [Google Scholar]
  38. Young EW, Goodkin DA, Mapes DL, Port FK, Keen ML, Chen K, Maroni BL, Wolfe RA, Held PJ. The dialysis outcomes and practice patterns study (dopps): an international hemodialysis study. Kidney International. 2000;57:S74–S81. [Google Scholar]
  39. Zeng D, Cai J. A semiparametric additive rate model for recurrent events with an informative terminal event. Biometrika. 2010;97:699–712. doi: 10.1093/biomet/asq039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Zhu L, Zhao H, Sun J, Pounds S. A conditional approach for regression analysis of longitudinal data with informative observation time and non-negligible observation duration. Communications in Statistics-Theory and Methods. 2014;43(23):4998–5011. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

10985_2018_9433_MOESM1_ESM

RESOURCES