Analysis of combined incident and prevalent cohort data under a proportional mean residual life model

Chi Hyun Lee; Jing Ning; Richard J Kryscio; Yu Shen

doi:10.1002/sim.8098

. Author manuscript; available in PMC: 2019 May 30.

Published in final edited form as: Stat Med. 2019 Jan 24;38(12):2103–2114. doi: 10.1002/sim.8098

Analysis of combined incident and prevalent cohort data under a proportional mean residual life model

Chi Hyun Lee ^1,^*, Jing Ning ¹, Richard J Kryscio ^2,³, Yu Shen ¹

PMCID: PMC6461486 NIHMSID: NIHMS1014550 PMID: 30680767

Summary

The Nun Study, a longitudinal study to examine risk factors for the progression of dementia, consists of subjects who were already diagnosed with dementia (i.e., prevalent cohort) and those who do not have dementia (i.e., incident cohort) at study enrollment. When assessing the risk factors’ effects on the survival time from dementia diagnosis until death, utilizing data from both cohorts supports more efficient statistical inference because the two cohorts provide valuable complementary information. A major challenge in analyzing the combined cohort data is that the prevalent cases are not representative of the target population. Moreover, the dates of dementia diagnosis are not ascertained for the prevalent cohort in the Nun Study. Hence, the survival time for the prevalent cohort is only partially observed from study enrollment until death or censoring, with the time from dementia diagnosis to study enrollment missing. In this paper, we propose an efficient estimation method that uses both incident and prevalent cohorts under the proportional mean residual life model. By assuming proportionality of the mean residual life time with covariates in the incident cohort, we can utilize the natural relationship between the mean residual life function and the hazard function of the survival time measured from enrollment until death for the prevalent cohort. We evaluate the efficiency gain from using the combined cohort data through simulations and demonstrate that the proposed method is valid and efficient.

Keywords: Combined cohort data, incident cohort, Nun Study, prevalent cohort, proportional hazards model, proportional mean residual life model

1 |. INTRODUCTION

Prospective observational studies are commonly used to identify and evaluate risk factors that are associated with disease-specific survival. Such studies occasionally include both incident and prevalent cohorts. For example, the Nun Study of Aging and Alzheimer’s Disease (Nun Study),¹ which motivates this work, involves an incident cohort of subjects who have not experienced dementia onset and are followed over time to monitor the potential diagnosis of dementia and death; and the prevalent cohort of subjects who already have dementia but have not experienced death at the time of study entry. The two cohorts provide valuable complementary information: the incident cohort is a random sample from the target population; and the prevalent cohort includes more deaths since subjects are sampled in the midst of dementia. Thus, analyzing the combined data from both cohorts yields more efficient statistical results. However, statistical analysis using the combined data has received less attention in the literature.

The data from the Nun Study consist of 501 subjects after excluding 177 participants who had missing key covariates (22) or withdrew consent (155). Among the participants represented in the data, 77 (about 15%) already had dementia and 424 were not yet diagnosed with dementia at study entry; these participants comprise the prevalent and incident cohorts, respectively. During the prospective follow-up, 153 subjects among the incident cohort were diagnosed with dementia. The dates of diagnosis of dementia were not available for the 77 subjects with dementia in the prevalent cohort. The combined cohort data are illustrated in Figure S1 of the web-based supplementary materials. In the statistical literature, the Nun Study data have been used primarily to illustrate Markov transition models,^2,3,4,5,6 which has excluded the data from the prevalent cohort. We aim to take advantage of data from both the prevalent and incident cohorts for more efficient evaluation of the relationship between the risk factors and the survival time after diagnosis of dementia. In addition to the challenge of properly adjusting for sampling bias, a major issue when analyzing the combined data from the Nun Study is that the dates of dementia diagnosis for the prevalent cases were not ascertained. Thus, we only observe the time from study enrollment to death (referred to as the “forward recurrence time”) with the information of the time from diagnosis of dementia to study enrollment (referred to as the “backward recurrence time”) missing for the prevalent cohort.

We consider the proportional mean residual life (PMRL) model ⁷ to assess the effect of risk factors on the residual survival time. By assuming proportionality of the mean residual life time with covariates, we can utilize the natural relationship between the mean residual life function and the hazard function of the forward recurrence time to analyze the combined cohort data. In Section 2, we introduce notations to depict the combined cohort data and present the connection between the PMRL model and the proportional hazards (PH) model. We review existing estimation methods for data from the incident cohort only and the prevalent cohort only, and propose efficient estimating equations for the combined cohorts in Section 3. The asymptotic properties are also established in this section. We investigate finite sample properties through simulation studies under various settings in Section 4. In Section 5, we use the proposed method to analyze the Nun Study data. We provide some remarks in Section 6.

2 |. NOTATIONS AND MODEL

We consider data from both the incident and prevalent cohorts with respective sample sizes of n₁ and n₂. For the incident cohort, we denote T⁰ and a p × 1 vector X as the duration from disease diagnosis to death and the time-independent covariates, respectively. Let C be the duration from disease diagnosis to a censoring event. Then, the observed data from the incident cohort consist of independent and identically distributed (i.i.d.) {(T_i, A_i, X_i), i = 1, … , n₁}, where $T_{i} = \min (T_{i}^{0}, C_{i})$ and $Δ_{i} = I (T_{i}^{0} \leq C_{i})$ . We assume that the censoring time C is conditionally independent of T⁰ given covariates X. We note that the incident cohort is representative of the target population. For prevalent cases, the dates of dementia diagnosis, which occurred prior to enrollment, are unknown. Thus, only partial information on survival times that is measured from the study enrollment is available. We introduce additional notations to represent the event times observed from the prevalent cohort. Let V⁰ and a p × 1 vector X^v denote the duration from enrollment to death and the time-independent covariates for the prevalent cohort, respectively. Unlike the censoring time C for the incident cohort, the censoring time C^v is measured from enrollment until a censoring event. The observed prevalent cohort data are i.i.d. ${(V_{i}, Δ_{i}^{υ}, X_{i}^{υ}), i = 1, ..., n_{2}}$ , where $V_{i} = \min (V_{i}^{0}, C_{i}^{υ})$ and $Δ_{i}^{υ} = I (V_{i}^{0} \leq C_{i}^{υ})$ . The censoring time for the prevalent cohort, C^v, is assumed to be conditionally independent of V⁰ given covariates X^v. Based on research about dementia,^8,9 it is reasonable to assume that the natural history of dementia follows a stationary Poisson process. Under such an assumption, the prevalent cohort is subject to length-biased sampling.

The mean residual life function for the underlying survival time T⁰ at time t can be defined as m(t | X) = E(T⁰−t | T⁰ > t, X). To assess the covariate effects on the mean residual time, we assume the PMRL model ⁷ as

m (t | X) = m_{0} (t) \exp (β^{┬} X),

(1)

where m₀(t) is the unspecified positive baseline mean residual life function and β is a p × 1 vector of coefficients. We may use existing methods to fit the model to data from the incident cohort only. ^10,11 However, the observed data from the prevalent cohort cannot directly fit model (1) because the survival times are length biased and the backward recurrence times are missing. Under length-biased sampling, it is shown that the conditional density function of the forward recurrence time V⁰ given covariates X is

f_{V^{0} | X} (υ | X) = \frac{S (υ | X)}{m (0 | X)},

where S(· | X) is the conditional survival function of T⁰ and m(0 | X) is the mean survival time of T⁰ given X. ¹² It follows that the hazard function of the forward recurrence time is

λ^{υ} (t | X) = \frac{S (t | X) / m (0 | X)}{\int_{t}^{τ} S (u | X) / m (0 | X) d u} = \frac{S (t | X)}{\int_{t}^{τ} S (u | X) d u} = \frac{1}{E (T^{0} - t | T^{0} > t, X)} = \frac{1}{m (t | X)}

where τ is the finite upper bound that satisfies Pr(T > τ) > 0. Therefore, as discussed by Maguluri and Zhang,¹³ Chen and Cheng,¹⁰ and Chen et al.,¹¹ the PMRL model for T⁰ implies the following PH model for the forward recurrence time V⁰:

λ^{υ} (t | X) = {m (t | X)}^{- 1} = {m_{0} (t)}^{- 1} \exp (- β^{┬} X) = λ_{0}^{υ} (t) \exp (- β^{┬} X),

(2)

where $λ_{0}^{υ} (\cdot)$ is the positive unspecified baseline hazard function of the forward recurrence time.

3 |. ESTIMATION METHODS

3.1. |. Estimation for Incident Cohort

For data from an incident cohort, Maguluri and Zhang ¹³ proposed an estimation method under the PMRL model when censoring was absent. Chen et al. ¹¹ extended the method to accommodate right censoring using the inverse probability of censoring weighted (IPCW) approach. The IPCW estimating equation assumes that censoring is independent of the covariates. While the assumption can be relaxed to tackle a censoring distribution that is dependent on the covariates, as discussed in the paper, the censoring mechanism needs to be modelled. An alternative semiparametric estimation procedure was developed based on the counting process theory by Chen and Cheng. ¹⁰ We briefly review their method in this section.

Based on the definition of m(t | X) and using an inversion formula, we can derive the conditional survival function of T⁰ given X,

S (t | X) = \frac{m (0 | X)}{m (t | X)} \exp {- \int_{0}^{t} \frac{1}{m (u | X)} d u} .

Under model (1), it follows that

m_{0} (t) d Λ_{i} (t) = \exp (- β^{┬} X_{i}) d t + d m_{0} (t),

(3)

where Λ_i(t) is the cumulative hazard function of $T_{i}^{0}$ . Let N_i(t) = I(T_i ≤ t)Δ_i and Y_i(t) = I(T_i ≥ t). Define

M_{i} (t; β, m_{0}) = N_{i} (t) - \int_{0}^{t} Y_{i} (s) d Λ_{i} (s; β, m_{0}),

(4)

where dΛ_i(t;β, m₀) = {exp(−β^┬X_i)dt + dm₀(t)} for i = 1, …,n₁.Expression (4) is a zero-mean martingale when β = β* and $m_{0} (\cdot) = m_{0}^{*} (\cdot)$ , where β* and $m_{0}^{*}$ are the true parameter and the true baseline mean function, respectively. Based on equation (3) and expression (4), the following estimating equations are constructed to estimate $m_{0} (\cdot)$ and β,

\frac{1}{n_{1}} \sum_{i = 1}^{n_{1}} [m_{0} (t) d N_{i} (t) - Y_{i} (t) {\exp (- β^{┬} X_{i}) d t + d m_{0} (t)}] = 0

(5)

\frac{1}{n_{1}} \sum_{i = 1}^{n_{1}} \int_{0}^{τ} X_{i} [m_{0} (t) d N_{i} (t) - Y_{i} (t) {\exp (- β^{┬} X_{i}) d t + d m_{0} (t)}] = 0

(6)

A closed form solution is available for m₀(·) from equation (5),

{\hat{m}}_{0} (t; β) = {\hat{S} (t)}^{- 1} \int_{t}^{τ} \hat{S} (u) Q (u; β) d u,

where $\hat{S} (t) = \exp {- \int_{0}^{t} \sum_{i = 1}^{n_{1}} d N_{i} (u) / \sum_{i = 1}^{n_{1}} Y_{i} (t)}$ and $Q (t; β) = \sum_{i = 1}^{n_{1}} Y_{i} (t) \exp (- β^{┬} X_{i}) / \sum_{i = 1}^{n_{1}} Y_{i} (t)$ . After replacing $m_{0} (t)$ with ${\hat{m}}_{0} (t; β)$ in equation (6), we have the estimating function for β

U_{I} (β) = \frac{1}{n_{1}} \sum_{i = 1}^{n_{1}} \int_{0}^{τ} {X_{i} - \bar{X} (t)} {{\hat{m}}_{0} (t; β) d N_{i} (t) - Y_{i} (t) \exp (- β^{┬} X_{i}) d t},

(7)

where $\bar{X} (t) = \sum_{i = 1}^{n_{1}} Y_{i} (t) X_{i} / \sum_{i = 1}^{n_{1}} Y_{i} (t)$ . The estimator ${\hat{β}}_{I}$ can be obtained from the solution to $U_{I} (β) = 0$ . Chen and Cheng¹⁰ showed that $n_{1}^{1 / 2} ({\hat{β}}_{I} - β^{*})$ converges weakly to a normal distribution with mean zero and covariance matrix $A_{I}^{- 1} Σ_{I} A_{I}^{- 1}$ under the regularity conditions (C1)−(C5) listed in Appendix A.1. We define matrices A_I and Ʃ_I in Appendix A.2. The covariance matrix $A_{I}^{- 1} Σ_{I} A_{I}^{- 1}$ can be consistently estimated by ${{\hat{A}}_{I} ({\hat{β}}_{I})}^{- 1} {\hat{Σ}}_{I} ({\hat{β}}_{I}) {{\hat{A}}_{I} ({\hat{β}}_{I})}^{- 1}$ , where

{\hat{Σ}}_{I} (β) = \frac{1}{n_{1}} \sum_{i = 1}^{n_{1}} \int_{0}^{τ} {X_{i} - \bar{X} (t)}^{\otimes 2} Y_{i} (t) {\hat{m}}_{0} (t; β) {\exp (- β^{┬} X_{i}) d t + d {\hat{m}}_{0} (t; β)},

{\hat{A}}_{I} (β) = \frac{1}{n_{1}} \sum_{i = 1}^{n_{1}} \int_{0}^{τ} {X_{i} - \bar{X} (t)}^{\otimes 2} Y_{i} (t) \exp (- β^{┬} X_{i}) d t,

in which $a^{\otimes 2} = a a^{┬}$ for any vector a.

3.2 |. Estimation for Prevalent Cohort

As discussed, data arising from prevalent sampling are subject to length bias, which hinders one from applying the method proposed for the incident cohort. Under the PMRL model, Bai et al. ¹⁴ proposed a semiparametric method for right-censored length-biased data, adopting the IPCW approach. That method properly addressed the induced dependent censoring issue and sampling bias, which are commonly encountered in length-biased data with right censoring. However, that method is not directly applicable to our motivating data because the survival times are not available due to missing backward recurrence times. Due to the special relationship between the PMRL and the PH models shown in equation (2), it is sufficient to estimate the covariate effects using only the observed forward recurrence times from the prevalent cohort. Note that we are estimating the same regression coefficient β for the target population under model (1) with the prevalent cohort data as with the incident cohort data. This approach has been studied for right-censored length-biased data by Chan et al. ¹⁵ for cross-sectional sampled data with no follow-up or data with no information on the disease diagnosis time. The prevalent cohort data in our study belong to the latter case.

Denote $N_{i}^{υ} (t) = I (V_{i} \leq t) Δ_{i}^{υ}$ and $Y_{i}^{υ} (t) = I (V_{i} \geq t), for i = 1, \dots, n_{2}$ , Define $S^{(k)} (β, t) = n^{- 1} \sum_{i = 1}^{n_{2}} X_{i}^{υ \otimes k} \exp (- β^{┬} X_{i}^{υ}) Y_{i}^{υ} (t)$ for k = 0,1, and 2, where $a^{\otimes 0} = 1$ , $a^{\otimes 1} = a$ , $a^{\otimes 2} = a a^{T}$ for any vector a. Based on the relationship shown in equation (2), we can estimate the regression parameter _ by adopting the partial likelihood score function,

U_{P} (β) = \frac{1}{n_{2}} \sum_{i = 1}^{n_{2}} \int_{0}^{τ} {X_{i}^{υ} - ε (β, t)} d N_{i}^{υ} (t),

(8)

where $ε (β, t) = S^{(1)} (β, t) / S^{(0)} (β, t)$ . The solution to $U_{P} (β) = 0$ is the estimator ${\hat{β}}_{P}$ Under the regularity conditions (C1)–(C4), and (C6) listed in Appendix A.1, the distribution of $n_{2}^{1 / 2} ({\hat{β}}_{P} - β^{*})$ converges to a normal distribution with mean zero and covariance matrix $A_{P}^{- 1} Σ_{P} A_{P}^{- 1}$ , where A_P and Ʃ_P are defined in Appendix A.3. We can consistently estimate $A_{P}^{- 1} Σ_{P} A_{P}^{- 1}$ by ${{\hat{A}}_{P} ({\hat{β}}_{P})}^{- 1} {\hat{Σ}}_{P} ({\hat{β}}_{P}) {{\hat{A}}_{P} ({\hat{β}}_{P})}^{- 1}$ , where

{\hat{Σ}}_{P} (β) = \frac{1}{n_{2}} \sum_{i = 1}^{n_{2}} {[\int_{0}^{τ} {X_{i}^{υ} - ε (β, t)} d N_{i}^{υ} (t)]}^{\otimes 2},

{\hat{A}}_{P} (β) = \frac{1}{n_{2}} \sum_{i = 1}^{n_{2}} \int_{0}^{τ} [\frac{S^{(2)} (β, t)}{S^{(0)} (β, t)} - {ε (β, t)}^{\otimes 2}] d N_{i}^{υ} (t) .

Note that the estimating function (8) is equivalent to the score function for conventional survival data under the PH model, except for the unknown regression coefficients being negative of β. Thus, we can implement the estimation method using readily available software.

3.3 |. Estimation Using the Combined Cohorts

Although the data arising from the two cohorts have distinct data structures with different time variables, they are from the same target population. Thus, we may use the combined cohort data to make inference for the target cohort under model (1) regarding survival times. To improve statistical efficiency, we propose an estimation method that combines the two weighted estimating functions using data from the incident and prevalent cohorts. We consider a class of weighted linear combinations of the estimating functions (7) and (8):

U_{C} (β) = \frac{1}{n} {W_{1} n_{1} U_{I} (β) + W_{2} n_{2} U_{p} (β)} = \frac{1}{n} [W_{1} \sum_{i = 1}^{n_{1}} \int_{0}^{τ} {X_{i} - \bar{X} (t)} {{\hat{m}}_{0} (t; β) d N_{i} (t) - Y_{i} (t) \exp (- β^{┬} X_{i}) d t} + W_{2} \sum_{i = 1}^{n_{2}} \int_{0}^{τ} {X_{i}^{υ} - ε (β, t)} d N_{i}^{υ} (t)],

(9)

where W₁ and W₂ are p × p weight matrices. We combine the estimating equations derived from each cohort instead of using the weighted average of the two estimators, ${\hat{β}}_{I}$ and ${\hat{β}}_{p}$ , to avoid imposing a restrictive condition that the optimal estimator is a linear combination of the two estimators. Note that the total sample size increases to n = n₁ + n₂ by combining the data from the two cohorts. We can obtain a class of estimators ${\tilde{β}}_{C}$ by solving $U_{C} (β) = 0 for β$ .

Among the class of estimators ${\tilde{β}}_{C}$ , we derive the estimator with the smallest asymptotic variance by finding the optimal W = (W₁, W₂). Let $ρ = \lim_{n_{1}}_{\to \infty, n_{2} \to \infty} n_{1} / (n_{1} + n_{2})$ . Based on the large sample properties of the estimators ${\hat{β}}_{I}$ and ${\hat{β}}_{p}$ , the asymptotic covariance matrix of $n^{1 / 2} ({\tilde{β}}_{C} - β *)$ is

Ω_{C} (W) = {ρ W_{1} A_{I} + (1 - ρ) W_{2} A_{p}}^{- 1} {ρ W_{1} \sum_{I} W_{1}^{T} + (1 - ρ) W_{2} \sum_{p} W_{2}^{T}} {[{ρ W_{1} A_{I} + (1 - ρ) W_{2} A_{p}}^{- 1}]}^{T} .

By the matrix Cauchy–Schwarz inequality,¹⁶ for any W,

Ω_{C} (W) \geq Ω_{o p t} = {ρ A_{I} \sum_{I}^{- 1} A_{I} + (1 - ρ) A_{P} \sum_{P}^{- 1} A_{P}}^{- 1} .

We can attain the efficiency bound $Ω_{o p t}$ when the weight matrices $W_{1} = A_{I} \sum_{I}^{- 1}$ and $W_{2} = A_{p} \sum_{p}^{- 1}$ , which are the optimal weights. Since the optimal weights depend on the unknown parameter β, we proceed to a two-step estimation. We first derive an estimator that is consistent with β* by solving $U_{C} (β) = 0$ with W₁ = W₂ = I_p×p, where I_p×p is the identity matrix, to obtain the first-step estimator ${\hat{β}}_{C}$ . Then, the efficient estimator ${\hat{β}}_{o p t}$ is the solution to

U_{o p t} (β) = \frac{1}{n} {{\hat{W}}_{1} n_{1} U_{I} (β) + {\hat{W}}_{2} n_{2} U_{p} (β)} = 0,

where ${\hat{W}}_{1} = {\hat{A}}_{I} ({\hat{β}}_{C}) {{\hat{Σ}}_{I} ({\hat{β}}_{C})}^{- 1}$ and ${\hat{W}}_{2} = {\hat{A}}_{P} ({\hat{β}}_{C}) {{\hat{Σ}}_{P} ({\hat{β}}_{C})}^{- 1}$ . The asymptotic properties of ${\hat{β}}_{o P t}$ are summarized in the following theorem.

Theorem 1. Under the regularity conditions listed in Appendix A.1, $n^{1 / 2} ({\hat{β}}_{o p t} - β^{*})$ converges weakly to a normal distribution with mean zero and covariance matrix $Ω_{o p t}$ .

The detailed proofs of Theorem 1 are provided in Appendix A.4. The covariance matrix $Ω_{o p t}$ can be consistently estimated ${\hat{Ω}}_{o p t}$ ,

{[\hat{ρ} {\hat{A}}_{I} ({\hat{β}}_{o p t}) {{\hat{Σ}}_{I} ({\hat{β}}_{o p t})}^{- 1} {\hat{A}}_{I} ({\hat{β}}_{o p t}) + (1 - \hat{ρ}) {\hat{A}}_{P} ({\hat{β}}_{o p t}) {{\hat{Σ}}_{P} ({\hat{β}}_{o p t})}^{- 1} {\hat{A}}_{P} ({\hat{β}}_{o p t})]}^{- 1},

where $\hat{ρ} = n_{1} / n$ .

4 |. SIMULATION STUDY

We conducted simulation studies to investigate the finite sample properties of the proposed estimation method for the combined cohort data. We simulated 1000 datasets that consist of n₁ subjects from the incident cohort and n₂ subjects from the prevalent cohort. Total sample sizes of n = n₁ + n₂ = 200 and 400 were considered with various combinations. We considered two covariates: X₁ from a Bernoulli distribution with probability 0.5 and X₂ from a uniform distribution (0, 1) for both cohorts. Conditioning on X₁ and X₂, the survival time T⁰ was generated from the same target population under the mean residual life model m(t | X₁, X₂) = (at+b) exp(β₁X₁ +β₂X₂), where parameters for the baseline mean function (a, b) = (0.1, 0.5) and the true coefficients (β₁, β₂) = (0.5, −0.5). For the incident cohort, we randomly generated n₁ observations, $(T_{i}^{0}, X_{1 i}, X_{2 i})$ , i = 1, … , n₁. For the prevalent cohort, we generated the left truncation time A from a uniform distribution and only kept observations that satisfy T⁰ > A. We continued the sampling procedure until we sampled n₂ observations $(V_{j}^{0}, X_{1 j}^{υ}, X_{2 j}^{υ})$ j = 1, … , n₂, where $V_{j}^{0} = T_{j}^{0} - A_{j}$ , and $X_{1 j}^{υ} = X_{1 j}, X_{2 j}^{υ} = X_{2 j}$ for subject j with $T_{j}^{0} > A_{j}$ . Since both cohorts are subject to right censoring, we generated censoring times C and C^v from a uniform distribution (0, τ_C ) and chose τ_C to allow for 15% and 30% of censoring rates overall. Under this setting, the censoring rate of each cohort is about the same. The distributions of C and C^v share the same support because the follow-up periods for both cohorts are the same in practice. The generated dataset consists of ${(T_{i}, Δ_{i}, X_{1 i}, X_{2 i}), (V_{j}, Δ_{j}^{υ}, X_{1 j}^{υ}, X_{2 j}^{υ}); i = 1, \dots, n_{1}, j = 1, \dots, n_{2}}$ .

We denoted ${\hat{β}}_{I}$ as the estimator using the simulated incident cohort data only, ${\hat{β}}_{P}$ using the simulated prevalent cohort data only, and ${\hat{β}}_{C}$ and ${\hat{β}}_{o p t}$ as the proposed estimators using data from both cohorts with identity weight matrices and the optimal weights, respectively. Tables 1 and 2 summarize the simulation results. When the overall censoring rate is as low as 15%, all estimators present virtually unbiased point estimates, the asymptotic standard errors are close to the empirical standard deviations of the point estimates, and the coverage probabilities are close to the nominal level of 95%. We note that the relative efficiency of the estimators ${\hat{β}}_{I}$ and ${\hat{β}}_{P}$ highly depends on the number of samples in each cohort. When there are more samples and hence more failure events in the incident cohort than in the prevalent cohort (i.e., n₁ > n₂), ${\hat{β}}_{I}$ has smaller variance, which indicates that it is more efficient than ${\hat{β}}_{P}$ , and vice versa. When the proposed method is used for the combined cohort data, we have an increased sample size of n₁ + n₂. Thus, we observe smaller variance estimates for ${\hat{β}}_{C}$ and ${\hat{β}}_{o p t}$ compared to ${\hat{β}}_{I}$ and ${\hat{β}}_{P}$ under all settings. To assess the efficiency gain of the proposed estimators over ${\hat{β}}_{I}$ and ${\hat{β}}_{P}$ , we compute the relative efficiency, which is defined as the ratio of the mean squared errors of the estimators. For example, when n₁ = 100, n₂ = 100, and the censoring rate is 15%, ${\hat{β}}_{C}$ for β₁ is 1.86 and 1.93 times more efficient than ${\hat{β}}_{I}$ and ${\hat{β}}_{P}$ , respectively; and ${\hat{β}}_{o p t}$ is respectively 2.08 and 2.15 times more efficient. The proposed estimator with optimal weights ${\hat{β}}_{o p t}$ is relatively more efficient than ${\hat{β}}_{C}$ across all settings. While the point estimates for ${\hat{β}}_{o p t}$ tend to be slightly more biased than ${\hat{β}}_{C}$ due to the two-step estimation procedure, the mean squared errors of ${\hat{β}}_{o p t}$ are smaller in every setting.

TABLE 1.

Summary statistics of simulation results for estimating (β₁, β₂) = (0.5, −0.5) with n = 200. Monte Carlo mean of the estimates (Est), the empirical standard deviation (SD), the mean standard error (SE), the mean squared error (MSE) and the coverage probability (CP) using incident cohort only $({\hat{β}}_{I})$ , prevalent cohort only $({\hat{β}}_{P})$ , and both incident and prevalent cohorts ( ${\hat{β}}_{C}$ and ${\hat{β}}_{o p t}$ ) with sample sizes of n₁ and n₂ for incident and prevalent cohorts, respectively, and censoring rates (cr) of 15% and 30%.

				β₁					Β₂

n₁	n₂	cr		Est	SD	SE	MSE	CP	Est	SD	SE	MSE	CP
125	75	15%	${\hat{β}}_{I}$	0.481	0.208	0.206	0.043	0.943	−0.489	0.347	0.359	0.120	0.950
			${\hat{β}}_{P}$	0.526	0.276	0.274	0.077	0.948	−0.502	0.492	0.468	0.242	0.950
			${\hat{β}}_{C}$	0.502	0.171	0.172	0.029	0.954	−0.484	0.302	0.297	0.091	0.952
			${\hat{β}}_{o p t}$	0.487	0.162	0.164	0.026	0.950	−0.493	0.279	0.282	0.078	0.947
		30%	${\hat{β}}_{I}$	0.422	0.199	0.198	0.046	0.934	−0.429	0.336	0.346	0.118	0.936
			${\hat{β}}_{P}$	0.519	0.306	0.300	0.094	0.951	−0.499	0.547	0.517	0.299	0.940
			${\hat{β}}_{C}$	0.471	0.186	0.181	0.035	0.948	−0.450	0.322	0.314	0.106	0.934
			${\hat{β}}_{o p t}$	0.443	0.164	0.165	0.030	0.940	−0.456	0.281	0.285	0.081	0.934
100	100	15%	${\hat{β}}_{I}$	0.476	0.232	0.229	0.054	0.945	−0.470	0.390	0.401	0.153	0.950
			${\hat{β}}_{P}$	0.513	0.236	0.235	0.056	0.951	−0.524	0.401	0.399	0.161	0.938
			${\hat{β}}_{C}$	0.498	0.169	0.171	0.029	0.955	−0.499	0.286	0.294	0.081	0.951
			${\hat{β}}_{o p t}$	0.486	0.161	0.163	0.026	0.950	−0.497	0.273	0.280	0.074	0.944
		30%	${\hat{β}}_{I}$	0.417	0.222	0.222	0.056	0.927	−0.416	0.381	0.387	0.152	0.938
			${\hat{β}}_{P}$	0.509	0.256	0.258	0.066	0.951	−0.539	0.446	0.442	0.200	0.952
			${\hat{β}}_{C}$	0.476	0.182	0.182	0.034	0.942	−0.486	0.307	0.314	0.095	0.958
			${\hat{β}}_{o p t}$	0.450	0.165	0.167	0.030	0.942	−0.473	0.284	0.288	0.081	0.953
75	125	15%	${\hat{β}}_{I}$	0.473	0.263	0.263	0.070	0.943	−0.469	0.463	0.459	0.215	0.933
			${\hat{β}}_{P}$	0.512	0.217	0.209	0.047	0.939	−0.514	0.365	0.353	0.134	0.950
			${\hat{β}}_{C}$	0.501	0.172	0.170	0.030	0.940	−0.495	0.297	0.289	0.088	0.947
			${\hat{β}}_{o p t}$	0.489	0.163	0.163	0.027	0.946	−0.492	0.286	0.277	0.082	0.943
		30%	${\hat{β}}_{I}$	0.421	0.252	0.254	0.069	0.942	−0.415	0.452	0.443	0.212	0.940
			${\hat{β}}_{P}$	0.515	0.234	0.227	0.055	0.946	−0.512	0.407	0.388	0.166	0.945
			${\hat{β}}_{C}$	0.491	0.185	0.181	0.034	0.944	−0.478	0.326	0.311	0.107	0.953
			${\hat{β}}_{o p t}$	0.466	0.167	0.168	0.029	0.949	−0.469	0.307	0.289	0.095	0.944

Open in a new tab

TABLE 2.

Summary statistics of simulation results for estimating (β₁, β₂) = (0.5, −0.5) with n = 400. Monte Carlo mean of the estimates (Est), the empirical standard deviation (SD), the mean standard error (SE), the mean squared error (MSE) and the coverage probability (CP) using incident cohort only $({\hat{β}}_{I})$ , prevalent cohort only $({\hat{β}}_{P})$ , and both incident and prevalent cohorts ( ${\hat{β}}_{C}$ and ${\hat{β}}_{o p t}$ ) with sample sizes of n₁ and n₂ for incident and prevalent cohorts, respectively, and censoring rates (cr) of 15% and 30%.

				β₁					Β₂

n₁	n₂	cr		Est	SD	SE	MSE	CP	Est	SD	SE	MSE	CP
250	150	15%	${\hat{β}}_{I}$	0.479	0.142	0.146	0.021	0.946	−0.472	0.252	0.254	0.064	0.957
			${\hat{β}}_{P}$	0.514	0.184	0.190	0.034	0.970	−0.529	0.334	0.322	0.113	0.949
			${\hat{β}}_{C}$	0.496	0.116	0.120	0.013	0.954	−0.496	0.206	0.207	0.042	0.954
			${\hat{β}}_{o p t}$	0.488	0.111	0.115	0.012	0.955	−0.493	0.196	0.199	0.038	0.960
		30%	${\hat{β}}_{I}$	0.424	0.138	0.141	0.025	0.916	−0.422	0.243	0.246	0.065	0.941
			${\hat{β}}_{P}$	0.509	0.204	0.208	0.042	0.962	−0.534	0.371	0.357	0.138	0.941
			${\hat{β}}_{C}$	0.467	0.124	0.126	0.017	0.947	−0.473	0.217	0.219	0.048	0.957
			${\hat{β}}_{o p t}$	0.446	0.113	0.116	0.016	0.927	−0.460	0.198	0.201	0.041	0.953
200	200	15%	${\hat{β}}_{I}$	0.475	0.162	0.163	0.027	0.945	−0.469	0.280	0.284	0.079	0.947
			${\hat{β}}_{P}$	0.516	0.172	0.164	0.030	0.931	−0.508	0.272	0.278	0.074	0.955
			${\hat{β}}_{C}$	0.501	0.121	0.120	0.015	0.940	−0.490	0.199	0.206	0.040	0.961
			${\hat{β}}_{o p t}$	0.490	0.115	0.115	0.013	0.951	−0.488	0.191	0.198	0.037	0.956
		30%	${\hat{β}}_{I}$	0.420	0.157	0.157	0.031	0.914	−0.420	0.273	0.275	0.081	0.941
			${\hat{β}}_{P}$	0.516	0.188	0.179	0.035	0.938	−0.517	0.309	0.307	0.095	0.952
			${\hat{β}}_{C}$	0.481	0.130	0.128	0.017	0.929	−0.475	0.217	0.221	0.048	0.958
			${\hat{β}}_{o p t}$	0.456	0.116	0.118	0.015	0.928	−0.464	0.202	0.204	0.042	0.943
150	250	15%	${\hat{β}}_{I}$	0.478	0.192	0.188	0.037	0.944	−0.453	0.330	0.328	0.111	0.939
			${\hat{β}}_{P}$	0.508	0.144	0.145	0.021	0.963	−0.512	0.245	0.246	0.060	0.946
			${\hat{β}}_{C}$	0.500	0.119	0.119	0.014	0.949	−0.495	0.200	0.203	0.040	0.950
			${\hat{β}}_{o p t}$	0.493	0.116	0.115	0.014	0.943	−0.491	0.194	0.196	0.038	0.949
		30%	${\hat{β}}_{I}$	0.422	0.187	0.182	0.041	0.918	−0.410	0.323	0.318	0.112	0.933
			${\hat{β}}_{P}$	0.504	0.156	0.158	0.024	0.959	−0.512	0.272	0.270	0.074	0.952
			${\hat{β}}_{C}$	0.483	0.125	0.127	0.016	0.955	−0.482	0.215	0.218	0.047	0.950
			${\hat{β}}_{o p t}$	0.465	0.118	0.119	0.015	0.937	−0.470	0.203	0.205	0.042	0.959

Open in a new tab

With an increased censoring rate of 30%, we find some bias for ${\hat{β}}_{I}$ , where only the incident cohort data are used. A similar trend was observed in the original simulation studies on ${\hat{β}}_{I}$ conducted by Chen and Cheng. ¹⁰ In the simulation results under a censoring rate of 30%, we observe that the estimators ${\hat{β}}_{C}$ and ${\hat{β}}_{o p t}$ are less biased and more efficient than ${\hat{β}}_{I}$ . Therefore, combining information from the prevalent cohort data with that from the incident cohort data is desirable, especially under heavy censoring rates.

5 |. APPLICATION

The Nun Study, introduced in Section 1, has been conducted to examine risk factors for the progression of dementia, with a cohort of 678 members of the School Sisters of Notre Dame religious congregation who were 75 years of age or older and recruited between 1991 and 1993. ¹ Each participant received an assessment of her cognitive and physical function near-annually up to 10 years. At each examination, the participant’s cognitive status was recorded as one of the five following states: cognitively intact for age, cognitive deficit that does not affect activities of daily living, cognitive deficit in one or more activities of daily living, clinical dementia, and death. Covariates such as age at each exam, presence of the apolipoprotein E-e4 allele (APOE4), and the level of education were collected.

To illustrate the proposed estimation method, we use the combined cohort data, which consist of 501 subjects with complete data from the Nun Study. Among them, 153 incident and 77 prevalent cases were used in the analysis. In the data, the exact time of death was recorded if it occurred before the last follow-up. If a subject did not die by the last examination, her survival time was censored. Among the incident cases, 29 (19%) subjects were right censored; and only two (2.6%) were right censored among the prevalent cases. The overall censoring rate was as low as 13.5%. For the incident cohort, the data include the survival time from dementia diagnosis until death or the censoring event. When a subject was assessed as clinically demented at one of the annual examinations, we assumed that dementia occurred in the middle of two consecutive examinations. However, for the prevalent cohort data, we only have the information that the subject was demented prior to enrollment; hence, the backward recurrence time is missing. Instead, we have the forward recurrence times from study enrollment until death or the censoring event for the prevalent cohort. We considered two covariates of interest: the level of education and the presence of the genetic risk factor APOE4. The distribution of the covariates are summarized in Table 3 by each cohort and for the combined cohorts.

TABLE 3.

Distribution of risk factors by cohort.

Variable	Incident only (n₁ = 153)	Prevalent only (n₂ = 77)	Combined cohorts (n = 230)
APOE4
Presence	39 (25%)	29 (38%)	68 (30%)
Absence	114 (75%)	48 (62%)	162 (70%)
EDCAT
College and higher	134 (87%)	49 (64%)	183 (80%)
Others	19 (13%)	28 (36%)	47 (20%)

Open in a new tab

We conducted regression analyses to estimate the effects of the educational level and APOE4 on the mean residual survival time under the PMRL model (1). The analyses were carried out using the incident cohort only, the prevalent cohort only, and the combined cohort data with optimal weights. In the analysis of the incident cohort data, the support of the censoring distribution is greater than that of the survival distribution, which satisfies the assumption for the method using only the incident cohort. ¹⁰ The estimated distributions of the survival time and the censoring time are provided as Figure S2 in the web-based supplementary materials. We present the results in Table 4 . None of the estimated regression parameters were found to be significantly associated with the mean residual survival time, which is consistent with the findings in the literature. Qiu et al. ¹⁷ and Helmer et al. ¹⁸ showed that educational level was not significantly correlated with the mortality of subjects who had dementia, while a lower level of education was found to be associated with higher risk of dementia in other studies. ¹⁹ Mez et al. ²⁰ suggested that the incidence of dementia may mediate the effect of APOE4 on mortality, given that both APOE4 and dementia are high risk factors for decreased survival times among older adults. Thus, among subjects diagnosed with dementia,

TABLE 4.

Regression analysis under the proportional mean residual life model using data from incident cohort only, prevalent cohort only, and combined cohorts. Estimated parameter (Est) and standard error (SE).

	Incident only		Prevalent only		Combined cohorts

	Est	SE	Est	SE	Est	SE
APOE4
(presence=1, absence=0)	0.279	0.154	−0.278	0.245	0.148	0.131
EDCAT
(college and higher=1, others=0)	−0.231	0.197	−0.470	0.253	−0.287	0.155

Open in a new tab

APOE4 has not been found to be a significant risk factor for death.

Under the assumption that the incident and prevalent cohorts are from the same population, we can examine the proportional means assumption by checking the proportional hazards assumption using the prevalent cohort. We confirmed that the assumption is reasonable: the p-values are 0.66 and 0.19 for the presence of APOE4 and the level of education, respectively; and 0.39 for the global test. However, it should be noted that the model diagnostic test may have low power. Another assumption is that data observed in the prevalent cohort are subject to length bias (i.e., the incidence of dementia follows a stationary Poisson process). However, the information about the time from dementia onset to enrollment is missing for the prevalent cohort data, which hinders one from checking this assumption. As an alternative, we can compute the dementia incidence rate using the incident cohort only data, provided that the two cohorts are from the same population. The incidence rate was fairly constant over the follow-up period, with no specific trend. Hence, the stationarity assumption is reasonable for our application.

6 |. CONCLUSION

In observational studies, prevalent samples are commonly collected along with the incident cohort from a single study population. Combining data from incident and prevalent cohorts can substantially improve efficiency and ensure robustness of the estimators when assessing the risk factors’ effects on survival times. This is an efficient way of utilizing the data because the combined cohort data are usually available at no additional cost. While statistical methods for the analysis of the combined cohort data would make invaluable contributions to many studies, such methods are limited in the literature.

In this paper, we assume the PMRL model for the target population. One advantage of assuming such a model is that it directly leads to the PH model on the forward recurrence times for the prevalent cohort. Hence, we can use the conventional survival method for the incident cohort under the PMRL model. For the prevalent cohort data, which has a nonstandard structure with missing backward recurrence times, we can use the PH model without additional assumptions or extra effort. Thus, the proposed estimation method involves two estimating functions that are constructed differently for data from the incident and prevalent cohorts.

In the estimating function for the combined data (9), we only use data from the incident cohort to derive the consistent estimator ${\hat{m}}_{0} (t; β)$ for $m_{0} (t)$ . To estimate the baseline mean function m₀(t) more efficiently, one may consider combining the data om the prevalent cohort. Based on equation (2), m₀(t) is the inverse of the baseline hazard function of the forward recurrence time, $λ_{0}^{υ} (t)$ . Hence, a naive approach is to estimate the inverse of $λ_{0}^{υ} (t)$ based on the Nelson–Aalen estimator for the cumulative hazard ⁰function. However, the estimated baseline cumulative hazar⁰d function is nonsmooth and results in a noisy estimator for $λ_{0}^{υ} (t)$ as in conventional survival analyses, which leads to an unstable estimation of $m_{0} (t)$ . As an alternative, one may adopt the kernel smoothing method to estimate the baseline hazard function, $λ_{0}^{υ} (t)$ .²¹ A major drawback of applying the smoothing method is that the choice of bandwidth, which is crucial, involves computationally intensive procedures. Further studies on combining data for more efficient estimation of $m_{0} (t)$ are of interest.

Subjects were examined periodically in the Nun Study. While the dates of death were accurately recorded, the onset of dementia is only known to occur within time intervals (i.e., interval censored). In our application, we adopted a simple approach by assuming that the event occurred in the middle of the interval since that was not the focus of the current paper. Further research that tackles the issue of interval censoring is certainly warranted.

Supplementary Material

NIHMS1014550-supplement-2.pdf^{(72.6KB, pdf)}

ACKNOWLEDGMENTS

The work was partially supported by the U.S. National Institutes of Health through grants CA193878 and CA016672. The Nun Study data reported in this article were collected from the SMART project (AG386561). The authors also acknowledge the Texas Advanced Computing Center at The University of Texas at Austin for providing HPC resources that contributed to the research results reported within this paper.

APPENDIX

A. LARGE SAMPLE PROPERTIES OF THE ESTIMATORS

A.1. Regularity conditions

(C1) Given any $X = x, Pr (T^{0} < C / x) > 0$ ; and given any $X^{υ} = x, \Pr (V^{0} < C^{υ} / x) > 0$ .

(C2) The parameter space of β is a compact subset of $ℝ^{p}$ , and the true parameter value β* is in the interior of the parameter space.

(C3) The true baseline mean function $m_{0}^{*} (t)$ is continuously differentiable on [0, τ].

(C4) A p × 1 vector of covariates X is bounded by some constant, and not contained in a (p − 1)-dimensional hyperplane.

(C5) $A_{I} = \int_{0}^{τ} E [{X - μ_{X} (t)}^{\otimes 2} S^{*} (t / X) \exp (- β^{* T} X)]$ dt is nonsingular, where µ_X(t) is the limit of $\bar{X} (t)$ as $n_{1} \to \infty$ and $S^{*} (t / X) = \Pr (T > t / X)$ .

(C6) $A_{P} = \int_{0}^{τ} [\frac{s^{(2)} (β^{*}, t)}{s^{(0)} (β^{*}, t)} - {e (β^{*}, t)}^{\otimes 2}] s^{(0)} (β^{*}, t) λ_{0}^{υ} (t)$ dt is positive definite, where $s^{(k)} (β, t)$ is the limit of $S^{(k)} (β, t)$ for k = 0,1, and 2, and $e (β, t)$ is the limit of $ε (β, t)$ as $n_{2} \to \infty$ .

A.2. Asymptotic properties of ${\hat{β}}_{I}$

The asymptotic properties of the estimator ${\hat{β}}_{I}$ have been established in the appendix of Chen and Cheng. 10 Here, we briefly outline the results. Given that ${\hat{m}}_{0} (t; β^{*})$ converges to $m_{0}^{*} (t)$ almost surely, we have

n_{1}^{1 / 2} U_{I} (β^{*}) = n_{1}^{- 1 / 2} \sum_{i = 1}^{n_{1}} \int_{0}^{τ} (X_{i} - \bar{X} (t) - \frac{E [S (t / X) {X - μ_{X} (t)}]}{E {S (t / X)}}) m_{0}^{*} (t) d M_{i} (t) + o_{P} (1) .

Thus $n_{1}^{1 / 2} U_{I} (β^{*})$ converges weakly to a normal distribution with mean zero and covariance matrix

\sum_{I} = \int_{0}^{τ} E [{X - μ_{X} (t)}^{\otimes 2} S^{*} (t / X) m_{0}^{*} (t) {\exp (- β^{* T} X) d t + d m_{0}^{*} (t)}] .

Provided that $\partial {\hat{m}}_{0} (t, β^{*}) / \partial β= - m_{0}^{*} (t) μ_{X} (t) + o_{P} (1)$ , it is shown that $\partial U_{I} (β^{*}) / \partial β$ converges in probability to A_I, is defined in (C5). By applying the Taylor series expansion, one can show that $n_{1}^{1 / 2} U_{I} (β^{*}) = {- \partial U_{I} (β^{*}) / \partial β} n_{1}^{1 / 2} ({\hat{β}}_{I} - β) + o_{P} (1)$ . Hence, it follows that $n_{1}^{1 / 2} ({\hat{β}}_{I} - β)$ converges weakly to a normal distribution with mean zero and covariance matrix $A_{I}^{- 1} \sum_{I} A_{I}^{- 1}$ .

A.3. Asymptotic properties of ${\hat{β}}_{P}$

We establish the asymptotic properties of ${\hat{β}}_{P}$ following the large sample studies conducted by Andersen and Gill ²² for conventional survival data. Let $M_{i}^{υ} (t) = N_{i}^{υ} (t) - \int_{0}^{t} λ_{i}^{υ} (s) d s$ , where $λ_{i}^{υ} (t) = λ_{0}^{υ} (t) \exp (- β^{┬} X_{i})$ . We can represent the estimating function $U_{P} (β)$ evaluated at β* as follow:

n_{2}^{1 / 2} U_{P} (β^{*}) = n_{2}^{- 1 / 2} \sum_{i = 1}^{n_{2}} \int_{0}^{τ} {X_{i}^{υ} - ε (β^{*}, t)} d M_{i}^{υ} (t) + o_{P} (1) .

The distribution of $n_{2}^{1 / 2} U_{P} (β^{*})$ is asymptotically normal with mean zero and covariance matrix

\sum_{P} = \int_{0}^{τ} [\frac{s^{(2)} (β^{*}, t)}{s^{(2)} (β^{*}, t)} - {e (β^{*}, t)}^{\otimes 2}] s^{(0)} (β^{*}, t) λ_{0}^{υ} (t) d t .

By the Taylor series expansion, $n_{1}^{1 / 2} U_{P} (β^{*}) = {- \partial U_{P} (β^{*}) / \partial β} n_{2}^{1 / 2} ({\hat{β}}_{P} - β^{*}) + o_{P} (1)$ . Note that $\partial U_{P} (β^{*}) / \partial β$ converges in probability to A_P, which is defined in (C6). Thus, $n_{2}^{1 / 2} ({\hat{β}}_{P} - β^{*})$ asymptotically follows a normal distribution with mean zero and covariance matrix $A_{P}^{- 1} Σ_{P} A_{P}^{- 1}$ .

A.4. Proofs of Theorem 3.1

Given the optimal weights $W_{1} = A_{I} Σ_{I}^{- 1}$ and $W_{2} = A_{P} Σ_{P}^{- 1}$ , we rewrite the estimating function $U_{o p t} (β)$ in summations of i.i.d. vectors, as follows.

n^{1 / 2} U_{o p t} (β) = \sqrt{\frac{n_{1}}{n}} A_{I} \sum_{I}^{- 1} \frac{1}{\sqrt{n_{1}}} \sum_{i = 1}^{n_{1}} \int_{0}^{τ} (X_{i} - \bar{X} (t) - \frac{E [S (t / X) {X - μ_{X} (t)}]}{E {S (t / X)}}) m_{0}^{*} (t) d M_{i} (t) + \sqrt{\frac{n_{2}}{n}} A_{P} \sum_{P}^{- 1} \frac{1}{\sqrt{n_{2}}} \sum_{i = 1}^{n_{2}} \int_{0}^{τ} {X_{i}^{υ} - ε (β, t)} d M_{i}^{υ} (t) + o_{P} (1) .

Based on the asymptotic properties of ${\hat{β}}_{I}$ and ${\hat{β}}_{P}$ , it follows that $n^{1 / 2} U_{o p t} (β^{*})$ is asymptotically normal with mean zero and covariance matrix $Σ = ρ A_{I} Σ_{I}^{- 1} A_{I} + (1 - ρ) A_{P} Σ_{P}^{- 1} A_{P}$ . This is straightforward because the incident and prevalent cohorts are independent.

By the Taylor series expansion of $U ({\hat{β}}_{o p t})$ around β*, we have

n^{1 / 2} ({\hat{β}}_{o p t} - β^{*}) = {- \frac{\partial}{\partial β} U_{o p t} (\bar{β})}^{- 1} n^{1 / 2} U_{o p t} (β^{*}),

where $\bar{β}$ is on the line segment between ${\hat{β}}_{opt}$ and β*, and

\frac{\partial}{\partial β} U_{o p t} (\bar{β}) = \frac{n_{1}}{n} A_{I} \sum_{I}^{- 1} \frac{\partial}{\partial β} U_{I} (\bar{β}) + \frac{n_{2}}{n} A_{P} \sum_{P}^{- 1} \frac{\partial}{\partial β} U_{P} (\bar{β}) .

We can easily show that $\partial U_{o p t} (\bar{β}) / \partial β$ converges in probability to $ρ A_{I} Σ_{I}^{- 1} A_{I} + (1 - ρ) A_{P} Σ_{P}^{- 1} A_{P}$ , which is equal to Ʃ. Therefore, $n^{1 / 2} ({\hat{β}}_{o p t} - β^{*})$ is asymptotically normal with mean zero and covariance $Ω_{o p t} = Σ^{- 1} Σ Σ^{- 1} = Σ$ .

Denote an arbitrarily small neighborhood of β* as ß Following the arguments in Chen and Cheng,¹⁰ $\Pr ({\hat{β}}_{o p t} \in B) = 1$ because $U_{o p t} (β^{*}) \to 0$ can be extended to any β ϵ ß under the regularity conditions on uniform convergence. Thus, ${\hat{β}}_{o p t}$ is consistent with β*.

Given the consistency of Â_I(β), Â_P(β), ${\hat{Σ}}_{I} (β)$ , and ${\hat{Σ}}_{P} (β)$ , and assuming that Ʃ_I and Ʃ_P are nonsingular, we can show that the estimators of the optimal weights ${\hat{W}}_{1} = {\hat{A}}_{I} ({\hat{β}}_{C}) {{\hat{Σ}}_{I} ({\hat{β}}_{C})}^{- 1}$ and ${\hat{W}}_{2} = {\hat{A}}_{P} ({\hat{β}}_{C}) {{\hat{Σ}}_{P} ({\hat{β}}_{C})}^{- 1}$ converge in probability to $A_{I} Σ_{I}^{- 1}$ and $A_{P} Σ_{P}^{- 1}$ , respectively, where ${\hat{β}}_{C}$ is a consistent estimator of β*

References

1.Snowdon DA, Greiner LH, Mortimer JA, Riley KP, Greiner PA, Markesbery WR. Brain infarction and the clinical expression of Alzheimer disease. The Nun Study. JAMA 1997;277:813–817. [PubMed] [Google Scholar]
2.Tyas SL, Salazar JC, Snowdon DA, et al. Transitions to mild cognitive impairments, dementia, and death: findings from the Nun Study. Am J Epidemiol 2007;165:1231–1238. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Yu L, Tyas SL, Snowdon DA, Kryscio RJ. Effects of ignoring baseline on modeling transitions from intact cognition to dementia. Comput Stat Data Anal 2009;53:3334–3343. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Yu L, Griffith WS, Tyas SL, Snowdon DA, Kryscio RJ. A nonstationary Markov transition model for computing the relative risk of dementia before death. Stat Med 2010;. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Wei S, Xu L, Kryscio RJ. Markov transition model to dementia with death as a competing event. Comput Stat Data Anal 2014;80:78–88. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Wei S, Kryscio RJ. Semi-Markov models for interval censored transient cognitive states with back transitions and a competing risk. Stat Methods Med Res 2016;25:2909–2924. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Oakes D, Dasu T. A note on residual life. Biometrika 1990;77:409–410. [Google Scholar]
8.Addona V, Wolfson DB. A formal test for the stationarity of the incidence rate using data from a prevalent cohort study with follow-up. Lifetime Data Anal 2006;12:267–284. [DOI] [PubMed] [Google Scholar]
9.Asgharian M, Wolfson DB, Zhang X. Checking stationarity of the incidence rate using prevalent cohort survival data. Stat Med 2006;25:1751–1767. [DOI] [PubMed] [Google Scholar]
10.Chen YQ, Cheng S. Semiparametric regression analysis of mean residual life with censored survival data. Biometrika 2005;92:19–29. [Google Scholar]
11.Chen YQ, Jewell NP, Lei X, Cheng SC. Semiparametric estimation of proportional mean residual life model in presence of censoring. Biometrics 2005;61:170–178. [DOI] [PubMed] [Google Scholar]
12.Cox DR. Renewal Theory London: Methuen; 1962. [Google Scholar]
13.Maguluri G, Zhang CH. Estimation in the mean residual life regression model. J R Stat Soc Series B Stat Method 1994;56:477–489. [Google Scholar]
14.Bai F, Huang J, Zhou Y. Semiparametric inference for the proportional mean residual life model with right-censored length-biased data. Stat Sin 2016;26:1129–1158. [Google Scholar]
15.Chan KCG, Chen YQ, Di CZ. Proportional mean residual life model for right-censored length-biased data. Biometrika 2012;99:995–1000. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Chaganty NR, Joe H. Efficiency of generalized estimating equations for binary responses.. J R Stat Soc Series B Stat Method 2004;66:851–860. [Google Scholar]
17.Qiu C, Bäckman L, Winblad B, Agüero-Torres H, Fratiglioni L. The influence of education on clinically diagnosed dementia incidence and mortality data from the Kungsholmen Project. Arch Neurol 2001;58:2034–2039. [DOI] [PubMed] [Google Scholar]
18.Helmer C, Joly P, Letenneur L, Commenges D, Dartigues JF. Mortality with dementia: results from a French prospective community-based cohort. Am J Epidemiol 2001;154:642–648. [DOI] [PubMed] [Google Scholar]
19.Sharp ES, Gatz M. The relationship between education and dementia: an updated systematic review. Alzheimer Dis Assoc Disord 2011;25:289–304. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Mez J, Marden JR, Mukherjee S, et al. Alzheimer’s disease genetic risk variants beyond APOEe4 predict mortality. Alzheimers Dement 2017;doi: 10.1016/j.dadm.2017.07.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Wells MT. Nonparametric kernel estimation in counting processes with explanatory variables. Biometrika 1994;81:759– 801. [Google Scholar]
22.Andersen PK, Gill RD. Cox’s regression model for counting processes: a large sample study. Ann Stat 1982;10:1100–1120. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

NIHMS1014550-supplement-2.pdf^{(72.6KB, pdf)}

[R1] 1.Snowdon DA, Greiner LH, Mortimer JA, Riley KP, Greiner PA, Markesbery WR. Brain infarction and the clinical expression of Alzheimer disease. The Nun Study. JAMA 1997;277:813–817. [PubMed] [Google Scholar]

[R2] 2.Tyas SL, Salazar JC, Snowdon DA, et al. Transitions to mild cognitive impairments, dementia, and death: findings from the Nun Study. Am J Epidemiol 2007;165:1231–1238. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] 3.Yu L, Tyas SL, Snowdon DA, Kryscio RJ. Effects of ignoring baseline on modeling transitions from intact cognition to dementia. Comput Stat Data Anal 2009;53:3334–3343. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] 4.Yu L, Griffith WS, Tyas SL, Snowdon DA, Kryscio RJ. A nonstationary Markov transition model for computing the relative risk of dementia before death. Stat Med 2010;. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] 5.Wei S, Xu L, Kryscio RJ. Markov transition model to dementia with death as a competing event. Comput Stat Data Anal 2014;80:78–88. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] 6.Wei S, Kryscio RJ. Semi-Markov models for interval censored transient cognitive states with back transitions and a competing risk. Stat Methods Med Res 2016;25:2909–2924. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] 7.Oakes D, Dasu T. A note on residual life. Biometrika 1990;77:409–410. [Google Scholar]

[R8] 8.Addona V, Wolfson DB. A formal test for the stationarity of the incidence rate using data from a prevalent cohort study with follow-up. Lifetime Data Anal 2006;12:267–284. [DOI] [PubMed] [Google Scholar]

[R9] 9.Asgharian M, Wolfson DB, Zhang X. Checking stationarity of the incidence rate using prevalent cohort survival data. Stat Med 2006;25:1751–1767. [DOI] [PubMed] [Google Scholar]

[R10] 10.Chen YQ, Cheng S. Semiparametric regression analysis of mean residual life with censored survival data. Biometrika 2005;92:19–29. [Google Scholar]

[R11] 11.Chen YQ, Jewell NP, Lei X, Cheng SC. Semiparametric estimation of proportional mean residual life model in presence of censoring. Biometrics 2005;61:170–178. [DOI] [PubMed] [Google Scholar]

[R12] 12.Cox DR. Renewal Theory London: Methuen; 1962. [Google Scholar]

[R13] 13.Maguluri G, Zhang CH. Estimation in the mean residual life regression model. J R Stat Soc Series B Stat Method 1994;56:477–489. [Google Scholar]

[R14] 14.Bai F, Huang J, Zhou Y. Semiparametric inference for the proportional mean residual life model with right-censored length-biased data. Stat Sin 2016;26:1129–1158. [Google Scholar]

[R15] 15.Chan KCG, Chen YQ, Di CZ. Proportional mean residual life model for right-censored length-biased data. Biometrika 2012;99:995–1000. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] 16.Chaganty NR, Joe H. Efficiency of generalized estimating equations for binary responses.. J R Stat Soc Series B Stat Method 2004;66:851–860. [Google Scholar]

[R17] 17.Qiu C, Bäckman L, Winblad B, Agüero-Torres H, Fratiglioni L. The influence of education on clinically diagnosed dementia incidence and mortality data from the Kungsholmen Project. Arch Neurol 2001;58:2034–2039. [DOI] [PubMed] [Google Scholar]

[R18] 18.Helmer C, Joly P, Letenneur L, Commenges D, Dartigues JF. Mortality with dementia: results from a French prospective community-based cohort. Am J Epidemiol 2001;154:642–648. [DOI] [PubMed] [Google Scholar]

[R19] 19.Sharp ES, Gatz M. The relationship between education and dementia: an updated systematic review. Alzheimer Dis Assoc Disord 2011;25:289–304. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] 20.Mez J, Marden JR, Mukherjee S, et al. Alzheimer’s disease genetic risk variants beyond APOEe4 predict mortality. Alzheimers Dement 2017;doi: 10.1016/j.dadm.2017.07.002. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Wells MT. Nonparametric kernel estimation in counting processes with explanatory variables. Biometrika 1994;81:759– 801. [Google Scholar]

[R22] 22.Andersen PK, Gill RD. Cox’s regression model for counting processes: a large sample study. Ann Stat 1982;10:1100–1120. [Google Scholar]

PERMALINK

Analysis of combined incident and prevalent cohort data under a proportional mean residual life model

Chi Hyun Lee

Jing Ning

Richard J Kryscio

Yu Shen

Summary

1 |. INTRODUCTION

2 |. NOTATIONS AND MODEL

3 |. ESTIMATION METHODS

3.1. |. Estimation for Incident Cohort

3.2 |. Estimation for Prevalent Cohort

3.3 |. Estimation Using the Combined Cohorts

4 |. SIMULATION STUDY

TABLE 1.

TABLE 2.

5 |. APPLICATION

TABLE 3.

TABLE 4.

6 |. CONCLUSION

Supplementary Material

ACKNOWLEDGMENTS

APPENDIX

A. LARGE SAMPLE PROPERTIES OF THE ESTIMATORS

A.1. Regularity conditions

A.2. Asymptotic properties of ${\hat{β}}_{I}$

A.3. Asymptotic properties of ${\hat{β}}_{P}$

A.4. Proofs of Theorem 3.1

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Analysis of combined incident and prevalent cohort data under a proportional mean residual life model

Chi Hyun Lee

Jing Ning

Richard J Kryscio

Yu Shen

Summary

1 |. INTRODUCTION

2 |. NOTATIONS AND MODEL

3 |. ESTIMATION METHODS

3.1. |. Estimation for Incident Cohort

3.2 |. Estimation for Prevalent Cohort

3.3 |. Estimation Using the Combined Cohorts

4 |. SIMULATION STUDY

TABLE 1.

TABLE 2.

5 |. APPLICATION

TABLE 3.

TABLE 4.

6 |. CONCLUSION

Supplementary Material

ACKNOWLEDGMENTS

APPENDIX

A. LARGE SAMPLE PROPERTIES OF THE ESTIMATORS

A.1. Regularity conditions

A.2. Asymptotic properties of β^I

A.3. Asymptotic properties of β^P

A.4. Proofs of Theorem 3.1

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

A.2. Asymptotic properties of ${\hat{β}}_{I}$

A.3. Asymptotic properties of ${\hat{β}}_{P}$