Skip to main content
Biostatistics (Oxford, England) logoLink to Biostatistics (Oxford, England)
. 2016 Nov 22;17(2):350–363. doi: 10.1093/biostatistics/kxv042

Sieve estimation in a Markov illness-death process under dual censoring

Audrey Boruvka 1,*, Richard J Cook 1
PMCID: PMC5963425  PMID: 26598559

Abstract

Semiparametric methods are well established for the analysis of a progressive Markov illness-death process observed up to a noninformative right censoring time. However, often the intermediate and terminal events are censored in different ways, leading to a dual censoring scheme. In such settings, unbiased estimation of the cumulative transition intensity functions cannot be achieved without some degree of smoothing. To overcome this problem, we develop a sieve maximum likelihood approach for inference on the hazard ratio. A simulation study shows that the sieve estimator offers improved finite-sample performance over common imputation-based alternatives and is robust to some forms of dependent censoring. The proposed method is illustrated using data from cancer trials.

Keywords: Cox model, Interval censoring, Method of sieves, Profile likelihood, Progression-free survival

1. Introduction

Vital status for individuals in a clinical trial is often readily available. Detection of non-fatal events requires closer surveillance, which can prove difficult and costly to maintain over time. As a result survival times are subject to right censoring, but the occurrence of intermediate events may be right-censored earlier or interval-censored between assessments. In general, we refer to this scenario as dual censoring. Various forms of dual censoring arise in trials involving tumor progression. Guidelines call for the analysis of so-called time to progression (TTP), coinciding with detection of progression, or progression-free survival (PFS), given by the earliest of TTP and death (FDA, 2007). TTP is typically right-censored at death or the preceding (negative) assessment, which induces dependent censoring. PFS is thus deemed preferable to TTP (FDA, 2007, p. 8), but this outcome is subject to systematic imputation.

Multistate models have been suggested as a more natural framework for assessing treatment effects on progression and death. A chain of events model (Figure 1, left), for example, is useful for settings in which progression always precedes death (Frydman, 1995b). Semicompeting risks (Figure 1, middle) have been proposed for the case where death may precede progression (Hu and Tsodikov, 2014). Xu and others (2010) observe that semicompeting risks essentially amount to the progressive illness-death model (Fix and Neyman, 1951; Figure 1, right), which is fully specified by the state-transition intensity functions.

Fig. 1.

Fig. 1.

Multistate alternatives to TTP and PFS: chain of events (left), semicompeting risks (middle), and progressive illness-death (right) models.

Among the three state-transition structures, methods to deal with specific instances of dual censoring are most developed for the illness-death model. Frydman (1995a) considers the nonparametric maximum likelihood estimator (NPMLE) from interval-censored progression times with known progression status. This is generalized by Frydman and Szarek (2009) to account for unknown status, which often arises when the last assessment is negative and long precedes right-censoring or death. Bebchuk and Betensky (2001) combine local likelihood and multiple imputation to estimate transition intensities under progression times right-censored before death. Joly and others (2002) propose spline-based penalized likelihood for the (Cox, 1972) proportional hazards model for an interval-censored variant of this observation scheme. Jackson (2011) considers a piecewise exponential analog by way of time-dependent covariates.

These works recognize that progression and death are observed in different ways, but the broader problem of dual censoring has not yet been considered. Methods for time-to-event endpoints that leave any dependence on time unspecified are generally preferred in practice. However, non- and semi-parametric maximum likelihood estimators require the locations of support for the distribution of each transition time, and these are ambiguous whenever the progression status is unknown. To address these issues, we develop a sieve estimator for a multistate extension of the Cox model and compare its numerical performance with routine analysis of imputation-based PFS under a variety of censoring scenarios.

2. Dual censoring of the progressive illness-death process

Let Inline graphic be a one-jump counting process representing the transition from state Inline graphic to state Inline graphic (Inline graphic) in the progressive illness-death model and Inline graphic be the corresponding transition time. So Inline graphic is the time to progression, Inline graphic is the time to progression-free death, and Inline graphic is the time of death following progression. Over the observation period Inline graphic, Inline graphic, suppose that the survival time Inline graphic is observed up to a right censoring time Inline graphic, Inline graphic, but progression status Inline graphic is not necessarily known for all Inline graphic, Inline graphic. For example, progression may be right-censored at some random time preceding Inline graphic. Alternatively, progression status could assessed periodically, leading to interval censoring.

Whatever the form of this inspection process, we presume that it yields a potential censoring interval Inline graphic for the progression time Inline graphic. We say “potential” because we may not know with certainty that Inline graphic. Put Inline graphic to denote whether or not the survival time is observed. Let Inline graphic whenever progression status is known to be negative at Inline graphic and Inline graphic otherwise. Similarly, let Inline graphic indicate that progression status is known to be positive at Inline graphic. So Inline graphic denotes that, based on the available data, we are certain Inline graphic for some Inline graphic. Otherwise either Inline graphic, indicating that Inline graphic, or progression status is unknown at Inline graphic. If the status is unknown, then Inline graphic and we cannot rule out the possibility that either Inline graphic or Inline graphic.

2.1. Example: Bone lesions and their complications

Dual right-censored data are encountered in cancer trials evaluating the effect of bisphosphonates on bone metastases and their complications, known as skeletal-related events (SREs). The time of an SRE is often self-evident, but can otherwise be measured accurately through frequent clinic visits, so SREs are typically considered subject only to right censoring. Growth of new or existing bone lesions is assessed by radiographic surveys, which are carried out less frequently. This results in interval-censored lesion progression times. Standard practice is to evaluate SREs and lesions as separate endpoints, as SREs provide the most direct measure of clinical benefit. Time-to-event analysis of either outcome is complicated by the fact that the mortality rate is non-negligible. Use of PFS can circumvent this issue. However, since the treatment is intended to manage symptoms rather than prolong survival, the measured effect on PFS will likely underestimate any symptom benefit. The illness-death model offers an alternative that isolates the effect of interest.

A dual-censored observation from this multistate process with first SRE as the intermediate event is illustrated in the top panel of Figure 2, where we know that no SREs occurred within an initial loss to follow-up time and that the subject survived at least up to final right-censoring time Inline graphic. We cannot rule out the possibility that progression may have occurred between these two times, so Inline graphic. The censoring interval Inline graphic here is indeed “potential” from the observed data because in reality, the subject did not experience any SREs. The PFS endpoint has no standard definition in this setting. Practitioners might simply discard all data collected after the initial loss to follow-up time so that PFS is right-censored early. Alternatively, the negative progression status at this earlier right censoring time could be carried forward to Inline graphic, giving PFS with a form of last observation carried forward (LOCF) imputation.

Fig. 2.

Fig. 2.

Top: A dual right-censored observation in which progression, coinciding here with first SRE, status is unknown at the last observation time Inline graphic. Bottom: A dual-censored observation in which lesion progression status is observed to be positive, but the progression time is known only up to the interval Inline graphic.

The bottom panel of Figure 2 considers lesion progression rather than SRE. Here a new lesion developed some time between the first and second radiographic surveys, which gives a censoring interval Inline graphic that captures the progression time with certainty: Inline graphic and Inline graphic. Loss to follow-up occurs after the second survey but before death, giving Inline graphic and Inline graphic. Guidelines suggest imputing PFS to the time at which progression is first detected, carrying forward the last known progression status to death, and sensitivity analysis to examine variations on this imputation scheme (FDA, 2007, Appendix 2).

Let Inline graphic be the at-risk process for any transition out of state Inline graphic, so that Inline graphic if state Inline graphic is occupied at time Inline graphic and Inline graphic otherwise. Define the Inline graphic transition probability Inline graphic, with Inline graphic and Inline graphic. Suppose that the observation scheme renders the complete data Inline graphic coarsened at random in the sense of Heitjan and Rubin (1991). Then the likelihood of a dual-censored observation Inline graphic is

2.1. (2.1)

where Inline graphic is the transition intensity process or instantaneous transition probability at time Inline graphic. Whatever model we choose for the transition intensity function Inline graphic, the likelihood a priori maximizes to infinity; Inline graphic Inline graphic can be made arbitrarily large at any time we observe Inline graphic exactly. The usual way out is to replace Inline graphic by the jump discontinuity Inline graphic in the cumulative transition intensity function Inline graphic. However, consider an individual with unknown progression status Inline graphic and known survival time Inline graphic. Surely, we need Inline graphic to increase at Inline graphic, but the observed data are insufficient to jointly estimate Inline graphic and Inline graphic. Nonparametric maximum likelihood will assign mass to at least one of the two potential transition times, but the manner in which support is allocated is subject to bias. Since the so-called risk set for Inline graphic transitions is empty at Inline graphic, the likelihood can be increased appreciably by allocating more mass to potential or observed support for the distribution of Inline graphic early in the observation period. So the initial increments in the NPMLE for Inline graphic will tend to be large in finite samples. All of these difficulties can be mitigated by maximizing the likelihood with respect to a sieve—a finite-dimensional approximation to Inline graphic whose size increases with Inline graphic. Such an approach is generally known as the method of sieves (Grenander, 1981).

3. Method of sieves for dual-censored data

Here a sieve is defined for a given random sample Inline graphic, Inline graphic, of dual-censored observations. Each element of a sieve corresponds to a piecewise parametric cumulative intensity function Inline graphic defined on a data-driven partition of the observation period Inline graphic. Let Inline graphic and Inline graphic respectively denote the set of left- and right-endpoints from the collection of known censoring intervals Inline graphic. Define Inline graphic as the set of right-endpoints from the maximal intersections (Wong and Yu, 1999) for Inline graphic; that is, the set of Inline graphic from Inline graphic such that Inline graphic, Inline graphic, and Inline graphic is either Inline graphic or Inline graphic for every Inline graphic. In addition, let Inline graphic, for Inline graphic, denote the set of exactly observed terminal event times Inline graphic with known progression status and non-empty Inline graphic risk set at Inline graphic.

From Frydman (1995a, Theorem 1), the NPMLE for Inline graphic based on the subsample with known progression status (Inline graphic) can be uniquely defined as the discrete maximizer concentrating its support on Inline graphic, Inline graphic. This implies that Inline graphic should, at minimum, have support on Inline graphic. To ensure that Inline graphic and Inline graphic are jointly estimable, the sieve partition must not isolate any Inline graphic with Inline graphic and unknown progression status Inline graphic. We can achieve this by defining Inline graphic on a partition Inline graphic, such that Inline graphic for Inline graphic, Inline graphic, Inline graphic, and every subinterval Inline graphic contains at least one point from Inline graphic. Here Inline graphic is a tuning parameter that determines the rate at which the sieve or partition size Inline graphic increases with Inline graphic.

For consistency, we need Inline graphic as Inline graphic. This can be met if the true parameter Inline graphic is sufficiently smooth and increasing on Inline graphic and the distribution of the inspection times has positive support on Inline graphic for some small Inline graphic. We express these and other assumptions throughout this paper more precisely in Appendix A of the supplementary material available at Biostatistics online, but essentially this means Inline graphic must be dense in Inline graphic as Inline graphic. Such a requirement is stronger than the ones imposed by Joly and others (2002) and Frydman and Szarek (2009), which allow for unobservable terminal event times with negative progression status: Inline graphic. A consequence of this is that the support for the distribution of Inline graphic is not apparent from the available data, so imposing at least a weakly parametric model for Inline graphic is needed to achieve consistency.

Apart from the location of support points, estimation from dual-censored data poses two additional challenges: (i) inference under convergence rates possibly slower than the parametric rate, as encountered with various forms of interval censoring, and (ii) consideration of Inline graphic that depends arbitrarily on aspects of the event history, such as the duration in state 1. We avoid these complications by considering a sieve estimator for the Cox model with fixed covariates. This permits inference on the familiar hazard ratio via Murphy and van der Vaart's (2000) profile likelihood theory and, barring the standard Markov assumption, puts little restriction on any dependence with time. A variety of extensions or alternatives could be considered, but we adopt this model as a starting point.

4. Sieve estimation of the Cox model

Sieve estimators have been previously proposed for interval-censored survival data. Huang and Rossini (1997) examine the proportional odds model. Zhang and others (2010) devise a spline-based sieve for the Cox model. Our setting is complicated by multiple event types and censoring schemes, but these works provide a useful basis for extension. Herein assume that each Inline graphic has cumulative intensity function

4. (4.1)

where Inline graphic is an Inline graphic transition-type-specific Inline graphic-vector based on the fixed covariate Inline graphic, Inline graphic is a regression parameter, and Inline graphic is now a nondecreasing cumulative baseline Inline graphic transition intensity function. Note that the parameter Inline graphic is common to each transition type, but Inline graphic can be suitably constructed from Inline graphic to give type-specific covariate effects (Andersen and Borgan, 1985, pp. 478–480). For example, if we wish to estimate the effect of the scalar covariate Inline graphic on each transition type separately, we may put Inline graphic, Inline graphic, Inline graphic, and Inline graphic. The effect of Inline graphic on the risk of the Inline graphic, Inline graphic, and Inline graphic transitions then corresponds to Inline graphic, Inline graphic, and Inline graphic, respectively.

Under (4.1) the cumulative Inline graphic transition intensity process Inline graphic depends on only the current state occupied and thus satisfies the Markov property Inline graphic. So Inline graphic and Inline graphic, for any Inline graphic (e.g. (Andersen and Borgan, 1985, Theorem II.6.7)). Let Inline graphic denote the likelihood of an observation Inline graphic given by (2.1), evaluated under these transition probabilities. Then the sieve maximum likelihood estimator (SMLE), Inline graphic, corresponds to the maximizer of the log-likelihood function Inline graphic over Inline graphic. The sieve Inline graphic is defined by its piecewise parametric family and partition Inline graphic. For a sufficiently large partition size, one would not anticipate Inline graphic to be particularly sensitive to the parametric form on the subintervals. This general notion is demonstrated in the survival case by Huang and Rossini (1997). We defer a discussion on selection of the partition to the end of Section 4.2. The remainder of this section describes the estimation scheme, with illustrations for the piecewise exponential sieve. Numerical results for this sieve are examined in Sections 5 and 6.

4.1. Parameter estimation

Suppose that the cumulative baseline intensity functions from a given sieve Inline graphic are specified by the (finite-dimensional) parameter Inline graphic. The piecewise exponential sieve, for example, is characterized by the piecewise constant values taken by the cumulative intensity function. These range through positive values in Inline graphic-space. In general, let Inline graphic denote the parameters specifying Inline graphic for Inline graphic and Inline graphic. Then the SMLE satisfies the score equations Inline graphic. These can be solved using the following self-consistency algorithm, which is akin to the routine outlined by Frydman (1995b) under the null model with Inline graphic fixed at zero and no dual censoring.

Step 4.1 —

For Inline graphic and Inline graphic, define Inline graphic as a partition of Inline graphic in which each subinterval contains Inline graphic points from Inline graphic. Set Inline graphic, Inline graphic, and Inline graphic to some “neutral” value that ensures Inline graphic is increasing. For example, with the piecewise exponential sieve Inline graphic.

Step 4.2 —

Find a candidate increment Inline graphic. For Inline graphic, apply the Newton-Raphson method: Inline graphic. Obtain Inline graphic via the self-consistency equations (Turnbull, 1976) that result from re-arranging the score equation Inline graphic to give a recursive expression for Inline graphic. For the piecewise exponential sieve, this is loosely:

graphic file with name M202.gif

where the conditional expectations are evaluated under Inline graphic and Inline graphic. These are equal to one if Inline graphic provides Inline graphic exactly. A precise expression for this ratio is provided in Appendix D of the supplementary material available at Biostatistics online.

Step 4.3 —

Increment Inline graphic and Inline graphic by Inline graphic and Inline graphic, respectively, with Inline graphic the smallest nonnegative integer ensuring no decrease in the log-likelihood. This gives Inline graphic. If, for some small positive value Inline graphic, Inline graphic then stop. Otherwise, set Inline graphic to Inline graphic and return to Step 4.2.

For each Inline graphic the log-likelihood is concave in Inline graphic, which implies that the Newton-Rapshon method yields a profile maximizer for Inline graphic. Similar properties are not readily available for Inline graphic. So the score equations may neither uniquely characterize the SMLE nor identify global maxima. Multiple (local) maxima may be detected with different starting values and examination of the profile log-likelihood. Our experience thus far has uncovered rare instances where the increment-halving procedure in Step 4.3 reduces the first and only candidate increment to its starting value. In this narrow form of local maxima, the algorithm could be initialized with starting values based on imputed data.

4.2. Variance estimation

In Appendices B and C of supplementary material available at Biostatistics online, we show that if the Inline graphicth (Inline graphic or 2) derivative of Inline graphic is continuous, positive, and bounded on Inline graphic and some regularity conditions hold, then the SMLE Inline graphic converges to the truth Inline graphic at the rate Inline graphic with Inline graphic. However, Inline graphic achieves the semiparametric efficiency bound. Both the limiting distribution of Inline graphic and interval estimation for Inline graphic remain as open problems.

Holding Inline graphic fixed in the self-consistency algorithm described in Section 4.1 evaluates the profile log-likelihood needed to estimate standard error for Inline graphic under Murphy and van der Vaart (2000, Corollary 3), which gives an approximation to the curvature in the profile log-likelihood at Inline graphic akin to numerical differentiation. This entails successively perturbing the entries in Inline graphic by a chosen value Inline graphic. The data-driven procedure outlined in Boruvka and Cook (2015, Section 6) reduces the choice to specifying typical Inline graphic and extreme Inline graphic (absolute) values for any given entry in Inline graphic.

Estimation thus entails setting a number of parameters—namely the sieve constants Inline graphic, sieve rate Inline graphic, typical and large values for Inline graphic, and the threshold Inline graphic. The Inline graphic achieving the fastest rate of convergence is Inline graphic, although better finite sample properties may be obtained with a larger sieve. In practice, we set Inline graphic to the (presumed) asymptotically optimal value for discrete inspection processes and closer to Inline graphic under dual right censoring. We have not formally investigated performance for different values of Inline graphic, but this could be set to some positive value invariant to Inline graphic that represents the presumed degree of non-linearity in Inline graphic relative to the other cumulative transition intensity functions. Empirical motivation for these choices is provided in Section 5, but further study is warranted. Our experience suggests that estimates are not particularly sensitive to the choice of the remaining parameters provided that Inline graphic is moderately valued, Inline graphic is relatively large, and Inline graphic is sufficiently small. In the simulation studies described below, we set Inline graphic, Inline graphic, and Inline graphic to ensure convergence within a reasonable number of iterations over the censoring schemes and sample sizes considered.

5. Simulation study

Numerical properties of the piecewise exponential SMLE were investigated for right- and interval-censored variants of dual censoring. In both cases, we considered the same model with cumulative Inline graphic, Inline graphic, transition intensity Inline graphic, where Inline graphic, Inline graphic, Inline graphic, Inline graphic uniform on Inline graphic, Inline graphic the product of Inline graphic and the Inline graphic transition type indicator, Inline graphic and Inline graphic. Here Inline graphic influences only the exit time from initial state 0 and its effect is the same for each transition type. However, neither of these properties were assumed in estimating Inline graphic. Throughout Inline graphic was right censored by the fixed time Inline graphic representing study closure. Under these fixed parameters, roughly 56% of subjects in the sample progressed (Inline graphic), 12% were event-free at Inline graphic Inline graphic, and 16% survived to study closure (Inline graphic).

The censoring scheme acting on the progression status is described in Sections 5.1 and 5.2 below, where we summarize findings from 10 000 Monte Carlo replicates of the sample sizes Inline graphic under four general scenarios: (1) independent dual censoring, (2) independent dual censoring with increased censoring of progression, (3) conditionally independent dual censoring given Inline graphic, and (4) dependent dual censoring. The sieve parameters were held fixed at Inline graphic, Inline graphic for dual right-censored data, and Inline graphic for interval-censored progression. The first scenario was revisited with alternative values for Inline graphic.

In each scenario, we also considered estimates of the Cox model obtained by some combination of early right censoring, mid- or right-endpoint imputation the progression time, or carrying the last negative progression status forward to the final right censoring time or death (LOCF). Details on these alternatives and referenced displays can be found in Appendices E and F of supplementary material available at Biostatistics online.

5.1. Dual right censoring

To obtain dual right-censored data, an early censoring time Inline graphic was generated by Inline graphic with probability Inline graphic, Inline graphic. Otherwise Inline graphic followed some distribution with Inline graphic. This gave a dual right censoring scheme in which Inline graphic coincides with administrative censoring and Inline graphic is a dropout time taking place earlier in the observation period. The four scenarios were, respectively, specified as: (1) Inline graphic, where Inline graphic is exponential-distributed with mean Inline graphic, Inline graphic and Inline graphic; (2) Inline graphic, Inline graphic, Inline graphic and Inline graphic; (3) Inline graphic, Inline graphic, Inline graphic and Inline graphic; and (4) Inline graphic, where Inline graphic follows Inline graphic truncated to Inline graphic, Inline graphic and Inline graphic. The rates of exact observation, singly right-censored and doubly right-censored data were roughly 70, 15, and 15%, respectively, under Scenarios 1 and 3. Under Scenario 2, the censoring rates were Inline graphic15 and 40%. In Scenario 4, these were 25 and 20%.

The SPMLE based on “singly” right-censored data was also considered for three alternative outcomes given, respectively, by the observed transition times right-censored by Inline graphic, PFS right-censored at Inline graphic, and PFS with LOCF imputation under exactly observed survival times with unknown progression status. These two variants of PFS are depicted in Figure 2.

Table E.1 of supplementary material available at Biostatistics online summarizes performance in estimating Inline graphic. Results for the SMLE support the asymptotic properties stated in Appendices B and C of supplementary material available at Biostatistics online with average bias generally diminishing in larger samples, average standard error estimates nearing the Monte Carlo sample standard deviations, and empirical coverage probabilities of the 95% confidence intervals at or near the nominal level. The SPMLE from right-censored data at Inline graphic shows higher variability and lower bias under independent censoring (Scenarios 1–3). Under dependent censoring (Scenario 4), the SPMLE has larger finite-sample bias. The SPMLE for PFS right-censored at Inline graphic also performed relatively well under independent censoring; however, its regression coefficient is defined on the basis of the restrictive assumption that Inline graphic. The PFS variant incorporating LOCF imputation is clearly biased under independent censoring, particularly when the rate of dual censoring is higher. LOCF imputation fared better in Scenario 4. This is not surprising since Inline graphic often closely preceded Inline graphic in this dependent censoring scheme.

Figure E.1 of supplementary material available at Biostatistics online depicts the pointwise average and percentiles of the SMLE Inline graphic under Scenarios 1–4 with Inline graphic. Estimates appear unbiased, with the exception of overestimates for Inline graphic early in the observation period. Results under Inline graphic and 500 (not shown here or the supplement) indicate that bias and variability decreases with increasing sample size, but are otherwise similar. The SPMLE obtained by right censoring observations at Inline graphic demonstrate little to no bias under independent censoring. This is however not the case under the dependent censoring scheme of Scenario 4. Estimates for Inline graphic are clearly biased (Figure E.2 of supplementary material available at Biostatistics online), with the pointwise Inline graphicth percentiles consistently smaller than the truth.

From Table E.2 of supplementary material available at Biostatistics online, it is apparent that the largest sieve size (Inline graphic) achieves the smallest finite-sample bias with little to no increase in variability compared with the sieves under Inline graphic and Inline graphic. A larger sieve, sample size or degree of dual censoring increased computational demands for estimation, but the routine we implemented typically converged within a few seconds in all settings considered (Table E.3 of supplementary material available at Biostatistics online).

5.2. Interval-censored progression times

To generate interval-censored progression times, status was inspected on the basis of Inline graphic “scheduled” visits, evenly spaced on Inline graphic. “Actual” visit times followed Inline graphic independent normal distributions centered at the scheduled times with common standard deviation Inline graphic and truncated at zero, Inline graphic, and the midpoints between consecutive scheduled times. So the inspection times were continuously distributed on Inline graphic with greater density around the scheduled targets. This setup is similar to the one in Zeng and others (2015); however, here the spread of the inspection times better cover Inline graphic so that we can reasonably expect the SMLE to be consistent over the observation period.

Under the independent censoring schemes of Scenarios 1–3, every inspection after the first was missed with probability Inline graphic, where Inline graphic. Dependent censoring in Scenario 4 was obtained by discarding inspections taking place after Inline graphic. In all scenarios, the last observation time Inline graphic offered one further inspection of progression status with a fixed probability of Inline graphic. Parameters in each scenario were set to: (1) Inline graphic, Inline graphic and Inline graphic; (2) Inline graphic, Inline graphic and Inline graphic; (3) Inline graphic, Inline graphic and Inline graphic; and (4) Inline graphic and Inline graphic follows Inline graphic truncated to Inline graphic. With Inline graphic and Inline graphic, the probability of a missing inspection was Inline graphic, irrespective of Inline graphic. Under Inline graphic and Inline graphic, the probability remained the same for subjects with Inline graphic. With Inline graphic, inspections were two times more likely to be missed. Under Scenario 4, Inline graphic can be interpreted as a dropout time closely preceding death. In Scenarios 1 and 3, progression status was known by Inline graphic in just over half of the sample. For Scenarios 2 and 4 this rate was 44 and 40%, respectively. In all four scenarios, the rate was 20% among progression-free subjects, so status was known more often among subjects who progressed.

We also fit the SPMLE to two forms of singly right-censored data. The first arises by midpoint-imputing progression times if progression status is known to be positive, as depicted in the lower panel of Figure 2; otherwise, the negative progression status is carried forward to Inline graphic. The second is the guideline-based definition for PFS, given by the earliest of progression detection, death, and right-censoring at Inline graphic.

From Table F.1 of supplementary material available at Biostatistics online numerical results for the SMLE Inline graphic support the asymptotic properties in Appendices B and C with bias generally decreasing with increasing sample size, average standard error estimates reasonably approximating the Monte Carlo sample standard deviations, and empirical coverage probabilities of the 95% confidence intervals close to the nominal level. The SPMLE based on midpoint- and LOCF-imputed data and PFS had, on average, larger finite-sample bias. Bias in both of these estimators generally did not diminish with increasing sample size.

Pointwise average and percentiles of the SMLE Inline graphic are depicted in Figure F.1 of supplementary material available at Biostatistics online. The SMLE overestimates increments in Inline graphic early in the observation period. This pattern persists across scenarios and sample sizes, but the bias decreases with larger Inline graphic. The imputation-based SPMLE for Inline graphic is clearly biased (Figure F.2 of supplementary material available at Biostatistics online), with the degree of bias largest under survival-dependent interval censoring (Scenario 4). Imputation-based estimates for survivor and hazard functions typically exhibit a step pattern according to the density of the inspection times, as noted by Panageas and others (2007). For Inline graphic, this artifact of the observation scheme is a persistent departure from the true shape of the cumulative intensity function.

On average, a smaller sieve with Inline graphic achieved increased bias and similar variability compared with Inline graphic (Table F.2 of supplementary material available at Biostatistics online). A larger sieve (Inline graphic) gave similar variability, but did not always yield an improvement in average bias. Parameter estimation under interval censoring is more computationally demanding than under dual right-censored data, with average processing times about 100 times slower than those seen in Section 5.1 (Table F.3 of supplementary material available at Biostatistics online).

6. Applications

We return to the examples of Section 2.1, which demonstrate two variants of dual censoring—one arising from loss to follow-up for SREs and the other from periodic assessment for lesion progression. Data were obtained from similarly designed trials where SREs were recorded at clinic visits every 3 weeks and lesion progression was diagnosed on the basis of radiographic surveys every 3–6 months. Actual assessment times roughly followed this schedule, but with enough variation to justify use of the sieve. Since the assessment times were largely determined by a prespecified schedule, one might guess that coarsening at random assumption is plausible. However, loss to follow-up for both SREs and lesion progression occurred due to treatment discontinuation and death, although discontinuation rates were similar in the treatment groups. The simulation study offers some reassurance that the SMLE performs relatively well under survival dependent loss to follow-up. Another consideration is the plausibility of the Markov proportional hazards assumption, though one could argue that this model offers an adequate tool for detecting a difference in the risk of progression between treatment groups. Further investigation of the SMLE's requirements as they relate to the study design and features of the data is warranted, but out of scope for a simple demonstration of the proposed estimator.

6.1. Dual right censoring: Skeletal-related events

Rosen and others (2001) reported that two bisphosphonates, zolendronic acid and pamidronate disodium, showed equivalent efficacy and safety in preventing SREs among patients with breast cancer and multiple myeloma. This conclusion was partly drawn from the evaluation of time to the first SRE within 9 months of randomization in an international trial of 1600 patients. Here we evaluate time to first SRE and death via an illness-death model among the trial's North American breast cancer cohort. Within this subsample of 777 patients, the available trial data provide SREs up to 30 months following randomization. The majority of patients died during this period, so observation of SREs typically ceased earlier.

Under this three-state, outcome Inline graphic was observed exactly in just over one-third of the sample. Incomplete transition times and known progression status was observed for 28% of the patients. Almost 15% had unknown progression status but exact survival time, leaving the remaining 23% of the sample dual censored. Table 1 and Figure 3 give the SMLE under Inline graphic (Inline graphic) and Inline graphic. Estimates obtained from both smaller and larger sieves provide similar results, with changes in Inline graphic Inline graphic0.009. Also depicted is the SPMLE obtained by discarding any observations after the initial right censoring time. The same conclusion can be drawn from both approaches; under the assumed Markov illness-death process, any influence of zoledronic acid on the risk of bone interventions and death is not significantly different from that of pamidronate. The two methods diverge in estimating Inline graphic (Figure 3). Since patients near death would presumably be unable to attend clinic visits, early right censoring likely yields underestimates. This may explain why the SPMLE for Inline graphic is substantially smaller.

Table 1.

Regression coefficients for zolendronic acid versus pamidronate specific to first SRE, Inline graphic, death without SRE, Inline graphic, and death following SRE, Inline graphic

Early-censored
LOCF
SMLE
SPMLE
PFS PFS
Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Estimate 0.02 0.16 Inline graphic0.01 0.02 0.16 Inline graphic0.01 0.03 0.06
SE 0.11 0.19 0.15 0.11 0.28 0.14 0.10 0.09
Inline graphic-value 0.89 0.40 0.94 0.86 0.58 0.97 0.78 0.51
HR 95% LCL 0.82 0.80 0.74 0.82 0.67 0.75 0.85 0.89
UCL 1.26 1.72 1.32 1.26 2.04 1.32 1.24 1.27

Fig. 3.

Fig. 3.

The SMLE (solid) and early-censored SPMLE (dotted) for the cumulative baseline transition intensity functions between study entry (state 0), first SRE (state 1), and death (state 2).

6.2. Interval-censored progression times: Lesion progression

Hortobagyi and others (1996) showed that pamidronate reduced SREs in a placebo-controlled trial of 380 breast cancer patients with bone metastases. Lesion progression was considered as a secondary outcome. This was assessed using radiographic surveys scheduled at 3- to 6-month intervals over the course of follow-up, rendering the time to lesion progression interval-censored. Surveys were carried out up to 30 months after randomization, but over half of the patients died during this observation period. To account for interval censoring and the occurrence of death, we analyze lesion progression and survival as an illness-death process. Both the progression status and survival time was observed in 28% of the sample. An additional 13% had known progression status but right-censored survival time. In the remaining subjects, right-censoring (11%) or survival (48%) took place long after the last (negative) radiographic survey, resulting in unknown progression status. We defined “long after” as more than 6 weeks, which enabled us to carry forward recent lesion status to the last observation time. Similar results were obtained by carrying forward fewer weeks. This narrow form of LOCF imputation is problematic, but can be avoided when it is possible to randomly assess progression at death.

Table 2 suggests that pamidronate had no influence on mortality, but there is evidence that the bisphosphonate reduces the risk of lesion progression. Based on the SMLE with Inline graphic and Inline graphic, an individual treated with pamidronate had 0.68 (95% CI 0.47–0.97) times the rate of progression versus a patient who received placebo. Results obtained under different sieve sizes were similar. The SPMLE from midpoint- and LOCF-imputed data did not detect any significant treatment effect. The difference between the SMLE and SPMLE for the cumulative transition intensities is large (Figure 4) and likely indicative of bias due to imputation, considering the simulation results in Figure F.2 of supplementary material available at Biostatistics online.

Table 2.

Regression coefficients for pamidronate versus control specific to lesion progression, Inline graphic, death without lesion progression, Inline graphic, and death following lesion progression, Inline graphic

Imputation-based
SMLE
SPMLE
PFS
Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Estimate Inline graphic0.39 Inline graphic0.04 Inline graphic0.05 Inline graphic0.23 Inline graphic0.11 Inline graphic0.03 -0.17
SE 0.18 0.21 0.20 0.17 0.14 0.21 0.11
Inline graphic-value 0.03 0.85 0.82 0.20 0.42 0.88 0.12
HR 95% LCL 0.47 0.64 0.65 0.57 0.68 0.65 0.68
UCL 0.97 1.45 1.41 1.12 1.18 1.45 1.04

Fig. 4.

Fig. 4.

The SMLE (solid) and imputation-based SPMLE (dotted) for the cumulative baseline transition intensity functions between study entry (state 0), lesion progression (state 1), and death (state 2).

7. Discussion

This paper examined dual censoring and its challenges for semiparametric maximum likelihood estimation. Methods for special cases of dual-censored data have been previously developed, but the issue of support finding and the resulting imperative for smoothing has not been granted much attention. Our proposed estimator addresses the problem in a general manner, using a model familiar to practitioners. The result gives a multistate alternative to PFS that enables separate assessment of treatment effect on progression and survival without progression. A primary assumption of the maximum likelihood approach is that the observation scheme renders the underlying transition times coarsened at random. The simulation study shows that this requirement implies that the estimator is robust to survival-dependent censoring of progression provided that the censoring rate for survival is relatively low.

Supplementary material

Supplementary material is available at http://biostatistics.oxfordjournals.org.

Funding

This work was supported by a Discovery Grant from the Natural Sciences and Engineering Research Council of Canada (RGPIN 155849) and the Canadian Institutes of Health Research (FRN 13887). Richard Cook is a Canada Research Chair in Statistical Methods for Health Research.

Supplementary Material

Supplementary Data

Acknowledgments

The authors thank Novartis Pharmaceuticals for permission to use data from the bone metastases trials and Jerry Lawless for comments on an earlier version of this manuscript. Conflict of Interest: None declared.

References

  1. Andersen P. K., Borgan Ø. (1985). Counting process models for life history data: A review. Scandinavian Journal of Statistics 122, 97–158. [Google Scholar]
  2. Bebchuk J. D., Betensky R. A. (2001). Local likelihood analysis of survival data with censored intermediate events. Journal of the American Statistical Association 96, 449–457. [Google Scholar]
  3. Boruvka A., Cook R. J. (2015). A Cox-Aalen model for interval-censored data. Scandinavian Journal of Statistics 422, 414–426. [Google Scholar]
  4. Cox D. R. (1972). Regression models and life-tables. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 342, 187–220. [Google Scholar]
  5. FDA (2007). Clinical trial endpoints for the approval of cancer drugs and biologics: Guidance for industry. Rockville, MD: Food and Drug Administration.
  6. Fix E., Neyman J. (1951). A simple stochastic model of recovery, relapse, death and loss of patients. Human Biology 233, 205–241. [PubMed] [Google Scholar]
  7. Frydman H. (1995a). Nonparametric estimation of a Markov ‘illness-death’ process from interval-censored observations, with application to diabetes survival data. Biometrika 824, 773–789. [Google Scholar]
  8. Frydman H. (1995b). Semiparametric estimation in a three-state duration-dependent Markov model from interval-censored observations with application to aids data. Biometrics 512, 502–511. [PubMed] [Google Scholar]
  9. Frydman H., Szarek M. (2009). Nonparametric estimation in a Markov ‘illness-death’ process from interval censored observations with missing intermediate transition status. Biometrics 651, 143–151. [DOI] [PubMed] [Google Scholar]
  10. Grenander U. (1981) Abstract Inference. New York: Wiley. [Google Scholar]
  11. Heitjan D. F., Rubin D. B. (1991). Ignorability and coarse data. Annals of Statistics 194, 2244–2253. [Google Scholar]
  12. Hortobagyi G. N., Theriault R. L., Porter L., Blayney D., Lipton A., Sinoff C., Wheeler H., Simeone J. F., Seaman J., Knight R. D.. and others (1996). Efficacy of pamidronate in reducing skeletal complications in patients with breast cancer and lytic bone metastases. New England Journal of Medicine 33524, 1785–1792. [DOI] [PubMed] [Google Scholar]
  13. Hu C., Tsodikov A. (2014). Joint modeling approach for semicompeting risks data with missing nonterminal event status. Lifetime Data Analysis 204, 563–583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Huang J., Rossini A. J. (1997). Sieve estimation for the proportional-odds failure-time regression model with interval censoring. Journal of the American Statistical Association 92439, 960–967. [Google Scholar]
  15. Jackson C. (2011). Multi-state models for panel data: The msm package for R. Journal of Statistical Software 388, 1–28. [Google Scholar]
  16. Joly P., Commenges D., Helmer C., Letenneur L. (2002). A penalized likelihood approach for an illness-death model with interval-censored data: Application to age-specific incidence of dementia. Biostatistics 33, 433–443. [DOI] [PubMed] [Google Scholar]
  17. Murphy S. A., van der Vaart A. W. (2000). On profile likelihood. Journal of the American Statistical Association 95450, 449–465. [Google Scholar]
  18. Panageas K. S., Ben-Porat L., Dickler M. N., Chapman P. B., Schrag D. (2007). When you look matters: The effect of assessment schedule on progression-free survival. Journal of the National Cancer Institute 996, 428–432. [DOI] [PubMed] [Google Scholar]
  19. Rosen L. S., Gordon D., Kaminski M., Howell A., Belch A., Mackey J., Apffelstaedt J., Hussein M., Coleman R. E., Reitsma D. J.. and others (2001). Zoledronic acid versus pamidronate in the treatment of skeletal metastases in patients with breast cancer or osteolytic lesions of multiple myeloma: A phase 3, double-blind, comparative trial. Cancer Journal 75, 377–387. [PubMed] [Google Scholar]
  20. Turnbull B. W. (1976). The empirical distribution function with arbitrarily grouped, censored and truncated data. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 383, 290–295. [Google Scholar]
  21. Wong G. Y. C., Yu Q. (1999). Generalized MLE of a joint distribution function with multivariate interval-censored data. Journal of Multivariate Analysis 692, 155–166. [Google Scholar]
  22. Xu J., Kalbfleisch J. D., Tai B. (2010). Statistical analysis of illness-death processes and semicompeting risks data. Biometrics 663, 716–725. [DOI] [PubMed] [Google Scholar]
  23. Zeng L., Cook R. J., Wen L., Boruvka A. (2015). Bias in progression-free survival analysis due to intermittent assessment of progression. Statistics in Medicine 3424, 3181–3193. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Zhang Y., Hua L., Huang J. (2010). A spline-based semiparametric maximum likelihood estimation method for the Cox model with interval-censored data. Scandinavian Journal of Statistics 372, 338–354. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

Articles from Biostatistics (Oxford, England) are provided here courtesy of Oxford University Press

RESOURCES