Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Apr 15.
Published in final edited form as: Clin Cancer Res. 2012 Jan 26;18(8):2301–2308. doi: 10.1158/1078-0432.CCR-11-2097

The Use and Interpretion of Competing Risks Regression Models

James J Dignam 1,2, Qiang Zhang 2, Maria N Kocherginsky 1
PMCID: PMC3328633  NIHMSID: NIHMS352464  PMID: 22282466

Abstract

Purpose

Competing risks observations, where patients are subject to a number of potential failure events, are a feature of most clinical cancer studies. With competing risks, several modeling approaches are available to evaluate the relationship of covariates to cause-specific failures. We discuss the use and interpretation of commonly used competing risks regression models.

Experimental Design

For competing risks analysis, the influence of covariate can be evaluated in relation to cause-specific hazard or on the cumulative incidence of the failure types. We present simulation studies to illustrate how covariate effects differ between these approaches. We then show the implications of model choice in an example from a Radiation Therapy Oncology Group (RTOG) clinical trial for prostate cancer.

Results

The simulation studies illustrate that, depending on the relationship of a covariate to both the failure type of principal interest and the competing failure type, different models can result in substantially different effects. For example, a covariate that has no direct influence on the hazard of a primary event can still be significantly associated with the cumulative probability of that event, if the covariate influences the hazard of a competing event. This is a logical consequence of a fundamental difference between the model formulations. The example from RTOG similarly shows differences in the influence of age and tumor grade depending on the endpoint and the model type used.

Conclusions

Competing risks regression modeling requires that one consider the specific question of interest and subsequent choice of the best model to address it.

Keywords: competing risks, cause-specific hazard, cumulative incidence, prognostic covariate

Introduction

Competing risks methods are common in biomedical research, particularly in cancer, where the need to deal with multiple potential outcomes is nearly ubiquitous. For example, competing risks are encountered when cancer patients are followed after treatment, and their first failure event may be local recurrence, distant metastases, onset of second primary cancer, or death precluding these events. Statistical analysis and interpretation of competing risks data differ from survival analysis with only a single cause of failure. First, appropriate methods must be applied in order to obtain a correct estimate the cumulative probability of each event (15). Secondly, when comparing cause-specific outcomes between groups, one must also consider the influence of competing risks and choose the test appropriate for the question of primary interest (68). For example, one may want to compare the cause-specific hazard of a given event type, which heuristically reflects the rate per unit of time of the event among those having not failed from other events, or the cumulative incidence of the event, which reflects the rate of the event as well as the influence of competing events.

In this article, we address these analyses in the context of regression modeling in the presence of competing risks, focusing on the two most commonly used approaches (9,10). As in the case of estimation and testing (8), the quantities produced by different competing risks models on the same data, generally referred to as ‘hazard ratios’, can differ substantially and may lead to different or even seemingly contradictory inferential conclusions. In fact, the method of summary and inference for competing risks data has occasionally been a source of debate and contention, primarily because each of the different approaches may only illuminate one important aspect of the data while possibly obscuring others (1114). Similarly, in the context of modeling, it has been correctly pointed out that effects of prognostic covariates in one model type do not necessarily pertain those that would be obtained in the other type of model (15,16). We argue that this does not diminish the usefulness of either approach, and the choice is rather a matter of the question of interest.

After briefly reviewing competing risks methods in the next section, we present simulation studies of competing risks models under various realistic scenarios. We then present an example from a clinical trial of the Radiation Therapy Oncology Group. Finally, we offer guidance for which modeling approach might be most appropriate depending on the question of interest.

Background

Competing Risks Data, Estimators, and Tests

Fundamentals of competing risks have been reviewed extensively elsewhere (15,8,16,17). Briefly, in competing risks data, an individual can potentially fail from any of several, say K, event types, but only the time to failure for the earliest (in time) of these (or the last follow-up time if no failure has yet occurred) is observed. Thus, the observations take the form of (T, δ), denoting a failure time T and a failure status indicator δ that records either the type of failure that occurred or indicates that no failure has yet occurred. Although only one event time is recorded, there is partial information on all failure types. For example, if the patient failed from δ = cause 1 at T=24 months, it is known that the patient has achieved 24 months free of failure from cause 2.

The principal identifiable quantity in competing risks observations is the cause-specific hazard function λk(t), which heuristically represents the probability of failure due to cause k at a moment in time, given that no failure of any kind has occurred thus far. The cumulative cause-specific hazard Λk(t) equals the cause-specific hazard summed from start of observation to time t. The cause-specific hazards for k mutually exclusive events are additive to the hazard of failure from any of the events, so that the quantity Λ(t) = Λk(t) + Λk(t) + … + Λk(t) is the cumulative hazard function for failures from any cause. The cumulative hazard function has corresponding survival function S(t) (probability of remaining event-free past time t) via the identity S(t)=exp(−Λ(t)).

Another important quantity is the cumulative probability of event k over time in the presence of other competing events. The cumulative incidence function Fk(t) for cause k is defined mathematically as

Fk(t)=Pr(failuretimeTt,cause=k)=0tS(u)λk(u)du

Fk(t) involves both the probability of having not failed from some other event first up to time t (represented by (S(u)) and the cause-specific hazard for the event of interest (λk(u)) at that time. At each time point, the K cumulative incidence functions are additive to the probability of failure from any cause, or 1 − S(t). The cumulative probability of event k occurring in the presence of competing risks is often incorrectly estimated by 1−Sk(t), where Sk(t) is calculated using the Kaplan-Meier estimator and treating events other than those due to cause k as censored. Many authors have discussed this issue specifically in the oncology context (1,35).

Tests to determine differences in survival functions between groups are most often formulated as tests comparing the underlying hazard functions, using some variant of the logrank test. For cause-specific hazards, the same testing approach is valid (9). Similarly, the most common method to compare cumulative incidence functions is carried out via an underlying quantity that is an analogue to the hazard for the cumulative incidence function, known as the subdistribution hazard (18). The subdistribution hazard can be thought of the probability of failure due to cause k at a moment in time, given that no cause k failures have occurred thus far (with those who have failed from other causes considered among those still event-free with respect to cause k). Several recent articles have compared these tests and discussed their use and interpretation (68).

Regression Models for Competing Risks Data

To summarize the effect of treatment and of patient or disease covariates in the competing risks setting, regression models can be used. As in the case of the estimators and tests just described, one must consider the metric on which to assess these covariate effects. Models for competing risks have been reviewed in detail elsewhere (15, 16), and so we briefly revisit these here.

Modeling cause-specific hazards

The familiar Cox proportional hazards model is readily applied to modeling cause-specific hazards (9). Denote the set of covariates by a vector Z. The model has the form

λk(t)=λ0k(t)exp(Zβ)

Here, the cause-specific hazard is a function of some unspecified ‘baseline’ cause-specific hazard and a set of covariates. Cause-specific hazard ratio (CHR) estimates from this model are largely interpreted in the same way as hazard ratios in the absence of competing risks. However, when there is dependence between failure types, the effects may reflect the influence of competing events, sometimes in a counterintuitive way (19,20). Also, as we will illustrate, if one models via cause-specific hazard, the covariate effects obtained do not necessarily pertain to the cumulative incidence of a given event type (3,10,15).

Modeling cumulative incidence

A similar formulation the Cox model for hazards can be used for a quantity known as the ‘subdistribution hazard’ (10). Heuristically, the ‘subdistribution hazard’ λ*k can be thought of as the hazard for an individual who either fails from cause k or does not, and in the latter case has an infinite failure time for cause k. While this may seem unusual, indeed in the case of mutually exclusive event types, those who fail from one cause are invulnerable to failure from other causes. This mathematical construct allows relating a covariate to the cumulative incidence of a specific event type using the model

λk(t)=λok(t)exp(Zβ)

which is similar to the Cox model in that covariates act to multiply the baseline subdistribution hazard in a time-independent manner. Interpretation of the subdistribution hazard ratio (SHR) is similar to that of the Cox model CHR, but there are additional important considerations that we discuss later.

A number of other approaches to regression modeling for cumulative incidence have been proposed. Because the cumulative incidence function is a function of the cause-specific hazards, and estimators can be ‘constructed’ from estimates of cause-specific failure hazards for the different events involved, one may opt to specify familiar Cox models for one or all of these (21,22). Parametric distributions for one or all of the cause-specific hazards can be used, and a number of such models have been proposed in the biomedical context (2325). In another approach, Klein and Andersen use generalized linear models, which typically are restricted to data without censoring (26). This is accomplished through creation of a baseline cumulative incidence function estimate based on ‘pseudo-values’, and subsequent specification of a regression model that relates covariates to cumulative incidence of the given event. Fine has generalized the original subdistribution hazards model to derive a model that has some similarities with this approach (27).

Simulation Study of Competing Risks Regression Models

To understand when the results of the modeling approaches may differ, we simulated competing risks observations and computed the estimators from the models. From a bivariate survival distribution (28) we generate the time to a designated primary event X1 and a competing event X2. We also generate an independent censoring time C. Then we obtain T = minimum(X1, X2, C) and an event type indicator (i.e., which time was the minimum) to form the competing risks observations. We compute the Cox CHR (9) and Fine-Gray SHR (10) for both event types. These estimates are averaged over a large number of simulated data sets to illustrate the behavior of the models under various scenarios.

Independent competing risks

For the following simulations (Table 1), correlation between failure times was set at zero, implying that failure hazards (and thus failure times) from the two causes are unrelated. In this instance, the cause-specific hazards that are estimable from the competing risks observations (time and event type of first failure) provide valid estimates for the ‘true’ hazards for each event type in the bivariate distribution (known as marginal hazards). However, independence of failure times is not likely the case for many biomedical data situations, and because at most one of the failure times is observed in actual competing risks data, dependence among times cannot be determined (29,30). This is the fundamental problem in competing risks that leads to the need for specialized methods.

Table 1.

Comparison of competing risks regression models under different competing risks scenarios, with 250 subjects per group. Competing risks data were simulated from a bivariate exponential failure distribution with the hazard parameters indicated and independent censoring that resulted in about 33% of lifetimes being censored. Averages of estimated parameters are based on 3000 simulated datasets.

Model Estimates (Group B/A ratio)
Specified Hazards Cox Fine and Gray
Scenario Group A Group B CHR 95% CI SHR 95% CI
I: No difference for either event
 Event 1 λ11 = 1.00 λ21 = 1.00 0.993 0.731–1.349 0.994 0.737–1.343
 Event 2 λ12 = 1.00 λ22 = 1.00 1.000 0.736–1.359 1.003 0.743–1.355
II: Event 1 rate lower in Group A
 Event 1 λ11 = 0.50 λ21 = 1.00 2.013 1.428–2.839 1.794 1.286–2.503
 Event 2 λ12 = 1.00 λ22 = 1.00 0.999 0.750–1.331 0.750 0.567–0.991
III: Both event rates lower in Group A
 Event 1 λ11 = 0.50 λ21 = 1.00 2.002 1.451–2.763 1.291 0.956–1.742
 Event 2 λ12 = 0.50 λ22 = 1.00 2.003 1.452–2.762 1.292 0.958–1.744
IV: Large and small effects in Group A
 Event 1 λ11 = 0.50 λ21 = 1.00 2.002 1.441–2.802 1.555 1.133–2.135
 Event 2 λ12 = 0.75 λ22 = 1.00 1.333 0.988–1.799 0.939 0.704–1.251

Scenario I

In the null case where hazards are identical between groups for both Event 1 and Event 2, both the CHR and SHR models estimate the effects as null (ratio = 1), as expected.

Scenario II

Only Event 1 is related to group membership, with the hazard rate in Group B equaling twice that of Group A. For the CHR model, the effects for Group B vs. A are as expected (CHR≈2.0 for Event 1 and CHR≈1.0 for Event 2). The SHR model gives a somewhat smaller group effect for Event 1 (SHR= 1.79). However, for Event 2, the PSH model indicates that Group B is less likely to fail (SHR = 0.75). This happens because, even though the Event 2 hazards are identical, fewer in Group A fail due to Event 1 and are therefore available to fail from Event 2.

Scenario III

Both events are strongly influenced by group, with both hazards twice as large in Group B vs. Group A. The CHR model estimates show these two–fold hazard ratios. However, the effect is attenuated again in the SHR model (SHR = 1.29), because for each event type, a large number of individuals fail from the competing event.

Scenario IV

Group A has substantially lower hazard for Event 1 and only moderately lower hazard for Event 2, and the CHR model shows these effects. The Event 1 SHR is attenuated while the Event 2 SHR is null. This is again due to the fact that the model reflects the influence of the competing event. With relatively more individuals available to fail from Event 2 in Group A (because they fail less from Event 1), the cumulative incidence of Event 2 (and thus the SHR) is similar between Group A and Group B.

Dependent competing risks

We next ran simulations with moderately high positive correlation of approximately 0.60 between event times (Table 2). That is, in a given individual, if the event 1 time was small, then the event 2 time would tend to also be small, and similarly for large failure times. In oncology, positive correlation between event types is plausible, as for example, patients likely to experience local-regional recurrence may also be at higher risk for distant metastases. Some specific results of interest are as follows.

Table 2.

Comparison of competing risks regression models under different competing risks scenarios, with 250 subjects per group. Competing risks data were simulated from a bivariate exponential failure distribution with dependence between failure times and hazard parameters indicated and independent censoring that resulted in about 33% of lifetimes being censored. Averages of estimated parameters are based on 3000 simulated datasets

Model Estimates (Group B/A ratio)
Specified Hazards Cox Fine and Gray
Scenario Group A Group B CHR 95% CI SHR 95% CI
I: No difference for either event
 Event 1 λ11 = 1.00 λ21 = 1.00 1.006 0.725–1.395 1.006 0.730–1.387
 Event 2 λ12 = 1.00 λ22 = 1.00 0.999 0.720–1.385 0.998 0.724–1.375
II: Event 1 rate lower in Group A
 Event 1 λ11 = 0.50 λ21 = 1.00 3.200 2.083–4.914 3.097 2.039–4.704
 Event 2 λ12 = 1.00 λ22 = 1.00 0.789 0.588–1.058 0.570 0.428–0.760
III: Both event rates lower in Group A
 Event 1 λ11 = 0.50 λ21 = 1.00 1.997 1.416–2.816 1.337 0.971–1.840
 Event 2 λ12 = 0.50 λ22 = 1.00 2.014 1.427–2.842 1.350 0.981–1.859
IV: Large and small effects in Group A
 Event 1 λ11 = 0.50 λ21 = 1.00 2.585 1.762–3.791 2.132 1.480–3.070
 Event 2 λ12 = 0.75 λ22 = 1.00 1.130 0.830–1.538 0.793 0.589–1.067

Scenario II

The estimated effects for the CHR model and SHR model for Event 1 are similar. For Event 2, both models show an effect in favor of Group B. Even though the actual Event 2 hazards from the bivariate distribution generating the data are identical between groups, the introduction of dependence between event times causes the group effect associated with Event 1 to become apparent in the cause-specific hazard estimates for Event 2 as well.

Scenario IV

In this case with a large effect for Event 1 and a smaller effect for Event 2 in Group A, the two models also tend to have more congruent estimates of effects than under independence.

In data analyses, where observing only the time to first event, we do not know the correlation between potential event times. Even when multiple events are observed, treatment usually changes after the first failure occurs. Thus, analyses are typically performed on time to first event, partitioned into causes, without incorporation of any correlation that might exist among failure times.

Amount of censoring

Since the cumulative incidence functions partition the cumulative probability of failing from any event into probabilities for the specific causes, more failures of one type will naturally result in fewer eventual total failures of the other types, in the case where all patients are observed until failure. However, as long as a sufficient proportion of patients are censored, this condition need not hold, and indeed one group may concurrently have greater failure probability for several competing events.

Table 3 shows a simulation study under Scenario II, where Group B has twice the hazard for Event 1 only. When there is no censoring, Group B has significantly lower cumulative incidence of Event 2 (SHR = 0.69), since more patients in this group have failed from Event 1. As the proportion censored increases (for example, if in a study the data were looked at earlier in time), this effect is less apparent. When 67% of patients are currently failure-free (i.e., censored), there is no significant excess of Event 2 failures in Group B. Thus, the proportion censored can influence the estimates obtained for models based on cumulative incidence. Note that the CHR estimates are not influenced by censoring (under dependence between event times, the cause-specific hazards can also be influenced).

Table 3.

Effect of the proportion of cases event-free (censored) on the estimate of the effect of failure from a competing event.

Event 2 Effect Estimates Event 1 Effect Estimates
Cox Fine-Gray Cox Fine-Gray
Scenario II: No event 2 hazard difference CHR 95% CI SHR 95% CI CHR 95% CI SHR 95% CI
% of cases censored
 0% 0.999 0.829–1.203 0.691 0.574–0.832 2.002 1.602–2.502 1.757 1.411–2.187
 10% 0.997 0.818–1.217 0.709 0.583–0.863 2.005 1.581–2.542 1.766 1.400–2.228
 25% 0.999 0.807–1.124 0.733 0.594–0.903 2.004 1.552–2.587 1.776 1.385–2.277
 50% 0.999 0.774–1.289 0.787 0.614–1.008 2.006 1.479–2.719 1.811 1.348–2.433
 67% 0.997 0.724–1.372 0.847 0.621–1.155 2.007 1.372–2.937 1.861 1.286–2.693
 85% 0.989 0.600–1.630 0.921 0.564–1.505 2.020 1.112–3.668 1.947 1.085–3.493

In practical terms, this situation can arises in long-term follow-up after successful treatment., For example, in our previous article, we showed that women who received tamoxifen had greater cumulative incidence of non-cancer deaths and second primary cancers (other than uterine cancer, which is significantly increased with tamoxifen use) after 12 years (13.7% vs. 10.1%) than those who received placebo (8). This observation alone does not imply a deleterious effect of tamoxifen with respect to these events, and the CHR for this endpoint indicated an 8% relative increase that was statistically nonsignificant (8).

Example from Clinical Oncology

Analysis by Causes of Death after Treatment for Localized Prostate Cancer

Radiation Therapy Oncology Group (RTOG) 8610 was a randomized trial comparing radiation therapy alone to radiation therapy plus androgen deprivation therapy (ADT) after curative surgery for localized prostate cancer (31). Men with prostate cancer may obtain benefit from ADT via reduction in risk of local recurrence and eventual distant metastases, which account for most prostate cancer deaths. On the other hand, depletion of hormones can have negative health consequences with respect to other diseases, thus having the potential to increase other cause mortality. Age at diagnosis is important with respect to both the growth behavior of the prostate tumor and risk for non-cancer deaths. A third important covariate is tumor cell differentiation classified into grades, which is strongly related to prostate cancer death but should not have any direct bearing on non-cancer deaths. Among 471 trial eligible men who participated in the trial, we examine treatment, age at diagnosis, and grade from the perspective of modeling the cause-specific hazard and the cumulative incidence (using Fine-Gray and Klein-Andersen models, using the software of Klein (32) for the latter) of prostate cancer and other-cause deaths.

The addition of hormones reduces the hazard of prostate cancer deaths (Table 4, A) by 33% (CHR =0.67). The influence of treatment on the cumulative incidence of prostate cancer death (via the subdistribution hazard) is similar in this case (SHR=0.66). The Klein-Andersen model estimate is also similar. For age at diagnosis, the cause-specific hazard model indicates a statistically non-significant 11% decrease in failure risk per 10 years of increased age (partly due to some nonlinearity in the age effect). However, the cumulative incidence regression indicates a statistically significant 25% reduction in cancer deaths per 10 years of increased age (SHR=0.75). This is likely largely due to the fact that with increasing age, risk of death from causes other than cancer increases greatly, even among cancer patients. Grade is strongly associated with prostate cancer death hazard and cumulative incidence, according to both the SHR and Klein-Andersen models.

Table 4.

Comparison of competing risks regression models examining treatment and two covariates for competing outcomes in prostate cancer (RTOG 8610)

Model Effect Estimates
Cox CSH Fine-Gray SDH Klein-Andersen
Event type (death) CHR 95% CI SHR 95% CI SHR 95% CI
A. Prostate Cancer
 ADT (vs RT only) 0.67 0.49–0.92 0.66 0.48–0.91 0.67 0.49–0.93
 Age* 0.89 0.71–1.13 0.75 0.60–0.95 0.79 0.63–1.00
 Grade 2 vs 1 1.84 1.04–3.23 1.83 1.05–3.17 1.87 1.06–3.31
 Grade 3 vs 1 2.87 1.66–4.98 2.83 1.65–4.87 2.94 1.70–5.08
B. Other causes
 ADT (vs RT only) 1.13 0.85–1.51 1.26 0.95–1.68 1.20 0.89–1.61
  Age 2.02 1.60–2.57 1.93 1.54–2.43 1.88 1.49–2.38
  Grade 2 vs 1 0.87 0.59–1.28 0.75 0.52–1.08 0.82 0.56–1.20
  Grade 3 vs 1 0.91 0.62–1.35 0.60 0.41–0.87 0.61 0.41–0.90
All deaths
ADT (vs RT only) 0.88 0.71–1.09 - - - -
 Age 1.36 1.15–1.61 - - - -
 Grade 2 vs 1 1.13 0.83–1.55 - - - -
 Grade 3 vs 1 1.44 1.06–1.97 - - - -
*

per 10 year increment in age

For other-cause deaths (Table 4, B), ADT is seen to nominally increase the hazard (CHR=1.13), while for the cumulative incidence model, a 26% greater risk is seen that approaches statistical significance. If indeed ADT reduces prostate cancer deaths, as clearly indicated in for that endpoint, then there will naturally accumulate more other-cause deaths among patients receiving it, leading to greater cumulative incidence of this event. This is an important consideration when attempting to infer the extent to which treatment is a causative factor in other-cause deaths. Age at diagnosis is similarly related to the cause-specific hazard of other-cause death. For tumor grade, a ‘protective’ effect with for high grade is suggested in the cumulative incidence models. However, this is most likely due to the strong association of this factor with prostate cancer death, and not with any direct influence on death from other causes.

An important additional issue we do not address in this paper is whether the general model form is appropriate on whichever scale one is working. In particular, the proportionality assumption cannot be simultaneously satisfied for both cause-specific hazards and subdistribution hazards, although often either model will still provide a reasonably informative summary of covariate effects (3335).

Discussion

In this article, we discuss two popular alternative methods to exploring the influence of covariates on outcomes when there are competing risks. We showed in simulation studies and data examples that the influence of a covariate on either a) the risk (hazard) of a specific event or b) the cumulative probability (incidence) of that event will differ for the same data. Thus, it is important to be aware of the outcome metric one is evaluating in relation to the covariate and interpret the result appropriately. Both approaches are correct for their respective purposes, and each may lend important information about the covariate.

Covariate effects in the cause-specific hazard model described here pertain to the event of interest only, without regard to how the covariates act on competing events (9). In fact, individuals failing from other causes simply become no longer at risk for failure and are thus treated essentially as censored observations. The extent to which this model characterizes the covariate’s influences solely on risk of the event of interest depends on how causes of failures may be inter-related, a quantity that is not directly observable in competing risks data. However, as the simulations show, even under moderate dependence between failure types, the cause-specific hazard ratio tends to reflect the influence of the covariate isolated to the event of interest, regardless of how that same covariate is related to competing events. On the other hand, cumulative incidence models measure the effect of the covariate on the specific event cumulative probability, and this covariate effect may derive from the direct effect on making the event of interest more or less likely to occur, or the indirect effect of making competing events more or less likely to occur. Thus, for example, patient age is related to avoidance of dying from prostate cancer through possibly less aggressive prostate disease as well as increased probability of simply dying of other causes first.

Given the choice of modeling approaches, one may ask when to use which? This depends on question(s) of interest, and in many cases both may be informative. Because it is desirable to establish that, at least in part, a covariate such as treatment group or a potentially prognostic marker are indeed acting on the event of interest, CHR models might be preferred for treatment or prognostic marker effect testing. Indeed, as the simulations show, when covariate effects are ‘shared’ among competing events, it may be the case that none achieves statistical significance when modeled on the cumulative incidence scale.

On the other hand, the effect of covariates for a given outcome, focusing largely on those who do not fail from other causes first, certainly does not yield the complete picture, and indeed can be misleading. For example, the effect of hormone therapy specifically on prostate cancer death is dramatic, but other-cause deaths are actually more common, and the effect of treatment on these events is null at best and has the potential to be deleterious. Thus, if one were interested in evaluating the ultimate benefit in terms of prostate cancer deaths avoided in a given population, then the model for cumulative incidence of prostate cancer more realistically models what treatment benefit is to be realized in a population. In our example, because the effect on other-cause death is essentially null, results for the two models were similar. However, other factors, such as age at diagnosis, have the potential to dramatically alter the ‘yield’ from such a treatment intervention, if say, it was applied in populations too old to benefit. Thus, for building prognostic nomograms or considering health economic effects of treatment, such those of interest in comparative effectiveness research, modeling on the cumulative incidence scale is of critical importance. These models give a better sense of the influences of treatment and other covariates on an absolute scale.

Competing risks methods have been widely adopted in biomedical research and oncology applications in particular, resulting in richer interpretation of the complex event history that cancer patients undergo. While methods continue to be developed to better answer the needs of researchers, we hope that this discussion is helpful in applying and interpreting numerous useful tools available at present and frequently seen in the oncology literature.

Statement of Translational Relevance.

Competing risks are common in clinical cancer research, as patients are subject to multiple potential failure events, both disease related and otherwise. In order to evaluate how treatment and patient or disease characteristics relate to these outcomes, competing risks regression models are used. Depending on which model is used, a distinctly different picture of the relationship of covariates to outcomes may be seen. It is important to choose a modeling approach that addresses the question of interest and subsequently interpret the results accordingly. Here, we provide examples and guidance for which model may be most appropriate depending on the specific goals and interests of the analysis.

Acknowledgments

Support for this research was partially provided by Public Health Service grants NCI P30-CA-14599 and RTOG U10 CA21661 from the National Cancer Institute, National Institutes of Health, US Department of Health and Human Services, and by Pennsylvania Department of Health 2009 Formula Grant 4100050889.

References

  • 1.Gelman R, Gelber R, Henderson IC, Coleman CN, Harris JR. Improved methodology for analyzing local and distant recurrence. J Clin Oncol. 1990;8:548–555. doi: 10.1200/JCO.1990.8.3.548. [DOI] [PubMed] [Google Scholar]
  • 2.Korn EL, Dorey FJ. Applications of crude incidence curves. Stat Med. 1992;11:813–829. doi: 10.1002/sim.4780110611. [DOI] [PubMed] [Google Scholar]
  • 3.Gaynor JJ, Feuer EJ, Tan CC, Wu DH, Little CR, Straus DJ, et al. On the use of cause-specific failure and conditional failure probabilities: Examples from clinical oncology data. J Am Stat Assoc. 1993;88:400–09. [Google Scholar]
  • 4.Caplan RJ, Pajak TF, Cox JD. Analysis of the probability and risk of cause-specific failure. Int J Radiat Oncol Biol Phys. 1994;29:1183–1186. doi: 10.1016/0360-3016(94)90416-2. [DOI] [PubMed] [Google Scholar]
  • 5.Gooley TA, Leisenring W, Crowley J, Storer BE. Estimation of failure probabilities in the presence of competing risks: new representations of old estimators. Stat Med. 1999;18:695–706. doi: 10.1002/(sici)1097-0258(19990330)18:6<695::aid-sim60>3.0.co;2-o. [DOI] [PubMed] [Google Scholar]
  • 6.Freidlin B, Korn EL. Testing treatment effects in the presence of competing risks. Stat Med. 2005;24:1703–1712. doi: 10.1002/sim.2054. [DOI] [PubMed] [Google Scholar]
  • 7.Williamson PR, Kolamunnage-Dona R, Tudur Smith C. The influence of competing-risks setting on the choice of hypothesis test for treatment effect. Biostatistics. 2007;8:689–694. doi: 10.1093/biostatistics/kxl040. [DOI] [PubMed] [Google Scholar]
  • 8.Dignam JJ, Kocherginsky MN. Choice and interpretation of statistical tests used when competing risks are present. J Clin Oncol. 2008;26:4027–4034. doi: 10.1200/JCO.2007.12.9866. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Prentice RL, Kalbfleisch JD, Peterson AV, Flournoy N, Farewell VT, Breslow NE. The analysis of failure times in the presence of competing risks. Biometrics. 1978;34:541–554. [PubMed] [Google Scholar]
  • 10.Fine JP, Gray RJ. A proportional hazards model for the subdistribution of a competing risk. J Am Stat Assoc. 1999;94:496–509. [Google Scholar]
  • 11.Bentzen SM, Vaeth M, Pedersen DE, Overgaard J. Why actuarial estimates should be used in reporting late normal-tissue effects of cancer treatment …now! Int J Radiat Oncol Biol Phys. 1995;32:1531–1534. doi: 10.1016/0360-3016(95)00262-W. [DOI] [PubMed] [Google Scholar]
  • 12.Caplan RJ, Pajak TF, Cox JD. In response to Bentzen et al., IJROBP 32:1531–1534;1995. Int J Radiat Oncol Biol Phys. 1995;32(5):1547. doi: 10.1016/0360-3016(95)99002-W. [DOI] [PubMed] [Google Scholar]
  • 13.Chappell R. RE: Caplan et al and Bentzen et al. Int J Radiat Oncol Biol Phys. 1996;36:988–989. doi: 10.1016/s0360-3016(97)89873-2. [DOI] [PubMed] [Google Scholar]
  • 14.Denham JW, Hamilton CS, O’Brien P. Regarding actuarial late effect analyses: Bentzen et al., IJROBP 32:1531–1534;1995 and Caplan et al., IJROBP 32:1547;1995. Int J Radiat Oncol Biol Phys. 1996;35(1):197. doi: 10.1016/s0360-3016(96)85032-2. [DOI] [PubMed] [Google Scholar]
  • 15.Kim HT. Cumulative incidence in competing risks data and competing risks regression analysis. Clin Cancer Res. 2007;13:559–565. doi: 10.1158/1078-0432.CCR-06-1210. [DOI] [PubMed] [Google Scholar]
  • 16.Tai BC, Grundy R, Machin D. On the importance of accounting for competing risks in pediatric brain cancer: II. Regression modeling and sample size. Int J Radiat Oncol Biol Phys. 2011;79:1139–1146. doi: 10.1016/j.ijrobp.2009.12.024. [DOI] [PubMed] [Google Scholar]
  • 17.Allignol A, Schumacher M, Wanner C, Dreschler C, Beyersmann J. Understanding competing risks: a simulation point of view. BMC Med Res Methodol. 2011;11(1):86. doi: 10.1186/1471-2288-11-86. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Gray RJ. A class of K-sample tests for comparing the cumulative incidence of a competing risk. Ann Stat. 1988;16:1141–1154. [Google Scholar]
  • 19.Slud EV, Byar D. How dependent causes of death can make risk factors appear protective. Biometrics. 1988;44:265–69. [PubMed] [Google Scholar]
  • 20.Di Serio C. The protective impact of a covariate on competing failures with an example from a bone marrow transplantation study. Lifetime Data Anal. 1997;3:99–122. doi: 10.1023/a:1009672300875. [DOI] [PubMed] [Google Scholar]
  • 21.Cheng SC, Fine JP, Wei LJ. Prediction of cumulative incidence function under the proportional hazards model. Biometrics. 1998;54:219–228. [PubMed] [Google Scholar]
  • 22.Andersen PK, Borgan O, Gill R, Keiding N. Statistical Models Based on Counting Processes. New York: Springer; 1992. [Google Scholar]
  • 23.Benichou J, Gail MH. Estimates of absolute cause-specific risk in cohort studies. Biometrics. 1990;46:813–826. [PubMed] [Google Scholar]
  • 24.Bryant J, Dignam JJ. Semiparametric models for cumulative incidence functions. Biometrics. 2004;60:182–190. doi: 10.1111/j.0006-341X.2004.00149.x. [DOI] [PubMed] [Google Scholar]
  • 25.Jeong JH, Fine JP. Parametric regression on cumulative incidence function. Biostatistics. 2007;8:184–196. doi: 10.1093/biostatistics/kxj040. [DOI] [PubMed] [Google Scholar]
  • 26.Klein JP, Andersen PK. Regression modeling of competing risks data based on pseudovalues of the cumulative incidence function. Biometrics. 2005;61:223–229. doi: 10.1111/j.0006-341X.2005.031209.x. [DOI] [PubMed] [Google Scholar]
  • 27.Fine JP. Regression modeling of competing crude failure probabilities. Biostatistics. 2001;2:85–97. doi: 10.1093/biostatistics/2.1.85. [DOI] [PubMed] [Google Scholar]
  • 28.Hougaard P. A class of multivariate failure time distributions. Biometrika. 1986;73:671–678. [Google Scholar]
  • 29.Tsiatis AA. A nonidentifiability aspect of the problem of competing risks. Proc Natl Acad Sci USA. 1975;72:20–22. doi: 10.1073/pnas.72.1.20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Gail MH. A review and critique of some models used in competing risk analysis. Biometrics. 1975;31:209–222. [PubMed] [Google Scholar]
  • 31.Pilepich MV, Winter K, John MJ, Mesic JB, Sause W, Rubin P, et al. Phase III radiation therapy oncology group (RTOG) trial 86–10 of androgen deprivation adjuvant to definitive radiotherapy in locally advanced carcinoma of the prostate. Int J Radiat Oncol Biol Phys. 2001;50:1243–1252. doi: 10.1016/s0360-3016(01)01579-6. [DOI] [PubMed] [Google Scholar]
  • 32.Klein JP, Gerster M, Andersen PK, Tarima S, Perme MP. SAS and R functions to compute pseudo-values for censored data regression. Comput Methods Programs Biomed. 2008;89:289–300. doi: 10.1016/j.cmpb.2007.11.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Latouche A, Boisson V, Chevret S, Porcher R. Misspecified regression model for the subdistribution hazard of a competing risk. Stat Med. 2007;26:965–74. doi: 10.1002/sim.2600. [DOI] [PubMed] [Google Scholar]
  • 34.Beyersmann J, Schumacher M. Comment on: Misspecified regression model for the subdistribution hazard of a competing risk. Stat Med. 2007;26:1649–51. doi: 10.1002/sim.2727. [DOI] [PubMed] [Google Scholar]
  • 35.Grambauer N, Schumacher M, Beyersmann J. Proportional subdistribution hazards modeling offers a summary analysis, even if misspecified. Stat Med. 2010;29(7–8):875–84. doi: 10.1002/sim.3786. [DOI] [PubMed] [Google Scholar]

RESOURCES