Evaluating center performance in the competing risks setting: Application to outcomes of wait-listed end-stage renal disease patients

Sai H Dharmarajan; Douglas E Schaubel; Rajiv Saran

doi:10.1111/biom.12739

. Author manuscript; available in PMC: 2019 Jun 19.

Published in final edited form as: Biometrics. 2017 Jul 6;74(1):289–299. doi: 10.1111/biom.12739

Evaluating center performance in the competing risks setting: Application to outcomes of wait-listed end-stage renal disease patients

Sai H Dharmarajan ^1,^*, Douglas E Schaubel ^1,^**, Rajiv Saran ²

PMCID: PMC6582632 NIHMSID: NIHMS894618 PMID: 28682445

Summary

It is often of interest to compare centers or healthcare providers on quality of care delivered. We consider the setting where evaluation of center performance on multiple competing events is of interest. We propose estimating center effects through cause-specific proportional hazards frailty models that allow correlation among a centers cause-specific effects. Estimation of our model proceeds via penalized partial likelihood and is implemented in R. To evaluate center performance, we also propose a directly standardized excess cumulative incidence (ECI) measure. Therefore, based on our proposed methods, practitioners can evaluate centers either through the cause-specific hazards or the cumulative incidence functions. We demonstrate, through simulations, the advantages of the proposed methods to detect outlying centers, by comparing the proposed methods and existing methods which assume uncorrelated random center effects. In addition, we develop a Correlation Score Test to test the null hypothesis that the competing event processes within a center are correlated. Using data from the Scientific Registry of Transplant Recipients, we apply our method to evaluate the performance of Organ Procurement Organizations on two competing risks: (i) receipt of a kidney transplant and (ii) death on the wait-list.

Keywords: Cause-specific hazards, Center Effects, Competing Risks, Correlation Score Test, Cumulative Incidence, Kidney Transplantation

1. Introduction

The availability of electronic health records and the demand for value-driven healthcare have led to greatly increased interest in the methods for evaluation of center performance (Ash et al., 2012). For continuous or binary outcomes, center effects are usually estimated as either fixed or random effects models. Evaluation of center performance is then generally carried out by comparing these estimated risk-adjusted center effects to some fixed quantity, or the average center effect, or by using graphical checks (Spiegelhalter et al., 2012).

The proposed methods are motivated by the end-stage renal disease (ESRD) setting. There are thousands more patients in need of transplantation than there are donor kidneys. As a result medically suitable ESRD patients are placed on a waiting list. For example, in 2015, there were 98,956 patients on the kidney waiting list at year-end, but only 11,594 deceased-donor kidney transplants (Hart et al., 2016). In the United States, there are 58 wait-lists, each administered by an Organ Procurement Organization (OPO). Our objective here is to evaluate OPOs with respect to (i) kidney transplantation and (ii) pre-transplant death (competing risks) among wait-listed patients.

While there has been extensive research conducted into establishing methods for institutional comparisons with respect to binary and continuous outcomes, apart from a few recent studies, time-to-event outcomes have received considerably less attention. He and Schaubel (2014a) assessed the standardized mortality ratio (SMR) measure based on the Cox model and developed an alternative based on stratification. In another study, He and Schaubel (2014b) developed a direct standardized measure of center performance.

Oftentimes in clinical and epidemiological settings, there is more than one competing outcome of interest. In such cases, there are two approaches to conceptualize the event times for the competing risks. The first approach assumes that, for every patient, a latent event time (Gail, 1975; Crowder, 2001) exists for each outcome and only the minimum of these (Cox, 1959) is observed. Under this conceptualization, latent event times must act independently in order for marginal quantities (e.g., cause- or event-specific survival function) to be identifiable. A second approach, adopted in our report, assumes that only one event time, pertaining to the cause of failure, exists for each subject (Kalbfleisch and Prentice, 2002). Data from such settings can now be analyzed through the analysis of cause-specific hazards (Kalbfleisch and Prentice, 2002; Prentice and Kalbfleisch, 1978).

With competing risks data, a comparison of centers with respect to all-cause mortality has the potential to obscure important findings by averaging of dissimilar results (Van Rompaye et al., 2010). An analysis by cause has the potential to yield more interpretable and insightful conclusions (Putter et al., 2007). Fan and Schaubel (2016) proposed, as a center performance measure, the difference between the estimated cumulative incidence of transplant for patients at a given center and the average of the estimated cumulative incidences. Based on similar techniques, Van Rompaye, Erikson and Goetghebeur (2015) developed an ‘excess cause-specific cumulative incidence’ (ECI). For indirectly standardized measures, center performance is evaluated at the patient mix or covariate distribution of each center. Although useful for internal benchmarking, directly standardized measures are preferred for comparisons across centers (Varewyck et al., 2014). Note that random center effects may be preferable to fixed effects in the presence of small center sizes (Ash et al., 2012; Ohlssen et al., 2006; Kalbfleisch and Wolfe, 2013).

Most existing methods for clustered competing risks model the within-cluster dependence through a random effect, and concentrate on a single risk (or separate models for each risk) (Katsahian and Boudreau, 2011; Do Ha et al., 2014). In contrast, we propose a class of frailty models which allow a centers cause-specific random effects be correlated. This approach utilizes the additional information available in the form of correlation between cause-specific random effects within a center.

In this article, we develop a directly standardized ECI measure to contrast center performance on competing outcomes. We utilize an easily implementable penalized partial likelihood method (Ripatti and Palmgreen, 2000). Note that Gorfine and Hsu (2011) and Gorfine et al. (2014) also developed frailty models for correlated event times within-cluster. However, an Expectation-Maximization (EM) algorithm was used which requires numerical integration at each E-step. In comparison, our estimation procedure does not require any numerical integration and is implemented through a single call to coxme function of the coxme package (Therneau, 2009).

If competing events are indeed uncorrelated, fitting separate models is appropriate and easier than the proposed methods. Therefore, we also develop a convenient score test for the presence of correlation between competing risks within-center. The score test does not require fitting the joint model and, thus, provides an a priori checks the appropriateness of using separate cause-specific models, in lieu of the proposed methods.

2. Proposed Methods

2.1. Model and Likelihood

There are J centers or clusters, with each center j having n_j members (j = 1, …, J) so that there are $\sum_{j = 1}^{J} n_{j} = n$ individuals in the entire sample. For each subject i(i = 1, …, n_j) in center j, let $T_{i j}^{0}$ and C_ij denote the failure time and the censoring time, respectively, and let X_ij be a vector of time-independent covariates. The observed event time is then defined as $T_{i j} = \min (T_{i j}^{0}, C_{i j})$ . Each subject fails due to one of K causes, we use Δ_ij (Δ_ij ∈ {0, …, K}) to indicate the cause of the observed failure for subject i in center j, with Δ_ij = 0 if $T_{i j}^{0} > C_{i j}$ . The observed data consist of {T_ij, Δ_ij, X_ij, A_ij} for i = 1, …, n_j and (j = 1, …, J), where A_ij = 1 if subject i belongs to center j and 0 otherwise.

Additionally, we define a vector of center-specific random effects or frailties, for the jth center, γ_j = (γ_j1, …, γ_jK)^T, given which the event times for all subjects within that center are assumed to be conditionally independent. Thus, the cause-specific hazard function for cause k, for the subject i in the center j, is given by:

λ_{i j k} (t | X_{i j}, γ_{j k}) = \lim_{h ↓ 0} \frac{1}{h} P r (t ⩽ T_{i j}^{0} < t + h, Δ_{i j} = k | T_{i j}^{0} ⩾ t, X_{i j}, γ_{j k})

and is assumed to be following the proportional hazards model:

λ_{i j k} (t | X_{i j}, γ_{j k}) = λ_{0 k} (t) \exp {β_{k}^{T} X_{i j} + γ_{j k}}

(1)

for k = 1,..,K where β₁, …, β_k and λ₀₁, …λ_0k are cause-specific regression coefficients and cause-specific baseline hazards respectively. Here, we assume that the vector of covariates X_ij is the same for all causes, but it can be replaced by cause-specific vectors of covariates X_ijk. The center-specific random effects imply a correlation between the cause-specific hazards across subjects within a center. Further, by assuming that the center-specific random effect vectors arise from a multivariate normal distribution with mean zero and covariance matrix V _j, i.e., γ_j ~ MV N(0,V _j), our model allows for the association of different cause-specific hazards across individuals within a center. It is important to note that our model implies that the cause-specific hazards for different causes may be correlated across individuals within a center and not that the cause-specific event times within each individual are correlated. Indeed, as we do not adopt the latent failure time paradigm, our model is agnostic about the existence of different cause-specific event times within each individual.

We focus on the case of K = 2 competing causes, and allow for center-specific random effects for the two different causes to be negatively associated, i.e., Corr(γ_j1, γ_j2) ⩽ 0. To this end we reformulate the cause-specific hazards in equation (1) as

λ_{i j 1} (t | X_{i j}, b_{j}^{0}, b_{j}^{1}) = λ_{01} (t) \exp {β_{1}^{T} X_{i j} + b_{j}^{1} + b_{j}^{0}}

(2)

λ_{i j 2} (t | X_{i j}, b_{j}^{0}, b_{j}^{2}) = λ_{02} (t) \exp {β_{1}^{T} X_{i j} + b_{j}^{2} - b_{j}^{0}}

(3)

where $b_{j}^{1} + b_{j}^{0} = γ_{j 1}$ and $b_{j}^{2} - b_{j}^{0} = γ_{j 2}$ . We have decomposed a center’s cause-specific random-effect into two independent components: a shared random-effect, $b_{j}^{0}$ , acting in opposite directions on the hazards of the two different risks, and a cause-specific random effect component $b_{j}^{k}$ . This implies that $Cov (γ_{j 1}, γ_{j 2}) = - Var (b_{j}^{0})$ . We further assume that jointly $b_{j} = (b_{j}^{0}, b_{j}^{1}, b_{j}^{2}) \sim p (b_{j}; D_{j}) = M V N (0, D_{j} (θ_{j}))$ , where D_j(θ_j) is a diagonal covariance matrix with unknown parameters denoted by the vector θ_j.

We now construct the likelihood function for the model implied in equation (1) in terms of the parameters $(λ_{0} (t), β_{k}^{T}, θ_{j})$ . Note that, for any given subject, $λ_{i j} (t | X_{i j}, b_{j}) = \sum_{k = 1}^{K} λ_{i j k} (t | X_{i j}, b_{j})$ . Thus, the cause-specific densities can be represented as f_ijk(t|X_ij, b_j) = λ_ijk(t|X_ij, b_j)S_ij(t|X_ij, b_j) for k = {1,..,K}, where $S_{i j} (t | X_{i j}, b_{j}) = \exp {- \sum_{k = 1}^{K} λ_{i j k} (t | X_{i j}, b_{j})}$ . Hence, the likelihood function can be written in terms of cause-specific hazard functions. Let the at-risk indicator for subject i in center j be given by Y_ij(t) = I(T_ij ⩾ t). Using the notation given in Section 2.1, we write the likelihood for subjects in center j as:

L_{j} = \int \prod_{i = 1}^{n_{j}} \prod_{k = 1}^{K} {λ_{0 k} (T_{i j}) \exp {β_{k}^{T} X_{i j} + b_{j}^{T} Z_{i j k}}}^{I (Δ_{i j} = k)} \times [\exp (- \int_{0}^{t} Y_{i j} (u) λ_{0 k} (u) \exp {β_{k}^{T} X_{i j} + b_{j}^{T} Z_{i j k}} d u)] p (b_{j}; D_{j} (θ_{j})) d b_{j}

(4)

where the integral sign represents the unobserved frailties given by b_j being integrated out and Z_ijk are design vectors setup to obtain the cause-specific hazard models in equations (2) and (3). Specifically, if subject i is in center j then Z_ij1 = (1, 1, 0) and Z_ij2 = (−1, 0, 1), and if subject i does not belong to center j then Z_ij1 = Z_ij2 = (0, 0, 0). It is important to note that for the construction of the above likelihood, we assumed the following: (1) Conditional on {X_ij, Z_ijk, b_j}, the event times and censoring times are independent and the censoring times are non-informative for {β_k, λ_0k, k = 1,2}, (2) X_ij and b_j are independent.

2.2. Estimation

It follows from equation (4) above that the overall likelihood of the data is given by:

L = \int \prod_{j = 1}^{J} \prod_{i = 1}^{n_{j}} \prod_{k = 1}^{K} {λ_{0 k} (T_{i j}) \exp {β_{k}^{T} X_{i j} + b_{j}^{T} Z_{i j k}}}^{I (Δ_{i j} = k)} \times [\exp (- Λ_{0 k} (T_{i j}) \exp {β_{k}^{T} X_{i j} + b_{j}^{T} Z_{i j k}})] \times p (b_{j}; D_{j} (θ_{j})) d b_{j},

(5)

where $Λ_{0 k} (T_{i j}) \exp {β_{k}^{T} X_{i j} + b_{j}^{T} Z_{i j k}} = \int_{0}^{T_{i j}} Y_{i j} (u) λ_{0 k} (u) \exp {β_{k}^{T} X_{i j} + b_{j}^{T} Z_{i j k}} d u$ .

Let b = {b₁^T, …b_J^T}^T be a vector of all random-effects, obtained by stacking the center-specific vectors of random effects b_j, j = 1, …, J. Correspondingly, we define p(b; D(θ)) = MV N(0, D(θ)) such that D(θ) is a block-diagonal covariance matrix composed of blocks formed by D_j(θ_j). We further assume that $θ_{j} = θ_{j^{'}} = (θ_{0}, θ_{1}, θ_{2})$ ; i.e., the center-specific random effect vectors, b_j are i.i.d with $Var (b_{j}^{l}) = θ_{l}, l = {0, 1, 2}$ .

The integrand in equation (5) above can be viewed as the full likelihood of the data under our model, composed of the conditional likelihood of the data given random effects b, multiplied by the likelihood of the random effects. Taking the log, we define:

l_{f u l l} = l_{c o n d} + l_{b} = \sum_{j = 1}^{J} \sum_{i = 1}^{n_{j}} \sum_{k = 1}^{K} I (Δ_{i j} = k) {\log (λ_{0 k} (T_{i j})) + β_{k}^{T} X_{i j} + b_{j}^{T} Z_{i j k}} - \sum_{j = 1}^{J} \sum_{i = 1}^{n_{j}} \sum_{k = 1}^{K} Λ_{0 k} (T_{i j}) + β_{k}^{T} X_{i j} + {b_{j}}^{T} Z_{i j k} + \log | D |^{- \frac{1}{2}} - \frac{1}{2} b^{T} D^{- 1} b

(6)

The above equation is a penalized log-likelihood for the observed data. As in Ripatti and Palmgren (2000), treating b as a fixed effect and using profile likelihood to estimate Λ_0k(t) parameters, then plugging back the resulting Breslow (1974) estimator ${\hat{Λ}}_{0 k} (t)$ into equation (6) yields the following penalized partial log-likelihood (PPLL):

l_{p p l l} = l_{1} + l_{b} = \sum_{j = 1}^{J} \sum_{i = 1}^{n_{j}} \sum_{k = 1}^{K} I (Δ_{i j} = k) {β_{k}^{T} X_{i j} + {b_{j}}^{T} Z_{i j k} - \log \sum_{r = 1}^{J} \sum_{q = 1}^{n_{j}} Y_{q r} (T_{i j}) \exp {β_{k}^{T} X_{q r} + {b_{r}}^{T} Z_{q r k}}} + \log | D |^{- \frac{1}{2}} - \frac{1}{2} b^{T} D^{- 1} b .

(7)

As recommended in Ripatti and Palmgren (2000), we suggest obtaining the estimates of ((β_k, b),k = {1, 2}) as solutions to the PPLL. To estimate θ we need to integrate out b. As in Breslow and Clayton (1993), we use a Laplace saddle point approximation to the integration of penalized partial likelihood L_PPLL = exp(l_ppll), with respect to db. Doing so, we obtain an expression for the log of the integrated likelihood as:

l_{I N T} = - \frac{1}{2} \log | D | - \frac{1}{2} \log | K'' (\hat{b}) | - K (\hat{b})

K (\hat{b}) = \sum_{j = 1}^{J} \sum_{i = 1}^{n_{j}} \sum_{k = 1}^{K} I (Δ_{i j} = k) {β_{k}^{T} X_{i j} + {\hat{b}}_{j}^{T} Z_{i j k} - \log \sum_{r = 1}^{J} \sum_{q = 1}^{n_{j}} Y_{q r} (T_{i j}) \exp {β_{k}^{T} X_{q r} + {\hat{b}}_{r}^{T} Z_{q r k}}} + \log | D |^{- \frac{1}{2}} - \frac{1}{2} {\hat{b}}^{T} D^{- 1} \hat{b}

and $\hat{b}$ denotes the solution to the partial derivatives of K(b) with respect to b, i.e., $\hat{b}$ solves:

K^{'} (b) = \sum_{j = 1}^{J} \sum_{i = 1}^{n_{j}} \sum_{k = 1}^{K} I (Δ_{i j} = k) [Z_{i j k} - \frac{\sum_{r = 1}^{J} \sum_{q = 1}^{n_{j}} Y_{q r} (T_{i j}) Z_{q r k} \exp {β_{k} X_{q r} + {b_{r}}^{T} Z_{q r k}}}{\sum_{r = 1}^{J} \sum_{q = 1}^{n_{j}} Y_{q r} (T_{i j}) e x p {β_{k} X_{q r} + {b_{r}}^{T} Z_{q r k}}}] - D^{- 1} b = 0.

(8)

The quantity $K " (\hat{b})$ is the set of second partial derivatives of K(b) at $\hat{b}$ . $K'' (\hat{b})$ is also the second partial derivative of l_PPLL, evaluated at $\hat{b}$ . If we define H as the matrix of second derivatives or Hessian of the PPLL with respect to (β, b), such that:

H = [\begin{array}{l} H_{11} & H_{12} \\ H_{21} & H_{22} \end{array}] = - I (β, b) + [\begin{matrix} 0 & 0 \\ 0 & D^{- 1} \end{matrix}]

where $I (β, b) = - \partial^{2} l_{1} / \partial (β, b) \partial {(β, b)}^{'}$ , then $H {(β, \hat{b})}_{22} = K'' (\hat{b})$ . We then have:

l_{I N T} \approx l_{1} (β, \hat{b}) + l_{b} (θ, \hat{b}) - \frac{1}{2} \log | H {(β, \hat{b})}_{22} |

(9)

As demonstrated by Ripatti and Palmgren (2000), ignoring the last term on the right hand side of equation (9) while estimating (β, b) leads to very little loss of information. This corresponds to using the PPLL to estimate (β, b) via a Newton-Raphson algorithm. We have the following estimating equation for β:

\partial l_{P P L L} / \partial β = \sum_{j = 1}^{J} \sum_{i = 1}^{n_{j}} \sum_{k = 1}^{K} I (Δ_{i j} = k) [X_{i j} - \frac{\sum_{j = 1}^{J} \sum_{q = 1}^{n_{j}} Y_{q r} (T_{i j}) X_{q r} \exp {β_{k} X_{q r} + {b_{r}}^{T} Z_{q r k}}}{\sum_{j = 1}^{J} \sum_{q = 1}^{n_{j}} Y_{q r} (T_{i j}) \exp {β_{k} X_{q r} + {b_{r}}^{T} Z_{q r k}}}] = 0

(10)

The estimating equation for b is similarly obtained by setting ∂l_PPLL/∂b to zero, and is identical to equation (8). Thus, equation (8), required for the saddle point Laplace approximation, is automatically satisfied when PPLL is used to estimate b. To estimate D(θ) we plug the estimated values $(\hat{β})$ into equation (9) and solve for θ that maximizes l_INT. This gives us the following estimating equation:

- \frac{1}{2} [t r (D^{- 1} \frac{\partial D}{\partial θ}) + t r (H_{22}^{- 1} \frac{\partial D^{- 1}}{\partial θ}) - {\hat{b}}^{T} D^{- 1} \frac{\partial D}{\partial θ} D^{- 1} \hat{b}] = 0

(11)

For a diagonal covariance matrix, as in our case, we obtain the following solution:

{\hat{θ}}_{l} = \frac{{({\hat{b}}^{l})}^{T} ({\hat{b}}^{l}) + t r (H_{22}^{l} {({\hat{b}}^{l})}^{- 1})}{J}, l = {0, 1, 2}

(12)

where ${\hat{b}}^{l} = {{\hat{b}}_{1}^{l}, \dots, {\hat{b}}_{J}^{l}}$ and $H_{22}^{l} ({\hat{b}}^{l})$ is the sub-matrix corresponding to ${\hat{b}}^{l}$ terms. The proposed estimation algorithm begins with an initial guess of θ, then alternates between using the PPLL to estimate (β, b) as listed above and using equation (12) to update θ until convergence. As suggested by Gray (1992), the variance of ${({\hat{β}}^{T}, {\hat{b}}^{T})}^{T}$ is obtained as:

\hat{V} (\hat{β}, \hat{b}) = H {(\hat{β}, \hat{b})}^{- 1} I (\hat{β}, \hat{b}) H {(\hat{β}, \hat{b})}^{- 1}

(13)

To obtain the asymptotic distribution for $(\hat{β}, \hat{b}, {\hat{λ}}_{0 k} (s))$ , we assumed that the increments ${\hat{λ}}_{0 k} (s)$ are independent of $(\hat{β}, \hat{b})$ . Under this assumption we estimated the variance of ${\hat{λ}}_{0 k} (s)$ via a non-parametric bootstrap approach where the values of $(\hat{β}, \hat{b})$ were treated as fixed by setting $X \hat{β} + \hat{b}$ as an offset in the linear predictor of the instantaneous hazard. Thus, our desired asymptotic variance-covariance matrix for $(\hat{β}, \hat{b}, {\hat{λ}}_{0 k} (s))$ was obtained using equation (13) to estimate the variance of $(\hat{β}, \hat{b})$ and a non-parametric bootstrap approach to estimate the variance of ${\hat{λ}}_{0 k} (s)$ . In doing so we assume independence between ${\hat{λ}}_{0 k} (s)$ and $(\hat{β}, \hat{b})$ . Our simulation studies suggest this to be a safe assumption. In reality, the increments of ${\hat{λ}}_{0 k} (s)$ and $(\hat{β}, \hat{b})$ may be weakly correlated. However, with increasing sample size one would expect this correlation to get weaker and have a negligible impact on the standard errors of estimates. Then, ignoring this correlation in return for substantial gains in computational efficiency seems appropriate. It should also be noted that, while using the Laplace approximation to the marginal log-likelihood leads to little loss of information, it might result in a slight underestimation of standard errors of fixed and random effect parameters if the cluster sizes are very small, as demonstrated in Ripatti and Palmgren (2000).

2.3. Center Effect Measures: Cumulative Incidence

We define the cumulative incidence function (CIF) of cause k for subject i at center j as:

F_{i j k} (t) = P (T_{i}^{0} ⩽ t, Δ_{i} = k | A_{i} = j, X_{i j}),

(14)

the probability that an individual i in center j experiences a cause k event by time t. To evaluate the performance of center j with respect to type k events, we first define the average risk of events of type k at that center as F_jk(t) = E_X[F_ijk(t)], which is estimated as:

{\hat{F}}_{j k} (t) = {\hat{E}}_{X} [F_{i j k} (t)] = \frac{\sum_{i = 1}^{n_{j}} F_{i j k} (t)}{n_{j}}

(15)

Note that the above equation can be interpreted as potential risk for event k, at time t, that would be observed if the entire study population was treated at center j, assuming there are no unmeasured confounders. To compare the performance of center j to that of other centers we difference this potential risk with the average of such potential risks across all the centers. We call this measure the excess cumulative incidence. This is denoted as δ_jk(t) = F_jk(t) − E_A[F_jk(t)] and estimated as:

{\hat{δ}}_{j k} (t) = {\hat{F}}_{j k} (t) - \frac{\sum_{q = 1}^{J} {\hat{F}}_{q k} (t)}{J}

(16)

2.4. Estimating Center Effects

We estimate cumulative incidence functions, defined in equation (14) using the cause-specific hazards estimated from section 2.2. We note that the cause-specific CIF for cause k, individual i at center j can be written as:

F_{i j k} (t) = \int_{0}^{t} S_{i j} (s) λ_{i j k} (s) d s,

(17)

for which an estimate ${\hat{F}}_{i j k} (t)$ is then obtained by plugging into equation (17) the following estimated quantities:

{\hat{λ}}_{i j k} (s) = {\hat{λ}}_{0 k} (s) \exp ({\hat{β}}_{k} X_{i j} + {\hat{b}}_{j} Z_{i j k}); {\hat{S}}_{i j} (s) = \exp {- \sum_{k = 1}^{2} {\hat{Λ}}_{0 k} (s) \exp ({\hat{β}}_{k} X_{i j} + {\hat{b}}_{j} Z_{i j k})}

where ${\hat{β}}_{k}$ , ${\hat{b}}_{j}$ are estimates obtained as detailed in Section 2.2, and ${\hat{Λ}}_{0 k} (t) = \int_{0}^{t} {\hat{λ}}_{0 k} (s) d s$ is the cumulative cause-specific baseline hazard function obtained by integrating the Breslow-Aalen (Breslow 1974) estimate of the cause-specific baseline hazard function. Estimates of F_jk(t) and the excess cumulative incidence at center j, ${\hat{δ}}_{j k} (t)$ , are subsequently obtained by plugging ${\hat{F}}_{i j k} (t)$ into equations (14) and (16) respectively.

To obtain the variance of the cause-specific cumulative incidence and excess cumulative incidence functions, we apply a parametric bootstrap approach. Specifically, we re-sample the estimated parameters ${\hat{β}}_{k}$ , ${\hat{b}}_{j}$ and ${\hat{λ}}_{0 k} (s)$ from their estimated asymptotic distributions to obtain bootstrapped estimates of the cumulative incidence functions. The variance of ${\hat{F}}_{j k} (t)$ and ${\hat{δ}}_{j k} (t)$ are estimated as variance of the corresponding bootstrapped estimates.

3. Score test of Correlation of Cause-specific Hazards

As mentioned in Section 2.1, equation (1), the cause-specific hazard function for cause k, for the ith subject in center j, is assumed to follow:

λ_{i j k} (t | X_{i}, γ_{j k}) = λ_{0 k} (t) \exp {β_{k}^{T} X_{i} + γ_{j k}}

Thus, the likelihood for the observed data in center j is:

L_{j} = \int \prod_{i = 1}^{n_{j}} \prod_{k = 1}^{K} {λ_{0 k} (t_{i}) \exp {β_{k}^{T} X_{i} + γ_{j k}}}^{Δ_{i k} (t)} \times [\exp (- \int_{0}^{τ} Y_{i} (u) λ_{0 k} (u) \exp {β_{k}^{T} X_{i} + γ_{j k}} d t)] p (γ_{j}; V (θ)) d γ_{j}

(18)

To develop a score test of the correlation of cause-specific hazards within centers, we consider a special case of the model in equation (1) when only K = 2 causes are present. Assume that the center-specific random effects or frailty for cause 2 and cause 1 differ by a multiplicative constant, i.e., γ_j2 = ωγ_j1, implying the following specification for the cause-specific hazards:

λ_{i j 1} (t | X_{i}) = λ_{01} (t) \exp {β_{1}^{T} X_{i} + γ_{j 1}}; λ_{i j 2} (t | X_{i}) = λ_{02} (t) \exp {β_{1}^{T} X_{i} + ω γ_{j 1}}

(19)

The presence of a correlation between the cause-specific hazards within centers is then assessed by testing H₀ : ω = 0. When ω = 0, there is little evidence for a linear relationship between center-specific random effects for causes 1 and 2. Conversely, even if the center-specific random effects are not perfectly correlated as implied by the specification in (19) but have a dependence of the form specified in model (1) we would expect to reject the test of H₀ : ω = 0 in favor of H_a : ω ≠ 0. This is because, in case of any non-zero correlation between the center-specific random effects, the specification in (19) with some ω ≠ 0 should provide a better fit to the observed data than that with ω = 0. Thus, we propose to test for the presence of correlation between cause-sepcific hazards in model (1), i.e., H₀ : Cov(γ_j1,γ_j2) = 0, using the specification in (19) and testing H₀ : ω = 0.

Under the joint model for the cause-specific hazards in (19), likelihood for observed data in center j is given by:

L_{j} = \int \prod_{i = 1}^{n_{j}} {λ_{01} (t_{i}) \exp {β_{1}^{T} X_{i} + γ_{j 1}}}^{Δ_{i 1} (t)} {λ_{02} (t_{i}) \exp {β_{2}^{T} X_{i} + ω γ_{j 1}}}^{Δ_{i 2} (t)} \times [\exp (- \int_{0}^{τ} Y_{i} (u) λ_{01} (u) \exp {β_{1}^{T} X_{i} + γ_{j 1}} d u)] [\exp (- \int_{0}^{τ} Y_{i} (u) λ_{02} (u) \exp {β_{2}^{T} X_{i} + ω γ_{j 1}} d u)] \times p (γ_{j 1}; θ) d γ_{j 1} .

(20)

The marginal log-likelihood for the observed data at all centers is then given by:

\log l (ω, β_{k}, λ_{0 k}) = \sum_{j = 1}^{J} \sum_{i = 1}^{n_{j}} (\sum_{k = 1}^{2} Δ_{i k} (t) {\log λ_{0 k} (t_{i}) + {β_{k}^{T} X_{i}}}) + \log \int K_{j} (z_{j}, t) p (z_{j}; θ) d z_{j}

where z_j = logγ_j1, and

K_{j} (z, t) = z_{j}^{\sum_{i = 1}^{n_{j}} N_{i 1} (t -) + ω N_{i 2} (t -)} \times \exp {- z_{j} (\int_{0}^{t} Y_{i} (u) λ_{02} (u) \exp {β_{2}^{T} X_{i}} d u) - z_{j}^{ω} (\int_{0}^{t} Y_{i} (u) λ_{02} (u) \exp {β_{2}^{T} X_{i}} d u)} .

3.1. Correlation Score Test

Using the above formulation, the score test for correlation of the two cause-specific hazards tests H₀ : ω = 0. The score function is:

U_{ω} (ω, β_{k}, λ_{0 k}) = \sum_{j} \frac{\int {\sum_{i = 1}^{n_{j}} N_{i 2} (t_{i}) - (\int_{0}^{t} Y_{i} (u) λ_{02} (u) \exp {β_{2}^{T} X_{i}} d u)} z_{j}^{ω} \log z_{j} K_{j} (z_{j}) p (z_{j}; θ) d z_{j}}{\int K_{j} (z_{j}) p (z_{j}; θ) d z_{j}}

Setting ω = 0 and replacing β_k, λ_0k and θ with their estimates when ω = 0, we have:

U_{ω} (ω, β_{k}, λ_{0 k}) = \sum_{j} \frac{\int {\sum_{i = 1}^{n_{j}} N_{i 2} (t_{i}) - (\int_{0}^{t} Y_{i} (u) {\hat{λ}}_{02} (u) \exp {{\hat{β}}_{2}^{T} X_{i}} d u)} \log z_{j} {\hat{K}}_{j} (z_{j}) p (z_{j}; \hat{θ}) d z_{j}}{\int {\hat{K}}_{j} (z_{j}) p (z_{j}; θ) d z_{j}} = \sum_{j} {\hat{M}}_{2 j} . \hat{\log z_{j}}

${\hat{M}}_{2 j}$ is an estimate of the ${\sum_{i = 1}^{n_{j}} N_{i 2} (t_{i}) - (\int_{0}^{t} Y_{i} (u) λ_{02} (u) \exp {β_{2}^{T} X_{i}} d u)}$ , the sum of the martingale residuals for cause 2 at center j; and $\hat{\log z_{j}} = E [\log z_{j} | O_{j}]$ , i.e., the posterior expectation of the log frailties given the observed data in center j, O_j. If the frailties z_j are assumed to follow a log normal distribution, there is no closed form expression for $\hat{\log z_{j}}$ , however we can use the estimates ${\hat{γ}}_{j 1}$ obtained by maximizing the penalized partial log-likelihood for cause 1. Balan et al. (2016) note that the test of H₀ : ω = 0 can be carried out by testing if ${\hat{M}}_{2 j}$ and $\hat{\log z_{j}}$ are correlated. Thus, the correlation score test (CST) tests if there is a linear dependency between ${\hat{M}}_{2 j}$ and $\hat{\log z_{j}}$ and uses the regular t statistic from linear regression as the test statistic, $t = r \sqrt{(J - 2) / (1 - r^{2})}$ . Under H₀ : ω = 0, asymptotically, t follows a t distribution with J − 2 degrees of freedom.

4. Simulation Studies

In the first (of two) set of simulations, we evaluated the fixed effect parameter estimators, variance components of the random effects, and Correlation Score Test. There were K = 2 competing risks, and J = 50 or J = 100 centers (configurations 1 and 2, respectively). The center-specific random effects γ_j1, γ_j2 followed a mean zero multivariate normal (MVN) distribution with variance components $σ_{j} = (σ_{1}^{2}, σ_{2}^{2}, ρ_{12}) = (0.25, 0.25, - 0.5)$ . Using the re-parameterization described in Sections 2.1 and 2.2, this corresponds to the center-specific random effects vector $b_{j} = (b_{j}^{0}, b_{j}^{1}, b_{j}^{2})$ being generated from a MVN with mean zero and diagonal covariance matrix D with elements θ_j = (θ₀, θ₁, θ₂) = (0.125,0.125,0.125). The sample size within each center was fixed at n_j = 20 or n_j = 50 for different sub-configurations. In addition, we considered a single N(0,1) covariate X_i with regression coefficients β₁ = 0.5 and β₂ = 1.25 for causes k = 1 and k = 2 respectively. Given β_k, γ_j and the covariate X_i we generated a failure time $T_{i}^{0}$ for each subject within center j from an exponential distribution with rate parameter $μ = \sum_{k = 1}^{2} μ_{k} = \sum_{k = 1}^{2} \exp (β_{k} X_{i} + γ_{j k})$ . We assigned a cause of failure for subject i in center j given a failure at time t using $P r (Δ_{i} = k | T_{i}^{0} = t) = μ_{k} / μ$ . Finally, all censoring occurred at time τ = 0.4 in all configurations.

As shown in Table 1, the proposed method performs very well in estimating the parameters of interest. Also in Table 1, we present results of simulations where the center-specific random effects γ_j1, γ_j2 were generated from a mean zero MVN with $σ_{j} = (σ_{1}^{2}, σ_{2}^{2}, ρ_{12}) = (0.25, 0.25, 0)$ in order to assess the loss in efficiency due to unnecessarily estimating a correlation parameter when the true random effects are not correlated.

Table 1.

Estimating Regression Coefficients and Variance Components: Results from 500 Simulated Datasets

J	n_j		True Value	Bias	ESD	CP	True Value	Bias	ESD	CP
50	20	β₁	0.5	0.007	0.075	0.946	0.5	0.000	0.075	0.954
		β₂	1.25	0.002	0.072	0.950	1.25	0.002	0.074	0.942
		θ₁	0.125	−0.003	0.068	–	0	0.022	0.036	–
		θ₂	0.125	−0.001	0.088	–	0.125	−0.027	0.095	–
		θ₃	0.125	0.005	0.087	–	0.125	−0.021	0.089	–
50	50	β₁	0.5	−0.001	0.043	0.962	0.5	−0.001	0.043	0.962
		β₂	1.25	0.000	0.044	0.954	1.25	−0.004	0.046	0.944
		θ₁	0.125	−0.002	0.051	–	0	0.020	0.027	–
		θ₂	0.125	0.003	0.066	–	0.125	−0.020	0.073	–
		θ₃	0.125	−0.004	0.057	–	0.125	−0.026	0.069	–
100	20	β₁	0.5	0.003	0.050	0.960	0.5	0.000	0.051	0.960
		β₂	1.25	0.001	0.051	0.946	1.25	0.001	0.053	0.942
		θ₁	0.125	−0.005	0.053	–	0	0.017	0.029	–
		θ₂	0.125	0.003	0.066	–	0.125	−0.019	0.074	–
		θ₃	0.125	−0.001	0.064	–	0.125	−0.021	0.065	–
100	50	β₁	0.5	0.001	0.032	0.942	0.5	−0.001	0.033	0.944
		β₂	1.25	0.000	0.031	0.952	1.25	0.002	0.030	0.964
		θ₁	0.125	0.002	0.037	–	0	0.015	0.023	–
		θ₂	0.125	−0.001	0.043	–	0.125	−0.017	0.053	–
		θ₃	0.125	0.000	0.041	–	0.125	−0.012	0.049	–

Open in a new tab

In Table 2, we evaluate the proposed CST and a likelihood ratio test (LRT) of the correlation between cause-specific hazards, via H₀ : ρ = 0. For each (J, n_j) configuration, the Type 1 error rate was calculated as the mean number of times H₀ when the random effects were generated from a mean zero MVN with σ_j = (0.25, 0.25, 0). Similarly, the Power was the mean number of rejections when the random effects were generated from a mean zero MVN with σ_j = (0.25, 0.25, −0.5). The CST seems to do almost as well as the LRT, attaining a type I error rate closer to the nominal 0.05 and achieving nearly as much power. More importantly, the CST is carried out in much less computation time, since it does not require fitting the full model.

Table 2.

Power and Type I error of proposed Correlation Score Test (CST), and Likelihood Ratio (LR) tests. The null hypothesis is no correlation between cause-specific hazards within center: Results from 500 Simulated Datasets

Number of Centers	Subjects per Center	Type I Error		Power
(J)	(n_j)	LRT	CST	LRT	CST
50	20	0.006	0.032	0.416	0.358
50	50	0.028	0.026	0.782	0.692
100	20	0.022	0.048	0.710	0.654
100	20	0.034	0.036	0.982	0.960

Open in a new tab

In the second simulation study, we evaluated our estimators of the center-specific random effects {γ_j1, γ_j2}. Again, K = 2, J = 50, and X_i ~ N(0,1) with regression coefficients β₁ = 0.5 and β₂ = 1.25 for k = 1 and k = 2 respectively. Of the 50 centers, we fixed the value of the random effects for center j^′ and allowed the random effects for the remaining 49 centers to come from a mean 0 MVN with $σ_{j} = (σ_{1}^{2}, σ_{2}^{2}, ρ_{12}) = (0.25, 0.25, - 0.5)$ . The sample size for each of these 49 centers, n_j, j ≠ j^′ was set equal to the random draw from a N(100,40²) variate bounded at 20. Given β_k, γ_j and X_i, we generated $T_{i}^{0}$ from an exponential distribution with rate parameter $μ_{i} = \sum_{k = 1}^{2} μ_{i k}$ , where μ_ik = exp(β_kX_i+γ_jk), and assigned a cause of failure using $P r (Δ_{i} = k | T_{i}^{0} = t) = μ_{i k} / μ$ . Censoring again occurred at time τ = 0.4.

We studied the performance of our estimators at different values of the random effects ${γ_{j^{'} 1}, γ_{j^{'} 2}}$ and at different $n_{j^{'}}$ values. We compared the proposed method to an approach that fits separate frailty models for each k and therefore ignores the correlation between the center-specific random effects. As shown in Table 3, the proposed method produces center effect estimates with smaller mean square error, regardless of the center size and effect.

Table 3.

Estimating Center-Specific Effects: Results from 500 Simulations

			Proposed Method				Ignornig Correaltion of Random Effects
n_j′		True Value	Bias	ESD	ASE	CP	Relative MSE
20	γ_j^′1	0.0	−0.019	0.231	0.322	0.988	1.113
	γ_j^′2	0.0	−0.015	0.239	0.305	0.980	1.084
	γ_j^′1	0.5	−0.175	0.241	0.297	0.970	1.232
	γ_j^′2	−0.5	0.168	0.249	0.327	0.972	1.248
	γ_j^′1	1.0	−0.276	0.244	0.276	0.870	1.252
	γ_j^′2	−1.0	0.397	0.244	0.354	0.838	1.700
40	γ_j^′1	0.0	−0.007	0.222	0.263	0.988	1.093
	γ_j^′2	0.0	−0.015	0.209	0.243	0.986	1.041
	γ_j^′1	0.5	−0.097	0.208	0.232	0.946	1.203
	γ_j^′2	−0.5	0.108	0.214	0.271	0.974	1.274
	γ_j^′1	1.0	−0.141	0.203	0.209	0.916	1.242
	γ_j^′2	−1.0	0.268	0.221	0.306	0.914	1.799
60	γ_j^′1	0.0	−0.010	0.202	0.231	0.968	1.098
	γ_j^′2	0.0	−0.005	0.187	0.210	0.976	1.074
	γ_j^′1	0.5	−0.066	0.195	0.200	0.962	1.142
	γ_j^′2	−0.5	0.070	0.209	0.240	0.958	1.148
	γ_j^′1	1.0	−0.097	0.168	0.178	0.948	1.227
	γ_j^′2	−1.0	0.219	0.219	0.276	0.890	1.666
80	γ_j^′1	0.0	−0.018	0.181	0.210	0.988	1.095
	γ_j^′2	0.0	−0.020	0.179	0.191	0.964	1.080
	γ_j^′1	0.5	−0.071	0.180	0.180	0.954	1.170
	γ_j^′2	−0.5	0.066	0.180	0.218	0.970	1.153
	γ_j^′1	1.0	−0.105	0.161	0.161	0.894	1.206
	γ_j^′2	−1.0	0.193	0.201	0.256	0.918	1.554
100	γ_j^′1	0.0	−0.020	0.170	0.194	0.976	1.095
	γ_j^′2	0.0	−0.015	0.157	0.176	0.978	1.065
	γ_j^′1	0.5	−0.076	0.15	0.167	0.964	1.171
	γ_j^′2	−0.5	0.059	0.197	0.203	0.932	1.123
	γ_j^′1	1.0	−0.095	0.141	0.149	0.928	1.218
	γ_j^′2	−1.0	0.155	0.196	0.242	0.944	1.494

Open in a new tab

An expanded version of Table 3 is available in the Web Appendix (see Web Table 1). While both methods produce shrinkage, leveraging information on the correlation structure of the center-specific random effects leads to estimates with reduced shrinkage and higher rates of coverage. These gains in bias and coverage become more pronounced with decreasing sample sizes, and as the true values of the center effects deviate from the mean of the random effect distribution.

To examine our proposed excess cumulative incidence (ECI) center effect measure, we conducted simulations where the center-specific effects {γ_j1, γ_j2} were known for all centers. We set J = 50, with n_j set equal to the maximum of 20 and a N(100,40²) variate. Center-specific effects {γ_j1, γ_j2} were each fixed at one realization from a MVN with mean 0 and $σ_{j} = (σ_{1}^{2}, σ_{2}^{2}, ρ_{12}) = (0.25, 0.25, - 0.5)$ ; theses were then treated as true center effects. We set X_i ~ N(0,1), with β₁ = 0.5 and β₂ = 1.25 for causes 1 and 2 respectively. Failure times and causes were then generated as presented earlier. Censoring was again at τ = 0.4. The true ECI for each center was calculated at t = 0.3. In Table 4, we compare the proposed method with fitting separate cause-specific Cox frailty models. In terms of mean squared error of the ECI estimates, the proposed method generally out-performs the separate-models approach. A striking example, from Table 4, is the ECI estimates for Center j = 23, whose true ECI values for cause 1 and cause 2 are at opposite extremes.

Table 4.

Estimating Excess Cumulative Incidence: Results from 500 Simulation

			Proposed Method				Ignornig Correlation of Random Effects
Cause	Center	True Value	Bias	ESD	ASE	CP	Relative MSE
1	14	−0.170	0.029	0.022	0.027	0.850	1.018
	16	−0.096	0.024	0.029	0.034	0.926	0.750
	17	−0.181	0.003	0.017	0.023	0.990	3.432
	38	−0.138	0.017	0.024	0.029	0.958	1.226
	1	−0.179	0.023	0.020	0.026	0.918	1.462
	36	0.006	−0.009	0.038	0.038	0.932	0.950
	4	−0.033	0.001	0.034	0.037	0.948	1.010
	49	−0.070	0.018	0.029	0.034	0.944	0.777
	32	−0.047	0.010	0.031	0.035	0.960	0.969
	34	0.005	0.001	0.035	0.039	0.948	1.028
	23	0.344	−0.022	0.045	0.047	0.934	1.279
	19	0.118	−0.013	0.043	0.045	0.904	0.939
	13	0.142	−0.015	0.044	0.046	0.942	1.036
	15	0.127	−0.007	0.043	0.044	0.938	1.082
	18	0.210	−0.022	0.047	0.048	0.904	0.988
2	26	−0.222	0.020	0.019	0.025	0.932	2.265
	25	−0.140	0.014	0.027	0.029	0.936	1.196
	20	−0.137	0.013	0.025	0.030	0.952	1.209
	23	−0.199	0.014	0.021	0.025	0.950	2.122
	5	−0.078	0.006	0.030	0.032	0.950	1.056
	29	−0.017	0.004	0.034	0.033	0.922	0.954
	11	−0.020	0.002	0.031	0.034	0.954	0.987
	45	−0.043	0.006	0.032	0.033	0.934	0.965
	34	−0.058	0.007	0.029	0.033	0.952	0.993
	9	−0.009	0.001	0.031	0.033	0.946	1.016
	41	0.203	−0.016	0.037	0.037	0.928	1.065
	40	0.158	−0.012	0.035	0.036	0.934	1.076
	31	0.157	−0.016	0.038	0.038	0.900	0.969
	17	0.371	−0.020	0.037	0.034	0.902	1.327
	14	0.111	−0.009	0.033	0.035	0.926	1.245

Open in a new tab

5. Application

We applied the proposed methods to evaluate Organ Procurement Organizations (OPOs) with respect to two competing risks: (i) deceased-donor kidney transplantation (ii) death (prior to transplantation). We use data from the Scientific Registry of Transplant Recipients (SRTR). The SRTR data system includes data on all donor, wait-listed candidates, and transplant recipients in the U.S., submitted by the members of the Organ Procurement and Transplantation Network (OPTN), and has been described elsewhere. The Health Resources and Services Administration (HRSA), U.S. Department of Health and Human Services provides oversight to the activities of the OPTN and SRTR contractors.

The study cohort included patients wait-listed between 1/1/2010 and 4/30/2010. Patients were followed from the date of listing until the earliest of receipt of a kidney transplant, death, removal from wait-list, or the end of the observation period, 12/31/2012. Using the proposed methods, we compared OPOs across the U.S. with respect to the cumulative incidence of receiving a deceased-donor transplant and the cumulative incidence of death prior to transplantation. The time point we chose was two years post wait-listing, an appropriate time horizon based on previous related analyses (e.g., Fan and Schaubel, 2016). Patients receiving a living donor transplant were treated as independently censored, which is appropriate from the perspective that living-donor transplantation depends on many factors related to a patient’s specific circumstances and largely independent of OPO. Note that living-donor transplantation was not a cause of our interest, rendering unappealing its inclusion as a separate cause.

Our study population included n = 11,759 patients across J = 58 OPOs across the U.S. A total of 2,408 patients (20.5%) received a deceased-donor kidney transplant, while 1,114 (9.5%) died first. We adjusted for the following patient-level covariates: age at listing, race, sex, body mass index, primary renal diagnosis, panel reactive antibody level and blood type. Owing to the large dimension of the covariate vector, we used a two-stage approach, as done in Kalbfleisch and Wolfe (2013), to obtain the risk-adjusted center effects (see also He and Schaubel, 2014b). Specifically, we estimated the patient-level covariates at the first stage by fitting a Cox model stratified by OPO. At the second stage, we estimated the cause-specific OPO effects by fitting the proposed model, using the patient-level linear predictor from the first stage as an offset. The estimated variance components are given by ${\hat{σ}}_{j} = ({\hat{σ}}_{1}^{2}, {\hat{σ}}_{2}^{2}, {\hat{ρ}}_{12}) = (0.619, 0.031, 0.210)$ . The estimated correlation was determined to be statistically significant, with the CST yielding a p-value of 0.021.

Figure 1 displays the estimated OPO-specific ECI’s at 2 years post-listing, along with 95% confidence intervals. The ECIs of transplantation ranged from −0.120 to 0.404, and the ECIs of death ranged from −0.126 to 0.115. For a given OPO, a high ECI for transplantation and a low ECI for death represent good performance. We classified OPOs as low- or high-outliers based on the 95% confidence intervals.

We compared the proposed method to a method that ignores the correlation between the cause-specific center effects with respect to outlier classification (Web Table 2). While the two methods produced nearly identical classifications of OPOs based on the incidence of transplant, the proposed method classified 6 more OPOs as outliers than fitting separate frailty models by cause. This is a consequence of the reduction in shrinkage in the ECI estimates by the proposed method, due to leveraging the information on the correlation structure.

6. Discussion

In this report, we develop methods for evaluating center performance in the competing risks setting. We propose estimating center effects through cause-specific proportional hazards frailty models that allow correlation among a centers cause-specific hazards. We also propose a score test to test for the presence of correlation between a center’s cause-specific hazards.

In our application, the cause-specific center effects do not seem to be strongly correlated. In scenarios where the correlation between cause-specific center effects is on the higher side, as maybe the case, for example, if there exists an unmeasured covariate influencing both outcomes, using the proposed method instead of currently available methods may produce a larger change in classification of centers than seen here. Since fitting the proposed model may be computationally cumbersome, we recommend first using the proposed CST, to determine if the proposed model is warranted (the alternative being cause-specific frailty models).

To ease computational burden while adjusting for case-mix in our application, we use a two-stage approach. In the first stage, we fit a model stratified by OPO to estimate the regression parameters associated with a large number of patient characteristics. In the second stage, we used the estimated regression parameters as an offset in the linear predictor of the instantaneous hazard in a random-effects model. Note that, following this two-stage approach has the added benefit of avoiding problems due to confounding between the patient-level covariates and the OPO-specifc random-effects. As mentioned in Section 2.1, correlation between covariates and random-effects is a violation of our model assumption which may lead to biased estimates of center effects. However, using the above mentioned two-stage approach seems to rectify this issue. This is because, in the second stage, our random effects are estimated given $X \hat{β}$ , where $\hat{β}$ is estimated from the stratified model. This ensures that an unbiased estimate of $\hat{β}$ is used while estimating the random effects. The random effects then estimated represent an estimate of variation between centers after all the within center variation has been accounted for accurately. It is possible that the random-effects may still be correlated with center-level averages of the covariates X, and that this variation could further be partitioned into variation due to differences in center-level averages of the covariates X and other remaining variation between centers. The question of adjusting further for between-center differences while using a random-effects model may be a policy decision. An alternative, one-stage, approach to account for confounding by patient-level covariates is to use the between-method decomposition of covariates as suggested by Sjölander et al. (2013), where center-level averages of the covariates X are adjusted for.

Supplementary Material

Supp DataS1

NIHMS894618-supplement-Supp_DataS1.csv^{(58.7KB, csv)}

Supplementary Materials

NIHMS894618-supplement-Supplementary_Materials.pdf^{(223.2KB, pdf)}

Acknowledgements

The authors thank the Associate Editor and Referee for their thoughtful suggestions which led to substantial improvement of the manuscript. This work was supported in part by National Institutes of Health Grant R01-DK070869. The data reported here have been supplied by the Minneapolis Medical Research Foundation (MMRF) as the contractor for the Scientific Registry of Transplant Recipients (SRTR). The interpretation and reporting of these data are the responsibility of the authors and in no way should be seen as an official policy of or interpretation by the SRTR or the U.S. Government.

Footnotes

^7.

Supplementary Materials

Web Appendix A, referenced in Section 4, and a web supplement containing R code and an example data file is available with this paper at the Biometrics website on Wiley Online Library.

References

Ash AS, Fienberg SE, Louis TA, Normand ST, Stukel TA, and Utts J (2012). Statistical issues in assessing hospital performance. White paper, Committee of Presidents of Statistical Societies. [Google Scholar]
Balan T, Boonk SE, Vermeer MH, and Putter H (2016). Score test for association between recurrent events and a terminal event. Statistics in medicine 35(18), 3037–3048. [DOI] [PubMed] [Google Scholar]
Breslow NE (1974). Covariance analysis of censored survival data. Biometrics 30, 89–99. [PubMed] [Google Scholar]
Breslow NE, and Clayton DG (1993). Approximate inference in generalized linear models. Journal of the American Statistical Association 88, 9–25. [Google Scholar]
Cox DR (1959). The analysis of exponentially distributed lifetimes with two types of failure. Journal of Royal Statistical Society, Series B 21, 411–421. [Google Scholar]
Crowder MJ (2001) Classical competing risks. London: Chapman and Hall/CRC. [Google Scholar]
Do Ha I, Christian NJ, Jeong JH, Park J, and Lee Y (2014). Analysis of clustered competing risks data using subdistribution hazard models with multivariate frailties. Statistical methods in medical research. Published Online. DOI: 10.1177/0962280214526193. [DOI] [PMC free article] [PubMed] [Google Scholar]
Fan L, and Schaubel DE (2016). Comparing center-specific cumulative incidence functions. Lifetime data analysis 22(1), 1–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gail MH (1975). A review and critique of some models used in competing risk analysis. Biometrics 31, 209–222. [PubMed] [Google Scholar]
Gorfine M, and Hsu L (2011). FrailtyBased Competing Risks Model for Multivariate Survival Data. Biometrics 67(2), 415–426. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gorfine M, Hsu L, Zucker DM, and Parmigiani G (2014). Calibrated predictions for multivariate competing risks models. Lifetime Data Analysis 20(2), 234–251. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gray RJ (1992). Flexible methods for analyzing survival data using splines, with applications to breast cancer prognoses. Journal of the American Statistical Association 87, 942–951. [Google Scholar]
Hart A, Smith JM, Skeans MA, Gustafson SK, Stewart DE, Cherikh WS, Wainright JL, Boyle G, Snyder JJ, Kasiske BL and Israni AK (2016). OPTN/SRTR Annual Data Report 2014: Kidney. American Journal of Transplant 16 (Suppl 2), 11–46. [DOI] [PMC free article] [PubMed] [Google Scholar]
He K, and Schaubel DE (2014a). Methods for comparing centerspecific survival outcomes using direct standardization. Statistics in medicine 33(12), 2048–2061. [DOI] [PMC free article] [PubMed] [Google Scholar]
He K, and Schaubel DE (2014b). Standardized Mortality Ratio for Evaluating Center-Specific Mortality: Assessment and Alternative. Statistics in Biosciences 7(2), 1–26. [Google Scholar]
Kalbfleisch JD, and Prentice RL (2002). The statistical analysis of failure time data, 2nd Edition. New York: Wiley. [Google Scholar]
Kalbfleisch JD, and Wolfe R (2013). On monitoring outcomes of medical providers. Statistics in Biosciences 5, 286–302. [Google Scholar]
Katsahian S, and Boudreau C (2011). Estimating and testing for center effects in competing risks. Statistics in medicine 30(13), 1608–1617. [DOI] [PubMed] [Google Scholar]
Ohlssen DI, Sharples LD, and Spiegelhalter DJ (2006). A hierarchical modelling frame-work for identifying unusual performance in health care providers. Journal of the Royal Statistical Society, Series A 170, 865–890. [Google Scholar]
Prentice RL, Kalbfleisch JD, Peterson AV, Flournoy V, Farewell VT, and Breslow NE (1978). The analysis of failure times in the presence of competing risks. Biometrics 34, 541–554. [PubMed] [Google Scholar]
Putter H, Fiocco M, and Geskus R (2007). Tutorial in biostatistics: competing risks and multi-state models. Statistics in Medicine 26, 2389–2430. [DOI] [PubMed] [Google Scholar]
Ripatti S, and Palmgren J (2000). Estimation of multivariate frailtymodels using penalized partial likelihood. Biometrics 56, 1016–1022. [DOI] [PubMed] [Google Scholar]
Sjölander A, Lichtenstein P, Larsson H, Pawitan Y. (2013). Between-within models for survival analysis. Statistics in Medicine 32, 3067–3076. [DOI] [PubMed] [Google Scholar]
Spiegelhalter D, Sherlaw-Johnson C, Bardsley M, Blunt I, Wood C, and Grigg O (2012). Statistical methods for healthcare regulation: rating, screening and surveillance. Journal of Royal Statistical Society, Series A 175(1), 1–47. [Google Scholar]
Therneau T (2015). Package ‘coxme’. Mixed Effects Cox Models. R Package version 2.2–5.
VanRompaye B, Goetghebeur E, and Jaffar S (2010). Design and testing for clinical trials faced with misclassified causes of death. Biostatistics 11, 546–558. [DOI] [PMC free article] [PubMed] [Google Scholar]
Van Rompaye B, Eriksson M, and Goetghebeur E (2015). Evaluating hospital performance based on excess causespecific incidence. Statistics in medicine 34(8), 1334–1350. [DOI] [PMC free article] [PubMed] [Google Scholar]
Varewyck M, Goetghebeur E, Eriksson M, and Vansteelandt S (2014). On shrinkage and model extrapolation in the evaluation of clinical center performance. Biostatistics 15(4), 651–664. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhao L, Shi J, Shearon TH, and Li Y (2015). A Dirichlet process mixture model for survival outcome data: assessing nationwide kidney transplant centers. Statistics in medicine 34(8), 1404–1416. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supp DataS1

NIHMS894618-supplement-Supp_DataS1.csv^{(58.7KB, csv)}

Supplementary Materials

NIHMS894618-supplement-Supplementary_Materials.pdf^{(223.2KB, pdf)}

[R1] Ash AS, Fienberg SE, Louis TA, Normand ST, Stukel TA, and Utts J (2012). Statistical issues in assessing hospital performance. White paper, Committee of Presidents of Statistical Societies. [Google Scholar]

[R2] Balan T, Boonk SE, Vermeer MH, and Putter H (2016). Score test for association between recurrent events and a terminal event. Statistics in medicine 35(18), 3037–3048. [DOI] [PubMed] [Google Scholar]

[R3] Breslow NE (1974). Covariance analysis of censored survival data. Biometrics 30, 89–99. [PubMed] [Google Scholar]

[R4] Breslow NE, and Clayton DG (1993). Approximate inference in generalized linear models. Journal of the American Statistical Association 88, 9–25. [Google Scholar]

[R5] Cox DR (1959). The analysis of exponentially distributed lifetimes with two types of failure. Journal of Royal Statistical Society, Series B 21, 411–421. [Google Scholar]

[R6] Crowder MJ (2001) Classical competing risks. London: Chapman and Hall/CRC. [Google Scholar]

[R7] Do Ha I, Christian NJ, Jeong JH, Park J, and Lee Y (2014). Analysis of clustered competing risks data using subdistribution hazard models with multivariate frailties. Statistical methods in medical research. Published Online. DOI: 10.1177/0962280214526193. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] Fan L, and Schaubel DE (2016). Comparing center-specific cumulative incidence functions. Lifetime data analysis 22(1), 1–21. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] Gail MH (1975). A review and critique of some models used in competing risk analysis. Biometrics 31, 209–222. [PubMed] [Google Scholar]

[R10] Gorfine M, and Hsu L (2011). FrailtyBased Competing Risks Model for Multivariate Survival Data. Biometrics 67(2), 415–426. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] Gorfine M, Hsu L, Zucker DM, and Parmigiani G (2014). Calibrated predictions for multivariate competing risks models. Lifetime Data Analysis 20(2), 234–251. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] Gray RJ (1992). Flexible methods for analyzing survival data using splines, with applications to breast cancer prognoses. Journal of the American Statistical Association 87, 942–951. [Google Scholar]

[R13] Hart A, Smith JM, Skeans MA, Gustafson SK, Stewart DE, Cherikh WS, Wainright JL, Boyle G, Snyder JJ, Kasiske BL and Israni AK (2016). OPTN/SRTR Annual Data Report 2014: Kidney. American Journal of Transplant 16 (Suppl 2), 11–46. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] He K, and Schaubel DE (2014a). Methods for comparing centerspecific survival outcomes using direct standardization. Statistics in medicine 33(12), 2048–2061. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] He K, and Schaubel DE (2014b). Standardized Mortality Ratio for Evaluating Center-Specific Mortality: Assessment and Alternative. Statistics in Biosciences 7(2), 1–26. [Google Scholar]

[R16] Kalbfleisch JD, and Prentice RL (2002). The statistical analysis of failure time data, 2nd Edition. New York: Wiley. [Google Scholar]

[R17] Kalbfleisch JD, and Wolfe R (2013). On monitoring outcomes of medical providers. Statistics in Biosciences 5, 286–302. [Google Scholar]

[R18] Katsahian S, and Boudreau C (2011). Estimating and testing for center effects in competing risks. Statistics in medicine 30(13), 1608–1617. [DOI] [PubMed] [Google Scholar]

[R19] Ohlssen DI, Sharples LD, and Spiegelhalter DJ (2006). A hierarchical modelling frame-work for identifying unusual performance in health care providers. Journal of the Royal Statistical Society, Series A 170, 865–890. [Google Scholar]

[R20] Prentice RL, Kalbfleisch JD, Peterson AV, Flournoy V, Farewell VT, and Breslow NE (1978). The analysis of failure times in the presence of competing risks. Biometrics 34, 541–554. [PubMed] [Google Scholar]

[R21] Putter H, Fiocco M, and Geskus R (2007). Tutorial in biostatistics: competing risks and multi-state models. Statistics in Medicine 26, 2389–2430. [DOI] [PubMed] [Google Scholar]

[R22] Ripatti S, and Palmgren J (2000). Estimation of multivariate frailtymodels using penalized partial likelihood. Biometrics 56, 1016–1022. [DOI] [PubMed] [Google Scholar]

[R23] Sjölander A, Lichtenstein P, Larsson H, Pawitan Y. (2013). Between-within models for survival analysis. Statistics in Medicine 32, 3067–3076. [DOI] [PubMed] [Google Scholar]

[R24] Spiegelhalter D, Sherlaw-Johnson C, Bardsley M, Blunt I, Wood C, and Grigg O (2012). Statistical methods for healthcare regulation: rating, screening and surveillance. Journal of Royal Statistical Society, Series A 175(1), 1–47. [Google Scholar]

[R25] Therneau T (2015). Package ‘coxme’. Mixed Effects Cox Models. R Package version 2.2–5.

[R26] VanRompaye B, Goetghebeur E, and Jaffar S (2010). Design and testing for clinical trials faced with misclassified causes of death. Biostatistics 11, 546–558. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R27] Van Rompaye B, Eriksson M, and Goetghebeur E (2015). Evaluating hospital performance based on excess causespecific incidence. Statistics in medicine 34(8), 1334–1350. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] Varewyck M, Goetghebeur E, Eriksson M, and Vansteelandt S (2014). On shrinkage and model extrapolation in the evaluation of clinical center performance. Biostatistics 15(4), 651–664. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] Zhao L, Shi J, Shearon TH, and Li Y (2015). A Dirichlet process mixture model for survival outcome data: assessing nationwide kidney transplant centers. Statistics in medicine 34(8), 1404–1416. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Evaluating center performance in the competing risks setting: Application to outcomes of wait-listed end-stage renal disease patients

Sai H Dharmarajan

Douglas E Schaubel

Rajiv Saran

Summary

1. Introduction

2. Proposed Methods

2.1. Model and Likelihood

2.2. Estimation

2.3. Center Effect Measures: Cumulative Incidence

2.4. Estimating Center Effects

3. Score test of Correlation of Cause-specific Hazards

3.1. Correlation Score Test

4. Simulation Studies

Table 1.

Table 2.

Table 3.

Table 4.

5. Application

Figure 1.

6. Discussion

Supplementary Material

Acknowledgements

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Evaluating center performance in the competing risks setting: Application to outcomes of wait-listed end-stage renal disease patients

Sai H Dharmarajan

Douglas E Schaubel

Rajiv Saran

Summary

1. Introduction

2. Proposed Methods

2.1. Model and Likelihood

2.2. Estimation

2.3. Center Effect Measures: Cumulative Incidence

2.4. Estimating Center Effects

3. Score test of Correlation of Cause-specific Hazards

3.1. Correlation Score Test

4. Simulation Studies

Table 1.

Table 2.

Table 3.

Table 4.

5. Application

Figure 1.

6. Discussion

Supplementary Material

Acknowledgements

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases