Skip to main content
Blood logoLink to Blood
. 2014 Apr 17;123(23):3664–3671. doi: 10.1182/blood-2014-01-552984

Validation and refinement of the Disease Risk Index for allogeneic stem cell transplantation

Philippe Armand 1,, Haesook T Kim 2, Brent R Logan 3,4, Zhiwei Wang 3, Edwin P Alyea 1, Matt E Kalaycio 5, Richard T Maziarz 6, Joseph H Antin 1, Robert J Soiffer 1, Daniel J Weisdorf 7, J Douglas Rizzo 3, Mary M Horowitz 3, Wael Saber 3
PMCID: PMC4047501  PMID: 24744269

Key Points

  • The DRI successfully stratified patients in a very large allogeneic transplantation registry cohort.

  • The DRI was refined by using this cohort to build a more inclusive and conditioning intensity–independent index.

Abstract

Because the outcome of allogeneic hematopoietic cell transplantation (HCT) is predominantly influenced by disease type and status, it is essential to be able to stratify patients undergoing HCT by disease risk. The Disease Risk Index (DRI) was developed for this purpose. In this study, we analyzed 13 131 patients reported to the Center for International Blood and Marrow Transplant Research who underwent HCT between 2008 and 2010. The DRI stratified patients into 4 groups with 2-year overall survival (OS) ranging from 64% to 24% and was the strongest prognostic factor, regardless of age, conditioning intensity, graft source, or donor type. A randomly selected training subgroup of 9849 patients was used to refine the DRI, using a multivariable regression model for OS. This refined DRI had improved prediction ability for the remaining 3282 patients compared with the original DRI or other existing schemes. This validated and refined DRI can be used as a 4- or 3-group index, depending on the size of the cohort under study, for prognostication; to facilitate the interpretation of single-center, multicenter, or registry studies; to adjust center outcome data; and to stratify patients entering clinical trials that enroll patients across disease categories.

Introduction

Although many factors influence the outcome of allogeneic hematopoietic cell transplantation (HCT), disease type and disease status at the time of transplantation are the strongest determinants of post-HCT survival. It is therefore essential to categorize patients by disease risk in HCT studies. This is already feasible for other variables, such as donor HLA match1 or comorbidity burden.2 The Disease Risk Index (DRI) was developed as a tool to assign patients into one of 4 overall survival (OS) risk groups based on disease type and status at the time of transplantation.3 This index was developed by using a single-institution patient cohort and has since been successfully applied in other studies.4,5 The primary goals of this study were to examine the validity of the DRI in a multicenter setting and to refine the DRI by using a much larger data set from the Center for International Blood and Marrow Transplant Research (CIBMTR). The refinement was intended to create a conditioning regimen–independent index, to categorize Burkitt lymphoma (BL), which we could not do in the original study, to perform a finer categorization of some diseases (eg, myelodysplastic syndromes [MDSs] and acute lymphoblastic leukemia [ALL]), and to use the increased statistical power to appropriately classify less common disease type-disease status combinations. Ultimately, the aim of the DRI is to provide a robust tool that can be used for prognostication, for the analysis and interpretation of retrospective data, whether conducted in single-center, multicenter, or registry settings or within the context of the federally mandated center outcome reporting. The DRI can also be used for the stratification of patients entering prospective HCT clinical trials.

Methods

Data source

The CIBMTR is a combined research program of the Medical College of Wisconsin and the National Marrow Donor Program. CIBMTR comprises a voluntary network of more than 450 transplantation centers worldwide that contribute detailed data on consecutive allogeneic and autologous HCTs to a centralized statistical center. Observational studies conducted by the CIBMTR are performed in compliance with all applicable federal regulations pertaining to the protection of human research participants. Protected health information used in the performance of such research is collected and maintained in CIBMTR’s capacity as a public health authority under the Health Insurance Portability and Accountability Act Privacy Rule. Additional details regarding the data source are described elsewhere.6

Patients

The study cohort consisted of patients reported to the CIBMTR who underwent HCT between 2008 and 2010, excluding autologous and syngeneic transplantations. Among the 17 223 patients in this data set, we excluded 2361 patients with missing disease type, disease subtype, or disease status information; patients transplanted for benign or rare disorders (eg, histiocytic disorders, large granular lymphocyte or natural killer cell leukemias); and 1731 pediatric (age <18 years) patients. For the remaining 13 131 patients, the baseline characteristics at HCT, disease information, transplantation type, and OS outcomes were obtained from the CIBMTR database. We categorized cytogenetics for patients with acute myeloid leukemia (AML)7 and MDSs8 according to previously proposed risk schemes that were specifically derived from HCT patient cohorts. For AML, t(8;21), inv(16), and t(15;17) were classified as favorable, complex karyotype (4 or more abnormalities) as adverse, and all others as intermediate. For MDSs, abnormal chromosome 7 and complex karyotype (≥ 4 abnormalities) were classified as adverse and all others as intermediate. For patients with ALL, cytogenetics were classified into 3 groups: Philadelphia chromosome–positive (Ph+; defined by t(9;22) on karyotypic analysis or BCR-ABL translocation by fluorescence in situ hybridization, where available); t(4;11); and all others. Patients with nodular lymphocyte–predominant Hodgkin lymphoma were grouped with Hodgkin lymphoma (HL); patients with small lymphocytic lymphoma were grouped with chronic lymphocytic leukemia.

Statistical analysis

Patient baseline characteristics were reported descriptively. The primary end point was OS. No secondary HCT outcome was examined for this study. OS was defined as the time from stem cell infusion to death from any cause. Patients who were alive or lost to follow-up were censored at the time last seen alive. The 1-year follow-up completeness index was 99% and 83% overall. OS was calculated by using the Kaplan-Meier method. The log-rank test was used for comparisons of Kaplan-Meier curves. Potential prognostic factors for OS, including disease risk, age, conditioning intensity, donor and recipient gender, cytomegalovirus (CMV) serostatus, graft source, donor HLA type, comorbidity index, and Karnofsky performance status (KPS) at HCT were examined in the proportional hazards model. The proportional hazards assumption for each variable of interest was tested, and interaction terms were examined. Because conditioning intensity (myeloablative conditioning [MAC] vs reduced-intensity conditioning/nonmyeloablative [RIC/NMA]) did not meet the proportional hazards assumption, the Cox model was stratified by conditioning intensity. All P values were 2-sided at a significance level of .05. To refine the DRI, we randomly split the 13 131 patients into a training cohort comprising three-fourths of the patients (n = 9849) and a testing cohort comprising the remaining one-fourth of the patients (n = 3282). The risk groups were then recreated on the basis of the hazard ratios (HRs) for mortality of each individual disease type-disease status category in a multivariable model, with AML intermediate cytogenetics in first complete remission (CR1) as the reference category. We used thresholds of 2.0, 1.33, and 0.89 to separate the groups (separated by a 50% proportional increase in hazard for death). We then compared the performance of the refined DRI and various other risk scores in the testing cohort by using both the multivariable c-statistic9 (rcorr.cens procedure from Hmisc package in R; R Foundation for Statistical Computing, Vienna, Austria), which quantifies the predictive ability of a stratification scheme, as well as the assessment of model fit measured by the difference in the Akaike information criterion (AIC) between multivariable models built with and without the disease risk variable. For this comparison, we excluded patients with BL who were not classified in the original DRI.

Because the very-high-risk group represents only a small percentage of patients, we examined the behavior for various possible sample sizes of the 4-group index and a potential 3-group index, built by combining the very-high-risk and high-risk categories. We performed a simulation study with sample sizes of 100, 200, 300, 400, 500, 750, 1000, 2000, 3000, 4000, and 5000 patients. For each sample size, 100 random samples of that size were drawn from the entire 13 131 patients. We then fit Cox models on each sample by using a 3-group or 4-group DRI and pooled the outcomes from the 100 random samples. All calculations were performed by using SAS 9.3 (SAS Institute, Inc., Cary, NC), and R version 2.13.

Results

Patient characteristics

The baseline characteristics of the 13 131 patients are shown in Table 1. The cohort included a broad representation of diseases, disease status, donor types, and graft sources; more than 5000 patients received transplantations for lymphoid malignancies, including nearly 2000 for non-Hodgkin lymphoma (NHL). Approximately half the patients were conditioned with an MAC regimen and half with an RIC/NMA regimen. The median follow-up for survivors was 24 months (range, 2 to 50 months).

Table 1.

Baseline patient characteristics (N = 13 131)

Characteristic No. %*
Age, years
 Median 52
 Range 18-80
 <40 3067 23
 40-49 2648 20
 50-64 6042 46
 ≥65 1374 10
Gender
 Male 7562 58
 Female 5569 42
Disease
 ALL 1726 13
  CR1 1023 8
  CR2 407 3
  CR3 61 0
  Relapse/induction failure 235 2
 AML 5313 40
  Favorable cytogenetics 224 2
  Intermediate/unavailable cytogenetics 4838 37
  Adverse cytogenetics 251 2
  CR1 2800 21
  CR2 1014 8
  CR3 162 1
  Relapse/induction failure 1337 10
 CLL 746 6
  CR 81 1
  PR 400 3
  Relapse/induction failure 265 2
 CML 511 4
  Chronic phase 1 274 2
  Chronic phase 2 116 1
  Accelerated phase 69 1
  Blast crisis 52 0
 HL 436 3
  CR 126 1
  PR 225 2
  Relapse/induction failure 85 1
 MDS 1539 12
  Low-risk§ 886 7
  High-risk§ 653 5
  Intermediate-risk/unavailable cytogenetics|| 1294 10
  Adverse cytogenetics|| 245 2
  Untreated 438 3
  CR 177 1
  Improved but not CR 448 3
  Relapsed/no response/progression 476 4
 Multiple myeloma 489 4
  CR 78 1
  PR 165 1
  VGPR 96 1
  Relapse/induction failure 150 1
 Myeloproliferative neoplasms 426 3
  Untreated 197 2
  CR 5 0
  Improved but not CR 77 1
  Relapsed/no response/progression 147 1
 NHL 1945 15
  Indolent B-cell NHL 587 4
   CR 183 1
   PR 276 2
   Relapse/induction failure 128 1
  Aggressive B-cell NHL 540 4
   CR 181 1
   PR 205 2
   Relapse/induction failure 154 1
  Mantle cell lymphoma 355 3
   CR 160 1
   PR 149 1
   Relapse/induction failure 46 0
  T-cell lymphoma 428 3
   CR 171 1
   PR 164 1
   Relapse/induction failure 93 1
  BL 35 0
   CR 23 0
   PR 7 0
   Relapse/induction failure 5 0
HCT-CI#
 0 5284 40
 1-2 3674 28
 3+ 3975 30
 Unavailable 198 2
KPS at HCT
 <90 4467 34
 90-100 8152 62
 Unknown 512 4
Donor match**
 MRD 4932 38
 Non-MRD 8199 62
 Well-matched URD 5020 38
 Partially matched URD 1416 11
 Mismatched URD 131 1
 Mismatched relative 521 4
 Matching unknown 83 1
 UCB donor 1028 8
Graft source
 PB 10508 80
 BM 1595 12
 UCB 1028 8
Conditioning
 MAC 7020 53
 Nonmyeloablative/RIC 6111 47
CMV serostatus
 Recipient-negative and donor-negative 3245 25
 Recipient-negative and donor-positive 1396 11
 Recipient-positive and donor-negative 3501 27
 Recipient-positive and donor-negative 3702 28
 Recipient or donor unknown 1287 10
Gender matching
 Male to male 4452 34
 Male to female 2967 23
 Female to male 2553 19
 Female to female 2106 16
 Donor gender unknown 1053 8
Year of HCT
 Median 2009
 Range 2008-2010
Follow-up for survivors, months
 Median 24
 Range 2-50

BM, bone marrow; CI, conditioning intensity; CLL, chronic lymphocytic leukemia; CML, chronic myelogenous leukemia; MRD, matched related donor; PB, peripheral blood; UCB, umbilical cord blood; URD, unrelated donor; VGPR, very good partial remission.

*

Percentages may not add to 100 because of rounding.

Classified according to Armand et al.7

Includes 3 cases with untreated disease.

§

Low-risk MDS refers to MDS with ≤5% blasts (refractory anemia with or without ringed sideroblasts and refractory cytopenia with multilineage dysplasia); high-risk MDS refers to MDS with >5% blasts (refractory anemia with excess blasts [RAEB-1 and RAEB-2]).

||

Classified according to Armand et al.8

Indolent B-NHL includes follicular lymphoma, marginal zone lymphoma, and lymphoplasmacytic lymphoma. Small lymphocytic lymphoma is included in the CLL group.

#

Classified according to Sorror et al.2

**

Classified according to Weisdorf et al.1

Validation of the original DRI

To validate the original DRI, patients were assigned to a DRI category as previously described3 (Table 2). Because patients with NHL or HL in first partial remission (PR) and subsequent PR were grouped together in this data set, all patients with NHL or HL in PR were categorized as low-risk stage, regardless of conditioning intensity, for this validation. Patients with BL who were not included in the original DRI were excluded from the validation step but were included in the refinement step. The results are shown in Figure 1. The DRI provided a highly significant risk stratification, with 2-year OS of 64% (95% confidence interval [95% CI], 61% to 67%) for patients in the low-risk group (12% of the cohort), 51% (95% CI, 50% to 53%) in the intermediate-risk group (64% of patients), 34% (95% CI, 32% to 36%) in the high-risk group (23% of patients), and 24% (95% CI, 17% to 31%) in the very-high-risk group (1% of patients) (P < .0001 for all pairwise comparisons between groups). We also built a multivariable model for OS by using the DRI categories and other baseline variables. As shown in Table 3, the DRI was associated with highly significant differences in OS, with an HR for mortality of 2.97 (95% CI, 2.44 to 3.62) in the very-high-risk group compared with the low-risk group (P < .0001); in this cohort, the DRI was the variable most strongly associated with survival.

Table 2.

Summary of disease and stage risk groups from original DRI

Disease Disease risk
 AML favorable cytogenetics Low
 CLL
 CML
 Indolent B-cell NHL
 ALL Intermediate
 AML intermediate cytogenetics
 MDS intermediate cytogenetics
 Myeloproliferative neoplasms
 Multiple myeloma
 HL
 DLBCL/transformed indolent B-cell NHL
 Mantle cell lymphoma
 Low-risk T-cell lymphoma
 AML adverse cytogenetics High
 MDS adverse cytogenetics
 High-risk T-cell lymphoma
Stage Stage risk
 Any CR Low
 PR (including improved but <CR MDS/MPN)
 Untreated
 Chronic phase CML
 Induction failure High
 Active relapse
 Accelerated or blast phase CML
Overall assignment
Disease risk Stage risk DRI assignment
Low Low Low
Low High Intermediate
Intermediate Low
Intermediate High High
High Low
High High Very high

For this study, patients in first and subsequent PR were grouped and assigned to low risk regardless of conditioning intensity. BL was excluded from the cohort for this validation because it was not included in the original DRI.

DLBCL, diffuse large B-cell lymphoma; MPN, myeloproliferative neoplasms.

Figure 1.

Figure 1

OS in entire cohort, stratified by the original DRI.

Table 3.

Multivariable analysis for OS

Variable OS
HR P
Risk group (original DRI)
 Low Ref
 Intermediate 1.46 <.0001
 High 2.30 <.0001
 Very high 2.97 <.0001
Age, years
 <40 Ref
 40-49 1.11 .006
 50-64 1.23 <.0001
 ≥65 1.47 <.0001
Donor/graft source*
 PB MRD Ref
 BM MRD 0.94 .45
 PB MUD WM 1.12 .0002
 BM MUD WM 1.06 .32
 PB MUD PM 1.41 <.0001
 BM MUD PM 1.35 .002
 PB MMRD 1.79 <.0001
 BM MMRD 1.11 .33
 PB MMUD 1.51 .001
 BM MMUD 1.96 .020
 Cord blood 1.78 <.0001
Conditioning NA NA
 MAC
 RIC
CMV serostatus
 Recipient-negative Ref
 Recipient-positive 1.12 <.0001
Gender
 Male to male Ref
 Female to male 1.01 .69
 Female to female 0.92 .026
 Male to female 0.94 .050
HCT-CI
 0 Ref
 1-2 1.09 .006
 3+ 1.30 <.0001
KPS at HCT
 90-100 Ref
 <90 1.41 <.0001
 Missing 1.20 .005

MMRD, mismatched related donor; MMUD, mismatched unrelated donor; MUD, matched unrelated donor; NA, not applicable; PM, partially matched; Ref, reference; WM, well-matched.

*

All possible combinations of graft source and donor match were considered separately in this model because there was a significant interaction between bone marrow graft and mismatched related donor when graft source and donor match were considered independently.

Models are stratified on conditioning intensity, so no HRs or P values are provided for those variables.

Includes 308 patients with missing information whose outcome is similar to that of patients with HCT-CI of 0 in this study.

Subgroup validation

We built 17 distinct multivariable models for patient subgroups of interest: patients who were conditioned with MAC regimens, were conditioned with RIC/NMA regimens, received peripheral blood grafts, received bone marrow grafts, received umbilical cord blood grafts, were younger than age 40 years, were age 40 to 64 years, were age 65 years or older, were transplanted from a matched sibling, were transplanted from a well-matched unrelated donor,1 were transplanted from a partially matched unrelated donor, were transplanted from a mismatched donor, had a KPS <90% at HCT, had a KPS of 90% to 100%, were CMV seropositive or were transplanted from a seropositive donor, were CMV seronegative or were transplanted from a seronegative donor, and patients with any disease except for AML. In each of these 17 distinct subgroups, the 4 DRI risk groups were significantly different (P < .05), with HRs of 1.3 to 1.7 for intermediate-risk, 1.9 to 2.8 for high-risk, and 2.5 to 4.7 for very-high-risk subgroups, all compared with low-risk DRI.

Assessment of other prognostic factors

By using the multivariable models previously mentioned, we examined the impact of the AML and MDS cytogenetics grouping schemes used here on OS. In a model restricted to patients with AML, the HRs for mortality associated with adverse (n = 251) and favorable (n = 224) cytogenetics were 1.5 (95% CI, 1.3 to 1.8) and 0.6 (95% CI, 0.5 to 0.8), respectively, (both P < .0001) compared with intermediate-risk cytogenetics (n = 4838). In a model for patients with MDSs, the HR for adverse (n = 245) compared with intermediate-risk cytogenetics (n = 1294) was 1.2 (95% CI, 1.0 to 1.4; P = .050). We also specifically examined the outcome of patients with t(8;21) AML (n = 95) who were categorized as low-risk in the original HCT cytogenetics study10 and intermediate-risk in a subsequent validation study.7 In this cohort, they were best classified as low risk (HR, 0.6 [95% CI, 0.5 to 0.9] compared with normal karyotype AML; P = .009). Cytogenetics in ALL (as described in “Methods”) were not associated with outcome: HR, 0.9 (95% CI, 0.7 to 1.1) for Ph-positive ALL (n = 184 patients) compared with Ph-negative/non t(4;11) (P = .4). To ensure that the DRI could also be used with non-HCT–specific cytogenetics assignment, we repeated the validation using the cytogenetics risk grouping scheme from the Medical Research Council/National Cancer Research Institute11 for AML and the International Prognostic Scoring System (IPSS)12 for MDSs. The HRs for mortality in the multivariable models for the intermediate-risk (1.4; 95% CI, 1.2 to 1.5), high-risk (2.0; 95% CI,1.9 to 2.2), and very-high-risk (2.6; 95% CI, 2.2 to 3.1) groups compared with the low-risk group remained highly significant (all pairwise P < .005), although this decreased the stratification ability of the DRI slightly (c-statistic, 0.631 using the Medical Research Council and IPSS cytogenetics schemes vs 0.637 using the schemes from the original DRI).

Among the 2381 patients with lymphoma, 889 (37%) had undergone a prior autologous hematopoietic cell transplantation (61% of patients with NHL and 39% of those with HL). This did not affect survival in the multivariable model (HR,1.1; 95% CI, 1.0 to 1.2; P = .2), even in models restricted to NHL or to HL. Moreover, there was no significant interaction between DRI assignment and prior autologous hematopoietic cell transplantation. For all subtypes of lymphoma examined, patients with second complete remission (CR2+) had outcomes indistinguishable from those in first CR (CR1), and better than those in any PR. Finally, we re-examined the assignment of risk categories for T-cell NHL and found in this study that only enteropathy-associated T-cell lymphoma and extranodal natural killer cell T-cell lymphoma appeared to be at significantly higher risk than others when adjusted for disease status at HCT; however, because this was based on a small number of patients, all patients with T-cell NHL were grouped together for subsequent analyses.

Refinement of the DRI

We used the large size of this cohort to entirely rebuild the DRI in order to confirm the previous assignments, especially for rarer disease type-disease status combinations. For this analysis, we randomly split the cohort into a training set (three-fourths of the patients) and a testing set (the remaining one-fourth of the patients). In the training cohort, we used the multivariable model described earlier but included separate terms for every possible disease type-disease status combination. We did not make any assumptions about disease groupings in the model; however, categories with similar outcomes were regrouped for presentation of the final results. In this case, we did not build separate models for MAC and RIC/NMA patients, as we had for the original DRI.3 We assumed that a DRI built on the entire cohort would be independent of conditioning intensity and not subject to the possible selection bias associated with the choice of conditioning regimen (see “Discussion”). Risk groups were established on the basis of proportional increases in the HR for mortality, centered on AML intermediate cytogenetics in CR1. The results are shown in Table 4, which also shows the original DRI assignment for comparison. We then tested the performance of this refined DRI in the testing cohort compared with the original DRI and with other proposed disease-risk stratification schemes. The multivariable c-statistics (with higher values denoting improved predictive ability) were 0.635 for the scheme used in the Patient Activation Measure score,13 0.626 for the stratification scheme used in Blood and Marrow Transplant Clinical Trials Network trials,14 0.637 for the original DRI, and 0.643 for the refined DRI. Similarly, the reduction in AIC (with a larger reduction implying better model fit) for the refined DRI was 126 compared with 81 for the Patient Activation Measure score, 37 for the Blood and Marrow Transplant Clinical Trials Network scheme, and 86 for the original DRI. The survival of patients in the entire cohort stratified by using the refined DRI is shown in Figure 2A; Figure 2B shows the survival of patients in both the training and testing cohorts stratified by the refined DRI.

Table 4.

Refinement of the DRI

Disease Stage No. of patients HR* Original DRI Percentage of patients New DRI Group 2-y OS (%) 95% CI
Hodgkin lymphoma CR 126 0.36 Int 14 Low 66 63-68
CLL CR 81 0.47 Low Low
Mantle cell lymphoma CR 160 0.51 Int Low
Indolent NHL CR 183 0.53 Low Low
AML favorable cytogenetics CR 190 0.64 Low Low
Indolent NHL PR 276 0.71 Low Low
CLL PR 400 0.78 Low Low
CML chronic phase 1/2 390 0.82 Low Low
CML advanced phase 69 0.92 Int 63 Int 51 50-52
Mantle cell lymphoma PR 149 0.95 Int Int
Myeloproliferative neoplasm Any 426 0.98 Int Int
AML intermediate cytogenetics CR 3611 Ref Int Int
ALL CR1 1023 1.00 Int Int
T-cell NHL CR 171 1.00 Int Int
Multiple myeloma CR/VGPR/PR 339 1.03 Int Int
Aggressive NHL CR 181 1.05 Int Int
Low-risk MDS adverse cytogenetics Early 103 1.06 High Int
T-cell NHL PR 164 1.06 Int Int
Low-risk MDS intermediate cytogenetics Early 516 1.09 Int Int
HL PR 225 1.09 Int Int
Low-risk MDS intermediate cytogenetics Advanced 235 1.18 Int Int
Indolent NHL Advanced 128 1.21 Int Int
CLL Advanced 265 1.22 Int Int
High-risk MDS intermediate cytogenetics Early 364 1.24 Int Int
Aggressive NHL PR 205 1.26 Int Int
T-cell NHL Advanced 93 1.41 High 20 High 33 31-35
AML favorable cytogenetics Advanced 34 1.42 Int High
HL Advanced 85 1.48 High High
High-risk MDS intermediate cytogenetics Advanced 179 1.56 Int High
High-risk MDS adverse cytogenetics Early 80 1.58 High High
ALL CR2 407 1.58 Int High
AML adverse cytogenetics CR 175 1.59 High High
Mantle cell lymphoma Advanced 46 1.59 High High
High-risk MDS adverse cytogenetics Advanced 30 1.59 Very high High
BL CR 23 1.65 NA High
Multiple myeloma Advanced 150 1.65 High High
ALL CR3 61 1.70 Int High
Low-risk MDS adverse cytogenetics Advanced 32 1.86 Very high High
AML intermediate cytogenetics Advanced 1227 1.89 High High
CML blast phase 52 2.02 Int 4 Very high 23 20-27
ALL Advanced 235 2.23 High Very high
Aggressive NHL Advanced 154 2.54 High Very high
AML adverse cytogenetics Advanced 76 2.83 Very high Very high
BL PR Advanced 12 5.21 NA Very high

Int, intermediate.

*

Hazard ratio for mortality compared with AML intermediate cytogenetics in CR1.

Advanced stage refers to induction failure or active relapse, including stable or progressive disease for NHL, HL, and CLL.

Those categories were not included in the original DRI.

Figure 2.

Figure 2

OS, stratified by new DRI. (A) Stratified by new DRI groups. (B) Stratified by new DRI groups, separated in training and testing cohorts. (C) Stratified by new DRI groups and conditioning intensity.

In multivariable models using this refined DRI, there were no significant interactions between the risk group assignment and any of the other model variables. The new DRI stratified patients regardless of conditioning intensity (Figure 2C). Nonetheless, we repeated the refinement process in the MAC and RIC/NMA cohorts separately. The resultant risk groups were identical in the 2 conditioning groups, with one exception. For lymphoma, patients in PR who underwent RIC HCT had an outcome similar to those in CR, but patients in PR who underwent MAC HCT had a worse outcome than those in CR. Despite this, and as discussed further, we chose to keep the refined DRI independent of conditioning intensity, as shown in Table 4.

For small size cohorts, the very-high-risk group, which will often comprise only a small proportion of patients, can be merged with the high-risk group, and the DRI can be used as a 3-group breakdown. To approximate what cohort size would work best in a data set like this one, we performed simulations by randomly selecting subcohorts of various sample sizes from our entire cohort (see “Methods”). The results (not shown) indicate that a 3-group breakdown (merging the very-high-risk and high-risk groups) is preferable for a cohort size of 300 or smaller, and a 4-group breakdown is preferable for cohort sizes larger than 300.

Discussion

To validate and refine the original DRI, we used a cohort of more than 13 000 patients. The original DRI, which was built on training and validation groups of patients completely separate from the one used in this study, performed very well in this cohort (Figure 1). It stratified patients into 4 groups with very different OS and provided the strongest prognostic information in this cohort. It was also very stable across age groups, conditioning intensity, graft source, and donor type. As discussed in the original proposal for the DRI, we envision that this tool should not be fixed but should instead be refined by the transplant community as new information becomes available. Thus, we anticipate that in the future, molecular information (eg, FLT-3 internal tandem duplication status for AML and MYC/BCL-2 status for aggressive B-cell NHL) will be incorporated and will alter the risk assignments. We therefore used the large sample size of this cohort to begin this refinement process. For example, we added BL to the DRI; we were also able to clarify the assignment of some of the patients with rarer diseases in HCT cohorts, such as HL or mantle cell lymphoma (Table 4). We also found a similar survival for ALL patients regardless of Ph status, as preliminarily suggested in other studies.15,16 Moreover, we show that patients with MDSs are best stratified based on the combination of blast percentage, cytogenetics, and response to prior therapy; this breakdown provided better discrimination than the IPSS12 in our cohort (data not shown). Despite these changes, the OS rates in each group of this refined DRI are similar to those in the original DRI. In this large data set, we were also able to refine the DRI and confirm that this refined DRI performed slightly better than the original one in a testing cohort separate from the training cohort. The differences in c-statistics between the schemes compared here are relatively small. This reflects in part the use of a multivariable c-statistic, in which the difference in predictive power of the disease risk variable is further diluted by the predictive power of other variables in the model. Indeed, univariable c-statistic calculations showed the same ranking of predictive ability but with larger differences (data not shown). The differences in model fit as assessed with the AIC yielded the same ranking. Overall, the purpose of those analyses was only to compare different schemes in an independent cohort, and the results suggest that the refined DRI appears to outperform existing schemes, likely because it was derived by using the largest cohort of patients of all 4 schemes with no preexisting assumptions.

The DRI should be collapsed to a 3-group system for smaller studies, with the very-high-risk and the high-risk groups merged. The choice between using the 3- or 4-group index, or collapsing further, should depend on the characteristics of the particular cohort under analysis, considering such factors as duration of follow-up, number of events, and survival distribution; the choice involves a trade-off between getting more accurate estimates of risk with more groups and the additional variability in estimation resulting from using smaller groups. In general, one rule of thumb may be that each category requires at least 10 events, where events are patient deaths if the end point is OS. Even with 10 events in each category, if the HRs are similar between two categories, it would be preferable to merge these two categories to increase power. In our simulation, a 3-group system appeared optimal for cohorts of fewer than ∼300 patients, but we emphasize that the choice of a 3- or 4-group breakdown is best dictated by the characteristics of the cohort under study and that the numbers provided here are to be used only as a general guide. When using the DRI for stratified randomization in a clinical trial, the 3-group version will likely be most useful, especially if other stratification variables are used.

The issue of conditioning intensity is noteworthy. As described previously, and in a remarkable concordance between the original DRI and this study, the risk assignment of patients with lymphoma in PR seemed to differ according to conditioning intensity, with a higher risk (relative to patients in CR) only for patients receiving MAC. This runs somewhat contrary to the common assumption that patients with more chemotherapy-refractory disease benefit from higher conditioning intensity. There are at least 2 possible explanations for this finding. One is that it reflects selection bias, because patients with more aggressive disease, as assessed clinically but not in a way that can be captured with the variables available to us, may be preferentially treated with MAC HCT. The other is that, in fact, such patients, because they are more chemotherapy refractory, may not benefit from the additional cytotoxicity of a MAC regimen and may suffer from the additional toxicity of those regimens without a commensurate benefit in disease control. Unfortunately, the nature of our studies does not allow us to distinguish those possibilities, which must be left to further prospective trials. For this reason, and also to keep the DRI separate from the choice of transplantation method, we chose to keep the assignments independent of conditioning intensity, unlike in our original study. More broadly, we emphasize that this study was not meant to compare the outcomes of MAC and RIC hematopoietic stem cell transplantation, or to suggest an optimal conditioning regimen for any group of patients. The strong likelihood of selection bias in the choice of conditioning regimen would preclude this type of analysis in a retrospective study like ours. In this respect, the salient conclusion of this study is therefore not that conditioning intensity does not have an impact on hematopoietic stem cell transplantation outcome, but that the DRI applies to patients regardless of conditioning intensity.

The derivation of the DRI was based solely on OS and did not directly take into consideration relapse risk. Relapse and time to relapse are more difficult to reliably ascertain in multicenter studies, given the different possible definitions of relapse (eg, molecular, cytogenetic, hematologic, radiologic, clinical). Obtaining such data without a detailed analysis of relapse time and ascertainment method for each patient would be unreliable, and obtaining it with the appropriate quality control for a cohort this size would be prohibitively time-consuming. Moreover, not only the risk but also the tempo of relapse may be quite different between different diseases. It is therefore questionable whether the type of analysis performed for this study would be appropriate in a cross-disease analysis. Finally, OS is the least subjective HCT outcome and is usually very closely associated with progression-free survival anyway, reflecting the poor outcomes of patients who relapse after HCT.5 For those reasons, we chose to focus exclusively on OS.

It is also conceivable that longer follow-up could alter the assignments but, again, this is unlikely in an HCT cohort because disease-related variables meet the proportional hazards assumption so well in this cohort and in general in HCT studies. The chosen cohort allowed us to validate and refine the DRI for patients transplanted in the most recent timeframe that allows collection of 2-year survival outcome, which is particularly useful because year of transplantation has been clearly associated with HCT outcome.17

We propose this refined DRI as a system for risk-stratifying heterogeneous populations of patients undergoing HCT, based on OS. This index applies regardless of age, conditioning regimen, donor type, and graft source. We anticipate that the DRI will be used by clinicians for prognostication and by researchers to improve the quality of retrospective studies and prospective clinical trials that enroll patients across disease type and disease status categories. We also hope that this work encourages the performance of HCT trials that are not disease specific, at least when the success of the investigational treatment is not likely to be disease specific, by removing the concerns over outcome variability introduced by disease type and disease status heterogeneity.

Acknowledgments

This work was supported by an American Society of Clinical Oncology Career Development Award (P.A.), by grant U19 AI 29530 from the National Institute of Allergy and Infectious Diseases (NIAID), and grant PO1 HL 070149 from the National Heart, Lung and Blood Institute (NHLBI). The Center for International Blood and Marrow Transplant Research is supported by Public Health Service grant/cooperative Agreement U24-CA76518 from the National Cancer Institute (NCI), the NHLBI, and the NIAID, by grant/cooperative agreement 5U01HL069294 from NHLBI and NCI, by contract HHSH234200637015C with Health Resources and Services Administration/Department of Health and Human Services), by grants N00014-06-1-0704 and N00014-08-1-0058 from the Office of Naval Research, and by grants from Allos, Inc.; Amgen, Inc.; Angioblast; an anonymous donation to the Medical College of Wisconsin; Ariad; Be the Match Foundation; Blue Cross and Blue Shield Association; Buchanan Family Foundation; CaridianBCT; Celgene Corporation; CellGenix, GmbH; Children’s Leukemia Research Association; Fresenius-Biotech North America, Inc.; Gamida Cell Teva Joint Venture Ltd.; Genentech, Inc.; Genzyme Corporation; GlaxoSmithKline; Kiadis Pharma; The Leukemia & Lymphoma Society; The Medical College of Wisconsin; Millennium Pharmaceuticals, Inc.; Milliman USA, Inc.; Miltenyi Biotec, Inc.; National Marrow Donor Program; Optum Healthcare Solutions, Inc.; Otsuka America Pharmaceutical, Inc.; Seattle Genetics; Sigma-Tau Pharmaceuticals; Soligenix, Inc.; Swedish Orphan Biovitrum; Therakos, Inc.; and Wellpoint, Inc.

Footnotes

The publication costs of this article were defrayed in part by page charge payment. Therefore, and solely to indicate this fact, this article is hereby marked “advertisement” in accordance with 18 USC section 1734.

Authorship

Contribution: P.A. designed the research, analyzed the data, and wrote the paper; H.T.K. designed the research, analyzed the data, and edited the paper; B.R.L. designed the research, collected and analyzed the data, and edited the paper; Z.W. collected and analyzed the data and edited the paper; E.P.A., M.E.K., and R.T.M. edited the paper and served as cochairs of the Center for International Blood and Marrow Transplant Research Chronic Leukemia Working Committee; J.H.A., R.J.S., D.J.W., J.D.R., and M.M.H. designed the research and edited the paper; and W.S. designed the research, analyzed the data, and edited the paper.

Conflict-of-interest disclosure: The authors declare no competing financial interests.

Correspondence: Philippe Armand, Dana-Farber Cancer Institute, 450 Brookline Ave, Boston MA 02215; e-mail: parmand@partners.org.

References

  • 1.Weisdorf D, Spellman S, Haagenson M, et al. Classification of HLA-matching for retrospective analysis of unrelated donor transplantation: revised definitions to predict survival. Biol Blood Marrow Transplant. 2008;14(7):748–758. doi: 10.1016/j.bbmt.2008.04.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Sorror ML, Maris MB, Storb R, et al. Hematopoietic cell transplantation (HCT)-specific comorbidity index: a new tool for risk assessment before allogeneic HCT. Blood. 2005;106(8):2912–2919. doi: 10.1182/blood-2005-05-2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Armand P, Gibson CJ, Cutler C, et al. A disease risk index for patients undergoing allogeneic stem cell transplantation. Blood. 2012;120(4):905–913. doi: 10.1182/blood-2012-03-418202. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Slack JL, Dueck AC, Fauble VD, et al. Reduced toxicity conditioning and allogeneic stem cell transplantation in adults using fludarabine, carmustine, melphalan, and antithymocyte globulin: outcomes depend on disease risk index but not age, comorbidity score, donor type, or human leukocyte antigen mismatch. Biol Blood Marrow Transplant. 2013;19(8):1167–1174. doi: 10.1016/j.bbmt.2013.05.001. [DOI] [PubMed] [Google Scholar]
  • 5.Thanarajasingam G, Kim HT, Cutler C, et al. Outcome and prognostic factors for patients who relapse after allogeneic hematopoietic stem cell transplantation. Biol Blood Marrow Transplant. 2013;19(12):1713–1718. doi: 10.1016/j.bbmt.2013.09.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Horowitz M. The role of registries in facilitating clinical research in BMT: examples from the Center for International Blood and Marrow Transplant Research. Bone Marrow Transplant. 2008;42(Suppl 1):S1–S2. doi: 10.1038/bmt.2008.101. [DOI] [PubMed] [Google Scholar]
  • 7.Armand P, Kim HT, Zhang MJ, et al. Classifying cytogenetics in patients with acute myelogenous leukemia in complete remission undergoing allogeneic transplantation: a Center for International Blood and Marrow Transplant Research study. Biol Blood Marrow Transplant. 2012;18(2):280–288. doi: 10.1016/j.bbmt.2011.07.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Armand P, Deeg HJ, Kim HT, et al. Multicenter validation study of a transplantation-specific cytogenetics grouping scheme for patients with myelodysplastic syndromes. Bone Marrow Transplant. 2010;45(5):877–885. doi: 10.1038/bmt.2009.253. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Harrell FE, Jr, Lee KL, Califf RM, Pryor DB, Rosati RA. Regression modelling strategies for improved prognostic prediction. Stat Med. 1984;3(2):143–152. doi: 10.1002/sim.4780030207. [DOI] [PubMed] [Google Scholar]
  • 10.Armand P, Kim HT, DeAngelo DJ, et al. Impact of cytogenetics on outcome of de novo and therapy-related AML and MDS after allogeneic transplantation. Biol Blood Marrow Transplant. 2007;13(6):655–664. doi: 10.1016/j.bbmt.2007.01.079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Grimwade D, Hills RK, Moorman AV, et al. National Cancer Research Institute Adult Leukaemia Working Group. Refinement of cytogenetic classification in acute myeloid leukemia: determination of prognostic significance of rare recurring chromosomal abnormalities among 5876 younger adult patients treated in the United Kingdom Medical Research Council trials. Blood. 2010;116(3):354–365. doi: 10.1182/blood-2009-11-254441. [DOI] [PubMed] [Google Scholar]
  • 12.Greenberg P, Cox C, LeBeau MM, et al. International scoring system for evaluating prognosis in myelodysplastic syndromes. Blood. 1997;89(6):2079–2088. [PubMed] [Google Scholar]
  • 13.Parimon T, Au DH, Martin PJ, Chien JW. A risk score for mortality after allogeneic hematopoietic cell transplantation. Ann Intern Med. 2006;144(6):407–414. doi: 10.7326/0003-4819-144-6-200603210-00007. [DOI] [PubMed] [Google Scholar]
  • 14.Anasetti C, Logan BR, Lee SJ, et al. Blood and Marrow Transplant Clinical Trials Network. Peripheral-blood stem cells versus bone marrow from unrelated donors. N Engl J Med. 2012;367(16):1487–1496. doi: 10.1056/NEJMoa1203517. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Mohty M, Labopin M, Volin L, et al. Acute Leukemia Working Party of EBMT. Reduced-intensity versus conventional myeloablative conditioning allogeneic stem cell transplantation for patients with acute lymphoblastic leukemia: a retrospective study from the European Group for Blood and Marrow Transplantation. Blood. 2010;116(22):4439–4443. doi: 10.1182/blood-2010-02-266551. [DOI] [PubMed] [Google Scholar]
  • 16.Postow M, Kim H, Sun L, et al. Philadelphia chromosome is not an adverse prognostic factor in acute lymphoblastic leukemia after myeloablative allogeneic hematopoietic stem cell transplantation [abstract]. Biol Blood Marrow Transplant. 2011;17(2):S244–S245. [Google Scholar]
  • 17.Gooley TA, Chien JW, Pergam SA, et al. Reduced mortality after allogeneic hematopoietic-cell transplantation. N Engl J Med. 2010;363(22):2091–2101. doi: 10.1056/NEJMoa1004383. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Blood are provided here courtesy of The American Society of Hematology

RESOURCES