Abstract
Clinical trials commonly use adjudication committees to refine endpoints, but observational research or genome-wide association studies rarely do. Our goals were to establish definitions of cause-specific death after unrelated donor allogeneic hematopoietic cell transplantation (URD-HCT), estimate discordance between reported and adjudicated cause-specific death, and identify factors contributing to inconsistency in cause-specific deathdetermination. A consensus panel adjudicated cause-specific deathin 1,484 patients who died within 1 year after HCT, derived from 3,532 acute leukemia or myelodysplasia patients after URD-HCT 2000-2011 reported by 151 U.S. transplant centers to CIBMTR. Deaths were classified as disease-related (DRM) or transplant-related (TRM). The panel agreed with >99% of deaths reported by centers as DRM and 80% reported as TRM. Year of transplant (cohort effect) and disease status significantly influenced agreement between panel and centers. Sensitivity analysis of deaths <100 days post-transplant yielded lowest agreement between the panel and centers for myelodysplastic syndrome patients. Standard pre-defined criteria for adjudicating cause-specific deathled to consistent application to similar clinical scenarios and clearer delineation of cause-specific deathcategories. Other studies of competing events like cancer-specific vs treatment-related mortality would benefit from our results. Our detailed algorithm should result in more consistent reporting of cause-specific deathby centers.
Keywords: Allogeneic HCT, Adjudication, Cause-specific mortality, Acute leukemia
INTRODUCTION
Allogeneic hematopoietic cell transplantation (HCT) offers the only curative therapy for some hematologic malignancy/disorder patients, but has a 1-year post-HCT mortality rate exceeding 30%.1 Allogeneic HCT can precipitate a multi-factorial cascade of events, the sequence and severity of which differs between patients. Not all patients who die after HCT experience all potential post-HCT events. Moreover, surviving patients may experience a similar sequence of events as those who did not survive. Patients’ clinical courses can significantly differ across and within cause-specific mortality. Additionally, co-morbidities carry their own risks and are difficult to discern from HCT-specific causes of death.
As an example, a patient with moderate (not severe) graft-versus host disease (GvHD) treated with multiple systemic immunosuppressive agents develops an infection and dies. The initiating (primary) and contributory (secondary) causes are not easily delineated, leading to ambiguity of whether GvHD or Infection should be reported as the primary cause-specific death. Patients with severe GvHD, requiring prolonged immunosuppression might die of severe GvHD without infection. Likewise, patients can die of infection in the absence of GvHD. Hence GvHD and Infection are not always concurrent causes of death. Discerning the initiating and contributing cause-specific deaths are critical in HCT patients who often have competing and correlated outcomes.
Endpoint Assessment Committees are often used to determine clinical trial endpoints,2-3 but are rarely used for observational research. The only published study investigating cause-specific death (GvHD, infection, disease, other) within the first year post-HCT used the primary cause of death reported by the transplant center.4 Two additional studies examined cause-specific death in HCT patients who had survived beyond 2 and 5 years post-HCT.5-6 The first study defined outcomes (death due to disease recurrence, GvHD or infection) but did not review or adjudicate individual cases,5 while the second study used cause of death reported per the National Death Index in addition to review of medical data for individual cases.6 These methods work for landmark analyses or observational studies that describe changes over time, but genetic studies investigating cause-specific deaths that are incorrectly or inconsistently assigned could result in biased estimation of the association between genetic variants and each cause.7
In preparation for a genome-wide association study (GWAS) of cause-specific mortality after unrelated donor (URD) allogeneic HCT, we convened a consensus panel to review and adjudicate cause-specific deaths to reduce endpoint misclassification and subsequent over- or under-estimation of genetic effects. Our on-going GWAS named DISCOVeRY-BMT: Determining the Influence of Susceptibility-COnveying Variants Related to 1-Year mortality after unrelated donor Blood and Marrow Transplant is designed to investigate donor and recipient genetic factors that contribute to 1-year cause-specific mortality after URD-HCT. We report our cause-specific death definitions, process for adjudication and degree of concordance between the causes of death reported by individual transplant centers and the consensus panel.
SUBJECTS AND METHODS
Research Ethics
All patients and donors provided written informed consent for their clinical data to be used for research purposes and were not compensated for their participation. This study was reviewed and approved by the Roswell Park Cancer Institute Institutional Review Board. All patient data were de-identified. Summary data are provided in this manuscript, with the exception of Supplementary Tables 4 and 5 which contain patient-specific data that have been altered slightly to further protect patient identity and confidentiality.
Study Population
Two independent cohorts were studied to determine the consistency of adjudication results. These cohorts were defined as a training and validation cohort for the main genome-wide association study.
Cohort 1
This cohort included 2,609 10/10 HLA-matched, first, T-cell replete, URD-HCT recipients treated with myeloablative or reduced intensity conditioning regimens from 2000-2008 for acute myeloid or lymphocytic leukemia (AML, ALL) or myelodysplastic syndrome (MDS) who were reported to the Center for International Blood and Marrow Transplant Research (CIBMTR) and had banked biorepository samples from recipient and donor.8 Of 2,609 patients, 1,116 (43%) died within 1-year after HCT.
Cohort 2
This cohort includes 572 patients who had an URD-HCT between 2009-2011 who otherwise met the same criteria as Cohort 1, together with 351 patients who were 8/8 HLA-matched (HLA-DQB1 not assessed) URD-HCT between 2000-2011, but otherwise met the same criteria as Cohort 1. Of 923 patients in Cohort 2, 368 (40%) died within 1-year after HCT.
Patient data and recipient/donor blood samples were contributed by 151 transplant centers. Procedures for the completion and review of CIBMTR data collection forms, as well as cause(s) of death, differ by transplant center. The goal of adjudication was to reduce variability in ascertaining the cause-specific deaths in patients with similar sequences of events leading to death.
Cause of Death Adjudication
The CP consisted of 2 adult HCT physicians (MP, PLM), a pediatric hematologist/oncologist (KO), and a HCT clinical epidemiologist (TH). Causes of death and additional action plans (e.g. request for clinical information from the transplant center) were recorded for each case by an independent co-investigator (XZ) using pre-specified nomenclature and notation. Adjudication of Cohort 1 was completed over 8 months via 3 in-person meetings at CIBMTR (Milwaukee WI) and weekly teleconferences. Adjudication for Cohort 2 was completed over 2 days via an in-person session at CIBMTR.
Case report form summaries were provided to the consensus panel and included detailed data summarized in Supplementary Table 1. Each clinical summary was discussed by the consensus panel using information available in submitted forms, autopsy reports, error correction forms and source documents. Discussions continued until a unanimous consensus was reached regarding the causes of death or that additional information was needed from the transplant center. When additional information was needed before adjudicating cause-specific death, up to 3 data queries requesting source documentation or forms data clarification were submitted to the transplant center.
Cause of Death Category Definitions
The primary cause of death was broadly defined as “Disease-Related Mortality” (DRM - related to leukemia/MDS relapse/progression, including death due to toxicity or infection from post-HCT anti-leukemic therapy) or “Transplant-Related Mortality” (TRM – any cause of death not included in DRM), similar to previous HCT studies.3-6 TRM subtypes were further classified as GvHD, Infection, Organ Failure, and Other. Cause-specific deaths were categorized in a hierarchical manner: Disease, GvHD, Infection, Organ Failure then Other, in descending priority.
Table 1 provides detailed definitions and description of clinical scenarios. Briefly, DRM included documented post-HCT disease progression, relapse or death before day +30 post-HCT in patients with a high disease burden pre-transplant. Autopsy confirming presence of disease was coded as DRM. Treatments such as re-induction chemotherapy, donor lymphocyte infusion and second HCT after the index HCT may have caused “TRM-like” deaths however were coded as DRM due to the hierarchical structure and priority for the cause-specific death definitions.
Table 1.
Definition of Clinical Scenarios for Adjudication of Primary and Secondary Causes of Death
| Primary Cause- specific death |
Secondary cause-specific death |
Definition of Clinical Scenarios |
|---|---|---|
| Disease | None |
|
| GVHD | None |
|
| Infection | None |
|
| Organ Failure | None |
|
| Other | None |
|
| GvHD | Infection |
|
| GvHD | Infection, Organ Failure |
|
| GvHD | Organ Failure |
|
| GvHD | Other |
|
| Infection | GvHD |
|
| Infection | Other |
|
| Other | Infection |
|
| Other | Organ Failure |
|
Legend: CR: Complete Remission, HCT: hematopoietic cell transplantation, GvHD: graft-versus-host disease, GI: gastrointestinal tract, VOD/SOS: veno-occlusive disease/sinosidal obstructive syndrome, ARDS/IP: Adult respiratory distress syndrome/interstitial pneumonitis, TTP/HUS: thrombotic thrombocytopenic purpura/hemolytic uremic syndrome, CNS: central nervous system
GvHD deaths included severe acute or chronic GvHD with active treatment at time of death. Infection deaths included bacterial, viral, fungal, and/or protozoan infections causing end organ damage. Organ Failure deaths were defined as transplant-related toxicity not due to disease progression, GvHD or infection. Organ Failure included, for example, veno-occlusive disease/sinusoidal obstructive syndrome (VOD/SOS), non-infectious interstitial pneumonitis, adult respiratory distress syndrome (ARDS), myocardial infarction, and renal failure in the absence of infection and GvHD. “Other” causes of death included rare events: vascular events including hemorrhage or thrombosis (e.g. pulmonary emboli, stroke), secondary malignancies, primary or secondary graft failure, accident, suicide or unknown.
The consensus panel could include an unlimited number of secondary or contributing causes of death. Based on the hierarchical nature of the definitions, secondary causes were included only for TRM and were coded in the same categories as the primary cause (GvHD, Infection, Organ Failure, Other). Secondary causes contributed to death but were not as severe as the primary cause or were closer to the time of death. Rare exceptions (affecting ≤3 cases per category) to the rules were allowed for unusual patient circumstances.
Internal and External Validity
Internal validity was tested using two approaches.9 First, 11 sequential cases from the Cohort 1 were blindly re-reviewed 2 months later. Second, 25 non-sequential cases were randomly selected by a non-panel member (XZ), and blindly re-reviewed by the consensus panel after all cases were adjudicated.
External validity was measured using a fourth in-person meeting at CIBMTR with 2 adult HCT physicians not involved in the study (JAH, PJM). Twenty-one previously adjudicated simple and complex cases from Cohort 1 were selected by a consensus panel member (TH), then adjudicated in the same manner as prior panel meetings.
Statistical Analysis
A first order Agreement Coefficient (AC1) was used to assess transplant center/consensus panel agreement. As with Cohen’s Kappa statistic (K), AC1 values range from 0-1 with a higher value indicating greater concordance.10-11 K was not used as the primary agreement measure due to the unblinded nature of the consensus panel review10,12-13 and the disproportionate prevalence of the competing causes of death,11,14-16 AC1 confidence intervals were taken from the 0.025 and 0.975 quantiles of the bootstrapped AC1 distribution (n=10 000) of the observed transplant center/consensus panel cause-specific death contingency table.17 A Bias Index was calculated to demonstrate the proportion of the total sample that moved into (positive value) or out of (negative value) a cause-specific death category,18-19 and the Prevalence Index was used to measure the proportion of all cases in each transplant center/consensus panel agreement cell.12,14
Several analyses were performed to measure the association with, and dependencies between, covariates and transplant center/consensus panel agreement on primary cause-specific death. Logistic regression models were used to test for patient characteristics associated with transplant center/consensus panel agreement on broad categories of death (DRM vs TRM) and specific TRM categories (GvHD, Organ Failure, Infection). Quasi-symmetric log-linear models were constructed to assess dependencies of each covariate with transplant center/consensus panel agreement on cause-specific death.20-22 Covariates tested include recipient gender and age, year of transplant, disease (AML, ALL, MDS) and disease status (early, intermediate, advanced). All statistical analyses were performed using R statistical software.23
RESULTS
Demographics
Patient, disease and treatment characteristics are shown in Table 2 for both cohorts. Cohort 2 was older and heavier than Cohort 1, had more patients with MDS and advanced disease, less with ALL, and more peripheral blood grafts. Other characteristics were well balanced between the cohorts, with the exception of the planned differences in year of HCT based on the cohort ascertainment.
Table 2.
Recipient Characteristics of the First and Second Cohorts
| Recipient Characteristics |
First Cohort N=2609 n (%) |
Second Cohort N=923 n (%) |
P |
|---|---|---|---|
|
| |||
| Age, years | 0.002 | ||
| ≥40 | 1528 (59%) | 594 (64%) | |
| <40 | 1081 (41%) | 329 (36%) | |
|
| |||
| Male | 1475 (57%) | 504 (55%) | NS |
|
| |||
| Race | NS | ||
| White | 2442 (94%) | 867 (94%) | |
| Non-White | 114 (4%) | 44 (5%) | |
| Missing/unknown | 53 (2%) | 12 (1%) | |
|
| |||
| Disease | <0.001 | ||
| AML | 1535 (59%) | 572 (62%) | |
| ALL | 653 (25%) | 124 (13%) | |
| MDS | 421 (16%) | 227 (25%) | |
|
| |||
| AML Disease Status | <0.001 | ||
| Early | 677 (44%) | 310 (54%) | |
| Intermediate | 386 (25%) | 121 (21%) | |
| Advanced | 472 (31%) | 141 (25%) | |
|
| |||
| ALL Disease Status | NS | ||
| Early | 248 (38%) | 57 (50%) | |
| Intermediate | 278 (43%) | 42 (30%) | |
| Advanced | 127 (19%) | 25 (20%) | |
|
| |||
| MDS Disease Status | <0.001 | ||
| Early | 277 (75%) | 83 (37%) | |
| Intermediate | -- | -- | |
| Advanced | 144 (25%) | 144 (63%) | |
|
| |||
| KPS/LPS | NS | ||
| 90-100 | 1591 (61%) | 518 (56%) | |
| <80 | 758 (29%) | 279 (30%) | |
| Missing | 260 (10%) | 126 (14%) | |
|
| |||
| Year of HCT | -- | ||
| 2000-02 | 450 (17%) | 33 (4%) | |
| 2003-05 | 926 (35%) | 90 (9%) | |
| 2006-08 | 1233 (47%) | 121 (13%) | |
| 2009-11 | 0 | 679 (74%) | |
|
| |||
| BMI** | 0.011 | ||
| Underweight | 50 (2%) | 25 (3%) | |
| Normal | 977 (37%) | 293 (32%) | |
| Overweight | 811 (31%) | 310 (34%) | |
| Obese | 764 (29%) | 295 (32%) | |
| Missing | 7 (<1%) | 0 | |
|
| |||
| Graft Source | <0.001 | ||
| Peripheral Blood | 1648 (63%) | 660 (72%) | |
| Bone Marrow | 961 (37%) | 263 (28%) | |
Legend: NS: P>0.1, NA: Not applicable, AML: Acute Myeloid Leukemia, ALL: Acute Lymphoblastic Leukemia, MDS: Myelodysplastic Syndrome, Early: AML/ALL in CR1 or MDS RA/RARS, Intermediate: AML/ALL in CR2, Advanced: AML/ALL not in remission or MDS RAEB1/2, KPS: Karnofsky Performance Score, LPS: Lansky Performance Score, BMI: body mass index.
7 patients in cohort 1 and 52 patients in cohort 2 missing KPS/LPS were excluded;
BMI definitions for ≥18 years: underweight <18, normal 18 - <25, overweight 25- <30, obese ≥30 mg/kg2; BMI for ≥2 and <18 years: underweight <5th percentile, normal 5 - <85th percentile, overweight ≥85 - <95th percentile, obese ≥95th percentile; BMI for <2 years coded as normal. [Reference 27]
Data Queries and Resolutions
The consensus panel identified 99/1,484 (6.7%) cases as needing additional information from the transplant center in order to determine the causes of death. Transplant center responses were received for 76/99 (77%) of data queries, and cause-specific death was resolved by the consensus panel for all 76 cases. The remaining 23 cases with no response from the transplant center were re-reviewed by the consensus panel for final resolution of cause-specific death.
Internal and External Validity
All measures of internal and external validity showed excellent concordance for both primary and secondary cause-specific death. The 11 sequential cases demonstrated 100% agreement between first and second review, and the randomly selected 25 cases were 95% concordant on primary and secondary cause-specific death, including their order. External reviewers were in 100% agreement on all 21 previously adjudicated cases.
Primary Cause of Death – DRM vs. TRM
For Cohort 1, the primary cause-specific death categorized as DRM vs TRM demonstrated 87.5% agreement between the transplant center and consensus panel (AC1=0.75, 95% CI 0.73-0.80) and the Bias Index was 0.12. The consensus panel agreed with 463/465 cases (99.6% agreement) reported by the transplant center as DRM. The consensus panel agreed with 521/651 cases (80% agreement) reported by the transplant center as TRM. The consensus panel adjudicated the remaining 130 cases as DRM instead of TRM.
For Cohort 2, similar results demonstrated an overall agreement of 89% (AC1=0.78, 95% CI 0.71-0.84). The consensus panel agreed with the transplant center in 98.8% (162/164) for DRM and 81% (165/204) for TRM. The Bias Index was 0.10, indicating little difference in results of TRM/DRM adjudication between the two cohorts.
Primary Cause of Death – Disease vs. GvHD vs. Infection vs. Organ Failure vs. Other
Table 3 summarizes the transplant center/consensus panel agreement on DRM, GvHD, Infection, Organ Failure, and Other cause-specific death categories after adjudicating the primary causes of death for Cohorts 1 and 2, while Figure 1 graphically summarizes the same information for both cohorts combined. Supplementary Table 2 expresses the reclassification by the consensus panel in percentages and Supplementary Table 3 shows the statistical measures of agreement. The overall agreement for Cohort 1 considering all five cause-specific death categories was 73% (AC1=0.67, 95% CI 0.64-0.70). The consensus panel agreed with the transplant center 80 and 99% for GvHD and Disease deaths with the majority of discordance in cases reported by the transplant center as Organ Failure or Other (Table 3, Supplementary Table 2, and Figure 1). Of the 220 cases reported by the transplant center as Organ Failure, 115 (52%) were reclassified by the consensus panel as Disease (mainly, organ failure due to additional therapy for post-HCT relapse), GvHD (which led to multi-organ failure), Infection (which led to sepsis/multi-organ failure) or Other (Table 3, Figure 1). For the 113 cases reported by the transplant center as Other cause-specific death, the consensus panel was able to classify 75 (67%) of them into a more specific category (Table 3, Supplementary Tables 2 and 3, Figure 1).
Table 3.
Distribution of the Most Common Categories of Primary Cause of Death Within One Year after Transplant, Reported by the Transplant Center vs. Adjudicated by the Consensus Panel
| Consensus Panel – Cohort 1 | N (%) Agreement |
||||||
|---|---|---|---|---|---|---|---|
| Primary Cause- specific death |
Disease | GvHD | Infection | Organ Failure | Other | ||
| Transplant Center | Disease | 463 | 0 | 0 | 2 | 0 | 465 (99.6%) |
| GvHD | 16 | 105 | 8 | 3 | 0 | 132 (79.5%) | |
| Infection | 44 | 24 | 101 | 12 | 5 | 186 (54.3%) | |
| Organ Failure | 45 | 31 | 27 | 105 | 12 | 220 (47.8%) | |
| Other | 25 | 13 | 16 | 21 | 38 | 113 (33.6%) | |
| N (%) Agreement | 593 (78.1%) | 173 (60.7%) | 152 (66.4%) | 143 (73.4%) | 55 (69.1%) | 1116 | |
| Consensus Panel – Cohort 2 | N (%) Agreement | ||||||
|---|---|---|---|---|---|---|---|
| Primary Cause- specific death |
Disease | GvHD | Infection | Organ Failure | Other | ||
| Transplant Center | Disease | 162 | 0 | 1 | 1 | 0 | 164 (98.8%) |
| GvHD | 3 | 38 | 0 | 0 | 1 | 42 (90.5%) | |
| Infection | 9 | 20 | 29 | 6 | 2 | 66 (43.9%) | |
| Organ Failure | 13 | 13 | 8 | 16 | 8 | 58 (27.6%) | |
| Other | 14 | 4 | 1 | 7 | 12 | 38 (31.6%) | |
| N (%) Agreement | 201 (80.6%) | 75 (50.7%) | 39 (74.4%) | 30 (53.3%) | 23 (52.2%) | 368 | |
Figure 1. Visual Summary of the Proportion of Causes of Death in Each Category as Reported by the Transplant Center that were Reclassified by the Consensus Panel, for Both Cohorts.
Black boxes indicate >15% of deaths in that TRM category were reclassified
Boxes with dots indicate ≥5 but <15% of deaths in that TRM category were reclassified
White boxes indicate <5% of deaths in that TRM category were reclassified
Grey boxes indicate the percent agreement of the Consensus Panel with the Transplant Center
Similarly, Cohort 2 had an overall agreement of 70% (AC1=0.64, 95% CI 0.58-0.69), with 68% of Other and 72% of Organ Failure deaths reported by the transplant center categorized by the consensus panel into a different category (Table 3), and again the consensus panel agreed with 91-99% of transplant center cause-specific deaths for GvHD and Disease. Additional agreement statistics are summarized in Supplementary Tables 2 and 3.
All Contributing Causes of Death
Table 4 summarizes the transplant center/consensus panel agreement for all cause-specific deaths for Cohorts 1 and 2. Agreement was highest for “Disease, with or without another cause”, followed by “GvHD and Infection, without or without another cause”, “GvHD without infection, with or without another cause”, “Infection without GvHD, with or without another cause” and “Organ Failure without GvHD without Infection, with or without another cause”. Between the 2 cohorts, the consensus panel adjudicated 189 deaths to GvHD and Infection (+/− another contributing causes), 181 deaths to Infection without GvHD (+/− another contributing causes), and 138 deaths to GvHD without Infection (+/− another contributing causes). These 3 clinical scenarios occurred with similar frequencies. Results between the two cohorts were consistent, despite adjudication schedule differences, inclusion of 8/8 HLA matching in Cohort 2, different years of HCT and different patient characteristics (Table 2).
Table 4.
Distribution of the Most Common Combinations of Causes of Death Reported by the Transplant Center vs. Adjudicated by the Consensus Panel
| Consensus Panel – Cohort 1 |
N (%) Agreement | |||||||
|---|---|---|---|---|---|---|---|---|
| All Causes of Death | Disease +/− Other |
GvHD +/− Other (No Infection) |
GvHD + Infection +/− Other |
Infection +/− Other (No GvHD) |
Organ Failure +/− Other (No GvHD, No Infection) |
Other (No GvHD, Infection, Organ Failure) |
||
|
| ||||||||
| Transplant Center | Disease +/−Other | 482 | 0 | 0 | 1 | 2 | 1 | 486 (99%) |
| GvHD +/−Other (No Infection) |
14 | 88 | 25 | 1 |
6 | 1 | 135 (65%) | |
| GvHD + Infection +/− Other | 7 | 4 | 51 | 1 | 0 | 1 | 64 (80%) | |
|
| ||||||||
| Infection +/− Other (No GvHD) |
36 | 2 | 40 | 102 | 10 | 2 | 192 (53%) | |
|
| ||||||||
| Organ Failure +/− Other (No GvHD, No Infection) |
38 | 10 | 10 | 26 | 84 | 3 | 171 (49%) | |
| Other (No GvHD, Infection or Organ Failure) |
16 | 7 | 4 | 9 | 11 | 21 | 68 (31%) | |
| N (%) Agreement | 593 (81%) | 111 (79%) | 130 (39%) | 140 (73%) | 113 (74%) | 29 (72%) | 1116 | |
| Consensus Panel – Cohort 2 |
N (%) Agreement |
|||||||
|---|---|---|---|---|---|---|---|---|
| All Causes of Death | Disease +/− Other |
GvHD +/− Other (No Infection) |
GvHD + Infection +/− Other |
Infection +/− Other (No GvHD) |
Organ Failure +/− Other (No GvHD, No Infection) |
Other (No GvHD, Infection, Organ Failure) |
||
| Transplant Center | Disease +/−Other | 170 | 0 | 0 | 1 | 1 | 0 | 172 (99%) |
| GvHD +/−Other (No Infection) |
3 | 20 | 14 | 0 |
0 | 0 | 37 (54%) | |
| GvHD + Infection +/− Other | 2 | 1 | 20 | 1 | 0 | 0 | 24 (83%) | |
|
| ||||||||
| Infection +/− Other (No GvHD) |
8 | 3 | 21 | 31 | 2 | 2 | 67 (46%) | |
|
| ||||||||
| Organ Failure +/− Other (No GvHD, No Infection) |
11 | 2 | 2 | 7 | 14 | 7 | 43 (33%) | |
|
| ||||||||
| Other (No GvHD, Infection or Organ Failure) |
7 | 1 | 2 | 1 | 3 | 11 | 25 (44%) | |
|
| ||||||||
| N (%) Agreement | 201 (85%) | 27 (74%) | 59 (34%) | 41 (76%) | 20 (70%) | 20 (55%) | 368 | |
Clinical Scenarios
Supplementary Tables 4 and 5 detail examples of the adjudication results where the transplant center and consensus panel disagreed and agreed, respectively. These examples illustrate the common and rare scenarios that were adjudicated and provide more information about the decisions made and factors considered during the adjudication.
Factors Associated with Transplant Center and Consensus Panel Agreement
The multivariate models indicated that both the transplant centers and consensus panel coded the causes of death based on the patient’s disease (AML, ALL or MDS), hence the proportion of transplant center/consensus panel agreement did not vary by disease. However, the consensus panel relied on disease status (in remission or not) to adjudicate cause-specific deaths, whereas this was not consistently done by the transplant centers. This difference, in combination with the consistent application of cause-specific death definitions by the consensus panel but not transplant centers, accounted for most of the transplant center/consensus panel discordance. In addition, the transplant center cause-specific deathsdemonstrated a cohort effect: agreement between the transplant center and consensus panel was lower earlier in the cohort (HCT in 2000-2003) compared to later (HCT in 2009-2011).
Assessment of Transplant Center and Consensus Panel Agreement among Early Deaths
Table 5 summarizes the results of a sensitivity analysis of transplant center/consensus panel agreement for both cohorts on early deaths before day +100 post-URD-HCT since these were the most difficult to adjudicate. This analysis demonstrates how transplant center/consensus panel agreement is associated with disease status pre-URD-HCT. Agreement was highest for early disease status AML/ALL deaths before day +30 post-HCT (81%), followed by early or intermediate disease AML/ALL deaths before day +100 post-HCT (73-74%). Agreement was lowest among early disease risk MDS deaths before day +30 post-HCT (18%).
Table 5.
Sensitivity Analysis of Agreement between Reported and Adjudicated Cause-Specific Death < 100 Days after URD-HCT
| Timing of Death Occurrence After URD-HCT |
Disease Risk | Rate of Agreement between TC & CP |
Summary of Consensus Panel Changes |
|---|---|---|---|
| AML & ALL patients, Cohorts 1 & 2 | |||
| Day 0-30, N=121 deaths (4%) of 2884 patients | |||
| Early | 26/32 (81%) | 6 cases were recoded as a different TRM category | |
| Intermediate | 13/24 (54%) | 11 cases were recoded as a different TRM category | |
| Advanced | 32/65 (49%) | 6 cases were recoded as a different TRM category 27 cases were recoded from TRM to DRM |
|
| Day 0-100, N=478 deaths (17%) of 2884 patients | |||
| Early | 110/150 (73%) | 38 cases were recoded as a different TRM category 3 cases were recoded from TRM to DRM |
|
| Intermediate | 80/108 (74%) | 25 cases were recoded as a different TRM category 3 cases were recoded from TRM to DRM |
|
| Advanced | 141/220 (64%) | 33 cases were recoded as a different TRM category 46 cases were recoded from TRM to DRM |
|
| MDS patients, Cohorts 1 & 2 | |||
| Day 0-30, N=21 deaths (3%) of 648 patients | |||
| Early | 2/11 (18%) | 6 cases were recoded as a different TRM category 3 cases were recoded from TRM to DRM |
|
| Advanced | 6/10 (60%) | 1 case was recoded as a different TRM category 1 case were recoded from DRM to TRM 2 cases were recoded from TRM to DRM |
|
| Day 0-100, N=122 deaths (19%) of 648 patients | |||
| Early | 39/63 (62%) | 20 cases were recoded as a different TRM category 4 cases were recoded from TRM to DRM |
|
| Advanced | 35/59 (59%) | 13 cases were recoded as a different TRM category 9 cases were recoded from DRM to TRM 2 cases were recoded from TRM to DRM |
|
Legend: AML: Acute Myeloid Leukemia, ALL: Acute lymphoblastic Leukemia, MDS: Myelodysplastic Syndrome, URD-HCT: unrelated donor allogeneic hematopoietic cell transplantation, TC: Transplant Center, CP: Consensus Panel
AML+ALL Disease Risk = Early: 1st complete remission, Intermediate: ≥2nd complete remission, Advanced: Not in complete remission at URD-HCT
MDS Disease Risk = Early: Refractory anemia with or without ringed sideroblasts or not otherwise specified, Advanced: all other MDS subtypes
Recommendations for Future Adjudication Panels
Based on our findings, the following groups should be prioritized for future studies needing adjudication of cause-specific death: all MDS patients who died before day +100, and all patients regardless of disease or disease status whose cause of death was reported as “Other” or “Organ Failure”. While the rate of agreement was higher for AML/ALL patients in remission pre-HCT, and for deaths reported as due to “GVHD” or “Infection”, there was still about a 20-27% discordance between reported and adjudicated death. This rate of discordance is enough to bias analyses of cause-specific mortality, and thus these groups are recommended for adjudication as well. Based on our report of 99% agreement between reported and adjudicated death, cause of death reported as recurrence or relapse of disease does not need to be adjudicated. This represented 42% of all deaths in our study population.
DISCUSSION
Application of our algorithm to future studies should focus efforts on adjudication of cases reported as TRM, particularly Infection, Organ Failure or Other, as well as AML and ALL patients who were not in remission at the time of HCT and all MDS patients. The current CIBMTR guidance provided to transplant centers for reporting causes of death uses the Centers for Disease Control and Prevention and World Health Organization definition for the underlying cause of death, defined as “the disease or injury that initiated the chain of events that led directly or inevitably to death”.24-26 However, many transplant centers report the last event in a cascade of multiple events. We defined the cause-specific death categories based on the CIBMTR guidance, our clinical experience, prior publications of similar studies and our intent to study genetic contributions to important, common causes of death. Other investigators studying the risk factors for these outcomes, or evaluating a multi-state modelling approach to define the risk of one phenotype (eg, infection) after developing another (eg, GvHD) could attribute causes of death with different definitions, prioritization or hierarchy.
Having precise categories of cause-specific death with pre-specified definitions allowed the consensus to categorize most of the Other cause-specific death cases into more specific categories (GvHD, Infection, Organ Failure, Disease). The ability to attribute more than one cause-specific death to each case by incorporating both timing and severity of each outcome, allowed better separation of cases into single vs multi-factorial causes of death. For patients with relapsed disease, the primary cause-specific death is defined as Disease, even if patients died with complications including infection, organ failure, GvHD. This practice is not uniformly applied in clinical research and varied over time in the CIBMTR cohort. This lack of uniformity between centers and over time could be due to changes in data manager education, the real-time reporting of individual cases by >150 transplant centers over a 12 year period, changes to the CIBMTR forms over time, availability of medical records (before/after EMRs) or a lack of standards for how to select and prioritize cause-specific deaths when multiple competing serious events occur.
Our consensus panel methodically adjudicated all deaths in the DISCOVeRY-BMT study with internal and external validity controls.9 There were no validation inconsistencies using short (2-day) or prolonged (8-month) review. Compact and prolonged schedules represent feasible designs for future studies, as long as quality assurance measures are used. This study was possible because the investigator team understood the critical importance of the review process and was dedicated to accuracy and consistency.
This approach to defining cause-specific death will enable us to perform a reproducible and accurate evaluation of genetic variants associated with TRM. We evaluated the potential impact of endpoint misclassification on our future GWAS study (DISCOVeRY-BMT) using the methods described by Wray, et al7 in the context of our adjudication results. Since 20% of TRM cases reported by the transplant center were reclassified as DRM, the estimate of total genetic variance explained by SNPs for TRM and DRM would decrease by 25-33% depending on the extent to which genetic variants contribute to TRM. Therefore, if we had not performed the adjudication, the total amount of true genetic variance contributing to these clinical endpoints after URD-HCT would have been difficult to assess and the results would have been subject to confounding by an increased false positive discovery rate.
Our findings are applicable to any GWAS investigating competing events, such as cancer-specific mortality vs treatment-related mortality, especially when multiple centers contribute data. Adjudication of the primary endpoint in observational research helps to eliminate potential cohort effects, reduces outcome heterogeneity and increases the likelihood of discovering true genetic associations. Our detailed and easy to use outcome algorithm (Table 1) provides the standards and definitions necessary to yield more consistent and accurate reporting of cause-specific death by participating CIBMTR centers.
Supplementary Material
Highlights.
Clinical scenarios/definitions of cause-specific death after URD-HCT are presented
High rate of agreement between reported and adjudicated disease-related deaths
Discordance of 20% between reported and adjudicated transplant-related deaths
Discordance varied among GVHD, Infection, Organ Failure and Other causes of death
ACKNOWLEDGEMENTS
John P. Klein, PhD, Department of Biostatistics, Medical College of Wisconsin, provided guidance on study design and statistical methodology. This study was supported by a grant from the National Heart, Lung, and Blood Institute (R01 HL102278) to Drs. Hahn and Sucheston-Campbell. The Bioinformatics Core at Roswell Park Cancer Institute is partially funded by a grant from the National Cancer Institute (P30 CA016056). The Center for International Blood and Marrow Transplant Research is supported by Public Health Service Grant/Cooperative Agreement U24-CA076518 from the National Cancer Institute (NCI), the National Heart, Lung and Blood Institute (NHLBI) and the National Institute of Allergy and Infectious Diseases (NIAID); a Grant/Cooperative Agreement 5U10HL069294 from NHLBI and NCI; a contract HHSH250201200016C with Health Resources and Services Administration (HRSA/DHHS); two Grants N00014-12-1-0142 and N00014-13-1-0039 from the Office of Naval Research; and grants from *Actinium Pharmaceuticals; Allos Therapeutics, Inc.; *Amgen, Inc.; Anonymous donation to the Medical College of Wisconsin; Ariad; Be the Match Foundation; *Blue Cross and Blue Shield Association; *Celgene Corporation; Chimerix, Inc.; Fred Hutchinson Cancer Research Center; Fresenius-Biotech North America, Inc.; *Gamida Cell Teva Joint Venture Ltd.; Genentech, Inc.;*Gentium SpA; Genzyme Corporation; GlaxoSmithKline; Health Research, Inc. Roswell Park Cancer Institute; HistoGenetics, Inc.; Incyte Corporation; Jeff Gordon Children’s Foundation; Kiadis Pharma; The Leukemia & Lymphoma Society; Medac GmbH; The Medical College of Wisconsin; Merck & Co, Inc.; Millennium: The Takeda Oncology Co.; *Milliman USA, Inc.; *Miltenyi Biotec, Inc.; National Marrow Donor Program; Onyx Pharmaceuticals; Optum Healthcare Solutions, Inc.; Osiris Therapeutics, Inc.; Otsuka America Pharmaceutical, Inc.; Perkin Elmer, Inc.; *Remedy Informatics; *Sanofi US; Seattle Genetics; Sigma-Tau Pharmaceuticals; Soligenix, Inc.; St. Baldrick’s Foundation; StemCyte, A Global Cord Blood Therapeutics Co.; Stemsoft Software, Inc.; Swedish Orphan Biovitrum; *Tarix Pharmaceuticals; *TerumoBCT; *Teva Neuroscience, Inc.; *THERAKOS, Inc.; University of Minnesota; University of Utah; and *Wellpoint, Inc. The views expressed in this article do not reflect the official policy or position of the National Institute of Health, the Department of the Navy, the Department of Defense, Health Resources and Services Administration (HRSA) or any other agency of the U.S. Government. *Corporate Members
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
This study was previously presented at an oral session during the annual Tandem BMT meeting in February 2014 and was published as an abstract in Biol Blood Marrow Transplant 20: S35, 2014 (abstract #20).
Conflict-of-Interest
The co-authors have no conflicts of interest to disclose.
REFERENCES
- 1).Majhail NS, Chitphakdithai P, Logan B, et al. Significant improvement in survival after unrelated donor hematopoietic cell transplantation in the recent era. Biol Blood Marrow Transplant. 2015;21:142–150. doi: 10.1016/j.bbmt.2014.10.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2).Walovitch R, Yao B, Chokron P, Le H, Bubley G. Subjective endpoints in clinical trials: the case for blinded independent central review. J Clin Trials. 2013;5:111–17. [Google Scholar]
- 3).Copelan E, Casper JT, Carter SL, et al. A scheme for defining cause of death and its application in the T cell depletion trial. Biol Blood Marrow Transplant. 2007;13:1469–76. doi: 10.1016/j.bbmt.2007.08.047. [DOI] [PubMed] [Google Scholar]
- 4).Gratwohl A, Brand R, Frassoni F, et al. Cause of death after allogeneic haematopoietic stem cell transplantation (HSCT) in early leukaemias: an EBMT analysis of lethal infectious complications and changes over calendar time. Bone Marrow Transplant. 2005;36:757–769. doi: 10.1038/sj.bmt.1705140. [DOI] [PubMed] [Google Scholar]
- 5).Wingard JR, Majhail NS, Brazauskas R, et al. Long-term survival and late deaths after allogeneic hematopoietic cell transplantation. J Clin Oncol. 2011;29:2230–39. doi: 10.1200/JCO.2010.33.7212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6).Martin PJ, Counts GW, Appelbaum FR, et al. Life expectancy in patients surviving more than 5 years after hematopoietic cell transplantation. J Clin Oncol. 2010;28:1011–16. doi: 10.1200/JCO.2009.25.6693. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7).Wray NR, Lee SH, Kendler KS. Impact of diagnostic misclassification on estimation of genetic correlations using genome-wide genotypes. Eur J Hum Gen. 2012;20:668–674. doi: 10.1038/ejhg.2011.257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8).Pasquini MC, Wang Z, Horowitz MM, Gale RP. 2010 report from the Center for International Blood and Marrow Transplant Research (CIBMTR): current uses and outcomes of hematopoietic cell transplants for blood and bone marrow disorders. Clin Transpl. 2010;210:87–105. [PubMed] [Google Scholar]
- 9).Dechartres A, Boutron I, Roy C, Ravaud P. Inadequate planning and reporting of adjudication committees in clinical trials: recommendation proposal. J Clin Epidemiol. 2009;62:695–702. doi: 10.1016/j.jclinepi.2008.09.011. [DOI] [PubMed] [Google Scholar]
- 10).Gwet KL. Inter-Rater Reliability: Dependency on Trait Prevalence and Marginal Homogeneity. Stat Methods IRR Assessment. 2002;2:1–9. [Google Scholar]
- 11).Gwet KL. Computing inter-rater reliability and its variance in the presence of high agreement. Br J Math Stat Psychol. 2008;61:29–48. doi: 10.1348/000711006X126600. [DOI] [PubMed] [Google Scholar]
- 12).Banerjee M, Capozzoli M, McSweeney L, Sinha D. Beyond Kappa: A Review of Interrater Agreement Measures. Canadian J Stat. 1999;27:3–23. [Google Scholar]
- 13).Sim J, Wright CC. The kappa statistic in reliability studies: use, interpretation, and sample size requirements. Phys Ther. 2005;85:257–68. [PubMed] [Google Scholar]
- 14).Hoehler FK. Bias and prevalence effects on kappa viewed in terms of sensitivity and specificity. J Clin Epidemiol. 2000;53:499–503. doi: 10.1016/s0895-4356(99)00174-2. [DOI] [PubMed] [Google Scholar]
- 15).Wongpakaran N, Wongpakaran T, Wedding D, Gwet KL. A comparison of Cohen’s Kappa and Gwet’s AC1 when calculating inter-rater reliability coefficients: a study conducted with personality disorder samples. BMC Med Res Methodol. 2013;13:61. doi: 10.1186/1471-2288-13-61. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16).Ludbrook J. Statistical techniques for comparing measurers and methods of measurement: a critical review. Clin Exp Pharmacol Physiol. 2002;29:527–36. doi: 10.1046/j.1440-1681.2002.03686.x. [DOI] [PubMed] [Google Scholar]
- 17).Efron B. Better bootstrap confidence intervals (with discussions) J Amer Stat Assoc. 1987;82:171–200. [Google Scholar]
- 18).Hoehler FK. Bias and prevalence effects on kappa viewed in terms of sensitivity and specificity. J Clin Epidemiol. 2000;53:499–503. doi: 10.1016/s0895-4356(99)00174-2. [DOI] [PubMed] [Google Scholar]
- 19).Cicchetti DV, Feinstein AR. High agreement but low kappa, II: resolving the paradoxes. J Clin Epidemiol. 1990;43:551–558. doi: 10.1016/0895-4356(90)90159-m. [DOI] [PubMed] [Google Scholar]
- 20).Agresti A. Series in probability and statistics. 3rd ed Wiley; Hoboken, NJ: 2013. Categorical data analysis; p. 714. [Google Scholar]
- 21).Agresti A, Lang JB. Quasi-symmetric latent class models, with application to rater agreement. Biometrics. 1993;49:131–9. [PubMed] [Google Scholar]
- 22).Gottard A, Marchetti GM, Agresti A. Quasi-Symmetric Graphical Log-Linear Models. Scand J Stat. 2011;38:447–465. [Google Scholar]
- 23).R Core Development Team . R: A language and environment for statistical computing. R Foundation for Statistical Computing; Vienna, Austria: 2012. [Google Scholar]
- 24). [Accessed 2/27/15];Instructions for Recipient Death Data (Form 2900) http://www.cibmtr.org/DataManagement/TrainingReference/Manuals/DataManagement/Documents/CIBMTR%20Recipient%20Death%20Data%20Form%202900.pdf.
- 25).CDC [Accessed 2/27/15];Multiple Causes of Death 1999-2013. http://wonder.cdc.gov/wonder/help/mcd.html.
- 26).World Health Organization [Accessed 2/27/15];Health Topics: Mortality. http://www.who.int/topics/mortality/en/
- 27). [Accessed 7/11/14];Use and Interpretation of the WHO and CDC growth charts for children from birth to 20 years in the United States. http://www.cdc.gov/nccdphp/dnpa/growthcharts/resources/growthchart.pdf.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.

