Skip to main content
eLife logoLink to eLife
. 2017 Jun 27;6:e25306. doi: 10.7554/eLife.25306

Replication Study: Inhibition of BET recruitment to chromatin as an effective treatment for MLL-fusion leukaemia

Xiaochuan Shan 1, Juan Jose Fung 2, Alan Kosaka 2, Gwenn Danet-Desnoyers 1; Reproducibility Project: Cancer Biology*
Editor: Karen Adelman3
PMCID: PMC5487217  PMID: 28653617

Abstract

In 2015, as part of the Reproducibility Project: Cancer Biology, we published a Registered Report (Fung et al., 2015), that described how we intended to replicate selected experiments from the paper "Inhibition of BET recruitment to chromatin as an effective treatment for MLL-fusion leukaemia" (Dawson et al., 2011). Here, we report the results of those experiments. We found treatment of MLL-fusion leukaemia cells (MV4;11 cell line) with the BET bromodomain inhibitor I-BET151 resulted in selective growth inhibition, whereas treatment of leukaemia cells harboring a different oncogenic driver (K-562 cell line) did not result in selective growth inhibition; this is similar to the findings reported in the original study (Figure 2A and Supplementary Figure 11A,B; Dawson et al., 2011). Further, I-BET151 resulted in a statistically significant decrease in BCL2 expression in MV4;11 cells, but not in K-562 cells; again this is similar to the findings reported in the original study (Figure 3D; Dawson et al., 2011). We did not find a statistically significant difference in survival when testing I-BET151 efficacy in a disseminated xenograft MLL mouse model, whereas the original study reported increased survival in I-BET151 treated mice compared to vehicle control (Figure 4B,D; Dawson et al., 2011). Differences between the original study and this replication attempt, such as different conditioning regimens and I-BET151 doses, are factors that might have influenced the outcome. We also found I-BET151 treatment resulted in a lower median disease burden compared to vehicle control in all tissues analyzed, similar to the example reported in the original study (Supplementary Figure 16A; Dawson et al., 2011). Finally, we report meta-analyses for each result.

DOI: http://dx.doi.org/10.7554/eLife.25306.001

Research Organism: Human, Mouse

Introduction

The Reproducibility Project: Cancer Biology (RP:CB) is a collaboration between the Center for Open Science and Science Exchange that seeks to address concerns about reproducibility in scientific research by conducting replications of selected experiments from a number of high-profile papers in the field of cancer biology (Errington et al., 2014). For each of these papers a Registered Report detailing the proposed experimental designs and protocols for the replications was peer reviewed and published prior to data collection. The present paper is a Replication Study that reports the results of the replication experiments detailed in the Registered Report (Fung et al., 2015), for a paper by Dawson et al., and uses a number of approaches to compare the outcomes of the original experiments and the replications.

In 2011, Dawson et al. demonstrated targeting BET proteins as an effective strategy for modulating aberrant gene expression associated with mixed-lineage leukaemia (MLL), a genetically distinct form of acute leukaemia. Selective inhibition of cell viability and BCL2 expression in leukaemia cells harboring MLL fusions was observed with the BET bromodomain inhibitor, I-BET151, in contrast to leukaemia cells with alternate oncogenic drivers (Dawson et al., 2011). Furthermore, efficacy of I-BET151 was tested in a xenograft model of MLL, which resulted in a statistically significant increase in survival compared to vehicle control treated animals.

The Registered Report for the paper by Dawson et al. described the experiments to be replicated (Figures 2A, 3D, 4B and D, and Supplementary Figures 11A-B and 16A), and summarized the current evidence for these findings (Fung et al., 2015). Recent studies have investigated the efficacy of targeting BET bromodomains in other cancer types. Studies using structurally distinct BET inhibitors, JQ1 and OTX015, have reported these inhibitors to be highly active in various cell lines, mouse models, and primary patient samples of MLL and other types of AML (Chen et al., 2013; Coudé et al., 2015; Fiskus et al., 2014; Herrmann et al., 2013; Mertz et al., 2011; Zuber et al., 2011). Furthermore, I-BET151 was reported to be active against AML with mutations involving the nucleophosmin (NPM1) gene (NPM1c AML) (Dawson et al., 2014). Additional studies examining the antitumor effects of BET bromodomain inhibition include other cancer types, such as gastric cancer (Montenegro et al., 2016), childhood sarcoma (Bid et al., 2016), triple negative breast cancer (da Motta et al., 2017; Shu et al., 2016), and ovarian cancer (Zhang et al., 2016). Additionally, evidence indicating potential acquired resistance to BET inhibitors has been reported (Fong et al., 2015; Kumar et al., 2015; Rathert et al., 2015), along with studies which have found that treatment with combinatorial drugs can mitigate acquired resistance (Asangani et al., 2016; Kurimchak et al., 2016; Yao et al., 2015). Furthermore, a recent study reported synergistic effects of combinatorial treatment with a disruptor of telomeric silencing 1-like (DOT1L) inhibitor and I-BET151 in MLL cells and mouse leukaemia models (Gilan et al., 2016). Studies have also addressed the nonclinical safety of BET inhibition and found that BET bromodomain inhibition can impact normal intestinal homeostasis and repair (Nakagawa et al., 2016). There are several variations of I-BET bromodomain inhibitors currently in clinical trials (Chaidos et al., 2015; French, 2016; Wadhwa and Nicolaides, 2016). A phase I trial to determine the recommended dose of BET inhibitor OTX015/MK-8628 reported that although the inhibitor was found to be tolerated, toxic effects such as thrombocytopenia were commonly observed (Amorim et al., 2016). An additional clinical study testing BET inhibition with OTX015/MK-8628 in four patients with advanced NUT midline carcinoma, with confirmed BRD4-NUT fusions, found early clinical benefits reported for two patients with one achieving post-treatment disease stabilization (Stathis et al., 2016).

The outcome measures reported in this Replication Study will be aggregated with those from the other Replication Studies to create a dataset that will be examined to provide evidence about reproducibility of cancer biology research, and to identify factors that influence reproducibility more generally.

Results and discussion

Evaluating the selective inhibition of a MLL-fusion leukaemic cell line with I-BET151

Using the same BET bromodomain inhibitor as Dawson et al., we aimed to independently replicate an experiment analyzing the ability of I-BET151 to selectively inhibit the growth of the MLL-fusion leukaemic cell line MV4;11 (MLL-AF4 fusion). Similar to the original study, the K-562 leukaemic cell line, which is oncogenically driven by tyrosine kinase activation, served as a negative control. This experiment is comparable to what was reported in Figure 2A and Supplementary Figure 11A-B of Dawson et al. (2011) and described in Protocols 1–2 in the Registered Report (Fung et al., 2015). While the original study included a variety of human leukaemia cell lines, this replication attempt was restricted to the MV4;11 and K-562 cell lines, which were utilized in further experiments. For each cell line, cellular IC50 values were determined (Figure 1, Figure 1—figure supplement 1, Figure 1—figure supplement 2) with a mean absolute IC50 value of 9.03 nM [n = 5, SD = 8.27 nM] for I-BET151 treated MV4;11 cells, while the K-562 IC50 estimates were not able to be accurately determined following published guidelines (Sebaugh, 2011). This compares to the original study that reported an IC50 value of 26 nM for MV4;11 cells, with the K-562 IC50 value also unable to be accurately determined (Dawson et al., 2011).

Figure 1. Cellular I-BET151 dose response curves in MLL-fusion leukaemia and non-MLL-fusion leukaemia cell lines.

Cell viability assays were performed for a MLL-fusion leukaemia cell line (MV4;11) and a non-MLL-fusion leukaemia cell line (K-562) with the I-BET151 inhibitor. Curves and absolute IC50 values were determined for each biological repeat. (A) Dose response curves for each biological repeat [n = 5] for the MV4;11 cell line. Percent cell viability is relative to DMSO treated cells. I-BET151 doses range from 1 pM to 1 mM. Exploratory one-sample t-test of IC50 values compared to a constant of 100 µM; t(4) = 20.63, p=3.26×10−5; Cohen’s d = 9.23, 95% CI [3.13, 15.45]. (B) The mean absolute IC50 value for MV4;11 cells and 95% confidence interval [n = 5] for this replication attempt is plotted with the IC50 value reported in Dawson et al. (2011) displayed as a single point (red circle) for comparison. (C) Dose response curves for each biological repeat [n = 4] for the K-562 cell line. Percent cell viability is relative to DMSO treated cells with I-BET151 dose range from 1 pM to 1 mM. The IC50 estimates were not able to be accurately determined following published guidelines (Sebaugh, 2011). Additional details for this experiment can be found at https://osf.io/zm3j4/.

DOI: http://dx.doi.org/10.7554/eLife.25306.002

Figure 1.

Figure 1—figure supplement 1. Cellular I-BET151 dose response curves for each MV4;11 biological repeat.

Figure 1—figure supplement 1.

This is the same experiment as in Figure 1. The dose response curve of each biological repeat for the MV4;11 cell line plotted individually with the absolute IC50 value listed. Additional details for this experiment can be found at https://osf.io/zm3j4/.
Figure 1—figure supplement 2. Cellular I-BET151 dose response curves for each K-562 biological repeat.

Figure 1—figure supplement 2.

This is the same experiment as in Figure 1. The dose response curve of each biological repeat for the K-562 cell line plotted individually. The IC50 estimates were not able to be accurately determined following published guidelines (Sebaugh, 2011), and are listed as greater than 100 µM, which is the highest dose before inhibition started to be observed. Additional details for this experiment can be found at https://osf.io/zm3j4/.

The analysis plan specified in the Registered Report (Fung et al., 2015) proposed to compare the IC50 values from MV4;11 cells to those determined for K-562 cells. However, as stated above this could not be performed because of the inability to determine the K-562 IC50 values. Instead, we performed an exploratory analysis using the MV4;11 IC50 values compared to a constant of 100 µM, the highest dose in K-562 cells before inhibition started to be observed, which was statistically significant (one-sample t-test; t(4) = 20.63, p=3.26×10−5; Cohen’s d = 9.23, 95% CI [3.13, 15.45]). To summarize, for this experiment we found results that were in the same direction as the original study and statistically significant where predicted.

Analysis of BCL2 gene expression following I-BET151 treatment

A key antiapoptotic gene, BCL2, which has been implicated in MLL-fusion leukaemias (Robinson et al., 2008; Wang et al., 2011), was reported in Dawson et al. (2011) to be selectively inhibited by I-BET151, suggesting a possible mode of action. We sought to replicate this experiment, which is similar to what was reported in Figure 3D of Dawson et al. (2011) and described in Protocol 3 in the Registered Report (Fung et al., 2015). While the original study included multiple leukaemia cell lines with an MLL-fusion, this replication attempt was restricted to the MV4;11 cell line, which was utilized in other experiments. Similar to the cellular IC50 experiment, the K-562 cell line served as a control for the specificity towards MLL-fusion leukaemias. We used quantitative real-time-polymerase chain reaction (qRT-PCR) to assess BCL2 expression in MV4;11 and K-562 cells treated with I-BET151 or vehicle control (Figure 2). Using the 2-∆∆Ct method, treatment of MV4;11 cells with I-BET151 resulted in a 0.501 [n = 3, SD = 0.048] mean fold change in BCL2 expression relative to vehicle control, while K-562 cells remained largely unchanged [n = 3, M = 1.06, SD = 0.028]. This is similar to Dawson et al. (2011), which reported an ~0.22 mean fold change in BCL2 expression for MV4;11 cells and an ~0.94 mean fold change for the K-562 cell line.

Figure 2. BCL2 expression in I-BET151 treated MV4;11 and K-562 cells.

Figure 2.

MV4;11 and K-562 cells were treated with 500 nM I-BET151, or an equivalent volume of DMSO. Total RNA was isolated 6 hr after treatment and qRT-PCR was used to detect BCL2 and B2M expression. Fold change in BCL2 expression normalized to B2M and relative to DMSO is presented for I-BET151 treated MV4;11 and K-562 cells. Expression level of BCL2 in DMSO was assigned a value of 1. Means reported and error bars represent SD from three independent biological repeats. Two-sample t-test comparing fold gene expression values from MV4;11 cells to K-562 cells; t(4) = 17.23, uncorrected p=6.66×10−5, a priori Bonferroni adjusted significance threshold = 0.0167; (Bonferroni corrected p=0.0004). One-sample t-test comparing fold gene expression from K-562 cells to a constant of 1 (DMSO treated cells); t(2) = 3.53, uncorrected p=0.0719, a priori Bonferroni adjusted significance threshold = 0.0167; (Bonferroni corrected p=0.216). One-sample t-test comparing fold gene expression from MV4;11 cells to a constant of 1 (DMSO treated cells); t(2) = 17.86, uncorrected p=0.003, a priori Bonferroni adjusted significance threshold = 0.0167; (Bonferroni corrected p=0.009). Additional details for this experiment can be found at https://osf.io/np6gq/.

DOI: http://dx.doi.org/10.7554/eLife.25306.005

To provide a direct comparison to the original data, we are reporting the analysis specified a priori in the Registered Report (Fung et al., 2015). We planned to conduct one two-sample t-test, between K-562 and MV4;11 fold gene expression values, and two one-sample t-tests, on K-562, or MV4;11, fold gene expression values compared to a constant of 1, which represents the DMSO treated gene expression. To account for these multiple comparisons, the Bonferroni correction was used, making the a priori Bonferroni adjusted significance threshold 0.0167. We performed an unpaired, two-sample t-test on the fold gene expression values from K-562 cells compared to MV4;11 cells, which was statistically significant (t(4) = 17.23, uncorrected p=6.66×10−5, corrected p=0.0004). Thus, the null hypothesis that I-BET151 treatment on BCL2 expression is similar for MV4;11 and K-562 cells can be rejected. Additionally, we performed a one-sample t-test on the fold gene expression values from K-562 cells compared to a constant of 1 (t(2) = 3.53, uncorrected p=0.072, corrected p=0.216), and a one-sample t-test on the fold gene expression values from MV4;11 cells compared to a constant of 1 (t(2) = 17.86, uncorrected p=0.003, corrected p=0.009). To summarize, for this experiment we found results that were statistically significant where predicted and in the same direction as the original study.

Generation of disseminated xenograft MLL mouse model and testing of I-BET151 compound in vivo

In order to test the efficacy of I-BET151 as a therapeutic agent in MLL-fusion leukaemia, we sought to replicate an experiment similar to what was reported in Figure 4B and D and Supplementary Figure 16A (Dawson et al., 2011) and described in Protocols 4–5 in the Registered Report (Fung et al., 2015). Prior to conducting this efficacy experiment, we determined the maximum tolerated dose (MTD) of I-BET151 in this xenograft mouse model, which imitates the pathology of human leukaemia by disseminating to the bone marrow (Lopes de Menezes et al., 2005). Five weeks after intravenous injection of MV4;11 cells into 7-week-old female NOD-SCID mice, the animals were successfully engrafted as determined by detectable human leukaemia cells in the peripheral blood. This was an increase from the 3 weeks reported in the original study, but was necessary due to the lack of detectable leukaemia cells in mice at 3 weeks post-injection during this replication attempt. Mice were randomized into treatment groups and after 21 days of daily intraperitoneal (IP) injections of vehicle control or three different doses of I-BET151, the MTD was determined as 20 mg/kg/day (Figure 3) based on the criteria pre-specified in the Registered Report (Fung et al., 2015). For comparison the original study delivered I-BET151 daily at 30 mg/kg. While it is not uncommon for variations in stock solution to cause differing compound potency and toxicity (Kannt and Wieland, 2016), a reduced dose could have an impact on I-BET151 efficacy. Indeed, recent work from authors of the original study reported I-BET151 administered at 10 mg/kg had no impact on survival in a MLL-AF9 leukaemia mouse model (Gilan et al., 2016), while I-BET151 administered at 15 mg/kg did provide a survival benefit in a mouse model of NPM1c AML, albeit with a smaller effect size (Dawson et al., 2014).

Figure 3. Maximal tolerated dose in an I-BET151 treated xenograft mouse model of MLL-fusion leukaemia.

Female NOD-SCID mice were conditioned with Busulfan and intravenously injected with 1 × 107 MV4;11 cells in vehicle (PBS). Once disease was established (detectable MV4;11 cells from retro-orbital bleeds), mice were randomized into four cohorts and received daily IP injections of vehicle control [n = 5], 10 mg/kg I-BET151 [n = 5], 20 mg/kg I-BET151 [n = 5], or 30 mg/kg I-BET151 [n = 5]. The animals were monitored and their weight was measured for the duration of the 21 day treatment. The Y axis represents mouse weight in grams, the X axis represents days since the start of treatment. Additional information can be found at https://osf.io/juzmg/.

DOI: http://dx.doi.org/10.7554/eLife.25306.006

Figure 3.

Figure 3—figure supplement 1. Maximal tolerated dose in an I-BET151 treated xenograft mouse model of MLL-fusion leukaemia.

Figure 3—figure supplement 1.

This is the same experiment as in Figure 3. Female NOD-SCID mice xenotransplanted with MV4;11 cells were randomly assigned to receive daily IP injections of treatment. At the time of sacrifice, disease burden was evaluated in peripheral blood (PB), bone marrow (BM), and spleen cells. Number of mice analyzed: n = 5 for vehicle group, n = 5 for I-BET151 10 mg/kg group, n = 5 for I-BET151 20 mg/kg, and n = 5 for I-BET151 30 mg/kg. (A) Dot plot representing the percent of MV4;11 as the percent of HLA-A,B,C+ cells determined by flow cytometry analyses of the total cell population. Medians reported as crossbars. (B) Dot plot representing the absolute number of MV4;11 cells as determined using CountBright absolute counting beads. Medians reported as crossbars. Additional details for this experiment can be found at https://osf.io/juzmg/.

Following the same procedure as the MTD protocol, another cohort of 7-week-old female NOD-SCID mice were injected with MV4;11 cells and monitored until engraftment was established, which also occurred 5 weeks after injection. Mice were treated daily with I-BET151 at 20 mg/kg or vehicle control by IP injection for up to 21 days or until mice were humanely euthanized at signs of distress or disease (e.g. hind limb paralysis). At the time of sacrifice, disease presence was determined by evaluating the percentage of human leukaemia cells present in the peripheral blood, spleen, and bone marrow (Figure 4B). Disease was defined, a priori, as 0.5% of human leukaemia cells over the total live nucleated cells, which included human and mouse cells, in the sample. After the start of treatment, mice treated with vehicle control achieved a median survival of 19.0 days [n = 11], which is indistinguishable from the 18.5 days [n = 12] observed in mice treated with I-BET151 (Figure 4A). A planned comparison between these survival distributions using a log-rank (Mantel-Cox) test was not statistically significant (p=0.536). This compares to the original study that reported a median survival of 14 days, [n = 5] for vehicle control treated animals that was lengthened to the experimental end-point of 21 days with I-BET151 treatment. While the median survival after the start of treatment in control treated mice are similar between these studies, the overall median survival from the MV4;11 injection are farther apart, with the original study reporting a median time of 35 days and this replication attempt reporting 59 days due to the added time from injection to beginning of treatment. Comparatively, other published studies using this model have reported a range of median survival times, such as 41 days (O'Farrell et al., 2003), 51 days (Dorn et al., 2009; Lopes de Menezes et al., 2005; Ma et al., 2015), and 60 days (Hardwicke et al., 2009) after intravenous injection of MV4;11 cells into NOD-SCID mice. To summarize, for this experiment we found the survival results were not in the same direction as the original study and not statistically significant where predicted.

Figure 4. Efficacy study of I-BET151 in xenograft mouse model of MLL-fusion leukaemia.

Female NOD-SCID mice were xenotransplanted with 1 × 107 MV4;11 cells after conditioning with Busulfan. Following establishment of disease (detectable MV4;11 cells from retro-orbital bleeds), mice were randomly assigned to receive daily IP injections of 20 mg/kg I-BET151 or vehicle control. (A) Kaplan-Meier plot of survival during the course of the study. Green arrowhead indicates when treatment commenced on day 40 with treatment continuing for the pre-specified period of 21 days. Animals with no detectable disease at the time of sacrifice (less than 0.5% of MV4;11 cells) or that were unable to be evaluated at the time of death were censored (denoted by a cross). Number of mice monitored: n = 11 for vehicle group and n = 12 for I-BET151 group. Log-rank (Mantel-Cox) test of I-BET151 treatment compared to vehicle control (p=0.536). (B–C) At the time of sacrifice, disease burden was evaluated in peripheral blood (PB), bone marrow (BM), and spleen cells. Number of mice analyzed: n = 8 for vehicle group and n = 10 for I-BET151 group. (B) The percent of MV4;11 cells was determined by flow cytometric analysis as the percent of HLA-A,B,C+ cells in the total nucleated population (both mouse and human cells). Dot plot with medians reported as crossbars. (C) The absolute number of MV4;11 cells each sample was also determined using CountBright absolute counting beads. Dot plot with medians reported as crossbars. Additional details for this experiment can be found at https://osf.io/jakpw/.

DOI: http://dx.doi.org/10.7554/eLife.25306.008

Figure 4.

Figure 4—figure supplement 1. Mouse body weight and cell death analysis in I-BET151 treated xenograft mouse model of MLL-fusion leukaemia.

Figure 4—figure supplement 1.

This is the same experiment as in Figure 4. Female NOD-SCID mice xenotransplanted with MV4;11 cells were randomly assigned to receive daily IP injections of 20 mg/kg I-BET151 or vehicle control after detection of disease. (A) During the course of treatment animals were monitored and their weight was measured twice a week over the 21 day treatment period. Number of mice monitored: n = 11 for vehicle group and n = 12 for I-BET151 group. (B–C) At the time of sacrifice, cell death (apoptotic and necrotic cells) was evaluated in peripheral blood (PB), bone marrow (BM), and spleen cells. Number of mice analyzed: PB: n = 4 for vehicle group and n = 7 for I-BET151 group; BM: n = 6 for vehicle group and n = 8 for I-BET151 group; Spleen: n = 6 for vehicle group and n = 8 for I-BET151 group. (B) The percent of MV4;11 cells that were apoptotic was determined by flow cytometric analysis as the percent of Annexin V+PI- cells in the HLA-A,B,C+ population. Dot plot with medians reported as crossbars. (C) The percent of MV4;11 cells that were necrotic was determined by flow cytometric analysis as the percent of Annexin V+PI+ cells in the HLA-A,B,C+ population. Dot plot with medians reported as crossbars. Additional details for this experiment can be found at https://osf.io/jakpw/.
Figure 4—figure supplement 2. Detection of HLA by immunohistochemistry in mouse tissues.

Figure 4—figure supplement 2.

This is from the same experiment as in Figure 4. Female NOD-SCID mice xenotransplanted with MV4;11 cells were randomly assigned to receive daily IP injections of 20 mg/kg I-BET151 or vehicle control after detection of disease. Tissues containing observed tumors were fixed, sectioned, and stained with an antibody against human leukocyte antigen (HLA-A,B,C) and counterstained with hematoxylin. (A) Positive control staining of human tonsil tissue that was processed in parallel with other tissue sections (magnification: 400X left, 100X right). Representative images of various tissues from MV4;11 engrafted mice who received vehicle treatment, which include (B) stomach (magnification: 400X left, 100X right), (C) gastrointestinal tract (magnification: 400X left, 100X right), (D) mesentery (magnification: 400X left, 20X right), (E) ovary (magnification: 400X left, 20X right), and (F) peritoneal cavity (magnification: 400X). Mouse tissues (e.g. gastrointestinal mucosa) were negatively stained. Additional details for this experiment can be found at https://osf.io/jakpw/.
Figure 4—figure supplement 3. Flow cytometry gating strategies.

Figure 4—figure supplement 3.

(A) Representative images of gating strategy to assess leukaemia burden in peripheral blood, spleen, and bone marrow cells. The FL-3 vs FSC plot was used to gate on CountBright absolute counting beads and total nucleated cells. From the total nucleated population, the HLA-A,B,C APC vs FSC plot was used to gate on HLA-A,B,C+ cells (human leukaemia cells). (B) Representative images of gating strategy to assess cell death in peripheral blood, spleen, and bone marrow cells. The HLA-A,B,C APC vs FSC plot was used to gate on human leukaemia cells. From the human leukaemia cell population, the PI vs Annexin V FITC plot was used to gate on apoptotic cells (Annexin V+PI population) and necrotic cells (Annexin V+PI+ population).

In addition to monitoring survival, disease burden was determined for vehicle control and I-BET151 treated mice. For the efficacy study, the median disease burden observed in the peripheral blood, bone marrow, and spleen were smaller in I-BET151 treated mice compared to the control mice (Figure 4B,C). This was also observed in the MTD study (Figure 3—figure supplement 1). For example, the median percent of leukaemia cells detected in the bone marrow of vehicle control mice was 10.7%, which was reduced to 5.3% in I-BET151 treated mice (Figure 4B). This compares to the representative example reported in Supplementary Figure 16A of Dawson et al. (2011), which reported 14% leukaemia burden in the bone marrow of the control mouse and 1.5% in the I-BET151 treated mouse where low-level residual leukaemia was reported after the 21 day treatment. To summarize, for the disease burden aspect of this experiment we found the results were in the same direction as the original study.

To assess if I-BET151 impacted cell death, specifically apoptosis and necrosis, flow cytometric analysis was also performed on the isolated cells from the vehicle control or I-BET151 treated mice. The degree of apoptosis and necrosis was assessed within the leukaemia population by staining with Annexin V and propidium iodide (PI). Observationally, there was no notable difference in the peripheral blood or spleen, with a minor decrease in both apoptotic (Annexin V+PI-) and necrotic cells (Annexin V+PI+) observed in the bone marrow of I-BET151 treated mice compared to vehicle control (Figure 4—figure supplement 1).

Unexpectedly, at the time of sacrifice, animal health issues were observed beyond the anticipated disease presentation for this model. These included gastrointestinal bloating in three of the I-BET151 treated mice and solid tumors detected in four of the vehicle control treated mice and two of the I-BET151 treated mice. While it has been previously noted that NOD-SCID mice have a high rate of spontaneous thymic lymphomas, no other tumor types were previously reported and tumors did not manifest until 20 weeks of age (Prochazka et al., 1992). To explore the origin of the observed tumors, sections were stained using a human leukocyte antigen (HLA-A,B,C) antibody (Figure 4—figure supplement 2). In all tested samples there was extensive staining, indicating the injected human leukaemia cells spread throughout the mouse outside the hematological tissues. This suggests these mice most likely succumbed to fulminant leukaemia manifesting often as extramedullary disease (EMD), a diagnosis that has been observed in a subset of patients with various types of AML (Ganzel et al., 2016). Indeed, the original study reported EMD in the majority of the control mice, in contrast to none of the I-BET151 treated mice (Dawson et al., 2011). Collectively, these observations raise the possibility that although a survival benefit was not observed with I-BET151 treatment in this replication attempt, despite an overall reduced disease burden, that health issues not directly related to leukaemia could have impacted mortality observed in some of the animals, particularly those treated with I-BET151. Additionally, busulfan conditioning could have influenced engraftment kinetics and disease progression in this animal model and potentially impacted the responsiveness of MV4;11 cells to I-BET151 treatment.

Meta-analyses of original and replicated effects

We performed a meta-analysis using a random-effects model, where possible, to combine each of the effects described above as pre-specified in the confirmatory analysis plan (Fung et al., 2015). To provide a standardized measure of the effect, a common effect size was calculated for each effect from the original and replication studies. Cohen’s d is the standardized difference between two independent means using the pooled sample standard deviation. For a one-sample test, Cohen’s d is the difference between the sample mean and the null value using the sample standard deviation. The hazard ratio (HR) is the ratio of the probability of a particular event, in this case death, in one group compared to the probability in another group. The estimate of the effect size of one study, as well as the associated uncertainty (i.e. confidence interval), compared to the effect size of the other study provides another approach to compare the original and replication results (Errington et al., 2014; Valentine et al., 2011). Importantly, the width of the confidence interval for each study is a reflection of not only the confidence level (e.g. 95%), but also variability of the sample (e.g. SD) and sample size.

A meta-analysis of the I-BET151 IC50 values in MV4;11 and K-562 cells was not conducted since the original study reported single IC50 values. Comparing the original and replication results, the IC50 value reported in Dawson et al. (2011) for MV4;11 cells fell outside the 95% CI of the values generated during this replication attempt (Figure 1B). In K-562 cells, both the original and replication studies could not accurately determine the IC50 estimates indicating K-562 cells are largely unaffected by I-BET151.

There were three comparisons made with the qRT-PCR data analyzing BCL2 expression following I-BET151 treatment (Figure 5A,B). For two of the comparisons, the fold BCL2 expression in each cell line was compared to a constant of 1, which represents the DMSO treated group (Figure 5A). The comparison of K-562 fold gene expression compared to a constant of 1 resulted in d = −2.04, 95% CI [−4.17, 0.13] in this study, whereas d = 1.30, 95% CI [−0.38, 2.88] for the data estimated a priori from Figure 3D in the original study (Dawson et al., 2011). A meta-analysis of these effects resulted in d = −0.307, 95% CI [−3.57,2.96], which was not statistically significant (p=0.854). The results are in opposition when considering the direction of the effect. The point estimate of the replication effect size was not within the confidence interval of the original result, nor was the point estimate of the original effect size within the confidence interval of the replication result. The comparison of MV4;11 fold gene expression compared to a constant of 1, resulted in d = 10.31, 95% CI [1.54, 19.86] in this replication attempt, d = 26.00, 95% CI [4.10, 48.25] for the data estimated from the original study (Dawson et al., 2011) and d = 15.13, 95% CI [0.94, 29.32] for a meta-analysis of the two effects. Both the original and replication results are consistent when considering the direction of the effect, with the point estimate of the replication effect size was within the confidence interval of the original result and vice versa. Further, the random effects meta-analysis resulted in a statistically significant effect (p=0.037). For the third comparison between MV4;11 and K-562 fold gene expression, this replication attempted resulted in d = 14.07, 95% CI [4.73, 23.59], which compares to d = 17.34, 95% CI [5.90, 29.04] for the data estimated from the original study (Dawson et al., 2011) (Figure 5B). A meta-analysis of these effects resulted in d = 15.37, 95% CI [5.99, 24.75], which was statistically significant (p=0.001). Additionally, both results are consistent when considering the direction of the effect with both effect size point estimates falling within the confidence interval of the other study.

Figure 5. Meta-analyses of each effect.

Figure 5.

Effect size and 95% confidence interval are presented for Dawson et al. (2011), this replication study (RP:CB), and a random effects meta-analysis of those two effects. Sample sizes used in Dawson et al. (2011) and this replication attempt are reported under the study name. (A) Fold BCL2 expression in K-562 cells compared to a constant of 1 (DMSO) (meta-analysis p=0.854), and fold BCL2 expression in MV4;11 cells compared to a constant of 1 (DMSO) (meta-analysis p=0.037). (B) Fold BCL2 expression in MV4;11 cells compared to the fold BCL2 expression in K-562 cells (meta-analysis p=0.001). (C) HR for mice treated daily with I-BET151 compared to vehicle control (meta-analysis p=0.220). Additional details for these meta-analyses can be found at https://osf.io/vfp47/.

DOI: http://dx.doi.org/10.7554/eLife.25306.012

The comparison of the overall survival distributions between mice treated with I-BET151 compared to those that were treated with vehicle control resulted in a HR of 1.44, 95% CI [0.45, 4.56] for this replication attempt compared to a HR of 10.32, 95% CI [1.51, 70.63] for the original study (Dawson et al., 2011). A meta-analysis (Figure 5C) of these effects resulted in a HR of 3.30, 95% CI [0.49,22.18], which was not statistically significant (p=0.220). Both results are consistent when considering the direction of the effect, however the point estimate of the replication effect size was not within the confidence interval of the original result, and vice versa. The large confidence intervals of the meta-analysis along with a statistically significant Cochran’s Q test (p=0.085), suggests heterogeneity between the original and replication studies.

This direct replication provides an opportunity to understand the present evidence of these effects. Any known differences, including reagents and protocol differences, were identified prior to conducting the experimental work and described in the Registered Report (Fung et al., 2015). However, this is limited to what was obtainable from the original paper and through communication with the original authors, which means there might be particular features of the original experimental protocol that could be critical, but unidentified. So while some aspects, such as cell lines, strain of mice, number of cells plated/injected, and specific BET inhibitor were maintained, others were changed at the time of study design (Fung et al., 2015) or in the execution of the replication. This includes differences such as the conditioning for xenotransplantation (sub-lethal irradiation to busulfan) and dose of I-BET151 used for the efficacy study (30 mg/kg to 20 mg/kg). The regimen used in this replication attempt has not been fully evaluated with MV4;11 cells regarding its impact on disease progression and treatment responsiveness, an important consideration, since the method of conditioning can influence overall and hematological toxicity, as well as overall survival, despite similar engraftment levels (Robert-Richard et al., 2006; Saland et al., 2015). Furthermore, other aspects were unknown or not easily controlled for. These include variables such as cell line genetic drift (Hughes et al., 2007; Kleensang et al., 2016), mouse sex (Clayton and Collins, 2014), circadian biological responses to therapy (Fu and Kettner, 2013), the microbiome of recipient mice (Macpherson and McCoy, 2015), housing temperature in mouse facilities (Kokolus et al., 2013), and differing compound potency resulting from different stock solutions (Kannt and Wieland, 2016) or from variation in cell division rates (Hafner et al., 2016). Whether these or other factors influence the outcomes of this study is open to hypothesizing and further investigation, which is facilitated by direct replications and transparent reporting.

Materials and methods

As described in the Registered Report (Fung et al., 2015), we attempted a replication of the experiments reported in Figures 2A, 3D, 4B and D, and Supplementary Figures 11A-B and 16A of Dawson et al. (2011). A detailed description of all protocols can be found in the Registered Report (Fung et al., 2015). Additional detailed experimental notes, data, and analysis are available on the Open Science Framework (OSF) (RRID:SCR_003238) (https://osf.io/hcqqy/; Shan et al., 2017).

Cell culture

MV4;11 (ATCC, cat# CRL-9591, RRID:CVCL_0064) and K-562 cells (ATCC, cat#CCL-243, RRID:CVCL_0004) were maintained in RPMI-1640 medium supplemented with 10% fetal bovine serum (FBS) and 1% penicillin/streptomycin at 37°C in a humidified atmosphere at 5% CO2. Quality control data for both the MV4;11 and K-562 cell lines related to the in vitro experiments are available at https://osf.io/vb9q4/ and quality control data for the MV4;11 cell line related to the in vivo experiments are available at https://osf.io/5tvwe/. This includes results confirming the cell lines were free of mycoplasma contamination and common mouse pathogens. Additionally, STR DNA profiling of the cell lines was performed and all cells were confirmed to be the indicated cell lines when queried against STR profile databases.

Therapeutic compounds

25 mg of I-BET151 (Sigma-Aldrich, cat# SML0666; lot# 033M4620V) was dissolved in 601.8 µl filtered dimethyl sulfoxide (DMSO) to generate a 0.1 M stock, which was stored at 4°C until use for in vitro experiments. For in vivo experiments, I-BET151 (lot# 084M4722V) was dissolved in DMSO to generate a 60 mg/ml stock. This was further diluted to the appropriate working solution with vehicle (10% wt/vol Kleptose HPB in 0.9 %/g NaCl, pH 5.0), filter sterilized, and aliquoted (21 total) into glass vials. Final concentration of DMSO in the working solution was 5% (v/v). Aliquots were stored at 4°C until use when aliquots were brought to room temperature before injection.

Cellular dose response assay

The doubling time of MV4;11 and K-562 cells were empirically determined by seeding at 4 × 104 cells per well into two 96-well plates. The day after seeding and another three days later an MTS cell proliferation assay (Promega CellTiter-Aqueous One, cat# G3582) was performed according to the manufacturer’s instructions with a 4 hr incubation at 37°C. Absorbance was read at 490 nm and the plate background was subtracted at 650 nm using a plate reader (Molecular Devices, SpectraMax 190) and Softmax Pro data acquisition and analysis software (RRID:SCR_014240), version 3.1.2. The doubling time of each cell line was calculated to be 48 hr and 36.2 hr for the MV4;11 and K-562 cell lines, respectively. Doubling time was calculated using the formula (doubling time = incubation time*ln(2)/ln(second plate average reading/first plate average reading)).

For cellular dose response, MV4;11 and K-562 cells were seeded at 4 × 104 cells per well (non-edge wells) into a 96-well plate and incubated overnight. Cells were treated with serial dilutions of I-BET151 to yield 10 dilutions ranging from 0.001 nM to 1 mM, which differs from the conditions listed in the Registered Report due to solubility limits. I-BET151 doses from 0.001 nM to 100 µM were diluted with medium to give a final DMSO concentration of 0.1%, while the 1 mM concentration gave a final DMSO concentration of 1% due to drug solubility. Both concentrations of DMSO were used to treat control cells in addition to medium alone wells that were used for background subtraction. All conditions were done in technical triplicate. Following an incubation of 3 population doublings of the cell lines, an MTS cell proliferation assay was performed as described for the doubling time experiment. Percent viability was calculated as a percentage of DMSO treated cells after background subtraction (medium only wells). Values for each biological repeat were fit to a four-parameter logistic curve and absolute IC50 values were calculated with R software (RRID:SCR_001905), version 3.3.2 (R Core Team, 2016).

Gene expression analysis

MV4;11 and K-562 cells were seeded at 1 × 105 cells per well in a 48 well plate in triplicate for each condition and incubated overnight. Cells were then treated with either 500 nM I-BET151 or an equivalent volume of DMSO (final concentration of 0.1% (v/v)) for 6 hr. Cells were then harvested and RNA isolated using the RNAspin mini kit (GE Healthcare, cat# 25-0500-70) according to manufacturer's instructions. RNA concentration and purity was determined (data available at https://osf.io/n8frz/) and samples were stored at −80°C until further analyzed. Total RNA was reverse transcribed into cDNA using SuperScript III First-Strand Synthesis System (GE Healthcare, cat# 27-9261-01) according to manufacturer's instructions. qRT-PCR reactions were then performed in technical triplicate using BCL2 and B2M-specific primers (sequences listed in Registered Report [Fung et al., 2015]), and SYBR Green PCR Master Mix (ThermoFisher, cat# 4344463) according to manufacturer’s instructions. Reaction volume was 20 µl and consisted of 10 µl 2X SYBR Green PCR Master Mix, 20 µM forward and reverse primers, and 5 µl RNA diluted in water. A negative control without RNA template was also included. PCR cycling conditions were used as follows: [1 cycle 95°C for 10 min -- 40 cycles 95°C for 15 s, 60°C 60 s] using a real-time PCR system (Bio-Rad, DNA Engine Opticon System) and Opticon Monitor Software (Bio-Rad, RRID:SCR_014241), version 1.4. BCL2 transcript levels were normalized to B2M levels in each sample and I-BET151 treated samples were compared to DMSO treated samples for each cell line using the ∆∆Ct method.

Animals

All animal procedures were approved by the University of Pennsylvania Stem Cell and Xenograft Core IACUC# 803506 and were in accordance with the University of Pennsylvania policies on the care, welfare, and treatment of laboratory animals. Four-week old female NOD-SCID mice (Jackson Laboratory, Stock No: 001303, RRID:IMSR_JAX:001303) were housed in sterile conditions using high-efficiency particulate arrestance (HEPA)-filtered micro-isolator with 12 hr light/dark cycles, and fed with sterile rodent chow and acidified water ad libitum. The mice were housed for approximately 3 weeks before being enrolled in the study.

I-BET151 maximum tolerated dose (MTD) study

After acclimation, 20 female NOD-SCID mice were conditioned with 30 mg/kg Busulfan (Otsuka America Pharmaceutical, Inc., cat# NDC 59148-070-90) by intraperitoneal (IP) injection 24 hr prior to injection of MV4;11 cells. Conditioned mice were then intravenously injected via the tail vein with 1 × 107 MV4;11 cells in 0.2 ml sterile vehicle (PBS) (Gibco, Life Technologies, cat# 14190–136). Mice were inspected daily for signs of distress and the 'Mouse Health Scoring System' (Cooke et al., 1996) was used to record scores for each mouse over the course of the study. At day 21 post-injection, retro-orbital bleeds were collected and analyzed for leukaemia burden by flow cytometry. Detectable disease was not achieved on day 21 and daily inspection continued until day 34 when disease was detected. Animals were then ranked according to disease burden (percent HLA-A,B,C+ cells) and randomly assigned to one of four groups using an alternating serpentine method. Group assignment was done by simple randomization. All mice had detectable leukaemia at time of randomization and variability of disease burden was evenly distributed among the four conditions (One-way ANOVA; F(3,16) = 0.019, p=0.996). After randomization, mice received daily IP injections of vehicle control, 10 mg/kg I-BET151, 20 mg/kg I-BET151, or 30 mg/kg I-BET151 in a volume of 10 ml/kg body weight. Injection volume and dosing was kept constant throughout the treatment period. Mice were inspected daily for the duration of the 21 day drug treatment for decreased spontaneous activity (grooming and ambulation), ruffled fur, weight loss, and loss of hind limb motility. Mice were sacrificed if moribund, received a health-monitoring score of three, or within three days of the last treatment. Upon sacrifice, cardiac puncture was performed to collect the peripheral blood, and then the spleen and both tibias and femora were removed. Splenic cell suspension was made in cold PBS by gently mushing the spleen between two microscopic glass slides. Bones were flushed with cold PBS to isolate bone marrow cells. Red blood cells from samples were lysed using ammonium chloride solution (STEMCELL Technologies, cat# 07850) according to manufacturer’s instructions prior to flow cytometry analysis.

I-BET151 efficacy study

After one week of acclimation, 28 female NOD-SCID mice were conditioned with Busulfan and injected with MV4;11 cells as described in the MTD study. Five mice died unexpectedly before the first retro-orbital bleed to assess leukaemia burden. The remaining 23 mice were analyzed on day 21 post-injection, with no engraftment detected. Similar to the MTD study, mice were analyzed again two weeks later, with leukaemia burden detected. Animals were then ranked according to disease burden (percent HLA-A,B,C+ cells) and randomly assigned to a group using an alternating serpentine method. Designation of vehicle control or I-BET151 to group was done by a simple randomization. All mice had detectable leukaemia at time of randomization and variability of disease burden was evenly distributed among the two conditions (Student’s t-test; t(21) = 0.028, p=0.978). After randomization, mice received daily IP injections of vehicle control or 20 mg/kg I-BET151 in a volume of 10 ml/kg body weight. Injection volume and dosing was kept constant throughout the treatment period. Mice were inspected daily and peripheral blood, spleen, and bone marrow cells were isolated upon sacrifice as described for the MTD study.

Flow cytometry analysis

To assess leukaemia burden in peripheral blood, spleen, and bone marrow cells, a maximum of 106 cells were stained in a sample volume of 50 µl each. 20 µl of APC conjugated anti-human HLA-A,B,C (Biolegend, cat# 311410, RRID:AB_314879) or APC conjugated isotype control (Biolegend, cat# 400220, RRID:AB_326468) was added and incubated at room temperature for 15 min. CountBright absolute counting beads (Life Technology, cat# C36950) in 1X FACS lysing solution were added and incubated at room temperature for an additional 15 min. Flow cytometry analysis was performed on a Canto or LSRII (Becton-Dickinson) and analyzed with FlowJo software (Tree Star, Inc., RRID:SCR_008520), version 10.1. Gating strategy was described in the Registered Report with a representative example reported in Figure 4—figure supplement 3A.

To assess cell death, a maximum of 106 cells were stained in a sample volume of 50 µl each. 5 µl of APC conjugated anti-human HLA-A,B,C or APC conjugated isotype control was added and incubated at 4°C for 30 min. After washing twice with 1 ml of 1X Binding Buffer and collecting cells by spinning at 300xg for 10 min, cells were resuspended in 100 µl of 1X Binding Buffer. 10 µl of Annexin V-FITC (Miltenyi Biotech Ltd, cat# 130-092-052) was added and incubated at room temperature for 15 min in the dark. After washing twice with 1 ml of 1X Binding Buffer and collecting cells by spinning at 300xg for 10 min, cells were resuspended in 500 µl of 1X Binding Buffer. Prior to flow cytometry analysis 5 µl of propidium iodide (PI) (Invitrogen, cat# P3566) was added. An initial attempt with 7-AAD, which was pre-specified in the Registered Report, was unsuccessful because of an inability to properly compensate, thus PI was substituted. Gating strategy was described in the Registered Report with a representative example reported in Figure 4—figure supplement 3B. Flow cytometry data for this work is also available via Flow Repository (RRID:SCR_013779; Spidlen et al., 2012), where it is directly accessible at https://flowrepository.org/id/FR-FCM-ZY68.

Immunohistochemistry

Tissues were fixed in 10% buffered formalin (Fisher Scientific, cat# 23–245685) and subsequently embedded in paraffin. The paraffin blocks were sectioned (5 µm) and stained with mouse anti-human Class 1 HLA-A,B,C antibody (Abcam, cat# ab70328, RRID:AB_1269092) and counterstained with hematoxylin. Staining was performed on a BOND-IIITM instrument (Leica Biosystems, Wetzlar, Germany) using the Bond Polymer Refine Detection System (Leica Biosystems, cat# DS9800). Heat-induced epitope retrieval was done for 20 min with ER1 solution (Leica Biosystems, cat# AR9961). Incubation with the HLA-A,B,C antibody was at 1:1200 dilution for 15 min followed by 8 min post-primary step and 8 min incubation with polymer HRP. Then endogenous peroxidase was blocked for 5 min followed by a 10 min incubation with DAB substrate. All staining steps were performed at room temperature with slides washed three times between each step with Bond Wash Solution (Leica Biosystems, cat# AR9590) or water. A section of human tonsil tissue was processed in parallel as a positive control. Immunostaining was performed by the Pathology Clinical Service Center at the University of Pennsylvania, Perelman School of Medicine.

Statistical analysis

Statistical analysis was performed with R software (RRID:SCR_001905), version 3.3.2 (R Core Team, 2016). All data, csv files, and analysis scripts are available on the OSF (https://osf.io/hcqqy/). Confirmatory statistical analysis was pre-registered (https://osf.io/kfx9p/) before the experimental work began as outlined in the Registered Report (Fung et al., 2015). Data were checked to ensure assumptions of statistical tests were met. A meta-analysis of a common original and replication effect size was performed with a random effects model and the metafor R package (Viechtbauer, 2010) (https://osf.io/vfp47/). The original study data was extracted a priori from the published figures by determining the mean and upper/lower error values for each data point. The extracted data were published in the Registered Report (Fung et al., 2015) and were used in the power calculations to determine the sample size for this study.

Deviations from registered report

The range of I-BET151 doses used in cellular dose response assay was altered from 0.01 nM to 10 mM listed in the Registered Report to 1 pM to 1 mM reported in this replication attempt. This was due to the solubility of I-BET151 restricting the highest dose that could be tested in this replication attempt (1 mM), which was the same dose tested in the original study. For the in vivo experimentation, only female mice were used in this replication attempt rather than a male female split as outlined in the Registered Report. This deviation was caused by an inability to obtain male and female mice from the supplier when the MTD study was performed. To maintain consistency with the MTD study only female mice were utilized in the efficacy study. Furthermore, the time between MV4;11 injection and animal randomization was extended from the 21 days outlined in the Registered Report (Fung et al., 2015) due to undetectable disease burden at this time. Mice were checked for disease burden 2 weeks later to allow additional time for disease manifestation. This occurred in both the MTD and efficacy studies, with disease burden only becoming detectable after this additional time, after which the remaining aspects of the protocol were carried out.

For the cell death flow cytometry analysis an initial attempt using 7-AAD was unsuccessful because of an inability to properly compensate, thus PI, a suitable viability stain, was substituted. Additional materials and instrumentation not listed in the Registered Report, but needed during experimentation are also listed above.

Acknowledgements

The Reproducibility Project: Cancer Biology would like to thank Dr. Mark Dawson (Peter MacCallum Cancer Centre) for sharing critical information and the following companies for generously donating reagents to the Reproducibility Project: Cancer Biology; American Type and Tissue Collection (ATCC), Applied Biological Materials, BioLegend, Charles River Laboratories, Corning Incorporated, DDC Medical, EMD Millipore, Harlan Laboratories, LI-COR Biosciences, Mirus Bio, Novus Biologicals, Sigma-Aldrich, and System Biosciences (SBI).

Funding Statement

The funder had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Karen Adelman, Harvard University, United States.

Reproducibility Project: Cancer Biology:

Elizabeth Iorns, Alexandria Denis, Stephen R Williams, Nicole Perfito, and Timothy M Errington

Funding Information

This paper was supported by the following grant:

  • Laura and John Arnold Foundation to .

Additional information

Competing interests

XS: Stem Cell and Xenograft Core, University of Pennsylvania, Perelman School of Medicine is a Science Exchange associated lab.

JJF: ProNovus Bioscience, LLC is a Science Exchange associated lab.

AK: ProNovus Bioscience, LLC is a Science Exchange associated lab.

GD-D: Stem Cell and Xenograft Core, University of Pennsylvania, Perelman School of Medicine is a Science Exchange associated lab.

RP:CB: EI, NP: Employed by and hold shares in Science Exchange Inc. The other authors declare that no competing interests exist.

Author contributions

XS, Acquisition of data, Drafting or revising the article.

JJF, Acquisition of data, Drafting or revising the article.

AK, Acquisition of data, Drafting or revising the article.

GD-D, Acquisition of data, Drafting or revising the article.

RP:CB, Analysis and interpretation of data, Drafting or revising the article.

Ethics

Animal experimentation: All animal procedures were approved by the University of Pennsylvania Stem Cell and Xenograft Core IACUC# 803506 and were in accordance with the University of Pennsylvania policies on the care, welfare, and treatment of laboratory animals.

References

  1. Amorim S, Stathis A, Gleeson M, Iyengar S, Magarotto V, Leleu X, Morschhauser F, Karlin L, Broussais F, Rezai K, Herait P, Kahatt C, Lokiec F, Salles G, Facon T, Palumbo A, Cunningham D, Zucca E, Thieblemont C. Bromodomain inhibitor OTX015 in patients with lymphoma or multiple myeloma: a dose-escalation, open-label, pharmacokinetic, phase 1 study. The Lancet Haematology. 2016;3:e196–e204. doi: 10.1016/S2352-3026(16)00021-1. [DOI] [PubMed] [Google Scholar]
  2. Asangani IA, Wilder-Romans K, Dommeti VL, Krishnamurthy PM, Apel IJ, Escara-Wilke J, Plymate SR, Navone NM, Wang S, Feng FY, Chinnaiyan AM. BET bromodomain inhibitors enhance efficacy and disrupt resistance to AR antagonists in the treatment of Prostate Cancer. Molecular Cancer Research. 2016;14:324–331. doi: 10.1158/1541-7786.MCR-15-0472. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bid HK, Phelps DA, Xaio L, Guttridge DC, Lin J, London C, Baker LH, Mo X, Houghton PJ. The Bromodomain BET Inhibitor JQ1 suppresses tumor angiogenesis in models of Childhood Sarcoma. Molecular Cancer Therapeutics. 2016;15:1018–1028. doi: 10.1158/1535-7163.MCT-15-0567. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Chaidos A, Caputo V, Karadimitris A. Inhibition of bromodomain and extra-terminal proteins (BET) as a potential therapeutic approach in haematological malignancies: emerging preclinical and clinical evidence. Therapeutic Advances in Hematology. 2015;6:128–141. doi: 10.1177/2040620715576662. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Chen C, Liu Y, Lu C, Cross JR, Morris JP, Shroff AS, Ward PS, Bradner JE, Thompson C, Lowe SW. Cancer-associated IDH2 mutants drive an acute myeloid leukemia that is susceptible to Brd4 inhibition. Genes & Development. 2013;27:1974–1985. doi: 10.1101/gad.226613.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Clayton JA, Collins FS. Policy: nih to balance sex in cell and animal studies. Nature. 2014;509:282–283. doi: 10.1038/509282a. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Cooke KR, Kobzik L, Martin TR, Brewer J, Delmonte J, Crawford JM, Ferrara JL. An experimental model of idiopathic pneumonia syndrome after bone marrow transplantation: I. The roles of minor H antigens and endotoxin. Blood. 1996;88:3230–3239. [PubMed] [Google Scholar]
  8. Coudé MM, Braun T, Berrou J, Dupont M, Bertrand S, Masse A, Raffoux E, Itzykson R, Delord M, Riveiro ME, Herait P, Baruchel A, Dombret H, Gardin C. BET inhibitor OTX015 targets BRD2 and BRD4 and decreases c-MYC in acute leukemia cells. Oncotarget. 2015;6:17698–17712. doi: 10.18632/oncotarget.4131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. da Motta LL, Ledaki I, Purshouse K, Haider S, De Bastiani MA, Baban D, Morotti M, Steers G, Wigfield S, Bridges E, Li JL, Knapp S, Ebner D, Klamt F, Harris AL, McIntyre A. The BET inhibitor JQ1 selectively impairs tumour response to hypoxia and downregulates CA9 and angiogenesis in triple negative breast cancer. Oncogene. 2017;36:122–132. doi: 10.1038/onc.2016.184. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Dawson MA, Gudgin EJ, Horton SJ, Giotopoulos G, Meduri E, Robson S, Cannizzaro E, Osaki H, Wiese M, Putwain S, Fong CY, Grove C, Craig J, Dittmann A, Lugo D, Jeffrey P, Drewes G, Lee K, Bullinger L, Prinjha RK, Kouzarides T, Vassiliou GS, Huntly BJ. Recurrent mutations, including NPM1c, activate a BRD4-dependent core transcriptional program in acute myeloid leukemia. Leukemia. 2014;28:311–320. doi: 10.1038/leu.2013.338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Dawson MA, Prinjha RK, Dittmann A, Giotopoulos G, Bantscheff M, Chan WI, Robson SC, Chung CW, Hopf C, Savitski MM, Huthmacher C, Gudgin E, Lugo D, Beinke S, Chapman TD, Roberts EJ, Soden PE, Auger KR, Mirguet O, Doehner K, Delwel R, Burnett AK, Jeffrey P, Drewes G, Lee K, Huntly BJ, Kouzarides T. Inhibition of BET recruitment to chromatin as an effective treatment for MLL-fusion leukaemia. Nature. 2011;478:529–533. doi: 10.1038/nature10509. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Dorn DC, Kou CA, Png KJ, Moore MA. The effect of cantharidins on leukemic stem cells. International Journal of Cancer. 2009;124:2186–2199. doi: 10.1002/ijc.24157. [DOI] [PubMed] [Google Scholar]
  13. Errington TM, Iorns E, Gunn W, Tan FE, Lomax J, Nosek BA. An open investigation of the reproducibility of cancer biology research. eLife. 2014;3:e04333. doi: 10.7554/eLife.04333. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Fiskus W, Sharma S, Qi J, Valenta JA, Schaub LJ, Shah B, Peth K, Portier BP, Rodriguez M, Devaraj SG, Zhan M, Sheng J, Iyer SP, Bradner JE, Bhalla KN. Highly active combination of BRD4 antagonist and histone deacetylase inhibitor against human acute myelogenous leukemia cells. Molecular Cancer Therapeutics. 2014;13:1142–1154. doi: 10.1158/1535-7163.MCT-13-0770. [DOI] [PubMed] [Google Scholar]
  15. Fong CY, Gilan O, Lam EY, Rubin AF, Ftouni S, Tyler D, Stanley K, Sinha D, Yeh P, Morison J, Giotopoulos G, Lugo D, Jeffrey P, Lee SC, Carpenter C, Gregory R, Ramsay RG, Lane SW, Abdel-Wahab O, Kouzarides T, Johnstone RW, Dawson SJ, Huntly BJ, Prinjha RK, Papenfuss AT, Dawson MA. BET inhibitor resistance emerges from leukaemia stem cells. Nature. 2015;525:538–542. doi: 10.1038/nature14888. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. French CA. Advances in Cancer Research. Elsevier; 2016. Small-Molecule Targeting of BET Proteins in Cancer; pp. 21–58. [DOI] [PubMed] [Google Scholar]
  17. Fu L, Kettner NM. Progress in Molecular Biology and Translational Science. Elsevier; 2013. The circadian clock in cancer development and therapy; pp. 221–282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Fung JJ, Kosaka A, Shan X, Danet-Desnoyers G, Gormally M, Owen K, Reproducibility Project: Cancer Biology Registered report: Inhibition of BET recruitment to chromatin as an effective treatment for MLL-fusion leukemia. eLife. 2015;4:e08997. doi: 10.7554/eLife.08997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Ganzel C, Manola J, Douer D, Rowe JM, Fernandez HF, Paietta EM, Litzow MR, Lee JW, Luger SM, Lazarus HM, Cripe LD, Wiernik PH, Tallman MS. Extramedullary Disease in adult acute myeloid leukemia is common but lacks Independent significance: analysis of patients in ECOG-ACRIN Cancer research Group Trials, 1980-2008. Journal of Clinical Oncology. 2016;34:3544–3553. doi: 10.1200/JCO.2016.67.5892. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Gilan O, Lam EY, Becher I, Lugo D, Cannizzaro E, Joberty G, Ward A, Wiese M, Fong CY, Ftouni S, Tyler D, Stanley K, MacPherson L, Weng CF, Chan YC, Ghisi M, Smil D, Carpenter C, Brown P, Garton N, Blewitt ME, Bannister AJ, Kouzarides T, Huntly BJ, Johnstone RW, Drewes G, Dawson SJ, Arrowsmith CH, Grandi P, Prinjha RK, Dawson MA. Functional interdependence of BRD4 and DOT1L in MLL leukemia. Nature Structural & Molecular Biology. 2016;23:673–681. doi: 10.1038/nsmb.3249. [DOI] [PubMed] [Google Scholar]
  21. Hafner M, Niepel M, Chung M, Sorger PK. Growth rate inhibition metrics correct for confounders in measuring sensitivity to Cancer drugs. Nature Methods. 2016;13:521–527. doi: 10.1038/nmeth.3853. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Hardwicke MA, Oleykowski CA, Plant R, Wang J, Liao Q, Moss K, Newlander K, Adams JL, Dhanak D, Yang J, Lai Z, Sutton D, Patrick D. GSK1070916, a Potent Aurora B/C kinase inhibitor with Broad antitumor activity in tissue culture cells and Human tumor xenograft models. Molecular Cancer Therapeutics. 2009;8:1808–1817. doi: 10.1158/1535-7163.MCT-09-0041. [DOI] [PubMed] [Google Scholar]
  23. Herrmann H, Blatt K, Shi J, Gleixner KV, Cerny-Reiterer S, Müllauer L, Vakoc CR, Sperr WR, Horny H-P, Bradner JE, Zuber J, Valent P. Small-molecule inhibition of BRD4 as a new potent approach to eliminate leukemic stem- and progenitor cells in acute myeloid leukemia (AML) Oncotarget. 2013;3:1588–1599. doi: 10.18632/oncotarget.733. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Hughes P, Marshall D, Reid Y, Parkes H, Gelber C. The costs of using unauthenticated, over-passaged cell lines: how much more data do we need? BioTechniques. 2007;43:575–586. doi: 10.2144/000112598. [DOI] [PubMed] [Google Scholar]
  25. Kannt A, Wieland T. Managing risks in drug discovery: reproducibility of published findings. Naunyn-Schmiedeberg's Archives of Pharmacology. 2016;389:353–360. doi: 10.1007/s00210-016-1216-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Kleensang A, Vantangoli MM, Odwin-DaCosta S, Andersen ME, Boekelheide K, Bouhifd M, Fornace AJ, Li HH, Livi CB, Madnick S, Maertens A, Rosenberg M, Yager JD, Zhaog L, Hartung T. Genetic variability in a frozen batch of MCF-7 cells invisible in routine authentication affecting cell function. Scientific Reports. 2016;6:28994. doi: 10.1038/srep28994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Kokolus KM, Capitano ML, Lee CT, Eng JW, Waight JD, Hylander BL, Sexton S, Hong CC, Gordon CJ, Abrams SI, Repasky EA. Baseline tumor growth and immune control in laboratory mice are significantly influenced by subthermoneutral housing temperature. PNAS. 2013;110:20176–20181. doi: 10.1073/pnas.1304291110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Kumar K, Raza SS, Knab LM, Chow CR, Kwok B, Bentrem DJ, Popovic R, Ebine K, Licht JD, Munshi HG. GLI2-dependent c-MYC upregulation mediates resistance of pancreatic Cancer cells to the BET bromodomain inhibitor JQ1. Scientific Reports. 2015;5:9489. doi: 10.1038/srep09489. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Kurimchak AM, Shelton C, Duncan KE, Johnson KJ, Brown J, O'Brien S, Gabbasov R, Fink LS, Li Y, Lounsbury N, Abou-Gharbia M, Childers WE, Connolly DC, Chernoff J, Peterson JR, Duncan JS. Resistance to BET Bromodomain inhibitors is mediated by Kinome Reprogramming in ovarian Cancer. Cell Reports. 2016;16:1273–1286. doi: 10.1016/j.celrep.2016.06.091. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Lopes de Menezes DE, Peng J, Garrett EN, Louie SG, Lee SH, Wiesmann M, Tang Y, Shephard L, Goldbeck C, Oei Y, Ye H, Aukerman SL, Heise C. CHIR-258: a potent inhibitor of FLT3 kinase in experimental tumor xenograft models of human acute myelogenous leukemia. Clinical Cancer Research. 2005;11:5281–5291. doi: 10.1158/1078-0432.CCR-05-0358. [DOI] [PubMed] [Google Scholar]
  31. Ma S, Yang LL, Niu T, Cheng C, Zhong L, Zheng MW, Xiong Y, Li LL, Xiang R, Chen LJ, Zhou Q, Wei YQ, Yang SY. SKLB-677, an FLT3 and wnt/β-catenin signaling inhibitor, displays potent activity in models of FLT3-driven AML. Scientific Reports. 2015;5:15646. doi: 10.1038/srep15646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Macpherson AJ, McCoy KD. Standardised animal models of host microbial mutualism. Mucosal Immunology. 2015;8:476–486. doi: 10.1038/mi.2014.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Mertz JA, Conery AR, Bryant BM, Sandy P, Balasubramanian S, Mele DA, Bergeron L, Sims RJ. Targeting MYC dependence in Cancer by inhibiting BET bromodomains. PNAS. 2011;108:16669–16674. doi: 10.1073/pnas.1108190108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Montenegro RC, Clark PG, Howarth A, Wan X, Ceroni A, Siejka P, Nunez-Alonso GA, Monteiro O, Rogers C, Gamble V, Burbano R, Brennan PE, Tallant C, Ebner D, Fedorov O, O'Neill E, Knapp S, Dixon D, Müller S. BET inhibition as a new strategy for the treatment of gastric Cancer. Oncotarget. 2016;7:43997–44012. doi: 10.18632/oncotarget.9766. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Nakagawa A, Adams CE, Huang Y, Hamarneh SR, Liu W, Von Alt KN, Mino-Kenudson M, Hodin RA, Lillemoe KD, Fernández-Del Castillo C, Warshaw AL, Liss AS. Selective and reversible suppression of intestinal stem cell differentiation by pharmacological inhibition of BET bromodomains. Scientific Reports. 2016;6:20390. doi: 10.1038/srep20390. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. O'Farrell AM, Abrams TJ, Yuen HA, Ngai TJ, Louie SG, Yee KW, Wong LM, Hong W, Lee LB, Town A, Smolich BD, Manning WC, Murray LJ, Heinrich MC, Cherrington JM. SU11248 is a novel FLT3 tyrosine kinase inhibitor with potent activity in vitro and in vivo. Blood. 2003;101:3597–3605. doi: 10.1182/blood-2002-07-2307. [DOI] [PubMed] [Google Scholar]
  37. Prochazka M, Gaskins HR, Shultz LD, Leiter EH. The nonobese diabetic scid mouse: model for spontaneous thymomagenesis associated with immunodeficiency. PNAS. 1992;89:3290–3294. doi: 10.1073/pnas.89.8.3290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. R Core Team . R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2016. [Google Scholar]
  39. Rathert P, Roth M, Neumann T, Muerdter F, Roe JS, Muhar M, Deswal S, Cerny-Reiterer S, Peter B, Jude J, Hoffmann T, Boryń ŁM, Axelsson E, Schweifer N, Tontsch-Grunt U, Dow LE, Gianni D, Pearson M, Valent P, Stark A, Kraut N, Vakoc CR, Zuber J. Transcriptional plasticity promotes primary and acquired resistance to BET inhibition. Nature. 2015;525:543–547. doi: 10.1038/nature14898. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Robert-Richard E, Ged C, Ortet J, Santarelli X, Lamrissi-Garcia I, de Verneuil H, Mazurier F. Human cell engraftment after busulfan or irradiation conditioning of NOD/SCID mice. Haematologica. 2006;91:1384. [PubMed] [Google Scholar]
  41. Robinson BW, Behling KC, Gupta M, Zhang AY, Moore JS, Bantly AD, Willman CL, Carroll AJ, Adamson PC, Barrett JS, Felix CA. Abundant anti-apoptotic BCL-2 is a molecular target in leukaemias with t(4;11) translocation. British Journal of Haematology. 2008;141:827–839. doi: 10.1111/j.1365-2141.2008.07100.x. [DOI] [PubMed] [Google Scholar]
  42. Saland E, Boutzen H, Castellano R, Pouyet L, Griessinger E, Larrue C, de Toni F, Scotland S, David M, Danet-Desnoyers G, Vergez F, Barreira Y, Collette Y, Récher C, Sarry JE. A robust and rapid xenograft model to assess efficacy of chemotherapeutic agents for human acute myeloid leukemia. Blood Cancer Journal. 2015;5:e297. doi: 10.1038/bcj.2015.19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Sebaugh JL. Guidelines for accurate EC50/IC50 estimation. Pharmaceutical Statistics. 2011;10:128–134. doi: 10.1002/pst.426. [DOI] [PubMed] [Google Scholar]
  44. Shan X, Fung JJ, Kosaka A, Danet-Desnoyers G, Iorns E, Denis A, Williams SR, Perfito N, Errington TM. 2017. Study 29: Replication of Dawson, et al., 2011 (Nature) Open Science Framework. [DOI]
  45. Shu S, Lin CY, He HH, Witwicki RM, Tabassum DP, Roberts JM, Janiszewska M, Huh SJ, Liang Y, Ryan J, Doherty E, Mohammed H, Guo H, Stover DG, Ekram MB, Peluffo G, Brown J, D'Santos C, Krop IE, Dillon D, McKeown M, Ott C, Qi J, Ni M, Rao PK, Duarte M, Wu SY, Chiang CM, Anders L, Young RA, Winer EP, Letai A, Barry WT, Carroll JS, Long HW, Brown M, Liu XS, Meyer CA, Bradner JE, Polyak K. Response and resistance to BET bromodomain inhibitors in triple-negative breast Cancer. Nature. 2016;529:413–417. doi: 10.1038/nature16508. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Spidlen J, Breuer K, Rosenberg C, Kotecha N, Brinkman RR. FlowRepository: a resource of annotated flow cytometry datasets associated with peer-reviewed publications. Cytometry Part A. 2012;81:727–731. doi: 10.1002/cyto.a.22106. [DOI] [PubMed] [Google Scholar]
  47. Stathis A, Zucca E, Bekradda M, Gomez-Roca C, Delord JP, de La Motte Rouge T, Uro-Coste E, de Braud F, Pelosi G, French CA. Clinical response of Carcinomas harboring the BRD4-NUT oncoprotein to the targeted Bromodomain Inhibitor OTX015/MK-8628. Cancer Discovery. 2016;6:492–500. doi: 10.1158/2159-8290.CD-15-1335. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Valentine JC, Biglan A, Boruch RF, Castro FG, Collins LM, Flay BR, Kellam S, Mościcki EK, Schinke SP. Replication in prevention science. Prevention Science. 2011;12:103–117. doi: 10.1007/s11121-011-0217-6. [DOI] [PubMed] [Google Scholar]
  49. Viechtbauer W. Conducting Meta-Analyses in R with the metafor Package. Journal of Statistical Software. 2010;36:1–48. doi: 10.18637/jss.v036.i03. [DOI] [Google Scholar]
  50. Wadhwa E, Nicolaides T. Bromodomain Inhibitor Review: bromodomain and Extra-terminal Family protein inhibitors as a Potential New therapy in Central Nervous System tumors. Cureus. 2016;8:e620. doi: 10.7759/cureus.620. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Wang QF, Wu G, Mi S, He F, Wu J, Dong J, Luo RT, Mattison R, Kaberlein JJ, Prabhakar S, Ji H, Thirman MJ. MLL fusion proteins preferentially regulate a subset of wild-type MLL target genes in the leukemic genome. Blood. 2011;117:6895–6905. doi: 10.1182/blood-2010-12-324699. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Yao W, Yue P, Khuri FR, Sun SY. The BET bromodomain inhibitor, JQ1, facilitates c-FLIP degradation and enhances TRAIL-induced apoptosis independent of BRD4 and c-Myc inhibition. Oncotarget. 2015;6:34669–34679. doi: 10.18632/oncotarget.5785. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Zhang Z, Ma P, Jing Y, Yan Y, Cai MC, Zhang M, Zhang S, Peng H, Ji ZL, Di W, Gu Z, Gao WQ, Zhuang G. BET bromodomain inhibition as a therapeutic strategy in ovarian Cancer by downregulating FoxM1. Theranostics. 2016;6:219–230. doi: 10.7150/thno.13178. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Zuber J, Shi J, Wang E, Rappaport AR, Herrmann H, Sison EA, Magoon D, Qi J, Blatt K, Wunderlich M, Taylor MJ, Johns C, Chicas A, Mulloy JC, Kogan SC, Brown P, Valent P, Bradner JE, Lowe SW, Vakoc CR. RNAi screen identifies Brd4 as a therapeutic target in acute myeloid leukaemia. Nature. 2011;478:524–528. doi: 10.1038/nature10334. [DOI] [PMC free article] [PubMed] [Google Scholar]
eLife. 2017 Jun 27;6:e25306. doi: 10.7554/eLife.25306.014

Decision letter

Editor: Karen Adelman1

In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.

Thank you for submitting your article "Replication Study: Inhibition of BET recruitment to chromatin as an effective treatment for MLL-fusion leukaemia" for consideration by eLife. Your article has been reviewed by three peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Charles Sawyers as the Senior Editor. Two of the reviewers, Mark Dawson and M Dawn Teare, have agreed to share their names; a third reviewer remains anonymous.

The following issues need to be addressed before acceptance, as outlined below:

1) The Abstract and text needs to make clear that (and how) the in vivo protocol employed deviated from the original protocol and acknowledge how this might alter the outcome. It seems very important to not give the wrong impression that the xenograft experiments of the original study could not be reproduced, which would easily be the take-home-message after a "superficial read" of the current manuscript. To avoid this impression, the authors should clearly highlight the most relevant deviations from the original study (i.e. the alternative conditioning regimen and the reduced I-BET151 dose) in the Abstract.

2) Complications to interpretation due to other animal health issues should be clearly described. Here, it is not necessary to describe all potential issues in detail in the main text (although perhaps in supplement), but to address the following possibility, raised by reviewer #1. "The fact that the reduction in disease burden following IBET treatment closely paralleled the findings reported in the original study yet a survival benefit was not observed here raises the possibility that leukaemia related mortality was not the primary cause for the mortality observed in at least some animals in the IBET treatment arm."

Specific examples of places where clarification is in order are given below, as well as some additional suggestions, which have been extracted from the reviews:

Reviewer #1:

From a reader's perspective there is too much emphasis on the statistical testing performed and not enough discussion about the results and how variations to the original study should be interpreted.

For experiments 1) and 2) the general results obtained very much led to the same conclusion. In experiment 1) however, although there was, as previously, a marked difference in sensitivity between MV4;11 (IC50 9 nM) and K562 (unmeasurable), the meta-analysis demonstrated poor reproducibility between the concentrations in the original and reproduced experiments for MV4:11 (IC50 of 26 nM previously).

The main area of discrepancy relates to 3) the in vivo mouse experiment and specifically the efficacy study assessing overall survival. There are a number of methodological aspects of this study that differ substantially to the original study and these are not raised in the Abstract nor discussed adequately in the manuscript. Therapeutically, the dose of the drug was different (less, at 20 vs 30 mg/kg) as was the duration of therapy, with treatment starting at d35 instead of d21 as in the previous study. As disease activity was determined by examination of peripheral blood only and we know that this xenograft model gives disseminated disease, one interpretation could be that the results differed as a lower dose was used in the face of an increased disease bulk. Most importantly technically, the conditioning regime used was markedly different. 30mg/kg of Busulfan is a very different approach to the more conventional strategy of sub-lethal irradiation performed in the original study. The conditioning regime can dramatically alter engraftment kinetics and also the natural history of the disease. This is reflected in this particular case by the fact that one quarter of the mice on this study developed 'solid malignancies', which I assume to be extramedullary leukaemia. Furthermore 3/12 mice treated with I-BET developed gastrointestinal bloating and (whilst not explicitly described) presumably died for reasons that were independent of the leukaemia burden. This is alluded to by the authors when they state that "animal health issues were observed beyond the anticipated disease presentation of this model" but could be stressed more. These important issues impact significantly on an efficacy study that is essentially measuring all cause mortality. The fact that the reduction in disease burden following IBET treatment closely paralleled the findings reported in the original study yet a survival benefit was not observed here raises the possibility that leukaemia related mortality was not the primary cause for the mortality observed in at least some animals in the IBET treatment arm.

All of the above differences would also explain the apparently contradictory finding that, although survival was not impacted, disease bulk was consistently less in the IBET treated arm. A significant issue with the manuscript as currently presented is that the reader is left unaware of these critical details when reading the reproduction. It is essential to communicate this both in the Abstract, in the Results section and when discussing the discrepant results.

The results from the MTD study show that the mice in the 30mg/kg arm had the greatest variation in starting weight amongst all the cohorts. There is a 25% variation in weight between the heaviest and lightest mouse in this cohort whereas the starting weight of the mice in the other cohort's vary very little. The initial weight of the mice can contribute substantially to the observed toxicity of any intervention especially conditioning. This variation may be one of the reasons why more toxicity was observed in this cohort.

Reviewer #2:

The summary assessment of each experiment is not consistently written it states 'did not find a statistically significant' but the converse is not stated, i.e. 'we did see a statistically significant'. The summary needs to be consistent. This replication study has been powered from a statistical perspective so this needs to be clear in the Abstract.

Reviewer #3:

Most experiments reproduce key results of the original study, with the exception of in-vivo treatment studies in the MV4;11 AML xenograft model, where the authors only reproduce the impact on disease burden while impact on overall survival has been observed. For the interpretation of these results it should be noted that the replication study diverged from the original work in several experimental details that might be of relevance.

Overall, the following major points should be considered prior to publication:

In the replication of MV4;11 xenograft experiments several experimental conditions deviate from the original study. Most likely due to a lack of access to an irradiation unit, the authors conditioned NOD/SCID mice with a single dose of busulfan (30 mg/kg i.p. 24 hrs prior to MV4;11 transplantation), which has been reported to lead to comparable engraftment of human CD34+ cord blood cells. Apart from the difference in cell type, the original study by Robert-Richard et al. mainly used other dosing regimens (35 mg/kg 36 hrs prior to injection; or 2 x 25 mg 48/24 hrs prior to injection), which they found to be less toxic. While under these conditions, busulfan and irradiation performed similar in human CD34+ HSC engraftment, one cannot conclude that the used regimen has no impact on the in-vivo growth of MV4;11 AML cells (which, for example, are likely to have a much higher proliferation rate than CD34+ HSCs). In fact, another study (Saland et al., 2015) has documented significant differences in the growth of xenografted MV4;11 cells following irradiation vs. busulfan (20 mg/kg). Although the use of NSG and a lower busulfan dose does not allow for direct conclusions for the present study, these results illustrate how sensitive xenograft models are to conditioning regimens. As most prominent difference, after busulfan conditioning the MV4;11 model turned out to be much delayed compared to the original study, which might be explained by insufficient immunosuppression, toxicity of busulfan or its metabolites on MV4;11 cells, or other experimental differences (which the authors partially summarize in their Discussion). Beyond reducing the overall aggressiveness of the model (and, potentially, promoting a high incidence of xenograft-derived solid tumors, which has been observed by the authors), these altered experimental conditions may also impact the in-vivo responsiveness of MV4;11 cells to I-BET151, which should be discussed. This seems particularly important, since busulfan conditioning is still quite uncommon and the used regimen (30 mg/kg 24 hrs prior to injection) has neither been tested in MV4;11 cells, nor has it been evaluated regarding its impact on treatment responsiveness.

Beyond changing the conditioning protocol, the authors (based on general toxicity studies) also decided to reduce the I-BET151 dose from 30 mg/kg to 20 mg/kg. While this seems reasonable for the specific experiment, the cause for the observed toxicity remains unclear and may (again) be related to the conditioning regimen. Most importantly, the use of a different dosing regimen on its own provides a possible explanation for reduced drug effects in the MV4;11 xenograft model. In fact, in recent work the authors of the original study show that reducing I-BET151 to 10 mg/kg has no impact on survival in this model (Gilan et al., Figure 3A). Overall, it cannot be assumed that the drug batch was simply more active and, therefore, needed to be reduced to achieve comparable in-vivo activities. While differences in drug batches should always be considered, the reduced tolerability might be due to the used conditioning regimen, and the differences in survival might simply be a consequence of the reduced I-BET151 dose used in this study.

Overall, changes in the conditioning regimen and the use of a lower dose of I-BET151 may substantially alter the in-vivo response of MV4;11 cells and provide (independent) possible explanations for the observed reduction in in-vivo drug activity. In light of these concerns, it seems very important to not give the wrong impression that the xenograft experiments of the original study could not be reproduced, which would easily be the take-home-message after a "superficial read" of the current manuscript. To avoid this impression, the authors should clearly highlight the most relevant deviations from the original study (i.e. the alternative conditioning regimen and the reduced I-BET151 dose) in the Abstract. Moreover, these experimental differences should also be discussed more extensively in the Discussion. In the current Discussion, the authors first raise some uncertainties due to missing experimental details, then summarize consistent conditions, and then transition into various possible differences that are hard to control for (which are certainly relevant; differences in MV4;11 culture conditions such as different FBS batches could be added). However, for a comprehensive discussion it seems very adequate to first summarize consistent conditions, then highlight obvious and important differences in experimental design (i.e. different conditioning and different drug dose), and only then discuss additional factors that are not easy to control for.

In summary, the design and execution of this replication study is of high quality, and all necessary changes to the original protocol have been implemented in the best possible way. However, the changes in the design of the xenograft study are substantial and might fully explain the observed differences in in-vivo effects of I-BET151. Due to these differences, the results of the performed xenograft experiments are not fully conclusive with respect to the reproduction of the original study, which should be highlighted more clearly.

eLife. 2017 Jun 27;6:e25306. doi: 10.7554/eLife.25306.015

Author response


The following issues need to be addressed before acceptance, as outlined below:

1) The Abstract and text needs to make clear that (and how) the in vivo protocol employed deviated from the original protocol and acknowledge how this might alter the outcome. It seems very important to not give the wrong impression that the xenograft experiments of the original study could not be reproduced, which would easily be the take-home-message after a "superficial read" of the current manuscript. To avoid this impression, the authors should clearly highlight the most relevant deviations from the original study (i.e. the alternative conditioning regimen and the reduced I-BET151 dose) in the Abstract.

We have revised the Abstract to reflect the most relevant deviations between this replication attempt and the original study. We have also included these deviations and expanded discussion on how they might alter the outcome in the main text.

2) Complications to interpretation due to other animal health issues should be clearly described. Here, it is not necessary to describe all potential issues in detail in the main text (although perhaps in supplement), but to address the following possibility, raised by reviewer #1. "The fact that the reduction in disease burden following IBET treatment closely paralleled the findings reported in the original study yet a survival benefit was not observed here raises the possibility that leukaemia related mortality was not the primary cause for the mortality observed in at least some animals in the IBET treatment arm."

We have revised the manuscript to include many of the potential areas of clarification and issues raised in the reviews. Specifically, for the possibility raised by reviewer #1, we included this possibility in an additional paragraph after describing the results of the in vivo experiment.

Specific examples of places where clarification is in order are given below, as well as some additional suggestions, which have been extracted from the reviews:

Reviewer #1:

From a reader's perspective there is too much emphasis on the statistical testing performed and not enough discussion about the results and how variations to the original study should be interpreted.

For experiments 1) and 2) the general results obtained very much led to the same conclusion. In experiment 1) however, although there was, as previously, a marked difference in sensitivity between MV4;11 (IC50 9 nM) and K562 (unmeasurable), the meta-analysis demonstrated poor reproducibility between the concentrations in the original and reproduced experiments for MV4:11 (IC50 of 26 nM previously).

The main area of discrepancy relates to 3) the in vivo mouse experiment and specifically the efficacy study assessing overall survival. There are a number of methodological aspects of this study that differ substantially to the original study and these are not raised in the Abstract nor discussed adequately in the manuscript. Therapeutically, the dose of the drug was different (less, at 20 vs 30 mg/kg) as was the duration of therapy, with treatment starting at d35 instead of d21 as in the previous study. As disease activity was determined by examination of peripheral blood only and we know that this xenograft model gives disseminated disease, one interpretation could be that the results differed as a lower dose was used in the face of an increased disease bulk. Most importantly technically, the conditioning regime used was markedly different. 30mg/kg of Busulfan is a very different approach to the more conventional strategy of sub-lethal irradiation performed in the original study. The conditioning regime can dramatically alter engraftment kinetics and also the natural history of the disease. This is reflected in this particular case by the fact that one quarter of the mice on this study developed 'solid malignancies', which I assume to be extramedullary leukaemia. Furthermore 3/12 mice treated with I-BET developed gastrointestinal bloating and (whilst not explicitly described) presumably died for reasons that were independent of the leukaemia burden. This is alluded to by the authors when they state that "animal health issues were observed beyond the anticipated disease presentation of this model" but could be stressed more. These important issues impact significantly on an efficacy study that is essentially measuring all cause mortality. The fact that the reduction in disease burden following IBET treatment closely paralleled the findings reported in the original study yet a survival benefit was not observed here raises the possibility that leukaemia related mortality was not the primary cause for the mortality observed in at least some animals in the IBET treatment arm.

All of the above differences would also explain the apparently contradictory finding that, although survival was not impacted, disease bulk was consistently less in the IBET treated arm. A significant issue with the manuscript as currently presented is that the reader is left unaware of these critical details when reading the reproduction. It is essential to communicate this both in the Abstract, in the Results section and when discussing the discrepant results.

We agree that there are many aspects a reader needs to consider when interpreting this study, just like any individual study. We have revised the manuscript to include the points raised here, specifically including discussion on the deviation in dose and conditioning used and the impact this could potentially have had on engraftment kinetics, disease progression, and determining survival benefit. We have also clarified in the text that the gastrointestinal bloating was observed at the time of sacrifice and expanding discussion about the solid tumors, which as this review suggests is likely extramedullary disease.

The results from the MTD study show that the mice in the 30mg/kg arm had the greatest variation in starting weight amongst all the cohorts. There is a 25% variation in weight between the heaviest and lightest mouse in this cohort whereas the starting weight of the mice in the other cohort's vary very little. The initial weight of the mice can contribute substantially to the observed toxicity of any intervention especially conditioning. This variation may be one of the reasons why more toxicity was observed in this cohort.

We agree that variation in the initial weight of the mice could be a reason for observed toxicity, but in the case of the 30 mg/kg cohort, the lightest mouse did not have any observed weight loss throughout the course of the MTD study or have a health monitoring score that justified euthanasia. The remaining four mice had weights with variation similar to the other cohorts; however two of those mice had to be euthanized, which defined the MTD for the study.

Reviewer #2:

The summary assessment of each experiment is not consistently written it states 'did not find a statistically significant' but the converse is not stated, i.e. 'we did see a statistically significant'. The summary needs to be consistent. This replication study has been powered from a statistical perspective so this needs to be clear in the Abstract.

We have revised the Abstract to provide a balanced summary assessment. We included ‘statistically significant’ for the BCL2 expression analysis, since we followed through on the confirmatory analysis that was powered from a statistical perspective. However, we did not revise the analysis of IC50 values since this was an exploratory analysis.

Reviewer #3:

Most experiments reproduce key results of the original study, with the exception of in-vivo treatment studies in the MV4;11 AML xenograft model, where the authors only reproduce the impact on disease burden while impact on overall survival has been observed. For the interpretation of these results it should be noted that the replication study diverged from the original work in several experimental details that might be of relevance.

Overall, the following major points should be considered prior to publication:

In the replication of MV4;11 xenograft experiments several experimental conditions deviate from the original study. Most likely due to a lack of access to an irradiation unit, the authors conditioned NOD/SCID mice with a single dose of busulfan (30 mg/kg i.p. 24 hrs prior to MV4;11 transplantation), which has been reported to lead to comparable engraftment of human CD34+ cord blood cells. Apart from the difference in cell type, the original study by Robert-Richard et al. mainly used other dosing regimens (35 mg/kg 36 hrs prior to injection; or 2 x 25 mg 48/24 hrs prior to injection), which they found to be less toxic. While under these conditions, busulfan and irradiation performed similar in human CD34+ HSC engraftment, one cannot conclude that the used regimen has no impact on the in-vivo growth of MV4;11 AML cells (which, for example, are likely to have a much higher proliferation rate than CD34+ HSCs). In fact, another study (Saland et al., 2015) has documented significant differences in the growth of xenografted MV4;11 cells following irradiation vs. busulfan (20 mg/kg). Although the use of NSG and a lower busulfan dose does not allow for direct conclusions for the present study, these results illustrate how sensitive xenograft models are to conditioning regimens. As most prominent difference, after busulfan conditioning the MV4;11 model turned out to be much delayed compared to the original study, which might be explained by insufficient immunosuppression, toxicity of busulfan or its metabolites on MV4;11 cells, or other experimental differences (which the authors partially summarize in their Discussion). Beyond reducing the overall aggressiveness of the model (and, potentially, promoting a high incidence of xenograft-derived solid tumors, which has been observed by the authors), these altered experimental conditions may also impact the in-vivo responsiveness of MV4;11 cells to I-BET151, which should be discussed. This seems particularly important, since busulfan conditioning is still quite uncommon and the used regimen (30 mg/kg 24 hrs prior to injection) has neither been tested in MV4;11 cells, nor has it been evaluated regarding its impact on treatment responsiveness.

We have revised the manuscript to include the points raised in here, specifically including discussion on the deviation in the conditioning used and the impact this could potentially have had on engraftment kinetics, disease progression, and responsiveness of the MV4;11 cells to I-BET151, especially considering the impact busulfan has on this model is not fully understood.

Beyond changing the conditioning protocol, the authors (based on general toxicity studies) also decided to reduce the I-BET151 dose from 30 mg/kg to 20 mg/kg. While this seems reasonable for the specific experiment, the cause for the observed toxicity remains unclear and may (again) be related to the conditioning regimen. Most importantly, the use of a different dosing regimen on its own provides a possible explanation for reduced drug effects in the MV4;11 xenograft model. In fact, in recent work the authors of the original study show that reducing I-BET151 to 10 mg/kg has no impact on survival in this model (Gilan et al., Figure 3A). Overall, it cannot be assumed that the drug batch was simply more active and, therefore, needed to be reduced to achieve comparable in-vivo activities. While differences in drug batches should always be considered, the reduced tolerability might be due to the used conditioning regimen, and the differences in survival might simply be a consequence of the reduced I-BET151 dose used in this study.

We agree that changing the dose of I-BET151 could impact the efficacy of the drug independent of the impact other experimental factors might have as well. We have revised the manuscript to highlight this and discuss it in context of other work by authors of the original study who recently reported I-BET151 administered at 10 mg/kg had no impact on survival in a MLL-AF9 leukemia mouse model (Gilan et al., 2016), while I-BET151 administered at 15 mg/kg did provide a survival benefit in a mouse model of NPM1c AML, albeit with a smaller effect size (Dawson et al., 2014).

Overall, changes in the conditioning regimen and the use of a lower dose of I-BET151 may substantially alter the in-vivo response of MV4;11 cells and provide (independent) possible explanations for the observed reduction in in-vivo drug activity. In light of these concerns, it seems very important to not give the wrong impression that the xenograft experiments of the original study could not be reproduced, which would easily be the take-home-message after a "superficial read" of the current manuscript. To avoid this impression, the authors should clearly highlight the most relevant deviations from the original study (i.e. the alternative conditioning regimen and the reduced I-BET151 dose) in the Abstract. Moreover, these experimental differences should also be discussed more extensively in the Discussion. In the current Discussion, the authors first raise some uncertainties due to missing experimental details, then summarize consistent conditions, and then transition into various possible differences that are hard to control for (which are certainly relevant; differences in MV4;11 culture conditions such as different FBS batches could be added). However, for a comprehensive discussion it seems very adequate to first summarize consistent conditions, then highlight obvious and important differences in experimental design (i.e. different conditioning and different drug dose), and only then discuss additional factors that are not easy to control for.

We have revised the Abstract to reflect the most relevant deviations between this replication attempt and the original study. We have also reordered the experimental differences as suggested and included these deviations and expanded discussion on how they might alter the outcome in the main text.

In summary, the design and execution of this replication study is of high quality, and all necessary changes to the original protocol have been implemented in the best possible way. However, the changes in the design of the xenograft study are substantial and might fully explain the observed differences in in-vivo effects of I-BET151. Due to these differences, the results of the performed xenograft experiments are not fully conclusive with respect to the reproduction of the original study, which should be highlighted more clearly.

We agree that there are many aspects a reader needs to consider when interpreting this study, just like any individual study and have revised the manuscript to address the points raised in these reviews.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Shan X, Fung JJ, Kosaka A, Danet-Desnoyers G, Iorns E, Denis A, Williams SR, Perfito N, Errington TM. 2017. Study 29: Replication of Dawson, et al., 2011 (Nature) Open Science Framework. [DOI]

    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES