Highlights
-
•
We identified two sub-groups of infant MLL-AF4-driven ALL, iALL-HOXA9, and iALL-IRX1.
-
•
The subgroups exhibit mutually exclusive expression of HOXA9/HOXA10 and IRX1.
-
•
The transcriptional profile of iALL-IRX1 patients revealed a more aggressive disease.
-
•
The two subgroups exhibit different expression of potential therapeutic targets.
Abstract
Infant t(4;11) acute lymphoblastic leukemia is the most common leukemia in infant patients and has a highly aggressive nature. The patients have a dismal prognosis, which has not improved in more than a decade, suggesting that a better understanding of this disease is required. In the study described here, we analyzed two previously published RNA-sequencing data sets and gained further insights into the global transcriptomes of two known subgroups of this disease, which are characterized by the presence or absence of a homeobox gene expression signature. Specifically, we identified a remarkable mutually exclusive expression of the HOXA9/HOXA10 and IRX1 genes and termed the two subgroups iALL-HOXA9 and iALL-IRX1. This expression pattern is critical as it suggests that there is a fundamental difference between the two subgroups. Investigation of the transcriptomes of the two subgroups reveals a more aggressive nature for the iALL-IRX1 group, which is further supported by the fact that patients within this group have a worse prognosis and are also diagnosed at a younger age. This could be reflective of a developmentally earlier cell of origin for iALL-IRX1. Our analysis further uncovered critical differences between the two groups that may have an impact on treatment strategies. In summary, after a detailed investigation into the transcriptional profiles of iALL-HOXA9 and iALL-IRX1 patients, we highlight the importance of acknowledging that these two subgroups are different and that this is of clinical importance.
In the age of molecular medicine, transcriptional profiling of patients’ samples has become a vital component in improving our understanding of diseases. We now have the potential to dissect transcriptional variations among patients, identify unique components of their disease and provide customized treatment. This approach is particularly valuable for diseases that are rare and difficult to model as there is a scarcity of available information. A prominent example of such a disease is infant MLL-AF4-driven acute lymphoblastic leukemia (ALL). This devastating disease is known to arise in utero, and the patients have a poor prognosis 1, 2, 3, 4. With only a handful of patients diagnosed each year, a unique underlying biology, and a lack of accurate disease models, our understanding of this disease remains limited, which is reflected in the lack of progress in treating these patients [2]. Currently, we know that infant MLL-AF4-driven ALL can be divided further into two subgroups. Two classification systems can be found in the literature: one is based on the expression levels of the gene HOXA9, and the other on separating the patients based on the expression of genes in the HOXA and IRX family of proteins 5, 6, 7. This has been reported to be of clinical relevance as patients with HOXA9high/IRXneg/low expression have a better prognosis than those with HOXA9low/IRXpos expression 5, 6, 7. In this study, we set out to better understand these two subgroups of patients. To do this, we analyzed two previously published RNA-sequencing data sets derived from infant/pediatric patients with MLL-AF4-driven ALL [8,9].
Methods
RNA sequencing analysis pipelines
Raw reads were aligned with Kallisto (version 0.43.1) to GRCh38. The Bioconductor package Tximport was used to import transcript-level abundance, estimated counts, and transcript lengths (version 3.5) [10]. We initially performed batch correction using limma and filtered the samples for genes with low counts across samples. After filtering, determination of the expression level of each gene and differential expression analysis were performed using the DESeq2 pipeline (version 3.5) [11,12]. Genes were considered differentially expressed if they had an adjusted p value ≤0.1. Library pcaExplorer was used for PCA analysis [13]. Gene set enrichment analysis (GSEA) was performed using the GSEA Jana Desktop tool (version 4.1) [14,15]. R version 3.4.3 was used. It should be noted that before processing of the Andersson et al. [9] data set, Bam files were converted to Fastq with Samtools. GraphPad Prism version 7.0 was used.
Results and discussion
We analyzed the RNA-sequencing data set of Andersson et al. [9], which contains data from 17 infant (<1 year) and 5 pediatric (11–18 years) patients with MLL-AF4-driven ALL. Principal component analysis (PCA) revealed that infant blasts formed two clusters (pink and green in Figure 1A). Intriguingly, blasts from pediatric patients (blue) clustered closely with one of the infant clusters. Investigation into the genes driving the clustering revealed HOXA9, HOXA10 and IRX1, IRX2 to be among the top PC1 loadings—in opposite directions. This confirmed previous publications reporting that clustering of the patients was driven by genes of the HOXA and IRX families [5,6]. To further investigate the expression pattern of these genes we performed Spearman's correlation test, which revealed an inverse correlation in the expression of HOXA9, HOXA10, and IRX1, but not IRX2 (Figure 1B,C; Supplementary Table E1, online only, available at www.exphem.org). Although IRX2 was one of the top differentially expressed genes, it was not uniformly upregulated in the HOXA9low/IRXpos patients (Figure 1C). Furthermore, Fisher's exact test confirmed that the observed mutually exclusive expression was statistically significant (Figure 1D,E). These data suggest that the previously described infant ALL (iALL) HOXA9low/IRXpos subgroup would be more accurately described as iALL-IRX1. It was also noteworthy that all pediatric patients expressed HOXA9 and clustered closely with iALL-HOXA9. This may hint at the age of the patients at diagnosis as being another contributing factor to the division of infant patients into two subgroups, especially because a previous study suggested that expression patterns in infants change noticeably around the age of 90 days [5]. However, although patients in the iALL-IRX1 group appeared to be diagnosed at an earlier age, this did not reach statistical significance (Figure 1F). Investigation of the expression of all HOXA cluster genes revealed that they were uniquely upregulated in the iALL-HOXA9 subgroup, in line with previous reports of their coordinated expression (Figure 1G) [5,6,16].
GSEA of the genes differentially expressed between iALL-HOXA9 and iALL-IRX1 (Supplementary Table E2, online only, available at www.exphem.org) revealed an enrichment in MYC targets, as well as oxidative phosphorylation in the iALL-IRX1 patients (Figure 1H). The same subgroup also exhibited enrichment for proliferation pathways as exemplified by G2M checkpoints, E2F targets, and MTORC1 signaling upregulation (Figure 1H). This signature is indicative of the more aggressive nature of the blasts derived from these patients, which could explain their worse prognosis compared with iALL-HOXA9 patients. This is further supported by an additional enrichment of IL2 STAT5 signaling, a key component of core cancer pathways [17]. The top enriched pathway in iALL-HOXA9 patients was Allograft rejection indicative of an immune system-related response (Figure 1I).
To further validate our data, we performed the same analysis with the data set of Agraz-Doblas et al. [8], which contains the transcriptome sequences of the blasts of 27 infant patients, and obtained similar results (Supplementary Figure E1A–F, Supplementary Tables E1 and E3, online only, available at www.exphem.org) [8]. To identify genes common to both data sets, we compared the genes differentially expressed between iALL-HOXA9 and iALL-IRX1 in both experiments (i.e., genes common between Supplementary Tables E2 and E3), which identified a total of 342 common genes (Figure 2A; Supplementary Table E4, online only, available at www.exphem.org). To obtain a general idea about these genes, we performed GSEA. There was an enrichment in Estrogen response late, which had been previously correlated with aggressive cancers, in the iALL-IRX1 group [18,19]. HOXA9-expressing blasts, on the other hand, exhibited an enrichment for Interferon gamma response, cementing our previous observation of an immune system response in these patients (Figure 2B,C). Both these signatures were present in the two individual RNA-sequencing experiments (Supplementary Figure 2A,B, online only, available at www.exphem.org).
It is intriguing that Homeobox genes HOXA9, HOXA10, and IRX1 are inversely correlated in the two subgroups of infant patients with MLL-AF4-driven ALL. This mutually exclusive expression could be the result of the two subgroups having a different cell of origin. To investigate the expression pattern of these genes in human hematopoietic cells, we looked into previously published single-cell RNA-sequencing experiments with adult bone marrow and fetal liver-derived hematopoietic cells [20,21]. While HOXA9 and HOXA10 were expressed in hematopoietic stem and progenitor cells (both adult and fetal), IRX1 exhibited very little expression in the hematopoietic system (Supplementary Figure E3A,B, online only, available at www.exphem.org) [20,21]. Interrogation of murine gastrulation and early organogenesis data sets revealed that Irx1 was expressed predominantly in mesoderm, whereas Hoxa9 and Hoxa10 were expressed in hematoendothelial progenitors (Supplementary Figure E3C) [22].
The IRX1 expression pattern could be indicative of iALL-IRX1 arising in a developmentally earlier cell type than iALL-HOXA9, which is supported by upregulation of genes such as PDGFRB and PDGFD in the iALL-IRX1 data set (Figure E3A; Supplementary Figure E4A, online only, available at www.exphem.org). Contrary to this, hematopoiesis-associated genes such as AFF1 (AF4), CD96, SPN, and PROM1 are upregulated in the iALL-HOXA9 set (Figure 3B,C; Supplementary Figure E4B,C, Supplementary Table E4). Furthermore, as discussed above, patients with iALL-IRX1 appear to be diagnosed at a younger age as compared with iALL-HOXA9 patients (Figure 3E). As mesoderm has multiple progeny, including stromal cell components, it would not be surprising if MLL-AF4 was expressed in the bone marrow microenvironment of patients with iALL-IRX1. In fact, Menendez et al. [23] reported that a subset of bone marrow mesenchymal stromal cells of infant patients with MLL-AF4-driven ALL express the fusion gene. Although they do not specify whether the patients expressed HOXA9 or IRX1, they do suggest that the disease could arise from a pre-hematopoietic precursor.
The differences between the two groups may also have an impact on treatment options for these patients. For example, PROM1 (CD133), which has recently been suggested to be a target for MLL-AF4+ patients, was specifically upregulated in the iALL-HOXA9 subgroup, with lower expression in the iALL-IRX1 group of patients (Figure 3C; Supplementary Figure E4C) [24,25]. Treating iALL-IRX1 patients with CD133-directed CAR-T cells might therefore not be as effective as for iALL-HOXA9 patients. Another recently described therapeutic target for MLL-rearranged patients is GSPG4 (NG2), which is expressed at similar levels in both iALL-HOXA9 and iALL-IRX1 patients—albeit at lower levels than PROM1, suggesting that the outcome of this treatment could be similar for both subgroups (Figure 3D; Supplementary Figure 4D) [26].
One key point about the two subgroups of patients with MLL-AF4-driven ALL is that the majority of information we have gathered about this disease to date is derived from mouse models and cell lines, such as SEM, that express HOXA9. Therefore, it is important to replicate experiments in models and cell lines that mirror the iALL-IRX1 disease. We believe this would be critical when selecting therapeutic regimes for these patients, as exemplified by PROM1. We believe that future studies should therefore consider the HOXA9 and IRX1 expression status of infant ALL patients.
Conflict of interest disclosure
The authors declare no competing interests.
Acknowledgments
We thank the St. Jude Children's Research Hospital–Washington University Pediatric Cancer Genome Project and Agraz-Doblas et al. [8] for providing RNA-Seq data from MLL-AF4+ infant and pediatric patients. This work was supported by a Cancer Research UK Programme Foundation Award (KO) and an MRC PhD studentship (VS).
Footnotes
VS conceived the study, analyzed the data and wrote the article. KO supervised the study and wrote the article.
Supplementary material associated with this article can be found in the online version at https://doi.org/10.1016/j.exphem.2020.10.002.
Appendix. Supplementary materials
References
- 1.Meyer C, Burmeister T, Groger D. The MLL recombinome of acute leukemias in 2017. Leukemia. 2018;32:273–284. doi: 10.1038/leu.2017.213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Pieters R, De Lorenzo P, Ancliffe P. Outcome of infants younger than 1 year with acute lymphoblastic leukemia treated with the Interfant-06 Protocol: results From an international phase III randomized study. J Clin Oncol. 2019;37:2246–2256. doi: 10.1200/JCO.19.00261. [DOI] [PubMed] [Google Scholar]
- 3.Gale KB, Ford AM, Repp R. Backtracking leukemia to birth: identification of clonotypic gene fusion sequences in neonatal blood spots. Proc Natl Acad Sci USA. 1997;94:13950–13954. doi: 10.1073/pnas.94.25.13950. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Ford AM, Ridge SA, Cabrera ME. In utero rearrangements in the trithorax-related oncogene in infant leukaemias. Nature. 1993;363:358–360. doi: 10.1038/363358a0. [DOI] [PubMed] [Google Scholar]
- 5.Kang H, Wilson CS, Harvey RC. Gene expression profiles predictive of outcome and age in infant acute lymphoblastic leukemia: a Children's Oncology Group study. Blood. 2012;119:1872–1881. doi: 10.1182/blood-2011-10-382861. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Stam RW, Schneider P, Hagelstein JA. Gene expression profiling-based dissection of MLL translocated and MLL germline acute lymphoblastic leukemia in infants. Blood. 2010;115:2835–2844. doi: 10.1182/blood-2009-07-233049. [DOI] [PubMed] [Google Scholar]
- 7.Trentin L, Giordan M, Dingermann T, Basso G, Te Kronnie G, Marschalek R. Two independent gene signatures in pediatric t(4;11) acute lymphoblastic leukemia patients. Eur J Haematol. 2009;83:406–419. doi: 10.1111/j.1600-0609.2009.01305.x. [DOI] [PubMed] [Google Scholar]
- 8.Agraz-Doblas A, Bueno C, Bashford-Rogers R. Unraveling the cellular origin and clinical prognostic markers of infant B-cell acute lymphoblastic leukemia using genome-wide analysis. Haematologica. 2019;104:1176–1188. doi: 10.3324/haematol.2018.206375. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Andersson AK, Ma J, Wang J. The landscape of somatic mutations in infant MLL-rearranged acute lymphoblastic leukemias. Nat Genet. 2015;47:330–337. doi: 10.1038/ng.3230. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Soneson C, Love MI, Robinson MD. Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences. F1000Research. 2015;4:1521. doi: 10.12688/f1000research.7563.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Ritchie ME, Phipson B, Wu D. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43:e47. doi: 10.1093/nar/gkv007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Marini F, Binder H. pcaExplorer: an R/Bioconductor package for interacting with RNA-seq principal components. BMC Bioinformatics. 2019;20:331. doi: 10.1186/s12859-019-2879-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Mootha VK, Lindgren CM, Eriksson KF. PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet. 2003;34:267–273. doi: 10.1038/ng1180. [DOI] [PubMed] [Google Scholar]
- 15.Subramanian A, Tamayo P, Mootha VK. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005;102:15545–15550. doi: 10.1073/pnas.0506580102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Krivtsov AV, Feng Z, Lemieux ME. H3K79 methylation profiles define murine and human MLL-AF4 leukemias. Cancer Cell. 2008;14:355–368. doi: 10.1016/j.ccr.2008.10.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Wingelhofer B, Maurer B, Heyes EC. Pharmacologic inhibition of STAT5 in acute myeloid leukemia. Leukemia. 2018;32:1135–1146. doi: 10.1038/s41375-017-0005-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Deeb KK, Michalowska AM, Yoon CY. Identification of an integrated SV40 T/t-antigen cancer signature in aggressive human breast, prostate, and lung carcinomas with poor prognosis. Cancer Res. 2007;67:8065–8080. doi: 10.1158/0008-5472.CAN-07-1515. [DOI] [PubMed] [Google Scholar]
- 19.Jagannathan V, Robinson-Rechavi M. Meta-analysis of estrogen response in MCF-7 distinguishes early target genes involved in signaling and cell proliferation from later target genes involved in cell cycle and DNA repair. BMC Syst Biol. 2011;5:138. doi: 10.1186/1752-0509-5-138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Hay SB, Ferchen K, Chetal K, Grimes HL, Salomonis N. The Human Cell Atlas bone marrow single-cell interactive web portal. Exp Hematol. 2018;68:51–61. doi: 10.1016/j.exphem.2018.09.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Popescu DM, Botting RA, Stephenson E. Decoding human fetal liver haematopoiesis. Nature. 2019;574:365–371. doi: 10.1038/s41586-019-1652-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Pijuan-Sala B, Griffiths JA, Guibentif C. A single-cell molecular map of mouse gastrulation and early organogenesis. Nature. 2019;566:490–495. doi: 10.1038/s41586-019-0933-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Menendez P, Catalina P, Rodriguez R. Bone marrow mesenchymal stem cells from infants with MLL-AF4+ acute leukemia harbor and express the MLL-AF4 fusion gene. J Exp Med. 2009;206:3131–3141. doi: 10.1084/jem.20091050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Bueno C, Velasco-Hernandez T, Gutiérrez-Agüera F. CD133-directed CAR T-cells for MLL leukemia: on-target, off-tumor myeloablative toxicity. Leukemia. 2019;33:2090–2125. doi: 10.1038/s41375-019-0418-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Godfrey L, Crump NT, O'Byrne S. H3K79me2/3 controls enhancer–promoter interactions and activation of the pan-cancer stem cell marker PROM1/CD133 in MLL-AF4 leukemia cells. Leukemia. 2020 doi: 10.1038/s41375-020-0808-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Lopez-Millan B, Sanchéz-Martínez D, Roca-Ho H. NG2 antigen is a therapeutic target for MLL-rearranged B-cell acute lymphoblastic leukemia. Leukemia. 2019;33:1557–1569. doi: 10.1038/s41375-018-0353-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.