These 2 independently performed studies by Passet et al and Kimura et al describe a novel genetic subtype of B-progenitor acute lymphoblastic leukemia (B-ALL) characterized by 2 separate focal genomic deletions leading to upregulation of CDX2 on chromosome 13 and to the formation of a chimeric gene fusion (UBTF::ATXN7L3) on chromosome 17. This novel subtype confers high clinical risk and is found in 1-2% of patients with B-ALL, with higher representation in adolescents and young adults.
Key Points
CDX2 deregulation and UBTF fusion define a B-ALL subtype with distinct immunophenotype, expression profile, and high-risk feature.
Somatic 13q12.2 deletions spanning FLT3 promoter lead to upregulation of CDX2 through a mechanism of enhancer retargeting.
Visual Abstract
Abstract
Transcriptome sequencing has identified multiple subtypes of B-progenitor acute lymphoblastic leukemia (B-ALL) of prognostic significance, but a minority of cases lack a known genetic driver. Here, we used integrated whole-genome (WGS) and -transcriptome sequencing (RNA-seq), enhancer mapping, and chromatin topology analysis to identify previously unrecognized genomic drivers in B-ALL. Newly diagnosed (n = 3221) and relapsed (n = 177) B-ALL cases with tumor RNA-seq were studied. WGS was performed to detect mutations, structural variants, and copy number alterations. Integrated analysis of histone 3 lysine 27 acetylation and chromatin looping was performed using HiChIP. We identified a subset of 17 newly diagnosed and 5 relapsed B-ALL cases with a distinct gene expression profile and 2 universal and unique genomic alterations resulting from aberrant recombination-activating gene activation: a focal deletion downstream of PAN3 at 13q12.2 resulting in CDX2 deregulation by the PAN3 enhancer and a focal deletion of exons 18-21 of UBTF at 17q21.31 resulting in a chimeric fusion, UBTF::ATXN7L3. A subset of cases also had rearrangement and increased expression of the PAX5 gene, which is otherwise uncommon in B-ALL. Patients were more commonly female and young adult with median age 35 (range,12-70 years). The immunophenotype was characterized by CD10 negativity and immunoglobulin M positivity. Among 16 patients with known clinical response, 9 (56.3%) had high-risk features including relapse (n = 4) or minimal residual disease >1% at the end of remission induction (n = 5). CDX2-deregulated, UBTF::ATXN7L3 rearranged (CDX2/UBTF) B-ALL is a high-risk subtype of leukemia in young adults for which novel therapeutic approaches are required.
Introduction
B-cell acute lymphoblastic leukemia (B-ALL) is the most common childhood tumor and an important cause of morbidity in adults. Despite advances in understanding the biology of B-ALL and new treatment approaches, prognosis is substantially worse in adolescents and adults than in younger children, with cure rates of <40% in patients 40 years old and older.1,2 Conventional genetic analyses including karyotyping, fluorescence in situ hybridization, and targeted molecular analyses detect subtypes defined by aneuploidy, chromosomal rearrangements, and/or known gene fusions,3 but a residual minority of cases lacking a genomic driver remain, commonly termed “B-other.” Sequencing, particularly whole-transcriptome sequencing (RNA-seq), has identified multiple previously unrecognized subtypes,4 including those with rearrangements of DUX4,5-8 MEF2D,9 NUTM1,10 and ZNF384,11-14 cases with gene expression profiles that phenocopy established subtypes (eg, Ph-like,15 ETV6::RUNX1–like,5,16 KMT2A-like16), and cases driven by single point mutations PAX5 P80R and IKZF1 N159Y.16,17 These subtypes are associated with varying outcomes in children and adults,18,19 and comprehensive genomic analysis has become an increasingly widely adopted clinical diagnostic approach.20,21 However, prior studies have shown that a subset of cases lack a driver despite careful analysis of RNA-seq data, including cases with distinct gene expression profile, suggesting an unidentified, subtype-defining driver lesion(s). Integrated analysis of RNA-seq with whole-genome sequencing (WGS) has been informative in ALL of T- or ambiguous lineage in identifying genomic alterations and rearrangements leading to enhancer hijacking, for example cases with deregulation of BCL11B and GATA2.22,23 Using a similar integrated approach in B-ALL, we report the genomic basis of a subtype of B-ALL presenting with a distinct immunophenotype and gene expression profile, driven by genetic alterations resulting in deregulation of CDX2 (caudal type homeobox 2) and deletion of the 3′ region of UBTF (upstream binding transcription factor) resulting in an in-frame UBTF::ATXN7L3 fusion.
Methods
Patients and sample collection
We studied patients with B-ALL at diagnosis or relapse and available diagnostic material from the following studies and institutions: St. Jude Children’s Research Hospital (clinicaltrials.gov identifier #NCT00549848),16,19 the Eastern Cooperative Oncology Group (ECOG) and the American College of Radiology Imaging Network (ACRIN) Cancer Research Group (ECOG-ACRIN),18 the Children's Oncology Group,16 the INotuzumab Ozogamicin trial to inVestigAte Tolerability and Efficacy (INO-VATE),24 City of Hope,25 Peter MacCallum Cancer Centre/Royal Melbourne Hospital, South Australian Health and Medical Research Institute, MD Anderson Cancer Center (#NCT00623870 [R05045337; MDM2 inhibitor] and #NCT01134575 [inotuzumab ozogamicin]), and the Munich Leukemia Laboratory (MLL)26 (supplemental Table 1). Patients and/or their guardians provided written informed consent in accordance with the Declaration of Helsinki. The study was approved by the Institutional Review Board of St. Jude Children’s Research Hospital.
Transcriptome sequencing
We used publicly available RNA-seq data from our previous study.16 Additional RNA-seq was performed as previously described.16 Sequencing reads were mapped to the GRCh37 human genome reference by STAR (version 2.4.2a),27 using the suggested 2-pass mapping pipeline. FusionCatcher (version 1.10; https://github.com/ndaniel/fusioncatcher) and STAR-Fusion (version 1.5.0)28 were used to detect fusions. To quantitate gene expression, we used HTSeq (version 0.13.5)29 to calculate read counts for each transcript and the ComBat function in the sva30 R package to correct for batch effects as we previously described.16 The DESeq2 Bioconductor R package31 was used for gene expression normalization and differential expression analysis. For a 2-dimensional t-distributed stochastic neighbor embedding (tSNE) plot and hierarchical clustering, the Rtsne and pheatmap R packages were used with the top 1000 most variable genes based on median absolute deviation and perplexity 30. Gene set enrichment and pathway analyses were performed using Metascape (version 3.5)32 and gene set enrichment analysis (version 4.1.0)33,34 as previously described.16
WGS
WGS data were processed as described in our previous study.22 Sequencing reads were mapped to the GRCh37 human genome reference by Burrows-Wheeler Alignment (version 0.7.12).35 For the detection of mutations, we used the combination of GenomonFisher (version 0.2.1), GenomonMutationFilter (version 0.2.9), and EBFilter (version 0.2.1; https://genomon-project.github.io/GenomonPagesR/). For the detection of structural variants (SVs), 3 SV callers were implemented including GenomonSV (version 0.8.0), Delly (version 0.7.7),36 and Manta (version 1.6.0).37 All variant calls were further manually reviewed for read depth and to remove artifacts using the Integrated Genome Viewer38 as described in our previous studies.16,22 CNVkit (version 0.9.1)39 was used for copy number alteration detection from WGS data. The prediction of cryptic recombination signal sequences (RSS) was screened using RSS database (https://www.itb.cnr.it/rss/analyze.html).40
HiChIP and ChIP-seq
HiChIP41 data were processed and analyzed using MAPS software (version 2.0)42 as described in our previous study.22 Super-enhancers ranked by H3K27ac enrichment were identified using the Ranking of Super Enhancers43 software with default parameters. Chromatin immunoprecipitation (ChIP)-seq data of GM12878 cell line for UBTF were downloaded from the Gene Expression Omnibus using the following accession (GSE91779; Moore et al44) and analyzed as described previously.22
Statistical analysis
Categorical variables were examined with the use of 2-sided Fisher’s exact test and the Holm-Bonferroni method was used for multiple testing. Statistical significance of gene expression level and variant allele frequency (VAF) were assessed using the Wilcoxon rank-sum test. Analyses were performed using Prism version 7.0 (GraphPad Software) and R version 3.6.1 (http://www.r-project.org).
Results
Identification of the high-risk B-ALL subtype with unique characteristics
We analyzed RNA-seq data from 1988 B-ALL cases from our previous study16 to explore new subtypes without a known leukemic driver (B-other). Among 125 cases (6.4%) of B-other, we identified 6 cases, which by transcriptome analysis clustered together with a distinct expression profile compared with other B-ALL cases.16 In contrast, the remaining B-other cases did not have a specific expression pattern and clustered with other known subtypes. We next collected and analyzed additional RNA-seq data from children and adults from independent studies and institutions. In total, we analyzed 3398 cases and identified an additional 16 cases (total 22 cases) that clustered with the distinct expression group, including 6 out of 731 (0.82%) from ECOG-ACRIN, 3 out of 92 (3.3%) from INO-VATE study (relapse cohort; supplemental Figure 1),24 2 out of 862 (0.23%) from the Children's Oncology Group, 4 out of 280 (1.4%) from MLL, and 7 from other cohorts (Table 1; supplemental Table 1). All patients were older than 10 years (median age 35, range 12-70), mostly adolescents and young adults (AYA) (54.5%) and female (77.3%) (supplemental Figure 2; Table 1). Incidence was highest in AYA (1.4%), especially females (2.2%), not common in adults (0.33%), and rare in children (0.17%, P = .011; supplemental Figure 2; supplemental Table 2). There was enrichment (2.8%; 5/177, P = .015) of the subtype in relapse cases compared with diagnosis cases (0.53%; 17/3221) and a high incidence of relapse or minimal residual disease (MRD) >1% at the end of induction therapy (56.3%; 9/16; Table 1).
Table 1.
Immunophenotype | |||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Sample ID | Sample status | Gender | Age group | Outcome | Relapse | MRD >1% at EOI | Karyotype | CD10 | IgM | CD19 | CD20 | CD22 | CD34 |
ADU_0788_ 201543 | Diagnosis | Female | Adult | Alive (CR1) | No | NA | G-banding: failure. | Dim | NA | Posi | Posi | Posi | Posi |
ADI_0252 | Diagnosis | Female | Adult | Dead | Yes | NA | 46,XX,t(1;7)(q25;p12),add (6)(q13),der(11)add(11)(p?13) add(11)(q?22), del(13) (q12q22),?add(18)(q23)[23] | Neg | NA | Posi | Neg | Posi | Posi |
MLL_13382 | Diagnosis | Female | Adult | Alive (CR1) | No | NA | 46,XX[20] | Neg | NA | Posi | Neg | Neg | Posi |
MLL_137117 | Diagnosis | Female | Adult | Alive (CR1) | No | NA | 46,XX[20] | Neg | NA | Posi | Neg | Dim | Posi |
SJALL060150 | Diagnosis | Female | Adult | Alive (Refractory post cycle 2 of Induction) | Yes | NA | NA | Neg | Posi | Posi | Neg | Dim | Posi |
AYAII_0107 | Diagnosis | Female | AYA | Alive (CR1) | No | NA | NA | Posi | NA | Posi | Neg | Neg | Posi |
AYAII_0369 | Diagnosis | Female | AYA | Alive (CR1) | No | Positive | Hyperdiploid. 82-86 (near tetraploid) gain of 1q in 7/20 | Neg | NA | Posi | NA | NA | Posi |
MLL_13360 | Diagnosis | Female | AYA | Dead (CR1; hepatorenal syndrome with acute renal failure, sepsis) | No | NA | 46,XX[15] | Neg | NA | Posi | Neg | Dim | Posi |
MLL_202248- M228 | Diagnosis | Female | AYA | Alive (CR1) | No | NA | 46,XX[22] | Neg | NA | Posi | Neg | Dim | Posi |
SJALL054476 | Diagnosis | Male | AYA | Alive | Yes | NA | 46,XY,t(12;13)(p11.2;q14)[30]/ 46,idem,del(6)(q21q25)[7]/ 46,XY[14];abnormal;del(6q) | Neg | Posi | Posi | Neg | Posi | Posi |
SJALL054518 | Diagnosis | Male | AYA | NA | NA | NA | Inv(9)(p13q13) | Neg | Posi | Posi | Neg | Dim | Posi |
SJALL066367 | Diagnosis | Male | AYA | Alive (CR1) | Yes | NA | NA | Dim | Neg | Weak | Neg | Posi | Posi |
SJBALL016287 | Diagnosis | Female | AYA | Alive (CR1) | No | Positive | 46,XX[20] | Neg | Posi | NA | NA | NA | NA |
SJBALL020165 | Diagnosis | Female | AYA | NA | No | Positive | 46,XX,del(9)(q13q22) [9]/46,XX[11] | Neg | Posi | Posi | Neg | Posi | Posi |
SJBALL020169 | Diagnosis | Female | AYA | NA | No | Positive | 46,XX,del(9)(q13q22) [6]/46,XX[14] | Neg | Posi | Posi | Neg | NA | Posi |
SJBALL020889 | Diagnosis | Female | Childhood HR | Alive (CR1) | No | Positive | 46,XX,del(9)(q12q32) [10]/92,idemx2,+ add(1)(p11),-20[8]/46,XX[20] | Neg | Posi | NA | NA | NA | NA |
SJBALL022051 | Diagnosis | Male | Childhood HR | Alive (CR1) | No | NA | 46,XY,der(6)t(1;6)(q21.1;p25.3), add(14)(q32.3)[13]/46,XY[7] | Dim | Posi | Posi | Dim | Posi | NA |
SJALL058818_R1 | Relapse | Male | Adult | Dead | Yes | Positive | Complex, near tetraploidy | Neg | Posi | Posi | NA | Posi | Posi |
SJALL073397_R1 | Relapse | Female | Adult | Refractory after INO | Yes | Positive | NA | Neg | NA | Posi | NA | Posi | Posi |
SJBALL042249_R1 | Relapse | Female | Adult | Dead (refractory disease) | Yes | Positive | 46,XX,dup(1)(q21q42),der (?2)t(2;13)(p11.2;q14), -3,add(6)(p23),?inv (7)(p13p22),der(?8)t (8;13)(q22;q22), del(9) (q13q22),add(12) (q24.3),add(13)(q14),del (13)(q12),del(?15)(q15q24), add(20)(q13.1),+mar | Dim | NA | Posi | Neg | Neg | Posi |
SJALL073380_R1 | Relapse | Female | AYA | Alive (CR2) | Yes | NA | NA | Neg | NA | Posi | NA | Posi | Posi |
SJALL073389_R1 | Relapse | Female | AYA | Alive (CR2) | Yes | NA | NA | Neg | NA | Posi | NA | Dim | Posi |
CR, complete remission; EOI, end of induction; HR, high-risk; NA, not available; Neg, negative; Posi, positive.
RNA-seq–based gene expression profile data from our previous study (1988 cases)16 and additional 16 cases were analyzed using hierarchical clustering and tSNE analysis, confirming the distinct expression profile of this group (Figure 1A; supplemental Figure 3). Gene expression analysis identified 1216 differentially expressed genes (fold change >2, adjusted P < .001) including CDX2, NTRK3, PREX2, and LRRC4C and low expression of FLT3 and MME (CD10) (Figure 1B; supplemental Table 3). Immunophenotyping showed negativity of CD10 for most cases (78.9%) and positivity for cytoplasmic immunoglobulin M (IgM) (Figure 1C; supplemental Table 1). High expression of NTRK3 was observed in almost half of cases (45.5%, 10/22; Figure 1D), which is rare in B-ALL. We also observed high expression of histone genes, with enrichment of histone and ribosomal RNA–related pathways (Figure 1E; supplemental Table 4).
Deletion of UBTF resulting in UBTF::ATXN7L3 fusion
We next examined the genomic features of this subtype with integrated analysis of RNA-seq (n = 22) and WGS (n = 11; 5 tumor-germline paired cases and 6 tumor-only cases). All 22 cases exhibited expression of a chimeric fusion involving the C-terminus of UBTF and the downstream gene ataxin 7 like 3 (ATXN7L3) (Figure 2A; supplemental Tables 1 and 5). The fusions involved juxtaposition of UBTF exon 15 (n = 1), exon 17 (n = 19), exon 18 (n = 1), or exon 20 (n = 1) to the 5′ untranslated region of ATXN7L3 exon 1 on chromosome 17q21.31 (supplemental Figure 4A; supplemental Table 1). All fusions encoded an in-frame chimeric fusion protein utilizing 60 nucleotides upstream of the canonical translation start site of ATXN7L3 (Figure 2A), which was confirmed by immunoblotting in transfected human embryonic kidney 293T cells (Figure 2B). Both UBTF and ATXN7L3 are on chromosome 17q21.31, ~7000 bases apart. Analysis of WGS data of 11 available UBTF::ATXN7L3-rearranged cases identified the presence of a genomic deletion of ~10 kb between the 2 genes (Figure 2C; supplemental Table 1) with genomic breakpoints corresponding to the UBTF fusion junction (introns 15, 17, 18, or 20) and upstream of the ATXN7L3 coding sequence (Figure 2C; supplemental Figure 4B; supplemental Table 1). Notably, the breakpoints in 9 out of 11 cases were observed at approximately the same location (chr17:42,276,135 ± 6 and chr17:42,278,627 ± 3; GRCh37), and cryptic RSS were identified at these sites, with addition of nontemplated nucleotides, suggesting that the deletion may result from aberrant recombination-activating gene (RAG)-mediated recombination (Figure 2D; supplemental Table 6) as previously described for deletions observed for other genes in B-ALL.45-48 Collectively, the structural variants involving UBTF resulted in the generation of an in-frame chimeric UBTF::ATXN7L3 fusion protein, which is a universal genomic feature of this subtype.
Deletion proximal of FLT3 results in enhancer retargeting and deregulation of CDX2
In addition to UBTF alterations, recurrent 13q12.2 (FLT3/PAN3) deletions were detected by WGS analysis from all analyzed cases in this subtype, in which the involvement of aberrant RAG-mediated recombination was also estimated (Figure 3A; supplemental Tables 1 and 6). Deletions in this region have been recently described in B-ALL with breakpoints occurring upstream of the FLT3 coding sequence, but sparing the FLT3 promoter, and associated with increased expression of FLT3 (type I deletion; Figure 3A).49 In contrast, in UBTF-rearranged cases, all 13q12.2 deletions included the promoter region and exon 1 of the FLT3 gene and the promoter of the adjacent gene PAN3 (type II deletions), which is oriented in head-to-head configuration with FLT3 (Figure 3A). In samples with type I deletion, upregulation of FLT3 is caused by an enhancer hijacking mechanism, in which an enhancer in the PAN3 gene interacts with the FLT3 promoter instead of the PAN3 promoter because the latter is deleted (Figure 3A).49 In contrast, in UBTF-rearranged cases with type II deletion, we observed low FLT3 expression but ectopic upregulation of CDX2 (Figures 1B and 3B), a transcription factor normally expressed in intestinal tissue, which is located ~30 kb downstream of FLT3. Furthermore, allele-specific expression of CDX2 and FLT3 was confirmed (Figure 3C; supplemental Figure 5), suggesting that the type II deletion at 13q12.2 deregulates CDX2 expression through hijacking and retargeting of the PAN3 coding enhancer. Based on the unique and universal UBTF rearrangements and CDX2 overexpression in this subtype, we herein termed this new subtype “CDX2/UBTF.” Neither of the 2 types of SVs, UBTF and 13q12.2 type II deletion, were observed in 2754 samples of leukemia with WGS/Single Nucleotide Polymorphism array DNA copy number data or leukemia cell lines (https://www.stjude.cloud/); however, high CDX2 expression and amplification of the region including the enhancer in PAN3 were observed in the hypodiploid cell line Nalm-16 (Figure 3A; supplemental Figure 6).
The enhancer region in PAN3 is ubiquitously active in hematopoietic and immune cell types but not in other cell types or tissues (supplemental Figure 7A-D), indicating that the enhancer itself is not aberrantly activated in B-ALL and suggesting that the type II deletion in CDX2/UBTF B-ALL may result in CDX2 deregulation through enhancer retargeting as type I deletions have been described to do for FLT3. To investigate this, we performed HiChIP to profile acetylation of histone H3 at lysine 27 (H3K27ac) as a mark of enhancers, simultaneously with 3-dimensional chromatin architecture in 2 CDX2/UBTF samples with type II deletions, 1 B-other ALL sample with type I deletion leading to FLT3 overexpression (SJBALL113), the hypodiploid cell line Nalm-16 with amplification of the enhancer in PAN3, and the ETV6::RUNX1 cell line Reh with no alteration in this region (Figure 3D). The CDX2/UBTF B-ALL cases (SJALL060150 and SJBALL020169) harboring the type II deletion, which removes both the PAN3 and FLT3 promoters, showed evidence of chromatin looping between the enhancer in PAN3 and the CDX2 gene located ~280 kb away (Figure 3D; supplemental Figure 8A), supporting an enhancer retargeting mechanism for CDX2 expression. In contrast, in the B-other case SJBALL113 that harbors a type I deletion that removes the promoter of PAN3 but not FLT3, no interactions were observed between the enhancer in PAN3 and CDX2 (Figure 3D; supplemental Figure 8B); rather, multiple significant chromatin loops were identified between the PAN3 coding enhancers and the FLT3 gene, consistent with previous reports that type I deletions result in FLT3 overexpression.49 H3K27ac HiChIP data from Nalm-16 cells, which harbor amplification of the region containing the enhancer in PAN3, showed long-range interactions with the CDX2 gene that did not reach statistical significance (Figure 3D; supplemental Figure 8C; supplemental Table 7). Finally, Reh cells that lack FLT3/PAN3 deletions showed interactions between the PAN3 and FLT3 promoters, albeit to a lesser extent compared with the case with a type I deletion (SJBALL113; Figure 3D; supplemental Figure 8D). Collectively, these data functionally connect different classes of FLT3-PAN3 SVs with changes in local gene expression and provide a mechanistic basis for high CDX2 expression in the CDX2/UBTF B-ALL subtype.
Other genomic abnormalities in CDX2/UBTF B-ALL
We identified 2 unique and universal SVs involving UBTF and the FLT3/PAN3 region in all analyzed CDX2/UBTF B-ALL cases. We estimated the order of acquiring these SVs during clonal leukemic evolution by VAF of each SV. Most cases showed similar VAFs for both SVs, whereas 2 cases (SJBALL020169 and SJBALL042249) had higher VAFs for FLT3-PAN3 deletion (VAF above 0.35) than UBTF deletion (VAF below 0.3), suggesting aberrant CDX2 expression is the founder alteration (supplemental Figure 9; supplemental Table 8).
Additional co-alterations detected by WGS (n = 11) included the heterozygous deletion of ETV6 (36.3%, 4/11) and rearrangements of PAX5/ZCCHC7 (36.3%; 3 cases with genomic deletion and 1 due to inversion between the 2 genes) (Figure 4A; supplemental Tables 9 and 10). Aberrant RAG activity was suggested in PAX5/ZCCHC7 rearrangements (supplemental Table 6). The rearrangements of PAX5 included exon 1a of PAX5 gene but spared the alternate first exon of PAX5, exon 1b (Figure 4A). Although the use of alternative exon 1b was observed in cases without PAX5 rearrangements, these PAX5 rearranged cases showed higher exon 1b utilization compared with CDX2/UBTF cases without PAX5/ZCCHC7 rearrangements, leading to extremely high PAX5 expression, especially for the case with inversion (Figure 4B-C).
Gain of chromosome 1q (>20 Mb; n = 6, 54.5%) was common in CDX2/UBTF B-ALL cases (supplemental Tables 1 and 10). Notably, frequent gain of the histone gene cluster region (chr1: 149,260,000-149,900,000) was observed in >80% (9/11) of CDX2/UBTF cases with gains of 1q of any size, which were otherwise rare in B-ALL (1.83%, 22/1202) except for MEF2D-rearranged cases (25.9%, 7/27) (supplemental Figure 10A-C; supplemental Table 11). Histone genes were overexpressed in CDX2/UBTF cases, and notably, ChIP-seq analysis has shown UBTF to bind at histone gene promoters in the Epstein-Barr virus B lymphoid cell line GM12878 (ref. 44; supplemental Figure 10D). Finally, ranking of super-enhancers (as identified by H3K27ac HiChiP data) demonstrated stronger super-enhancer signals for the histone cluster on chromosome 6 and PAX5 regions in CDX2/UBTF cases compared with non-CDX2/UBTF B-ALL (supplemental Figure 10E; supplemental Table 12). These data support a role for upregulation of histone genes and PAX5 in CDX2/UBTF B-ALL in the case of histone gene deregulation either by chromosome 1 gain and/or the direct action of UBTF::ATXN7L3 (Figure 1B; supplemental Figure 10C).
Discussion
This study describes the entity and driver genomic alterations that define a high-risk B-ALL subtype with deregulation of CDX2 and UBTF. Increased expression of CDX2 has been described in B-ALL,50 but the mechanistic basis has been unknown. Here, we show the power of integrated RNA-seq and WGS to identify 2 new genomic alterations involving intergenic/noncoding regions that are not directly identifiable by RNA-seq,22 and by coupling these analyses with H3K27ac HiChIP analysis, we have directly demonstrated an enhancer retargeting mechanism at 13q12.2 leading to aberrant expression of CDX2 by HiChiP. Importantly, we demonstrated the different interaction of the same enhancer with nearby genes in 3 genomic alterations found in this region, type I deletion (FLT3 upregulation), type II deletion (unique in CDX2/UBTF B-ALL; CDX2 upregulation), and amplification (FLT3 and CDX2 upregulation). These different functional engagements between the enhancer and its target promoters can be explained by the recently described concept of enhancer release and retargeting, in which functional loss of a preferred promoter results in release of its normal enhancer partner, resulting in looping to, and activation of, an alternative promoter(s) within the same topologically associated domain.51 This was also observed in the PAX5 and ZCCHC7 region,51 one of the genomic features of CDX2/UBTF B-ALL that may drive the aberrant PAX5 expression observed in these cases. Notably, all these genomic deletions or inversions were estimated to have cryptic RSS sites at breakpoints, suggesting the involvement of aberrant RAG activity in the etiology of CDX2/UBTF B-ALL subtype.
The 2 distinct but universal genomic events observed in all CDX2/UBTF B-ALL cases are striking and reminiscent of DUX4-rearranged B-ALL. In that subtype, enhancer hijacking events drive deregulation of DUX4 that is followed by DUX4 binding to ERG, resulting in its deregulation and deletion, with expression of truncated, functionally aberrant ERG proteins.6 We estimated that the CDX2 abnormality arose first by comparing the VAFs of the 2 structural variants. Ectopic expression of CDX2 in stem and progenitor hematopoietic cells results in self-renewal, myeloid lineage skewing, and myelodysplasia and acute monoblastic leukemia, with suppression of lymphoid pathways.52,53 Additional alterations are likely required for the leukemogenesis of B-ALL, and accordingly, Ikzf1 alterations are associated with lymphoid phenotype in these mouse models. UBTF is one of the main downstream targets of MYC and regulates ribosome biogenesis by binding ribosomal RNA genes54 and deregulates gene expression, including those regulating B-cell differentiation, by reorganizing ribosomal DNA-genome contacts in a MYC-driven model of lymphoma.55 Thus, we hypothesize that aberrant expression of CDX2 may directly drive the genesis of the UBTF::ATXN7L3 deletion/rearrangement, and cooperativity of these alterations skews the phenotype of transformed cells to a B lymphoid lineage.
Identification of the CDX2/UBTF subtype of B-ALL at diagnosis is clinically important in that while uncommon in B-ALL overall (0.53%), this subtype was most frequently identified in AYA patients (1.4%), especially females (2.2%), and exhibits high-risk features including high incidence of relapse or refractory disease and elevated levels of MRD. Although the outcome of AYA with B-ALL has improved with the use of intensive, pediatric-inspired therapy,56-58 outcomes remain suboptimal. Several distinctive clinical and genomic features may be used to detect CDX2/UBTF B-ALL in clinical practice: (1) the distinctive immunophenotype, negativity for CD10, and positivity for cytoplasmic IgM; (2) the detection of UBTF::ATXN7L3 fusion by reverse transcription polymerase chain reaction, as the recurrent breakpoints (95%, 21/22) involve UBTF exon 17 or later and ATXN7L3 exon 1; (3) the use of genomic polymerase chain reaction to detect UBTF (17q21.31) deletions (10 kb) as the breakpoints were at approximately the same location (chr17:42,276,135 ± 6 and chr17:42,278,627 ± 3; GRCh37) in most cases (82%, 9/11); (4) the use of RNA-seq to confirm UBTF::ATXN7L3 fusion and high expression of CDX2; or (5) the use of WGS to detect the 2 universal genomic deletions on FLT3/PAN3 (13q12.2) and UBTF (17q21.31). In conclusion, the unique genomic and clinical characteristics of CDX2/UBTF B-ALL described here may help to guide refined risk stratification at diagnosis for improved outcome of AYA leukemia patients.
Supplementary Material
The online version of this article contains a data supplement.
Acknowledgments
The authors thank the Haematology Tissue Bank (Peter MacCallum Cancer Centre/Royal Melbourne Hospital) and Children’s Cancer Centre Tissue Bank (Murdoch Children’s Research Institute) for assistance with sample collection and the Genomics Core Facility and Genomics Platform Group (University of Melbourne Centre for Cancer Research) for sequencing and analysis support.
The authors are supported by the American and Lebanese Syrian Associated Charities of St. Jude Children’s Research Hospital; the St. Jude Children’s Research Hospital Chromatin Collaborative (C.G.M.); National Institutes of Health National Cancer Institute (NCI) grants P30 CA021765, R35 CA197695 (C.G.M.), U10 CA180820 (ECOG-ACRIN), UG1 CA232760 (M.L.), and UG1 CA189859 (E.M.P.); the Henry Schueler 41&9 Foundation (C.G.M.); a St. Baldrick’s Foundation Robert J. Arceci Innovation Award (to C.G.M.); a Garwood Postdoctoral Fellowship of the Hematological Malignancies Program of the St Jude Children’s Research Hospital Comprehensive Cancer Center (to S.K.); grants from the Wilson Centre for Blood Cancer Genomics (P.B.); the Snowdome Foundation (P.B.); the Peter MacCallum Cancer Foundation (G.L.R.); a SCOR Grant (7015-18); the Lymphoma and Leukemia Society (P.G.E.); the Perpetual Trustees and the Samuel Nissen Foundation (P.G.E.); and the National Health and Medical Research Council of Australia (A.O., GNT1140626).
Footnotes
RNA-seq, WGS, and HiChIP data have been deposited at the EGAS00001005863 (human primary samples), GSE190690 (cell lines), and housed in a protected Cloud environment in accordance with European General Data Protection regulations. Other legacy data used in this study have been deposited in the European Genome-Phenome Archive (EGA) in previous projects under accession numbers EGAS00001003266, EGAS00001000654, EGAS00001001952, EGAS00001001923, EGAS00001002217, and EGAS00001000447. The TARGET genomic data used in this study are available through the TARGET website (https://ocg.cancer.gov/programs/target) and also in the database of Genotypes and Phenotypes (dbGaP; http://www.ncbi.nlm.nih.gov/gap) under accession number phs000218 (TARGET).
Requests for data sharing may be submitted to Charles G. Mullighan (charles.mullighan@stjude.org).
The online version of this article contains a data supplement.
There is a Blood Commentary on this article in this issue.
The publication costs of this article were defrayed in part by page charge payment. Therefore, and solely to indicate this fact, this article is hereby marked “advertisement” in accordance with 18 USC section 1734.
Authorship
Contribution: S.K., L.M., I.I., and C.G.M. were responsible for conception and design of the study; S.K. and L.M. performed genomic data analysis; S.K. and P.E.M. performed experiments; Y.Z., Q.G., Z.G., G.L.R., M.M., W.W., S.B., J.Z., and K.G.R. assembled data; E.M.P., C.H., A.D.L., W.S., M.L., J.M.R., S.M.L., S.P.H., G.L.R., B.S., P.G.E., A.O., S.M.G., D.L.W., J.B., I.A., E.J.J., C.-H.P., W.K., T.H., and P.B. collected samples; S.K., L.M., I.I., and C.G.M. wrote the manuscript; and all authors critically revised the article and approved the final version.
Conflict-of-interest disclosure: I.I. received honoraria from Amgen and Mission Bio. P.G.E. received royalty distributions through Commercial Income Distribution from the Walter and Eliza Hall Institute. C.G.M. received research funding from Loxo Oncology, Pfizer, and AbbVie; honoraria from Amgen and Illumina; and holds stock in Amgen. There are no conflicts of interest with the work presented in this manuscript. A.D.L reports he is an employee of and has stock and/or other ownership interests in Pfizer. M.M. and W.W. report employment by Munich Leukemia Laboratory (MLL). C.H., W.K., and T.H. report equity ownership of MLL. The remaining authors declare no competing financial interests.
The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Correspondence: Charles G. Mullighan, Department of Pathology, Hematological Malignancies Program, St. Jude Children's Research Hospital, 262 Danny Thomas Place, Mail Stop 342, Memphis, TN 38105; e-mail: charles.mullighan@stjude.org.
REFERENCES
- 1.Iacobucci I, Kimura S, Mullighan CG. Biologic and therapeutic implications of genomic alterations in acute lymphoblastic leukemia. J Clin Med. 2021;10(17):3792. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Hunger SP, Mullighan CG. Acute lymphoblastic leukemia in children. N Engl J Med. 2015;373(16):1541-1552. [DOI] [PubMed] [Google Scholar]
- 3.Iacobucci I, Mullighan CG. Genetic basis of acute lymphoblastic leukemia. J Clin Oncol. 2017;35(9):975-983. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Kimura S, Mullighan CG. Molecular markers in ALL: clinical implications. Best Pract Res Clin Haematol. 2020;33(3):101193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Lilljebjörn H, Henningsson R, Hyrenius-Wittsten A, et al. Identification of ETV6-RUNX1-like and DUX4-rearranged subtypes in paediatric B-cell precursor acute lymphoblastic leukaemia. Nat Commun. 2016;7(1):11790. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Zhang J, McCastlain K, Yoshihara H, et al. ; St. Jude Children’s Research Hospital–Washington University Pediatric Cancer Genome Project . Deregulation of DUX4 and ERG in acute lymphoblastic leukemia. Nat Genet. 2016;48(12):1481-1489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Yasuda T, Tsuzuki S, Kawazu M, et al. Recurrent DUX4 fusions in B cell acute lymphoblastic leukemia of adolescents and young adults [published correction appears in Nat Genet. 2016;48(12):1591]. Nat Genet. 2016;48(5):569-574. [DOI] [PubMed] [Google Scholar]
- 8.Li JF, Dai YT, Lilljebjörn H, et al. Transcriptional landscape of B cell precursor acute lymphoblastic leukemia based on an international study of 1,223 cases. Proc Natl Acad Sci USA. 2018;115(50):E11711-E11720. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Gu Z, Churchman M, Roberts K, et al. Genomic analyses identify recurrent MEF2D fusions in acute lymphoblastic leukaemia. Nat Commun. 2016;7:13331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Boer JM, Valsecchi MG, Hormann FM, et al. Favorable outcome of NUTM1-rearranged infant and pediatric B cell precursor acute lymphoblastic leukemia in a collaborative international study. Leukemia. 2021;35(10):2978-2982. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Hirabayashi S, Butler ER, Ohki K, et al. Clinical characteristics and outcomes of B-ALL with ZNF384 rearrangements: a retrospective analysis by the Ponte di Legno Childhood ALL Working Group. Leukemia. 2021;35(11):3272-3277. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Hirabayashi S, Ohki K, Nakabayashi K, et al. ; Tokyo Children’s Cancer Study Group (TCCSG) . ZNF384-related fusion genes define a subgroup of childhood B-cell precursor acute lymphoblastic leukemia with a characteristic immunotype. Haematologica. 2017;102(1):118-129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Liu YF, Wang BY, Zhang WN, et al. Genomic profiling of adult and pediatric B-cell acute lymphoblastic leukemia. EBioMedicine. 2016;8:173-183. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Alexander TB, Gu Z, Iacobucci I, et al. The genetic basis and cell of origin of mixed phenotype acute leukaemia. Nature. 2018;562(7727):373-379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Iacobucci I, Roberts KG. Genetic alterations and therapeutic targeting of Philadelphia-like acute lymphoblastic leukemia. Genes (Basel). 2021;12(5):687. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Gu Z, Churchman ML, Roberts KG, et al. PAX5-driven subtypes of B-progenitor acute lymphoblastic leukemia. Nat Genet. 2019;51(2):296-307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Passet M, Boissel N, Sigaux F, et al. ; Group for Research on Adult ALL (GRAALL) . PAX5 P80R mutation identifies a novel subtype of B-cell precursor acute lymphoblastic leukemia with favorable outcome. Blood. 2019;133(3):280-284. [DOI] [PubMed] [Google Scholar]
- 18.Paietta E, Roberts KG, Wang V, et al. Molecular classification improves risk assessment in adult BCR-ABL1-negative B-ALL. Blood. 2021;138(11):948-958. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Jeha S, Choi J, Roberts KG, et al. Clinical significance of novel subtypes of acute lymphoblastic leukemia in the context of minimal residual disease-directed therapy. Blood Cancer Discov. 2021;2(4):326-337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Jobanputra V, Wrzeszczynski KO, Buttner R, et al. Clinical interpretation of whole-genome and whole-transcriptome sequencing for precision oncology. Semin Cancer Biol. 2021;S1044-579X(21)00197-8. [DOI] [PubMed] [Google Scholar]
- 21.Rosenquist R, Cuppen E, Buettner R, et al. Clinical utility of whole-genome sequencing in precision oncology. Semin Cancer Biol. 2021;S1044-579X(21)00189-9. [DOI] [PubMed] [Google Scholar]
- 22.Montefiori LE, Bendig S, Gu Z, et al. Enhancer hijacking drives oncogenic BCL11B expression in lineage-ambiguous stem cell leukemia. Cancer Discov. 2021;11(11):2846-2867. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Yamazaki H, Suzuki M, Otsuki A, et al. A remote GATA2 hematopoietic enhancer drives leukemogenesis in inv(3)(q21;q26) by activating EVI1 expression. Cancer Cell. 2014;25(4):415-427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kantarjian HM, DeAngelo DJ, Stelljes M, et al. Inotuzumab ozogamicin versus standard therapy for acute lymphoblastic leukemia. N Engl J Med. 2016;375(8): 740-753. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Zhao Y, Aldoss I, Qu C, et al. Tumor-intrinsic and -extrinsic determinants of response to blinatumomab in adults with B-ALL. Blood. 2021;137(4):471-484. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Walter W, Shahswar R, Stengel A, et al. Clinical application of whole transcriptome sequencing for the classification of patients with acute lymphoblastic leukemia. BMC Cancer. 2021;21(1):886. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Dobin A, Davis CA, Schlesinger F, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29(1):15-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Stransky N, Cerami E, Schalm S, Kim JL, Lengauer C. The landscape of kinase fusions in cancer. Nat Commun. 2014;5(1):4846. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Anders S, Pyl PT, Huber W. HTSeq – a Python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31(2):166-169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Leek JT, Johnson WE, Parker HS, Jaffe AE, Storey JD. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012;28(6):882-883. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15(12):550. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Zhou Y, Zhou B, Pache L, et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun. 2019;10(1):1523. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Subramanian A, Tamayo P, Mootha VK, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005;102(43):15545-15550. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Mootha VK, Lindgren CM, Eriksson KF, et al. PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet. 2003;34(3):267-273. [DOI] [PubMed] [Google Scholar]
- 35.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25(14):1754-1760. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Rausch T, Zichner T, Schlattl A, Stütz AM, Benes V, Korbel JO. DELLY: structural variant discovery by integrated paired-end and split-read analysis. Bioinformatics. 2012;28(18):i333-i339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Chen X, Schulz-Trieglaff O, Shaw R, et al. Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications. Bioinformatics. 2016;32(8):1220-1222. [DOI] [PubMed] [Google Scholar]
- 38.Robinson JT, Thorvaldsdóttir H, Winckler W, et al. Integrative genomics viewer. Nat Biotechnol. 2011;29(1):24-26. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Talevich E, Shain AH, Botton T, Bastian BC. CNVkit: genome-wide copy number detection and visualization from targeted DNA sequencing. PLOS Comput Biol. 2016;12(4):e1004873. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Merelli I, Guffanti A, Fabbri M, et al. RSSsite: a reference database and prediction tool for the identification of cryptic recombination signal sequences in human and murine genomes. Nucleic Acids Res. 2010;38(Web Server issue):W262-W267. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Mumbach MR, Rubin AJ, Flynn RA, et al. HiChIP: efficient and sensitive analysis of protein-directed genome architecture. Nat Methods. 2016;13(11):919-922. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Juric I, Yu M, Abnousi A, et al. MAPS: Model-based analysis of long-range chromatin interactions from PLAC-seq and HiChIP experiments. PLOS Comput Biol. 2019;15(4):e1006982. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Lovén J, Hoke HA, Lin CY, et al. Selective inhibition of tumor oncogenes by disruption of super-enhancers. Cell. 2013;153(2): 320-334. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Moore JE, Pratt HE, Purcaro MJ, Weng Z. A curated benchmark of enhancer-gene interactions for evaluating enhancer-target gene prediction methods. Genome Biol. 2020; 21(1):17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Mullighan CG, Su X, Zhang J, et al. ; Children’s Oncology Group . Deletion of IKZF1 and prognosis in acute lymphoblastic leukemia. N Engl J Med. 2009;360(5): 470-480. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Papaemmanuil E, Rapado I, Li Y, et al. RAG-mediated recombination is the predominant driver of oncogenic rearrangement in ETV6-RUNX1 acute lymphoblastic leukemia. Nat Genet. 2014;46(2):116-125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Iacobucci I, Li Y, Roberts KG, et al. Truncating erythropoietin receptor rearrangements in acute lymphoblastic leukemia. Cancer Cell. 2016;29(2):186-200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Iacobucci I, Storlazzi CT, Cilloni D, et al. Identification and molecular characterization of recurrent genomic deletions on 7p12 in the IKZF1 gene in a large cohort of BCR-ABL1-positive acute lymphoblastic leukemia patients: on behalf of Gruppo Italiano Malattie Ematologiche dell’Adulto Acute Leukemia Working Party (GIMEMA AL WP). Blood. 2009;114(10):2159-2167. [DOI] [PubMed] [Google Scholar]
- 49.Yang M, Safavi S, Woodward EL, et al. 13q12.2 deletions in acute lymphoblastic leukemia lead to upregulation of FLT3 through enhancer hijacking. Blood. 2020;136(8):946-956. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Yasuda T, Sanada M, Kawazu M, et al. Two novel high-risk adult B-cell acute lymphoblastic leukemia subtypes with high expression of CDX2 and IDH1/2 mutations. Blood. 2021;blood.2021011921. [DOI] [PubMed] [Google Scholar]
- 51.Oh S, Shao J, Mitra J, et al. Enhancer release and retargeting activates disease-susceptibility genes. Nature. 2021; 595(7869):735-740. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Galland A, Gourain V, Habbas K, et al. CDX2 expression in the hematopoietic lineage promotes leukemogenesis via TGFβ inhibition. Mol Oncol. 2021;15(9):2318-2329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Vu T, Straube J, Porter AH, et al. Hematopoietic stem and progenitor cell-restricted Cdx2 expression induces transformation to myelodysplasia and acute leukemia. Nat Commun. 2020;11(1):3021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.van Riggelen J, Yetil A, Felsher DW. MYC as a regulator of ribosome biogenesis and protein synthesis. Nat Rev Cancer. 2010; 10(4):301-309. [DOI] [PubMed] [Google Scholar]
- 55.Diesch J, Bywater MJ, Sanij E, et al. Changes in long-range rDNA-genomic interactions associate with altered RNA polymerase II gene programs during malignant transformation. Commun Biol. 2019;2(1):39. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Stock W, Luger SM, Advani AS, et al. A pediatric regimen for older adolescents and young adults with acute lymphoblastic leukemia: results of CALGB 10403. Blood. 2019;133(14):1548-1559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Huguet F, Leguay T, Raffoux E, et al. Pediatric-inspired therapy in adults with Philadelphia chromosome-negative acute lymphoblastic leukemia: the GRAALL-2003 study [published correction appears in J Clin Oncol. 2009;27(15):2574]. J Clin Oncol. 2009;27(6):911-918. [DOI] [PubMed] [Google Scholar]
- 58.Siegel SE, Stock W, Johnson RH, et al. Pediatric-inspired treatment regimens for adolescents and young adults with Philadelphia chromosome-negative acute lymphoblastic leukemia: a review. JAMA Oncol. 2018;4(5):725-734. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.