Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2014 Apr 1.
Published in final edited form as: Nat Genet. 2013 Sep 8;45(10):1226–1231. doi: 10.1038/ng.2754

Identification of a recurrent germline PAX5 mutation and susceptibility to pre-B cell acute lymphoblastic leukemia

Sohela Shah 1,2,#, Kasmintan A Schrader 1,2,#, Esmé Waanders 3,4,#, Andrew E Timms 5,#, Joseph Vijai 1,2,#, Cornelius Miething 1,#, Jeremy Wechsler 5, Jun Yang 6, James Hayes 1, Robert J Klein 1,2, Jinghui Zhang 7, Lei Wei 7,3, Gang Wu 7, Michael Rusch 7, Panduka Nagahawatte 7, Jing Ma 8, Shann-Ching Chen 3, Guangchun Song 3, Jinjun Cheng 3,8, Paul Meyers 9, Deepa Bhojwani 10, Suresh Jhanwar 11, Peter Maslak 12, Martin Fleisher 13, Jason Littman 2, Lily Offit 2, Rohini Rau-Murthy 2, Megan Harlan Fleischut 2, Marina Corines 2, Rajmohan Murali 11, Xiaoni Gao 1, Christopher Manschreck 2, Thomas Kitzing 1, Vundavalli V Murty 14, Susana Raimondi 3, Roland P Kuiper 4, Annet Simons 4, Joshua D Schiffman 15, Kenan Onel 16, Sharon E Plon 17, David Wheeler 17, Deborah Ritter 17, David S Ziegler 18, Kathy Tucker 19, Rosemary Sutton 20, Georgia Chenevix-Trench 21, Jun Li 21, David G Huntsman 22, Samantha Hansford 22, Janine Senz 22, Thomas Walsh 23, Ming Lee 23, Christopher N Hahn 24, Kathryn Roberts 3, Mary-Claire King 23, Sarah M Lo 25, Ross L Levine 26, Agnes Viale 27, Nicholas D Socci 28, Katherine L Nathanson 29, Hamish S Scott 24, Mark Daly 30, Steven M Lipkin 31, Scott W Lowe 1, James R Downing 3, David Altshuler 30, John T Sandlund 10,#, Marshall S Horwitz 5,#, Charles G Mullighan 3,#, Kenneth Offit 1,2,#
PMCID: PMC3919799  NIHMSID: NIHMS525604  PMID: 24013638

Somatic alterations of the lymphoid transcription factor gene PAX5 are a hallmark of B-progenitor acute lymphoblastic leukemia (B-ALL)1-3, but inherited mutations of PAX5 have not previously been described. Here, we report a novel heterozygous germline variant, c.547G>A (p.Gly183Ser), in the octapeptide domain of PAX5 that was found to segregate with disease in two unrelated kindreds with autosomal dominant B-ALL. Leukemic cells from all patients in both families exhibited 9p deletion, with loss-of-heterozygosity and retention of the mutant PAX5 allele at 9p13. Two additional sporadic ALL cases with 9p loss harbored somatic PAX5 Gly183 substitutions. Functional and gene expression analysis of the PAX5 mutation demonstrated relatively reduced transcriptional activity. These data extend the role of PAX5 alterations in the pathogenesis of pre-B ALL, and implicate PAX5 in a novel syndrome of susceptibility to pre-B cell neoplasia.

B-cell precursor acute lymphoblastic leukemia (ALL) is the most common pediatric malignancy. There is a 2-4 fold increased risk of developing the disease in children of affected siblings4, and in occasional cases ALL is inherited as a Mendelian disorder5. PAX5, encoding the B-cell lineage transcription factor paired box 5, is somatically deleted, rearranged or otherwise mutated in approximately 30% of sporadic B-progenitor ALL cases1,3,6-9. In PAX5-deficient mice, B-cell development is arrested at the pro-B-cell stage, and in vitro these cells can differentiate into other lymphoid and myeloid lineages10. PAX5 is also essential for maintaining the identity and function of mature B-cells11, and its deletion in mature B cells results in dedifferentiation to pro-B cells and aggressive lymphomagenesis12.

A heterozygous germline PAX5 variant, c.547G>A (NM_016734), p.Gly183Ser (NP_057953), was identified by exome sequencing in two families, one of Puerto Rican ancestry (family 1; Figure 1a) and the other of African-American ancestry (family 2; Figure 1b, Supplementary Note). This variant had not been previously described in public databases (Exome Variant Server, 1000 Genomes and dbSNP 137), or previous sequencing analyses of ALL and cancer genomes1,2,9. All affected family members had B-cell precursor ALL and all available diagnostic and relapse leukemic samples from both families demonstrated loss of 9p through i(9)(q10) or dicentric chromosomes involving 9q, both of which resulted in loss of the wild-type PAX5 allele and retention of PAX5 p.Gly183Ser (Figure 1c, Supplementary Figure 1 and Supplementary Table 1).

Figure 1. Familial Pre-B cell ALL associated with i(9)(q10) and dic(9;v) in two families harboring a novel, recurrent germline p.Gly183Ser variant.

Figure 1

a. Family 1 of Puerto Rican ancestry. The proband is denoted by an arrow. Exome sequencing was undertaken in germline DNA from all available affected (IV1, IV5, IV6, III5) and unaffected (IV9, III3, III4) individuals as well as the diagnostic leukemic sample from IV6. p.Gly183Ser variant status denoted by (+/−). b. Family 2 of African-American ancestry. The proband is denoted by an arrow. Exome sequencing was undertaken in diagnostic, remission and relapse leukemic samples from individuals III4, IV1, and IV2. p.Gly183Ser variant status denoted by (+/−). c. Chromosome 9 copy number heat map for SNP6.0 microarray data of germline and tumor samples from three members of Family 2. These data demonstrate the common feature of loss of 9p in the tumor specimens. Note the focal dark blue band denoting homozygous loss of CDKN2A/B in all samples. Blue indicates deletions and red indicates gains. G, germline; D, diagnosis; R, relapse sample. d. The haplotype flanking the p.Gly183Ser mutation. A five SNP haplotype rs7850825 to rs7020413 (Chr9:36.997-37.002 Mb) proximal to the mutation was concordant in both family 1 and family 2. However, the distal end flanking the mutation rs6476606 was discordant.

The germline PAX5 p.Gly183Ser mutation segregated with the leukemia in both kindreds, however, several unaffected obligate carriers (Family 1: II3, III2, III3 and Family 2: I1, I2, II2, II3) were also observed, suggesting incomplete penetrance. Unaffected mutation carriers and patients at the time of ALL diagnosis had normal immunoglobulin levels and no laboratory or clinical evidence of impaired B-cell function. Sanger sequencing of cDNA from peripheral blood of unaffected carriers indicated biallelic transcription of PAX5 (data not shown). The only mutated gene common to both families was PAX5 and no germline copy number aberrations were found to be shared between patients (Supplementary tables 2 and 3).

To determine whether p.Gly183Ser arose independently in each kindred or instead reflects common ancestry, the risk haplotypes of each family were compared. The families share a 4.7kb haplotype, spanning five SNPs (Figure 1d, Supplementary Note). The relatively small size of the shared haplotype and principal component analysis of genome-wide single nucleotide polymorphism genotype data (Supplementary Figure 2) together imply that the two families are not recently related and differ in ethnicity. Moreover, given the reduced fitness due to increased susceptibility for childhood ALL, it is unlikely that such a lethal mutation could be propagated over time. Because this haplotype is relatively frequent worldwide (Supplementary Table 4), it is likely that each family's mutation arose independently.

Genomic profiling of tumor samples demonstrated expression of the p.Gly183Ser mutant PAX5 in diagnostic and relapse tumor specimens from affected members of family 2, an average of 1 chimeric fusion and 9 non-silent sequence variants per case, and homozygous deletion of CDKN2A/CDKN2B in all cases due to loss of 9p and focal deletion of the second allele. Apart from loss of 9p, no other somatic sequence mutations or structural rearrangements were shared by the affected family members (Supplementary Tables 1, 5-12)

As somatic i(9)(q10)/dic(9;v) abnormalities were seen in all of the familial leukemias, we sequenced PAX5 in 44 additional sporadic pre-B ALL cases with i(9)(q10)/dic(9;v) aberrations to assess whether PAX5 mutations frequently co-occur with loss of 9p. Two leukemic samples revealed octapeptide mutations p.Gly183Ser, and p.Gly183Val, and in others previously reported variants including p.Pro80Arg and p.Val26Gly1 were observed (Table 1, Figure 2a). We examined the frequency of non-silent PAX5 somatic sequence mutations in a cohort of B-ALL cases with i(9)(q10)/dic(9;v) (n=28) and two cohorts of B-ALL without i(9)(q10)/dic(9;v) (n=183) and n=221.1,2. We observed a significantly higher frequency of PAX5 mutations in the i(9)/dic(9) cohort (p=0.0001). No novel germline PAX5 mutations were detected in 39 families with a history of two or more cases of cancer including at least one hematological cancer, although one familial case of ALL harbored a dic(9;20)(p11;q11.1) and a somatic p.Pro80Arg variant (Table 1 and Supplementary Materials).

Table 1.

PAX5 mutations found in familial and sporadic i(9)(q10)/dic(9;v) pre-B cell ALL samples

Inheritance Patient Mutation Tumor status Germline status
Family 1 IV6 c.547G>A = p.Gly183Ser Homozygous Heterozygous
Family 2 III4 c.547G>A = p.Gly183Ser Homozygous Heterozygous
Family 2 IV1 D c.547G>A = p.Gly183Ser Homozygous Heterozygous
Family 2 IV1 R c.547G>A = p.Gly183Ser Homozygous
Family 2 IV2 D c.547G>A = p.Gly183Ser Homozygous Heterozygous
Family 2 IV2 R c.547G>A = p.Gly183Ser Homozygous
Familial1 c.239C>G = p.Pro80Arg (tumor shows dic(9;20)(p11;q11.1)) Homozygous Wildtype
Sporadic c.77T>G = p.Val26Gly Heterozygous Wildtype
Sporadic c.77T>G = p.Val26Gly Heterozygous Wildtype
Sporadic c.77T>G = p.Val26Gly Heterozygous Wildtype
Sporadic c.197G>A = p.Ser66Asn Homozygous ND
Sporadic c.239C>G = p.Pro80Arg Homozygous Wildtype
Sporadic c.239C>G = p.Pro80Arg Homozygous Wildtype
Sporadic c.239C>G = p.Pro80Arg Homozygous Wildtype
Sporadic c.547G>A = p.Gly183Ser Homozygous ND
Sporadic c.548G>T = p.Gly183Val Heterozygous Wildtype
Sporadic c.1012G>T = p.Gly338Trp Heterozygous Wildtype
Sporadic c.1049-1051delAGTinsGTCCG = p.Gln350fs Heterozygous Wildtype
Sporadic c.1100_1100+15 del16bp (IVS9 splice) heterozygous ND
1

Familial case as reported in main text. ND = Not Determinable. Germline DNA either not tested or not available. PAX5 accession No: NM_016734

Figure 2. Recurrent PAX5 mutations in ALL.

Figure 2

a. Gene schematic of PAX5 showing the exons (upper grey numbers), amino acid residues (lower grey numbers), protein domains (as denoted by colored legend) and position of the germline p.Gly183Ser variant (in red) in relation to the somatic PAX5 mutations described in this study (n=13, arrows) and somatic mutations described previously in B-ALL1,2,20. Primary leukemic samples with confirmed retention of the germline p.Gly183Ser variant denoted by the square shape (Family 1) and diamond shape (Family 2). In one case of i(9)/dic(9) ALL, we found both a heterozygous Val26Gly and a heterozygous Gln350fs mutation, indicating polyclonality of the tumor. b. Conservation of the octapeptide domain in selected PAX family members.

Previously identified PAX5 somatic mutations commonly result in significant reduction in transcriptional activation mediated by PAX5. Downstream targets of PAX5 include CD19 and CD79A (Igα or mb-1)13. We examined the transactivating activity of wild type and mutant PAX5 alleles using a PAX5-dependent reporter gene assay, containing copies of a high-affinity PAX5-binding site derived from the CD19 promoter14. Both p.Gly183Ser and Val mutations resulted in partial but significant reduction in transcriptional activation (P<0.0001 for both alleles, Figure 3a). Additionally, there was no detectable differences in subcellular localization of wild type and p.Gly183Ser PAX5 (Supplementary Figure 3). To study the effect of this mutation on CD79A expression, we expressed mutant and wild type PAX5 in J558LμM, a mouse plasmacytoma cell line that does not express PAX5 or CD79A. Enforced expression of PAX5 results in expression of CD79A and assembly of the surface immunoglobulin (sIgM) complex. The amount of sIgM expression may be used to assess the transcriptional activity of PAX5 alleles on the CD79A promoter1. Both p.Gly183 alleles resulted in a significant reduction in sIgM expression compared to wild-type PAX5 (P<0.0001, Figure 3b). These results suggest that the PAX5 G183 mutations result in partial loss of PAX5 activity.

Figure 3. Attenuated transcriptional activity of PAX5 p.Gly183Ser.

Figure 3

a. Transcriptional activity of PAX5 variants compared to wild-type using a Pax5-dependent reporter gene assay in 293T cells. Bars show mean (± s.e.m.) luciferase activity of six individual experiments with triplicate measurements (PAX5 p.Gly183Val and PAX5 d2-6 four experiments with triplicate measurements). Asterisks indicate significant difference calculated by Dunnett's test (P<0.0001). MIR, MSCV-IRES-mRFP empty vector. b. Transcriptional activity of PAX5 variants using CD79A-dependent sIgM expression in the murine J558LμM plastocytoma cell line. Percentages indicate proportion of mRFP positive cells that show sIgM expression. Bars show mean (± s.e.m.) sIgM expression in two individual experiments with triplicate replicates each. Asterisks indicate significant difference calculated by Dunnett's test (P<0.0001). MIR, MSCV-IRES-mRFP empty vector. c. PAX5-dependent reporter gene assay of PAX5 wild-type and PAX5 p.Gly183Ser run in triplicate as above, with or without co-transfection of 0.05μg of Grg4 as indicated. A PAX5 p.Tyr179Glu (Y179E) mutant that is deficient in binding to Grg4 and empty vector were used as controls. Asterisks indicate significant differences by two-tailed t-test (p<0.0001). d. Heatmap of PAX5 activated genes in mature B cells. Four samples from family 2 (diagnosis and relapse samples from individuals IV1 and IV2) show differential expression of PAX5 activated genes when compared to a group of 139 sporadic B-ALL cases. This indicates an effect of the pGly183Ser mutation on PAX5 function. Red indicates high expression, blue represents low expression. PAX5 mutation status is indicated by the colors above the samples. Green indicates wild type PAX5, yellow indicates heterozygosity for a PAX5 mutation, and pink indicates biallelic PAX5 mutations.

The identified missense variant, p.Gly183Ser, is located at a conserved residue in the octapeptide (OP) domain of PAX5 that mediates interaction with Groucho transcriptional corepressors15 (Figure 2b). Previous studies have shown that GRG4 (also known as TLE4 in humans) represses PAX5-dependent luciferase activity in cells expressing wild type PAX5 but not in cells expressing PAX5 octapeptide domain mutants15. We observed GRG4-mediated repression of the transcriptional activity of PAX5 wild-type and p.Gly183Ser (Figure 3c), suggesting the effect of this mutation is not mediated by an altered interaction with GRG4.

To further explore the effect of the variant on downstream targets, we performed genome-wide transcriptional profiling of J558LμM cells transduced with empty vector, PAX5 wild-type or mutant alleles (either all transduced cells marked by RFP expression, or the subset of cells expressing sIgM), and examined the expression of PAX5 activated and repressed genes previously defined in Pax5−/− murine pro- and mature B cells16-19 and in human ETV6-RUNX1 B-ALL1. Examining all PAX5 expressing cells, we observed profound deregulation of PAX5 activated and repressed genes in J558LμM cells expressing known loss of function (e.g. the common exon 2-6 deletion that results in a truncating frameshift PAX5 allele) or strongly hypomorphic alleles (PAX5 p.Pro80Arg) and less marked deregulation in p.Gly183Ser or Val expressing cells (P values for each mutant allele versus wild-type were all P<0.001 (Supplementary Figures 4 and 5). Comparing sorted sIgM positive PAX5 p.Gly183Ser to PAX5 wild-type cells, we observed reduced expression of genes activated by PAX5 in pro-B and mature B-cells (P =1.4 × 10−4 and P =3.8 × 10−4 respectively, Supplementary Tables 13-15)

We next examined the transcriptional consequences of PAX5 p.Gly183Ser by performing transcriptome sequencing (mRNA-seq) of diagnosis and relapse samples obtained from two patients in kindred 2, and of 139 sporadic childhood B-progenitor ALL samples. We performed gene set enrichment analysis incorporating gene sets of PAX5-mutated ETV6-RUNX1 ALL (one third of which harbor focal PAX5 deletions)1, PAX5 regulated genes in Pax5−/− mice16-19 and genes regulated during murine B lymphoid development20. As a limited set of genes are known to be regulated in both murine pro- and mature B cells, and as the overlap between mouse and human PAX5 regulated genes are unknown, we used all previously published PAX5 regulated genes, and genes regulated during murine B cell development 16-20 in an unbiased approach to explore the effects of the PAX5 p.Gly183 mutations on direct and indirect transcriptional targets of PAX5. This showed striking enrichment of genes deregulated in ETV6-RUNX1/PAX5-mutated ALL, PAX5 activated and repressed genes (including CD19, CD72 and CD79a), and genes regulated during murine B lymphoid development in the signature of familial PAX5 p.Gly183Ser versus sporadic B-ALL (Figure 3d and Supplementary Figures 6 and 7. We also analyzed the overlap of previously published data and the expression differences between the familial ALL tumor samples and other B-ALL cases stratified by PAX5 mutation status (Supplementary file 6; Supplementary Figure 8). Together, these results suggest that PAX5 p.Gly183Ser results in attenuation of PAX5 function and deregulation of PAX5 target genes that is less severe than the previously reported p.Pro80Arg and D2-6 alleles that result in marked or complete loss of PAX5 activity.

The PAX5 deletions, translocations and sequence mutations identified as somatic events in B-ALL commonly affect the DNA-binding and transactivation domains and result in complete loss or marked attenuation of PAX5 transcriptional activity, but are rarely homozygous and not observed as inherited variants. Moreover, Pax5 loss promotes the development of B-ALL in experimental models which are commonly accompanied by the acquisition of second hits in Pax5 21, indicating that profound loss of PAX5 activity is commonly a central event in leukemogenesis. In contrast, the inherited PAX5 p.Gly183Ser mutation results in modest attenuation of PAX5 activity , and is accompanied by somatic loss of the wild-type PAX5 allele due to 9p alterations during leukemogenesis. This model is also consistent with the finding of a significant association of somatic PAX5 hypomorphic mutations coincident with complete loss of the normal PAX5 allele in leukemic cells absent 9p. These observations suggest that a severe reduction in PAX5 activity is incompatible with normal B lymphoid development and is deleterious in carriers, but by contrast, the partial hypomorphic p.Gly183Ser allele is tolerated as a germline allele but additional genetic events further reducing PAX5 activity are required to establish the leukemic clone. The universal finding of deletion of wild-type PAX5 in all familial ALL cases, rather than the acquisition of additional hypomorphic PAX5 mutations, suggests that a complete loss of wild-type PAX5 activity is required for developmental arrest and loss of maturation. This is supported by our transcriptional profiling of J558LμM p.Gly183Ser cells and familial leukemias showing deregulation of PAX5 target gene expression that is significant but less marked than known loss-of-function mutations. The differences in transcriptional profiles of some target gene panels were not as robustly observed as in mouse model systems, presumably due to inherent germline and somatic genetic and epigenetic variability in human leukemias. In addition, ongoing studies will be of interest to fully characterize the functional consequences of PAX5 octapeptide domain mutations.

Our findings have clinical implications with regard to options for pre-implantation genetic diagnosis, and the possible significance of somatic 9p alterations as a harbinger of a germline PAX5 mutation. The recent identification of germline TP53 mutations in familial ALL20,22 and the data presented here strongly implicating PAX5 mutations in a novel syndrome of inherited susceptibility to pre-B cell ALL, indicate that further sequencing of affected kindreds is required to define the full spectrum of germline variations contributing to ALL pathogenesis.

METHODS

Methods and any associated references are available in the online version of the paper

Supplementary Material

Main supplementary document
Supplementary Table 16
Supplementary Table 17
Supplementary Table 18
Supplementary Tables 19-21
Supplementary Tbales 9-12

ACKNOWLEDGMENTS

We thank M.A.S. Moore and S. Jae-Hung for their contributions to ongoing tumor studies; G. Dressler for the murine Grg4 construct; M. Busslinger for the luc-CD19 reporter construct; J. Hagman for providing a PAX5 vector and the J558LμM cell line; D. Payne-Turner for technical assistance; Dr. Wenjian Yang and Dr. Colton Smith (Pharmaceutical Sciences) for their assistance in the haplotype analyses and the Tissue Resources Core Facility, Pediatric Cancer Genome Project Core Facility and Flow Cytometry and Cell Sorting Core Facility of St Jude Children's Research Hospital. We thank the families for their generous participation in these studies. This project was supported by grant I5-A523 from the Starr Cancer Consortium, the Robert and Kate Niehaus Clinical Cancer Genetics Initiative, the Sabin Family Research Fund, the Lymphoma Foundation, Geoffrey Beene Cancer Research Center grant 78730, the Cancer Prevention and Research Institute of Texas Grant RP101089, NSW Priory of the Knights of the Order of Saint John, Matthew Bell Foundation, US National Cancer Institute of the National Institutes of Health (NIH) Comprehensive Cancer Center Core Grant CA21765, the American Lebanese Syrian Associated Charities of St Jude Children's Research Hospital, and grant R01DK58161. R.P.K. is funded by a grant from the Dutch Cancer Society (KUN2009-4298). T.K. is supported by the German Research Foundation Postdoctoral Fellowship (KI1605/1-1). C.G.M. is a Pew Scholar in the Biomedical Sciences, and a St Baldrick's Scholar. K.G.R. is supported by a National Health and Medical Research Council (NHMRC, Australia) CJ Martin Postdoctoral Fellowship. K.A.S. is funded by the Canadian Institutes of Health Research. A.T. is supported by T32GM007454. G.C.-T. is a Senior Principal Research Fellow of the NHMRC. E.W. is funded by the Dutch Cancer Society, project number KUN2012-5366.

Footnotes

AUTHOR CONTRIBUTIONS

K.O., C.G.M., M.S.H., J.T.S., S.S., K.A.S, E.W., A.T., J.V., C.M., S. L., R.J.K., M.D., D.A., and S.W.L. conceived and designed the experiment. S.S., K.A.S., E.W., A.T., J.V., C.M., J.W., J.Y., X.G., C.M., R.J.K., A.V., N.D.S., D.A., M.S.H., C.G.M., K.O., M.K., T.W., K.L., T.K., D.B., J.L., L.O., S.C.R., P.M., M.F., G.R., and J.C. performed the experiments. S.S., K.A.S., E.W., A.T., J.V., C.M., J.W., J.Y., R.J.K., N.D.S, M.S.H., C.G.M., K.O., L.W., J.Z., G.W., M.R., P.N., J.M., S.C., G.S., and J.C. performed statistical analysis. S.S., K.A.S., E.W., A.T., J.V., C.M., J.W., J.Y., C.M., R.R., M.C., R.M., M.H., S.L., R.J.K., A.V., N.D.S., D.A., C.H., H.S.S., S.W.L., M.S.H., C.G.M., K.O., L.W., J.H., J.Z., G.W., M.R., P.N., J.M., S.C., G.S., and J.C. analyzed the data. E.W., C.M., J.T.S, S.J., M.H., J.S., V.M., S.P., D.G.H., D.Z., G.C., S.L., S.M.L., R.L., A.V., K.L.N., M.D., D.A., C.H., H.S.S., S.W.L., M.S.H., C.G.M., K.O., R.P.K., A.S., J.J.L., K.T., R.S., S.H., J.S., D.W., D.R., P.M., J.Z., G.W., J.M., S.C., J.D., and K.O. contributed reagents, materials, and analysis tools. K.O., C.G.M., M.S.H., S.S., K.A.S., E.W., A.T., J.V., C.M., J.Y., R.J.K., and S.W.L. wrote the paper. K.O., C.G.M., M.S.H., J.T.S., R.J.K., M.D., D.A., and S.W.L. jointly supervised the research.

URLS. dbSNP 137, http://www.ncbi.nlm.nih.gov/projects/SNP/; NHLBI exome sequencing project, http://evs.gs.washington.edu/EVS/; European Genome-phenome Archive, http://www.ebi.ac.uk/ega/. Public data portal for results from the St. Jude–Washington University Pediatric Cancer Genome Project, http://explore.pediatriccancergenomeproject.org/.

Accession codes.

Transcriptome and whole exome sequencing data and SNP microarray data have been deposited in the European Genome-phenome Archive (EGA) which is hosted by the European Bioinformatics Institute (EBI) under EGAS00001000447. Mouse Affymetrix gene expression data are deposited in the Gene Expression Omnibus (GEO) under accession code GSE45260 (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE45260)

Supplementary information is available in the online version of the paper.

Supplementary information contains Supplementary Note, Supplementary Figures 1-8, Supplementary Tables 1-15 and Supplementary files 1-6.

COMPETING FINANCIAL INTERESTS

The authors declare no competing financial interests

REFERENCES

  • 1.Mullighan CG, et al. Genome-wide analysis of genetic alterations in acute lymphoblastic leukaemia. Nature. 2007;446:758–64. doi: 10.1038/nature05690. [DOI] [PubMed] [Google Scholar]
  • 2.Mullighan CG, et al. Deletion of IKZF1 and prognosis in acute lymphoblastic leukemia. N Engl J Med. 2009;360:470–80. doi: 10.1056/NEJMoa0808253. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Kuiper RP, et al. High-resolution genomic profiling of childhood ALL reveals novel recurrent genetic lesions affecting pathways involved in lymphocyte differentiation and cell cycle progression. Leukemia. 2007;21:1258–66. doi: 10.1038/sj.leu.2404691. [DOI] [PubMed] [Google Scholar]
  • 4.Hemminki K, Jiang Y. Risks among siblings and twins for childhood acute lymphoid leukaemia: results from the Swedish Family-Cancer Database. Leukemia. 2002;16:297–8. doi: 10.1038/sj.leu.2402351. [DOI] [PubMed] [Google Scholar]
  • 5.Pui CH, Robison LL, Look AT. Acute lymphoblastic leukaemia. Lancet. 2008;371:1030–43. doi: 10.1016/S0140-6736(08)60457-2. [DOI] [PubMed] [Google Scholar]
  • 6.Mullighan CG, Downing JR. Global genomic characterization of acute lymphoblastic leukemia. Semin Hematol. 2009;46:3–15. doi: 10.1053/j.seminhematol.2008.09.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Mullighan CG, et al. BCR-ABL1 lymphoblastic leukaemia is characterized by the deletion of Ikaros. Nature. 2008;453:110–4. doi: 10.1038/nature06866. [DOI] [PubMed] [Google Scholar]
  • 8.Nebral K, et al. Incidence and diversity of PAX5 fusion genes in childhood acute lymphoblastic leukemia. Leukemia. 2009;23:134–43. doi: 10.1038/leu.2008.306. [DOI] [PubMed] [Google Scholar]
  • 9.Zhang J, et al. Key pathways are frequently mutated in high-risk childhood acute lymphoblastic leukemia: a report from the Children's Oncology Group. Blood. 2011;118:3080–7. doi: 10.1182/blood-2011-03-341412. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Nutt SL, Heavey B, Rolink AG, Busslinger M. Commitment to the B-lymphoid lineage depends on the transcription factor Pax5. Nature. 1999;401:556–62. doi: 10.1038/44076. [DOI] [PubMed] [Google Scholar]
  • 11.Horcher M, Souabni A, Busslinger M. Pax5/BSAP maintains the identity of B cells in late B lymphopoiesis. Immunity. 2001;14:779–90. doi: 10.1016/s1074-7613(01)00153-4. [DOI] [PubMed] [Google Scholar]
  • 12.Cobaleda C, Jochum W, Busslinger M. Conversion of mature B cells into T cells by dedifferentiation to uncommitted progenitors. Nature. 2007;449:473–7. doi: 10.1038/nature06159. [DOI] [PubMed] [Google Scholar]
  • 13.Maier H, Colbert J, Fitzsimmons D, Clark DR, Hagman J. Activation of the early B-cell-specific mb-1 (Ig-alpha) gene by Pax-5 is dependent on an unmethylated Ets binding site. Mol Cell Biol. 2003;23:1946–60. doi: 10.1128/MCB.23.6.1946-1960.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Czerny T, Busslinger M. DNA-binding and transactivation properties of Pax-6: three amino acids in the paired domain are responsible for the different sequence recognition of Pax-6 and BSAP (Pax-5). Mol Cell Biol. 1995;15:2858–71. doi: 10.1128/mcb.15.5.2858. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Eberhard D, Jimenez G, Heavey B, Busslinger M. Transcriptional repression by Pax5 (BSAP) through interaction with corepressors of the Groucho family. EMBO J. 2000;19:2292–303. doi: 10.1093/emboj/19.10.2292. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Pridans C, et al. Identification of Pax5 target genes in early B cell differentiation. J Immunol. 2008;180:1719–28. doi: 10.4049/jimmunol.180.3.1719. [DOI] [PubMed] [Google Scholar]
  • 17.Revilla IDR, et al. The B-cell identity factor Pax5 regulates distinct transcriptional programmes in early and late B lymphopoiesis. EMBO J. 2012;31:3130–46. doi: 10.1038/emboj.2012.155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Delogu A, et al. Gene repression by Pax5 in B cells is essential for blood cell homeostasis and is reversed in plasma cells. Immunity. 2006;24:269–81. doi: 10.1016/j.immuni.2006.01.012. [DOI] [PubMed] [Google Scholar]
  • 19.Schebesta A, et al. Transcription factor Pax5 activates the chromatin of key genes involved in B cell signaling, adhesion, migration, and immune function. Immunity. 2007;27:49–63. doi: 10.1016/j.immuni.2007.05.019. [DOI] [PubMed] [Google Scholar]
  • 20.Holmfeldt L, et al. The genomic landscape of hypodiploid acute lymphoblastic leukemia. Nat Genet. 2013;45:242–52. doi: 10.1038/ng.2532. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Dang J, Mullighan CG, Phillips LA, Mehta P, Downing JR. Retroviral and Chemical Mutagenesis Identifies Pax5 as a Tumor Suppressor in B-Progenitor Acute Lymphoblastic Leukemia. Blood (ASH Annual Meeting Abstracts) 2008;112:1789. [Google Scholar]
  • 22.Powell BC, et al. Identification of TP53 as an acute lymphocytic leukemia susceptibility gene through exome sequencing. Pediatr Blood Cancer. 2012 doi: 10.1002/pbc.24417. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.DePristo MA, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nature genetics. 2011;43:491–8. doi: 10.1038/ng.806. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–60. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.McKenna A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Pounds S, et al. Reference alignment of SNP microarray signals for copy number analysis of tumors. Bioinformatics. 2009;25:315–21. doi: 10.1093/bioinformatics/btn624. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Olshen AB, Venkatraman ES, Lucito R, Wigler M. Circular binary segmentation for the analysis of array-based DNA copy number data. Biostatistics. 2004;5:557–72. doi: 10.1093/biostatistics/kxh008. [DOI] [PubMed] [Google Scholar]
  • 28.Venkatraman ES, Olshen AB. A faster circular binary segmentation algorithm for the analysis of array CGH data. Bioinformatics. 2007;23:657–63. doi: 10.1093/bioinformatics/btl646. [DOI] [PubMed] [Google Scholar]
  • 29.Mullighan CG. Single nucleotide polymorphism microarray analysis of genetic alterations in cancer. Methods in molecular biology. 2011;730:235–58. doi: 10.1007/978-1-61779-074-4_17. [DOI] [PubMed] [Google Scholar]
  • 30.Lin M, et al. dChipSNP: significance curve and clustering of SNP-array-based loss-ofheterozygosity data. Bioinformatics. 2004;20:1233–40. doi: 10.1093/bioinformatics/bth069. [DOI] [PubMed] [Google Scholar]
  • 31.Cai Y, Brophy PD, Levitan I, Stifani S, Dressler GR. Groucho suppresses Pax2 transactivation by inhibition of JNK-mediated phosphorylation. EMBO J. 2003;22:5522–9. doi: 10.1093/emboj/cdg536. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Lundblad A, et al. Immunochemical studies on mouse myeloma proteins with specificity for dextran or for levan. Immunochemistry. 1972;9:535–44. doi: 10.1016/0019-2791(72)90063-8. [DOI] [PubMed] [Google Scholar]
  • 33.Sitia R, Neuberger MS, Milstein C. Regulation of membrane IgM expression in secretory B cells: translational and post-translational events. EMBO J. 1987;6:3969–77. doi: 10.1002/j.1460-2075.1987.tb02739.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Maier H, et al. Requirements for selective recruitment of Ets proteins and activation of mb-1/Ig-alpha gene transcription by Pax-5 (BSAP). Nucleic Acids Res. 2003;31:5483–9. doi: 10.1093/nar/gkg785. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Hombach J, Tsubata T, Leclercq L, Stappert H, Reth M. Molecular components of the B-cell antigen receptor complex of the IgM class. Nature. 1990;343:760–2. doi: 10.1038/343760a0. [DOI] [PubMed] [Google Scholar]
  • 36.Team RDC. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing; Vienna, Austria: 2009. [Google Scholar]
  • 37.Gentleman RC, et al. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 2004;5:R80. doi: 10.1186/gb-2004-5-10-r80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Irizarry RA, et al. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003;4:249–64. doi: 10.1093/biostatistics/4.2.249. [DOI] [PubMed] [Google Scholar]
  • 39.Johnson WE, Li C, Rabinovic A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2007;8:118–27. doi: 10.1093/biostatistics/kxj037. [DOI] [PubMed] [Google Scholar]
  • 40.Smyth GK. Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol 3. 2004 doi: 10.2202/1544-6115.1027. Article3. [DOI] [PubMed] [Google Scholar]
  • 41.Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc B. 1995;57:289–300. [Google Scholar]
  • 42.Subramanian A, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102:15545–50. doi: 10.1073/pnas.0506580102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Zhang J, et al. The genetic basis of early T-cell precursor acute lymphoblastic leukaemia. Nature. 2012;481:157–63. doi: 10.1038/nature10725. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Trapnell C, et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010;28:511–5. doi: 10.1038/nbt.1621. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Main supplementary document
Supplementary Table 16
Supplementary Table 17
Supplementary Table 18
Supplementary Tables 19-21
Supplementary Tbales 9-12

RESOURCES