Abstract
The aetiology of childhood acute lymphoblastic leukaemia (ALL) is unclear. Genetic abnormalities have been identified in a number of ALL cases, although these alone are not sufficient for leukaemic transformation. Various in utero and post-natal environmental exposures have been suggested to alter risk of childhood ALL. DNA methylation patterns can be influenced by environmental exposures, and are reported to be altered in ALL, suggesting a potential mediating mechanism between environment and ALL disease risk. To investigate this, we used a ‘meet in the middle’ approach, investigating the overlap between exposure-associated and disease-associated methylation change. Genome-wide DNA methylation changes in response to possible ALL-risk exposures (i.e. breast feeding, infection history, day care attendance, maternal smoking, alcohol, caffeine, folic acid, iron, and radiation exposure) were investigated in a sub-population of the Avon Longitudinal Study of Parents and Children (ALSPAC) cohort using an epigenome-wide association study (EWAS) approach (n=861-927), and compared to a list of ALL disease-associated methylation changes compiled from published data. Hypergeometric probability tests suggested that the number of directionally concordant gene methylation changes observed in ALL disease and in response to the following exposures; maternal radiation exposure (p=0.001), alcohol intake (p=0.006); sugary caffeinated drink intake during pregnancy (p=0.045); and infant day care attendance (p=0.003), were not due to chance. Data presented suggests that DNA methylation may be one mediating mechanism in the multiple hit pathway needed for ALL disease manifestation.
Keywords: radiation, alcohol, smoking, caffeine, ALSPAC
Introduction
Acute lymphoblastic leukaemia (ALL) is the most common form of childhood cancer [1]. Although survival rates have improved, with 5-year survival rate increasing from 60-90% [2], overall incidence of childhood ALL has been slowly increasing [3]. Genetic aberrations such as chromosomal translocations and hyperdiploidy have been suggested to play a role in ALL disease development, yet these events alone are not sufficient for disease manifestation. For example, a translocation resulting in the TEL-AML1 fusion gene associated with B-cell lineage ALL [4] has been found to be present in roughly 1% of newborns at birth [5], which is approximately 100-fold higher incidence rate than that of childhood B-cell lineage ALL. The peak incidence of childhood ALL, between 2-5 years of age, suggests that in utero and early life events, including environmental factors, may contribute to disease risk. Blood chimerism in twins, occurring via the transfer of hematopoietic stem cells between the foetuses, results in concordant twin cases of leukaemia [6]; these have been identified as having the same fusion-gene sequence or IGH/TCR sequences, supporting the notion of ALL originating in utero. Concordance in identical twins is estimated to be 10%, yet 70% of these share the same placenta, suggesting secondary events are required, and have a major impact on the likelihood of leukaemia development [7]. Furthermore, a recent study observed spatial clustering of ALL cases, using addresses of mothers during pregnancy, suggesting that environmental factors may play a role in disease development [8]. Various in utero and early childhood exposures have been investigated in the aetiology of childhood ALL [9, 10]. These include breast feeding [11], infection history [12], childcare/day care attendance [11, 12], maternal smoking [13, 14], alcohol [15, 16], caffeine [15, 17], folic acid [18, 19], iron [20], radiation [8], household paints [21], chemicals [22], pesticides [23, 24], and herbicides [25]. The weight of supporting evidence for the role of each exposure in the aetiology of childhood ALL varies. The evidence for exposure to pesticides and paints demonstrate consistently positive associations with increased risk of childhood leukaemia (including data from a recent meta-analysis for paint) [21]. Evidence for a role of maternal caffeine and folate intake is weaker, but emerging, with recent meta-analysis suggesting high caffeine intake may be a risk factor [17], with folate supplementation suggested to be protective [26]. Despite some evidence from earlier studies suggesting maternal smoking and alcohol intake may influence risk, more recent meta-analysis found no association between these exposures and childhood leukaemia risk [27]. Studies investigating iron intake during pregnancy and leukaemia risk have been small scale and therefore are limited in their ability to provide robust evidence [28–30]. Likewise, the number of studies investigating maternal and infant radiation exposure is limited, and thus despite associations being found between maternal and infant radiation exposure and childhood cancer risk [31, 32], further investigations are warranted. A number of studies, including a recent meta-analysis [11], suggest that breast feeding and exposure to infection in early life may be protective against leukaemia development.
Whilst the strength of the associations for some of these environmental factors and ALL risk is weak and evidence limited, this may be due to lack of available environmental data, limited number of cases for this rare disease (i.e. studies underpowered), and inaccuracies associated with collecting environmental exposure data retrospectively. The investigation of intermediate biomarkers linking an exposure and the disease outcome may therefore help to strengthen the evidence for the association of a given exposure and disease outcome, but also may offer novel insights into the underlying causal pathway to disease.
Epigenetics play a key role in human health and disease [33–35]. Epigenetic modifications such as DNA methylation, the most extensively studied epigenetic modification [36], and alterations to chromatic structure are able to modulate gene expression [37]. Methylation of cytosine residues located within CpG dinucleotides across the genome play a crucial role in cellular identity by inhibiting the binding ability of transcriptomic machinery to DNA, thus influencing gene expression [38, 39]. Each individual’s epigenome is subject to change induced via genetic, environmental, and stochastic factors throughout the life course [40]. However the epigenome is particularly susceptible to dysregulation via environmental influences during gestation, neonatal development, puberty, and old age [41]. The establishment of DNA methylation patterns occurs during embryonic development, therefore this may be an acutely critical and vulnerable period during which environmental influences are of importance for resultant DNA methylation profiles, and possible health consequences in later life [42].
Aberrant DNA methylation in ALL is well established [39, 43–45], particularly at promoter associated CpG islands, which results in hundreds of genes involved in critical molecular pathways being silenced [46]. ALL is a heterogeneous disease and is subtyped based on its cellular phenotype and cytogenetic aberrations. A core set of CpG sites have been identified as aberrantly methylated throughout all subtypes, as well as subtype specific DNA methylation patterns [39, 47].
As DNA methylation can be altered by environmental exposures, and is known to be altered in ALL itself, it is plausible that DNA methylation may act as one potential mediating mechanism via which in utero and early life exposures are able to increase the risk of developing ALL. We therefore hypothesised that if DNA methylation were a mediating mechanism between environmental risk factors and ALL, there would be common methylation changes associated with both a given exposure and ALL. To investigate this hypothesis we employed a ‘meet in the middle’ approach [48]. Firstly, an epigenome-wide association study (EWAS) was carried out to identify associations between individual CpG site methylation and ALL risk exposures [49]. Exposure-associated CpG sites were mapped to genes and compared with previously identified genes from the literature which have reported altered DNA methylation in ALL compared to healthy individuals. Hypergeometric testing was used to assess the probability that the resultant overlap between exposure-associated and disease-associated methylation was not due to chance. The resultant data suggests that DNA methylation may play a mediating role in the causal pathway linking ALL risk exposures with disease risk.
Materials and Methods
Study population
The ALSPAC cohort is a prospective birth cohort comprising 14541 pregnancies recruited from the Bristol area in England with expected delivery dates between 1st April 1991 and 31st December 1992 [50, 51]. A large amount of data has been collected including genetic, epigenetic, biological, social and other environmental exposures in relation to health and developmental outcomes. Blood samples were collected from all consenting mothers (during pregnancy and follow up clinic) and their children (birth; childhood, age seven years; and adolescence, age 17 years) at several clinics. The study website contains details of all the data that is available through a fully searchable data dictionary (http://www.bristol.ac.uk/alspac/researchers/data-access/data-dictionary/). The study was observational and aimed to intervene as little as possible in the normal course of pregnancy and childhood [52]. Ethical approval for the study was obtained from the ALSPAC Ethics and Law Committee and the Local Research Ethics Committees.
The Accessible Resource for Integrated Epigenomic Studies (ARIES) is a sub-population of the ALSPAC cohort [53]. ARIES comprises of 1018 mother child pairs who were selected based on the availability of matched-samples at multiple time points required for the study. Genome-wide DNA methylation analysis was carried out on blood samples (cord/peripheral) from these individuals at birth and age seven.
Identification of environmental factors associated with an increased risk of ALL
Literature searches were previously carried out to identify exposures associated with ALL risk and their potential to influence changes in DNA methylation using the PubMed and ScienceDirect databases (1st January 1987- 31st March 2018). This was carried out in three stages, firstly, reviews discussing ALL aetiology were screened for environmental exposures potentially associated with increased ALL risk. From this initial screen a list of key words were compiled (Box 1). Secondly, a more rigorous literature search was carried out using the key words to identify evidence for an association between environmental exposures and ALL risk. Thirdly, the identified potential ALL risk exposures were used to further investigate the literature to find any relationships between the identified exposures and changes in DNA methylation patterns (Box 1). References from these papers, including the reviews were also screened. No language restrictions were imposed. To be as inclusive as possible in our investigations, we examined all potential risk factors for which there has been at least one study providing evidence (even if limited) of an association between an environmental factor and ALL risk.
Box 1. Literature searches carried out, and key words used to identify ALL risk exposures and potential exposure associated changes in DNA methylation.
Literature search | Key words |
---|---|
1. Identification of ALL risk exposures | Leukemia, acute lymphoblastic, leukaemia |
2. Identification of evidence supporting an association between ALL and an exposure | Leukemia, acute lymphoblastic, leukaemia, radiation, smoking, alcohol, folate, folic acid, iron, coffee, caffeine, herbicides, pesticides, household chemicals, chemicals, household paints, paints, child care, day care, breast feeding, birth weight, infection history, infection, virus, and bacterial |
3. Identification of relationships between ALL risk exposures and changes in DNA methylation | DNA methylation, and exposures identified in literature search 2. |
Identification of genes with altered methylation in ALL disease
Methylation changes in ALL were identified from previously published literature [39, 54], the significant CpG sites identified were mapped to genes (see [39, 54] for details). Studies which utilised the Illumina Infinium® HumanMethylation450k BeadChip assay (450K BeadChip) to measure DNA methylation, were used as they directly correlated with the measurements carried out in the ALSPAC cohort. This resulted in a total of 2173 genes which had reported significant differential methylation in ALL disease compared to healthy controls (Table S1).
Additionally, for comparison, a list of gene loci with altered methylation in ALL disease from all large published studies [43–45, 55–58], i.e. with no exclusions based on methodologies used to measure methylation was also compiled and assessed in relation to variation in methylation observed in relation to environmental factors in the ALSPAC cohort.
Environmental exposure data available in the ALSPAC cohort
The environmental data used in this study was selected from nine questionnaires carried out as part of the ALSPAC study, and these ranged from early gestation, throughout pregnancy, and throughout the life of the offspring (still ongoing) (Figure 1). The impact of in utero environmental exposures including maternal alcohol intake, smoking, radiation exposure, folic acid supplementation, coffee consumption, sugary caffeinated drink consumption and iron supplementation during pregnancy on DNA methylation status in offspring cord blood were assessed (Table S2 for cohort characteristics and details of measure of exposure variables). No data were available for pesticide exposure, so this exposure was not assessed in this analysis. The influence of early life post-natal exposures i.e. day care attendance at 8, 38, and 54 months, breast feeding, and offspring reported illness by the mothers on DNA methylation in the blood at age seven were also assessed (Table S3). The data used in this study was selected to best represent each exposure under investigation (where data were available), and in line with previously published data (see Table S4 for summary of evidence and indication of risk level, and Table S5 for justification of variable selection). An additional search was carried out to identify larger methylation studies (450K BeadChip) which identified exposure associated methylation changes (FDR corrected and cell-type adjusted).
Figure 1. Study design.
1a) 450K BeadChip was used to measure DNA methylation in cord blood at birth (n = 861); and 1b) in blood age 7 (n = 927); 2) altered DNA methylation in ALL disease identified in previous studies [39, 54]; 3a) questionnaires carried out for the ALSPAC study were used to identify in utero exposures (8-28 weeks gestation) previously associated with ALL risk (Table S2); and 3b) post-natal exposures previously associated with ALL risk (8-58 months of age) (Table S3).
The Illumina Infinium® HumanMethylation450k BeadChip assay, data pre-processing and normalisation
Genome-wide DNA methylation data was previously generated for the ARIES sub-population using the 450K BeadChip. The arrays were scanned using an Illumina iScan and reviewed using GenomeStudio (version 2011.1). To minimise the possibility of confounding by batch effects, samples were semi-randomly arranged on the chip, making sure samples from each time point were represented on each chip. Data were pre-processed using R (version 3.0.1) at the MRC Integrative Epidemiology Unit, University of Bristol [53]. The WateRmelon package [59] was used according to the subset quantile normalisation approach described by Touleimat & Tost [60] to reduce non-biological differences between probes. Data were normalised using functional normalization [61], resulting in a total of 861 cord blood samples and 927 age seven samples for analyses.
In the cord blood, cell-type proportion estimation was used to adjust for cell type effects. Publicly available expression signature data were used, this data can be found in the NCBI Gene Expression Omnibus repository, GSE68456 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE68456) [62]. For the age seven analyses the estimateCellCounts function in the minfi R package [63] was used, this is based on the model developed by Houseman et al [64]. The estimated cell-type proportion was added to the regression model to adjust for the potential effect of this variable.
Multiple linear regression analyses of DNA methylation at individual CpG sites and environmental factors
Multiple linear regression analysis was used to investigate associations between DNA methylation at individual CpG sites and environmental exposures of interest. For each exposure, the resulting p-values were corrected for multiple testing by controlling the false discovery rate (FDR) at 5% (p ≤ 0.05), first described by Benjamini and Hochberg [65]. DNA methylation was modelled as a continuous variable (outcome), in a multivariate regression model accounting for potential confounders (sex, parity, gestation, cell type, and batch). These were selected as relevant covariates, as well as being in line with previous multiple linear regression models [66]. FDb.InfiniumMethylation.hg19 R package (FDb.InfiniumMethylation.hg19: Annotation package for Illumina Infinium DNA methylation probes) was used annotate the methylation data by mapping CpG sites to genes for further analyses. If CpG sites were mapped to multiple genes, both genes were included in the analyses, and unmappable CpGs were excluded.
Integration of lists of gene loci with altered methylation due to an ALL risk exposure and a list of gene loci with altered methylation in ALL
Molbiotools (http://www.molbiotools.com/listcompare.html) was used to integrate the list of gene loci with altered methylation in ALL, with the lists of gene loci with significantly altered methylation (FDR-adjusted p ≤ 0.05) for each ALL risk exposure analysed using the gene symbol for comparison. Molbiotools was also used to identify gene loci which overlapped between multiple exposures, as well as gene loci which were altered in ALL and multiple exposures. Hypergeometric tests were carried out using the GeneProf calculator (http://www.geneprof.org) to assess the probability that the observed overlapping changes in methylation found in relation to a given exposure and in ALL were likely to be significant (p ≤ 0.05) and not due to a chance observation. An n of 21231 (i.e. the number of genes represented on the array) was used as the ‘population size’, with 2173 (i.e. number of genes with altered methylation in ALL) as the ‘number of successes in the population’. N used for ‘sample sizes’ and ‘number of successes in sample’ were the number of genes found to have altered methylation in response to a given exposure and the number of genes found to overlap between methylation change associated with a given exposure and in ALL respectively. Where larger data sets with exposure-associated methylation changes (FDR corrected and cell type adjusted) and disease-associated methylation changes were available the same integration process discussed above was also carried out.
Gene Ontology enrichment and pathway analysis
The Database for Annotation, Visualization and Integrated Discovery (DAVID) 6.8 Beta (https://david-d.ncifcrf.gov/) [67] was used to carry out Gene Ontology (GO) and Kyoto Encyclopaedia of genes and genomes (KEGG) pathway analysis to investigate the biological pathways and processes that may be affected through DNA methylation changes associated with individual in utero and post-natal exposures (FDR-adjusted p ≤ 0.05). GO and pathway analysis were carried out for each individual exposure using the resultant list of significant genes found to have altered methylation in relation to the given exposure. Furthermore, for those exposures where genes were also found to have altered methylation in ALL [39, 54], this analysis was also carried out using the list of overlapping genes.
Results
Epigenome-wide association study investigating the influence of ALL risk exposures on DNA methylation
For the majority of in utero exposures examined, methylation was found to be significantly altered across a range of gene loci in cord blood at birth (FDR-adjusted p ≤ 0.05) (Table 1), with most changes observed to be associated with maternal radiation exposure (n CpGs=288), alcohol intake (n CpGs=192), and consumption of sugary caffeinated drinks (n CpGs=66) during pregnancy. Only maternal iron intake whilst pregnant had no significantly associated methylation changes. The split between direction of methylation change in response to radiation and smoking (throughout pregnancy, at 3 months, and pre-pregnancy) exposures were roughly equal (i.e. 51% and 55%, 53%, 43% hypo- and 49% and 45%, 47%, 57% hypermethylated CpGs respectively). Whilst alcohol intake, sugary caffeinated drinks, and coffee consumption had larger proportions of hypermethylated compared to hypomethylated CpGs (82%, 73% and 60% respectively) (Table 1 and Table S6). Only folic acid supplementation resulted in a higher percentage (89%) of hypomethylated CpGs (Table 1 and Table S6). Gene ontology and pathway analysis carried out for each of the gene lists generated for individual in utero exposures only identified one potentially significantly affected pathway (‘Amoebiasis’) in response to the methylation changes which occurred in response to radiation exposure (Table 2). No other biological processes or pathways were suggested to be affected by methylation changes associated with other in utero exposures. Two additional studies were identified with larger data sets for maternal smoking (n CpGs=6073 and genes=3176) [68] and maternal plasma folate (n CpGs=443 and genes=229) [69] associated methylation changes. Both maternal smoking and maternal plasma folate studies had a significant overlap of gene loci with exposure associated- and ALL-associated methylation changes (4.95 x 10-11, 4.32 x 10-13, respectively) (Table 3).
Table 1.
Number of genes with altered methylation due to an ALL risk exposure (FDR ≤ 0.05) and the overlapping number of genes also observed to have altered methylation in ALL disease.
Exposure | Number of CpGs (hyper/hypo-methylated) with altered methylation in association with exposure (FDR ≤ 0.05) | Number of gene loci with altered methylation in association with exposure (FDR ≤ 0.05) | Number of overlapping gene loci with exposure-associated methylation change and altered methylation in ALLa | Number of overlapping gene loci with the same direction of methylation change (i.e. hypo/hypermethylated) for exposure and ALL (% concordance)b | Number of overlapping gene loci with exposure-associated methylation change and altered methylation in ALL from all published literaturea,c |
---|---|---|---|---|---|
In utero exposures | |||||
Radiation exposure | 288 (140/148) | 239 | 54 (1.69 x 10-8) | 40 (74%; 0.001) | 156 (0.001) |
Alcohol intake | 192 (158/34) | 175 | 33 (3.98 x 10-4) | 29 (88%; 0.006) | 94 (0.702) |
Sugary caffeinated drinks | 66 (48/18) | 54 | 12 (7.42 x 10-3) | 10 (83%; 0.045) | 30 (0.548) |
Smoking throughout pregnancy | 22 (10/12) | 13 | 5 (7.11 x 10-3) | 3 (60%; 0.140) | 10 (0.097) |
Smoking at 3 months pregnancy | 15 (7/8) | 7 | 3 (2.73 x 10-2) | 1 (33%; 0.530) | 5 (0.324) |
Smoking pre-pregnancy | 7 (4/3) | 4 | 2 (5.45 x 10-2) | 1 (50%; 0.350) | 4 (0.094) |
Folic acid supplementation | 9 (1/8) | 7 | 2 (0.155) | 1 (50%; 0.155) | 5 (0.324) |
Coffee consumption | 15 (9/6) | 13 | 1 (0.754) | 0 | 4 (0.981) |
Post-natal exposures | |||||
Reported a cold (6 months) | 75 (20/55) | 60 | 9 (0.156) | 3 (33%; 0.952) | 39 (0.085) |
Attended day nursery (8 months) | 11 (6/5) | 11 | 6 (2.65 x 10-5) | 5 (83%; 0.003) | 9 (0.068) |
P-value for Hypergeometric probability test for number of overlapping gene loci.
P-value for Hypergeometric probability test for number of overlapping gene loci with same direction of methylation change.
Table 2.
KEGG pathway which may be altered in response to altered methylation observed in children who were exposed in utero to radiation exposure (Total number of genes recognised by DAVID n = 79). - = hypomethylated, + hypermethylated.
KEGG pathway term | Pathway name | Total genes altered on pathway | Genes altered (- /+ methylation) | Total genes on pathway | P-value |
---|---|---|---|---|---|
hsa05146 | Amoebiasis | 5 | LAMA3 (+), LAMA5 (-), NFKB1 (+), IL12B (+), COL5A1(-) | 106 | 0.031399 |
Table 3.
Number of genes with altered methylation due to ALL risk exposure, reported in published 450K studies (FDR corrected and cell type adjusted) and the overlapping number of genes also observed to have altered methylation in ALL.
Exposure | Study (reference) | Number of CpGs (hyper/hypo-methylated) with altered methylation in association with exposure (FDR ≤ 0.05) | Number of gene loci with altered methylation in association with exposure (FDR ≤ 0.05) | Number of overlapping gene loci with exposure-associated methylation change and altered methylation in ALL* |
---|---|---|---|---|
In utero exposures | ||||
Smoking throughout pregnancy | [8] | 6073 (3185/2888) | 3176 | 431 (4.95 x 10-11) |
Maternal plasma folate | [9] | 443 (27/416) | 229 | 62 (4.32 x 10-13) |
P-value for Hypergeometric probability test for number of overlapping gene loci.
At age seven, post-natal exposures used as proxy measures for infection i.e. day care attendance (8 months of age) and maternally reported cold (6 months of age) were found to be significantly associated with altered methylation of 11 and 60 gene loci respectively (Table 1). Post-natal exposure to day care attendance at 38 months and 54 months, and breast feeding had no significantly associated methylation changes. Whilst the direction of methylation change in response to day care attendance was approximately equal i.e. 45% hypo- vs 55% hypermethylated CpGs; the majority of CpGs with altered methylation in response to having cold symptoms were hypomethylated (73%) (Table 1 and Table S6). GO and pathway analysis revealed that methylation changes associated with recorded cold exposure may significantly affect the MAPK signalling pathway (Table 4), but changes associated with day care attendance and breast feeding were not found to be associated with any biological processes or pathways.
Table 4.
KEGG pathway which may be altered in response to altered methylation observed in children who were postnatally recorded as having a cold (6 months) (Total number of genes recognised by DAVID n = 12). - = hypomethylated, + hypermethylated.
KEGG pathway term | Pathway name | Total genes altered on pathway | Genes altered (- /+ methylation) | Total genes on pathway | P-value |
---|---|---|---|---|---|
hsa04010 | MAPK signalling pathway | 4 | MAP3K7 (-), MAPK1 (-), ATF4 (-), ARRB1(-) | 267 | 0.017247 |
DNA methylation changes in ALL and integration with potential ALL risk exposure associated changes in methylation
Hypergeometric probability tests were used to ascertain if the degree of overlap between methylation changes observed in ALL and those associated with a potential risk exposure were statistically significant or likely to be due to chance. For over half of the exposures investigated (Table 1) there was a significant overlap between genes with both exposure-associated and disease-associated methylation change.
The concordance of the change in direction of methylation influenced by exposures and in ALL was not uniform across exposures. Whilst concordance was high (i.e. over 70%) for radiation exposure, alcohol intake, sugary caffeinated drinks consumption and day care attendance, for all other exposures concordance rates were intermediate (smoking and folate supplementation) or even low (reported cold symptoms and coffee consumption) (Table 1). We therefore carried out further hypergeometric tests to ascertain if the overlap between concordant methylation changes observed in ALL and those associated with a potential risk exposure were a statistically significant finding or likely to be due to chance (Table S7). These analyses suggested there was a statistically significant overlap between genes with methylation change in ALL and those found to have the same direction of methylation change in response to in utero exposure to radiation exposure, alcohol intake, sugary caffeinated drinks consumption, and to day care attendance at 8 months (Table 1). An unexpected disparity between findings for maternal coffee and sugary caffeinated drinks consumption have been reported in this study. This may allude to something other than caffeine driving the larger number of concordant changes identified for sugary caffeinated drinks consumption. The final column in Table 1 shows the overlap between genes with exposure-associated changes in methylation and changes in methylation in ALL disease from all published studies with no restrictions on the methods used to measure methylation. There is a lack of overlap (except for radiation exposure associated gene loci) compared to that observed in studies only using the 450K BeadChip array. The disparities between studies are likely to be partially or largely driven by the differences in technologies used to measure DNA methylation [43–45, 55–58], making the comparisons between data sets difficult, therefore this additional analysis should be interpreted with care.
Comparison of the data sets found that 7 gene loci (KCNQ1DN, MAD1L1, ROBO3, SEPT9, SNX8, TBX2 and ZC3H3) had altered methylation in ALL and in response to more than one potential ALL risk exposure, although the direction of methylation change was only found to be conserved between ALL and exposures for one gene (TBX2). Pathway analysis did not identify any significantly affected pathways when using the lists of genes which had both disease and exposure-associated altered methylation, most likely due to an inadequate number of genes overlapping between each exposure and ALL disease for this type of analysis or limitation of relying on gene mapping approaches.
Discussion
Due to the rarity of childhood ALL it is very difficult to investigate causal pathways in disease aetiology. The early onset of disease, with peak incidence between 2-5 years of age, in utero and early life periods are likely to be crucial for disease development. Therefore, to improve understanding of this complex disease pathway, better knowledge of the differences between individuals who possess/experience a first ‘hit’ (e.g. ALL-associated genetic translocation) and develop disease, compared to those that do, yet do not develop ALL will be key. However, as such genetic abnormalities are rare in the general population (1% for the TEL-AML1 fusion gene [5]), and fresh whole blood samples are required to measure such abnormalities, to the best of our knowledge, there is currently no single cohort possessing the numbers, data and samples required to investigate ALL disease development in this manner. We therefore applied a ‘meet in the middle’ approach [48] to investigate one plausible mechanism in the disease pathway to link possible risk factors to ALL cases i.e. DNA methylation changes.
We hypothesised that DNA methylation may act as a potential mediating mechanism between environmental exposures and childhood ALL risk. The majority of ALL risk exposures investigated in this study were shown to influence DNA methylation at multiple gene loci, with some degree of overlap found with disease-associated methylation changes. In demonstrating that methylation changes associated with these potential risk exposures occur in genes which also have altered methylation in overt disease, we provide evidence for a mediating role of DNA methylation in the causal disease pathway. Hypergeometeric tests suggest that for over half of the exposures investigated (i.e. in utero radiation exposure, maternal alcohol intake, maternal sugary caffeinated drinks intake, maternal smoking and infant day care attendance at 8 month), these concomitant gene methylation changes were not likely to be due to chance. Moreover, 4/5 of those exposures were found to have significant concordant changes in direction of methylation associated with exposure and ALL disease (with the exception of maternal smoking). In addition, we also compared our ALL-disease associated methylation signature with two previously published studies that investigated variation in methylation in association with maternal smoking and plasma folate [68, 69] (Table 3). This analysis provided additional evidence demonstrating that maternal smoking and plasma folate were associated with a large number of methylation changes, and that a significant number of these changes were also found in ALL disease. We would therefore suggest that for radiation exposure, maternal alcohol intake, maternal sugary caffeinated drinks intake, smoking, and folic acid intake, it is plausible that methylation changes which occur as a result of these exposures may contribute to the aetiology of ALL.
Whilst, in this context significant associations were also observed for day care attendance, the causal pathway towards disease may be less clear. It has been suggested that infections during early life (<1yr) ‘prime’ the immune system, enabling it to elicit an appropriate immune response later in childhood and adult life [70]. Without this ‘priming’, it is thought that exposure to infection later in life is likely to lead to increased proliferation and error which may lead to the development of leukaemia in individuals harbouring genetic aberrations. Greaves [71] suggested a two-hit model for role of infections, specifically in B cell precursor ALL. The proposed mechanism is multifactorial, involving inherited genetics, modulators of risk (i.e. diet), and exposure to infections in early-life. He further proposes that developed societies restriction of early-life microbial exposure has led to an increase in cases, and may therefore, in-part, be preventable. As such, day care attendance has been utilised as a proxy measure for infection, with inverse associations reported between attendance and ALL risk [12]. If day care attendance is associated with lower risk of ALL, we would hypothesise that DNA methylation changes associated with day care attendance would be protective, and therefore show the opposite direction of methylation change to that observed in disease. However, here we observed concordant changes across 5/6 genes (i.e. day care related methylation changes were associated with ALL risk not protection). Whilst the reason for this perhaps unexpected outcome is unclear, there may be several rational explanations for this finding. In this study recording of day care attendance was taken at ~8 months. Therefore individuals who attended day care thereafter would have been identified as non-attenders. There are therefore likely to have been a reasonable proportion of infants who attended day care between the ages of 8 month and 1 year i.e. after data collection and prior to usual state nursery attendance schedule, which may have influenced DNA methylation patterns of those individual, and may have skewed the outcome. There is also the possibility that other/additional exposures modify the effect of day care exposure, for example breast feeding has been shown to modify the effect of day care attendance on risk of type 1 diabetes, where day care attendance was only associated with a decreased risk with increasing breast feeding duration, and increased risk with non-breastfed children [72]. Furthermore, the use of a proxy measure of infection, may mean that the observed differences in DNA methylation may be due to other confounding factors, rather than infection itself i.e. children who attend day care may belong to different socio-demographic groups than those who do not, and also have a higher degree of social interaction [73]. A selective increase in ALL incidence in children 1-4 years of age has been shown during socioeconomic transition [74]. The authors further suggested that difference in socioeconomic status (SES) may lead to an increase in a genetically distinct subtype of ALL susceptibility, suggesting hyperdiploid ALL as a candidate. Additional studies have proposed that DNA methylation may act as a modifiable mechanism by which factors such as SES become physically embodied [75], resulting in altered disease risk between SES groups. A spurious result must also be considered, given that for day care there were only a small number of individuals exposed i.e. only 46/871 were reported to attended day care at 8 month of age. This may also be true for maternal radiation exposure were only 29/849 were reported to be exposed. Whilst we concede that it is possible that these observations may be due to chance, we would argue that the perinatal window represents a period of a high degree of plasticity for the developing epigenome compared to later time points during childhood and therefore it is plausible that some of the observations reported here may be real due to the timing of the exposure. In particular, the magnitude and significance of change identified for at least two of the six genes whose methylation were influenced by day care attendance (and are altered in ALL) (TMEM132C, -7% (p=0.003); and GLB1L, -2% (p=0.03)), implies a true association with this exposure, and potential candidates for future studies. Indeed, when using reported cold symptoms in the first 6 months of life as a measure of infection, which perhaps may be a less biased, although imperfect measure, we observed a larger number of methylation changes (i.e. 75 CpGs as opposed to just 11 for day care attendance). Despite a lack of statistically significant overlap between ALL-associated genes, there was a lower degree of concordant changes (33%), whereby 6 genes displayed hypomethylation in response to cold symptoms and hypermethylation in ALL, supporting the notion that early immune stimulation has a protective effect against ALL.
There are several plausible explanations as to why we did not observe a significant overlap between disease associated methylation change and exposure associated methylation changes for all exposures assessed. The simplest explanation, that DNA methylation may not be a key mechanism linking those particular exposures (i.e. folate supplementation, coffee consumption, and reported cold symptoms) and ALL risk. Whilst this is possible, we suggest alternative explanations for these findings. In this study we applied the ‘meet in the middle’ approach, due to rarity and difficulty obtaining appropriate data and samples within one cohort. The integration and overlap of the data sets analysed therefore originated from a combination of prospective and retrospective study designs. Moreover, in using data from different studies, the exposed and disease populations are likely to be different, which may have influenced study outcomes. The sample size may not have been sufficient to detect small exposure-associated methylation changes, thus a larger cohort may be needed to detect these small effects, for instance when investigating these associations in larger cohorts [54, 55] (Table 3) significant associations were found.
Here, ALL-associated exposures were measured via questionnaires completed by mothers of the offspring from whom DNA methylation data were available. This method of exposure measurement is not the most accurate due to reporting bias [76], and therefore it is plausible that we were unable to detect some methylation changes associated with some of these exposures due to the inaccuracy of the exposure data. All of the questions had categorised answers and mothers had to select the most appropriate, and no extra/unique data were collected regarding these exposures. In particular, the measure used to assess folate status was based on use of supplements. The utility of this measure can therefore easily be questioned, as dietary intakes of folate are likely to vary widely between individuals. Therefore, there is likely to be a high degree of variation between individuals both taking and not taking supplements, which would not have been captured using this method of assessment, and thus influence our outcome measure. In addition, the use of cold reporting in the first 6 month, whilst an objective measure, does not take into account other infections that may have been encountered during early life and consequently influence methylation patterns. Therefore, future studies using biomarkers or medical records to measure exposures may help improve accuracy of detecting methylation changes in response to some environmental exposures.
Also, whilst the use of hypergeometric tests gives a statistically based rationale to suggest that a higher degree of overlap between ALL and exposure associated genes in not due to a chance finding and therefore is likely to be a significant finding, it also could be argued that a biological rationale may be equally likely, whereby key methylation changes which result in a functional biological outcome are more important than a plethora of changes. For example, whilst the overlap for concordant genes for both smoking exposure and ALL was not statistically significant, it is plausible that hypermethylation of the CYP1A1 gene caused by smoking during or pre-pregnancy (and observed in ALL) has a biologically functional outcome (detoxification of compounds from tobacco smoke), which may be in the causal pathway towards disease. The protein encoded by the SEPT9 gene is involved in cytokinesis, and is crucial for the separation of daughter cells via mediation of midbody abscission [77], it has previously been implicated in cancers, including ALL, as well as potentially contributing to drug resistance [78]. Hypomethylation, as found in ALL and associated with maternal alcohol consumption in this study, could lead to over expression and enhanced cell survival. The TBX2 gene encodes a transcriptional repressor involved in the determination/maintenance of cell fate [79], found negatively dysregulated in B-cell acute lymphoblastic leukaemia [80], as well as in this study shown via hypomethylation in ALL, associated with maternal alcohol intake, and sugary caffeinated drink consumption. Using this rationale, it is possible that any or all of the concordant changes in methylation we have observed between risk factors and ALL, may be implicated in disease progression if functionally relevant in the causal pathway towards ALL manifestation.
Due to the difficulties in being able to obtain data and biological samples from childhood ALL cases prior to diagnosis there has been limited research investigating the potential mechanistic implications of risk exposures on disease development and progression. Here we provide evidence to suggest that a range of exposures which have been investigated in the context of ALL risk are associated with epigenetic variation. Further evaluation of this variation is required using more reliable exposure measurements, through replication in additional cohorts, and implementing causal analysis methods. International collaborations will also be key in generating the numbers required to capture sufficient ALL cases for a meaningful analysis. Furthermore, the analyses carried out here provides supporting evidence for the ability of environmental exposures to drive disease-associated changes in DNA methylation. If these changes occur pre-disease (possibly via multiple exposures), and in combination with other potential ALL-associated risk factors or ‘hits’, they could explain one contributing mechanistic pathway driving ALL disease incidence. Further work aiming to assess methylation patterns in ALL cases prior to disease diagnosis is warranted to provide additional evidence for a role of DNA methylation in the causal pathway towards disease, and with a view to develop predictive biomarkers of disease for high risk groups, and to strengthen the causal association between exposures and ALL risk to inform public health policy.
Supplementary Material
Brief description – “Novelty and Impact”.
Childhood acute lymphoblastic leukaemia (ALL) is a multi “hit” disease, whereby additional “hits” are required to transform pre-malignant cells carrying ALL-associated genetic translocations. We investigated the potential that environmental exposures associated with increased ALL risk influenced changes in DNA methylation patterns in offspring, and their concordance with methylation changes identified in ALL disease. We found statistically significant directionally concordant changes in methylation for maternal radiation exposure, alcohol, sugary caffeinated drinks; and childhood day care attendance.
Acknowledgements
We are extremely grateful to all the families who took part in this study, the midwives for their help in recruiting them, and the whole ALSPAC team, which includes interviewers, computer and laboratory technicians, clerical workers, research scientists, volunteers, managers, receptionists and nurses.
Financial support: Funding was provided via a studentship from the Institute of Health and Society, Newcastle University, UK, and from the North of England Children's Cancer Research charity (NECCR). CL Relton is supported by the UK Medical Research Council Integrative Epidemiology Unit (MC_UU_12013_2) and Cancer Research UK (C18281/A19169). The UK Medical Research Council and Wellcome (Grant ref: 102215/2/13/2) and the University of Bristol provide core support for ALSPAC. This publication is the work of the authors and Jessica A Timms, Caroline L Relton, Gemma C Sharp, Judith Rankin, Gordon Strathdee, and Jill A McKay will serve as guarantors for the contents of this paper. The Accessible Resource for Integrated Epigenomics Studies (ARIES) was funded by the UK Biotechnology and Biological Sciences Research Council (BB/I025751/1; and BB/I025263/1). No writing assistance was utilized in the production of this manuscript.
Footnotes
author contributions
Concept and design: Jessica A Timms, Caroline L Relton, Gemma C Sharp, Judith Rankin, Gordon Strathdee, Jill A McKay
Development of methodology: Jessica A Timms, Jill A McKay, Gemma C Sharp
Acquisition of data: Jessica A Timms, Caroline L Relton, Gemma C Sharp, Jill A McKay
Analysis and interpretation of data: Jessica A Timms, Gordon Strathdee, Jill A McKay
Writing, review, and/or revision of the manuscript: Jessica A Timms, Caroline L Relton, Judith Rankin, Gordon Strathdee, Jill A McKay
Conflict of interest
The authors have no other relevant affiliations or financial involvement with any organisation or entity with a financial interest in or financial conflict with subject matter or materials discussed in the manuscript apart from those disclosed.
References
- 1.Siegel DA, Henley SJ, Li J, Pollack LA, Van Dyne EA, White A. Rates and Trends of Pediatric Acute Lymphoblastic Leukemia - United States, 2001-2014. MMWR Morb Mortal Wkly Rep. 2017;66(36):950–954. doi: 10.15585/mmwr.mm6636a3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Board, P.D.Q.P.T.E. PDQ Cancer Information Summaries 2002-2017. National Cancer Institute (US); Bethesda (MD): Childhood Acute Lymphoblastic Leukemia Treatment (PDQ(R)): Health Professional Version. [Google Scholar]
- 3.Linet MS, Brown LM, Mbulaiteye SM, Check D, Ostroumova E, Landgren A, Devesa SS. International long-term trends and recent patterns in the incidence of leukemias and lymphomas among children and adolescents ages 0-19 years. Int J Cancer. 2016;138(8):1862–74. doi: 10.1002/ijc.29924. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Alm SJ, Engvall C, Asp J, Palmqvist L, Abrahamsson J, Fogelstrand L. Minimal residual disease monitoring in childhood B lymphoblastic leukemia with t(12;21)(p13;q22); ETV6-RUNX1: concordant results using quantitation of fusion transcript and flow cytometry. Int J Lab Hematol. 2017;39(2):121–128. doi: 10.1111/ijlh.12593. [DOI] [PubMed] [Google Scholar]
- 5.Pui C-H, Robison LL, Look AT. Acute lymphoblastic leukaemia. The Lancet. 2008;371(9617):1030–1043. doi: 10.1016/S0140-6736(08)60457-2. [DOI] [PubMed] [Google Scholar]
- 6.Greaves MF, Wiemels J. Origins of chromosome translocations in childhood leukaemia. Nat Rev Cancer. 2003;3(9):639–49. doi: 10.1038/nrc1164. [DOI] [PubMed] [Google Scholar]
- 7.Greaves MF, Maia AT, Wiemels JL, Ford AM. Leukemia in twins: lessons in natural history. Blood. 2003;102(7):2321–33. doi: 10.1182/blood-2002-12-3817. [DOI] [PubMed] [Google Scholar]
- 8.Carceles-Alvarez A, Ortega-Garcia JA, Lopez-Hernandez FA, Orozco-Llamas M, Espinosa-Lopez B, Tobarra-Sanchez E, Alvarez L. Spatial clustering of childhood leukaemia with the integration of the Paediatric Environmental History. Environ Res. 2017;156:605–612. doi: 10.1016/j.envres.2017.04.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Wiemels J. Perspectives on the causes of childhood leukemia. Chemico-Biological Interactions. 2012;196(3):59–67. doi: 10.1016/j.cbi.2012.01.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Whitehead TP, Metayer C, Wiemels JL, Singer AW, Miller MD. Childhood Leukemia and Primary Prevention. Curr Probl Pediatr Adolesc Health Care. 2016;46(10):317–352. doi: 10.1016/j.cppeds.2016.08.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Rudant J, Lightfoot T, Urayama KY, Petridou E, Dockerty JD, Magnani C, Milne E, Spector LG, Ashton LJ, Dessypris N, Kang AY, et al. Childhood acute lymphoblastic leukemia and indicators of early immune stimulation: a Childhood Leukemia International Consortium study. Am J Epidemiol. 2015;181(8):549–62. doi: 10.1093/aje/kwu298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Ajrouche R, Rudant J, Orsi L, Petit A, Baruchel A, Lambilliotte A, Gambart M, Michel G, Bertrand Y, Ducassou S, Gandemer V, et al. Childhood acute lymphoblastic leukaemia and indicators of early immune stimulation: the Estelle study (SFCE) Br J Cancer. 2015;112(6):1017–26. doi: 10.1038/bjc.2015.53. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.de Smith AJ, Kaur M, Gonseth S, Endicott A, Selvin S, Zhang L, Roy R, Shao X, Hansen HM, Kang AY, Walsh KM, et al. Correlates of Prenatal and Early-Life Tobacco Smoke Exposure and Frequency of Common Gene Deletions in Childhood Acute Lymphoblastic Leukemia. Cancer Research. 2017;77(7):1674. doi: 10.1158/0008-5472.CAN-16-2571. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Rumrich IK, Viluksela M, Vahakangas K, Gissler M, Surcel HM, Hanninen O. Maternal Smoking and the Risk of Cancer in Early Life - A Meta-Analysis. PLoS One. 2016;11(11):e0165040. doi: 10.1371/journal.pone.0165040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Menegaux F, Steffen C, Bellec S, Baruchel A, Lescoeur B, Leverger G, Nelken B, Philippe N, Sommelet D, Hemon D, Clavel J. Maternal coffee and alcohol consumption during pregnancy, parental smoking and risk of childhood acute leukaemia. Cancer Detect Prev. 2005;29(6):487–93. doi: 10.1016/j.cdp.2005.06.008. [DOI] [PubMed] [Google Scholar]
- 16.Karalexi MA, Dessypris N, Thomopoulos TP, Ntouvelis E, Kantzanou M, Diamantaras AA, Moschovi M, Baka M, Hatzipantelis E, Kourti M, Polychronopoulou S, et al. Parental alcohol consumption and risk of leukemia in the offspring: a systematic review and meta-analysis. Eur J Cancer Prev. 2017;26(5):433–441. doi: 10.1097/CEJ.0000000000000350. [DOI] [PubMed] [Google Scholar]
- 17.Thomopoulos TP, Ntouvelis E, Diamantaras AA, Tzanoudaki M, Baka M, Hatzipantelis E, Kourti M, Polychronopoulou S, Sidi V, Stiakaki E, Moschovi M, et al. Maternal and childhood consumption of coffee, tea and cola beverages in association with childhood leukemia: a meta-analysis. Cancer Epidemiol. 2015;39(6):1047–59. doi: 10.1016/j.canep.2015.08.009. [DOI] [PubMed] [Google Scholar]
- 18.Kałużna EM, Strauss E, Świątek-Kościelna B, Zając-Spychała O, Gowin E, Nowak JS, Rembowska J, Januszkiewicz-Lewandowska D. The methylenetetrahydrofolate reductase 677T-1298C haplotype is a risk factor for acute lymphoblastic leukemia in children. Medicine. 2017;96(51):e9290. doi: 10.1097/MD.0000000000009290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Dessypris N, Karalexi MA, Ntouvelis E, Diamantaras AA, Papadakis V, Baka M, Hatzipantelis E, Kourti M, Moschovi M, Polychronopoulou S, Sidi V, et al. Association of maternal and index child's diet with subsequent leukemia risk: A systematic review and meta analysis. Cancer Epidemiol. 2017;47:64–75. doi: 10.1016/j.canep.2017.01.003. [DOI] [PubMed] [Google Scholar]
- 20.El-Rashedi FH, El-Hawy MA, El-Hefnawy SM, Mohammed MM. HFE gene mutation and iron overload in Egyptian pediatric acute lymphoblastic leukemia survivors: a single-center study. Hematology. 2017;22(7):398–404. doi: 10.1080/10245332.2017.1289324. [DOI] [PubMed] [Google Scholar]
- 21.Bailey HD, Metayer C, Milne E, Petridou ET, Infante-Rivard C, Spector LG, Clavel J, Dockerty JD, Zhang L, Armstrong BK, Rudant J, et al. Home paint exposures and risk of childhood acute lymphoblastic leukemia: findings from the Childhood Leukemia International Consortium. Cancer Causes Control. 2015;26(9):1257–70. doi: 10.1007/s10552-015-0618-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Heck JE, Park AS, Qiu J, Cockburn M, Ritz B. Risk of leukemia in relation to exposure to ambient air toxics in pregnancy and early childhood. Int J Hyg Environ Health. 2014;217(6):662–8. doi: 10.1016/j.ijheh.2013.12.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Gunier RB, Kang A, Hammond SK, Reinier K, Lea CS, Chang JS, Does M, Scelo G, Kirsch J, Crouse V, Cooper R, et al. A task-based assessment of parental occupational exposure to pesticides and childhood acute lymphoblastic leukemia. Environmental Research. 2017;156:57–62. doi: 10.1016/j.envres.2017.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Bailey HD, Infante-Rivard C, Metayer C, Clavel J, Lightfoot T, Kaatsch P, Roman E, Magnani C, Spector LG, Th Petridou E, Milne E, et al. Home pesticide exposures and risk of childhood leukemia: Findings from the childhood leukemia international consortium. Int J Cancer. 2015 doi: 10.1002/ijc.29631. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Metayer C, Colt JS, Buffler PA, Reed HD, Selvin S, Crouse V, Ward MH. Exposure to herbicides in house dust and risk of childhood acute lymphoblastic leukemia. J Expo Sci Environ Epidemiol. 2013;23(4):363–70. doi: 10.1038/jes.2012.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Metayer C, Milne E, Dockerty JD, Clavel J, Pombo-de-Oliveira MS, Wesseling C, Spector LG, Schuz J, Petridou E, Ezzat S, Armstrong BK, et al. Maternal supplementation with folic acid and other vitamins and risk of leukemia in offspring: a Childhood Leukemia International Consortium study. Epidemiology. 2014;25(6):811–22. doi: 10.1097/EDE.0000000000000141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Orsi L, Rudant J, Ajrouche R, Leverger G, Baruchel A, Nelken B, Pasquet M, Michel G, Bertrand Y, Ducassou S, Gandemer V, et al. Parental smoking, maternal alcohol, coffee and tea consumption during pregnancy, and childhood acute leukemia: the ESTELLE study. Cancer Causes Control. 2015;26(7):1003–17. doi: 10.1007/s10552-015-0593-5. [DOI] [PubMed] [Google Scholar]
- 28.Dorak MT, Burnett AK, Worwood M, Sproul AM, Gibson BES. The C282Y Mutation of HFE Is Another Male-Specific Risk Factor for Childhood Acute Lymphoblastic Leukemia. Blood. 1999;94(11):3957–3958. [PubMed] [Google Scholar]
- 29.Thompson JR, Gerald PF, Willoughby MLN, Armstrong BK. Maternal folate supplementation in pregnancy and protection against acute lymphoblastic leukaemia in childhood: a case-control study. The Lancet. 2001;358(9297):1935–1940. doi: 10.1016/S0140-6736(01)06959-8. [DOI] [PubMed] [Google Scholar]
- 30.Kennedy AE, Kamdar KY, Lupo PJ, Okcu MF, Scheurer ME, Baum MK, Dorak MT. Examination of HFE associations with childhood leukemia risk and extension to other iron regulatory genes. Leuk Res. 2014;38(9):1055–60. doi: 10.1016/j.leukres.2014.06.016. [DOI] [PubMed] [Google Scholar]
- 31.Brent RL. Carcinogenic risks of prenatal ionizing radiation. Semin Fetal Neonatal Med. 2014;19(3):203–13. doi: 10.1016/j.siny.2013.11.009. [DOI] [PubMed] [Google Scholar]
- 32.Pearce MS, Salotti JA, Little MP, McHugh K, Lee C, Kim KP, Howe NL, Ronckers CM, Rajaraman P, Sir Craft AW, Parker L, et al. Radiation exposure from CT scans in childhood and subsequent risk of leukaemia and brain tumours: a retrospective cohort study. Lancet. 2012;380(9840):499–505. doi: 10.1016/S0140-6736(12)60815-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.McKay JA, Mathers JC. Diet induced epigenetic changes and their implications for health. Acta Physiol (Oxf) 2011;202(2):103–18. doi: 10.1111/j.1748-1716.2011.02278.x. [DOI] [PubMed] [Google Scholar]
- 34.Vickers MH. Early life nutrition, epigenetics and programming of later life disease. Nutrients. 2014;6(6):2165–78. doi: 10.3390/nu6062165. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Bakulski KM, Fallin MD. Epigenetic epidemiology: promises for public health research. Environ Mol Mutagen. 2014;55(3):171–83. doi: 10.1002/em.21850. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Hattori N, Ushijima T. Compendium of aberrant DNA methylation and histone modifications in cancer. Biochem Biophys Res Commun. 2014;455(1-2):3–9. doi: 10.1016/j.bbrc.2014.08.140. [DOI] [PubMed] [Google Scholar]
- 37.Mill J, Heijmans BT. From promises to practical strategies in epigenetic epidemiology. Nat Rev Genet. 2013;14(8):585–94. doi: 10.1038/nrg3405. [DOI] [PubMed] [Google Scholar]
- 38.Jones PA. Functions of DNA methylation: islands, start sites, gene bodies and beyond. Nat Rev Genet. 2012;13(7):484–492. doi: 10.1038/nrg3230. [DOI] [PubMed] [Google Scholar]
- 39.Nordlund J, Backlin C, Wahlberg P, Busche S, Berglund E, Eloranta M-L, Flaegstad T, Forestier E, Frost B-M, Harila-Saari A, Heyman M, et al. Genome-wide signatures of differential DNA methylation in pediatric acute lymphoblastic leukemia. Genome Biology. 2013;14(9) doi: 10.1186/gb-2013-14-9-r105. r105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Xu C-J, Bonder MJ, Söderhäll C, Bustamante M, Baïz N, Gehring U, Jankipersadsing SA, van der Vlies P, van Diemen CC, van Rijkom B, Just J, et al. The emerging landscape of dynamic DNA methylation in early childhood. BMC Genomics. 2017;18(1):25. doi: 10.1186/s12864-016-3452-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Dolinoy DC, Das R, Weidman JR, Jirtle RL. Metastable epialleles, imprinting, and the fetal origins of adult diseases. Pediatr Res. 2007;61(5 Pt 2):30r–37r. doi: 10.1203/pdr.0b013e31804575f7. [DOI] [PubMed] [Google Scholar]
- 42.Mathers JC, McKay JA. Epigenetics - potential contribution to fetal programming. Adv Exp Med Biol. 2009;646:119–23. doi: 10.1007/978-1-4020-9173-5_13. [DOI] [PubMed] [Google Scholar]
- 43.Wong NC, Ashley D, Chatterton Z, Parkinson-Bates M, Ng HK, Halemba MS, Kowalczyk A, Bedo J, Wang Q, Bell K, Algar E, et al. A distinct DNA methylation signature defines pediatric pre-B cell acute lymphoblastic leukemia. Epigenetics. 2012;7(6):535–41. doi: 10.4161/epi.20193. [DOI] [PubMed] [Google Scholar]
- 44.Dunwell T, Hesson L, Rauch TA, Wang L, Clark RE, Dallol A, Gentle D, Catchpoole D, Maher ER, Pfeifer GP, Latif F. A genome-wide screen identifies frequently methylated genes in haematological and epithelial cancers. Mol Cancer. 2010;9:44. doi: 10.1186/1476-4598-9-44. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Davidsson J, Lilljebjorn H, Andersson A, Veerla S, Heldrup J, Behrendtz M, Fioretos T, Johansson B. The DNA methylome of pediatric acute lymphoblastic leukemia. Hum Mol Genet. 2009;18(21):4054–65. doi: 10.1093/hmg/ddp354. [DOI] [PubMed] [Google Scholar]
- 46.Garcia-Manero G, Yang H, Kuang S-Q, O’Brien S, Thomas D, Kantarjian H. Epigenetics of acute lymphocytic leukemia. Seminars in hematology. 2009;46(1) doi: 10.1053/j.seminhematol.2008.09.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Nordlund J, Backlin CL, Zachariadis V, Cavelier L, Dahlberg J, Ofverholm I, Barbany G, Nordgren A, Overnas E, Abrahamsson J, Flaegstad T, et al. DNA methylation-based subtype prediction for pediatric acute lymphoblastic leukemia. Clin Epigenetics. 2015;7(1):11. doi: 10.1186/s13148-014-0039-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Vineis P, Perera F. Molecular epidemiology and biomarkers in etiologic cancer research: the new in light of the old. Cancer Epidemiol Biomarkers Prev. 2007;16(10):1954–65. doi: 10.1158/1055-9965.EPI-07-0457. [DOI] [PubMed] [Google Scholar]
- 49.Timms JA, Relton CL, Rankin J, Strathdee G, McKay JA. DNA methylation as a potential mediator of environmental risks in the development of childhood acute lymphoblastic leukemia. Epigenomics. 2016;8(4):519–36. doi: 10.2217/epi-2015-0011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Boyd A, Golding J, Macleod J, Lawlor DA, Fraser A, Henderson J, Molloy L, Ness A, Ring S, Davey Smith G. Cohort Profile: the 'children of the 90s'--the index offspring of the Avon Longitudinal Study of Parents and Children. Int J Epidemiol. 2013;42(1):111–27. doi: 10.1093/ije/dys064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Fraser A, Macdonald-Wallis C, Tilling K, Boyd A, Golding J, Davey Smith G, Henderson J, Macleod J, Molloy L, Ness A, Ring S, et al. Cohort Profile: the Avon Longitudinal Study of Parents and Children: ALSPAC mothers cohort. Int J Epidemiol. 2013;42(1):97–110. doi: 10.1093/ije/dys066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Golding J, Pembrey M, Jones R. ALSPAC--the Avon Longitudinal Study of Parents and Children. I. Study methodology. Paediatr Perinat Epidemiol. 2001;15(1):74–87. doi: 10.1046/j.1365-3016.2001.00325.x. [DOI] [PubMed] [Google Scholar]
- 53.Relton CL, Gaunt T, McArdle W, Ho K, Duggirala A, Shihab H, Woodward G, Lyttleton O, Evans DM, Reik W, Paul YL, et al. Data Resource Profile: Accessible Resource for Integrated Epigenomic Studies (ARIES) Int J Epidemiol. 2015 doi: 10.1093/ije/dyv072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Chatterton Z, Morenos L, Mechinaud F, Ashley DM, Craig JM, Sexton-Oates A, Halemba MS, Parkinson-Bates M, Ng J, Morrison D, Carroll WL, et al. Epigenetic deregulation in pediatric acute lymphoblastic leukemia. Epigenetics. 2014;9(3):459–67. doi: 10.4161/epi.27585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Figueroa ME, Chen SC, Andersson AK, Phillips LA, Li Y, Sotzen J, Kundu M, Downing JR, Melnick A, Mullighan CG. Integrated genetic and epigenetic analysis of childhood acute lymphoblastic leukemia. J Clin Invest. 2013;123(7):3099–111. doi: 10.1172/JCI66203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Sandoval J, Heyn H, Mendez-Gonzalez J, Gomez A, Moran S, Baiget M, Melo M, Badell I, Nomdedeu JF, Esteller M. Genome-wide DNA methylation profiling predicts relapse in childhood B-cell acute lymphoblastic leukaemia. Br J Haematol. 2013;160(3):406–9. doi: 10.1111/bjh.12113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Almamun M, Levinson BT, van Swaay AC, Johnson NT, McKay SD, Arthur GL, Davis JW, Taylor KH. Integrated methylome and transcriptome analysis reveals novel regulatory elements in pediatric acute lymphoblastic leukemia. Epigenetics. 2015;10(9):882–90. doi: 10.1080/15592294.2015.1078050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Schafer E, Irizarry R, Negi S, McIntyre E, Small D, Figueroa ME, Melnick A, Brown P. Promoter hypermethylation in MLL-r infant acute lymphoblastic leukemia: biology and therapeutic targeting. Blood. 2010;115(23):4798–809. doi: 10.1182/blood-2009-09-243634. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Pidsley R, CC YW, Volta M, Lunnon K, Mill J, Schalkwyk LC. A data-driven approach to preprocessing Illumina 450K methylation array data. BMC Genomics. 2013;14:293. doi: 10.1186/1471-2164-14-293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Touleimat N, Tost J. Complete pipeline for Infinium((R)) Human Methylation 450K BeadChip data processing using subset quantile normalization for accurate DNA methylation estimation. Epigenomics. 2012;4(3):325–41. doi: 10.2217/epi.12.21. [DOI] [PubMed] [Google Scholar]
- 61.Fortin JP, Labbe A, Lemire M, Zanke BW, Hudson TJ, Fertig EJ, Greenwood CM, Hansen KD. Functional normalization of 450k methylation array data improves replication in large cancer studies. Genome Biol. 2014;15(12):503. doi: 10.1186/s13059-014-0503-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.de Goede OM, Razzaghian HR, Price EM, Jones MJ, Kobor MS, Robinson WP, Lavoie PM. Nucleated red blood cells impact DNA methylation and expression analyses of cord blood hematopoietic cells. Clin Epigenetics. 2015;7:95. doi: 10.1186/s13148-015-0129-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Aryee MJ, Jaffe AE, Corrada-Bravo H, Ladd-Acosta C, Feinberg AP, Hansen KD, Irizarry RA. Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics. 2014;30(10):1363–9. doi: 10.1093/bioinformatics/btu049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Houseman EA, Accomando WP, Koestler DC, Christensen BC, Marsit CJ, Nelson HH, Wiencke JK, Kelsey KT. DNA methylation arrays as surrogate measures of cell mixture distribution. BMC Bioinformatics. 2012;13:86. doi: 10.1186/1471-2105-13-86. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society. Series B (Methodological) 1995;57(1):289–300. [Google Scholar]
- 66.Sharp GC, Lawlor DA, Richmond RC, Fraser A, Simpkin A, Suderman M, Shihab HA, Lyttleton O, McArdle W, Ring SM, Gaunt TR, et al. Maternal pre-pregnancy BMI and gestational weight gain, offspring DNA methylation and later offspring adiposity: findings from the Avon Longitudinal Study of Parents and Children. Int J Epidemiol. 2015;44(4):1288–304. doi: 10.1093/ije/dyv042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Sherman BT, Huang da W, Tan Q, Guo Y, Bour S, Liu D, Stephens R, Baseler MW, Lane HC, Lempicki RA. DAVID Knowledgebase: a gene-centered database integrating heterogeneous gene annotation resources to facilitate high-throughput gene functional analysis. BMC Bioinformatics. 2007;8:426. doi: 10.1186/1471-2105-8-426. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Joubert Bonnie R, Felix Janine F, Yousefi P, Bakulski Kelly M, Just Allan C, Breton C, Reese SE, Markunas Christina A, Richmond Rebecca C, Xu C-J, Küpers Leanne K, et al. DNA Methylation in Newborns and Maternal Smoking in Pregnancy: Genome-wide Consortium Meta-analysis. The American Journal of Human Genetics. 2016;98(4):680–696. doi: 10.1016/j.ajhg.2016.02.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Joubert BR, den Dekker HT, Felix JF, Bohlin J, Ligthart S, Beckett E, Tiemeier H, van Meurs JB, Uitterlinden AG, Hofman A, Haberg SE, et al. Maternal plasma folate impacts differential DNA methylation in an epigenome-wide meta-analysis of newborns. Nat Commun. 2016;7 doi: 10.1038/ncomms10577. 10577. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Greaves M. Infection, immune responses and the aetiology of childhood leukaemia. Nat Rev Cancer. 2006;6(3):193–203. doi: 10.1038/nrc1816. [DOI] [PubMed] [Google Scholar]
- 71.Greaves M. A causal mechanism for childhood acute lymphoblastic leukaemia. Nat Rev Cancer. 2018 doi: 10.1038/s41568-018-0015-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Hall K, Frederiksen B, Rewers M, Norris JM. Daycare Attendance, Breastfeeding, and the Development of Type 1 Diabetes: The Diabetes Autoimmunity Study in the Young. BioMed Research International. 2015;2015:203947. doi: 10.1155/2015/203947. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Crosnoe R, Purtell KM, Davis-Kean P, Ansari A, Benner AD. The Selection of Children from Low-Income Families into Preschool. Developmental psychology. 2016;52(4):599–612. doi: 10.1037/dev0000101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Hrušák O, Trka J, Zuna J, Poloučková A, Kalina T, Starý J. Acute lymphoblastic leukemia incidence during socioeconomic transition: selective increase in children from 1 to 4 years. Leukemia. 2002;16:720. doi: 10.1038/sj.leu.2402422. [DOI] [PubMed] [Google Scholar]
- 75.Needham BL, Smith JA, Zhao W, Wang X, Mukherjee B, Kardia SL, Shively CA, Seeman TE, Liu Y, Diez Roux AV. Life course socioeconomic status and DNA methylation in genes related to stress reactivity and inflammation: The multi-ethnic study of atherosclerosis. Epigenetics. 2015;10(10):958–69. doi: 10.1080/15592294.2015.1085139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Connor Gorber S, Schofield-Hurwitz S, Hardt J, Levasseur G, Tremblay M. The accuracy of self-reported smoking: a systematic review of the relationship between self-reported and cotinine-assessed smoking status. Nicotine Tob Res. 2009;11(1):12–24. doi: 10.1093/ntr/ntn010. [DOI] [PubMed] [Google Scholar]
- 77.Estey MP, Di Ciano-Oliveira C, Froese CD, Bejide MT, Trimble WS. Distinct roles of septins in cytokinesis: SEPT9 mediates midbody abscission. J Cell Biol. 2010;191(4):741–9. doi: 10.1083/jcb.201006031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Costa O, Schneider P, Coquet L, Chan P, Penther D, Legrand E, Jouenne T, Vasse M, Vannier J-P. Proteomic profile of pre - B2 lymphoblasts from children with acute lymphoblastic leukemia (ALL) in relation with the translocation (12; 21) Clinical Proteomics. 2014;11(1):31–31. doi: 10.1186/1559-0275-11-31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Bilican B, Goding CR. Cell cycle regulation of the T-box transcription factor tbx2. Experimental Cell Research. 2006;312(12):2358–2366. doi: 10.1016/j.yexcr.2006.03.033. [DOI] [PubMed] [Google Scholar]
- 80.Lee S-T, Muench MO, Fomin ME, Xiao J, Zhou M, de Smith A, Martín-Subero JI, Heath S, Houseman EA, Roy R, Wrensch M, et al. Epigenetic remodeling in B-cell acute lymphoblastic leukemia occurs in two tracks and employs embryonic stem cell-like signatures. Nucleic Acids Research. 2015;43(5):2590–2602. doi: 10.1093/nar/gkv103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 1.Figueroa ME, et al. Integrated genetic and epigenetic analysis of childhood acute lymphoblastic leukemia. J Clin Invest. 2013;123(7):3099–111. doi: 10.1172/JCI66203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Sandoval J, et al. Genome-wide DNA methylation profiling predicts relapse in childhood B-cell acute lymphoblastic leukaemia. Br J Haematol. 2013;160(3):406–9. doi: 10.1111/bjh.12113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Almamun M, et al. Integrated methylome and transcriptome analysis reveals novel regulatory elements in pediatric acute lymphoblastic leukemia. Epigenetics. 2015;10(9):882–90. doi: 10.1080/15592294.2015.1078050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Schafer E, et al. Promoter hypermethylation in MLL-r infant acute lymphoblastic leukemia: biology and therapeutic targeting. Blood. 2010;115(23):4798–809. doi: 10.1182/blood-2009-09-243634. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Wong NC, et al. A distinct DNA methylation signature defines pediatric pre-B cell acute lymphoblastic leukemia. Epigenetics. 2012;7(6):535–41. doi: 10.4161/epi.20193. [DOI] [PubMed] [Google Scholar]
- 6.Dunwell T, et al. A genome-wide screen identifies frequently methylated genes in haematological and epithelial cancers. Mol Cancer. 2010;9:44. doi: 10.1186/1476-4598-9-44. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Davidsson J, et al. The DNA methylome of pediatric acute lymphoblastic leukemia. Hum Mol Genet. 2009;18(21):4054–65. doi: 10.1093/hmg/ddp354. [DOI] [PubMed] [Google Scholar]
- 8.Joubert Bonnie R, et al. DNA Methylation in Newborns and Maternal Smoking in Pregnancy: Genome-wide Consortium Meta-analysis. The American Journal of Human Genetics. 2016;98(4):680–696. doi: 10.1016/j.ajhg.2016.02.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Joubert BR, et al. Maternal plasma folate impacts differential DNA methylation in an epigenome-wide meta-analysis of newborns. Nat Commun. 2016;7:10577. doi: 10.1038/ncomms10577. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.