Abstract
Recent associations between age-related differentially methylated sites and bivalently marked chromatin domains have implicated a role for these genomic regions in aging and age-related diseases. However, the overlap between such epigenetic modifications has so far only been identified with respect to age-associated hyper-methylated sites in blood. In this study, we observed that age-associated differentially methylated sites characterized in the human brain were also highly enriched in bivalent domains. Analysis of hyper- vs. hypo-methylated sites partitioned by age (fetal, child, and adult) revealed that enrichment was significant for hyper-methylated sites identified in children and adults (child, fold difference = 2.28, P = 0.0016; adult, fold difference = 4.73, P = 4.00×10−5); this trend was markedly more pronounced in adults when only the top 100 most significantly hypo- and hyper-methylated sites were considered (adult, fold difference = 10.7, P = 2.00×10−5). Interestingly, we found that bivalently marked genes overlapped by age-associated hyper-methylation in the adult brain had strong involvement in biological functions related to developmental processes, including neuronal differentiation. Our findings provide evidence that the accumulation of methylation in bivalent gene regions with age is likely to be a common process that occurs across tissue types. Furthermore, particularly with respect to the aging brain, this accumulation might be targeted to loci with important roles in cell differentiation and development, and the closing off of these developmental pathways. Further study of these genes is warranted to assess their potential impact upon the development of age-related neurological disorders.
Introduction
Epigenetic modifications, such as DNA methylation and post-translational modifications of histone proteins, are indispensable for many aspects of genome function, including gene expression. DNA methylation patterns at specific CpG-sites can vary over time within an individual and correspondingly, age-related methylation changes have been identified in multiple tissues and organisms [1]–[8]. A recent study of changes in DNA methylation in blood cells associated with aging revealed an enrichment of differentially-methylated sites, specifically those exhibiting increased methylation with age (hyper-methylation), in bivalently marked regions of the genome marked by the presence of both trimethylated lysine 4 and 27 in histone H3 (H3K4me3 and H3K27me3 respectively)[3]. Due to the predominant effect of H3K27me3, bivalent domains are known to take on repressed states while remaining ‘poised’ for active gene expression, a feature important to cell pluripotency and differentiation [9], [10].
Interestingly, the accumulation of DNA methylation at bivalent domains has been proposed to impact gene expression programs associated with both cell differentiation and carcinogenesis [11]. Given the relationship between cancer and aging, the finding that age-associated hyper-methylation has been shown to preferentially occur within bivalent domains is intriguing and warrants further study [3], and also suggests that the build up of DNA methylation at bivalent domains with age may occur in additional tissues with potential roles in the pathogenesis of other age-related disorders. In the current study, we sought to build upon our understanding of age-associated DNA methylation by investigating whether recently characterized age-associated DNA hyper- and hypo-methylated loci in the human brain [8] had a similar relationship with bivalently marked chromatin domains in human embryonic stem cells [12].
Methods
Data acquisition and mining
Raw association data, including P-values for 27,578 genome-wide CpG sites from [8], were kindly provided by the authors of the study; these methylation data were generated from human brain samples (dorsolateral prefrontal cortex) of 108 subjects using the Illumina Infinium HumanMethylation27 BeadChip [8]. For the current study, a Bonferonni corrected P-value (1.81304E-06) was applied to determine CpG sites with the most significant age-associated methylation changes within each of three age groups, as defined by Numata et al. [8]: fetal, child (0–10 years of age), and adult (>10 years of age). After applying this correction, sites were first partitioned into hypo-vs. hyper-methylated sites; hyper-methylated sites were defined as those with slopes >0.0 (i.e., those at which methylation increased with age), whereas hypo-methylated sites were defined as those with slopes <0.0. Data were then further partitioned into additional subsets: (1) all significant sites for each age group (using P-value <1.81304E-06), (2) significant sites unique to each age group (i.e., those sites with significant P-values <1.81304E-06 in a given age group that were not also significant (P-value >1.81304E-06) in either of the other two age groups), and (3) the top 100 most significant non-overlapping sites per age group. In addition, CpG sites with reported age-related changes in methylation identified in blood were also used [3], [4], [7], including aging-associated differentially methylated sites from samples of patients with type-1 diabetes [4]. When necessary, differentially methylated sites from these studies were also designated as either hyper- or hypo-methylated. Coordinates of CpG probe IDs from brain and blood studies were considered as single points in analyses below, based on the first base pair position of each probe. For bivalent chromatin sites, gene coordinates (+−2 KB) of 1,792 previously identified genes associated with co-methylation of H3K4 and H3K27 in human hES3 cells [12] were used. Probe and gene coordinates were based on the hg18 (NCBI 36) human genome reference build.
Data Analysis
We used the Genomic Hyperbrowser (http://hyperbrowser.uio.no/hb/)[13] to perform all analyses on genome coordinates. Enrichment was defined as the ratio of the proportion of probe coordinates covered by bivalent gene regions, to the proportion of non-probe coordinates covered by bivalent gene regions. The significance of overlap was tested at a whole genome scale, by asking whether age-associated probe coordinates fell within bivalent gene coordinates more often than expected by chance; for this, we defined a null model in which probe coordinates were preserved exactly, and where the positions of bivalent gene coordinates were randomized while preserving the observed distribution of lengths and distances between these bivalent gene coordinates. The base pair overlap between the two tracks was calculated for the real data, as well as for 50,000 Monte Carlo samples from the null model. P-values were calculated as the proportion of Monte Carlo samples being equal to or more extreme than the observed overlap.
For comparisons of the degree of overlap observed between bivalent gene loci and different subsets of age-associated differentially methylated sites (e.g. adult brain hyper- vs. hypo-methylated sites), case-control tracks were created for each pair of subsets. In each instance, case and control tracks consisted of probe coordinates unique to either of the two subsets being compared. For this analysis, Monte Carlo simulations (n = 50,000) were performed in which case-control labels for each of the coordinates were permuted randomly. P-values were calculated by comparing the observed base pair overlap between case tracks and bivalent regions against the corresponding distribution for 50,000 Monte Carlo samples. The fold enrichment difference in overlap between case and control tracks was calculated as the ratio between the proportions of case and control tracks that overlapped with bivalent regions.
We used DAVID [14], [15] to assess the biological function of genes associated with hyper-methylated sites unique to adult brain [8], considering those that overlapped bivalent sites compared to those that did not [12]. Results for gene ontology were assessed using the Functional Annotation Tool (GOTERM_BP_FAT).
Results
Enrichment of age-associated differentially methylated sites in bivalent chromatin domains
Of the 27,578 CpG sites investigated across three age groups, fetal, child, and adult, Numata et al. [8] identified 865, 5,506, and 10,578 loci, respectively, that exhibited age-associated changes in DNA methylation based on a false discovery rate of 5%. To further refine this data set, we applied a Bonferonni correction to the uncorrected data, and compiled a list of 125 fetal, 888 child, and 2,756 adult sites, with the most shared sites between children and adults (child/adult, n = 439; child/fetal, n = 12; fetal/adult, n = 14). After removing shared loci between age groups, there were a total of 2,851 non-overlapping sites (fetal, n = 103; child, n = 441; adult, n = 2,307; Table 1).
Table 1. Global enrichment of differentially methylated loci in bivalent chromatin domains.
Global Enrichment | ||
bivalent sites (n = 1792) | P value | |
Brain Adult (n = 2756) [8] | 3.063 | 0.00002 |
Brain Child (n = 888) [8] | 2.814 | 0.00002 |
Brain Fetal (n = 125) [8] | 0.7914 | 0.7809 |
Brain Adult only (n = 2307) [8] | 3.066 | 0.00002 |
Brain Child only (n = 441) [8] | 2.616 | 0.00002 |
Brain Fetal only (n = 103) [8] | 0.8008 | 0.7491 |
Rakyan et al. [3] (n = 131) | 12.86 | 0.00002 |
Alisch et al. [7] (n = 2026) | 2.09 | 0.00002 |
Teschendorff et al. ‘S3’ [4] (n = 583) | 3.495 | 0.00002 |
Teschendorff et al. ‘S5’ [4] , T1D (n = 683) | 5.294 | 0.00002 |
We tested for enrichment of brain DNA methylation sites identified in each age group with bivalent chromatin domains. We did this by first considering all age-associated sites in each age group (Table 1); both adult and child sites showed significant enrichment in bivalent gene regions (adult, enrichment = 3.06, P = 0.00002; child, enrichment = 2.81, P = 0.00002), whereas the enrichment of fetal sites in bivalent gene loci was non-significant (enrichment = 0.79, P = 0.78). Global enrichment values changed only slightly when sites unique to each age group were tested (adult, enrichment = 3.07, P = 0.00002; child, enrichment = 2.62, P = 0.00002; fetal, enrichment = 0.80, P = 0.75; Table 1).
As a point of comparison, we also tested for overlap between bivalent gene regions and age-associated differentially methylated sites identified in blood samples from three independent studies [3], [4], [7]. Two of these used samples collected from adults [3], [4], and the third was based on sampling from two child cohorts [7]. Data from Rakyan et al. [3] included only hyper-methylated sites identified in blood that overlapped those identified in CD4+ T cells and CD14+ monocytes. In all cases, sites identified from each of these three studies showed significant overlap with bivalent gene regions (Table 1). The highest enrichment value (12.86; P = 0.00002) was observed for sites identified in Rakyan et al. [3], which had previously been analyzed with respect to their overlap with bivalent sites identified in Zhao et al. [12], and was the motivation for this study. We also tested for overlap of bivalent gene regions and 683 differentially methylated sites identified in a cohort of T1D patients [4] and observed a significant global enrichment value of 5.29 (P = 0.00002).
Increased enrichment of hyper- vs. hypo-methylated sites in bivalent genes
Given the high degree of overlap between bivalent loci and sites identified by Rakyan et al. [3](Table 1), which consisted of the top 131 most significantly differentiated hyper-methylated sites associated with age in their samples, we attempted to further partition age-associated sites in brain to better match this dataset. We first partitioned data from Numata et al. [8] by hyper- versus hypo-methylated sites unique to each age group and tested these for enrichment in bivalent gene regions. In child and adult age groups, hyper-methylated sites were enriched in bivalent gene regions, whereas hypo-methylated sites were not (Table 2). In addition, the overlap of hyper-methylated sites with bivalent gene regions was significantly greater than hypo-methylated sites in both children and adults (child, fold difference = 2.28, P = 0.0016; adult, fold difference = 4.73, P = 0.00004; Table 2). A similar trend was noted for fetal-only sites, although the difference between hypo- and hyper-methylated sites was not significant (fold difference = 4.44, P = 0.099). A significant difference between the overlap of bivalent genes observed for hyper- and hypo-methylated sites in adult blood [4] was also observed (hyper>hypo, fold difference = 2.57, P = 0.00002; Table 2). We next analyzed only the top 100 methylated brain sites unique to child and adult age groups, partitioned by hyper- and hypo-methylated sites. Again, hyper-methylated sites showed a greater overlap with bivalent genes, but this difference was only significant for adult sites (child, fold difference = 2.75, P = 0.052; adult, fold difference = 10.67, P = 0.00002; Table 2). In addition, the top 100 adult hyper-methylated sites in brain had a higher enrichment value than when all unique sites were considered (enrichment = 8.09, P = 0.00002; Table 2), more similar to that observed for the Rakyan et al. [3] dataset (enrichment = 12.86, P = P = 0.00002; Table 1). Also striking, was that when aging associated T1D sites [4] were partitioned based on hypo- vs. hyper-methylated changes, hyper-methylated sites also exhibited a much higher enrichment compared to hypo-methylated sites (fold difference = 4.64, P = 0.00002; Table 2).
Table 2. Comparing global enrichment of hypo- vs. hyper-methylated loci in bivalent chromatin domains.
Global Enrichment | Is overlap for hyper sites > hypo sites? | |||
bivalent sites (n = 1792) | P value | Fold difference | P value | |
Brain Adult only, hypo (n = 440) [8] | 0.9055 | 0.752 | ||
Brain Adult only, hyper (n = 1867) [8] | 3.659 | 0.00002 | 4.727 | 0.00004 |
Brain Child only, hypo (n = 154) [8] | 1.326 | 0.2618 | ||
Brain Child only, hyper (n = 287) [8] | 3.392 | 0.00002 | 2.28 | 0.00164 |
Brain Fetal only, hypo (n = 83) [8] | 0.3875 | 0.9572 | ||
Brain Fetal only, hyper (n = 20) [8] | 2.77 | 0.1299 | 4.446 | 0.09896 |
Brain Adult only, hypo (top 100) [8] | 0.654 | 0.8946 | ||
Brain Adult only, hyper (top 100) [8] | 8.085 | 0.00002 | 10.67 | 0.00002 |
Brain Child only, hypo (top 100) [8] | 1.365 | 0.2754 | ||
Brain Child only, hyper (top 100) [8] | 4.427 | 0.00006 | 2.75 | 0.05192 |
Blood Adult hypo (n = 360) [4] | 2.017 | 0.00094 | ||
Blood Adult hyper (n = 223) [4] | 6.498 | 0.00002 | 2.571 | 0.00002 |
Adult T1D, hypo (n = 278) [4] | 1.365 | 0.1393 | ||
Adult T1D, hyper (n = 405) [4] | 9.255 | 0.00002 | 4.637 | 0.00002 |
Functional characterization of hyper-methylated adult brain/bivalent genes
We next sought to determine whether there were differences in biological function of adult brain hyper-methylated genes that overlapped bivalent loci compared to those that did not overlap bivalent loci. Of the 1,867 brain hyper-methylated sites unique to adults, 353 overlapped bivalent loci, corresponding to 304 non-redundant genes. The 1,514 hyper-methylated sites that did not overlap bivalent loci corresponded to 1,410 non-redundant genes. A qualitative comparison of the top ten most significant biological functions of these two groups of genes is shown in Table 3. Not surprisingly, given that bivalent domains are known to associate with genes involved in development, eight of the ten biological functions identified for adult brain hyper-methylated/bivalent sites shared the parent GO term “developmental process” (GO:0032502). Notably, among these was the term “neuron differentiation” (4.90×10−13; GO:0030182), which is defined as “the process in which a relatively unspecialized cell acquires specialized features of a neuron”. In contrast, none of the biological functions identified for those sites outside of bivalent domains were associated with GO:0032502; in addition, only four of these had P-values <0.05.
Table 3. Gene ontology results for brain hyper-methylated sites inside and outside bivalent chromatin domains.
Top ten biological functions for brain hyper-methylated adult-only sites overlapping bivalent genes | ||
GO Term (biological function) | # of genes | Benjamini P value |
pattern specification process | 41 | 1.30×10−21 |
embryonic morphogenesis | 41 | 1.40×10−19 |
regionalization | 31 | 2.90×10−16 |
regulation of transcription, DNA-dependent | 89 | 2.90×10−16 |
regulation of RNA metabolic process | 90 | 2.80×10−16 |
skeletal system development | 37 | 8.50×10−16 |
embryonic organ development | 28 | 3.50×10−15 |
embryonic organ morphogenesis | 25 | 7.70×10−15 |
neuron differentiation | 39 | 4.90×10−13 |
anterior/posterior pattern formation | 23 | 3.00×10−12 |
Discussion
Age-associated DNA hyper-methylated sites in blood have been shown to overlap bivalent chromatin domains [3], which involve genes with functions in development and cellular differentiation [10]. Here we have confirmed the occurrence of such an overlap in additional datasets from blood, and revealed that genomic regions associated with age-related changes in DNA methylation in the human brain are also enriched in bivalent domains, suggesting that this phenomenon is likely ubiquitous, occurring across different tissue types.
Regarding the relationship between brain age-associated DNA methylated sites and bivalent domains, we noted two primary trends: (1) that overlap of these sites was largely driven by DNA hyper-methylation, as hyper-methylated sites were more enriched in bivalent domains than hypo-methylated sites; and (2) that this enrichment was more pronounced for sites associated with DNA hyper-methylation that occurred in adulthood. Numata et al. [8] noted that DNA hyper-methylated sites represented a larger proportion of age-associated changes in adults compared to children, and discussed the fact that such trends may be consistent with the observation that stochastic changes in DNA methylation are known to increase with age [16]. Importantly, stochastic changes in DNA methylation have also been implicated in disease discordance between monozygotic twins [17], [18], including Alzheimer's disease [18].
But why DNA hyper-methylated sites show preferential overlap with bivalently marked genes, and whether these sites have roles in disease pathogenesis remains to be determined. Rakyan et al. [3] posited that increased DNA hyper-methylation at bivalent domains with age may be associated with a decrease in cellular and developmental pluripotency, in line with observations from in vitro studies using human cell lines [19]. Interestingly, with respect to the brain data analyzed here, we found that a subset of genes characterized by both age-associated DNA hyper-methylation and bivalent chromatin marks were enriched for the biological function “neuron differentiation”. Under a model such as that proposed by Rakyan et al. [3], and considering the two primary trends noted above, it could be speculated that the accumulation of methylation (typically associated with decreased gene expression) at bivalent loci in the aging brain could prevent neuronal cell regeneration. Thus, given that one of the primary pathologic features of age-related neurological disorders such as Alzheimer's disease and Parkinson's disease includes the loss of neuronal cell plasticity and cell death [20], [21], the relationship between the epigenetic alterations investigated in this study could have involvement in the onset or the progression of such disorders, and warrants further study.
Acknowledgments
The authors are grateful to Tianzhang Ye and colleagues for kindly providing the brain methylation data used in this study.
Funding Statement
Dr. Disanto is funded by research fellowship FISM-Fondazione Italiana Sclerosi Multipla-Cod.: 2010/B/5. Dr. Kjetil is supported by EMBIO at the University of Oslo. Dr. Breden is supported by the Natural Sciences and Engineering Research Council of Canada. Dr. Ramagopalan receives research support from the Multiple Sclerosis Society of Canada Scientific Research Foundation and the Multiple Sclerosis Society of Great Britain and Northern Ireland. This work was supported by the Medical Research Council (G0801976), the Wellcome Trust (075491/Z/04), and by a research fellowship FISM Fondazione Italiana Sclerosi Multipla-Cod.:2010/B/5. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Maegawa S, Hinkal G, Kim HS, Shen L, Zhang L, et al. (2010) Widespread and tissue specific age-related DNA methylation changes in mice. Genome Res 20(3): 332–340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Christensen BC, Houseman EA, Marsit CJ, Zheng S, Wrensch MR, et al. (2009) Aging and environmental exposures alter tissue-specific DNA methylation dependent upon CpG island context. PLoS Genet 5(8): e1000602. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Rakyan VK, Down TA, Maslau S, Andrew T, Yang TP, et al. (2010) Human aging-associated DNA hypermethylation occurs preferentially at bivalent chromatin domains. Genome Res 20(4): 434–439. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Teschendorff AE, Menon U, Gentry-Maharaj A, Ramus SJ, Weisenberger DJ, et al. (2010) Age-dependent DNA methylation of genes that are suppressed in stem cells is a hallmark of cancer. Genome Res 20(4): 440–446. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Hernandez DG, Nalls MA, Gibbs JR, Arepalli S, van der Brug M, et al. (2011) Distinct DNA methylation changes highly correlated with chronological age in the human brain. Hum Mol Genet 20(6): 1164–1172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Koch CM, Suschek CV, Lin Q, Bork S, Goergens M, et al. (2011) Specific age-associated DNA methylation changes in human dermal fibroblasts. PLoS One 6(2): e16679. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Alisch RS, Barwick BG, Chopra P, Myrick LK, Satten GA, et al. (2012) Age-associated DNA methylation in pediatric populations. Genome Res 22(4): 623–632. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Numata S, Ye T, Hyde TM, Guitart-Navarro X, Tao R, et al. (2012) DNA methylation signatures in development and aging of the human prefrontal cortex. Am J Hum Genet 90(2): 260–272. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Azuara V, Perry P, Sauer S, Spivakov M, Jørgensen HF, et al. (2006) Chromatin signatures of pluripotent cell lines. Nat Cell Biol 8(5): 532–538. [DOI] [PubMed] [Google Scholar]
- 10. Bernstein BE, Mikkelsen TS, Xie X, Kamal M, Huebert DJ, et al. (2006) A bivalent chromatin structure marks key developmental genes in embryonic stem cells. Cell 125(2): 315–26. [DOI] [PubMed] [Google Scholar]
- 11. Ohm JE, Baylin SB (2007) Stem cell chromatin patterns: an instructive mechanism for DNA hypermethylation? Cell Cycle 6(9): 1040–1043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Zhao XD, Han X, Chew JL, Liu J, Chiu KP, et al. (2007) Whole-genome mapping of histone H3 Lys4 and 27 trimethylations reveals distinct genomic compartments in human embryonic stem cells. Cell Stem Cell 1(3): 286–298. [DOI] [PubMed] [Google Scholar]
- 13. Sandve GK, Gundersen S, Rydbeck H, Glad IK, Holden L, et al. (2010) The Genomic HyperBrowser: inferential genomics at the sequence level. Genome Biol 11(12): R121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Huang da W, Sherman BT, Lempicki RA (2009) Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 4(1): 44–57. [DOI] [PubMed] [Google Scholar]
- 15. Huang da W, Sherman BT, Lempicki RA (2009) Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res 37(1): 1–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Fraga MF, Ballestar E, Paz MF, Ropero S, Setien F, et al. (2005) Epigenetic differences arise during the lifetime of monozygotic twins. Proc Natl Acad Sci U S A 102(30): 10604–10609. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Javierre BM, Fernandez AF, Richter J, Al-Shahrour F, Martin-Subero JI, et al. (2010) Changes in the pattern of DNA methylation associate with twin discordance in systemic lupus erythematosus. Genome Res 20(2): 170–179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Mastroeni D, McKee A, Grover A, Rogers J, Coleman PD (2009) Epigenetic differences in cortical neurons from a pair of monozygotic twins discordant for Alzheimer's disease. PLoS One 4(8): e6617. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Meissner A, Mikkelsen TS, Gu H, Wernig M, Hanna J, et al. (2008) Genome-scale DNA methylation maps of pluripotent and differentiated cells. Nature. 454(7205): 766–770. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Crews L, Masliah E (2010) Molecular mechanisms of neurodegeneration in Alzheimer's disease. Hum Mol Genet 19(R1): R12–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Enciu AM, Nicolescu MI, Manole CG, Mureşanu DF, Popescu LM, et al. (2011) Neuroregeneration in neurodegenerative disorders. BMC Neurol 11: 75. [DOI] [PMC free article] [PubMed] [Google Scholar]