Abstract
A new mitotic clock and mathematical approach that incorporates DNA methylation biology common among human cell types provides a new tool for cancer epigenetics research.
Introduction
The ability to accurately and efficiently detect the acceleration in the stem cell mitotic “tick rate”—a so-called mitotic clock—could be a useful tool for predicting cancer risk. It has long been recognized that mitosis itself is highly mutagenic [1, 2] and enhanced mitosis in a stem cell may be an important factor in cancer risk. To date, mitotic clock models that use genetic approaches such as telomere length have not consistently predicted risk, potentially in part because of differences in functional programming among cell types. Previous epigenetic mitotic clock models included ones that used random replication errors at CpG sites as signals of mitotic activity, allowing stem cells to be followed retrospectively [3]. Although these epigenetic mitotic clocks were ingenious in design, their practical application in human tissues has been limited by the need to sample stem cells from multiple tissues directly. Hence, while it is clear that a quantitative estimate of mitotic activity in stem cells is likely to be strongly associated with cancer risk, knowledge of biomarkers specific to stem cells that signal mitotic activity has been lacking. A new, biologically based approach presented in the current issue of Genome Biology [4], called epiTOC, uses an integrative methodology that makes use of previous work on estimation of tissue-specific stem cell division rates and devises a model for an epigenetic mitotic clock that overcomes these challenges.
In formulating the new epiTOC tool, Yang and colleagues [4] take on the task of identifying putative phenotypically important variation in DNA methylation that is related to both stem cell alterations and disease risk. It can be extremely challenging to differentiate cell- or tissue type-specific events that are associated with disease risk from effects that are common across cell types because differences in patterns of DNA methylation among normal cells and tissue types are incompletely characterized. Epigenetic events that occur at loci related to “stemness”, lineage-specific differentiation events, or cell-specific responses to transcription factors can depend on cell or tissue type, whereas events that occur at loci associated with metabolic and genetic regulation may be shared among cell types. At the same time, much progress has been made in the search for epigenetically important cancer disease risk loci. While early candidate gene studies comparing tumors with normal cells identified gene-specific hypermethylation (primarily in promoter regions) and DNA nucleotide repeat element hypomethylation (genome-wide hypomethylation), recent high-resolution approaches [5] have shown promise for assessing epigenetic variation in multiple normal and abnormal cells and tissues. Subsequent work from experimental studies has given us better maps relating the genomic context of CpG DNA methylation to functional gene regulation. This sets the stage for accelerated development and testing of potentially useful non-genetic, DNA-based biomarker tools in healthy and diseased cells. More specifically, we are becoming better positioned to recognize signals that are informative for specific types of questions. The integration of cell type data and an epigenetic approach to “telling time” has improved the coordinated universal model of keeping mitotic time by adding guidelines for adjusting to the right “time zone”. Indeed, here Yang and colleagues [4] apply knowledge of the stem cell functional phenotype of polycomb-related genes and integrate this with variation over calendar time to discover loci that are putatively related to mitosis.
The epigenetic clock as a tool for cancer risk prediction
In this work, Yang and colleagues [4] select specific Polycomb target loci that are both unmethylated in multiple fetal tissues and show age-associated hypermethylation and hypothesize that methylation at these sites reflects relative mitotic activity. They then construct a model that shows that cancer and pre-cancer tissues have increased DNA methylation relative to relevant normal tissues. This, they posit, reflects enhanced stem cell activity and increased cancer risk.
Modeling assumptions are limited by current knowledge
Yang and colleagues [4] are to be congratulated for combining cutting-edge biological knowledge with state-of-the-art bioinformatics in building a cancer prediction model. Scrutiny of this provocative model is certain to result in modifications and refinements to it as the underlying assumptions (of both the model and past experiments) are challenged and the understanding of the underlying biology improves. At the outset, we note that there are a few important assumptions and limitations in this work.
First, the stem cell division rates applied in this work are derived from those presented in Tomasetti and Vogelstein [6]. While this is reasonable, as Tomasetti and Vogelstein indicate in their work [6] there is room for improvement in the estimates they present. In addition, the current model is tested in cancer tissues and shows universal increases, but the authors have not yet shown evidence of prediction of risk in a prospective setting, where availability of data is still very limited. The current model does not include or apply estimates of the contribution that somatic alterations in non-stem cells (in any tissue or tissue environment) may make to tick rate. Altered somatic cells, particularly in tissues with higher levels of carcinogen exposure, may have non-stem cells that propagate alterations and increase the estimated tick rate of the mitotic clock. We also note here that the potential contribution of immunity and inflammation, which are particularly important in many solid tumors, is not yet specifically included in the model. The current model is also built, appropriately, using data from just one tissue source, and additional methylation data from normal tissues in healthy subjects are needed to expand and further examine the predictions of the model. Finally, reference-free and reference-based approaches would have to be properly applied to adjust for cellular heterogeneity in the setting of various other normal tissue types.
We also highlight that, as the authors note, this model necessarily assumes that methylation at the informative loci occurs only in stem cells. This is novel biology for which there is little to no experimental evidence. If true, it would imply that locus specificity in methylation is differentially determined in numerous distinct cellular and tissue-specific compartments.
Implications of epiTOC
An important aspect of the work presented by Yang and colleagues [4] is that it highlights the differences between genome-wide association studies (GWAS) and epigenome-wide association studies (EWAS). In their work on this issue, the authors define additional building blocks of the DNA methylome, adding loci that putatively act in coordination and display a novel “mitotic clock” phenotype. These loci, as well as others that have been previously defined (e.g., the Horvath “aging” loci), represent the beginning of our ability to organize the methylome into distinct loci-driven phenotypic units [7]. Importantly, future discovery-based interrogation seeking to define the nature of differences in tissues or of tissues within population groups can now begin to group loci for testing rather than treating them independently. These groups can be compared for differences in the locus-associated phenotype, offering potential for better interpretation of some of the results of these studies. This is not unlike applying the now-standard techniques for delineating cell types within tissues [8–10], which represented the initial attempts to arrange epigenetic data into organizational, phenotypically defined units.
Adding these new building blocks allows us to imagine new approaches to future studies, including examination of the dynamic changes to the epigenome over the life course. For example, does epiTOC provide a window into variation in the extent to which chronological age contributes to cancer risk? The ability to estimate mitotic tick-rate acceleration has value for researchers collecting DNA methylation data, as it provides a directed approach to investigating age-correlated cancer risk and may inform on the biology of cancer risk factors studied across the life course.
Conclusions
Tests of epiTOC’s association with increased cancer risk require case control approaches or, ideally, prospective studies. Initially, such work will be limited to cell types from the type of biological specimens that are more routinely collected in these studies, such as peripheral blood, and as epiTOC was developed using peripheral blood as the model tissue it may show initial success in that setting. In the future, other relatively accessible biospecimens could be collected in medical and research settings with a prospective design, so that cancer risk for the relevant organ could be tested. These tissues may include epithelial cells from colorectal tissue, the bladder, sputum, the cervix, and the oral cavity or ductal epithelial cells from the breast. Thus, epiTOC is a new tool of the best kind—one that immediately brings new questions to the fore while also providing the potential means to find answers to pressing old ones. It is an excellent demonstration of the power of incorporating biology into a computationally sophisticated analytic framework.
Acknowledgements
BCC received funding from NIH: NIGMS (P20GM104416/8189) and NIH: NIDCR (R01DE022772) and KTK received funding from NIH: NCI (1R01CA207110).
Authors’ contributions
KTK and BCC conceived and wrote this work together. Both authors read and approved the final manuscript.
Competing interests
The authors declare that they have no competing interests.
Abbreviations
- epiTOC
Epigenetic timer of cancer
References
- 1.Gupta PK, Sahota A, Boyadjiev SA, Bye S, Shao C, O'Neill JP, et al. High frequency in vivo loss of heterozygosity is primarily a consequence of mitotic recombination. Cancer Res. 1997;57(6):1188–1193. [PubMed] [Google Scholar]
- 2.Finette BA, Homans AC, Rivers J, Messier T, Albertini RJ. Accumulation of somatic mutations in proliferating T cell clones from children treated for leukemia. Leukemia. 2001;15(12):1898–1905. doi: 10.1038/sj.leu.2402306. [DOI] [PubMed] [Google Scholar]
- 3.Shibata D, Tavaré S. Stem cell chronicles: autobiographies within genomes. Stem Cell Rev. 2007;3(1):94–103. doi: 10.1007/s12015-007-0022-6. [DOI] [PubMed] [Google Scholar]
- 4.Yang Z, Wong A, Kuh D, Paul DS, Rakyan VK, Leslie RD, et al. Correlation of an epigenetic mitotic clock with cancer risk. Genome Biol. 2016;17:205. doi: 10.1186/s13059-016-1064-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.McCartney DL, Walker RM, Morris SW, McIntosh AM, Porteous DJ, Evans KL. Identification of polymorphic and off-target probe binding sites on the Illumina Infinium MethylationEPIC BeadChip. Genomics Data. 2016;9:22–24. doi: 10.1016/j.gdata.2016.05.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Tomasetti C, Vogelstein B. Cancer etiology. Variation in cancer risk among tissues can be explained by the number of stem cell divisions. Science. 2015;347(6217):78–81. doi: 10.1126/science.1260825. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Horvath S. DNA methylation age of human tissues and cell types. Genome Biol. 2013;14(10):R115. doi: 10.1186/gb-2013-14-10-r115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Houseman EA, Accomando WP, Koestler DC, Christensen BC, Marsit CJ, Nelson HH, et al. DNA methylation arrays as surrogate measures of cell mixture distribution. BMC Bioinformatics. 2012;13(1):86. doi: 10.1186/1471-2105-13-86. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Reinius LE, Acevedo N, Joerink M, Pershagen G, Dahlén SE, Greco D, et al. Differential DNA methylation in purified human blood cells: implications for cell lineage and studies on disease susceptibility. PLoS One. 2012;7(7) doi: 10.1371/journal.pone.0041361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Koestler DC, Jones MJ, Usset J, Christensen BC, Butler RA, Kobor MS, et al. Improving cell mixture deconvolution by identifying optimal DNA methylation libraries (IDOL) BMC Bioinformatics. 2016;17(1):120. doi: 10.1186/s12859-016-0943-7. [DOI] [PMC free article] [PubMed] [Google Scholar]