Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2025 Sep 23:2025.09.18.677221. [Version 2] doi: 10.1101/2025.09.18.677221

Intermediately Methylated Regions in Normal Cells Are Epimutation Hotspots in Cancer

Mohamed Mahgoub 1, Haley Abel 1, Nidhi Davarapalli 1, Heidi Struthers 1, Emilee Kotnik 1, Brittany Johnson 1, Christopher Markovic 2, Catrina Fronick 2, Robert Fulton 2, Michael P Meers 3, David H Spencer 1,2,4,*
PMCID: PMC12485763  PMID: 41040346

Abstract

DNA methylation is altered in all cancers, but the mechanisms responsible for these changes are not well understood. Using data from 100 primary samples, we show that regions with intermediate methylation in normal hematopoietic cells are hotspots for stable, clonal epimutations in acute myeloid leukemia. Analysis of hematopoietic stem cell clones demonstrated that intermediate methylation at epimutation hotspots reflects random allele-specific methylation and expression at the single cell level, representing a previously unrecognized form of somatically acquired imprinting. We identified somatically imprinted regions in other tissues and show they are epimutation hotspots in other cancer types. This demonstrates that random allele-specific methylation is both a general property of normal cells, and a vulnerability that renders them susceptible to cooperating events and clonal selection during cancer.


Cancer is caused by heritable somatic events that endow neoplastic cells with a selective advantage. While the most extensively studied somatic events in cancer are genetic mutations, cytosine methylation of DNA is also heritable through cell divisions and exhibits tissue-specific changes in malignant vs. normal cells. These include genome-wide methylation patterns present in most cancer types(13) and signatures that reflect a tumor’s cell of origin, subtype and the effects of somatic mutations(49). Genome-wide studies have shown that abnormally methylated CpGs in cancer frequently cluster to form differentially methylated regions (DMRs)(4, 5). However, the factors that determine why certain regions acquire these changes are poorly understood. This has made it difficult to define the mechanisms responsible for cancer-associated methylation changes and distinguish early events that could initiate cancer from alterations that are consequences of it(4, 10).

Epimutations are spontaneous, heritable epigenetic abnormalities that can affect gene activity and cause human disease, including cancer(11, 12). The best-characterized epimutations are inherited events that cause the imprinting disorders Beckwith-Wiedemann, Angelman, and Prader-Willi syndromes(13, 14), or predispose patients to cancer via silencing of tumor suppressors(15, 16). Epimutations can also be somatically acquired, most notably in cancer. These can also silence tumor suppressors, producing a second epigenetic ‘hit’ in tumors with a co-occurring somatic mutation(1719). Although the mechanisms and timing of somatic epimutations are not well understood, there is evidence they may pre-exist in normal tissues(2024) and could therefore be selected for during cancer development.

Although methylation in cancer has been extensively studied, most analysis has focused on changes associated with clinical or molecular features(4, 6, 2529). Such approaches may be less sensitive to detecting spontaneous epimutations, which could be independent of these variables. Here, we used an unbiased analysis to identify somatically acquired DNA methylation epimutations in acute myeloid leukemia (AML). Surprisingly, these events frequently occurred in regions with intermediate methylation in normal hematopoietic cells, which harbor blocks of CpGs that exhibit random allele-specific methylation in single cells. We demonstrate that methylation at these blocks affects chromatin and gene expression, and that “somatically imprinted” regions are epimutation hotspots in AML and other cancers.

Intermediately methylated regions in normal hematopoietic cells are hotspots for altered methylation in AML

To identify DNA methylation changes in AML that could be somatically acquired epimutations, we compared whole-genome bisulfite sequencing (WGBS) data from 100 primary AML samples to WGBS from 16 purified hematopoietic cell populations from 7 healthy donors(4) (Figure 1A, S1A). Pairwise comparisons were performed between each AML sample to pooled data from stem/progenitor and mature myeloid cells (see Methods) to detect common and rare events and account for maturation-related methylation changes in AML. Between 752 and 19,965 autosomal differentially methylated regions (DMRs) with a methylation difference >0.3 were identified in each sample, for a total of 575,869 DMRs (Figure 1B, S1B). Merging of all DMRs produced 76,752 unique regions, of which 33,181 (43.2%) were “private DMRs” in a single AML sample, and 43,571 (56.8%) regions that overlapped DMRs identified in at least two samples (Figure 1C). Applying a threshold of ≥25 AML samples resulted in 6,076 regions overlapping recurrent DMRs that we defined as DMR “hotspots”, which included regions near genes with important roles in hematopoiesis and AML (e.g., HOXA3, Figure 1D).

Fig. 1. Recurrent differentially methylated regions in AML are enriched for intermediately methylated regions in normal hematopoietic cells.

Fig. 1.

(A) Analysis of differentially methylated regions (DMRs) in 100 primary AML samples. WGBS data from each AML was compared to a pooled dataset of 16 hematopoietic cell populations purified from bone marrow aspirates from 7 distinct healthy donors. (B) Scatter plot showing the methylation level of each DMR identified in all AML samples versus mean methylation in normal hematopoietic cells. (C) Recurrence of DMRs across AMLs after merging overlapping regions from all samples. Private DMRs are regions with differential methylation in one AML sample, while DMR hotspots are regions overlapping highly recurrent DMRs identified in at least 25 AML samples. (D) DNA methylation at a DMR hotspot in the HOXA3 locus from normal hematopoietic cells and selected AML samples. (E) Heatmap showing methylation in normal hematopoietic stem/progenitors (HSPCs) at regions overlapping private DMRs identified in only one AML sample (left) and DMR hotspots (25+ samples) (right). DMRs are length-normalized and shown with 1 kb of flanking sequence. (F) Methylation levels at intermediately methylated regions in normal hematopoietic cells that are hotspots for differential methylation in AML samples (IMR hotspots). Each point shows the mean methylation of IMR hotspot in the indicated normal hematopoietic cell population (green), AMLs with no DMR called at the hotspot (blue), and AMLs in which a DMR was called (red). AML samples with a DMR at the hotspots are grouped by the presence of mutations in selected genes (minimum of 2 AML samples with the DMR hotspot per mutation group).

We next investigated the normal epigenetic features at hotspots by annotating them with chromatin states(30) from hematopoietic stem/progenitor cells (HSPCs) (Figure S1C, see Methods). Hotspots were >10-fold enriched for bivalent chromatin and transcriptional initiation compared to private DMRs, which were evenly distributed across all states (Figure S1D-F). Interestingly, states with the highest enrichment for hotspots had intermediate methylation in normal HSPCs, an uncommon genomic feature that marks tissue-specific regulatory elements(31). Direct analysis showed that 48% of hotspots had intermediate methylation in normal HPSCs, and there was a 5-fold enrichment for intermediately methylated CpGs in hotspots compared to their genome-wide frequency (Figure 1E).

We next defined hematopoietic-specific intermediately methylated regions (IMRs) with methylation between 0.3 and 0.7 in normal hematopoietic cells and evaluated the hotspots overlapping these IMRs (i.e., “IMR hotspots”) for features of somatic epimutations. Interestingly, methylation at IMR hotspots remained consistently intermediate throughout myeloid maturation and was present in promyelocytes, monocytes, and neutrophils (Figure 1F), and it was also stable between presentation and relapse in 6 AML patients (Figure S1H). Further, AML samples with DMRs at IMR hotspots showed consistent hyper- or hypomethylation at each hotspot, even in samples with mutations in DNMT3A, IDH1, IDH2, or TET2 (Figures 1F, S1I) that cause directional methylation changes in AML(46). This implies that DMRs at IMR hotspots are not simply driven by these mutations. Finally, there was a modest but significant negative correlation between methylation at promoter-associated IMR hotspots and expression of nearby genes, supporting their functional role in gene regulation (Figure S1J). Together this demonstrates that intermediately methylated regions in normal cells are hotspots for methylation changes in AML that resemble somatically acquired epimutations.

Intermediately methylated regions contain CpG blocks with allelic methylation in AML

Intermediate methylation could be due to cellular heterogeneity, discordant methylation across CpGs, or allele-specific methylation (Figure 2A). To distinguish between these possibilities, we analyzed fragment-level methylation (32, 33) at IMR hotspots in WGBS data from purified normal hematopoietic cells. This showed that methylation at consecutive CpGs was highly correlated at IMR hotspots compared to non-hotspot DMRs with intermediate methylation (p<10−10, Figure 2B, S2A). We next tested for allele-specific methylation (ASM) at IMR hotspots with heterozygous SNPs in the normal and AML samples. Surprisingly, ASM was rare in normal samples but more common in AML, especially in samples that retained intermediate methylation (Figure 2C, S2B-C). To further explore this observation, we used Oxford Nanopore sequencing (ONT) to obtain haplotype-resolved methylation from 10 primary AML samples and CD34+ HSPCs or immature (lineage marker-negative) bone marrow cells from four healthy donors (Figure 2D). We identified 1,264–4,497 autosomal regions of ASM per AML sample, after excluding regions in imprinted genes, near structural variants, or with high SNP density; in comparison, ~300 ASM regions were identified in normal samples (Figure S2D). Interestingly, 30% of the IMR hotspots overlapped an ASM region in the ONT data from AML samples, supporting our findings from WGBS.

Fig. 2. Intermediately methylated regions in normal hematopoietic cells display allelic methylation patterns in AML.

Fig. 2.

(A) Models of possible fragment-level methylation patterns at intermediately methylated regions (IMRs). (B) Comparison of WGBS fragment-level methylation metrics at IMRs overlapping DMR hotspots vs. non-hotspot DMRs in normal hematopoietic cells. Each point represents a different metric, with the x-axis indicating the effect size (standardized difference in means between hotspots and non-hotspots) and the y-axis indicating statistical significance as –log₁₀(p-value) from a t-test comparing the two groups. Metrics include PDR (Proportion of Discordant Reads), Entropy (Shannon entropy), CHALM (Cell Heterogeneity-Adjusted cLoNal Methylation), R² (linkage disequilibrium R squared), MHL (Methylation Haplotype Load), MCR (Methylation Concordance Ratio), and MBS (Methylation Block Score). (C) Fraction of IMR hotspots exhibiting allele-specific methylation (ASM) per sample from WGBS in normal hematopoietic cells and AML samples, calculated among all IMR hotspots with a heterozygous SNP and sufficient coverage. Only samples with at least 5 informative IMR hotspots are included. (D) ONT sequencing and haplotype-resolved methylation analysis of normal HSPCs (N=4) and AML samples (N=10). (E) Example of haplotype-resolved methylation in an IMR hotspot (at the promoter of CD1D), showing mosaic methylation in normal hematopoietic cells, allele-specific methylation (ASM) in AML sample 335640, and full methylation (epimutation) in AML sample 740266. Tracks show the DMR region identified in each AML sample vs. normal cells using DSS R package and the “IMR block” defined by correlation of fragment-level CpG methylation using ONT data(34). (F) ONT haplotype-level methylation at IMR blocks in all normal HSPCs (N=4) and AML samples (N=10). Top panels show the distribution haplotype-level methylation at IMR blocks. Bottom panels show the average methylation of haplotype 1 versus haplotype 2 for each IMR block. (G and H) Aggregated H3K4me3 CUT&Tag signal (G) and chromatin accessibility (Fiber-seq) (H) at IMR blocks in AML samples, with blocks grouped by methylation status (hypomethylated, intermediately methylated, or hypermethylated). (I) Chromatin accessibility (Fiber-seq) at IMR blocks with intermediate methylation (green in panel H) and exhibiting ASM in AML samples, comparing accessibility between the methylated and unmethylated haplotypes. (J) Example of haplotype-resolved methylation and chromatin accessibility at an IMR block in an intergenic region on chromosome 10 (chr10:61,866,122–61,869,606).

We next used the long-read ONT methylation data from all AML samples and normal HSPCs to identify blocks of consecutive, highly correlated CpGs(34) that may be driving allelic methylation at IMR hotspots. We also included published IMRs(31) and ASM regions from the ONT data in this analysis, which we reasoned could share similar genomic features. This resulted in 11,534 total methylation haplotype blocks, of which 1,575 had consistently intermediate methylation in normal HSPCs (between 0.3 and 0.7), which we defined as “IMR blocks”. These blocks contained a mean of 12 CpGs, spanned 186 bp, and showed highly correlated—but mosaic—patterns in HSPCs (Figure 2E). Analysis of haplotype-level methylation at all IMR blocks from HSPCs showed that both haplotypes had intermediate methylation, matching the aggregate methylation levels from ONT and WGBS (Figure 2F). In contrast, most IMR blocks in AML samples either had ASM (Figure 2E and 2F: indicated in blue) or were fully methylated/unmethylated on both alleles, reflecting an epimutation (Figure 2E and 2F: indicated in yellow). These findings demonstrate the unique properties of IMR blocks, which have mosaic haplotype-level methylation in normal cells that converts to ASM or fully methylated/unmethylated epimutations in AML.

Methylation at IMR blocks is associated with differential chromatin activity

We next investigated whether methylation at IMR blocks is associated with differential histone modifications using CUT&Tag and haplotype-level chromatin accessibility via Fiber-seq(35). IMR blocks in HSPCs possessed both H3K27 and H3K4 trimethylation, consistent with the bivalent chromatin states at hotspots from the HSPC-defined chromHMM model (Figure S2E). In AML samples, both modifications were negatively correlated with methylation at IMR blocks, with ASM blocks having intermediate signal consistent with allelic histone modification differences (Figure 2G, S2F). Likewise, a negative correlation between methylation and accessibility at IMR blocks was observed via Fiber-seq in two AML samples (Figure 2H). This was also apparent at the haplotype level, with unmethylated haplotypes having higher accessibility in aggregate and at individual regions (Figure 2I, 2J). Overall, these findings are consistent with previous analysis of histone modifications at IMRs(31) and support a functional relationship where unmethylated alleles at IMR blocks have greater accessibility and increased histone methylation compared to methylated blocks.

Allelic methylation at IMR blocks is stable over time and marks clonal hematopoiesis

To determine the stability of allelic methylation at IMR blocks, we used ONT sequencing to analyze paired presentation/relapse samples from 3 AML patients who relapsed after chemotherapy with a related AML clone (Figure 3A, S3A). We compared IMR blocks with allelic methylation (i.e., ASM or a methylated/unmethylated epimutation) between presentation and relapse samples for each patient (using synchronized haplotype designations), which demonstrated that >92.5% of the ASM blocks and >98.9% of the blocks with epimutations were stable at relapse. We next tested whether allelic methylation at IMR blocks is specific to leukemic cells by analyzing remission samples from two of the AML patients. Whole-genome sequencing(36) confirmed a molecular remission in patient 895870, while patient 627311 had a persistent DNMT3A mutation and several noncoding exome variants, indicating the presence of clonal hematopoiesis (Figure S3B). Patient 895870 had allelic methylation at IMR blocks at presentation that nearly completely normalized in the remission sample before returning at relapse (Figure 3C). Interestingly, allelic methylation only partially normalized in the remission sample from patient 627311 that had clonal hematopoiesis, which instead retained a pattern closer to the presentation and relapse samples (Figure 3D).

Fig. 3. IMR blocks are stable, clonal events in hematopoietic cell populations.

Fig. 3.

(A) Longitudinal analysis of methylation at IMR blocks in 3 AML patients (112200, 895870, and 627311) who achieved a complete morphologic remission with standard “7+3” induction chemotherapy. ONT sequencing was performed on the samples at the timepoints indicated by the open circles. (B) Heatmap representation of haplotype-level methylation from ONT sequencing of paired presentation/relapse sample pairs. IMR blocks were classified as either having ASM (left) or a fully methylated/unmethylated epimutation (right) via Fisher’s exact test on methylation read counts either between haplotypes (for ASM) or compared to normal HSPCs (for epimutations). Methylated and unmethylated IMR blocks are shown in red and blue, respectively. Only IMR blocks with statistically significant classifications at both presentation and relapse are shown. The percent of concordant ASM and epimutation blocks between presentation and relapse is shown on the right. (C) Haplotype-level methylation at IMR blocks from AML patient 895870 at presentation, complete morphologic and molecular remission (i.e., blasts <1% and no leukemia-associated mutations detected), and relapse. Points show the methylation of IMR blocks in haplotype 2 vs. haplotype 1 and are colored by their methylation status in the presentation sample. Bar plots show the variant allele fraction (VAF) of selected recurrent mutations in each sample. Bottom panel shows haplotype-level methylation at the same IMR blocks from a representative CD34+ HSPC sample and are colored based on the methylation status of the block in the presentation AML sample. (D) Haplotype-level methylation in AML patient 627311 at presentation, complete morphologic remission (bone marrow blasts <1%), but with a persistent DNMT3A, and relapse. Scatter plots are shown as in (C). Bar plots show mutation VAFs in each sample and indicate the presence of a DNMT3AR729W variant in the remission sample, consistent with persistent clonal hematopoiesis. (E) Correlation between clonal hematopoiesis in AML remission and the presence of ASM and epimutations at IMR blocks from ONT sequencing. Top panels show mutation VAFs at presentation and complete remission for 6 patients, 3 with clonal hematopoiesis in remission (left) and 3 with nonclonal hematopoiesis. Bottom panel shows the fraction of all IMR blocks (N=1,575) that met criteria for being ASM or fully methylated/unmethylated epimutations compared to normal HSPCs in each sample using a Fisher’s exact test on haplotype-level methylation read counts. All remission samples were bone marrow aspirates with normal trilineage hematopoiesis and <1% blasts. See reference (36).

To further examine the relationship between allelic methylation at IMR blocks and clonal hematopoiesis, we used ONT sequencing to measure methylation in additional remission samples from 4 patients who had either complete clearance of somatic mutations or multiple persistent mutations by exome sequencing, indicating clonal hematopoiesis(36) (Figure 3E, S3C). Quantification of IMR blocks with allelic methylation showed that patients with clonal hematopoiesis had a significantly higher fraction of allelic blocks compared to samples with complete mutation clearance (41% vs. 22%; p=0.034, Student’s t), supporting an association of ASM and epimutations with clonal hematopoiesis.

Random allele-specific methylation at IMR blocks is somatically acquired in normal hematopoietic stem cells

We reasoned that intermediate methylation at IMR blocks could reflect random ASM that is developmentally established in HSPCs at the single cell level and thus can be observed in clonal AML samples and clonal hematopoiesis but not in bulk cell populations. To test this, we examined IMR block methylation in ONT data from human embryonic stem cells. This showed that most blocks were fully methylated or unmethylated on both haplotypes (Figure S4A), implying that intermediate methylation emerges during hematopoietic development. IMR blocks also reverted to being fully methylated or unmethylated in induced pluripotent stem cells derived from CD34+ HSPCs(37) (Figure S4B), indicating these states are intrinsic to pluripotent stem cell identity.

To determine whether mosaic methylation at IMR blocks reflects random ASM, we sorted CD34+/CD38-/CD45RA-HSCs from primary cord blood and expanded bulk HSCs and 11 single-cell HSC clones for analysis via ONT sequencing (Figure 4A). This showed that methylation at most IMR blocks with intermediate methylation in bulk HSCs (684 of 1,575 blocks) did not have mosaic methylation at the haplotype level in single HSC clones: 69% of blocks had ASM in at least one clone and most clones had either ASM or were fully methylated/unmethylated on both haplotypes (Figure 4B). Further, 83% of IMR blocks with ASM in ≥4 clones (N=164) showed ‘switching’ of the methylated haplotype, implying stochastic selection of the methylated state (Figure 4C, 4D). In comparison, haplotype-level methylation was consistent in 98% of regions with ASM in bulk HSCs (Figure 4C), indicating that the random methylation is unlikely to be due to technical or biologic variability. IMR blocks with hyper or hypomethylated epimutations in the HSC clones were also random, but to a lesser extent, with 44% of blocks switching between full hyper and hypomethylated epimutations (Figure S4C, S4D). Most of the epimutation blocks had increased methylation, an effect previously observed in cells forced to rapidly divide in culture(4, 38).

Fig. 4. Random allele-specific methylation at IMR blocks is established in single hematopoietic stem cells and is associated with allele-specific gene expression.

Fig. 4.

(A) HSC expansion and sequencing to analyze haplotype-level methylation in single-cell-derived HSC clones. (B) Haplotype-level methylation at IMR blocks in normal donor HSPCs, bulk cord blood HSCs, and single-cell-derived HSC clones. Each point corresponds to the average methylation of one IMR block within a single clone. Only IMR blocks with intermediate methylation (0.3–0.7) in the bulk HSCs were included. (C) Haplotype-level methylation patterns in single-cell-derived HSC clones at IMR blocks exhibiting intermediate methylation in bulk HSCs (top, labelled “IMR Blocks”) and regions showing allele-specific methylation (ASM) in bulk HSCs (bottom left, “Bulk ASM”). In both panels, line plots display the average methylation of haplotype 1 (orange) and haplotype 2 (purple) across all clones for each region. The dot plots below indicate the number of individual clones in which Haplotype 1 or Haplotype 2 was methylated, which is indicated by the Y-axis and the dot color. The bar plot (bottom right) quantifies the percentage of regions where the methylated allele is concordant across clones versus switched (i.e., random) for both IMR blocks and bulk ASM regions. Only regions with ASM observed in at least four clones were included in this analysis. (D) Example of an IMR block exhibiting random methylation patterns in HSC clones. The tracks display haplotype-level methylation at the block that shows intermediate methylation in bulk HSCs. Individual clones demonstrate random allele-specific methylation (ASM) on either haplotype 1 (orange) or haplotype 2 (purple), or an epimutation showing either fully hypomethylated or hypermethylated states. (E) Allele-specific methylation is associated with allele-specific expression in HSC clones at an IMR block located within an internal promoter of the SCHIP1 gene. For several representative clones, tracks display DNA coverage, haplotype 1 methylation (orange), haplotype 2 methylation (purple), and RNA coverage. The bar plots quantify normalized, haplotype-specific expression in Transcripts Per Million (TPM). RNA Reads are assigned to haplotypes based on a heterozygous SNP, which is indicated by the vertical blue/grey (haplotype 1) and red (haplotype 2) lines in the coverage tracks. While DNA coverage is consistently biallelic, the level of RNA expression from each haplotype is inversely correlated with its methylation status, demonstrating that higher methylation is linked to transcriptional repression.

We next tested whether random ASM could be linked to allele-specific expression (ASE) by performing bulk RNAseq on 6 HSC clones. Although we were limited to examining only genes with coding SNPs near IMR blocks, we identified a block in an internal promoter of SCHIP1 with intermediate methylation in bulk HSCs that was either fully methylated/unmethylated or had ASM in all 6 clones (Figure 4E). Haplotype-level expression of SCHIP1 isoforms using this promoter was variable and correlated with IMR block haplotype methylation. Similar effects were seen for GATA2, a critical gene in hematopoietic development previously shown to have ASE in AML(17, 22); several ASM regions were present at the GATA2 locus in our data, but these did not meet all IMR block criteria. However, GATA2 had striking monoallelic expression in two clones, and the expression level correlated with haplotype-level methylation in a block-like region (Figure S4E), indicating that both ASM and ASE at GATA2 are pre-existing states in normal HSCs. Together these observations imply that IMR blocks like those at the SCHIP1 and GATA2 loci undergo a form of somatically acquired “imprinting” during hematopoietic development.

Intermediately methylated blocks in multiple normal tissues are epimutation hotspots in cancer

We were next interested in whether IMR blocks in other tissues displayed epigenetic features like those in hematopoietic cells and were associated with abnormal methylation in tumors. For this analysis, we used data from a recent study of methylation in normal tissues that defined tissue-specific CpG blocks with bimodal methylation (i.e., IMR blocks) (39, 40), as well as separate studies of primary tumors using WGBS or ONT(4144) (Figure 5A). Interestingly, the published set of IMR blocks were enriched for bivalent chromatin and transcriptional start sites in all tissues, similar to the blocks we identified in hematopoietic cells (Figure 5B, see also Figure S1). Likewise, analysis of haplotype-level methylation from primary tumors showed that these blocks lost intermediate methylation and frequently had ASM. Haplotype-level analysis showed that 25–71% of IMR blocks with retained intermediate methylation had ASM across tumor types, further indicating that ASM in these regions is a property of clonal cell populations (Figure 5C, 5D).

Fig. 5. Tissue-specific IMRs in normal cells are hotspots for epimutations in cancer.

Fig. 5.

(A) Identifying methylation changes at tissue-specific intermediately methylated regions (IMRs) in multiple cancers. IMRs for hematopoietic cells were identified using ONT data from this study, while IMRs for other tissues were defined using a normal tissue WGBS atlas(39). For each tissue, epimutations (hyper- or hypo-methylation) were identified by comparing tumor samples to their normal counterparts. Allele-specific methylation (ASM) was called using haplotype-phased ONT data. This analysis integrated data for AML generated in this study with published datasets for other cancers, including ONT(41) and WGBS(42, 44) data. For details, see the Methods section. (B) Heatmap showing the fold enrichment of tissue-specific IMRs within tissue-specific chromatin states. For each tissue (columns), the enrichment of its specific IMRs (n = number of blocks) was calculated for each chromatin state (rows) relative to a genomic background. Values displayed represent the fold enrichment for regions with an enrichment of ≥5. Chromatin state annotations were obtained from the Roadmap Epigenomics Consortium(45). (C) Comparison of Haplotype 1 versus Haplotype 2 methylation at tissue-specific IMRs in cancer samples using ONT data. (D) Percentage of tissue-specific IMRs with retained intermediate methylation (0.3–0.7) that exhibit allele-specific methylation (ASM) per sample in various cancer types, using phased ONT data. Each point represents an individual tumor sample. The box plots summarize the distribution of ASM percentages for each cancer type shown. (E) Distribution of average methylation levels at tissue-specific IMRs in normal tissues (blue) and their corresponding primary tumors (red) from ONT data. The numbers at the top of the plot indicate the number of samples included in each group. Each point represents the average methylation of a single IMR block within a sample, with data from multiple samples pooled for each tissue type. For visualization, 1,000 points were randomly sampled for each group. Data for tumor samples and normal blood were derived from ONT, while data for other normal tissues were from WGBS. (F) Percentage of various genomic features exhibiting epimutations across different cancer types from ONT data for individual primary tumor samples. Epimutations were defined as a significant difference in aggregate methylation via a Fisher’s exact test on methylation counts for each region between tumor and its respective normal. Boxplots show the percentage of tissue-specific IMRs (IMR Blocks), CpG islands, tissue-specific transcriptional start sites (TSS), and tissue-specific enhancers that have significant count-based differential methylation in each tumor sample compared to its corresponding normal tissue.

Finally, we sought to determine whether IMR blocks are hotspots for altered methylation in solid tumors and could therefore represent epimutations. Indeed, methylation at tissue-specific IMR blocks was frequently altered in both ONT and WGBS tumor data (Figure 5E, S5A). We then quantified the frequency of altered methylation at IMR blocks by comparing block methylation in each tumor sample to its respective normal tissue. For comparison, the same analysis was performed using other genomic regions commonly associated with differential methylation in cancer, including CpG islands, gene promoters and tissue-specific enhancers defined via chromatin states(30). Surprisingly, the fraction of IMR blocks with differential methylation was the highest among these regions, with a mean of 40% to 65% having statistically significant differential methylation vs. normal tissue (Figure 5F, S5B). By comparison, 3.6% to 22% of the other regions had differential methylation across tumor types. This demonstrates that tissue-specific CpG blocks with intermediate methylation are “hotspots” for altered methylation in multiple tumor types.

Discussion

The factors that predispose certain regions to DNA methylation changes in cancer are not well understood. Here we showed that discrete blocks of CpGs with intermediate methylation in normal cells are a priori hotspots for methylation changes in AML and multiple solid tumors. In AML, methylation changes at intermediately methylated regions (IMRs) resemble epimutations: they are stable, clonal events that are less dependent on mutation-driven phenotypes and can be observed in non-leukemic cells. Further, intermediate methylation at these blocks in normal hematopoietic cells is caused by random allele-specific methylation at the single cell level. The highly tissue-specific and stable nature of methylation at these regions implies they normally undergo a type of developmental “somatic imprinting”, and epimutations in AML are due to the absence of this allele-specific state. Further, these regions possess active and bivalent chromatin features and occur near important tissue-specific genes, which suggests they are involved in gene regulation and could be subject to selection during AML pathogenesis.

The concept of allele-specific somatic imprinting is consistent with previous epigenetic studies in primary human tissues. In hematopoietic cells, clonal cell populations exhibit heritable, random monoallelic gene expression(21, 22), and separate studies have used methylation signatures as clonal markers(46, 47). The blocks of random allele-specific methylation described here provide a direct link between these observations by identifying discrete loci where methylation can both contribute to gene regulation and encode stable markers of clonal identity. While it is possible that the mosaic methylation pattern of these regions means they are less functionally relevant, their relationship to chromatin activity and expression suggests otherwise. The striking consistency and stability of IMRs in HSCs also implies that establishing intermediate methylation via somatic imprinting could be a defining feature of HSC identity; epimutations could then be caused by a failure or spontaneous loss of imprinting. While the forces that make ASM advantageous are unknown, it could enable precise gene dosage or simultaneous transcription of two closely spaced genes from opposite alleles. Notably, the GATA2, SCHIP1 and HOXA loci with ASM all encode multiple overlapping transcripts and could be subject to such constraints.

One consequence of somatic imprinting is that it generates a substrate for selection during cancer development. For example, genes with ASM and monoallelic expression would be vulnerable to somatic mutations as a second hit(17, 18). Likewise, the presence of epimutations is a source of epigenetic heterogeneity that could confer an advantage to some cells. We found that IMR blocks were differentially methylated more often than CpG islands, promoters, and enhancers in multiple cancers, and in AML, individual IMR blocks tended to become either fully methylated or unmethylated, hinting that certain epimutations may be under selection. The discrete nature of these events means they may be more amenable to direct measurements and functional characterization to understand how pre-existing epigenetic variability contributes to the somatic processes that lead to cancer.

Materials and Methods

Primary AML and normal hematopoietic cell samples

Primary AML and normal hematopoietic cell samples used in this study were collected from consented subjects using IRB-approved protocols (WashU HRPO#201011766, FHCC #985.03). AML samples were either peripheral blood or bone marrow aspirate specimens obtained from patients with active disease at presentation or relapse and were processed without fractionation, purification, or enrichment via buffy coat preparation into cryopreserved cell suspensions or directly used for DNA extraction (see Tables S1 and S4). Remission samples were unfractionated whole bone marrow aspirates or peripheral blood samples obtained from patients in morphologic complete remission at the timepoints indicated in Figure 3. Normal adult CD34+ HSPCs, promyelocytes, neutrophils, and monocytes from healthy donor bone marrow aspirate samples as described previously(4, 5). Additional HSPC data were generated from two sources. First, we used peripheral blood mobilized CD34+ HSPCs from 3 donors from the Fred Hutchinson Cancer Center (FHCC) Cooperative Centers of Excellence in Hematology Cell Procurement & Processing Core. Second, we enriched for lineage-negative cells from bone marrow aspirates from two healthy adult donors via bead-based depletion of lineage-positive cells (Miltenyi cat# 130–092-211) on an autoMACS instrument (Miltenyi). Finally, commercially available purified CD34+ HSPCs from cord blood samples (STEMCELL) were used for the clonal expansion experiment.

Cord blood HSC sorting and expansion

Cryopreserved umbilical cord blood CD34+ cells (106 cells from a single donor purchased from STEMCELL Technologies) were thawed in a ThawSTAR instrument (BioLife) and then cultured overnight in 10 wells of a 96-well U-bottom plate in StemSpan media with CD34+ expansion supplement, UM729 (STEMCELL) and 1% pen/strep antibiotic. After overnight culture, cells were collected and stained with CD34-APC (BD Pharmingen, #555824), CD38-PE (BD Pharmingen, #560981), CD45RA-FITC (BD Pharmingen, #561882) and Near-IR LIVE/DEAD stain (Invitrogen) for FACS on a MoFlo instrument. Single CD34+/CD38-/CD45RA-cells were sorted into two 96-well flat-bottom plates seeded with 25,000 cells/well of irradiated (2000 cGy) AFT024 stroma (ATCC) and containing 100 uL of prepared StemSpan media. An additional 30,000 cells with the same immunophenotype were sorted into an additional well for expansion of a bulk (nonclonal) cell population. On days 4 and 10 after sorting 50 uL of prepared StemSpan media was added to all wells. The bulk HSCs were harvested in day 15 of culture and yielded approximately 107 cells. Expanded single-cell clones were transferred to 24-well plates without AFT024 stroma on days 21 and 23 of culture and then expanded in 0.5 mL of prepared StemSpan media for an additional 4 days before harvesting for DNA and RNA extraction.

Human embryonic stem cell and iPS cell culture

Human WA0, WA09/H9 ESCs and BM9 iPSCs were cultured in mTESR media (STEMCELL) and passaged using dispase before harvesting for DNA extraction.

Whole-genome bisulfite sequencing data production

WGBS data from 51 AML samples and 15 normal hematopoietic cell populations were repurposed from prior studies(4, 5). Additional datasets were generated as described previously(5) using the Zymo Gold Bisulfite conversion (Zymo Research) and Accel DNA methylation kit (IDT) kits to generate WGBS libraries. QC was performed on Agilent Bioanalyzer or Tapestation instruments prior to sequencing on NovaSeq 6000 or X plus instruments to generate 2×150 paired end reads to a target genome coverage of 10x. All WGBS data were processed using the DRAGEN software suite (v4.2.4) to obtain genome-wide 5mC read counts and methylation fractions.

ONT sequence data production and processing

All DNA for ONT sequencing was extracted using either the NEB Monarch or QIAmp DNA mini kit. DNA was fragmented and size-selected to eliminate small (<5 kbp) fragments and then prepared for sequencing via the LSK114 kit and sequenced using R10.4.1 flowcells on a PromethION sequencer (Oxford Nanopore Technologies, Oxford, UK) with up to two wash/reload cycles. For data obtained from expanded HSCs, adaptive sampling was performed for all IMRs and ASM regions identified in AML samples. Data processing and analysis, including basecalling, was performed via a Nextflow workflow (https://github.com/dhslab/nf-core-wgsnano, commit 8fe4655) using dorado (v0.5.2+7969fab) for basecalling and read mapping, and the PEPPER-Margin-Deepvariant pipeline for phased germline variant calling of SNV and small indels (v0.8.0)(54). Structural variants were identified after data processing using Sniffles(55). Haplotagging was performed using Whatshap (v2.2)(56), which used a common phased VCF file for presentation/remission/relapse samples from AML patients and the bulk HSC data for the HSC expansion experiment to synchronize the haplotype designations. Mosdepth (v.0.3.3)(57) was used to calculate coverage. Combined and haplotype-specific 5mC counts were generated with modkit (https://github.com/nanoporetech/modkit).

Fiber-seq on normal HSPCs and primary AML samples

Fiber-seq was performed on cryopreserved mobilized CD34+ HSPCs from two donors (obtained from FHCC) and presentation/relapse sample pairs from two AML patients using the published protocol(41) with slight modifications. Cells were thawed in a ThawSTAR instrument and 2×106 cells were collected at 250xg for 5 minutes, washed once in 1mL cold PBS and resuspended in 120 uL CUT&TAG nuclear extraction buffer (20 mM HEPES-KOH, pH 7.9, 10 mM KCl, 0.1% Triton X-100, 20% glycerol, and 0.5mM spermidine) and incubated for 10 minutes on ice. Nuclei were then collected at 350xg for 5 minutes and resuspended in 60 uL Buffer A (15mM Tris-HCl, 15 mM NaCl, 60 mM KCl, and 1 mM EDTA, pH 8.0) and transferred to a PCR tube before adding 1.5 uL 32 mM s-adenosyl methionine (SAM) and 0.5 uL Hia5 enzyme (200U/uL, a gift from A. Stergachis). Reactions were incubated at 25C in a thermocycler for 10 min and then stopped with 6uL of 10% SDS (final concentration of 1%). The lysate was transferred to a 1.5mL tube with wide bore tips and stored at −80C prior to DNA extraction and sequencing. DNA was extracted with a MagAttract DNA extraction kit, fragmented with a megaruptor and size-selected on a HTpippen to 18–22kbp prior to SMRTbell library preparation. Libraries were sequenced on a PacBio Revio instrument to generate CCS reads containing 5mC and kinetic data for prediction of 6mA via Fibertools(58). The FIRE pipeline was then used to obtain haplotype-level accessibility for all samples.

Histone CUT&Tag on CD34+ HSPCs and AML samples

Histone profiling used a CUT&Tag-direct protocol(59) with modifications relating to cryopreserved primary AML cells and the release of tagmented DNA from Concanavalin A coated (Con-A) magnetic beads. Cryopreserved HSPC and AML cells were thawed at 37°C, washed once in cold PBS, then pelleted for 3 minutes at 600g and resuspended in hypotonic NE1 buffer (20 mM HEPES-KOH pH 7.9, 10 mM KCl, 0.5 mM spermidine, 10% Triton X-100 and 20% glycerol) followed by incubation on ice for between 45 seconds and 2 minutes for nuclei extraction. The mixture was pelleted for 4 minutes at 1,300g, resuspended in 1× PBS and nuclei were counted using the Trypan blue staining or a ViCell Automated Cell Counter (Beckman Coulter). CUT&Tag-Direct was then carried out as previously described (https://doi.org/10.17504/protocols.io.bcuhiwt6). In brief, nuclei were bound to washed paramagnetic concanavalin A (ConA) beads (Bangs Laboratories) at a concentration of 5 μL washed beads per 50K nuclei and incubated with primary antibody (H3K27me3: Cell Signaling 9733S, H3K4me3: Millipore CS200580) at 4°C overnight in Wash Buffer (10 mM HEPES pH 7.5, 150 mM NaCl, 0.5 mM spermidine and Roche Complete Protease Inhibitor Cocktail) with 2 mM EDTA. Bound nuclei were washed and incubated with 1:100 secondary antibody for 1 hour at room temperature and then washed 2x and incubated in Wash-300 Buffer (Wash Buffer with 300 mM NaCl) with 1:50 5 μM loaded pA–Tn5 for 1 hour at room temperature. Nuclei were washed 2x and tagmented in Wash-300 Buffer with 10 mM MgCl2 for 1 hour at 37 °C and then resuspended sequentially in 50 μl of 10 mM TAPS and 5 μl of 10 mM TAPS with 0.1% SDS and incubated for 1 hour at 58 °C. The resulting suspension was mixed well with 16 μl of 0.9375% Triton X-100, and then 2 μL each of F and R primers and 25 μL 2× NEBNext Master Mix (New England Biolabs) were added for direct amplification with the following conditions: (1) 58 °C for 5 minutes, (2) 72 °C for 5 minutes, (3) 98 °C for 30 seconds, (4) 98 °C 15 seconds, (5) 60 °C for 15 seconds, (6) repeat steps 4–5 13 times, (7) 72 °C for 1 minute and (8) hold at 8 °C. DNA from amplified product was purified via AMPure bead cleanup and resuspended in 25 μl of 10 mM Tris-HCl with 1 mM EDTA. Libraries were assessed using the Qubit (Thermo Fisher Scientific) and TapeStation (Agilent) systems and sequencing was performed on an Illumina NovaSeq Xplus to obtain a minimum of 2 million 2×150 bp paired end reads. All data were analyzed via the nf-core(60) CUT&Run analysis pipeline to obtain normalized signal in bigwig format.

Bulk RNAseq on AML samples and HSC clones

RNA was extracted from 106 cells using QIAquick RNA micro kit (Zymo Research). RNA integrity was assessed by RIN scores by Agilent TapeStation 4150 (Agilent Technologies). Quantification of RNA eluates obtained by Qubit RNA BR Assay kit (Invitrogen-Thermo-Fisher). RNAseq libraries were generated via rRNA/Globin HMR depletion followed by ligation-based library preparation (Watchmaker Genomics). Libraries were assessed using Agilent TapeStation 4150 with High Sensitivity D1000 screen tapes (Agilent Technologies) and Qubit dsDNA HS Assays (Invitrogen-Thermo-Fisher). Sequencing was performed on Illumina NovaSeq Xplus to obtain a minimum of 50 million paired end 2×150 bp reads. Data processing used the Illumina DRAGEN RNA pipeline to obtain normalized gene and transcript-level expression values in transcripts per million (TPM).

Differentially methylated region (DMR) identification from WGBS data

The DSS(61) and bsseq(62) packages in R (version 4.3) were used to identify DMRs between each AML sample (N=100) and 17 normal hematopoietic cell methylation datasets (9 CD34+ HSPC samples, 3 promyelocyte samples, 3 neutrophil samples, and 2 monocyte samples, all purified as described above from normal bone marrow aspirates from adults aged 20–50). An FDR cutoff of 0.05 and minimum methylation difference of 0.3 were used within DSS without additional filtering. Analysis of DMR recurrence used bedtools(63) to merge all DMRs and count overlaps with the DMR set from each AML sample; initial merging and overlapping operations used default settings with no minimum overlap. DMR hotspots were then defined using bedtools multiintersect to obtain intervals that had >=25 overlapping DMRs across the 100 AML samples and had at least 5 CpG. Coverage-weighted mean methylation of all ‘multi-intersect’ DMRs was calculated for each AML sample and the pooled normal hematopoietic cell data using methfast, a custom c program for aggregating CpG methylation counts over bed intervals (code available upon request). These unmanipulated methylation fractions were used for all figures, statistical calculations, and comparisons, except for the heatmaps in Figure 1E where smoothing was performed to impute missing data for visualization (smoothed values were not used for sorting or determining methylation levels).

DNA fragment-level and allele-specific methylation analysis from WGBS data

Fragment-level methylation metrics were calculated in private and recurrent DMRs using all normal hematopoietic cell datasets using mHapSuite(38). Allele-specific methylation (ASM) analysis was performed by first identifying common single-nucleotide polymorphisms (SNPs) from the 1000 Genomes Project (1000G_phase1.snps.high_confidence.hg38.vcf.gz) that overlapped with IMR hotspots. Sequencing aligned reads covering these “hotspot SNPs” were then partitioned into two groups: those matching the reference allele and those matching the specific alternate allele defined in the 1000 Genomes Project. For the subsequent analysis of an IMR hotspot in an individual sample, each allele group was required to have a minimum of 8 reads, with each read spanning at least 4 CpGs. Individual reads were classified as methylated (≥70% of CpGs methylated) or unmethylated (≤30% of CpGs methylated); reads falling between these thresholds were designated “undetermined”. Finally, Fisher’s exact test was used to compare the number of methylated and unmethylated reads between the reference and alternate allele groups at each IMR hotspot. P-values were adjusted for multiple testing using the Benjamini-Hochberg method, and hotspots with an FDR < 0.05 were considered significant ASM sites. The code used for this analysis is available at https://github.com/dhslab/methtools.

ASM analysis from ONT data

ASM regions were identified from ONT data by first importing haplotype-resolved 5mC counts into bsseq objects in R and then calling DMRs between haplotypes with DSS using a minimum FDR of 0.05 and methylation difference of 0.4. ASM regions were then further filtered to exclude intervals meeting any of the following criteria: 1) on the X or Y chromosome, 2) overlaps a known imprinted region (N=782, from (64, 65)), 3) occurs within 1,000 bp of an identified structural variant in the ONT data, or 4) overlaps a 10,000 bp region with a SNP density that is in the top 1% across the genome.

Identification and analysis of IMR blocks in ONT data from normal HSPCs and AML samples

To define IMR blocks we pooled three sets of genomic coordinates: DMR hotspots from our AML WGBS data, ASM regions from our AML ONT data, and known IMRs in normal tissues(37). Coordinates from these sources were merged to create a unified set of intervals. Within these intervals, we identified methylation haplotype blocks (MHBs) by adapting a previously described method(40) for long read ONT data. Specifically, per-read CpG methylation calls were extracted from the ONT BAM files of four normal donors and 10 AML samples using modkit (https://github.com/nanoporetech/modkit). From this read-level methylation data, we calculated the methylation linkage disequilibrium (mLD) between adjacent CpG sites, measured as the squared Pearson correlation coefficient (r2), requiring a minimum of 10 reads covering a CpG pair for the calculation. An MHB was then defined as a contiguous region containing at least four consecutive CpG sites where the r2 value between any two adjacent CpGs was ≥ 0.5. The code used for this analysis is available at https://github.com/dhslab/methtools. To identify allele-specific methylation (ASM) and epimutations within IMR blocks using ONT data, we applied Fisher’s exact test. For the ASM analysis, we compared the counts of methylated and unmethylated CpGs between the two haplotypes. For the epimutation analysis, we compared the aggregated counts of methylated and unmethylated CpGs (regardless of haplotype) between AML or HSC samples (for data from expanded HSC bulk cells and clones) and pooled HSPCs from normal donors (n=4). For a block to be included, we required a minimum of four covered CpGs. Furthermore, we mandated a minimum adjusted CpG coverage (defined as total CpG counts divided by the number of CpGs in the block) of 5 per haplotype for ASM and 5 per sample for epimutation. P-values for both analyses were adjusted for multiple testing using the Benjamini-Hochberg method across all evaluable blocks. A significant ASM call was defined by an FDR < 0.05 and an absolute methylation difference between haplotypes > 0.4. A significant epimutation was defined by an FDR < 0.05 and an absolute methylation difference between the test and normal samples > 0.3.

Histone analysis at IMR blocks

Histone analysis at IMR blocks was performed using H3K4me3 and H3K27me3 CUT&Tag data from 7 AML samples and 3 mobilized CD34+ HSPC samples. All analysis used normalized signal from bigwig files produced by the nf-core pipeline. Mean methylation at each block was classified as being intermediate (i.e., having ASM) or being hyper or hypomethylated on both haplotypes. Aggregate normalized signal for each modification in 200 equally spaced bins was then calculated for a 5 kbp window centered on the block location for analysis by methylation-defined groups.

Fiber-seq FIRE analysis

Haplotype-resolved chromatin accessibility was assessed using the Fiber-seq Inferred Regulatory Elements (FIRE) analysis pipeline(66), available at https://github.com/fiberseq/FIRE. The workflow was run using aligned BAM files as input, and accessibility was quantified as the percentage of accessible fibers per region. Analysis of FIRE signal at IMR blocks used either combined or haplotype-level estimates of percent accessibility and were performed at IMR blocks grouped by methylation status (see histone analysis above).

Short read genome and exome sequencing of AML samples

Whole-genome sequencing (WGS) was performed on representation and relapse AML samples from patients 112200, 627311, and 895870 with matched normal tissue from skin biopsies. All sequencing was performed using 500ng of DNA input to generate libraries with the Illumina DNA library preparation kit, which were quantified by qPCR and sequenced to obtain a target genome coverage of 60x in 2×150 bp paired end reads on an Illumina NovaSeq Xplus sequencer. Data were analyzed using paired somatic mode with the Illumina DRAGEN software. Identified somatic variants were filtered to retain single nucleotide variants with PASS filter flag and total coverage between 20x and 90x to minimize sequencing artifacts. Whole-exome data for the remission samples was described previously(42); variants and their associated variant allele fractions were used directly from that publication.

IMR block identification in WGBS from normal tissues

To identify IMR blocks in normal tissues (brain, lung, colon, breast, and pancreas), we began with a set of previously defined regions of bimodal methylation “bimodal blocks” from WGBS data(44). We then applied a multi-step filtering process to refine these regions and isolate those specific to a single tissue type. First, to ensure consistency within a given tissue, we only retained bimodal blocks that were present in all normal samples for each tissue type. Next, these consistent blocks from all tissue types were merged to create a non-redundant block list (minimum block size of 50 bp; minimum distance between blocks of 50 bp). To isolate truly tissue-specific blocks and exclude germline imprinted regions, this non-redundant list was subjected to a second round of filtering. For each block, we calculated the average methylation within its matched-tissue samples and the average methylation across all other (non-matched) normal samples. A block was kept only if it exhibited intermediate methylation specifically in its matched tissue, defined by two criteria: 1) an average methylation level between 0.3 and 0.7 with a coefficient of variation (CV) ≤ 0.5 within the matched tissue, and 2) an absolute difference in average methylation of ≥ 0.2 between the matched and non-matched tissues. This process resulted in a final set of tissue-specific IMR blocks in hg19 coordinates. For compatibility with the solid-tumor ONT data, a second version of these block coordinates was generated in hg38 using liftOver.

Analysis of IMR blocks and other regions in WGBS and ONT data from solid tumors

To analyze methylation patterns in solid tumor WGBS and ONT data, we first defined several sets of genomic regions for analysis. These included the IMR blocks described above, CpG islands downloaded from the UCSC genome browser, and tissue-specific TSS and enhancers extracted from Roadmap Epigenomics Consortium ChromHMM annotations (model identifiers: E035 for blood, E073 for brain, E096 for lung, E075 for colon, E027 for breast, E098 for pancreas)(50). For the tumor methylation data, we used datasets from GEO: GSE121721 (WGBS: brain), GSE212391 (WGBS: other solid tumors), and GSE270257 (ONT: all solid tumors). For the matched normal samples, we downloaded beta value files from the DNA methylation atlas(45). All datasets were processed into a unified CpG-level bedmethyl format; for the normal samples, this conversion was performed using wgbstools(67). We then used methfast (https://github.com/dhslab/methfast) to aggregate CpG methylation counts from these bedmethyl files over all genomic regions of interest. For the blood cell type analyses, we used the same WGBS and ONT data for normal and AML samples as previously described. In Figure 5 and Figure S5, these normal hematopoietic cells and AML tumor samples are represented as the ‘blood’ normal and tumor category, respectively. Finally, to identify epimutations (in WGBS and ONT) and ASM (in ONT), we applied the same Fisher’s exact test-based approaches used for the hematopoietic cells, which involved comparing aggregated counts between tumor and normal samples for epimutation, and between the two haplotypes for ASM

Supplementary Material

Supplement 1

Acknowledgements:

We thank the Miga laboratory at University of California-Santa Cruz for assistance generating the ONT data, M. Jain at Northeastern University for consulting on ONT data analysis, and A. Stergachis at the University of Washington for reagents and assistance with Fiber-seq. Hematopoietic stem cells were purchased from the Cooperative Centers of Excellence in Hematology at the Fred Hutchinson Cancer Center, which is supported in part by NIDDK Grant # DK106829. We are grateful to the patients and research subjects at Alvin J. Siteman Cancer Center at Washington University School of Medicine and Barnes-Jewish Hospital in St. Louis for their participation in the Genomics of AML banking study, which is supported by WashU AML SPORE (P50CA171963, Daniel C. Link, PI), and thank the Siteman Cancer Center for the use of the Siteman Tissue Procurement and Flow Cytometry Cores. The Siteman Cancer Center is supported in part by an NCI Cancer Center Support Grant P30CA091842. We also thank D. Russler-Germain, A. Young, and J. Edwards at Wash U for helpful comments when preparing this manuscript.

Funding:

This work was supported by National Institutes of Health grants R00GM140251 to MPM and R37CA259359 to DHS.

Data and materials availability:

Raw and processed genomic data has been deposited to dbGaP (PHS000159) and GEO (accession pending), respectively.

Footnotes

Competing interests:

None

References and Notes:

  • 1.Feinberg A. P., Vogelstein B., Hypomethylation distinguishes genes of some human cancers from their normal counterparts. Nature 301, 89–92 (1983). [DOI] [PubMed] [Google Scholar]
  • 2.Feinberg A. P., Vogelstein B., Hypomethylation of ras oncogenes in primary human cancers. Biochem Bioph Res Co 111, 47–54 (1983). [Google Scholar]
  • 3.Gama-Sosa M. A., Slagel V. A., Trewyn R. W., Oxenhandler R., Kuo K. C., Gehrke C. W., Ehrlich M., The 5-methylcytosine content of DNA from human tumors. Nucleic Acids Res. 11, 6883–6894 (1983). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Spencer D. H., Russler-Germain D. A., Ketkar S., Helton N. M., Lamprecht T. L., Fulton R. S., Fronick C. C., O’Laughlin M., Heath S. E., Shinawi M., Westervelt P., Payton J. E., Wartman L. D., Welch J. S., Wilson R. K., Walter M. J., Link D. C., DiPersio J. F., Ley T. J., CpG Island Hypermethylation Mediated by DNMT3A Is a Consequence of AML Progression. Cell 168, 801–816.e13 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Wilson E. R., Helton N. M., Heath S. E., Fulton R. S., Payton J. E., Welch J. S., Walter M. J., Westervelt P., DiPersio J. F., Link D. C., Miller C. A., Ley T. J., Spencer D. H., Focal disruption of DNA methylation dynamics at enhancers in IDH-mutant AML cells. Leukemia 36, 935–945 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Glass J. L., Hassane D., Wouters B. J., Kunimoto H., Avellino R., Garrett-Bakelman F. E., Guryanova O. A., Bowman R., Redlich S., Intlekofer A. M., Meydan C., Qin T., Fall M., Alonso A., Guzman M. L., Valk P. J. M., Thompson C. B., Levine R., Elemento O., Delwel R., Melnick A., Figueroa M. E., Epigenetic Identity in AML Depends on Disruption of Nonpromoter Regulatory Elements and Is Affected by Antagonistic Effects of Mutations in Epigenetic Modifiers. Cancer Discov 7, 868–883 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Capper D., Jones D. T. W., Sill M., Hovestadt V., Schrimpf D., Sturm D., Koelsche C., Sahm F., Chavez L., Reuss D. E., Kratz A., Wefers A. K., Huang K., Pajtler K. W., Schweizer L., Stichel D., Olar A., Engel N. W., Lindenberg K., Harter P. N., Braczynski A. K., Plate K. H., Dohmen H., Garvalov B. K., Coras R., Hölsken A., Hewer E., Bewerunge-Hudler M., Schick M., Fischer R., Beschorner R., Schittenhelm J., Staszewski O., Wani K., Varlet P., Pages M., Temming P., Lohmann D., Selt F., Witt H., Milde T., Witt O., Aronica E., Giangaspero F., Rushing E., Scheurlen W., Geisenberger C., Rodriguez F. J., Becker A., Preusser M., Haberler C., Bjerkvig R., Cryan J., Farrell M., Deckert M., Hench J., Frank S., Serrano J., Kannan K., Tsirigos A., Brück W., Hofer S., Brehmer S., Seiz-Rosenhagen M., Hänggi D., Hans V., Rozsnoki S., Hansford J. R., Kohlhof P., Kristensen B. W., Lechner M., Lopes B., Mawrin C., Ketter R., Kulozik A., Khatib Z., Heppner F., Koch A., Jouvet A., Keohane C., Mühleisen H., Mueller W., Pohl U., Prinz M., Benner A., Zapatka M., Gottardo N. G., Driever P. H., Kramm C. M., Müller H. L., Rutkowski S., von Hoff K., Frühwald M. C., Gnekow A., Fleischhack G., Tippelt S., Calaminus G., Monoranu C.-M., Perry A., Jones C., Jacques T. S., Radlwimmer B., Gessi M., Pietsch T., Schramm J., Schackert G., Westphal M., Reifenberger G., Wesseling P., Weller M., Collins V. P., Blümcke I., Bendszus M., Debus J., Huang A., Jabado N., Northcott P. A., Paulus W., Gajjar A., Robinson G. W., Taylor M. D., Jaunmuktane Z., Ryzhova M., Platten M., Unterberg A., Wick W., Karajannis M. A., Mittelbronn M., Acker T., Hartmann C., Aldape K., Schüller U., Buslei R., Lichter P., Kool M., Herold-Mende C., Ellison D. W., Hasselblatt M., Snuderl M., Brandner S., Korshunov A., von Deimling A., Pfister S. M., DNA methylation-based classification of central nervous system tumours. Nature 555, 469–474 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Koelsche C., Schrimpf D., Stichel D., Sill M., Sahm F., Reuss D. E., Blattner M., Worst B., Heilig C. E., Beck K., Horak P., Kreutzfeldt S., Paff E., Stark S., Johann P., Selt F., Ecker J., Sturm D., Pajtler K. W., Reinhardt A., Wefers A. K., Sievers P., Ebrahimi A., Suwala A., Fernández-Klett F., Casalini B., Korshunov A., Hovestadt V., Kommoss F. K. F., Kriegsmann M., Schick M., Bewerunge-Hudler M., Milde T., Witt O., Kulozik A. E., Kool M., Romero-Pérez L., Grünewald T. G. P., Kirchner T., Wick W., Platten M., Unterberg A., Uhl M., Abdollahi A., Debus J., Lehner B., Thomas C., Hasselblatt M., Paulus W., Hartmann C., Staszewski O., Prinz M., Hench J., Frank S., Versleijen-Jonkers Y. M. H., Weidema M. E., Mentzel T., Griewank K., de Álava E., Martín J. D., Gastearena M. A. I., Chang K. T.-E., Low S. Y. Y., Cuevas-Bourdier A., Mittelbronn M., Mynarek M., Rutkowski S., Schüller U., Mautner V. F., Schittenhelm J., Serrano J., Snuderl M., Büttner R., Klingebiel T., Buslei R., Gessler M., Wesseling P., Dinjens W. N. M., Brandner S., Jaunmuktane Z., Lyskjær I., Schirmacher P., Stenzinger A., Brors B., Glimm H., Heining C., Tirado O. M., Sáinz-Jaspeado M., Mora J., Alonso J., del Muro X. G., Moran S., Esteller M., Benhamida J. K., Ladanyi M., Wardelmann E., Antonescu C., Flanagan A., Dirksen U., Hohenberger P., Baumhoer D., Hartmann W., Vokuhl C., Flucke U., Petersen I., Mechtersheimer G., Capper D., Jones D. T. W., Fröhling S., Pfister S. M., von Deimling A., Sarcoma classification by DNA methylation profiling. Nat. Commun. 12, 498 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Giacopelli B., Wang M., Cleary A., Wu Y.-Z., Schultz A. R., Schmutz M., Blachly J. S., Eisfeld A.-K., Mundy-Bosse B., Vosberg S., Greif P. A., Claus R., Bullinger L., Garzon R., Coombes K. R., Bloomfield C. D., Druker B. J., Tyner J. W., Byrd J. C., Oakes C. C., DNA methylation epitypes highlight underlying developmental and disease pathways in acute myeloid leukemia. Genome Res 31, gr.269233.120 (2021). [Google Scholar]
  • 10.Baylin S., Bestor T. H., Altered methylation patterns in cancer cell genomes: Cause or consequence? Cancer Cell 1, 299–305 (2002). [DOI] [PubMed] [Google Scholar]
  • 11.Hitchins M. P., Constitutional epimutation as a mechanism for cancer causality and heritability? Nat. Rev. Cancer 15, 625–634 (2015). [DOI] [PubMed] [Google Scholar]
  • 12.Horsthemke B., Heritable germline epimutations in humans. Nat. Genet. 39, 573–574 (2007). [Google Scholar]
  • 13.Poole R. L., Docherty L. E., Sayegh A. A., Caliebe A., Turner C., Baple E., Wakeling E., Harrison L., Lehmann A., Temple I. K., Mackay D. J. G., O. behalf of the I. C. I. Consortium, Targeted methylation testing of a patient cohort broadens the epigenetic and clinical description of imprinting disorders. Am. J. Méd. Genet. Part A 161, 2174–2182 (2013). [Google Scholar]
  • 14.Buiting K., Groβ S., Lich C., Gillessen-Kaesbach G., El-Maarri O., Horsthemke B., Epimutations in Prader-Willi and Angelman Syndromes: A Molecular Study of 136 Patients with an Imprinting Defect. Am. J. Hum. Genet. 72, 571–577 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Hitchins M. P., Ward R. L., Constitutional (germline) MLH1 epimutation as an aetiological mechanism for hereditary non-polyposis colorectal cancer. J. Méd. Genet. 46, 793 (2009). [DOI] [PubMed] [Google Scholar]
  • 16.Suter C. M., Martin D. I. K., Ward R. L., Germline epimutation of MLH1 in individuals with multiple cancers. Nat. Genet. 36, 497–501 (2004). [DOI] [PubMed] [Google Scholar]
  • 17.Mulet-Lazaro R., van Herk S., Erpelinck C., Bindels E., Sanders M. A., Vermeulen C., Renkens I., Valk P., Melnick A. M., de Ridder J., Rehli M., Gebhard C., Delwel R., Wouters B. J., Allele-specific expression of GATA2 due to epigenetic dysregulation in CEBPA double-mutant AML. Blood 138, 160–177 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Simpkins S. B., Bocker T., Swisher E. M., Mutch D. G., Gersell D. J., Kovatich A. J., Palazzo J. P., Fishel R., Goodfellow P. J., MLH1 Promoter Methylation and Gene Silencing is the Primary Cause of Microsatellite Instability in Sporadic Endometrial Cancers. Hum. Mol. Genet. 8, 661–666 (1999). [DOI] [PubMed] [Google Scholar]
  • 19.O’Neill K., Pleasance E., Fan J., Akbari V., Chang G., Dixon K., Csizmok V., MacLennan S., Porter V., Galbraith A., Grisdale C. J., Culibrk L., Dupuis J. H., Corbett R., Hopkins J., Bowlby R., Pandoh P., Smailus D. E., Cheng D., Wong T., Frey C., Shen Y., Paulin L. F., Sedlazeck F. J., Nelson J. M. T., Chuah E., Mungall K. L., Moore R. A., Coope R., Mungall A. J., McConechy M. K., Williamson L. M., Schrader K. A., Yip S., Marra M. A., Laskin J., Jones S. J. M., Long-read sequencing of an advanced cancer cohort resolves rearrangements, unravels haplotypes, and reveals methylation landscapes. medRxiv, 2024.02.20.24302959 (2024). [Google Scholar]
  • 20.Gimelbrant A., Hutchinson J. N., Thompson B. R., Chess A., Widespread Monoallelic Expression on Human Autosomes. Science 318, 1136–1140 (2007). [DOI] [PubMed] [Google Scholar]
  • 21.Stewart O., Gruber C., Randolph H. E., Patel R., Ramba M., Calzoni E., Huang L. H., Levy J., Buta S., Lee A., Sazeides C., Prue Z., van Konijnenburg D. P. H., Chinn I. K., Pedroza L. A., Lupski J. R., Schmitt E. G., Cooper M. A., Puel A., Peng X., Boisson-Dupuis S., Bustamante J., Okada S., Martin-Fernandez M., Orange J. S., Casanova J.-L., Milner J. D., Bogunovic D., Monoallelic expression can govern penetrance of inborn errors of immunity. Nature, 1–12 (2025). [Google Scholar]
  • 22.Seraihi A. F. A., Rio-Machin A., Tawana K., Bödör C., Wang J., Nagano A., Heward J. A., Iqbal S., Best S., Lea N., McLornan D., Kozyra E. J., Wlodarski M. W., Niemeyer C. M., Scott H., Hahn C., Ellison A., Tummala H., Cardoso S. R., Vulliamy T., Dokal I., Butler T., Smith M., Cavenagh J., Fitzgibbon J., GATA2 monoallelic expression underlies reduced penetrance in inherited GATA2-mutated MDS/AML. Leukemia 32, 2502–2507 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Marion-Poll L., Forêt B., Zielinski D., Massip F., Attia M., Carter A. C., Syx L., Chang H. Y., Gendrel A.-V., Heard E., Locus specific epigenetic modalities of random allelic expression imbalance. Nat. Commun. 12, 5330 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Deng Q., Ramsköld D., Reinius B., Sandberg R., Single-Cell RNA-Seq Reveals Dynamic, Random Monoallelic Gene Expression in Mammalian Cells. Science 343, 193–196 (2014). [DOI] [PubMed] [Google Scholar]
  • 25.Noushmehr H., Weisenberger D. J., Diefes K., Phillips H. S., Pujara K., Berman B. P., Pan F., Pelloski C. E., Sulman E. P., Bhat K. P., Verhaak R. G. W., Hoadley K. A., Hayes D. N., Perou C. M., Schmidt H. K., Ding L., Wilson R. K., Berg D. V. D., Shen H., Bengtsson H., Neuvial P., Cope L. M., Buckley J., Herman J. G., Baylin S. B., Laird P. W., Aldape K., Network T. C. G. A. R., Identification of a CpG Island Methylator Phenotype that Defines a Distinct Subgroup of Glioma. Cancer Cell 17, 510–522 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Ley T. J., Miller C., Ding L., Raphael B. J., Mungall A. J., Robertson A. G., Hoadley K., Triche T. J., Laird P. W., Baty J. D., Fulton L. L., Fulton R., Heath S. E., Kalicki-Veizer J., Kandoth C., Klco J. M., Koboldt D. C., Kanchi K.-L., Kulkarni S., Lamprecht T. L., Larson D. E., Lin L., Lu C., McLellan M. D., McMichael J. F., Payton J., Schmidt H., Spencer D. H., Tomasson M. H., Wallis J. W., Wartman L. D., Watson M. A., Welch J., Wendl M. C., Ally A., Balasundaram M., Birol I., Butterfield Y., Chiu R., Chu A., Chuah E., Chun H.-J., Corbett R., Dhalla N., Guin R., He A., Hirst C., Hirst M., Holt R. A., Jones S., Karsan A., Lee D., Li H. I., Marra M. A., Mayo M., Moore R. A., Mungall K., Parker J., Pleasance E., Plettner P., Schein J., Stoll D., Swanson L., Tam A., Thiessen N., Varhol R., Wye N., Zhao Y., Gabriel S., Getz G., Sougnez C., Zou L., Leiserson M. D. M., Vandin F., Wu H.-T., Applebaum F., Baylin S. B., Akbani R., Broom B. M., Chen K., Motter T. C., Nguyen K., Weinstein J. N., Zhang N., Ferguson M. L., Adams C., Black A., Bowen J., Gastier-Foster J., Grossman T., Lichtenberg T., Wise L., Davidsen T., Demchok J. A., Shaw K. R. M., Sheth M., Sofia H. J., Yang L., Downing J. R., Eley G., Genomic and Epigenomic Landscapes of Adult De Novo Acute Myeloid Leukemia. New Engl J Med 368, 2059–2074 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Mazor T., Pankov A., Johnson B. E., Hong C., Hamilton E. G., Bell R. J. A., Smirnov I. V., Reis G. F., Phillips J. J., Barnes M. J., Idbaih A., Alentorn A., Kloezeman J. J., Lamfers M. L. M., Bollen A. W., Taylor B. S., Molinaro A. M., Olshen A. B., Chang S. M., Song J. S., Costello J. F., DNA Methylation and Somatic Mutations Converge on the Cell Cycle and Define Similar Evolutionary Histories in Brain Tumors. Cancer Cell 28, 307–317 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Network T. C. G. A. R., Comprehensive, Integrative Genomic Analysis of Diffuse Lower-Grade Gliomas. New Engl J Medicine 372, 2481–2498 (2015). [Google Scholar]
  • 29.Figueroa M. E., Lugthart S., Li Y., Erpelinck-Verschueren C., Deng X., Christos P. J., Schifano E., Booth J., van Putten W., Skrabanek L., Campagne F., Mazumdar M., Greally J. M., Valk P. J. M., Löwenberg B., Delwel R., Melnick A., DNA Methylation Signatures Identify Biologically Distinct Subtypes in Acute Myeloid Leukemia. Cancer Cell 17, 13–27 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Ernst J., Kellis M., Chromatin-state discovery and genome annotation with ChromHMM. Nat Protoc 12, 2478–2492 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Elliott G., Hong C., Xing X., Zhou X., Li D., Coarfa C., Bell R. J. A., Maire C. L., Ligon K. L., Sigaroudinia M., Gascard P., Tlsty T. D., Harris R. A., Schalkwyk L. C., Bilenky M., Mill J., Farnham P. J., Kellis M., Marra M. A., Milosavljevic A., Hirst M., Stormo G. D., Wang T., Costello J. F., Intermediate DNA methylation is a conserved signature of genome regulation. Nat. Commun. 6, 6363 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Ding Y., Cai K., Liu L., Zhang Z., Zheng X., Shi J., mHapTk: a comprehensive toolkit for the analysis of DNA methylation haplotypes. Bioinformatics 38, 5141–5143 (2022). [DOI] [PubMed] [Google Scholar]
  • 33.Feng Y., Zhang Z., Hong Y., Ding Y., Liu L., Gao S., Fang H., Shi J., A DNA methylation haplotype block landscape in human tissues and preimplantation embryos reveals regulatory elements defined by comethylation patterns. Genome Res. 33, 2041–2052 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Guo S., Diep D., Plongthongkum N., Fung H.-L., Zhang K., Zhang K., Identification of methylation haplotype blocks aids in deconvolution of heterogeneous tissue samples and tumor tissue-of-origin mapping from plasma DNA. Nat. Genet. 49, 635–642 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Stergachis A. B., Debo B. M., Haugen E., Churchman L. S., Stamatoyannopoulos J. A., Single-molecule regulatory architectures captured by chromatin fiber sequencing. Science 368, 1449– 1454 (2020). [DOI] [PubMed] [Google Scholar]
  • 36.Slade M. J., Ghasemi R., O’Laughlin M., Burton T., Fulton R. S., Abel H. J., Duncavage E. J., Ley T. J., Jacoby M. A., Spencer D. H., Persistent Molecular Disease in Adult Patients With AML Evaluated With Whole-Exome and Targeted Error-Corrected DNA Sequencing. JCO Precis. Oncol. 7, e2200559 (2023). [Google Scholar]
  • 37.Hu K., Yu J., Suknuntha K., Tian S., Montgomery K., Choi K.-D., Stewart R., Thomson J. A., Slukvin I. I., Efficient generation of transgene-free induced pluripotent stem cells from normal and neoplastic bone marrow and cord blood mononuclear cells. Blood 117, e109–e119 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Wilson V. L., Jones P. A., DNA Methylation Decreases in Aging But Not in Immortal Cells. Science 220, 1055–1057 (1983). [DOI] [PubMed] [Google Scholar]
  • 39.Rosenski J., Peretz A., Magenheim J., Loyfer N., Shemer R., Glaser B., Dor Y., Kaplan T., Atlas of imprinted and allele-specific DNA methylation in the human body. Nat. Commun. 16, 2141 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Loyfer N., Magenheim J., Peretz A., Cann G., Bredno J., Klochendler A., Fox-Fisher I., Shabi-Porat S., Hecht M., Pelet T., Moss J., Drawshy Z., Amini H., Moradi P., Nagaraju S., Bauman D., Shveiky D., Porat S., Dior U., Rivkin G., Or O., Hirshoren N., Carmon E., Pikarsky A., Khalaileh A., Zamir G., Grinbaum R., Gazala M. A., Mizrahi I., Shussman N., Korach A., Wald O., Izhar U., Erez E., Yutkin V., Samet Y., Golinkin D. R., Spalding K. L., Druid H., Arner P., Shapiro A. M. J., Grompe M., Aravanis A., Venn O., Jamshidi A., Shemer R., Dor Y., Glaser B., Kaplan T., A DNA methylation atlas of normal human cell types. Nature, 1– 10 (2023). [Google Scholar]
  • 41.O’Neill K., Pleasance E., Fan J., Akbari V., Chang G., Dixon K., Csizmok V., MacLennan S., Porter V., Galbraith A., Grisdale C. J., Culibrk L., Dupuis J. H., Corbett R., Hopkins J., Bowlby R., Pandoh P., Smailus D. E., Cheng D., Wong T., Frey C., Shen Y., Lewis E., Paulin L. F., Sedlazeck F. J., Nelson J. M. T., Chuah E., Mungall K. L., Moore R. A., Coope R., Mungall A. J., McConechy M. K., Williamson L. M., Schrader K. A., Yip S., Marra M. A., Laskin J., Jones S. J. M., Long-read sequencing of an advanced cancer cohort resolves rearrangements, unravels haplotypes, and reveals methylation landscapes. Cell Genom. 4, 100674 (2024). [Google Scholar]
  • 42.Zhang Z., Hong Y., Zhang S., Zhu X., Liu L., Liao X., Gu H., Fang H., Shi J., Toward the DNA methylation haplotype map of 11 common solid cancers. Cell Rep. 44, 116197 (2025). [Google Scholar]
  • 43.Costa A. L., Doncevic D., Wu Y., Yang L., Man K. H., Spreng A.-S., Winter H., Fletcher M. N. C., Radlwimmer B., Herrmann C., A new IDH-independent hypermethylation phenotype is associated with astrocyte-like cell state in glioblastoma. Genome Biol. 26, 192 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Wu Y., Fletcher M., Gu Z., Wang Q., Costa B., Bertoni A., Man K.-H., Schlotter M., Felsberg J., Mangei J., Barbus M., Gaupel A.-C., Wang W., Weiss T., Eils R., Weller M., Liu H., Reifenberger G., Korshunov A., Angel P., Lichter P., Herrmann C., Radlwimmer B., Glioblastoma epigenome profiling identifies SOX10 as a master regulator of molecular tumour subtype. Nat. Commun. 11, 6434 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Kundaje A., Meuleman W., Ernst J., Bilenky M., Yen A., Heravi-Moussavi A., Kheradpour P., Zhang Z., Wang J., Ziller M. J., Amin V., Whitaker J. W., Schultz M. D., Ward L. D., Sarkar A., Quon G., Sandstrom R. S., Eaton M. L., Wu Y.-C., Pfenning A., Wang X., Liu M. C., Coarfa C., Harris R. A., Shoresh N., Epstein C. B., Gjoneska E., Leung D., Xie W., Hawkins R. D., Lister R., Hong C., Gascard P., Mungall A. J., Moore R., Chuah E., Tam A., Canfield T. K., Hansen R. S., Kaul R., Sabo P. J., Bansal M. S., Carles A., Dixon J. R., Farh K.-H., Feizi S., Karlic R., Kim A.-R., Kulkarni A., Li D., Lowdon R., Elliott G., Mercer T. R., Neph S. J., Onuchic V., Polak P., Rajagopal N., Ray P., Sallari R. C., Siebenthall K. T., Sinnott-Armstrong N. A., Stevens M., Thurman R. E., Wu J., Zhang B., Zhou X., Abdennur N., Adli M., Akerman M., Barrera L., Antosiewicz-Bourget J., Ballinger T., Barnes M. J., Bates D., Bell R. J. A., Bennett D. A., Bianco K., Bock C., Boyle P., Brinchmann J., Caballero-Campo P., Camahort R., Carrasco-Alfonso M. J., Charnecki T., Chen H., Chen Z., Cheng J. B., Cho S., Chu A., Chung W.-Y., Cowan C., Deng Q. A., Deshpande V., Diegel M., Ding B., Durham T., Echipare L., Edsall L., Flowers D., Genbacev-Krtolica O., Gifford C., Gillespie S., Giste E., Glass I. A., Gnirke A., Gormley M., Gu H., Gu J., Hafler D. A., Hangauer M. J., Hariharan M., Hatan M., Haugen E., He Y., Heimfeld S., Herlofsen S., Hou Z., Humbert R., Issner R., Jackson A. R., Jia H., Jiang P., Johnson A. K., Kadlecek T., Kamoh B., Kapidzic M., Kent J., Kim A., Kleinewietfeld M., Klugman S., Krishnan J., Kuan S., Kutyavin T., Lee A.-Y., Lee K., Li J., Li N., Li Y., Ligon K. L., Lin S., Lin Y., Liu J., Liu Y., Luckey C. J., Ma Y. P., Maire C., Marson A., Mattick J. S., Mayo M., McMaster M., Metsky H., Mikkelsen T., Miller D., Miri M., Mukame E., Nagarajan R. P., Neri F., Nery J., Nguyen T., O’Geen H., Paithankar S., Papayannopoulou T., Pelizzola M., Plettner P., Propson N. E., Raghuraman S., Raney B. J., Raubitschek A., Reynolds A. P., Richards H., Riehle K., Rinaudo P., Robinson J. F., Rockweiler N. B., Rosen E., Rynes E., Schein J., Sears R., Sejnowski T., Shafer A., Shen L., Shoemaker R., Sigaroudinia M., Slukvin I., Stehling-Sun S., Stewart R., Subramanian S. L., Suknuntha K., Swanson S., Tian S., Tilden H., Tsai L., Urich M., Vaughn I., Vierstra J., Vong S., Wagner U., Wang H., Wang T., Wang Y., Weiss A., Whitton H., Wildberg A., Witt H., Won K.-J., Xie M., Xing X., Xu I., Xuan Z., Ye Z., Yen C., Yu P., Zhang X., Zhang X., Zhao J., Zhou Y., Zhu J., Zhu Y., Ziegler S., Beaudet A. E., Boyer L. A., Jager P. L. D., Farnham P. J., Fisher S. J., Haussler D., Jones S. J. M., Li W., Marra M. A., McManus M. T., Sunyaev S., Thomson J. A., Tlsty T. D., Tsai L.-H., Wang W., Waterland R. A., Zhang M. Q., Chadwick L. H., Bernstein B. E., Costello J. F., Ecker J. R., Hirst M., Meissner A., Milosavljevic A., Ren B., Stamatoyannopoulos J. A., Wang T., Kellis M., Kundaje A., Meuleman W., Ernst J., Bilenky M., Yen A., Heravi-Moussavi A., Kheradpour P., Zhang Z., Wang J., Ziller M. J., Amin V., Whitaker J. W., Schultz M. D., Ward L. D., Sarkar A., Quon G., Sandstrom R. S., Eaton M. L., Wu Y.-C., Pfenning A. R., Wang X., Claussnitzer M., Liu Y., Coarfa C., Harris R. A., Shoresh N., Epstein C. B., Gjoneska E., Leung D., Xie W., Hawkins R. D., Lister R., Hong C., Gascard P., Mungall A. J., Moore R., Chuah E., Tam A., Canfield T. K., Hansen R. S., Kaul R., Sabo P. J., Bansal M. S., Carles A., Dixon J. R., Farh K.-H., Feizi S., Karlic R., Kim A.-R., Kulkarni A., Li D., Lowdon R., Elliott G., Mercer T. R., Neph S. J., Onuchic V., Polak P., Rajagopal N., Ray P., Sallari R. C., Siebenthall K. T., Sinnott-Armstrong N. A., Stevens M., Thurman R. E., Wu J., Zhang B., Zhou X., Beaudet A. E., Boyer L. A., Jager P. L. D., Farnham P. J., Fisher S. J., Haussler D., Jones S. J. M., Li W., Marra M. A., McManus M. T., Sunyaev S., Thomson J. A., Tlsty T. D., Tsai L.-H., Wang W., Waterland R. A., Zhang M. Q., Chadwick L. H., Bernstein B. E., Costello J. F., Ecker J. R., Hirst M., Meissner A., Milosavljevic A., Ren B., Stamatoyannopoulos J. A., Wang T., Kellis M., Integrative analysis of 111 reference human epigenomes. Nature 518, 317–330 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Chen M., Fu R., Chen Y., Li L., Wang S.-W., High-resolution, noninvasive single-cell lineage tracing in mice and humans based on DNA methylation epimutations. Nat. Methods, 1–11 (2025). [DOI] [PubMed] [Google Scholar]
  • 47.Scherer M., Singh I., Braun M. M., Szu-Tu C., Sanchez P. S., Lindenhofer D., Jakobsen N. A., Körber V., Kardorff M., Nitsch L., Kautz P., Rühle J., Bianchi A., Cozzuto L., Frömel R., Beneyto-Calabuig S., Lareau C., Satpathy A. T., Beekman R., Steinmetz L. M., Raffel S., Ludwig L. S., Vyas P., Rodriguez-Fraticelli A., Velten L., Clonal tracing with somatic epimutations reveals dynamics of blood ageing. Nature 643, 478–487 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1

Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES