Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2024 Jan 23;52(4):1602–1612. doi: 10.1093/nar/gkad1232

Reconstructing DNA methylation maps of ancient populations

Arielle Barouch 1,2, Yoav Mathov 3,4, Eran Meshorer 5,6, Benjamin Yakir 7, Liran Carmel 8,
PMCID: PMC10939417  PMID: 38261973

Abstract

Studying premortem DNA methylation from ancient DNA (aDNA) provides a proxy for ancient gene activity patterns, and hence valuable information on evolutionary changes in gene regulation. Due to statistical limitations, current methods to reconstruct aDNA methylation maps are constrained to high-coverage shotgun samples, which comprise a small minority of available ancient samples. Most samples are sequenced using in-situ hybridization capture sequencing which targets a predefined set of genomic positions. Here, we develop methods to reconstruct aDNA methylation maps of samples that were not sequenced using high-coverage shotgun sequencing, by way of pooling together individuals to obtain a DNA methylation map that is characteristic of a population. We show that the resulting DNA methylation maps capture meaningful biological information and allow for the detection of differential methylation across populations. We offer guidelines on how to carry out comparative studies involving ancient populations, and how to control the rate of falsely discovered differentially methylated regions. The ability to reconstruct DNA methylation maps of past populations allows for the development of a whole new frontier in paleoepigenetic research, tracing DNA methylation changes throughout human history, using data from thousands of ancient samples.

Graphical Abstract

Graphical Abstract.

Graphical Abstract

Introduction

The ability to computationally reconstruct premortem DNA methylation maps in ancient humans (1–3) provided a useful tool to identify evolutionary changes in gene regulation. This opened up the field of paleoepigenetics (4,5) that was used to study regulatory changes that underlie phenotypic adaptations in humans. For example, comparing DNA methylation across human groups led to the identification of roughly 3000 differentially methylated regions (DMRs) separating Neanderthals, Denisovans, and modern humans (3). Analysis of these DMRs revealed that the vocal and facial anatomy is derived in modern humans and differs from that in Neanderthals and Denisovans (3). These DMRs were also used to predict anatomical features of the Denisovan, whose physical appearance has not yet been recovered in the fossil record (6).

The reconstruction method utilizes the fact that deamination, the predominant chemical decay process in ancient DNA (aDNA), works differently on methylated and unmethylated cytosines, turning the former into thymines and the latter into uracils (7). aDNA typically consists of double-stranded fragments with short single-stranded overhangs. The deamination rate varies substantially between these two regions. Single-stranded DNA can exhibit deamination rates as high as tens of percentages, while double-stranded DNA shows rates around 1% or lower (8). The overall average deamination rate across a fragment, when considering both regions, typically falls within the range of 1–3% (1,3). aDNA library preparation protocols that use full or partial USER (User Specific Excision Reagent) treatment trim the molecules at uracil positions but do not process thymines (7,9). This enables the fraction of thymines in positions where cytosines can be potentially methylated (CpG positions, where cytosines are followed by a guanine), known as the CInline graphicT ratio, to be used as a proxy for premortem DNA methylation level (1,2).

Paleoepigenetics holds great promise to shed new light on human evolution, but its applicability is limited because the reconstruction of aDNA methylation maps requires shotgun sequencing with a coverage of at least ∼15x. Such high-coverage samples, which also went through USER treatment, are relatively rare and comprise a Neanderthal (10), a Denisovan (11), and a few anatomically modern humans (AMHs) (3,12,13). In particular, reconstructing aDNA methylation is infeasible for the thousands of individuals sequenced using in-solution hybridization capture, which is the most widely used technology in the aDNA field. Hybridization capture probes target pre-determined loci, usually a large set of single-nucleotide polymorphic sites (SNPs). The most popular probe set used with aDNA is known as the 1240K set, targeting 1233013 SNPs and designed as a merger of the 390k set (targeting 394 577 SNPs) (14) and the 840k set (targeting 842 630 SNPs) (15). Hybridization capture enabled low-coverage sequencing of thousands of ancient individuals and was shown to provide much information on demographic events in past populations (16–18). Many key populations have been deeply investigated using 1240K hybridization capture, including the Neolithic and Bronze Age Europe (19), pre-contact Caribbeans (20,21), and Bronze Age Levant (22).

Here we show that although samples sequenced to low coverage using hybridization capture are unamenable to the reconstruction of DNA methylation, a combination of several samples from the same population can still provide a useful DNA methylation map. The key to our approach is the observation that DNA methylation patterns of individuals from the same population are more similar than DNA methylation patterns of individuals from different populations (23,24). This means that populations have characteristic DNA methylation profiles, in a similar way by which genetic profiles of populations can be defined based on genotypes. The differences in DNA methylation profiles between populations represent a combination of genotype differences, differences in shared environment and lifestyle, and random effects (25–28).

Instead of reconstructing individual DNA methylation maps, we develop here a method that pools samples together to obtain DNA methylation maps that represent entire populations. Combining the facts that each position may be sampled by multiple individuals, that each probe targets a region of approximately 100 base pairs around the target site (14,15), and that off-target sequences that capture non-target genomic regions are common, we show that the number of sampled ancient individuals in current studies is sufficient to provide a reliable reconstruction of genome-wide DNA methylation. We demonstrate the method on seven ancient populations and show that population-wise comparisons provide meaningful information and can be used to trace changes in DNA methylation throughout human history.

Materials and methods

Computation of Inline graphic ratio

Reconstruction of DNA methylation proceeded according to the method described in Gokhman et al. (3), which is based on evaluating the Inline graphicratio in a position Inline graphic as Inline graphic, where Inline graphic and Inline graphic are the counts of thymines and cytosines, respectively, in that position.

Filtering out CpG positions

To remove possible PCR duplicates, coverage histogram was computed per chromosome per sample. After smoothing the histogram using a running window of length 5, we set the coverage threshold as the first coverage for which the histogram count was one or less, and then removed from the analysis all positions with coverage equal to or greater than the threshold.

CpG positions with a high rate of Inline graphicratio are likely to represent mutations rather than postmortem degradation. To filter them out, we removed positions whose Inline graphicratio exceeded 0.25 (1).

Determining window size

We collect statistics on the Inline graphic’s and Inline graphic’s in a window of size Inline graphic. Let Inline graphic be the effective coverage.

We require that the probability of observing a total of zero Inline graphic’s in the window for a minimum methylation level Inline graphic is less than Inline graphic. This translates into

graphic file with name M00016.gif

Taking log of both sides we get

graphic file with name M00017.gif

meaning that we have to have a minimum accumulated coverage of

graphic file with name M00018.gif

in the window. If the window is covered by the average effective coverage, then Inline graphic. This translates into the following window size:

graphic file with name M00020.gif

Methylation in pooled samples

We tested two pooling procedures. For both, let us have Inline graphic samples, indexed by Inline graphic, and let us index the CpG positions by Inline graphic.

Naïve pooling: Let Inline graphic and Inline graphic count the number of cytosines and thymines, respectively, at position Inline graphic in sample Inline graphic. After pooling, the counts in this position are defined as the sum of counts in all samples,

graphic file with name M00028.gif

Note that each sample has the same weight, regardless of potential difference in deamination rates or sequencing quality. After pooling, processing continues as if it were a regular single sample (3).

We used Inline graphic ratio values to estimate methylation through the application of histogram matching (29). In essence, histogram matching is a technique that takes two datasets, called the signal and the reference, and devises a nonlinear transformation that—once applied to the signal—adjusts its histogram to be as close as possible to that of the reference. Let Inline graphic, Inline graphic be the cumulative histogram of the reference and Inline graphic, Inline graphic be the original cumulative histogram of the signal. Let Inline graphic be the values of the bins for histogram Inline graphic, and Inline graphic be the values of the bins for histogram Inline graphic. Then, we would like to find a transformation Inline graphic such that Inline graphic, and the cumulative histogram of the transformed signal, Inline graphic, is as similar as possible to Inline graphic. For each Inline graphic, we find the index Inline graphic where Inline graphic is most similar to Inline graphic, namely

graphic file with name M00046.gif

Then, the transformation Inline graphic is defined as

graphic file with name M00048.gif

To this end, we computed the nonlinear transformation of the Inline graphic values that would produce a histogram that is as close as possible to a target histogram of measured DNA methylation in modern human bone (Bone2).

Advanced pooling: Let us consider a window around position Inline graphic, which contains in individual Inline graphic a set Inline graphic of informative CpG sites. We assume that Inline graphic, otherwise this position is not further considered. We also assume the window is sufficiently small, such that the methylation level of its CpG sites in each individual Inline graphic are similar. Finally, we assume the population is homogeneous, and consequently denote the common methylation value in this window as Inline graphic.

We assume that the observed number of thymine bases in window Inline graphic for sample Inline graphicis binomially distributed, Inline graphic, where

graphic file with name M00059.gif

is the total number of reads covering the CpG positions in the window (Inline graphic is the number of reads that cover position Inline graphic in individual Inline graphic), and Inline graphic is the deamination rate of individual Inline graphic. The likelihood of individual Inline graphic is

graphic file with name M00066.gif

and the log-likelihood

graphic file with name M00067.gif

where Inline graphic is a term that is independent of Inline graphic. The total log-likelihood of all individuals is

graphic file with name M00070.gif

where Inline graphic is a term independent of Inline graphic. The score function with respect to Inline graphic is:

graphic file with name M00074.gif

where

graphic file with name M00075.gif

To obtain a maximum likelihood estimator of Inline graphic we equate the score function to zero and solve using Newton–Raphson. For this,

graphic file with name M00077.gif

Given Inline graphic is the approximate solution at iteration Inline graphic, the solution at iteration Inline graphic is given by

graphic file with name M00081.gif

In order to get an initial guess, we may obtain an approximated solution using

graphic file with name M00082.gif

Hence,

graphic file with name M00083.gif

This simplifies into

graphic file with name M00084.gif

Further approximating by neglecting terms of the order of Inline graphic, we get

graphic file with name M00086.gif

This can be written as

graphic file with name M00087.gif

where

graphic file with name M00088.gif

The approximate solution is therefore

graphic file with name M00089.gif

The Fisher information for estimating Inline graphic is equal to the expectation of the negative second derivative of the log-likelihood function,

graphic file with name M00091.gif

The empirical Fisher information is the evaluation of this negative second derivative at the estimated value of the parameter, and may serve as an approximation of the Fisher information,

graphic file with name M00092.gif

where Inline graphic is the estimator obtained from the iterations of the Newton–Raphson algorithm. The empirical Fisher information is computed as part of the implementation of the algorithm. Finally, we may approximate the variance of the estimator via the inverse of the empirical Fisher information:

graphic file with name M00094.gif

Calculating pooled sample deamination rate

A reference modern bone sample was used for this calculation, as done in Gokhman et al. (3). Only CpG positions in which methylation in the reference was one were included.

Methylation in CpG islands and housekeeping genes promoters

We used the cpgIslandExt database for CpG islands (30,31), and a list of housekeeping genes (32), from which we created a list of promoter intervals (defined from 5000 upstream to 1000 downstream of the transcription start site).

Predicting population effective coverage

We created a simple linear regression model that uses the number of pooled samples and their mean number of autosomal SNP hits as explaining variables, to predict the effective coverage of the pooled cohort. To create the model, we used data from the seven cohorts examined in this study. The resulting equation, obtained with Inline graphic and Inline graphic, is

graphic file with name M00097.gif

A reasonable reconstruction can be obtained if the predicted effective coverage is above a threshold of 12 (Figure 2B).

Figure 2.

Figure 2.

(A) C→T ratio in the pooled samples versus DNA methylation measured in modern human bone. CpGs were binned according to their measured methylation level in modern human bone. For each bin, an average C→T ratio was computed. Ust ‘Ishim (shotgun sequencing 42×) and the Altai Neanderthal (shotgun sequencing 52×) were added for comparison. ‘r’ denotes the Pearson correlation. (B) Mean methylation as a function of the effective coverage of the sample. Horizontal line shows mean methylation in Bone2.

Computing correlations between DNA methylation maps

We computed correlations between DNA methylation maps as was suggested by Loyfer et al. (33) To this end, the genome was segmented to regions with roughly constant methylation and then correlations were computed based on the average methylation in these regions.

DMRs filtration model

To assess the level of false DMRs between two populations, we generated 10 permutations over the population labels. DMRs detected for the permutated data are considered false discoveries. We scanned different values of Inline graphic and min_CpGs, and calculated the FDR per set of parameters by dividing the average number of DMRs found in the permutations by the observed number (Supplementary Table S12). We chose for further analysis parameter sets for which FDR < 0.05.

Results

Datasets

To test our approach to pool together different individuals to reconstruct DNA methylation maps of ancient populations, we used seven ancient cohorts, each consisting of at least 20 individuals sequenced using the 1240K hybridization capture. Each cohort represents a relatively homogeneous population, with the individuals coming from similar times and geographic locations. To avoid intra-population variability due to the tissue specificity of DNA methylation, the cohorts predominantly include individuals whose DNA was extracted from petrous bones. Individuals with <30 000 autosomal SNP hits were filtered out, as previous work showed that lower coverage might compromise inference reliability (22).

The seven cohorts are Baqah_BA (21 samples) from the Middle-to-Late Bronze Age in Baq’ah, Jordan (22); Cuba_ARC_CER (35 samples) from the Archaic and Ceramic Ages in Cuba (20,21); British_N (47 samples) from Neolithic England, Scotland and Wales (19); British_BA (51 samples) from Early, Middle and Late Bronze Age England, Scotland and Wales (19); Eu_BellBeakers (54 samples) from the Bell Beakers culture of central Europe (19); Mongolia_BA_IA (63 samples) from Bronze and Iron Ages Mongolia (34); and the Caribbean_CER population (142 samples) from the Ceramic Age in the Caribbeans (20) (Table 1, Supplementary Table S1).

Table 1.

Basic information on the cohorts used in the current study

Population Number of individuals Period Region Average autosomal SNP hits Average total CpG coverage Effective Coverage; average total CpG coverage (excluding non covered CpGs) Petrous to total ratio Female to total ratio References
Caribbean_CER 142 100–1500 CE Caribbean 680 901 33.74 35.69 1.00 0.45 20
Mongolia_BA_IA 63 3000 BCE-100 CE Mongolia 788 240 22.37 24.16 1.00 0.35 34
Eu_BellBeakers 54 2500–2000 BCE Central Europe 671 619 12.02 14.29 0.98 0.30 19
British_N 47 4000–2500 BCE UK 632 179 10.25 12.79 0.91 0.36 19
British_BA 51 2300–800 BCE UK 605 833 9.47 12.00 0.96 0.49 19
Cuba_ARC_CER 35 1000 BCE - 1500 CE Caribbean 656 432 6.80 9.44 1.00 0.43 20,21
Baqah_BA 21 1550–1150 BCE Southern Levant 685 503 3.56 6.99 1.00 0.48 22

Generating population-level ancient methylation maps

We generate a population methylation map by pooling together individuals from that population. To carry out pooling, we tested two approaches. The first, denoted here as the naïve approach, simply sums up the counts of cytosines and thymines in each CpG position from all samples. This approach is computationally fast but ignores potential differences in deamination rate across the pooled individuals. The second, denoted here as the advanced approach, is computationally more involved and does account for differences in deamination rates between individuals (see Methods). Comparing the two methods, we found that the naïve approach performs better, or at least equally as well (Supplementary Text T1). Therefore, hereinafter we will only present results using the naïve approach.

Following pooling, filtration proceeds similarly to the original reconstruction algorithm (1), with the main step being the removal of CpG positions with a CInline graphicT ratio above a threshold of 0.25, as they likely result from mutations and not deamination. Unlike the original reconstruction algorithm, the mapping from CInline graphicT ratios to methylation was performed by matching the histogram of the CInline graphicT ratios to the histogram of DNA methylation measured in modern human bone using whole-genome bisulfite sequencing (WGBS).

Effective coverage of a population

We first wished to examine whether pooling produces CInline graphicT counts comparable to those obtained in high-coverage shotgun sequences. To this end, we looked at the fraction of CpG positions whose coverage is above a certain threshold (Figure 1A). Given the similar average number of autosomal SNP hits across the cohorts (Table 1), we expect that for a fixed threshold this fraction would increase with the number of samples in the cohort. Indeed, in Caribbean_CER, the largest cohort, 92.1% of the CpG positions are covered by at least one read, and 31.2% are covered by >15 reads. In contrast, in Baqah_BA, the smallest cohort, only 50.6% of the CpG positions are covered by at least one read, and 5.1% are covered by more than 15 reads.

Figure 1.

Figure 1.

(A) Percentage of CpG sites with coverage larger than a certain threshold, for each of the pooled cohorts. (B) Percentage of windows (genomic regions spanning 31 consecutive CpG positions) with coverage larger than a certain threshold, for each of the pooled cohorts. (C) Average coverage as a function of the distance to the nearest target SNP.

For shotgun sequencing, DNA methylation in a particular CpG position is reconstructed using information from a window comprising neighboring CpG positions (1). Assuming a shotgun sample with low coverage of 15×, the use of window size of 31 CpG positions—the maximum value allowed in our reconstruction algorithm – achieves an average total coverage in the window of 465×. We therefore wanted to assess to what extent pooling achieves similar coverage levels, by computing the average coverage in windows of 31 consecutive CpG positions (Figure 1B). For the two largest cohorts, Caribbean_CER and Mongolia_BA_IA, 68% and 58% of CpG positions, respectively, are amenable to DNA methylation reconstruction using a window of size 31. This number drops to 2% for Baqah_BA (Supplementary Table S2). In comparison, the standard methylation arrays of 450K and EPIC (850K) provide information on 1.71% and 3.06% of CpG positions, respectively (35–37). Thus, in all cohorts, except Baqah_BA the number of CpG positions whose coverage allows for reliable reconstruction of aDNA methylation is larger than in these popular commercial arrays. A recent study claims that even shotgun coverage as low as 10x might suffice to reconstruct DNA methylation, using windows of up to 50 consecutive CpG positions (38). Using this estimate, we get that 75% of Caribbean_CER windows are amendable to reconstruction, and 1% of Baqah_BA (Supplementary Table S2). The two largest cohorts are the only ones for which the computed optimal window size is lower than 31. The optimal window sizes are 21 for Cermaic_CER and 29 for Mongolia_BA_IA, making 63% and 52% of the windows amenable to reconstruction, respectively (Supplementary Table S2).

To capture the coverage differences between cohorts, we define the effective coverage as the average coverage of all CpG positions in the genome that are covered at least once (Table 1). Effective coverage should increase with the number of samples in a population, and with the average autosomal SNP hits. In the current study, effective coverage changes monotonically with the number of samples, from 35.69 in Caribbean_CER to 6.99 in Baqah_BA. For comparison, we examined two high-coverage shotgun sequenced samples, Ust ‘Ishim (39) and the Altai Neanderthal (10), for which we computed effective coverages of 29.73 and 41.5, respectively.

As hybridization capture targets specific SNPs, we expect to see the coverage drops when we move away from the target site. To test this, we looked at the mean coverage as a function of the distance to the nearest target SNP in chromosome 1 (Figure 1C). Indeed, we observed a sharply decreasing curve, reaching a low plateau for distances above 100 bp, consistent with the fact that the hybridization probes target a sequence of roughly a hundred bases around each target site. The approximately constant coverage at distances higher than a hundred base pairs represents information coming from off-target sequences.

Reconstructing DNA methylation of ancient populations

C→T ratio

The CInline graphicT ratio serves as a proxy for premortem DNA methylation, and has been used for the reconstruction of methylation in ancient samples sequenced using shotgun sequencing. We wanted to confirm that the CInline graphicT ratio computed on pooled samples may also serve as a proxy for aDNA methylation. To this end, we divided the CpG positions in the genome into ten bins, based on their methylation level in modern human bone (Bone2 (3)). Then, we computed the average CInline graphicT ratio across the positions in a bin, for each of the cohorts. As a reference, we repeated these computations for Ust ‘Ishim and the Altai Neanderthal. With the exception of the smallest cohort, Baqah_BA, the CInline graphicT ratio was monotonically increasing as a function of modern DNA methylation, attesting to its appropriateness to serve as the basis for reconstructing premortem DNA methylation (Figure 2A, Supplementary Text 2). The nonlinearity between the Inline graphic ratio and the reference methylation is accounted for by a histogram matching procedure (see Methods). Moreover, again with the exception of Baqah_BA, the correlation between the CInline graphicT ratio and the binned methylation was in the range 0.86–0.88, similar to the correlation of 0.92 for the Altai Neanderthal and 0.93 for Ust ‘Ishim. Note that the range of CInline graphicT values varies across samples, due to variations in deamination rates and effective coverages. The outlying characteristics of Baqah_BA are a result of the small size of this cohort, suggesting that a cohort with an effective coverage as low as 6.99 provides insufficient information on premortem DNA methylation. In the next section, we will determine a more accurate bound on the minimum effective coverage required for a reliable reconstruction of DNA methylation in pooled cohorts.

The reconstruction algorithm

As we did for shotgun sequenced samples, we take advantage of the high correlation in methylation levels between neighboring CpGs (40) to reduce the standard error of our estimated methylation values. To this end, we define a window of a fixed number of consecutive CpG positions and estimate the DNA methylation in a CpG position using information from all CpG positions in the window centered around it. We set the size of the window adaptively, based on the effective coverage of the cohort, but bounding it from above by 31 CpG positions (see Materials and methods). As mentioned above, all cohorts reached this upper bound, except for the two with the highest effective coverage – Caribbean_CER with a window size of 21, and Mongolia_BA_IA with a window size of 29. The number of informative CpGs within a window tends to be smaller in the low-quality samples (Supplementary Figure S1). Premortem DNA methylation was computed from the CInline graphicT ratio using histogram-matching (29) to obtain a reconstructed DNA methylation histogram as similar as possible to that of a reference histogram in modern bone (Supplementary Figure S2).

About 75% of CpG positions are methylated throughout the mammalian genome (41). Specifically, the average methylation in our reference modern human bone is 74.15%. To test how the different cohorts replicate this characteristic value, we measured the average methylation per cohort (Figure 2B). All populations showed average methylation between 70% and 75%, with the exception of Baqah_BA and Cuba_ARC_CER, which have the smallest number of samples and the lowest effective coverage. More generally, the average DNA methylation of the population is a monotonic increasing function of the effective coverage, which allowed us to set an effective coverage of 12 as a minimum threshold for reconstructing population DNA methylation (Figure 2B).

To obtain a quick way to evaluate whether the effective coverage of a cohort is sufficient to allow for the reconstruction of aDNA methylation, we created a linear model that predicts the effective coverage of a population based on the number of individuals in the cohort and the average number of autosomal SNP hits (Supplementary Table S3), two simple and readily accessible variables. The linear model has Inline graphic and Inline graphic (see Materials and methods).

To further validate the accuracy of our reconstructed DNA methylation, we conducted four analyses to assess the similarity between our reconstructed DNA methylation maps and the modern human bone map. First, we measured the mean methylation in the promoters of housekeeping genes and within CpG islands (CGIs), two genomic regions that are known to be strongly hypomethylated (42). We computed the average DNA methylation in these regions in our cohorts, as well as in a modern bone sample and the high-coverage shotgun sample Ust'-Ishim (Figure 3A). As expected, the DNA methylation within CpG islands and housekeeping gene promoters is significantly lower than the genomic average in all populations (Supplementary Table S4), with Caribbean_CER showing trends that most resemble modern DNA methylation. Similarly, we measured the mean methylation within Alu elements, which are transposable elements known for their high density of CpG positions and high methylation (43). As expected, the average DNA methylation levels withing these elements are high, consistent with the levels observed across the entire genome (Figure 3A).

Figure 3.

Figure 3.

(A) DNA methylation in CGIs, housekeeping genes promoters, ALU elements and across the genome. (B) Pearson correlations between the reconstructed DNA methylation of Caribbean_CER and Mongolia_BA_IA, and modern DNA methylation in human tissues and cell types. The ancient populations cluster with osteoblast. Correlations were computed based on genome segmentation, see Materials and methods. (C) Heatmap of Bone2 methylation versus Caribbean_CER methylation with 50 bins per axis. Match between these two methylation maps is manifested by the hot regions around high and low methylation levels. (D) Ribbon plots showing measured and reconstructed DNA methylation patterns in two random genomic regions. Methylation is color coded from low methylation in green to high methylation in red.

Next, we computed correlations between the reconstructed pooled DNA methylation maps of the cohorts and the measured maps in several human tissues and cell types, including osteoblasts (33). As expected, given the skeletal origin of aDNA, our populations showed the greatest resemblance to osteoblast (Figure 3B). We then further demonstrated the match between the reconstructed pooled methylation and bone methylation by generating heatmaps that compare the histograms of Bone2 and the reconstructed cohort methylation (Figure 3C). Finally, we produced ribbon plots for random genomic segments, offering a visual testament of the high quality of the reconstructed methylation, particularly for cohorts with sufficiently high effective coverage (Figure 3D).

DMRs detection

Methylation in known DMRs

In a previous study based on reconstructed DNA methylation in high-coverage shotgun samples, we identified 873 differentially methylated regions (DMRs) that are derived in modern humans, meaning that the methylation change occurred in the lineage leading to modern humans after they split from Neanderthals and Denisovans (archaic humans) (3). To further validate the quality of the reconstructed DNA methylation in the seven cohorts, we tested whether the average methylation in these DMRs in each of the cohorts is consistent with the methylation patterns observed in modern humans. For each DMR, we measured the distance between its average methylation in each of the cohorts and its average methylation in Ust'-Ishim. For comparison, we measured these distances also for the Altai Neanderthal. Reassuringly, the median of the distances was around 7% for all cohorts, whereas it was around 53% for the Altai Neanderthal (Figure 4).

Figure 4.

Figure 4.

Absolute value of the difference between the average methylation in the cohort and the average methylation in Ust ‘Ishim, for the 873 modern human-derived DMRs.

De-novo detection of DMRs

We next wished to use the cohort DNA methylation profiles to detect DMRs separating modern and archaic humans and compare them to DMRs detected using high-coverage shotgun samples (3). For this, we first ran the DMR detection algorithm for each of the cohorts, excluding Cuba_ARC_CER and Baqah_BA, which have the lowest effective coverage, against a group of archaic humans consisting of the Altai Neanderthal (10) and a Denisovan (11). We then reran the DMR detection algorithm comparing a group of high-coverage shotgun samples of anatomically modern humans to the same group of archaic humans. The group of anatomically modern humans included SF12 (12) (Sweden, 9033–8757 before present (BP), femur, 73.0×), I1 583 (3) (Turkey, ∼8500 BP, petrous, 32.3×), Ballito Bay A (13) (South Africa, 2000–1960 BP, petrous, 14.3×) and Eland Cave (13) (South Africa, 510–450 BP, tibia, 11.8×). Based on the high quality of the shotgun samples, we treated the DMRs detected using them as a reference set of DMRs, to which we compared the DMRs detected using the pooled cohorts.

In all comparisons, DMR detection was carried out using the same set of parameters, including a methylation difference threshold (Inline graphic) of 0.4 and a minimum number of CpG positions per DMR of 50 (3). We divide the DMRs into three categories: Shared-DMRs are DMRs detected in both analyses; cohort-DMRs are DMRs detected only using the pooled cohorts and are thus potentially false positives; and shotgun-DMRs are DMRs detected only using the shotgun samples and are thus potentially false negatives. Using these approximations, we computed for each cohort the recall and precision, as well as the F1-score that combines both measures (Supplementary Table S5). The performance of the DMR detection algorithm highly depends on the quality of the samples, with Caribbean_CER and Mongolia_BA_IA having the highest F1-score. The lower-quality cohorts exhibit lower recall and precision due to a large number of false positives and false negatives, likely due to higher levels of noise in the methylation reconstruction process. Repeating the DMR detection using all five cohorts pooled together as representing modern humans, we obtained a mild improvement in recall, precision and F1-score (Supplementary Table S5).

To reduce the number of false positives, we sought to further filter the DMRs detected using the pooled cohorts. To this end, we characterized each DMR by six features: its total length in bases, its total number of CpG positions, the average methylation difference between the compared groups (Inline graphic), its maximum Inline graphic, its mean coverage and its genomic context (whether it is within a CpG island, gene promoter, or gene body). For all cohorts and for each feature but Inline graphic, the shared-DMRs differed significantly from the cohort-DMRs, suggesting that these features might be useful in discriminating shared-DMRs from cohort-DMRs (Supplementary Table S6). Indeed, we were able to develop a classifier that filters out DMRs that are likely cohort-DMRs, and by improving precision on the expense of recall, we were able to considerably reduce false detection of DMRs (Supplementary Text T3, Supplementary Tables S7 and S8).

Detecting DMRs between populations

Finally, we wanted to apply the DMR detection pipeline developed above to identify DMRs separating the different cohorts. We ran the DMR detection algorithm between pairs of populations, using Inline graphic and a minimum of 40 CpG positions within a DMR. These parameters are less strict than in the previous section, to allow the detection of subtler DNA methylation changes (Supplementary Tables S9 and S10; Supplementary Text T4).

When comparing two populations, we reasoned that a better way to filter falsely detected DMRs is to carry out permutations of the population labels, and choosing parameter values that guarantee a false discovery rate less than a certain threshold (see Materials and methods, Supplementary Table S11). Comparing Mongolia_BA_IA to European populations, we detected 13 DMRs, which are present in the promoter or gene body of 13 genes. One of the genes is MEIS1, which is a major regulator of limb development, and is associated with myeloid leukemia. It was shown to be downregulated by promoter methylation in leukemia (44), and was also found to have gone through methylation changes separating modern from archaic humans (1). Another example is RUNX2, which plays a role in osteoblast differentiation (45). Comparing Mongolia_BA_IA to Caribbean_CER, we detected two DMRs, and another two were detected when comparing Caribbean_CER to European populations. Finally, comparing European populations to populations of Asian origin (Mongolia_BA_IA and Caribbean_CER), we found eight DMRs, including one inside the promoter region of IFITM3, which encodes an antiviral protein, and which shows lower methylation in its promoter in EV71-HFMD (Hand, foot and mouth disease induced by enterovirus 71) patients (46). Notably, polymorphism in the SNP rs12252 within the gene was found to be associated with susceptibility to COVID-19, and its mutant allele was found to be highly frequent in East Asian patients and rare in European patients (47).

Variation in bone type composition and sex ratio between cohorts have the potential to introduce confounding factors affecting the detection of DMRs between populations. Bone type composition is not a source of concern in our data, as the vast majority of individuals were sampled from petrous bones (Table 1). However, sex ratio differs between cohorts, with three showing a bias toward males (Table 1).

Discussion

Our study has demonstrated that it is possible to reconstruct aDNA methylation patterns even when high-coverage samples are not available. Our work is based on the premise that variability in DNA methylation is lower within, compared to between, populations. We therefore reasoned that we might reconstruct DNA methylation profile characteristic of a past population by pooling together the DNA sequences of a sufficient number of low-coverage ancient individuals from that population. Indeed, we were able to generate high-quality methylation maps of past populations that were later used to detect differential methylation. As the vast majority of ancient samples are sequenced to low-coverage, our study expands the scope of aDNA methylation studies, allowing them to consider populations that have so far not been used.

The ability to reconstruct DNA methylation of past populations, and the constantly increasing density of ancient samples from diverse locations and times, would allow for spatiotemporal mapping of changes in DNA methylation. Combined with the high responsiveness of DNA methylation to environmental cues, such a spatiotemporal mapping would enable the use of aDNA methylation to study past environments (25).

Here, we analyzed seven populations, whose members were sequenced using the popular 1240K in-situ hybridization capture. However, the methodology we have developed is completely general and should work also on samples sequenced with different types of hybridization capture sets, as well as on low-coverage shotgun samples.

As expected, a DNA methylation map reconstructed using many low-quality samples inherently contains more noise compared to a map reconstructed from a high-coverage sample. We concluded that to achieve a reliable reconstruction, the effective coverage of the cohort should be at least 12, but preferably exceeding 15. We presented a straightforward regression model that enables the prediction of the effective coverage of a cohort prior to downloading the data, based solely on the number of samples and the mean number of SNP hits. This model has the potential for refinement with the future inclusion of additional populations.

The increase in noise is manifested by an increased rate of falsely detected DMRs. In the current study, we used two setups to test the quality of DMR detection using population DNA methylation maps, and considered means to reduce the number of falsely detected DMRs in both. First, we tried to compare a population with high-coverage individuals. In such a setup label permutation is infeasible, and we have therefore devised a machine-learning algorithm that filters out DMRs that are likely false positives. Whereas this allowed for a considerable improvement in the rate of falsely detected DMRs, this rate may still be high in some applications, and we generally do not recommend using such a setup unless the population is characterized by very high effective coverage. Second, we compared populations to each other, in an attempt to identify DMRs separating the different populations. Here, label permutation can be used, and DMR detection parameters can be adjusted to achieve a low false discovery rate. We highly recommend using population DNA methylation maps in such a setup, and the resulting small number of DMRs testify for the efficiency by which the number of falsely detected DMRs is reduced. For example, contrasting European and Asian populations revealed a DMR within the promoter of IFITM3, potentially related to differential viral exposure of these populations, and reflecting different frequencies of IFITM3 allele in East Asians compared to Europeans.

Whereas the majority of population studies are based on petrous bones, high-coverage samples came from a variety of skeletal parts, including tibia and femur. This provides further advantage to detecting DMRs separating different populations, rather than DMRs separating populations from high-coverage samples, the latter being potentially affected by biases introduced by the use of different skeletal parts.

It is interesting to note that despite of the fact that naïve pooling does not take into account potential differences in deamination rate across individuals, it still performs better than the advance pooling technique (Supplementary Text T1). This suggests that differences in deamination rate are small compared to other sources of variance between the samples, especially in light of the fact that individuals from approximately the same place and time should have gone through similar deamination processes.

It is worth highlighting that in-situ hybridization capture sequencing, while designed to target specific genomic positions, exhibits notable presence of off-target reads. In fact, after filtering out reads with low mapping quality, approximately 65% of the covered CpG positions originate from off-target reads (Supplementary Figure S3).

Supplementary Material

gkad1232_Supplemental_Files

Acknowledgements

We thank David Reich for useful discussions. L.C. is the Snyder Granadar chair in Genetics. E.M. is the Arthur Gutterman Family chair for Stem Cell Research.

Contributor Information

Arielle Barouch, Department of Genetics, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem 9190401, Israel; School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem 9190401, Israel.

Yoav Mathov, Department of Genetics, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem 9190401, Israel; Edmond and Lily Safra Center for Brain Sciences (ELSC), The Hebrew University of Jerusalem, Jerusalem 9190401, Israel.

Eran Meshorer, Department of Genetics, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem 9190401, Israel; Edmond and Lily Safra Center for Brain Sciences (ELSC), The Hebrew University of Jerusalem, Jerusalem 9190401, Israel.

Benjamin Yakir, Department of Statistics and Data Science, The Hebrew University of Jerusalem, Jerusalem 9190500, Israel.

Liran Carmel, Department of Genetics, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem 9190401, Israel.

Data availability

No new data were generated or analyzed in support of this research.

Supplementary data

Supplementary Data are available at NAR Online.

Funding

Israel Science Foundation [ISF grant 2436/22 to L.C. and B.Y.]; John Templeton Foundation [grant #61739 to L.C. and E.M.]; Israel Ministry of Innovation, Science & Technology [grant #1001584586 to L.C. and E.M]. Funding for open access charge: Israel Science Foundation [ISF grant 2436/22 to L.C. and B.Y.]; John Templeton Foundation [grant #61739 to L.C. and E.M.]; Israel Ministry of Innovation, Science & Technology [grant #1001584586 to L.C. and E.M.].

Conflict of interest statement. None declared.

References

  • 1. Gokhman  D., Lavi  E., Prüfer  K., Fraga  M.F., Riancho  J.A., Kelso  J., Pääbo  S., Meshorer  E., Carmel  L.  Reconstructing the DNA methylation maps of the Neandertal and the Denisovan. Science. 2014; 344:523–527. [DOI] [PubMed] [Google Scholar]
  • 2. Pedersen  J.S., Valen  E., Velazquez  A.M.V., Parker  B.J., Rasmussen  M., Lindgreen  S., Lilje  B., Tobin  D.J., Kelly  T.K., Vang  S.  et al.  Genome-wide nucleosome map and cytosine methylation levels of an ancient human genome. Genome Res.  2014; 24:454–466. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Gokhman  D., Nissim-Rafinia  M., Agranat-Tamir  L., Housman  G., García-Pérez  R., Lizano  E., Cheronet  O., Mallick  S., Nieves-Colón  M.A., Li  H.  et al.  Differential DNA methylation of vocal and facial anatomy genes in modern humans. Nat. Commun.  2020; 11:1189. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Mathov  Y., Batyrev  D., Meshorer  E., Carmel  L.  Harnessing epigenetics to study human evolution. Curr. Opin. Genet. Dev.  2020; 62:23–29. [DOI] [PubMed] [Google Scholar]
  • 5. Gokhman  D., Meshorer  E., Carmel  L.  Epigenetics: it's getting old. Past meets future in paleoepigenetics. Trends Ecol. Evol.  2016; 31:290–300. [DOI] [PubMed] [Google Scholar]
  • 6. Gokhman  D., Mishol  N., de Manuel  M., de Juan  D., Shuqrun  J., Meshorer  E., Marques-Bonet  T., Rak  Y., Carmel  L.  Reconstructing Denisovan anatomy using DNA methylation maps. Cell. 2019; 179:180–192. [DOI] [PubMed] [Google Scholar]
  • 7. Briggs  A.W., Stenzel  U., Meyer  M., Krause  J., Kircher  M., Pääbo  S.  Removal of deaminated cytosines and detection of in vivo methylation in ancient DNA. Nucleic Acids Res.  2010; 38:e87. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Briggs  A.W., Stenzel  U., Johnson  P.L.F., Green  R.E., Kelso  J., Prüfer  K., Meyer  M., Krause  J., Ronan  M.T., Lachmann  M.  et al.  Patterns of damage in genomic DNA sequences from a Neandertal. Proc. Natl. Acad. Sci. U.S.A.  2007; 104:14616–14621. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Rohland  N., Harney  E., Mallick  S., Nordenfelt  S., Reich  D  Partial uracil–DNA–glycosylase treatment for screening of ancient DNA. Philos. Trans. Roy. Soc. B: Biol. Sci.  2015; 370:20130624. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Prüfer  K., Racimo  F., Patterson  N., Jay  F., Sankararaman  S., Sawyer  S., Heinze  A., Renaud  G., Sudmant  P.H., de Filippo  C.  et al.  The complete genome sequence of a Neanderthal from the Altai Mountains. Nature. 2014; 505:43–49. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Meyer  M., Kircher  M., Gansauge  M.-T., Li  H., Racimo  F., Mallick  S., Schraiber  J.G., Jay  F., Prüfer  K., de Filippo  C.  et al.  A high-coverage genome sequence from an archaic Denisovan individual. Science. 2012; 338:222–226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Günther  T., Malmström  H., Svensson  E.M., Omrak  A., Sánchez-Quinto  F., Kılınç  G.M., Krzewińska  M., Eriksson  G., Fraser  M., Edlund  H.  et al.  Population genomics of Mesolithic Scandinavia: investigating early postglacial migration routes and high-latitude adaptation. PLoS Biol.  2018; 16:e2003703. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Schlebusch  C.M., Malmström  H., Günther  T., Sjödin  P., Coutinho  A., Edlund  H., Munters  A.R., Vicente  M., Steyn  M., Soodyall  H.  et al.  Southern African ancient genomes estimate modern human divergence to 350,000 to 260,000 years ago. Science. 2017; 358:652–655. [DOI] [PubMed] [Google Scholar]
  • 14. Haak  W., Lazaridis  I., Patterson  N., Rohland  N., Mallick  S., Llamas  B., Brandt  G., Nordenfelt  S., Harney  E., Stewardson  K.  et al.  Massive migration from the steppe was a source for Indo-European languages in Europe. Nature. 2015; 522:207–211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Fu  Q., Hajdinjak  M., Moldovan  O.T., Constantin  S., Mallick  S., Skoglund  P., Patterson  N., Rohland  N., Lazaridis  I., Nickel  B.  et al.  An early modern human from Romania with a recent Neanderthal ancestor. Nature. 2015; 524:216–219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Liu  Y., Mao  X., Krause  J., Fu  Q.  Insights into human history from the first decade of ancient human genomics. Science. 2021; 373:1479–1484. [DOI] [PubMed] [Google Scholar]
  • 17. Skoglund  P., Mathieson  I.  Ancient genomics of modern humans: the first decade. Annu. Rev. Genomics Hum. Genet.  2018; 19:381–404. [DOI] [PubMed] [Google Scholar]
  • 18. Le  M.K., Smith  O.S., Akbari  A., Harpak  A., Reich  D., Narasimhan  V.M.  1,000 ancient genomes uncover 10,000 years of natural selection in Europe. 2022; bioRxiv doi:26 August 2022, preprint: not peer reviewed 10.1101/2022.08.24.505188. [DOI]
  • 19. Olalde  I., Brace  S., Allentoft  M.E., Armit  I., Kristiansen  K., Booth  T., Rohland  N., Mallick  S., Szécsényi-Nagy  A., Mittnik  A.  et al.  The Beaker phenomenon and the genomic transformation of northwest Europe. Nature. 2018; 555:190–196. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Fernandes  D.M., Sirak  K.A., Ringbauer  H., Sedig  J., Rohland  N., Cheronet  O., Mah  M., Mallick  S., Olalde  I., Culleton  B.J.  et al.  A genetic history of the pre-contact Caribbean. Nature. 2021; 590:103–110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Nägele  K., Posth  C., Iraeta Orbegozo  M., Chinique de Armas  Y., Hernández Godoy  S.T., González Herrera  U.M., Nieves-Colón  M.A., Sandoval-Velasco  M., Mylopotamitaki  D., Radzeviciute  R.  et al.  Genomic insights into the early peopling of the Caribbean. Science. 2020; 369:456–460. [DOI] [PubMed] [Google Scholar]
  • 22. Agranat-Tamir  L., Waldman  S., Martin  M.A.S., Gokhman  D., Mishol  N., Eshel  T., Cheronet  O., Rohland  N., Mallick  S., Adamski  N.  et al.  The Genomic History of the Bronze Age Southern Levant. Cell. 2020; 181:1146–1157. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Fraser  H.B., Lam  L.L., Neumann  S.M., Kobor  M.S.  Population-specificity of human DNA methylation. Genome Biol.  2012; 13:R8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Carja  O., MacIsaac  J.L., Mah  S.M., Henn  B.M., Kobor  M.S., Feldman  M.W., Fraser  H.B.  Worldwide patterns of human epigenetic variation. Nat. Ecol. Evol.  2017; 1:1577–1583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Gokhman  D., Malul  A., Carmel  L.  Inferring Past Environments from Ancient Epigenomes. Mol. Biol. Evol.  2017; 34:2429–2438. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Chatterjee  S., Ouidir  M., Tekola-Ayele  F.  Genetic and in utero environmental contributions to DNA methylation variation in placenta. Hum. Mol. Genet.  2021; 30:1968–1976. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Lim  I.Y., Lin  X., Karnani  N.  Implications of Genotype and Environment on Variation in DNA Methylation. Handbook of Nutrition, Diet, and Epigenetics. 2017; Cham: Springer International Publishing; 1–20. [Google Scholar]
  • 28. Lam  L.L., Emberly  E., Fraser  H.B., Neumann  S.M., Chen  E., Miller  G.E., Kobor  M.S.  Factors underlying variable DNA methylation in a human community cohort. Proc. Natl. Acad. Sci.  2012; 109:17253–17260. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Gonzalez  R.C.W.R.E.  Digital Image Processing. 1992; 3rd edn. [Google Scholar]
  • 30. Gardiner-Garden  M., Frommer  M.  CpG islands in vertebrate genomes. J. Mol. Biol.  1987; 196:261–282. [DOI] [PubMed] [Google Scholar]
  • 31. Kent  W.J., Sugnet  C.W., Furey  T.S., Roskin  K.M., Pringle  T.H., Zahler  A.M., Haussler  D.  The Human Genome Browser at UCSC. Genome Res.  2002; 12:996–1006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Eisenberg  E., Levanon  E.Y.  Human housekeeping genes, revisited. Trends Genet.  2013; 29:569–574. [DOI] [PubMed] [Google Scholar]
  • 33. Loyfer  N., Magenheim  J., Peretz  A., Cann  G., Bredno  J., Klochendler  A., Fox-Fisher  I., Shabi-Porat  S., Hecht  M., Pelet  T.  et al.  A DNA methylation atlas of normal human cell types. Nature. 2023; 613:355–364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Wang  C.-C., Yeh  H.-Y., Popov  A.N., Zhang  H.-Q., Matsumura  H., Sirak  K., Cheronet  O., Kovalev  A., Rohland  N., Kim  A.M.  et al.  Genomic insights into the formation of human populations in East Asia. Nature. 2021; 591:413–419. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Sandoval  J., Heyn  H., Moran  S., Serra-Musach  J., Pujana  M.A., Bibikova  M., Esteller  M.  Validation of a DNA methylation microarray for 450,000 CpG sites in the human genome. Epigenetics. 2011; 6:692–702. [DOI] [PubMed] [Google Scholar]
  • 36. Moran  S., Arribas  C., Esteller  M.  Validation of a DNA methylation microarray for 850,000 CpG sites of the human genome enriched in enhancer sequences. Epigenomics. 2016; 8:389–399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Pidsley  R., Zotenko  E., Peters  T.J., Lawrence  M.G., Risbridger  G.P., Molloy  P., van Djik  S., Muhlhausler  B., Stirzaker  C., Clark  S.J.  Critical evaluation of the Illumina MethylationEPIC BeadChip microarray for whole-genome DNA methylation profiling. Genome Biol.  2016; 17:208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Niiranen  L., Leciej  D., Edlund  H., Bernhardsson  C., Fraser  M., Quinto  F.S., Herzig  K.-H., Jakobsson  M., Walkowiak  J., Thalmann  O.  Epigenomic modifications in modern and ancient genomes. Genes (Basel). 2022; 13:178. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Fu  Q., Li  H., Moorjani  P., Jay  F., Slepchenko  S.M., Bondarev  A.A., Johnson  P.L.F., Aximu-Petri  A., Prüfer  K., de Filippo  C.  et al.  Genome sequence of a 45,000-year-old modern human from western Siberia. Nature. 2014; 514:445–449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Affinito  O., Palumbo  D., Fierro  A., Cuomo  M., de Riso  G., Monticelli  A., Miele  G., Chiariotti  L., Cocozza  S.  Nucleotide distance influences co-methylation between nearby CpG sites. Genomics. 2020; 112:144–150. [DOI] [PubMed] [Google Scholar]
  • 41. Tost  J.  DNA methylation: an introduction to the biology and the disease-associated changes of a promising biomarker. Mol. Biotechnol.  2010; 44:71–81. [DOI] [PubMed] [Google Scholar]
  • 42. Jeziorska  D.M., Murray  R.J.S., de Gobbi  M., Gaentzsch  R., Garrick  D., Ayyub  H., Chen  T., Li  E., Telenius  J., Lynch  M.  et al.  DNA methylation of intragenic CpG islands depends on their transcriptional activity during differentiation and disease. Proc. Natl. Acad. Sci. U.S.A.  2017; 114:E7526–E7535. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Deininger  P.  Alu elements: know the SINEs. Genome Biol.  2011; 12:236. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Lasa  A., Carnicer  M.J., Aventín  A., Estivill  C., Brunet  S., Sierra  J., Nomdedéu  J.F.  MEIS 1 expression is downregulated through promoter hypermethylation in AML1-ETO acute myeloid leukemias. Leukemia. 2004; 18:1231–1237. [DOI] [PubMed] [Google Scholar]
  • 45. Schroeder  T.M., Jensen  E.D., Westendorf  J.J.  Runx2: a master organizer of gene transcription in developing and maturing osteoblasts. Birth Defects Res. C Embryo Today. 2005; 75:213–225. [DOI] [PubMed] [Google Scholar]
  • 46. Li  M., Li  Y.-P., Deng  H.-L., Wang  M.-Q., Chen  Y., Zhang  Y.-F., Wang  J., Dang  S.-S.  DNA methylation and SNP in IFITM3 are correlated with hand, foot and mouth disease caused by enterovirus 71. Int. J. Infect. Dis.  2021; 105:199–208. [DOI] [PubMed] [Google Scholar]
  • 47. Li  Y., Wei  L., He  L., Sun  J., Liu  N.  Interferon-induced transmembrane protein 3 gene polymorphisms are associated with COVID-19 susceptibility and severity: a meta-analysis. J. Infect.  2022; 84:825–833. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

gkad1232_Supplemental_Files

Data Availability Statement

No new data were generated or analyzed in support of this research.


Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES