Skip to main content
eLife logoLink to eLife
. 2022 Jul 5;11:e77898. doi: 10.7554/eLife.77898

Genome-wide detection of imprinted differentially methylated regions using nanopore sequencing

Vahid Akbari 1,2, Jean-Michel Garant 1, Kieran O'Neill 1, Pawan Pandoh 1, Richard Moore 1, Marco A Marra 1,2, Martin Hirst 1,3, Steven JM Jones 1,2,
Editors: Jessica K Tyler4, Jessica K Tyler5
PMCID: PMC9255983  PMID: 35787786

Abstract

Imprinting is a critical part of normal embryonic development in mammals, controlled by defined parent-of-origin (PofO) differentially methylated regions (DMRs) known as imprinting control regions. Direct nanopore sequencing of DNA provides a means to detect allelic methylation and to overcome the drawbacks of methylation array and short-read technologies. Here, we used publicly available nanopore sequencing data for 12 standard B-lymphocyte cell lines to acquire the genome-wide mapping of imprinted intervals in humans. Using the sequencing data, we were able to phase 95% of the human methylome and detect 94% of the previously well-characterized, imprinted DMRs. In addition, we found 42 novel imprinted DMRs (16 germline and 26 somatic), which were confirmed using whole-genome bisulfite sequencing (WGBS) data. Analysis of WGBS data in mouse (Mus musculus), rhesus monkey (Macaca mulatta), and chimpanzee (Pan troglodytes) suggested that 17 of these imprinted DMRs are conserved. Some of the novel imprinted intervals are within or close to imprinted genes without a known DMR. We also detected subtle parental methylation bias, spanning several kilobases at seven known imprinted clusters. At these blocks, hypermethylation occurs at the gene body of expressed allele(s) with mutually exclusive H3K36me3 and H3K27me3 allelic histone marks. These results expand upon our current knowledge of imprinting and the potential of nanopore sequencing to identify imprinting regions using only parent-offspring trios, as opposed to the large multi-generational pedigrees that have previously been required.

Research organism: Human

Introduction

The addition of a methyl group to the fifth carbon of cytidine is the most prevalent and stable epigenetic modification of human DNA (Laurent et al., 2010). DNA methylation is involved in gene regulation and influences a vast array of biological mechanisms, including embryonic development and cell fate, genome imprinting, X-chromosome inactivation, and transposon silencing (Moore et al., 2013; Smith and Meissner, 2013). In mammals, there are two copies or alleles of a gene, one inherited from each parent. Most gene transcripts are expressed from both alleles. However, a subset of genes are only expressed from a single allele; this allele can be selected either randomly, as seen in X-chromosome inactivation in females, or based upon the parent-of-origin (PofO), referred to as imprinting (Chess, 2013; Khamlichi and Feil, 2018).

In imprinting, mono-allelic expression of a gene or cluster of genes is controlled by a cis-acting imprinting control region (ICR) (Bartolomei and Ferguson-Smith, 2011). The main mechanism by which this occurs is PofO-defined differential methylation at ICRs, also known as imprinted differentially methylated regions (DMRs) (Bartolomei and Ferguson-Smith, 2011; Maupetit-Méhouas et al., 2016). Imprinted DMRs are classified as germline (primary) or somatic (secondary), hereinafter referred to as gDMR and sDMR. gDMRs are established during the first wave of methylation reprogramming in germ cell development and escape the second methylation reprogramming after fertilization (Zink et al., 2018). sDMRs are established de novo after fertilization during somatic development, usually under the control of a nearby gDMR (Zink et al., 2018). Imprinted clusters of genes may span up to ~4 Mb, by acting through a CCCTC-binding factor (CTCF)-binding site or by allelic expression of a long non-coding RNA (Bartolomei and Ferguson-Smith, 2011; da Rocha and Gendrel, 2019). By contrast, individually imprinted genes are typically regulated by PofO-derived differential methylation at the gene promoter (Bartolomei and Ferguson-Smith, 2011).

Imprinting is implicated in various genetic disorders, either from aberrations in imprinted methylation or from deleterious variants affecting the ICR and imprinted genes. Aberrant imprinted methylation is also detected in several human cancers (Goovaerts et al., 2018; Jelinic and Shaw, 2007; Tomizawa and Sasaki, 2012). Thus, accurate mapping and characterization of imprinting in humans is key to the treatment and actionability of genetic disorders, and to personalized oncogenomonics.

To detect imprinted methylation, accurate assignment of methylation data to paternal and maternal alleles is required. Achieving this with traditional bisulfite sequencing or arrays is challenging. Several studies have used samples with large karyotypic abnormalities, such as uniparental disomies, teratomas, and hydatidiform moles, to infer regions of imprinting (Court et al., 2014; Hernandez Mora et al., 2018; Joshi et al., 2016). This approach relies not only on rare structural variants, but also on the assumption that both normal methylation and the imprinted state remain intact in spite of substantial genomic aberrations. A study by Zink et al., 2018, leveraged a genotyped, multi-generation pedigree spanning nearly half the population of Iceland (n=150,000), in combination with whole-genome oxidative bisulfite sequencing, to phase methylation and infer PofO (Zink et al., 2018). However, despite being able to phase nearly every SNP in that cohort, they were only able to phase 84% of the human autosomal methylome in over 200 samples due to the short length of reads. Furthermore, the study was based on a single, genetically isolated population, which may not be representative of the wider human population. A comprehensive mapping of imprinted methylation using a technology more suited to phasing reads, based on individuals more representative of the human population, could greatly advance our understanding of imprinting, with direct benefits for human health.

Previously, we have shown that nanopore sequencing can detect allelic methylation in a single sample and accurately determine PofO using only trio data. We also previously developed the software NanoMethPhase for this purpose (Akbari et al., 2021). Here, we applied NanoMethPhase to public nanopore data from a diverse set of 12 lymphoblastoid cell lines (LCLs) from the 1000 Genomes Project (1KGP) and Genome in a Bottle (GIAB) to investigate genome-wide allele-specific methylation (ASM) and detect novel imprinted DMRs (Figure 1A; Auton et al., 2015; De Coster et al., 2019; Jain et al., 2018; Shafin et al., 2020; Zook et al., 2016; Zook et al., 2019). Using trio data from GIAB and 1KGP for these cell lines, we phased nanopore long reads to their PofO and inferred allelic methylation (Akbari et al., 2021; Auton et al., 2015; Zook et al., 2019). We were able to detect haplotype and methylation status for 26.5 million autosomal CpGs comprising 95% of the human autosomal methylome (GRCh38 main chromosomes). We further used public whole-genome bisulfite sequencing (WGBS) data to confirm the presence of the detected DMRs in other tissues and to classify the novel DMRs as germline or somatic. We captured 94% of the well-characterized DMRs and detected 42 novel DMRs (16 gDMRs and 26 sDMRs). Of these novel DMRs, 40.5% show evidence of conservation. We also detected seven blocks of PofO methylation bias at seven imprinted clusters with mutual exclusive allelic H3K36me3 and H3K27me3 histone marks. Collectively, our results extend the set of known imprinted intervals in humans and demonstrate a major contribution in our ability to characterize imprinting by ASM, brought about by the capabilities of long-read nanopore sequencing.

Figure 1. Detection of allelic methylation using nanopore sequencing.

(a) Flowchart of the study representing all the analysis steps. (b) Pearson correlation matrix of the nanopore CpG methylation frequencies for the 12 cell lines and NA12878 whole-genome bisulfite sequencing (WGBS) from ENCODE (ENCFF835NTC). (c) Upset plot of the number of differentially methylated regions (DMRs) detected in our study and previous studies, including overlaps.

Figure 1.

Figure 1—figure supplement 1. Number of allelic differentially methylated regions (DMRs) overlapped to the reported DMRs and parent-of-origin (PofO)-defined phased CpG methylation in each sample examined for differential methylation analysis by DSS R package.

Figure 1—figure supplement 1.

To calculate coverage, the genome length is considered as 3.2G.

Results

Assessing the effectiveness of nanopore methylation calling and detection of known imprinted DMRs

Using the set of 12 LCLs for which we called methylation data, we conducted correlation analysis among nanopore-called methylation data and another WGBS dataset for NA12878 cell line (ENCFF835NTC) to confirm the reliability of methylation calling (Figure 1B). We observed high correlation across cell lines (r=0.75–0.93), as expected as they were the same cell type. NA12878 nanopore-called methylation also showed the highest correlation (r=0.89) with NA12878 WGBS, as expected (Figure 1B). To assess the use of nanopore long reads in detecting known DMRs, we identified previously reported DMRs, including 383 imprinted intervals (Court et al., 2014; Hernandez Mora et al., 2018; Joshi et al., 2016; Zink et al., 2018). Of these 383, we classified 68 as ‘well-characterized’ as they were reported by at least two genome-wide mapping studies or were previously known to be imprinted (Supplementary file 1). Subsequently, we haplotyped the methylome in each cell line, and performed differential methylation analysis (DMA) between alleles across cell lines; 95% (26.5M) of the human autosomal CpGs could be assigned to a haplotype. We detected 200 allelic DMRs (p-value <0.001, |methylation difference|>0.20, and detected in at least four cell lines in each haplotype) (Supplementary file 2). Out of the 200 detected DMRs, 101 overlapped with 103 previously reported DMRs with consistent PofO (Supplementary file 3), while the remaining 99 were novel (Figure 1C). Of the well-characterized DMRs, 64/68 (94%) were detected in our study (Figure 1C; Supplementary file 3).

Similarly, we assessed methylation haplotyping and detection of imprinted DMRs within a single sample. On average, 90% (M ± SD = 25 M ± 1.61 M) of the human methylome could be assigned to a parental haplotype in each cell line (Figure 1—figure supplement 1). Among the well-characterized DMRs, ~73% (M ± SD = 49.5 ± 4.5) could be detected in a single cell line. An additional 33 DMRs (SD = 9.6) reported by only one previous study were detected in each cell line (Figure 1—figure supplement 1).

Confirmation of novel imprinted DMRs

We detected 99 imprinted DMRs that did not overlap with previously reported imprinted DMRs (Court et al., 2014; Hernandez Mora et al., 2018; Joshi et al., 2016; Zink et al., 2018). In order to confirm these DMRs in human tissues and detect potential novel imprinted regions, we investigated WGBS datasets for partial methylation at nanopore-detected DMRs (Materials and methods). We used 60 WGBS datasets from 29 tissue types and 119 blood samples from 87 individuals (Bernstein et al., 2010; ENCODE Project Consortium, 2012; Stunnenberg et al., 2016). We first examined the 68 well-characterized DMRs, 91% of them demonstrated partial methylation (more than 60% of the CpGs at the DMR having between 0.35 and 0.65 methylation) in at least one tissue and individual blood samples (Figure 2A and B). As controls, we used 100 randomly selected 1, 2, 3 kb bins, and CpG islands (CGIs) in 100 resampling iterations. Of these, 0.65%, 0.74%, 2.28%, and 4.83% of the randomly selected 3, 2, 1 kb, and CGIs, respectively, demonstrated partial methylation (Figure 2—figure supplement 1). Applying this approach to the 99 previously unreported DMRs, the WGBS data supported 42 of the novel DMRs (Figure 2, Table 1). In agreement with previous studies reporting a higher number of maternally methylated intervals (Court et al., 2014; Hernandez Mora et al., 2018; Joshi et al., 2016), 74% of the novel DMRs were maternally methylated. Overall, we detected 143 imprinted DMRs of which 101 were found to overlap with previously reported DMRs while 42 were novel DMRs detected by nanopore and confirmed using WGBS data (Figure 2C, Supplementary file 4).

Figure 2. Confirmation of nanopore-detected differentially methylated regions (DMRs) using whole-genome bisulfite sequencing (WGBS) data.

(a) and (b) Violin plots representing the average methylation of each DMR in WGBS tissue and blood samples. (c) Idiogram of the 101 DMRs overlapping to reported intervals and 42 novel DMRs which were confirmed by WGBS. Paternally methylated DMRs are labelled on the left side of each chromosome while maternally methylated DMRs are on the right. Red labels represent germline DMRs while blue labels represent somatic DMRs. Novel DMRs are boxed and named based on their nearest gene (Ensembl Gene 103 GRCh38.p13).

Figure 2—source data 1. The average methylation of each DMR in human WGBS tissue samples.
Figure 2—source data 2. The average methylation of each DMR in human WGBS blood samples.

Figure 2.

Figure 2—figure supplement 1. Violin plots representing methylation in whole-genome bisulfite sequencing (WGBS) blood (left) and tissue (right) samples at randomly selected CpG islands (CGIs), 1, 2, and 3 kb intervals.

Figure 2—figure supplement 1.

Table 1. Forty-two detected imprinted differentially methylated regions (DMRs) from nanopore data and confirmed using whole-genome bisulfite sequencing (WGBS) data.

DMRs are named after the nearest gene (EnsemblGene 103 GRCh38.p13).

ID DMR name Origin Type Distance to nearest imprinted gene % Individuals with partial methylation % Tissues with partial methylation
22 AC024940.1 Maternal Germline 0 15.9 3.8
35 DDA1 Maternal Germline 0 7.3 15.4
38 ACTL10;NECAB3 Maternal Germline 0 3.7 8.7
42 SYCE1 Maternal Germline 3.2 kb 4 9.1
12 FAM83H Maternal Germline 149 kb 48.8 12
20 OPCML Maternal Germline 744.1 kb 45.1 25
19 YBX2P1 Maternal Germline >2 Mb 3.7 7.7
26 NALCN-AS1 Maternal Germline >2 Mb 30.5 10
28 PTGDR Maternal Germline >2 Mb 8.4 3.4
32 AATK Maternal Germline >2 Mb 23.2 9.1
34 MIR7-3HG Maternal Germline >2 Mb 8.1 3.6
2 AC007391.3 Maternal Germline >2 Mb 37.2 60.7
5 C4orf50 Maternal Germline >2 Mb 14.5 22.2
7 CPLANE1 Maternal Germline >2 Mb 2.3 7.1
14 SPTLC1 Maternal Germline >2 Mb 5.8 48.1
1 BMP8A Maternal Somatic 0 4.5 26.1
24 LPAR6;RB1 Maternal Somatic 0 2.3 10.3
36 ZNF714 Maternal Somatic 0 43.9 29.6
17 BTBD7P1 Maternal Somatic >2 Mb 73.5 55.6
18 ANKRD2 Maternal Somatic >2 Mb 34.1 3.8
23 GJA3 Maternal Somatic >2 Mb 25.6 21.4
27 RANBP20P Maternal Somatic >2 Mb 28 32
33 NFE2L3P1 Maternal Somatic >2 Mb 29.3 44.4
39 AL096828.3 Maternal Somatic >2 Mb 50 7.7
41 SLC5A1 Maternal Somatic >2 Mb 56.1 25.9
8 TMEM232 Maternal Somatic >2 Mb 25.6 37.9
9 NME5 Maternal Somatic >2 Mb 22.1 10.7
11 AC092634.8 Maternal Somatic >2 Mb 6.1 3.8
13 CDRT15P6 Maternal Somatic >2 Mb 12.7 5.6
15 TMOD1 Maternal Somatic >2 Mb 35.4 14.8
16 LHX6 Maternal Somatic >2 Mb 44.6 25
30 MIR6085 Paternal Germline >2 Mb 27.1 25.9
3 PAX8;PAX8-AS1 Paternal Somatic 0 24.4 32.1
4 ZDBF2 Paternal Somatic 0 53.6 58.6
29 SNHG14 Paternal Somatic 3 kb 4.7 37.9
37 AC092296.3 Paternal Somatic 90 kb 1.2 7.7
40 RIMBP3 Paternal Somatic 296 kb 17.4 11.1
10 RNU6-293P Paternal Somatic 1.03 Mb 65.1 37
21 PARP11 Paternal Somatic >2 Mb 10.8 22.2
25 UBAC2 Paternal Somatic >2 Mb 1.2 6.9
31 CFAP161 Paternal Somatic >2 Mb 22.4 6.9
6 MIR4456 Paternal Somatic >2 Mb 11.3 22.2

Novel imprinted DMRs display inter-individual variation

Although imprinted methylation is generally regarded as consistent between individuals and resistant to environmental factors, there are examples of polymorphic imprinting where imprinted methylation is not consistently observed across individuals. In order to assess the inter-individual variation of the novel imprinted DMRs, we examined partial methylation in the 119 blood samples from 87 individuals. Some imprinted DMRs such as VTRNA2-1, IGF2, RB1, PARD6G, CHRNE, and IGF2R are known to be polymorphic (Joshi et al., 2016; Zink et al., 2018). The detected DMRs that mapped to these imprinted regions displayed partial methylation in 2–65% of the individuals in our analysis (M ± SD = 40% ± 22%; Supplementary file 5). ZNF331 DMR is known to be consistently imprinted across individuals (Zink et al., 2018). In our analysis, the DMR that mapped to ZNF331 reported interval displayed partial methylation in 99% of the individuals (Supplementary file 5). We then examined inter-individual variation across the 42 novel DMRs. Imprinted methylation at all the novel DMRs demonstrated variation ranging from 1.2% to 73.5% of the individuals (M ± SD = 23.6% ± 19.2%; Table 1). Among the novel DMRs, maternal sDMR near BTBD7P1 is the most consistent with partial methylation in 73.5% of the individuals (Table 1). On the other hand, the novel paternal sDMRs within AC092296.3 and UBAC2 are the most variable with partial methylation in 1.2% of the individuals (Table 1). Among the individuals, four displayed hypermethylation at several of the well-characterized and novel DMRs (Figure 2B), in line with a previous study that identified rare individuals with consistent hyper- or hypomethylation at dozens of imprinted loci, indicative of a generalized imprinting disruption (Joshi et al., 2016).

As demonstrated in Figure 1C, a considerable number of imprinted DMRs detected in different studies are not overlapping between studies. Different studies used different samples and individuals, therefore, we examined inter-individual variation at DMRs detected in two or more studies (including the current work) and those that detected in one study (Supplementary file 5). The DMRs that detected in at least two studies demonstrated more consistency across individuals (M ± SD = 41.2% ± 33%) while DMRs detected in a single study showed more variability (M ± SD = 10.6% ± 15.4%) (Supplementary file 5). These results suggest that polymorphic imprinting can explain this non-overlapping DMRs across studies.

Determination of germline versus somatic status of novel imprinted DMRs

We investigated the methylation status of the detected novel DMRs in sperm and oocyte to determine if they are germline or somatic imprinted intervals. Maternally methylated gDMRs must display high methylation in oocyte and very low or no methylation in sperm with partial methylated after fertilization. Paternally methylated gDMRs must show high methylation in sperm and very low or no methylation in oocyte with partial methylated after fertilization. For the novel DMRs, 16 were detected as germline (more than 70% methylation in oocyte and less than 20% in sperm and vice versa) from which 15 were maternally methylated and one was paternally methylated (Figure 3A and B). Moreover, novel candidate gDMRs showed partial methylation in the blastocyst and fetal samples, indicating the gDMRs escaped de-methylation after fertilization. Meanwhile, sDMRs displayed partial methylation in fetal tissues, indicating their establishment during somatic development (Figure 3A and B). Overall, 16 of the novel DMRs were found to be germline while 26 were sDMRs.

Figure 3. Detection of novel germline and somatic differentially methylated regions (DMRs).

Figure 3.

(a) Heatmap displaying average methylation of the 42 nanopore-detected DMRs confirmed by whole-genome bisulfite sequencing (WGBS). DMRs are named based on their nearest gene (Ensembl Gene 103 GRCh38.p13). (b) Dot plots representing the methylation of novel germline and somatic DMRs in each sample with respect to other samples. (c) IGV screenshots from six novel germline DMRs overlapping with ZNF445 and/or ZFP57 chromatin immunoprecipitation sequencing (ChIP-seq) peaks. The range for all methylation tracks is 0–1.

During germ cell development, gDMRs are bound by proteins critical for their methylation maintenance during post-fertilization reprogramming. ZFP57 and ZNF445 have been identified as imprinting maintenance proteins (Takahashi et al., 2019). Using ZFP57 and ZNF445 ChIP-seq peak calling information from human embryonic stem cells and the HEK 293T cell line (Imbeault et al., 2017; Takahashi et al., 2019), 44% of the novel gDMRs and 49% of the reported gDMRs were bound by ZFP57 and/or ZNF445 (Figure 3C; Supplementary file 4). Of these gDMRs, 89% had a ZFP57 peak and 45% had a ZNF445 peak. This highlights the importance of ZFP57 as an important factor for the maintenance of imprinted methylation at gDMRs. 5′-TGC(5mC)GC-3′ is the canonical binding motif for ZFP57 (Quenneville et al., 2011). Eighty-eight percent of the gDMRs with a ZFP57 peak had at least one 5′-TGCCGC-3′ motif, while 40% of the gDMRs without ZFP57 peak had at least one 5′-TGCCGC-3′ motif in the human genome (GRCh38; Supplementary file 4). Moreover, at gDMRs the number of 5′-TGCCGC-3′ motifs demonstrated a significant positive correlation with the number of individuals demonstrating partial methylation (Pearson = 0.54, p-value = 3.6e−07; Appendix 1—figure 1). This suggests that a greater number of motifs provide more functional binding opportunities for ZFP57 and also less likelihood that all ZFP57 motifs could be perturbed through polymorphism or DNA sequence variation resulting in the imprinted methylation being less polymorphic.

Allelic H3K4me3 histone mark at detected DMRs

The H3K4me3 histone mark is protective against DNA methylation. At imprinted DMRs, the unmethylated allele is usually enriched for this histone modification (Court et al., 2014; John and Lefebvre, 2011). We used H3K4me3 chromatin immunoprecipitation sequencing (ChIP-seq) data for six LCLs and their heterozygous single-nucleotide variant (SNV) calls from 1KGP. Fifty of the DMRs mapped to reported intervals and 19 of the novel DMRs could be examined. Of these, 47 previously reported and 16 novel DMRs showed a significant allelic count in ChIP-seq data (Fisher’s combined p-value binomial <0.05 with at least 80% of the reads on one allele) (Figure 4a; Supplementary file 6). We also examined if the allelic H3K4me3 and methylation are in opposite alleles in NA12878 and NA19240. Forty of the previously reported DMRs and 10 of the novel DMRs with significant allelic H3K4me3 could be examined in NA12878 and/or NA19240. Thirty-seven previously reported and seven novel DMRs showed opposite allelic states between H3K4me3 and methylation (Figure 4b; Supplementary file 6).

Figure 4. Allelic H3K4me3 histone mark at detected imprinted differentially methylated regions (DMRs).

Figure 4.

(a) The plots representing reference and alternative alleles H3K4me3 read counts for the heterozygous single-nucleotide variants (SNVs) mapped to the detected DMRs for the six examined samples. Each point represents an SNV. Blue color displays SNVs with Fisher’s combined p-value binomial <0.05 and at least 80% of the reads on one allele and red color represent those SNVs that did not satisfy either of these thresholds. (b) The plots representing paternal and maternal H3K4me3 read counts for the heterozygous SNVs at DMRs in NA12878 and NA19240. Each point represents an SNV. The ‘Status’ indicates the methylation origin of the DMR and if the DMR is novel or reported.

Figure 4—source data 1. H3K4me3 allelic read counts for the heterozygous single-nucleotide variants (SNVs) mapped to the detected DMRs.
Figure 4—source data 2. H3K4me3 allelic read counts for the paternal and maternal heterozygous single-nucleotide variants (SNVs) mapped to the detected DMRs.

Overall, gDMRs were enriched more with the H3K4me3 mark. Sixty-three percent of the gDMRs and 48% of the sDMRs with at least one heterozygous SNV demonstrated an allelic H3K4me3 mark (Supplementary file 4). This is consistent with previous studies demonstrating the protective role of H3K4me3 against DNA methylation, specifically at germline ICRs in the second round of re-methylation during implantation and somatic development (Chen and Zhang, 2020; Hanna and Kelsey, 2014).

Conservation of detected imprinted DMRs across mammals

To investigate the conservation of detected DMRs and determine if any of the novel DMRs are conserved in mammals, we used WGBS data from mouse (Mus musculus), rhesus macaque (Macaca mulatta), and chimpanzee (Pan troglodytes) (Hon et al., 2013; Jeong et al., 2021; Tung et al., 2012). In determining whether any of the orthologous regions in these mammals displayed partial methylation, we found that 81 of the detected intervals which overlapped with previously reported DMRs and 17 of the novel imprinted DMRs displayed partial methylation in at least one tissue sample in one or more mammals (Figure 5A; Supplementary file 4). In the mouse, orthologs of the 33 detected DMRs were partially methylated, 20 of these were previously reported to be imprinted in mice (Gigante et al., 2019; Xie et al., 2012). Most (88%) of the partially methylated DMRs in the mouse were also partially methylated in rhesus macaque and/or chimpanzee suggesting conservation across species. These shared DMRs mapped to well-known imprinted clusters including KCNQ1, H19, GNAS, SNURF/SNRPN, PLAGL, SGCE, BLCAP, PEG3, PEG10, PEG13, GRB10, BLCAP, NAP1L5, INPP5F, and MEG3 where their allelic PofO expression has already been reported in mouse and other mammals (Geneimprint, 2021; Morison et al., 2001).

Figure 5. Conservation of detected differentially methylated regions (DMRs).

Figure 5.

(a) Upset plot representing the number of previously reported and novel DMRs with evidence of conservation (partial methylation) in each of the mammals. (b) Heatmap representing human DMRs (DMR names on the left) and average methylation of their orthologous intervals in mouse and macaque in different tissues and also in sperm, oocyte, and embryonic samples. Gray regions represent NA values that either did not have an ortholog or enough CpG in the whole-genome bisulfite sequencing (WGBS) data.

Sperm, oocyte, and embryo WGBS data for mouse and rhesus macaque were used to investigate if DMRs classified as germline or somatic in humans were still germline or somatic in other mammals and vice versa (Figure 5B; Dahlet et al., 2020; Gao et al., 2017; Jung et al., 2017; Saenz-de-Juano et al., 2019). Overall, imprinted DMRs preserved their identity as germline or somatic in the two other mammals examined (Figure 5B). However, in a few cases, the type of imprinted DMR was not consistent between humans and other mammals (Figure 5B). This finding is supported by an earlier study indicated that imprinting is largely conserved in mammals while the identity of ICR at the germline stage is not completely conserved (Cheong et al., 2015).

Novel DMRs within known imprinted gene domains

To examine the vicinity of novel DMRs to known imprinted genes, we assembled a list of 259 imprinted genes identified in previous studies (Supplementary file 7; Babak et al., 2015; Baran et al., 2015; Geneimprint, 2021; Jadhav et al., 2019; Morison et al., 2001; Zink et al., 2018). Fifteen of the novel DMRs (six germline and nine somatic) identified in our study could be mapped nearby (<1.03 Mb) to known imprinted genes (Table 1; Supplementary file 4).

Novel sDMRs close to known imprinted genes were mostly paternal of origin. Five of them mapped within known imprinted genes including ZDBF2, PAX8/PAX8-AS1, LPAR6/RB1, BMP8A, and ZNF714 while four mapped close to imprinted genes including PWAR1, LINC00665, DGCR6, and IGF2R (Figure 6; Figure 6—figure supplements 17). For ZNF714 and PAX8/PAX8-AS1, there are no reported imprinted DMRs within the gene or very close to them that explain their imprinted expression. Two of the novel sDMRs mapped to the promoters of these genes with a reverse relation between origin of methylation and expression (Figure 6), suggesting these DMRs could directly suppress paternal and maternal alleles in PAX8-AS1 and ZNF714, respectively.

Figure 6. IGV screenshots of two novel somatic differentially methylated regions (DMRs).

(a) Novel maternally methylated somatic DMR overlapping with the promoter of paternally expressed ZNF714 gene. (b) Novel paternally methylated somatic DMR overlapping with the promoter of maternally expressed PAX8-AS1gene. Highlighted box regions represent the DMRs. Parent-of-origin (PofO) allele-specific expression (ASE) track is created using publicly available ASE data from Zink et al., 2018 (see Materials and methods). Positive vertical bars (upward) represent paternal expression and negative bars (downward) represent maternal expression. The range for all methylation tracks is 0–1.

Figure 6.

Figure 6—figure supplement 1. Novel somatic paternally methylated differentially methylated region (DMR) in paternally expressed ZDBF2 gene.

Figure 6—figure supplement 1.

The range for all methylation tracks is 0–1. In parent-of-origin (PofO) ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 6—figure supplement 2. Novel somatic maternally methylated differentially methylated region (DMR) in maternally expressed RB1 gene and isoform dependent (or in some studies paternally) expressed LPAR6 gene.

Figure 6—figure supplement 2.

The range for all methylation tracks is 0–1. In parent-of-origin (PofO) ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 6—figure supplement 3. Novel somatic maternally methylated differentially methylated region (DMR) in paternally expressed BMP8A gene.

Figure 6—figure supplement 3.

The range for all methylation tracks is 0–1. In parent-of-origin (PofO) ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 6—figure supplement 4. Novel somatic paternally methylated differentially methylated region (DMR) 3 kb away from paternally expressed PWAR1 gene.

Figure 6—figure supplement 4.

The range for all methylation tracks is 0–1. In parent-of-origin (PofO) ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 6—figure supplement 5. Novel somatic paternally methylated differentially methylated region (DMR) 90 kb away from maternally expressed LINC00665 gene.

Figure 6—figure supplement 5.

The range for all methylation tracks is 0–1. In parent-of-origin (PofO) ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 6—figure supplement 6. Novel somatic paternally methylated differentially methylated region (DMR) 296 kb away from randomly/maternally expressed DGCR6 gene.

Figure 6—figure supplement 6.

The range for all methylation tracks is 0–1. In parent-of-origin (PofO) ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 6—figure supplement 7. Novel somatic paternally methylated differentially methylated region (DMR) 1.03 Mb away from maternally expressed IGF2R.

Figure 6—figure supplement 7.

The range for all methylation tracks is 0–1. In parent-of-origin (PofO) ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.

All novel gDMRs close to imprinted genes were maternal of origin. Three of them mapped within known imprinted genes including ACTL10/NECAB3, DDA1, and AC024940.1 while three of them mapped close to imprinted genes including SYCE1, NAPRT, and NTM (Figure 7; Figure 7—figure supplements 14). Three of the germline DMRs mapped within or very close to three known imprinted genes without reported ICR including AC024940.1 (OVOS2), ACTL10/NECAB3, and SYCE1. A novel maternal gDMR mapped to the promoter of the paternally expressed ACTL10 (Zink et al., 2018; Figure 7A). In a previous study, a CpG site located ~130 bp away from the DMR we detected was demonstrated to be a cis-methylation quantitative trait loci with PofO association (Cuellar Partida et al., 2018). Thus, the novel gDMR might be the ICR of this gene and directly suppress the maternal allele. Another novel maternal gDMR mapped to the promoter of SYCE1, which demonstrates paternal expression bias in the allele-specific expression (ASE) track (Zink et al., 2018; Figure 7B). Nakabayashi et al., 2011, also observed two array probes consistent with an imprinted DMR at this region, but were unable to validate them because of the difficulty in designing bisulfite PCR primers (Nakabayashi et al., 2011). The novel maternal gDMR at the promoter of SYCE1 could be the ICR for this gene and directly suppress the maternal allele.

Figure 7. IGV screenshots of two novel maternal germline differentially methylated regions (DMRs).

(a) Novel maternally methylated germline DMR overlapping with the promoter of the paternally expressed ACTL10 gene. (b) Novel maternally methylated germline DMR overlapping with the promoter of the SYCE1 gene, which demonstrates paternal expression bias from parent-of-origin (PofO) allele-specific expression (ASE) track. Highlighted box regions represent the DMRs. PofO ASE track is created using publicly available ASE data from Zink et al., 2018 (see Materials and methods). Positive vertical bars (upward) represent paternal expression and negative bars (downward) represent maternal expression. The range for all methylation tracks is 0–1.

Figure 7.

Figure 7—figure supplement 1. Novel germline maternally methylated differentially methylated region (DMR) in maternally expressed DDA1 gene.

Figure 7—figure supplement 1.

The range for all methylation tracks is 0–1. In parent-of-origin (PofO) ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 7—figure supplement 2. Novel germline maternally methylated differentially methylated region (DMR) in paternally expressed AC024940.1 (OVOS2) gene.

Figure 7—figure supplement 2.

The range for all methylation tracks is 0–1. In parent-of-origin (PofO) ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 7—figure supplement 3. Novel germline maternally methylated differentially methylated region (DMR) 149 kb away from isoform dependent expressed NAPRT gene.

Figure 7—figure supplement 3.

The range for all methylation tracks is 0–1. In parent-of-origin (PofO) ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 7—figure supplement 4. Novel germline maternally methylated differentially methylated region (DMR) 745 kb away from maternally expressed NTM gene.

Figure 7—figure supplement 4.

The range for all methylation tracks is 0–1. In parent-of-origin (PofO) ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.

Contiguous blocks of parental methylation bias

Previous studies demonstrated two paradigms of imprinting at the PWS/AS imprinted cluster, either PofO methylation confined to particular regulatory regions such as CGIs or subtle paternal bias across this cluster with spikes of maternal methylation (Court et al., 2014; Joshi et al., 2016; Sharp et al., 2010; Zink et al., 2018). Probes with paternal methylation bias at the SNORD116 cluster have been reported, spanning about a 95 kb region, and paternal deletion of this cluster results in PWS phenotypes (Hernandez Mora et al., 2018; Joshi et al., 2016; Matsubara et al., 2019). Slight hypomethylation of SNORD116 cluster in cases with PWS phenotype and hypermethylation in the cases with AS phenotype have been reported (Matsubara et al., 2019). We did not observe paternal methylation bias across the whole PWS/AS cluster; however, we detected a paternal methylation block spanning ~200 kb, immediately downstream of the known, maternally methylated PWS SNURF/SNRPN ICR (Figure 8). This block encompasses the SNORD116 cluster genes and several other genes such as PWAR1, 5 and 6, PWARSN and IPW. In addition to the PWS/AS block, we detected six other PofO methylation bias blocks ranging from 35 to 65 kb in size, were located within ZNF331, KCNQ1OT1, GNAS/GNAS-AS1, L3MBTL1, ZNF597/NAA60, and GPR1-AS/ZDBF2 imprinted clusters (Figure 8—figure supplements 16).

Figure 8. IGV screenshot of 200 kb paternally methylated biased methylation block in the PWS/AS imprinted cluster.

The range for all methylation tracks is 0–1. The histone mark tracks represent allelic read counts for H3K36me3 and H3K27me3 modifications. The range for all histone mark tracks is 0–20. In H3K27 and H3K36 tracks, for NA12878 and NA19240 the parent-of-origin could be determined and specified by maternal (Mat) and paternal (Pat) alleles. While the other samples are specified by reference (Ref) and alternative (Alt) alleles.

Figure 8.

Figure 8—figure supplement 1. IGV screenshot of ~65 kb paternally methylated biased methylation block in GPR1-AS/ZDBF2 imprinted cluster.

Figure 8—figure supplement 1.

The histone mark tracks are representing allelic read counts for H3K36me3 and H3K27me3 modifications. The range for all histone mark tracks is 0–20. Purple highlight demonstrates the whole block. In H3K27 and H3K36 tracks, for NA12878 and NA19240 the parent-of-origin (PofO) could be determined and specified by maternal (Mat) and paternal (Pat) alleles. While the other samples are specified by reference (Ref) and alternative (Alt) alleles. Black box represents known somatic differentially methylated region (DMR) in the block while red box shows known germline DMR in the block. The range for all methylation tracks is 0–1. In PofO ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 8—figure supplement 2. IGV screenshot of ~58 kb paternally methylated biased methylation block in ZNF331/ZNF813 imprinted cluster.

Figure 8—figure supplement 2.

The histone mark tracks are representing allelic read counts for H3K36me3 and H3K27me3 modifications. The range for all histone mark tracks is 0–20. In H3K27 and H3K36 tracks, for NA12878 and NA19240 the parent-of-origin (PofO) could be determined and specified by maternal (Mat) and paternal (Pat) alleles. While the other samples are specified by reference (Ref) and alternative (Alt) alleles. Purple highlight demonstrates the whole block. Red box shows known germline differentially methylated region (DMR) in the block. The range for all methylation tracks is 0–1. In PofO ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 8—figure supplement 3. IGV screenshot of ~56 kb paternally methylated biased methylation block in KCNQ1/KCNQ1OT1 imprinted cluster.

Figure 8—figure supplement 3.

The histone mark tracks are representing allelic read counts for H3K36me3 and H3K27me3 modifications. The range for all histone mark tracks is 0–20. In H3K27 and H3K36 tracks, for NA12878 and NA19240 the parent-of-origin (PofO) could be determined and specified by maternal (Mat) and paternal (Pat) alleles. While the other samples are specified by reference (Ref) and alternative (Alt) alleles. Purple highlight demonstrates the whole block. Red box shows known germline differentially methylated region (DMR) in the block. The range for all methylation tracks is 0–1. In PofO ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 8—figure supplement 4. IGV screenshot of ~44 kb paternally methylated biased methylation block in GNAS/GNAS-AS1 imprinted cluster.

Figure 8—figure supplement 4.

The histone mark tracks are representing allelic read counts for H3K36me3 and H3K27me3 modifications. The range for all histone mark tracks is 0–20. In H3K27 and H3K36 tracks, for NA12878 and NA19240 the parent-of-origin (PofO) could be determined and specified by maternal (Mat) and paternal (Pat) alleles. While the other samples are specified by reference (Ref) and alternative (Alt) alleles. Purple highlight demonstrates the whole block. Black box represents known somatic differentially methylated region (DMR) in the block while red box shows known germline DMR in the block. The range for all methylation tracks is 0–1. In PofO ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 8—figure supplement 5. IGV screenshot of ~42 kb paternally methylated biased methylation block in L3MBTL1/SGK2 imprinted cluster.

Figure 8—figure supplement 5.

The histone mark tracks are representing allelic read counts for H3K36me3 and H3K27me3 modifications. The range for all histone mark tracks is 0–20. In H3K27 and H3K36 tracks, for NA12878 and NA19240 the parent-of-origin (PofO) could be determined and specified by maternal (Mat) and paternal (Pat) alleles. While the other samples are specified by reference (Ref) and alternative (Alt) alleles. Purple highlight demonstrates the whole block. Red box shows known germline differentially methylated region (DMR) in the block. The range for all methylation tracks is 0–1. In PofO ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 8—figure supplement 6. IGV screenshot of ~35 kb maternally methylated biased methylation block in ZNF597/NAA60 imprinted cluster.

Figure 8—figure supplement 6.

The histone mark tracks are representing allelic read counts for H3K36me3 and H3K27me3 modifications. The range for all histone mark tracks is 0–20. In H3K27 and H3K36 tracks, for NA12878 and NA19240 the parent-of-origin (PofO) could be determined and specified by maternal (Mat) and paternal (Pat) alleles. While the other samples are specified by reference (Ref) and alternative (Alt) alleles. Purple highlight demonstrates the whole block. Black box represents known somatic differentially methylated region (DMR) in the block. The range for all methylation tracks is 0–1. In PofO ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.

As mentioned in the ‘Confirmation of novel imprinted DMRs’ section, only 42 out of 99 detected novel DMRs in the nanopore data could be confirmed in the WGBS data as partially methylated. Forty of the novel nanopore-detected DMRs that did not show partial methylation in the WGBS data mapped to the seven PofO-biased blocks. At imprinted intervals one allele is methylated and the other one is not. Therefore, at these intervals aggregated methylation from both alleles demonstrate partial methylation (~50% methylation) in WGBS data. However, in the subtle PofO bias blocks both alleles are methylated with a subtle hypomethylation on one of the alleles. Therefore, in contrast to imprinted intervals, aggregated methylation at these blocks usually do not show partial methylation in WGBS data. The weaker or subtle differential methylation can therefore explain why several novel DMRs detected in the nanopore data did not show partial methylation in the WGBS data and demonstrates the utility of nanopore sequencing in detecting subtle ASM differences.

Enriched allelic H3K36me3 and H3K27me3 histone marks at contiguous blocks

RNA polymerase II recruits SETD2 during elongation which results in the deposition of the H3K36me3 mark in the gene body. In turn, H3K36me3 recruits de novo DNA methyltransferases through their PWWP domain which results in DNA methylation in the gene body (Wagner and Carpenter, 2012).

Within the seven PofO methylation-biased blocks, parentally expressed or active allele demonstrated hypermethylation suggesting that subtle methylation is linked to parental ASE. Except ZNF597/NAA60, all the blocks demonstrated hypermethylation and ASE on the paternal allele. ZNF597/NAA60 demonstrated hypermethylation and ASE on the maternal allele. Therefore, to assess allelic H3K36me3, we used ChIP-seq data from six LCLs (Kasowski et al., 2013). H3K36me3 and H3K27me3 histone marks are mutually exclusive (Yuan et al., 2011). Moreover, DNA methylation and H3K27me3 shown to be mutually exclusive at CGIs (Brinkman et al., 2012). Thus, we also examined allelic H3K27me3 in the same cell line samples (Kasowski et al., 2013).

To analyze allelic histone modifications and detect blocks of allelic histone marks at large blocks of PofO bias, we binned the genome into 10 kb intervals and performed a binomial test with Fisher’s combined p-value test to determine the significance of allelic read counts at 10 kb intervals with >3 informative heterozygous SNVs (having at least five mapped reads) within each block in each sample. A 10 kb bin considered as significant for allelic histone mark if it had an adjusted p-value <0.001 and if at least 70% of the SNVs within the 10 kb bin having ≥80% of the reads mapped to one allele. In total, 174 bins for H3K36me3 and 132 bins for H3K27me3 could be examined. Of these, 147 bins for H3K36me3 and 51 bins for H3K27me3 were significant. Thirty-eight bins were significant for both histone marks in the same sample. All the seven blocks demonstrated multiple significant bins for H3K36me3 at almost all the samples. L3MBTL1, GPR1-AS/ZDBF2, GNAS/GNAS-AS1, and ZNF597/NAA60 demonstrated multiple significant H3K27me3 bins in majority of the samples and KCNQ1OT1, PWS/AS, and ZNF331 had significant H3K27me3 bins at 3, 2, and 1 of the samples, respectively. H3K36me3 and H3K27me3 demonstrated mutual exclusive pattern and H3K36me3 appeared on the hypermethylated allele while H3K27me3 on the hypomethylated allele (Figure 8; Figure 8—figure supplements 16; Figure 9; Supplementary file 8).

Figure 9. Mutual exclusive allelic H3K36me3 and H3K27me3 histone marks at seven parent-of-origin (PofO) methylation-biased blocks.

All blocks demonstrate allelic H3K36me3 on hypermethylated allele and H3K27me3 on hypomethylated allele. For NA12878 and NA19240, allele1 is the paternal and allele2 is maternal. For sake of visualization in other four cell lines without parental information, allele1 for H3K36me3 mark demonstrates the allele with more mapped reads at all blocks except ZNF597/NAA60. Therefore, for H3K36me3 we swapped the reference allele read count with the alternative allele read count if the reference allele count was less than the alternative allele count. At ZNF597/NAA60, we swapped the reference allele read count with the alternative allele read count if the reference had higher read count. We also swapped the reference and the alternative allele counts for the same SNVs for H3K27me3. Each point represents a heterozygous SNV. Lines are connecting SNVs that have mapped reads for both histone modifications.

Figure 9—source data 1. H3K36me3 and H3K27me3 allelic read counts for the heterozygous single-nucleotide variants (SNVs) mapped to the detected PofO-biased methylation blocks.

Figure 9.

Figure 9—figure supplement 1. Allelic H3K36me3 and H3K27me3 histone marks read count at seven test blocks.

Figure 9—figure supplement 1.

Test blocks do not display allelic bias for H3K36me3 or H3K27me3 histone modifications. For NA12878 and NA19240, allele1 is the paternal and allele2 is maternal. For sake of visualization in other four cell lines without parental information, allele1 for H3K36me3 mark demonstrates the allele with more mapped reads. Therefore, for H3K36me3 we swapped reference allele read count with alternative allele read count if reference allele count was less than alternative allele count. We also swapped reference and alternative allele counts for the same SNVs for H3K27me3. Each point represents a heterozygous SNV. Lines are connecting SNVs that have mapped reads for both histone modifications.
Figure 9—figure supplement 1—source data 1. H3K36me3 and H3K27me3 allelic read counts for the heterozygous single-nucleotide variants (SNVs) mapped to the test blocks.
Figure 9—figure supplement 2. IGV screenshot of the PPIEL imprinted cluster.

Figure 9—figure supplement 2.

The histone mark tracks are representing allelic read counts for H3K36me3 and H3K27me3 modifications. The range for all histone mark tracks is 0–20. In H3K27 and H3K36 tracks, for NA12878 and NA19240 the parent-of-origin (PofO) could be determined and specified by maternal (Mat) and paternal (Pat) alleles. While the other samples are specified by reference (Ref) and alternative (Alt) alleles. Red box shows known germline DMR. The range for all methylation tracks is 0–1. In PofO ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 9—figure supplement 3. IGV screenshot of the MEG3 imprinted cluster.

Figure 9—figure supplement 3.

The histone mark tracks are representing allelic read counts for H3K36me3 and H3K27me3 modifications. The range for all histone mark tracks is 0–20. In H3K27 and H3K36 tracks, for NA12878 and NA19240 the parent-of-origin (PofO) could be determined and specified by maternal (Mat) and paternal (Pat) alleles. While the other samples are specified by reference (Ref) and alternative (Alt) alleles. Black box shows known somatic differentially methylated region (DMR). The range for all methylation tracks is 0–1. In PofO ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 9—figure supplement 4. IGV screenshot of the MEST imprinted cluster.

Figure 9—figure supplement 4.

The histone mark tracks are representing allelic read counts for H3K36me3 and H3K27me3 modifications. The range for all histone mark tracks is 0–20. In H3K27 and H3K36 tracks, for NA12878 and NA19240 the parent-of-origin (PofO) could be determined and specified by maternal (Mat) and paternal (Pat) alleles. While the other samples are specified by reference (Ref) and alternative (Alt) alleles. Red box shows known germline differentially methylated region (DMR). The range for all methylation tracks is 0–1. In PofO ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 9—figure supplement 5. IGV screenshot of the DIRAS3 imprinted cluster.

Figure 9—figure supplement 5.

The histone mark tracks are representing allelic read counts for H3K36me3 and H3K27me3 modifications. The range for all histone mark tracks is 0–20. In H3K27 and H3K36 tracks, for NA12878 and NA19240 the parent-of-origin (PofO) could be determined and specified by maternal (Mat) and paternal (Pat) alleles. While the other samples are specified by reference (Ref) and alternative (Alt) alleles. Red box shows known germline differentially methylated region (DMR). The range for all methylation tracks is 0–1. In PofO ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 9—figure supplement 6. IGV screenshot of the IGF2 imprinted cluster.

Figure 9—figure supplement 6.

The histone mark tracks are representing allelic read counts for H3K36me3 and H3K27me3 modifications. The range for all histone mark tracks is 0–20. In H3K27 and H3K36 tracks, for NA12878 and NA19240 the parent-of-origin (PofO) could be determined and specified by maternal (Mat) and paternal (Pat) alleles. While the other samples are specified by reference (Ref) and alternative (Alt) alleles. Black box represents known somatic differentially methylated region (DMR) and red box shows known germline DMR. The range for all methylation tracks is 0–1. In PofO ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 9—figure supplement 7. IGV screenshot of the MTRNR2L4 imprinted cluster.

Figure 9—figure supplement 7.

The histone mark tracks are representing allelic read counts for H3K36me3 and H3K27me3 modifications. The range for all histone mark tracks is 0–20. In H3K27 and H3K36 tracks, for NA12878 and NA19240 the parent-of-origin (PofO) could be determined and specified by maternal (Mat) and paternal (Pat) alleles. While the other samples are specified by reference (Ref) and alternative (Alt) alleles. Black box represents known somatic differentially methylated region (DMR). The range for all methylation tracks is 0–1. In PofO ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.
Figure 9—figure supplement 8. IGV screenshot of the ADNP2 imprinted cluster.

Figure 9—figure supplement 8.

The histone mark tracks are representing allelic read counts for H3K36me3 and H3K27me3 modifications. The range for all histone mark tracks is 0–20. In H3K27 and H3K36 tracks, for NA12878 and NA19240 the parent-of-origin (PofO) could be determined and specified by maternal (Mat) and paternal (Pat) alleles. While the other samples are specified by reference (Ref) and alternative (Alt) alleles. Red box shows known germline differentially methylated region (DMR). The range for all methylation tracks is 0–1. In PofO ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.

To determine if allelic histone marks are unique to the PofO methylation-biased blocks, we examined allelic histone marks on several other imprinted clusters with strong ASE which did not display PofO bias methylation. For this, we examined PPIEL, MEG3, MEST, DIRAS3, IGF2, MTRNR2L4, and ADNP2/PARD6G-AS1 clusters. Eighty-three bins for H3K36me3 and 138 bins for H3K27me3 could be examined at the seven test blocks. Of these, only five bins for H3K36me3 and seven bins for H3K27me3 were significant and none of the bins were significant for both histone marks (Figure 9—figure supplements 18; Supplementary file 9). These results suggest that the blocks of PofO methylation bias in the gene body of active alleles are mediated by transcription and histone marks at their gene bodies.

Discussion

Here, we describe the first genome-wide map of human ASM intended to detect novel imprinted intervals using nanopore sequencing. Leveraging long reads and parental SNVs allowed us to phase methylation for ~26.5 million autosomal CpGs representing 95% of the CpGs in the human autosomal genome (GRCh38) across 12 LCLs. This effort achieves a much higher resolution than previous studies aimed at capturing allelic methylation using bisulfite sequencing or methylation arrays (Court et al., 2014; Hernandez Mora et al., 2018; Joshi et al., 2016; Zink et al., 2018). Fourteen of our novel DMRs did not have any phased CpG from previous WGBS or array studies (Supplementary file 4), illustrating the utility of longer reads for imprinted methylation calling. DMRs that are detected in only a single study displayed higher variations across individuals compared to those detected by at least two studies. Therefore, lack of phasing at some novel DMRs in previous studies and higher variation in imprinted methylation at novel DMRs could explain the reason they were not detected previously. We also demonstrated that germline DMRs with a greater number of ZFP57 motif tend to be more consistently imprinted across individuals suggesting motifs redundancy increases ZFP57 recruitment and tolerance to any DNA sequence variation. However, due to the availability of DNA sequence in a limited number of samples, we were not able to examine sequence variation at the DMRs and the ZFP57-binding motifs for any possible association with polymorphic imprinted methylation which will require further study.

Even though we detected methylation for all the CpGs in the human genome (GRCh38), we were not able to phase 5% of the human methylome (Kent et al., 2002). We used SNVs detected from short-reads data in the 1KGP and GIAB databases for phasing (Auton et al., 2015; Zook et al., 2019). Seventy-five percent of the unphased CpGs mapped to the ENCODE blacklist, regions with low mappability, indicative of lack of SNVs to phase reads (Amemiya et al., 2019). Improvement in base calling and variant calling from nanopore reads could enable the phasing of a complete genome-wide methylome using nanopore-detected SNVs.

We detected 16 novel gDMRs and 26 novel sDMRs. These novel DMRs were supported by several lines of evidence in our analyses. (1) They displayed significant PofO methylation bias in nanopore-sequenced cell line samples. (2) They were partially methylated in WGBS data. (3) gDMRs demonstrated establishment of methylation in sperm or oocyte and escape from the second de-methylation step. (4) Eighty-four percent of those for which H3K4me3 ChIP-seq data could be phased and examined showed significant allelic H3K4me3. (5) Forty percent showed evidence of conservation. (6) Eighty-three percent mapped to at least one regulatory region including CGI, CTCF-binding site, and enhancer (Supplementary file 4). These novel DMRs represent a substantial and well-confirmed expansion of known regions of imprinting, which may aid future research and diagnosis in the fields of genetic medicine and oncology.

We detected seven blocks of allelic methylation bias (Figure 8; Figure 8—figure supplements 16). All of the blocks represented several common features. (1) They were detected in imprinted genes that appeared in a cluster. (2) There was at least one well-characterized and conserved gDMR in each block (except ZNF597/NAA60 block with a conserved sDMR). (3) The well-characterized DMRs in these blocks displayed significant allelic H3K4me3 (except the DMR in the L3MBTL1 block, which could not be examined due to the lack of an SNV). (4) The well-characterized DMRs in these blocks overlapped with the promoters of genes with subtle PofO methylation bias at the gene body and DMR itself displayed opposite PofO methylation (except for GPR1-AS/ZDBF2 block where DMR did not map to the promoter and had the same PofO with the gene body). (5) All the blocks were accompanied by a strong allelic expression and H3K36me3 histone mark on the subtle hypermethylated allele and H3K27me3 on the hypomethylated allele. This represents a novel facet of imprinting biology and suggests a link between allelic expression and histone modifications with biased PofO methylation at these blocks. However, the mechanism regulating such blocks and the rule of these PofO-biased methylation remain to be determined. One possible explanation could be that the subtle parental methylation bias is used by cells to express important genes (genes that can regulate other genes in the cluster or have regulatory roles) in an imprinted cluster with higher fidelity through its gene body methylation on the active allele. For example, at the KCNQ1OT1 and GNAS/GNAS-AS1 clusters, the methylation blocks overlap with KCNQ1OT1 and GNAS-AS1 gene bodies, both of which encode antisense RNA transcripts that regulate other genes in the imprinted cluster (Chiesa et al., 2012; Turan and Bastepe, 2013).

Orthologous regions of ~40% of the detected DMRs demonstrated partial methylation in one or more of the three mammals including chimpanzee, rhesus macaque, and mouse, suggesting their conservation. There were a considerably higher number of orthologous sites and partially methylated orthologous DMRs in chimpanzee and rhesus macaque, in agreement with more similarities and less distance to these primates compared to the mouse in human evolution. Previously, Court et al., 2014, detected 14 novel DMRs, and did not detect any imprinted orthologs of their novel DMRs in mice (Court et al., 2014). All 14 DMRs also overlapped with our detected DMRs and six of them had orthologous regions in mm10 using the UCSC liftover file (Kent et al., 2002). Two of the orthologs displayed partial methylation in mouse; the first is MEG8 human DMR with its orthologous Rian gene in the mouse, which was not examined by Court et al., 2014, and the other is found in the Htr5a gene, which was previously reported as not conserved in mouse (Court et al., 2014). After reviewing their analysis, Court et al., 2014, seem to have examined different orthologous region (Appendix 2—figure 1). For Htr5a, they examined the CGI (CpG:_102) ~50 kb away from the gene, while we examined the region spanning the first or second exon (two transcripts) of Htr5a which was partially methylated while CpG:_102 was also unmethylated in our study.

Using reported imprinted genes, 36% of the novel DMRs mapped close to known imprinted genes (Babak et al., 2015; Baran et al., 2015; Geneimprint, 2021; Jadhav et al., 2019; Morison et al., 2001; Zink et al., 2018). Five of our novel DMRs could be potential ICRs for reported imprinted genes. Specifically, imprinted DMRs overlapping the promoters of ZNF714, PAX8-AS1, ACTL10, and SYCE1 genes (Figures 6 and 7). ZNF714 is a member of the zinc finger family of proteins which have several imprinted genes with developmental roles (Babak et al., 2015; Baran et al., 2015; Camargo et al., 2012; Jadhav et al., 2019; Zink et al., 2018). ZNF714 has been reported to be associated with non-syndromic cleft lip (Camargo et al., 2012). Aberrant methylation of multiple CpGs overlapping with the novel DMR at PAX8-AS1 has been implicated in thyroid disorders (Candler et al., 2021). SYCE1 and ACTL10 are also implicated in human diseases (Bak et al., 2016; Maor-Sagie et al., 2015). Thus, these imprinted DMRs could be of potential clinical value.

In addition to the aforementioned novel DMRs, two of the reported DMRs in PTCHD3 and FANCC are also interesting. Paternal expression of PTCHD3 and maternal expression for FANCC were previously detected by Zink et al., 2018, though they could not detect any associated DMR due to the lack of phased CpG (Zink et al., 2018). Hernandez Mora et al., 2018, detected three maternally methylated probes at the promoter of PTCHD3 and one maternally methylated probe in intron 1 of FANCC, but were unable to examine the parental expression (Hernandez Mora et al., 2018). We also detected two maternally methylated gDMRs overlapping with the promoter of PCTHD3 and intron 1 of FANCC (Appendix 3—figure 1; Appendix 3—figure 2). Therefore, these gDMRs could be the ICRs for these genes. The maternal gDMR at the PTCHD3 promoter can directly suppress the maternal allele and results in paternal expression. FANCC gDMR overlaps with a CGI and CTCF-binding site. CTCF is a methylation-sensitive DNA-binding protein and CpG methylation can inhibit CTCF binding (Hashimoto et al., 2017; Renda et al., 2007). Moreover, CTCF binding to the first intron of major immediate early (MIE) gene of the human cytomegalovirus (HCMV) in HCMV-infected cells resulted in repression of this gene (Puerta et al., 2014). Therefore, the maternally methylated DMR in intron 1 of maternally expressed FANCC suggests a mechanism through which the paternal allele is suppressed by CTCF binding at DMR while DNA methylation inhibits CTCF binding at the maternal allele.

Overall, our study demonstrates a near-complete genome-wide map of human ASM by leveraging long-read nanopore technology. The use of nanopore technology allowed us to expand the set of known imprinted DMRs using 12 LCLs with parental SNPs. Moreover, we detected seven large PofO bias methylation blocks with enriched allelic expression and histone modifications. We showed that nanopore sequencing has the ability to achieve a higher resolution of phased CpGs using a small sample size and allows for the calling of imprinted methylation in a single sample, potentially reducing the cost by reducing the sample size.

Materials and methods

Nanopore sequencing data and detection of ASM

We used publicly available nanopore sequencing data for 12 LCLs with trio data available. Raw and base-called nanopore data for HG002, HG005, HG00733, HG01109, HG01243, HG02055, HG02080, HG02723, HG03098, and HG03492 were obtained from the Human Pangenomics and GIAB (Shafin et al., 2020; Zook et al., 2016). NA19240 data (ERR3046934 and ERR3046935 raw nanopore and their base-called reads ERR3219853 and ERR3219854) were obtained from De Coster et al., 2019. Raw and base-called nanopore data for NA12878 were obtained from rel6 nanopore WGS consortium (Jain et al., 2018). Reads were mapped to the GRCh38 human reference genome using Minimap2 with the setting minimap2 –ax map-ont (Kent et al., 2002; Li, 2018). For all the cell lines and their parents, except HG002 and HG005, high-quality SNVs were called using Strelka2 with default parameters from alignment files in the 1KGP GRCh38 (Auton et al., 2015; Kim et al., 2018). High-quality SNVs for HG002 and HG005 and their parents were obtained from GIAB v.3.3.2 high confidence variant calls (Zook et al., 2019). CpG methylations were called from nanopore data using nanopolish with default parameters (Simpson et al., 2017). Methylation calls for each sample were preprocessed using the NanoMethPhase methyl_call_processor default setting (Akbari et al., 2021). Subsequently, haplotyping and PofO methylation detection were performed using NanoMethPhase and trio (mother, father, and child) variant call data with the setting nanomethphase phase –mbq 0. Finally, DMRs between haplotypes were called using the default setting of NanoMethPhase dma module that uses Dispersion Shrinkage for Sequencing data R package for DMA (Park and Wu, 2016). To avoid the confounding effects of X-chromosome inactivation, and because previous studies demonstrated no evidence of imprinting at sex chromosomes, we only examined autosomal chromosomes (Court et al., 2014; Joshi et al., 2016; Zink et al., 2018).

WGBS data and detection of novel DMRs

To confirm allelic methylation in other tissues and also detect potential novel imprinted DMRs, we used 60 public WGBS data records for 29 tissue type samples from the Epigenomics Roadmap and ENCODE projects (Supplementary file 10) and 119 blood WGBS datasets for 87 individuals from the Blueprint project (Bernstein et al., 2010; ENCODE Project Consortium, 2012; Stunnenberg et al., 2016; Supplementary file 10). CpGs with at least five mapped reads were used for further analysis. At imprinted DMRs, only one allele is methylated and we expect to observe partial methylation (~50%) at such regions. Therefore, we investigated the partial methylation of nanopore-detected DMRs in WGBS data (code is available on https://github.com/vahidAK/NanoMethPhase/tree/master/scripts (Akbari, 2022): PartialMethylation_AtDMR.sh). As controls, we examined 100 randomly selected CGIs: 1, 2, and 3 kb intervals with more than 15 CpGs each resampled 100 times.

Detection of gDMRs and sDMRs

To discriminate gDMRs from sDMRs, we used publicly available WGBS data for three sperms, two oocytes, and one blastocyst first published by Okae et al., 2014, and three fetal tissue libraries (GSM1172595 thymus, GSM1172596 muscle, and GSM941747 brain) from the Roadmap project (Bernstein et al., 2010; Okae et al., 2014).

Allelic H3K4me3, H3K36me3, and H3K27me3 analysis

H3K4me3, H3K36me3, and H3K27me3 ChIP-seq fastq files were obtained for NA12878, NA12891, NA12892, NA19238, NA19239, and NA19240 (SRP030041) (Kasowski et al., 2013). ChIP-seq data were aligned to the GRCh38 reference genome using the bwa-mem default setting (Kent et al., 2002; Li and Durbin, 2009). High-quality SNVs were called for these samples from 1KGP GRCh38 alignment files using strelka2 (Auton et al., 2015; Kim et al., 2018). We then counted the number of reads with a minimum mapping quality of 20 and base quality of 10 at each heterozygous SNV and kept those with at least five mapped reads. The reference allelic counts and total counts at each heterozygous SNV were used to detect significant allelic bias using a two-sided binomial test under the default probability of p=0.5 in python SciPy package (codes are available on GitHub https://github.com/vahidAK/NanoMethPhase/tree/master/scripts: CountReadsAtSNV.py & Binomial_test.py) (Virtanen et al., 2020).

ASE track

ASE data from Zink et al., 2018 (PofO_ASE.tsv; https://doi.org/10.6084/m9.figshare.6816917) were used to create ASE track for IGV. In PofO_ASE.tsv file from Zink et al., they have calculated lor_paternal_maternal across individuals which is (lor_ref_alt_pref - lor_ref_alt_palt)/2. lor_ref_alt_pref is log(#reads with ref allele/#reads with alt allele) when paternal homologue has ref allele and lor_ref_alt_palt is log(#reads with ref allele/#reads with alt allele) when paternal homologue has alt allele. For visualization in IGV, we converted the PofO_ASE.tsv file from Zink et al., to a bigwig format file using the UCSC tool bedGraphToBigWig version 4 and we kept lor_paternal_maternal as ASE value (Kent et al., 2010).

Mammalian conservation of DMRs

We used 16 WGBS datasets for mouse (GSM1051150-60 and GSM1051162-66), 34 WGBS datasets for rhesus macaque (GSE34128 and GSE151768), and 22 WGBS datasets for chimpanzee (GSE151768) to examine partial methylation in orthologous intervals (Hon et al., 2013; Jeong et al., 2021; Tung et al., 2012). Mouse, macaque, and chimpanzee coordinates lifted over to mm10, RheMac8, and PanTro5 coordinates using CrossMap and the appropriate liftover file from the UCSC genome browser. The list of detected human DMRs were also converted to the orthologous regions for each mammal using CrossMap and the appropriate liftover file (Kent et al., 2002; Zhao et al., 2014). Since many coordinates in the human splitted to several orthologs in other mammals, we merged orthologs that were ≤200 bp apart.

To examine the somatic and germline ortholog DMRs, we used embryo (GSM3752614, GSM4558210), sperm (GSE79226), and oocyte (GSM3681773, GSM3681774, GSM3681775) WGBS libraries from mouse; and embryo (GSM1466814), sperm (GSM1466810), and oocyte (GSM1466811) WGBS libraries from rhesus macaque (Dahlet et al., 2020; Gao et al., 2017; Jung et al., 2017; Saenz-de-Juano et al., 2019).

Acknowledgements

SJMJ and MAM acknowledge funding from the Canada Research Chairs program and the Canadian Foundation for Innovation. VA acknowledges funding from the University of British Columbia with a Four-Year Doctoral Fellowship.

Appendix 1

Appendix 1—figure 1. Pearson correlation for the number of ZFP57-binding motif (TGCCGC) at differentially methylated regions (DMRs) and percent of individuals that demonstrated partial methylation in their whole-genome bisulfite sequencing data at the DMRs.

Appendix 1—figure 1.

Appendix 1—figure 1—source data 1. Number of ZFP57-binding motif (TGCCGC) at differentially methylated regions (DMRs).

Appendix 2

Appendix 2—figure 1. Conserved differentially methylated region (DMR) at HTR5A in human and mouse.

Appendix 2—figure 1.

The known DMR at HTR5A reported to be not conserved in mouse by Court et al. (PMID: 24402520; see supplementary figure S6 from Court et al., 2014), however we detected it as conserved due to a different orthologous examined in our study. Court et al. examined CGI 102 which is also not imprinted in our analysis, however the ortholog we examined spans beginning of the HTR5A and is partially methylated which suggests the region is imprinted. Red boxes are showing germline DMRs. The range for all methylation tracks is 0–1. In parent-of-origin (PofO) ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.

Appendix 3

Appendix 3—figure 1. Germline maternally methylated differentially methylated region (DMR) in the promoter of paternally expressed PTCHD3 gene.

Appendix 3—figure 1.

The range for all methylation tracks is 0–1. In parent-of-origin (PofO) ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.

Appendix 3—figure 2. Germline maternally methylated differentially methylated region (DMR) in the intron 1 of maternally expressed FANCC gene.

Appendix 3—figure 2.

The range for all methylation tracks is 0–1. In parent-of-origin (PofO) ASE track, positive or upward bars represent paternal expression bias and negative or downward bars represent maternal expression bias.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Steven JM Jones, Email: sjones@bcgsc.ca.

Jessica K Tyler, Weill Cornell Medicine, United States.

Jessica K Tyler, Weill Cornell Medicine, United States.

Funding Information

This paper was supported by the following grants:

  • The University of British Columbia, 4-Year Doctoral Fellowship to Vahid Akbari.

  • Canada Research Chairs to Marco A Marra, Steven JM Jones.

Additional information

Competing interests

No competing interests declared.

No competing interests declared.

Author contributions

Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Software, Validation, Visualization, Writing - original draft, Writing – review and editing.

Conceptualization, Data curation, Software, Writing – review and editing.

Conceptualization, Data curation, Formal analysis, Investigation, Software, Writing – review and editing.

Conceptualization, Formal analysis, Investigation, Writing – review and editing.

Conceptualization, Formal analysis, Writing – review and editing.

Conceptualization, Formal analysis, Resources, Writing – review and editing.

Conceptualization, Formal analysis, Resources, Writing – review and editing.

Conceptualization, Funding acquisition, Project administration, Resources, Supervision, Writing – review and editing.

Additional files

Supplementary file 1. List of the reported imprinted differentially methylated regions (DMRs).
elife-77898-supp1.xlsx (32.4KB, xlsx)
Supplementary file 2. The list of 200 detected imprinted differentially methylation regions from nanopore data.

This list demonstrates the results of differential methylation analysis (DMA) using nanomethphase dma module that uses DSS R package for DMA.

elife-77898-supp2.xlsx (31.9KB, xlsx)
Supplementary file 3. Mapping detected differentially methylated regions (DMRs) using nanopore sequencing across 12 lymphoblastoid cell lines (LCLs) to reported DMRs.
elife-77898-supp3.xlsx (29.6KB, xlsx)
Supplementary file 4. Detected differentially methylated regions (DMRs) using nanopore sequencing across 12 lymphoblastoid cell lines (LCLs) that mapped to reported DMRs or confirmed in whole-genome bisulfite sequencing (WGBS) data.
elife-77898-supp4.xlsx (48.6KB, xlsx)
Supplementary file 5. The results for examining inter-individual variations of the detected differentially methylated regions (DMRs) in whole-genome bisulfite sequencing (WGBS) datasets from blood.
elife-77898-supp5.xlsx (38.1KB, xlsx)
Supplementary file 6. The allelic read counts of H3K4me3 chromatin immunoprecipitation sequencing (ChIP-seq) for each heterozygous single-nucleotide variant (SNV) at the detected differentially methylated regions (DMRs) in six lymphoblastoid cell line (LCL) samples.
elife-77898-supp6.xlsx (61.6KB, xlsx)
Supplementary file 7. List of the known imprinted genes.
elife-77898-supp7.xlsx (28KB, xlsx)
Supplementary file 8. The allelic read counts of H3K36me3 and H3K27me3 chromatin immunoprecipitation sequencing (ChIP-seq) for the seven parent-of-origin (PofO)-biased methylation blocks in six lymphoblastoid cell line (LCL) samples.
elife-77898-supp8.xlsx (250KB, xlsx)
Supplementary file 9. The allelic read counts of H3K36me3 and H3K27me3 chromatin immunoprecipitation sequencing (ChIP-seq) for the seven test blocks in six lymphoblastoid cell line (LCL) samples.
elife-77898-supp9.xlsx (190.6KB, xlsx)
Supplementary file 10. List of the whole-genome bisulfite sequencing (WGBS) tissues and WGBS blood samples used in our study.
Transparent reporting form

Data availability

The current manuscript is a computational study, so no new datasets have been generated for this manuscript. The source of each dataset is provided under the ‘“Materials and methods’” section under the appropriate subsection. Genomic tracks generated in this study including DNA methylation and histone modification tracks are deposited in the Mendeley data repository (https://doi.org/10.17632/f4k2gytbh5.1). Codes are uploaded to GitHub https://github.com/vahidAK/NanoMethPhase/tree/master/scripts (copy archived at swh:1:rev:1657f7aed60604aa7c7f3e77d992d76bee6bf6d3): PartialMethylation_AtDMR.sh, CountReadsAtSNV.py and Binomial_test.py.

The following dataset was generated:

Akbari V. 2022. Genome-Wide Detection of Imprinted Differentially Methylated Regions Using Nanopore Sequencing_Akbari-etal. Mendeley Data.

The following previously published datasets were used:

Shafin K, Pesout T, Lorig-Roach R, Haukness M, Olsen HE, Bosworth C, Armstrong J, Tigyi K, Maurer N, Koren S, Sedlazeck FJ, Marschall T, Mayes S, Costa V, Zook JM, Liu KJ, Kilburn D, Sorensen M, Munson KM, Vollger MR, Monlong J, Garrison E, Eichler EE, Salama S, Haussler D, Green RE, Akeson M, Phillippy A, Miga KH, Carnevali P, Jain M, Paten B. 2020. Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes. Human Pangenome Reference Consortium. hpgp-data

Jain M, Koren S, Miga KH, Quick J, Rand AC, Sasani TA, Tyson JR, Beggs AD, Dilthey AT, Fiddes IT, Malla S, Marriott H, Nieto T, O'Grady J, Olsen HE, Pedersen BS, Rhie A, Richardson H, Quinlan AR, Snutch TP, Tee L, Paten B, Phillippy AM, Simpson JT, Loman NJ, Loose M. 2018. Nanopore sequencing and assembly of a human genome with ultra-long reads. Nanopore WGS Consortium. NA12878

De Coster W, De Rijk P, De Roeck A, De Pooter T, D'Hert S, Strazisar M, Sleegers K, Van Broeckhoven C. 2019. Structural variants identified by Oxford Nanopore PromethION sequencing of the human genome. European Nucleotide Archive. PRJEB26791

Zook JM, Catoe D, McDaniel J, Vang L, Spies N, Sidow A, Weng Z, Liu Y, Mason CE, Alexander N, Henaff E, McIntyre ABR, Chandramohan D, Chen F, Jaeger E, Moshrefi A, Pham K, Stedman W, Liang T, Saghbini M, Dzakula Z, Hastie A, Cao H, Deikus G, Schadt E, Sebra R, Bashir A, Truty RM, Chang CC, Gulbahce N, Zhao K, Ghosh S, Hyland F, Fu Y, Chaisson M, Xiao C, Trow J, Sherry ST, Zaranek AW, Ball M, Bobe J, Estep P, Church GM, Marks P, Kyriazopoulou-Panagiotopoulou S, Zheng GXY, Schnall-Levin M, Ordonez HS, Mudivarti PA, Giorda K, Sheng Y, Rypdal KB, Salit M. 2016. Extensive sequencing of seven human genomes to characterize benchmark reference materials. NCBI Genome in a Bottle FTP. FTP

The 1000 Genomes Project Consortium 2015. A global reference for human genetic variation. The International Genome Sample Resource. 30x-grch38

Stunnenberg HG, Abrignani S, Adams D, de Almeida M, Altucci L, Amin V, Amit I, Antonarakis SE, Aparicio S, Arima T, Arrigoni L, Arts R, Asnafi V, Badosa ME, Bae JB, Bassler K, Beck S, Berkman B, Bernstein BE, Hirst M. 2016. The International Human Epigenome Consortium: A Blueprint for Scientific Collaboration and Discovery. Blueprint Epigenome. blueprint

Bernstein BE, Stamatoyannopoulos JA, Costello JF, Ren B, Milosavljevic A, Meissner A, Kellis M, Marra MA, Beaudet AL, Ecker JR, Farnham PJ, Hirst M, Lander ES, Mikkelsen TS, Thomson JA. 2010. The NIH Roadmap Epigenomics Mapping Consortium. NCBI Gene Expression Omnibus. epigenomics

ENCODE Project Consortium 2012. An Integrated Encyclopedia of DNA Elements in the Human Genome. ENCODE. encodeproject

Okae H, Chiba H, Hiura H, Hamada H, Sato A, Utsunomiya T, Kikuchi H, Yoshida H, Tanaka A, Suyama M, Arima T. 2014. Genome-wide analysis of DNA methylation dynamics during early human development. DNA Data Bank of Japan. DRA003802

Steinmetz LM, Hogenesch JB, Kellis M, Batzoglou S, Snyder M. 2013. Extensive Variation in Chromatin States Across Humans. NCBI Sequence Read Archive. SRP030041

Hon GC, Rajagopal N, Shen Y, McCleary DF, Yue F, Dang MD, Ren B. 2013. Epigenetic memory at embryonic enhancers identified in DNA methylation maps from adult mouse tissues. NCBI Gene Expression Omnibus. GSE42836

Tung J, Barreiro LB, Johnson ZP, Hansen KD, Michopoulos V, Toufexis D, Michelini K, Wilson ME, Gilad Y. 2012. Social environment is associated with gene regulatory variation in the rhesus macaque immune system. NCBI Gene Expression Omnibus. GSE34128

Jeong H, Mendizabal I, Berto S, Chatterjee P, Layman T, Usui N, Toriumi K, Douglas C, Singh D, Huh I, Preuss TM, Konopka G, Yi S V. 2021. Evolution of DNA methylation in the human brain. NCBI Gene Expression Omnibus. GSE151768

Dahlet T, Argüeso Lleida A, Al Adhami H, Dumas M, Bender A, Ngondo RP, Tanguy M, Vallet J, Auclair G, Bardet AF, Weber M. 2020. Genome-wide analysis in the mouse embryo reveals the importance of DNA methylation for transcription integrity. NCBI Gene Expression Omnibus. GSE130735

Jung YH, Sauria MEG, Lyu X, Cheema MS, Ausio J, Taylor J, Corces VG. 2017. Chromatin States in Mouse Sperm Correlate with Embryonic and Adult Regulatory Landscapes. NCBI Gene Expression Omnibus. GSE79226

Saenz-de-Juano MD, Ivanova E, Billooye K, Herta A-C, Smitz J, Kelsey G, Anckaert E. 2019. Genome-wide assessment of DNA methylation in mouse oocytes reveals effects associated with in vitro growth, superovulation, and sexual maturity. NCBI Gene Expression Omnibus. GSE128656

Gao F, Niu Y, Sun YE, Lu H, Chen Y, Li S, Kang Y, Luo Y, Si C, Yu J, Li C, Sun N, Si W, Wang H, Ji W, Tan T. 2017. De novo DNA methylation during monkey pre-implantation embryogenesis. NCBI Gene Expression Omnibus. GSE60166

Florian Z, Droplaug NM, Olafur TM, Nicolas JW, Tiffany JM, Asgeir S, Gisli HH, Sigurjon AG, Pall M, Helga I, Snædis K, Kristjan. Kristjan FA, Anna H, Julius G, Thorunn R, Ingileif J, Hilma H, Gudmundur IE, Olof S, Isleifur O, Gisli M, Daniel FG, Unnur T, Bjarni VH, Simon NS, Kari S. 2018. Methylation and expression data for whole-genome human imprinting study. figshare.

Imbeault M, Helleboid P, Trono D. 2017. ChIP-exo of human KRAB-ZNFs transduced in HEK 293T cells and KAP1 in hES H1 cells. NCBI Gene Expression Omnibus. GSE78099

Takahashi N, Coluccio A, Thorball C. W, Planet E, Shi H, Offner S, Turelli P, Imbeault M, Ferguson-Smith A. C, Trono D. 2019. ZNF445 is a primary regulator of genomic imprinting. NCBI Gene Expression Omnibus. GSE115387

References

  1. Akbari V, Garant J-M, O’Neill K, Pandoh P, Moore R, Marra MA, Hirst M, Jones SJM. Megabase-scale methylation phasing using nanopore long reads and NanoMethPhase. Genome Biology. 2021;22:68. doi: 10.1186/s13059-021-02283-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Akbari V. NanoMethPhase. swh:1:rev:1657f7aed60604aa7c7f3e77d992d76bee6bf6d3Software Heritage. 2022 https://archive.softwareheritage.org/swh:1:dir:544b123669c23565347328349f18f04a7fe06c41;origin=https://github.com/vahidAK/NanoMethPhase;visit=swh:1:snp:1f3158491f82b8a966e5344319e883a3742bee01;anchor=swh:1:rev:1657f7aed60604aa7c7f3e77d992d76bee6bf6d3
  3. Amemiya HM, Kundaje A, Boyle AP. The ENCODE Blacklist: Identification of Problematic Regions of the Genome. Scientific Reports. 2019;9:9354. doi: 10.1038/s41598-019-45839-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, Korbel JO, Marchini JL, McCarthy S, McVean GA, Abecasis GR, 1000 Genomes Project Consortium A global reference for human genetic variation. Nature. 2015;526:68–74. doi: 10.1038/nature15393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Babak T, DeVeale B, Tsang EK, Zhou Y, Li X, Smith KS, Kukurba KR, Zhang R, Li JB, van der Kooy D, Montgomery SB, Fraser HB. Genetic conflict reflected in tissue-specific maps of genomic imprinting in human and mouse. Nature Genetics. 2015;47:544–549. doi: 10.1038/ng.3274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bak M, Boonen SE, Dahl C, Hahnemann JMD, Mackay DJDG, Tümer Z, Grønskov K, Temple IK, Guldberg P, Tommerup N. Genome-wide DNA methylation analysis of transient neonatal diabetes type 1 patients with mutations in ZFP57. BMC Medical Genetics. 2016;17:29. doi: 10.1186/s12881-016-0292-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Baran Y, Subramaniam M, Biton A, Tukiainen T, Tsang EK, Rivas MA, Pirinen M, Gutierrez-Arcelus M, Smith KS, Kukurba KR, Zhang R, Eng C, Torgerson DG, Urbanek C, GTEx Consortium. Li JB, Rodriguez-Santana JR, Burchard EG, Seibold MA, MacArthur DG, Montgomery SB, Zaitlen NA, Lappalainen T. The landscape of genomic imprinting across diverse adult human tissues. Genome Research. 2015;25:927–936. doi: 10.1101/gr.192278.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bartolomei MS, Ferguson-Smith AC. Mammalian genomic imprinting. Cold Spring Harbor Perspectives in Biology. 2011;3:a002592. doi: 10.1101/cshperspect.a002592. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bernstein BE, Stamatoyannopoulos JA, Costello JF, Ren B, Milosavljevic A, Meissner A, Kellis M, Marra MA, Beaudet AL, Ecker JR, Farnham PJ, Hirst M, Lander ES, Mikkelsen TS, Thomson JA. The NIH Roadmap Epigenomics Mapping Consortium. Nature Biotechnology. 2010;28:1045–1048. doi: 10.1038/nbt1010-1045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Brinkman AB, Gu H, Bartels SJJ, Zhang Y, Matarese F, Simmer F, Marks H, Bock C, Gnirke A, Meissner A, Stunnenberg HG. Sequential ChIP-bisulfite sequencing enables direct genome-scale investigation of chromatin and DNA methylation cross-talk. Genome Research. 2012;22:1128–1138. doi: 10.1101/gr.133728.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Camargo M, Rivera D, Moreno L, Lidral AC, Harper U, Jones M, Solomon BD, Roessler E, Vélez JI, Martinez AF, Chandrasekharappa SC, Arcos-Burgos M. GWAS reveals new recessive loci associated with non-syndromic facial clefting. European Journal of Medical Genetics. 2012;55:510–514. doi: 10.1016/j.ejmg.2012.06.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Candler T, Kessler NJ, Gunasekara CJ, Ward KA, James P, Laritsky E, Baker MS, Dyer R, Elango R, Jeffries D, Waterland RA, Moore SE, Ludgate M, Prentice AM, Silver MJ. DNA methylation at a nutritionally sensitive region of the PAX8 gene is associated with thyroid volume and function in Gambian children. Science Advances. 2021;7:eabj1561. doi: 10.1126/sciadv.abj1561. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Chen Z, Zhang Y. Maternal H3K27me3-dependent autosomal and X chromosome imprinting. Nature Reviews. Genetics. 2020;21:555–571. doi: 10.1038/s41576-020-0245-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Cheong CY, Chng K, Ng S, Chew SB, Chan L, Ferguson-Smith AC. Germline and somatic imprinting in the nonhuman primate highlights species differences in oocyte methylation. Genome Research. 2015;25:611–623. doi: 10.1101/gr.183301.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Chess A. Random and non-random monoallelic expression. Neuropsychopharmacology. 2013;38:55–61. doi: 10.1038/npp.2012.85. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Chiesa N, De Crescenzo A, Mishra K, Perone L, Carella M, Palumbo O, Mussa A, Sparago A, Cerrato F, Russo S, Lapi E, Cubellis MV, Kanduri C, Cirillo Silengo M, Riccio A, Ferrero GB. The KCNQ1OT1 imprinting control region and non-coding RNA: new properties derived from the study of Beckwith-Wiedemann syndrome and Silver-Russell syndrome cases. Human Molecular Genetics. 2012;21:10–25. doi: 10.1093/hmg/ddr419. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Court F, Tayama C, Romanelli V, Martin-Trujillo A, Iglesias-Platas I, Okamura K, Sugahara N, Simón C, Moore H, Harness JV, Keirstead H, Sanchez-Mut JV, Kaneki E, Lapunzina P, Soejima H, Wake N, Esteller M, Ogata T, Hata K, Nakabayashi K, Monk D. Genome-wide parent-of-origin DNA methylation analysis reveals the intricacies of human imprinting and suggests a germline methylation-independent mechanism of establishment. Genome Research. 2014;24:554–569. doi: 10.1101/gr.164913.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Cuellar Partida G, Laurin C, Ring SM, Gaunt TR, McRae AF, Visscher PM, Montgomery GW, Martin NG, Hemani G, Suderman M, Relton CL, Davey Smith G, Evans DM. Genome-wide survey of parent-of-origin effects on DNA methylation identifies candidate imprinted loci in humans. Human Molecular Genetics. 2018;27:2927–2939. doi: 10.1093/hmg/ddy206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. da Rocha ST, Gendrel A-V. The influence of DNA methylation on monoallelic expression. Essays in Biochemistry. 2019;63:663–676. doi: 10.1042/EBC20190034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Dahlet T, Argüeso Lleida A, Al Adhami H, Dumas M, Bender A, Ngondo RP, Tanguy M, Vallet J, Auclair G, Bardet AF, Weber M. Genome-wide analysis in the mouse embryo reveals the importance of DNA methylation for transcription integrity. Nature Communications. 2020;11:3153. doi: 10.1038/s41467-020-16919-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. De Coster W, De Rijk P, De Roeck A, De Pooter T, D’Hert S, Strazisar M, Sleegers K, Van Broeckhoven C. Structural variants identified by Oxford Nanopore PromethION sequencing of the human genome. Genome Research. 2019;29:1178–1187. doi: 10.1101/gr.244939.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. ENCODE Project Consortium An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. doi: 10.1038/nature11247. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Gao F, Niu Y, Sun YE, Lu H, Chen Y, Li S, Kang Y, Luo Y, Si C, Yu J, Li C, Sun N, Si W, Wang H, Ji W, Tan T. De novo DNA methylation during monkey pre-implantation embryogenesis. Cell Research. 2017;27:526–539. doi: 10.1038/cr.2017.25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Geneimprint Geneimprint. 2021. [March 3, 2021]. http://www.geneimprint.com
  25. Gigante S, Gouil Q, Lucattini A, Keniry A, Beck T, Tinning M, Gordon L, Woodruff C, Speed TP, Blewitt ME, Ritchie ME. Using long-read sequencing to detect imprinted DNA methylation. Nucleic Acids Research. 2019;47:e46. doi: 10.1093/nar/gkz107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Goovaerts T, Steyaert S, Vandenbussche CA, Galle J, Thas O, Van Criekinge W, De Meyer T. A comprehensive overview of genomic imprinting in breast and its deregulation in cancer. Nature Communications. 2018;9:1–14. doi: 10.1038/s41467-018-06566-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Hanna CW, Kelsey G. The specification of imprints in mammals. Heredity. 2014;113:176–183. doi: 10.1038/hdy.2014.54. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Hashimoto H, Wang D, Horton JR, Zhang X, Corces VG, Cheng X. Structural Basis for the Versatile and Methylation-Dependent Binding of CTCF to DNA. Molecular Cell. 2017;66:711–720. doi: 10.1016/j.molcel.2017.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Hernandez Mora JR, Tayama C, Sánchez-Delgado M, Monteagudo-Sánchez A, Hata K, Ogata T, Medrano J, Poo-Llanillo ME, Simón C, Moran S, Esteller M, Tenorio J, Lapunzina P, Kagami M, Monk D, Nakabayashi K. Characterization of parent-of-origin methylation using the Illumina Infinium MethylationEPIC array platform. Epigenomics. 2018;10:941–954. doi: 10.2217/epi-2017-0172. [DOI] [PubMed] [Google Scholar]
  30. Hon GC, Rajagopal N, Shen Y, McCleary DF, Yue F, Dang MD, Ren B. Epigenetic memory at embryonic enhancers identified in DNA methylation maps from adult mouse tissues. Nature Genetics. 2013;45:1198–1206. doi: 10.1038/ng.2746. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Imbeault M, Helleboid PY, Trono D. KRAB zinc-finger proteins contribute to the evolution of gene regulatory networks. Nature. 2017;543:550–554. doi: 10.1038/nature21683. [DOI] [PubMed] [Google Scholar]
  32. Jadhav B, Monajemi R, Gagalova KK, Ho D, Draisma HHM, van de Wiel MA, Franke L, Heijmans BT, van Meurs J, Jansen R, GoNL Consortium. BIOS Consortium. ’t Hoen PAC, Sharp AJ, Kiełbasa SM. RNA-Seq in 296 phased trios provides a high-resolution map of genomic imprinting. BMC Biology. 2019;17:50. doi: 10.1186/s12915-019-0674-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Jain M, Koren S, Miga KH, Quick J, Rand AC, Sasani TA, Tyson JR, Beggs AD, Dilthey AT, Fiddes IT, Malla S, Marriott H, Nieto T, O’Grady J, Olsen HE, Pedersen BS, Rhie A, Richardson H, Quinlan AR, Snutch TP, Tee L, Paten B, Phillippy AM, Simpson JT, Loman NJ, Loose M. Nanopore sequencing and assembly of a human genome with ultra-long reads. Nature Biotechnology. 2018;36:338–345. doi: 10.1038/nbt.4060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Jelinic P, Shaw P. Loss of imprinting and cancer. The Journal of Pathology. 2007;211:261–268. doi: 10.1002/path.2116. [DOI] [PubMed] [Google Scholar]
  35. Jeong H, Mendizabal I, Berto S, Chatterjee P, Layman T, Usui N, Toriumi K, Douglas C, Singh D, Huh I, Preuss TM, Konopka G, Yi SV. Evolution of DNA methylation in the human brain. Nature Communications. 2021;12:2021. doi: 10.1038/s41467-021-21917-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. John RM, Lefebvre L. Developmental regulation of somatic imprints. Differentiation. 2011;81:270–280. doi: 10.1016/j.diff.2011.01.007. [DOI] [PubMed] [Google Scholar]
  37. Joshi RS, Garg P, Zaitlen N, Lappalainen T, Watson CT, Azam N, Ho D, Li X, Antonarakis SE, Brunner HG, Buiting K, Cheung SW, Coffee B, Eggermann T, Francis D, Geraedts JP, Gimelli G, Jacobson SG, Le Caignec C, de Leeuw N, Liehr T, Mackay DJ, Montgomery SB, Pagnamenta AT, Papenhausen P, Robinson DO, Ruivenkamp C, Schwartz C, Steiner B, Stevenson DA, Surti U, Wassink T, Sharp AJ. DNA Methylation Profiling of Uniparental Disomy Subjects Provides a Map of Parental Epigenetic Bias in the Human Genome. American Journal of Human Genetics. 2016;99:555–566. doi: 10.1016/j.ajhg.2016.06.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Jung YH, Sauria MEG, Lyu X, Cheema MS, Ausio J, Taylor J, Corces VG. Chromatin States in Mouse Sperm Correlate with Embryonic and Adult Regulatory Landscapes. Cell Reports. 2017;18:1366–1382. doi: 10.1016/j.celrep.2017.01.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Kasowski M, Kyriazopoulou-Panagiotopoulou S, Grubert F, Zaugg JB, Kundaje A, Liu Y, Boyle AP, Zhang QC, Zakharia F, Spacek DV, Li J, Xie D, Olarerin-George A, Steinmetz LM, Hogenesch JB, Kellis M, Batzoglou S, Snyder M. Extensive variation in chromatin states across humans. Science (New York, N.Y.) 2013;342:750–752. doi: 10.1126/science.1242510. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, Haussler D. The human genome browser at UCSC. Genome Research. 2002;12:996–1006. doi: 10.1101/gr.229102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Kent W J, Zweig AS, Barber G, Hinrichs AS, Karolchik D. BigWig and BigBed: enabling browsing of large distributed datasets. Bioinformatics (Oxford, England) 2010;26:2204–2207. doi: 10.1093/bioinformatics/btq351. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Khamlichi AA, Feil R. Parallels between Mammalian Mechanisms of Monoallelic Gene Expression. Trends in Genetics. 2018;34:954–971. doi: 10.1016/j.tig.2018.08.005. [DOI] [PubMed] [Google Scholar]
  43. Kim S, Scheffler K, Halpern AL, Bekritsky MA, Noh E, Källberg M, Chen X, Kim Y, Beyter D, Krusche P, Saunders CT. Strelka2: fast and accurate calling of germline and somatic variants. Nature Methods. 2018;15:591–594. doi: 10.1038/s41592-018-0051-x. [DOI] [PubMed] [Google Scholar]
  44. Laurent L, Wong E, Li G, Huynh T, Tsirigos A, Ong CT, Low HM, Kin Sung KW, Rigoutsos I, Loring J, Wei CL. Dynamic changes in the human methylome during differentiation. Genome Research. 2010;20:320–331. doi: 10.1101/gr.101907.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics (Oxford, England) 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics (Oxford, England) 2018;34:3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Maor-Sagie E, Cinnamon Y, Yaacov B, Shaag A, Goldsmidt H, Zenvirt S, Laufer N, Richler C, Frumkin A. Deleterious mutation in SYCE1 is associated with non-obstructive azoospermia. Journal of Assisted Reproduction and Genetics. 2015;32:887–891. doi: 10.1007/s10815-015-0445-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Matsubara K, Itoh M, Shimizu K, Saito S, Enomoto K, Nakabayashi K, Hata K, Kurosawa K, Ogata T, Fukami M, Kagami M. Exploring the unique function of imprinting control centers in the PWS/AS-responsible region: finding from array-based methylation analysis in cases with variously sized microdeletions. Clinical Epigenetics. 2019;11:36. doi: 10.1186/s13148-019-0633-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Maupetit-Méhouas S, Montibus B, Nury D, Tayama C, Wassef M, Kota SK, Fogli A, Cerqueira Campos F, Hata K, Feil R, Margueron R, Nakabayashi K, Court F, Arnaud P. Imprinting control regions (ICRs) are marked by mono-allelic bivalent chromatin when transcriptionally inactive. Nucleic Acids Research. 2016;44:621–635. doi: 10.1093/nar/gkv960. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Moore LD, Le T, Fan G. DNA methylation and its basic function. Neuropsychopharmacology. 2013;38:23–38. doi: 10.1038/npp.2012.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Morison IM, Paton CJ, Cleverley SD. The imprinted gene and parent-of-origin effect database. Nucleic Acids Research. 2001;29:275–276. doi: 10.1093/nar/29.1.275. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Nakabayashi K, Trujillo AM, Tayama C, Camprubi C, Yoshida W, Lapunzina P, Sanchez A, Soejima H, Aburatani H, Nagae G, Ogata T, Hata K, Monk D. Methylation screening of reciprocal genome-wide UPDs identifies novel human-specific imprinted genes. Human Molecular Genetics. 2011;20:3188–3197. doi: 10.1093/hmg/ddr224. [DOI] [PubMed] [Google Scholar]
  53. Okae H, Chiba H, Hiura H, Hamada H, Sato A, Utsunomiya T, Kikuchi H, Yoshida H, Tanaka A, Suyama M, Arima T. Genome-wide analysis of DNA methylation dynamics during early human development. PLOS Genetics. 2014;10:e1004868. doi: 10.1371/journal.pgen.1004868. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Park Y, Wu H. Differential methylation analysis for BS-seq data under general experimental design. Bioinformatics (Oxford, England) 2016;32:1446–1453. doi: 10.1093/bioinformatics/btw026. [DOI] [PubMed] [Google Scholar]
  55. Puerta MF, Ruth C, Fang L, Robert P, Zhong D, R.-M. Y. A, B. M. S, L. P. M, Qiyi T, H.-F. L. CTCF Binding to the First Intron of the Major Immediate Early (MIE) Gene of Human Cytomegalovirus (HCMV. Negatively Regulates MIE Gene Expression and HCMV Replication. Journal of Virology. 2014;88:7389–7401. doi: 10.1128/JVI.00845-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Quenneville S, Verde G, Corsinotti A, Kapopoulou A, Jakobsson J, Offner S, Baglivo I, Pedone PV, Grimaldi G, Riccio A, Trono D. In embryonic stem cells, ZFP57/KAP1 recognize a methylated hexanucleotide to affect chromatin and DNA methylation of imprinting control regions. Molecular Cell. 2011;44:361–372. doi: 10.1016/j.molcel.2011.08.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Renda M, Baglivo I, Burgess-Beusse B, Esposito S, Fattorusso R, Felsenfeld G, Pedone PV. Critical DNA Binding Interactions of the Insulator Protein CTCF. Journal OF Biological Chemistry. 2007;282:33336–33345. doi: 10.1074/jbc.M706213200. [DOI] [PubMed] [Google Scholar]
  58. Saenz-de-Juano MD, Ivanova E, Billooye K, Herta AC, Smitz J, Kelsey G, Anckaert E. Genome-wide assessment of DNA methylation in mouse oocytes reveals effects associated with in vitro growth, superovulation, and sexual maturity. Clinical Epigenetics. 2019;11:197. doi: 10.1186/s13148-019-0794-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Shafin K, Pesout T, Lorig-Roach R, Haukness M, Olsen HE, Bosworth C, Armstrong J, Tigyi K, Maurer N, Koren S, Sedlazeck FJ, Marschall T, Mayes S, Costa V, Zook JM, Liu KJ, Kilburn D, Sorensen M, Munson KM, Vollger MR, Monlong J, Garrison E, Eichler EE, Salama S, Haussler D, Green RE, Akeson M, Phillippy A, Miga KH, Carnevali P, Jain M, Paten B. Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes. Nature Biotechnology. 2020;38:1044–1053. doi: 10.1038/s41587-020-0503-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Sharp AJ, Migliavacca E, Dupre Y, Stathaki E, Sailani MR, Baumer A, Schinzel A, Mackay DJ, Robinson DO, Cobellis G, Cobellis L, Brunner HG, Steiner B, Antonarakis SE. Methylation profiling in individuals with uniparental disomy identifies novel differentially methylated regions on chromosome 15. Genome Research. 2010;20:1271–1278. doi: 10.1101/gr.108597.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Simpson JT, Workman RE, Zuzarte PC, David M, Dursi LJ, Timp W. Detecting DNA cytosine methylation using nanopore sequencing. Nature Methods. 2017;14:407–410. doi: 10.1038/nmeth.4184. [DOI] [PubMed] [Google Scholar]
  62. Smith ZD, Meissner A. DNA methylation: roles in mammalian development. Nature Reviews. Genetics. 2013;14:204–220. doi: 10.1038/nrg3354. [DOI] [PubMed] [Google Scholar]
  63. Stunnenberg HG, International Human Epigenome Consortium. Hirst M. The International Human Epigenome Consortium: A Blueprint for Scientific Collaboration and Discovery. Cell. 2016;167:1145–1149. doi: 10.1016/j.cell.2016.11.007. [DOI] [PubMed] [Google Scholar]
  64. Takahashi N, Coluccio A, Thorball CW, Planet E, Shi H, Offner S, Turelli P, Imbeault M, Ferguson-Smith AC, Trono D. ZNF445 is a primary regulator of genomic imprinting. Genes & Development. 2019;33:49–54. doi: 10.1101/gad.320069.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Tomizawa S, Sasaki H. Genomic imprinting and its relevance to congenital disease, infertility, molar pregnancy and induced pluripotent stem cell. Journal of Human Genetics. 2012;57:84–91. doi: 10.1038/jhg.2011.151. [DOI] [PubMed] [Google Scholar]
  66. Tung J, Barreiro LB, Johnson ZP, Hansen KD, Michopoulos V, Toufexis D, Michelini K, Wilson ME, Gilad Y. Social environment is associated with gene regulatory variation in the rhesus macaque immune system. PNAS. 2012;109:6490–6495. doi: 10.1073/pnas.1202734109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Turan S, Bastepe M. The GNAS complex locus and human diseases associated with loss-of-function mutations or epimutations within this imprinted gene. Hormone Research in Paediatrics. 2013;80:229–241. doi: 10.1159/000355384. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, Burovski E, Peterson P, Weckesser W, Bright J, van der Walt SJ, Brett M, Wilson J, Millman KJ, Mayorov N, Nelson ARJ, Jones E, Kern R, Larson E, Carey CJ, Polat İ, Feng Y, Moore EW, VanderPlas J, Laxalde D, Perktold J, Cimrman R, Henriksen I, Quintero EA, Harris CR, Archibald AM, Ribeiro AH, Pedregosa F, van Mulbregt P, SciPy 1.0 Contributors SciPy 1.0: fundamental algorithms for scientific computing in Python. Nature Methods. 2020;17:261–272. doi: 10.1038/s41592-019-0686-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Wagner EJ, Carpenter PB. Understanding the language of Lys36 methylation at histone H3. Nature Reviews. Molecular Cell Biology. 2012;13:115–126. doi: 10.1038/nrm3274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Xie W, Barr CL, Kim A, Yue F, Lee AY, Eubanks J, Dempster EL, Ren B. Base-resolution analyses of sequence and parent-of-origin dependent DNA methylation in the mouse genome. Cell. 2012;148:816–831. doi: 10.1016/j.cell.2011.12.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Yuan W, Xu M, Huang C, Liu N, Chen S, Zhu B. H3K36 methylation antagonizes PRC2-mediated H3K27 methylation. The Journal of Biological Chemistry. 2011;286:7983–7989. doi: 10.1074/jbc.M110.194027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Zhao H, Sun Z, Wang J, Huang H, Kocher JP, Wang L. CrossMap: a versatile tool for coordinate conversion between genome assemblies. Bioinformatics (Oxford, England) 2014;30:1006–1007. doi: 10.1093/bioinformatics/btt730. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Zink F, Magnusdottir DN, Magnusson OT, Walker NJ, Morris TJ, Sigurdsson A, Halldorsson GH, Gudjonsson SA, Melsted P, Ingimundardottir H, Kristmundsdottir S, Alexandersson KF, Helgadottir A, Gudmundsson J, Rafnar T, Jonsdottir I, Holm H, Eyjolfsson GI, Sigurdardottir O, Olafsson I, Masson G, Gudbjartsson DF, Thorsteinsdottir U, Halldorsson BV, Stacey SN, Stefansson K. Insights into imprinting from parent-of-origin phased methylomes and transcriptomes. Nature Genetics. 2018;50:1542–1552. doi: 10.1038/s41588-018-0232-7. [DOI] [PubMed] [Google Scholar]
  74. Zook JM, Catoe D, McDaniel J, Vang L, Spies N, Sidow A, Weng Z, Liu Y, Mason CE, Alexander N, Henaff E, McIntyre ABR, Chandramohan D, Chen F, Jaeger E, Moshrefi A, Pham K, Stedman W, Liang T, Saghbini M, Dzakula Z, Hastie A, Cao H, Deikus G, Schadt E, Sebra R, Bashir A, Truty RM, Chang CC, Gulbahce N, Zhao K, Ghosh S, Hyland F, Fu Y, Chaisson M, Xiao C, Trow J, Sherry ST, Zaranek AW, Ball M, Bobe J, Estep P, Church GM, Marks P, Kyriazopoulou-Panagiotopoulou S, Zheng GXY, Schnall-Levin M, Ordonez HS, Mudivarti PA, Giorda K, Sheng Y, Rypdal KB, Salit M. Extensive sequencing of seven human genomes to characterize benchmark reference materials. Scientific Data. 2016;3:160025. doi: 10.1038/sdata.2016.25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Zook JM, McDaniel J, Olson ND, Wagner J, Parikh H, Heaton H, Irvine SA, Trigg L, Truty R, McLean CY, De La Vega FM, Xiao C, Sherry S, Salit M. An open resource for accurately benchmarking small variant and reference calls. Nature Biotechnology. 2019;37:561–566. doi: 10.1038/s41587-019-0074-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

Editor's evaluation

Jessica K Tyler 1

This work uses nanowire sequencing to detect genome-Wide imprinted differentially methylated regions. It will be of broad interest to DNA methylation researchers.

Decision letter

Editor: Jessica K Tyler1
Reviewed by: Gavin Kelsey2

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Decision letter after peer review:

[Editors’ note: the authors submitted for reconsideration following the decision after peer review. What follows is the decision letter after the first round of review.]

Thank you for submitting the paper "Genome-Wide Detection of Imprinted Differentially Methylated Regions Using Nanopore Sequencing" for consideration by eLife. Your article has been reviewed by 3 peer reviewers, and the evaluation has been overseen by a Reviewing Editor and a Senior Editor, with a fair bit of internal discussion among all reviewing parties. The following individual involved in review of your submission has agreed to reveal their identity: Gavin Kelsey (Reviewer #1).

We appreciate that you have put a lot of effort into an important long-standing question and of specific interest here is the application of a newer technology, nanopore sequencing, to that problem. The reviewers agree that your study provides a new perspective with good quality data (exceptions where noted). However, while the report confirms the power of long-read sequencing technology, there is also a general sense that the current manuscript is lacking in biological novelty and the new DMRs would benefit from further investigation. One reviewer suggested examining the DMRs in relation to the possibility of polymorphic imprinting. Alternatively, if the technical advantage including costs / ease of use could be strengthened per comment of another reviewer. As the manuscript stands currently, neither of these two potential selling points are fully utilized. Thus, we cannot offer to publish your manuscript as it stands. We hope that you and your colleagues will benefit from the excellent comments of the reviewers appended to this letter for resubmission elsewhere, or if substantial progress can be made with new experiments, we could potentially reconsider your manuscript with the above comments in mind.

Reviewer #1:

The study builds upon a recent report from the authors on the application of long-read nanopore sequencing to the detection of parent-of-origin (PofO) allelic methylation in the human genome (Akbari et al. 2021), with the potential for identification of potentially novel imprinted genes. Nanopore sequencing has previously been shown to be effective in imprinted DMR identification in hybrid mouse crosses (e.g., Gigante et al. 2019). Long-read sequencing has the potential to provide for methylation calls at CpGs to be phased over extended distances with genetic haplotypes and PofO if parent-offspring trios or pedigrees are available. Here, the authors use as discovery datasets 12 LCLs from the 1K Genomes Project and other projects. Candidate PofO DMRs are then parsed through various filters, including methylation level in conventional short-read whole-genome bisulphite sequencing (WGBS) datasets from human gametes, preimplantation embryos as well as somatic tissues, in accordance with the predicted properties of germline DMRs and somatic (or secondary) DMRs. In addition, DMR predictions are compared with well-characterised germline or somatic DMRs (as a test of sensitivity), as well as predictions from a number of other studies that have deployed WGBS or array-based assays of various informative samples (Court et al. 2014, Joshi et al. 2016, Hernandez Mora et al. 2018; Zink et al. 2018). Finally, the authors seek conservation of DMR status in chimpanzee, rhesus macaque, and mouse. This experimental design and implementation look well thought-out and robust.

There have, of course, been numerous attempts to derive a comprehensive list of imprinted loci in the human genome over the last couple of decades. The recent large-scale WGoxBS study of the Icelandic population (Zink et al. 2018) also applied nanopore sequencing, but only in a limited capacity to validate DMRs identified from WGoxBS rather than as a discovery tool as in the current study. As well as demonstrating the power of nanopore sequencing, I think it is important that the current study shows something new, either in relation to undiscovered imprinted loci with potential importance in human development, physiology or disease, or new concepts in imprinting regulation. With some limited further analysis, this level of novelty could be achieved.

The authors report the detection of 76 allelic DMRs not overlapping with previously reported DMRs, of which 28 were determined to be high confidence novel DMRs based on methylation status in the WGBS datasets interrogated. They were able to assign 12 as gDMRs, all but one maternally methylated consistent with the predominance of oocyte methylation of gDMRs. One of the criteria for gDMR status is that the DMR should have "more than 40% methylation in oocyte and less than 20% in sperm and vice versa with methylation difference > 0.25 (lines 188-189)". A threshold of only 40% methylation in one gamete looks rather low for gDMR status. What does 40% methylation mean in molecular terms: that in a population of oocytes there are fully methylated and fully unmethylated alleles; or is there mosaic methylation across the gDMR? From inspection of the heatmap in Figure 3, it seems that very few novel 'gDMRs' have such low methylation in oocytes, so it would be valid to increase the minimum level of methylation required in oocytes to >70%, which would better meet with accepted norms within the imprinting community.

A key determinant of a gDMR is binding of ZFP57 and/or ZNF455 (e.g., Takahashi et al. Genes Dev. 2019 PMID: 30602440), which ensure methylation maintenance especially during methylation reprogramming in preimplantation embryos. The binding motif for ZFP57 at least is known, so the authors should be able to determine whether this motif is present within their candidate gDMRs.

What is striking from the screenshots of many of the novel DMRs (Figure 5 and Suppl. Figures S2, S4, S5, S12, S13, S14, S16) is that they appear to be 'polymorphic', i.e., PofO differential methylation is present in some but not all of the 12 LCLs. I think this needs to be explored in some greater depth. Polymorphic imprinting has been discussed almost since the discovery of imprinting (as in the WT1 locus), but still remains poorly characterised and understood. Recent examples include VTRNA2 (Zink et al. 2018, as well as Silver et al. Genome Biol. 2015 PMID: 26062908). Does the inter-individual variation reflect variation in sequences required for gametic methylation maintenance? For example, is there variation within sequence motifs for ZFP57 binding (if present within the DMR)? This information should be available to the authors. An alternative possibility is that it could reflect variation in establishment of the methylation state in one or other gamete; this cannot be directly addressed but might be a useful speculation to add.

The identification of extended regions of PofO methylation bias in some known imprinted domains (the authors observe this at seven loci) is important and interesting, and extends findings of Zink et al. on the chromosome 15 PWS/AS domain. The authors point to cases in which there is maternal methylation of a promoter-associated gDMR and paternal methylation bias over the expressed gene body. It is possible to offer mechanistic explanation for this, which the authors attempt, speculating that (lines 315-318): "the subtle parental methylation bias is used by cells to express important genes (genes which can regulate other genes in the cluster or have regulatory roles) in an imprinted cluster with higher fidelity through its gene body methylation on active allele." But the authors should consider other explanations for which precedents exist. Thus, the effect may relate to the presence of allelic histone modification bias over the gene body, given that some histone modifications will tend to occupy mutually exclusive locations to DNA methylation; e.g., the transcribed allele is likely to be enriched in H3K36me3 (which attracts de novo DNA methylation), while the non-expressed allele may be enriched in the mutually exclusive modification H3K27me3. For example, at the Kcnq1ot1 and Airn/Igf2r imprinted domains in mouse placental trophoblast, allelic H3K27me3 can span several Megabases (Schertzer et al. Mol Cell 2019 PMID: 31256989; Hanna et al. Genome Biol. 2019 PMID: 31665063; Andergassen et al. PLoS Genet. 2019 PMID: 31329595). There may be ENCODE ChIP-seq datasets that could be informative here. Alternatively, allelic histone modification and consequential DNA methylation bias may be curtailed in regions with sense:antisense transcription on the opposing alleles.

1. More analysis of the possibility of polymorphic imprinting of newly discovered gDMRs.

2. Analysis of ZFP57 binding site motifs in new DMRs: they would be expected to be present in gDMRs with robust maintenance.

3. Apply a more stringent threshold than >40% methylation in oocytes/sperm to call a gDMR.

4. Consideration in the Discussion of mechanisms that could account for extended PofO methylation bias along the lines suggested above.

5. A clear summary table/scheme on DMR status, including: germline or somatic; proximity to known imprinted cluster; consistency/variability of DMR status in the LCLs. The idiogram in Figure 2C has this information but not in a way that it is easy to extract the numbers.

6. The Discussion is on the long side – it would also benefit from some English language editing, particularly in the latter parts.

Other corrections:

Lines 39-40 (and the following sentences): "ICRs are classified as germline (or primary) or somatic (or secondary), hereinafter referred to as gDMR and sDMR." This definition is incorrect. ICRs (imprinting control regions) are by definition germline DMRs in that they are required and sufficient for imprinting of clusters of imprinted genes and sDMRs depend on the existence of the ICR which corresponds to a gDMR.

Lines 50-51: "Loss of imprinting is also widely observed in human cancers." This is not strictly true, as it has been shown that apparent imprinted disruption in tumours is more often a consequence of copy number variation rather than loss of imprinting per se, see Martin-Trujillo et al. 2017 PMID: 28883545.

Line 56: reference given numerically [14-16] rather than by author name.

Reviewer #2:

The study is comprehensive in making use of multiple publicly available datasets. It also provides new evolutionary insight into the conservation of DMRs, as it compares data between different mammalian species. As the study discriminates between germline as well as somatic imprints, it therefore provides useful data for the imprinting community to study questions between primary and secondary imprinting marks within imprinted gene clusters. Overall, the study is a comprehensive imprinting resource and Nanopore technical paper, even if it does not seem to provide major conceptual advances over previous studies. The study appears generally well performed and the conclusions are backed up by the data, except in the Discussion section (see below), which could be streamlined and shortened to focus on the main conclusions from this paper. Some improvements could be made in the presentation, when data are only provided in supplementary data table form and not presented in the figures as outlined below.

General issues:

1. The number of acronyms used in this article is very high. To make the article more accessible to a general readership audience, I would recommend reducing their use if possible, or alternatively providing a glossary.

2. The Discussion section seems overly long and speculative and could be substantially shortened, without impacting the overall message of the study.

Specific issues:

1. Figure 1a: "WGBS validation and byond" – should say beyond

2. Line 102-103: "All DMRs which overlapped with previously-reported DMRs displayed consistent PofO with those studies."

Where is this shown? If not, it would be good to show this somewhere.

3. Figure 3b: In order to more clearly identify sDMRs, which are either paternally or maternally methylated, it would be helpful to also add a panel comparing maternal with paternal LCLs.

4. Line 152: "The H3K4me3 histone mark is protective to DNA methylation." This sentence could be easily misinterpreted. It could be rephrased to "The H3K4me3 histone mark is protective against DNA methylation."

5. Section on H3K4me3 and DNA-methylation (line 151-167): A figure panel depicting the relationship between H3K4me3 and DNA-methylation would be helpful instead of only referring to the supplementary data tables.

6. Line 174-175: "Orthologs of the 77/107 detected DMRs showed significant partial methylated in at least one of the three mammals." should be rephrased to something like: "77 of the 107 detected orthologous DMRs showed significant partial methylation in at least one of the three mammals."

Reviewer #3:

Akbari, et al. demonstrate the utility of nanopore sequencing data for identification of imprinted genes and differentially methylated regions (DMRs) in lymphocyte cell line (LCL) genomes sequenced by the HPRC and others. Their analysis replicates a large number of imprinted genes and DMRs (96/172) identified by short-read sequencing of bisulfite-converted genomic DNA, but also identifies 76 of novel DMRs and genes in LCLs missed in prior studies. Conversely, a large number (283) of imprinted DMRs missed some of statistical cutoffs in the nanopore sequencing data, although >90% of well-characterized loci do meet these cutoffs. The authors then distinguish somatic from germline imprints found in published gametic and blastocyst DNA methylation datasets, and query corresponding non-human primate and rodent datasets to assess the evolutionary conservation of their novel imprints. Finally, they demonstrate concordant allelic representation of H3K4me3 and mRNA-seq reads of their imprinted genes.

Among the strengths of this study are the comparative analyses to imprinting catalogues from other datasets, and the integration of allelic chromatin marks and gene expression with their DNA methylation analysis.

One drawback of the study in its present form is that it does not address why a relatively large number of imprinted DMRs were not replicated across studies. Cursory analysis of supplementary tables suggests that many of the prior DMRs missed in the nanopore data do demonstrate expected parent-of-origin bias in DNA methylation, but were not picked up in multiple samples likely due to the chosen cutoffs (-/+ 25% methylation difference in at least 4 samples and p < 0.001). As there is no explanation for the chosen nanopore vs. oxBS-seq cutoffs, the reader is left with the impression that the concordance with prior studies is lower than it really is. Conversely, many of the novel DMRs fall near known imprinted DMRs. Among the possible explanations for the large fraction (44%) novel DMRs and missed known DMRs, the authors focus on technical differences, but only briefly mention that some DMRs are cell-type-specific, which may explain why DMRs identified in LCLs may not match DMRs identified in peripheral blood monocytes better, or miss arbitrary statistical cutoffs. One way to address this issue would be to use more sophisticated methods to combine p-values across studies to quantify concordance of imprinted DMRs without applying arbitrary "hard" cutoffs (that don't match between nanopore and oxBS-seq anyway). This is a relatively major point, because while truly novel imprinted DMRs that can only be identified by nanopore are relatively rare, the take-home message of this study could be that while there is very good concordance across methods and most novel (possibly cell-type specific) DMRs fall near known DMRs, there are additional benefits to DNA methylation profiling by nanopore-sequencing.

A second limitation is that the study attributes these differences to long vs. short read technologies without assessing concordance between nanopore and another long-read technology. Because many of the HPRC samples include Pacbio data, one possible way to address this would be to include imprinted DMR analysis in the corresponding Pacbio samples in HPRC.

Overall, the study clearly demonstrates that nanopore-sequencing performs well in profiling DNA methylation, and can identify novel yet conserved imprints, but doesn't highlight its strengths (long-range, single-molecule phased DNA methylation patterns) or its possible weaknesses (single-nucleotide resolution).

I think the authors should consider not applying hard cutoffs in their analysis and instead combine p-values or use rank-based metrics to assess concordance/novelty as suggested above. The concordance appears better than described by the authors, and in general, I feel the authors don't need to draw as many distinctions to the prior work to "sell" the reader on nanopore sequencing.

There are other benefits they could use to draw these points of distinction however, for example by looking at concordance of methylation patterns across single nanopore molecules, or including Pacbio reads and presenting a more informative cost/benefit analysis to guide readers in choosing DNA methylation profiling approaches (oxBS-seq, nanopore, Pacbio). See: "a cheap and easy way to call ICRs".

Apart from the important cutoff-agnostic analysis, and possibly including Pacbio samples, the manuscript could be improved mainly in tone and attention to detail. I noticed some sentence structure errors, but more importantly felt some sentences served to unnecessarily "sell" this study, either by sounding too critical with prior studies or or providing little support, e.g. "cheap and easy" above, or "individuals more representative of the human population" (with only 12 samples). The authors could also re-assess the first sentence of their discussion, and should maybe cite: https://pubmed.ncbi.nlm.nih.gov/33230324/

I think these could be improved without taking anything away from what is an interesting and informative study in its own right. eLife would be a good fit for this study, which could be accepted with relatively minor revisions and only limited added analysis.

[Editors’ note: further revisions were suggested prior to acceptance, as described below.]

Thank you for resubmitting your work entitled "Genome-Wide Detection of Imprinted Differentially Methylated Regions Using Nanopore Sequencing" for further consideration by eLife. Your revised article has been evaluated by Jessica Tyler (Senior Editor) and a Reviewing Editor.

The manuscript has been improved but there are some remaining issues that need to be addressed, as outlined below:

Everyone felt that the manuscript has been significantly improved. However, the following issues remain:

1. The processed annotated tracks must be deposited in a data repository and released at the time of publication.

2. Comment on the high incidence of polymorphic imprinting in the discussion.

3. Test for sequence variation in ZFP57 binding sites in polymorphically methylated gDMRs if sequence information is available for the LCLs and other informative samples for which DNA methylation data exist.

4. More rigor is required in the analysis of allelic enrichment of H3K36me3 and H3K27me3 at domains of extended PofO DNA methylation bias compared with those imprinted regions that do not, and compute the allelic read scores and provide an appropriate summary plot.

More details on these points are given below, as well as minor points that need to be addressed also.

Reviewer #2 (Recommendations for the authors):

The authors addressed some (not all) of the concerns raised in the first round of review, often differently than suggested (which is acceptable).

The data availability however, is not acceptable in my view; given that the bioinformatic approach to DNA methylation phasing is computationally sophisticated and was published elsewhere already (Akbari, et al., 2021), it should be expected that the processed nanometh tracks (shown in Figure 3c, 6-8 and the extensive supplement) are made available via a data repository (e.g. GEO or Mendeley data). Inclusion of processed data tracks is obligatory for GEO submissions, and recreating these data tracks from the raw data places an undue burden on labs wishing to query these data for their loci of interest. These data tracks should include all chromosomes, including sex chromosomes, to enable others to re-run statistical analyses with their own (p-value and DNA methyation change) cutoffs.

Reviewer #3 (Recommendations for the authors):

In my review of the original manuscript, I suggested that it was important that the study showed something new, either in relation to undiscovered imprinted loci with potential importance in human development, physiology or disease, or new concepts in imprinting regulation. Specifically, I recommended further analysis of:

– polymorphic imprinting

– analysis of ZFP57/ZNF445 binding sites within candidate gDMRs

– stricter threshold for methylation levels in gametes to call candidate gDMRs

and additional improvements including:

– more focussed discussion

– clear summary table

In general, these points have been addressed well in the revised manuscript.

"Novel Imprinted DMRs Display Inter-Individual Variation"

I think this is an important addition to the manuscript that demonstrates the majority of the novel DMRs identified in the current study could represent 'polymorphic' imprinting and they are not consistently methylated in blood samples from 87 individuals, in comparison to 'well-characterised' DMRs. Some of the novel DMRs exhibited partial methylation consistent with imprinted status in as few as 1-2% of individuals, which could explain why they had not been detected in previous studies. The relevant analysis is performed well.

On the other hand, it is an omission that the authors do not comment on this high incidence of polymorphic imprinting in the discussion. The authors do need to return to this finding in their discussion. Although at this point, they are not able to provide much speculation for why these loci exhibit inter-individual variation, there are multiple implications of the finding.

Regarding the opening line of this section, "Imprinted methylation can display variation across individuals due to environmental and genetic factors", I think this statement could be modified, as generally speaking imprinting (i.e., well-characterised imprints) is consistent between individuals and resistant to environmental factors (with procedures associated with assisted reproduction techniques possibly the most likely 'environment' to lead to instability of methylation at gDMRs). Therefore, rephrase to something like: "Although imprinted methylation is generally regarded is consistent between individuals and resistant to environmental factors, there are examples of polymorphic imprinting……."

"Determination of Germline versus Somatic Status of Novel Imprinted DMRs"

This section now includes the analysis of ZFP57/ZNF445 binding data – published ChIP-seq datasets from hESCs and HEK293 cells. Interestingly, 44% of the novel gDMRs have binding for one or both of these proteins in these cells, compared with 49% of characterised gDMRs. A difference in binding by these factors does not therefore appear to relate to the high incidence of polymorphic imprinting amongst novel DMRs, but the possibility that there is sequence variation in the ZFP57 binding sites between the sequenced samples has not been addressed.

It is also welcomed that the authors apply the more stringent thresholds for gDMR classification in this section.

"Enriched Allelic H3K36me3 and H3K27me3 Histone Marks at Contiguous Blocks"

It is interesting to see the differential allelic enrichment for these two histone modifications across the seven extended domains that display biased parent-of-origin DNA methylation. The authors draw a distinction with other imprinted domains without this extended PofO bias. This conclusion is based entirely on inspection of screenshots of ChIP-seq data, examples of which are provided in Figure 8 and Supplementary Figures 14-26. I think the authors could be a little more rigorous and compute the allelic read scores and provide an appropriate summary plot.

1. It is an omission that the authors do not comment on this high incidence of polymorphic imprinting in the discussion. The authors do need to return to this finding in their discussion.

2. Test for sequence variation in ZFP57 binding sites in polymorphically methylated gDMRs if sequence information is available for the LCLs and other informative samples for which DNA methylation data exist.

3. The authors should be more rigorous in their analysis of allelic enrichment of H3K36me3 and H3K27me3 at domains of extended PofO DNA methylation bias compared with those imprinted regions that do not, and compute the allelic read scores and provide an appropriate summary plot.

4. Rephrase statement: "Imprinted methylation can display variation across individuals due to environmental and genetic factors".

eLife. 2022 Jul 5;11:e77898. doi: 10.7554/eLife.77898.sa2

Author response


[Editors’ note: the authors resubmitted a revised version of the paper for consideration. What follows is the authors’ response to the first round of review.]

Thank you for considering our manuscript entitled "Genome-Wide Detection of Imprinted Differentially Methylated Regions Using Nanopore Sequencing". We greatly appreciate the reviewers’ constructive comments and positive evaluation of the presented work. We performed a new set of analyses to address the comments regarding our work. We believe the new experiments address the concerns raised by the reviewers and we would like to resubmit our manuscript for publication within eLife journal. We summarize the main features of our revision which addresses all of the reviewers’ concerns:

  • We set a more relaxed threshold to detect differentially methylated regions (DMRs) within the nanopore data.

  • We added more WGBS tissue and blood samples to further improve the confirmation of the detected DMRs.

  • We have now included WGBS data from 87 individuals and examined the evidence for polymorphic imprinting.

  • For the detected germline DMRs we set a more stringent threshold of 70%.

  • H3K36me3 and H3K27me3 ChIP-seq data are now included to examine allelic status of these histone marks in the detected imprinted blocks.

  • We used ChIP-seq data for ZFP57 and ZNF445 to determine if the germline DMRs are protected using these proteins during the second reprogramming step of human development.

  • We also edited the manuscript and addressed requested changes and modifications in the figures and the manuscript body by the reviewers.

Reviewer #1:

The study builds upon a recent report from the authors on the application of long-read nanopore sequencing to the detection of parent-of-origin (PofO) allelic methylation in the human genome (Akbari et al. 2021), with the potential for identification of potentially novel imprinted genes. Nanopore sequencing has previously been shown to be effective in imprinted DMR identification in hybrid mouse crosses (e.g., Gigante et al. 2019). Long-read sequencing has the potential to provide for methylation calls at CpGs to be phased over extended distances with genetic haplotypes and PofO if parent-offspring trios or pedigrees are available. Here, the authors use as discovery datasets 12 LCLs from the 1K Genomes Project and other projects. Candidate PofO DMRs are then parsed through various filters, including methylation level in conventional short-read whole-genome bisulphite sequencing (WGBS) datasets from human gametes, preimplantation embryos as well as somatic tissues, in accordance with the predicted properties of germline DMRs and somatic (or secondary) DMRs. In addition, DMR predictions are compared with well-characterised germline or somatic DMRs (as a test of sensitivity), as well as predictions from a number of other studies that have deployed WGBS or array-based assays of various informative samples (Court et al. 2014, Joshi et al. 2016, Hernandez Mora et al. 2018; Zink et al. 2018). Finally, the authors seek conservation of DMR status in chimpanzee, rhesus macaque, and mouse. This experimental design and implementation look well thought-out and robust.

There have, of course, been numerous attempts to derive a comprehensive list of imprinted loci in the human genome over the last couple of decades. The recent large-scale WGoxBS study of the Icelandic population (Zink et al. 2018) also applied nanopore sequencing, but only in a limited capacity to validate DMRs identified from WGoxBS rather than as a discovery tool as in the current study. As well as demonstrating the power of nanopore sequencing, I think it is important that the current study shows something new, either in relation to undiscovered imprinted loci with potential importance in human development, physiology or disease, or new concepts in imprinting regulation. With some limited further analysis, this level of novelty could be achieved.

The authors report the detection of 76 allelic DMRs not overlapping with previously reported DMRs, of which 28 were determined to be high confidence novel DMRs based on methylation status in the WGBS datasets interrogated. They were able to assign 12 as gDMRs, all but one maternally methylated consistent with the predominance of oocyte methylation of gDMRs. One of the criteria for gDMR status is that the DMR should have "more than 40% methylation in oocyte and less than 20% in sperm and vice versa with methylation difference > 0.25 (lines 188-189)". A threshold of only 40% methylation in one gamete looks rather low for gDMR status. What does 40% methylation mean in molecular terms: that in a population of oocytes there are fully methylated and fully unmethylated alleles; or is there mosaic methylation across the gDMR? From inspection of the heatmap in Figure 3, it seems that very few novel 'gDMRs' have such low methylation in oocytes, so it would be valid to increase the minimum level of methylation required in oocytes to >70%, which would better meet with accepted norms within the imprinting community.

A key determinant of a gDMR is binding of ZFP57 and/or ZNF455 (e.g., Takahashi et al. Genes Dev. 2019 PMID: 30602440), which ensure methylation maintenance especially during methylation reprogramming in preimplantation embryos. The binding motif for ZFP57 at least is known, so the authors should be able to determine whether this motif is present within their candidate gDMRs.

What is striking from the screenshots of many of the novel DMRs (Figure 5 and Suppl. Figures S2, S4, S5, S12, S13, S14, S16) is that they appear to be 'polymorphic', i.e., PofO differential methylation is present in some but not all of the 12 LCLs. I think this needs to be explored in some greater depth. Polymorphic imprinting has been discussed almost since the discovery of imprinting (as in the WT1 locus), but still remains poorly characterised and understood. Recent examples include VTRNA2 (Zink et al. 2018, as well as Silver et al. Genome Biol. 2015 PMID: 26062908). Does the inter-individual variation reflect variation in sequences required for gametic methylation maintenance? For example, is there variation within sequence motifs for ZFP57 binding (if present within the DMR)? This information should be available to the authors. An alternative possibility is that it could reflect variation in establishment of the methylation state in one or other gamete; this cannot be directly addressed but might be a useful speculation to add.

The identification of extended regions of PofO methylation bias in some known imprinted domains (the authors observe this at seven loci) is important and interesting, and extends findings of Zink et al. on the chromosome 15 PWS/AS domain. The authors point to cases in which there is maternal methylation of a promoter-associated gDMR and paternal methylation bias over the expressed gene body. It is possible to offer mechanistic explanation for this, which the authors attempt, speculating that (lines 315-318): "the subtle parental methylation bias is used by cells to express important genes (genes which can regulate other genes in the cluster or have regulatory roles) in an imprinted cluster with higher fidelity through its gene body methylation on active allele." But the authors should consider other explanations for which precedents exist. Thus, the effect may relate to the presence of allelic histone modification bias over the gene body, given that some histone modifications will tend to occupy mutually exclusive locations to DNA methylation; e.g., the transcribed allele is likely to be enriched in H3K36me3 (which attracts de novo DNA methylation), while the non-expressed allele may be enriched in the mutually exclusive modification H3K27me3. For example, at the Kcnq1ot1 and Airn/Igf2r imprinted domains in mouse placental trophoblast, allelic H3K27me3 can span several Megabases (Schertzer et al. Mol Cell 2019 PMID: 31256989; Hanna et al. Genome Biol. 2019 PMID: 31665063; Andergassen et al. PLoS Genet. 2019 PMID: 31329595). There may be ENCODE ChIP-seq datasets that could be informative here. Alternatively, allelic histone modification and consequential DNA methylation bias may be curtailed in regions with sense:antisense transcription on the opposing alleles.

1. More analysis of the possibility of polymorphic imprinting of newly discovered gDMRs.

We have addressed this using 119 blood samples from 87 individuals. The detailed analysis and results are in the new section of the manuscript entitled “Novel Imprinted DMRs Display Inter-Individual Variation”. Relying on the fact that imprinted regions display partial methylation of around 50%, we calculated the proportion of individuals displaying partial methylation. We accurately confirmed DMRs that have already been reported to be polymorphic or not. For example, VTRNA2-1, IGF2, RB1, PARD6G, CHRNE, and IGF2R are known to be polymorphic 1,2. We have detected partial methylation in 2-65% of the individuals in our analysis for these regions (M ± SD = 40% ± 22%; Supplementary Table 5) indication of their interindividual variability. ZNF331 DMR is known to be consistently imprinted across individuals 2 and we also detected partial methylation in 99% of the individuals at this region (Supplementary Table 5).

We performed the same analysis on the novel DMRs. Novel DMRs demonstrated interindividual variation ranging from 1-73% (M ± SD=23.6% ± 19.2%; Table 1). The novel maternal sDMR near BTBD7P1 is the most consistent with partial methylation in 73% of the individuals. On the other hand, the novel paternal somatic DMRs within AC092296.3 and UBAC2 are the most variable with partial methylation in 1% of the individuals (Table 1). However, because of the unavailability of the genomic data from the 87 individuals and the small sample size of 12 cell lines we cannot establish an association of the polymorphic methylation and possible underlying genetic factors.

2. Analysis of ZFP57 binding site motifs in new DMRs: they would be expected to be present in gDMRs with robust maintenance.

We used ChIP-seq data for ZFP57 and ZNF445. 44% of the novel germline DMRs and 49% of the reported DMRs overlapped with the ChIP-seq peak calls for ZFP57 and/or ZNF445 (Figure 3C; Supplementary Table 4). The results have been included in the “Determination of Germline versus Somatic Status of Novel Imprinted DMRs” section of the manuscript.

3. Apply a more stringent threshold than >40% methylation in oocytes/sperm to call a gDMR.

More stringent threshold (>70%) was used to determine germline DMRs. Using this threshold, two of the novel DMRs including NFE2L3P1 and ZNF714 that previously detected as germline now tagged as somatic. The results of this analysis are provided in the “Determination of Germline versus Somatic Status of Novel Imprinted DMRs” section of the manuscript.

4. Consideration in the Discussion of mechanisms that could account for extended PofO methylation bias along the lines suggested above.

To address this, we used H3K36me3 and H3K27me3 ChIP-seq data. At the large imprinted blocks, we observed allelic H3K36me3 marking on the hypermethylated and expressed allele and H3K27me3 on the hypomethylated and silenced allele. We did not observe such allelic histone marks on other imprinted clusters without large blocks of PofO methylation bias. These results suggest high expression of active allele and allelic deposition of histone marks at these blocks are involve in the regulation of subtle PofO methylation biases. The detailed results of these analyses are included in the “Enriched Allelic H3K36me3 and H3K27me3 Histone Marks at Contiguous Blocks” section of the manuscript.

5. A clear summary table/scheme on DMR status, including: germline or somatic; proximity to known imprinted cluster; consistency/variability of DMR status in the LCLs. The idiogram in Figure 2C has this information but not in a way that it is easy to extract the numbers.

We added a new table (Table 1) which includes a summary information of the detected DMRs.

6. The Discussion is on the long side – it would also benefit from some English language editing, particularly in the latter parts.

The manuscript has been edited to improve clarity and correctness of language.

Reviewer #2:

The study is comprehensive in making use of multiple publicly available datasets. It also provides new evolutionary insight into the conservation of DMRs, as it compares data between different mammalian species. As the study discriminates between germline as well as somatic imprints, it therefore provides useful data for the imprinting community to study questions between primary and secondary imprinting marks within imprinted gene clusters. Overall, the study is a comprehensive imprinting resource and Nanopore technical paper, even if it does not seem to provide major conceptual advances over previous studies. The study appears generally well performed and the conclusions are backed up by the data, except in the Discussion section (see below), which could be streamlined and shortened to focus on the main conclusions from this paper. Some improvements could be made in the presentation, when data are only provided in supplementary data table form and not presented in the figures as outlined below.

General issues:

1. The number of acronyms used in this article is very high. To make the article more accessible to a general readership audience, I would recommend reducing their use if possible, or alternatively providing a glossary.

We provided a glossary at the end of the manuscript and reduced the frequency of acronyms in the main text as well.

“Glossary

LCLs: Lymphoblastoid cell lines (LCLs) are immortalized cell lines derived from the Blymphocytes in the peripheral blood.

WGBS: Whole-genome bisulfite sequencing (WGBS) is an approach for detection of DNA methylation. In this strategy, DNA is treated with bisulfite to convert cytosine to uracil, with 5mC and 5hmC remaining intact. Sequencing and mapping of this converted DNA allows the detection of 5mC.

DMA and DMR: Differential methylation analysis (DMA) is the process of comparing two samples or, in case of imprinting, two alleles to detect differentially methylated regions (DMRs). DMRs are regions that show methylation difference either between two samples or, in case of imprinting, between the paternal and maternal alleles.

PofO: Parent-of-origin (PofO) refers to the maternal or paternal origin of an allele.

ASE: Allele-specific expression (ASE) refers to the preferential expression of one allele, either maternal or paternal allele.

ICR: Imprinting control region (ICR) is region of usually a few kilobases that determines allelespecific expression of the imprinted genes based upon the parent-of-origin. ICR is usually controlled through DNA methylation and is a DMR in which only paternal or maternal allele is methylated. ICR is a cis-acting regulatory region which means it regulate the expression of gene(s) only on the same chromosomal allele.

UPD: Uniparental disomies (UPD). During normal human development, each individual inherit one paternal and one maternal copy of each chromosome. In some abnormalities, both copy of a chromosome, or of part of a chromosome, are coming from one parent, it is called UPD.

CGI: CpG islands (CGIs) are stretch of DNA, usually 500–1500 bp long, with more than 50% GC content and a CG: GC ratio of more than 0.6.”

2. The Discussion section seems overly long and speculative and could be substantially shortened, without impacting the overall message of the study.

Discussion section has been modified accordingly.

Specific issues:

1. Figure 1a: "WGBS validation and byond" – should say beyond

Figure has been corrected.

2. Line 102-103: "All DMRs which overlapped with previously-reported DMRs displayed consistent PofO with those studies."

Where is this shown? If not, it would be good to show this somewhere.

This information is available in the supplementary table 3. In the manuscript, we have also added the text to refer readers to the supplementary table 3.

“101 overlapped with 103 previously reported DMRs with consistent PofO, while the remaining 99 were novel (Figure 1C; Supplementary Table 3)”

3. Figure 3b: In order to more clearly identify sDMRs, which are either paternally or maternally methylated, it would be helpful to also add a panel comparing maternal with paternal LCLs.

The panel is added to the figure 3b with the maternal methylation of LCLs on the X axis and the paternal methylation on the Y axis.

4. Line 152: "The H3K4me3 histone mark is protective to DNA methylation." This sentence could be easily misinterpreted. It could be rephrased to "The H3K4me3 histone mark is protective against DNA methylation."

The sentence has been modified as suggested by the reviewer.

5. Section on H3K4me3 and DNA-methylation (line 151-167): A figure panel depicting the relationship between H3K4me3 and DNA-methylation would be helpful instead of only referring to the supplementary data tables.

A new figure (Figure 4) has been added to the manuscript to represent the results from the allelic H3K4me3 analysis at each heterozygous single nucleotide variant.

6. Line 174-175: "Orthologs of the 77/107 detected DMRs showed significant partial methylated in at least one of the three mammals." should be rephrased to something like: "77 of the 107 detected orthologous DMRs showed significant partial methylation in at least one of the three mammals."

The manuscript has been edited as suggested by the reviewer.

Reviewer #3:

Akbari, et al. demonstrate the utility of nanopore sequencing data for identification of imprinted genes and differentially methylated regions (DMRs) in lymphocyte cell line (LCL) genomes sequenced by the HPRC and others. Their analysis replicates a large number of imprinted genes and DMRs (96/172) identified by short-read sequencing of bisulfite-converted genomic DNA, but also identifies 76 of novel DMRs and genes in LCLs missed in prior studies. Conversely, a large number (283) of imprinted DMRs missed some of statistical cutoffs in the nanopore sequencing data, although >90% of well-characterized loci do meet these cutoffs. The authors then distinguish somatic from germline imprints found in published gametic and blastocyst DNA methylation datasets, and query corresponding non-human primate and rodent datasets to assess the evolutionary conservation of their novel imprints. Finally, they demonstrate concordant allelic representation of H3K4me3 and mRNA-seq reads of their imprinted genes.

Among the strengths of this study are the comparative analyses to imprinting catalogues from other datasets, and the integration of allelic chromatin marks and gene expression with their DNA methylation analysis.

One drawback of the study in its present form is that it does not address why a relatively large number of imprinted DMRs were not replicated across studies. Cursory analysis of supplementary tables suggests that many of the prior DMRs missed in the nanopore data do demonstrate expected parent-of-origin bias in DNA methylation, but were not picked up in multiple samples likely due to the chosen cutoffs (-/+ 25% methylation difference in at least 4 samples and p < 0.001). As there is no explanation for the chosen nanopore vs. oxBS-seq cutoffs, the reader is left with the impression that the concordance with prior studies is lower than it really is. Conversely, many of the novel DMRs fall near known imprinted DMRs. Among the possible explanations for the large fraction (44%) novel DMRs and missed known DMRs, the authors focus on technical differences, but only briefly mention that some DMRs are cell-type-specific, which may explain why DMRs identified in LCLs may not match DMRs identified in peripheral blood monocytes better, or miss arbitrary statistical cutoffs. One way to address this issue would be to use more sophisticated methods to combine p-values across studies to quantify concordance of imprinted DMRs without applying arbitrary "hard" cutoffs (that don't match between nanopore and oxBS-seq anyway). This is a relatively major point, because while truly novel imprinted DMRs that can only be identified by nanopore are relatively rare, the take-home message of this study could be that while there is very good concordance across methods and most novel (possibly cell-type specific) DMRs fall near known DMRs, there are additional benefits to DNA methylation profiling by nanopore-sequencing.

A second limitation is that the study attributes these differences to long vs. short read technologies without assessing concordance between nanopore and another long-read technology. Because many of the HPRC samples include Pacbio data, one possible way to address this would be to include imprinted DMR analysis in the corresponding Pacbio samples in HPRC.

Overall, the study clearly demonstrates that nanopore-sequencing performs well in profiling DNA methylation, and can identify novel yet conserved imprints, but doesn't highlight its strengths (long-range, single-molecule phased DNA methylation patterns) or its possible weaknesses (single-nucleotide resolution).

I think the authors should consider not applying hard cutoffs in their analysis and instead combine p-values or use rank-based metrics to assess concordance/novelty as suggested above. The concordance appears better than described by the authors, and in general, I feel the authors don't need to draw as many distinctions to the prior work to "sell" the reader on nanopore sequencing.

There are other benefits they could use to draw these points of distinction however, for example by looking at concordance of methylation patterns across single nanopore molecules, or including Pacbio reads and presenting a more informative cost/benefit analysis to guide readers in choosing DNA methylation profiling approaches (oxBS-seq, nanopore, Pacbio). See: "a cheap and easy way to call ICRs".

Apart from the important cutoff-agnostic analysis, and possibly including Pacbio samples, the manuscript could be improved mainly in tone and attention to detail. I noticed some sentence structure errors, but more importantly felt some sentences served to unnecessarily "sell" this study, either by sounding too critical with prior studies or or providing little support, e.g. "cheap and easy" above, or "individuals more representative of the human population" (with only 12 samples). The authors could also re-assess the first sentence of their discussion, and should maybe cite: https://pubmed.ncbi.nlm.nih.gov/33230324/

We have used more relaxed cut-off for the detection of the DMRs (0.20 as methylation difference cut-off instead of 0.25). This resulted in including 27 more DMRs of which 4 mapped to reported intervals. Because imprinted methylation is variable across individuals and also tissues, DMRs which reported by multiple studies are more likely to be consistent across individuals while other DMRs are likely to be more variable. We have included 119 WGBS datasets from 87 individuals and assessed polymorphic imprinting. As demonstrated in figure 1C, considerable number of imprinted DMRs detected in different studies are not overlapping between studies. Our analysis suggest polymorphic imprinting can explain this non-overlapping DMRs. The DMRs that detected by at least two studies demonstrated more consistency across individuals with average DMR frequency of 41.2% (SD = 33%) while DMRs detected in a single study showed more variability (M ± SD = 10.6% ± 15.4%) (Supplementary Table 5).

In the white paper published by PacBio technology, a 250x coverage per-strand is needed for PacBio to detect methylation reliably 3. Therefore, in a mammalian size genome this technology is prohibitively expensive. Moreover, in the current study our goal is to detect imprinted methylation genome-wide using nanopore long-reads that we and others previously showed could detect allelic methylation at even shallow coverage 4,5. We do not necessarily want to promote nanopore sequencing but our results indicate that nanopore sequencing represents a modality suitable for this task. We have tried to present a comparison with previous studies that we think can be useful for the readers and the community. However, we have modified sentences that might imply we are trying to overly promote nanopore sequencing.

Regarding the “randomly selected regions” We have modified the analysis and also included 100 random resampling. This has improved the confirmation of DMRs and the accuracy of the approach. The detailed analysis is provided in the “Confirmation of Novel Imprinted DMRs” section in the manuscript.

Regarding the "cheap and easy", Nanopore can phase significantly higher number of CpGs in a single sample compare to short-read. Therefore, in cases where we need large cohort of samples to achieve higher resolution of phased CpGs using short-reads, by using nanopore we can achieve this with mush smaller cohort size. By "cheap and easy" we meant that it will help to reduce the cost by reducing the sample size. However, we have modified the respective sentences in the last paragraph of the Discussion section to not be misleading.

“We also showed that nanopore sequencing has the ability to achieve a higher resolution of phased CpGs using a small sample size and allows for the calling of imprinted methylation in a single sample, potentially reducing the cost by reducing the sample size.”

I think these could be improved without taking anything away from what is an interesting and informative study in its own right. eLife would be a good fit for this study, which could be accepted with relatively minor revisions and only limited added analysis.

We thank the reviewer for this endorsement.

References

Joshi, R. S. et al. DNA Methylation Profiling of Uniparental Disomy Subjects Provides a Map of Parental Epigenetic Bias in the Human Genome. Am. J. Hum. Genet. 99, 555–566 (2016).

Zink, F. et al. Insights into imprinting from parent-of-origin phased methylomes and transcriptomes. Nat. Genet. 50, 1542–1552 (2018).

Biosciences, P. Detecting DNA base modifications using single molecule, real-time sequencing. White Pap. Base Modif. (2015) doi:https://www.pacb.com/wpcontent/uploads/2015/09/WP_Detecting_DNA_Base_Modifications_Using_SMRT_Seque ncing.pdf.

Gigante, S. et al. Using long-read sequencing to detect imprinted DNA methylation. Nucleic Acids Res. 47, e46–e46 (2019).

Akbari, V. et al. Megabase-scale methylation phasing using nanopore long reads and NanoMethPhase. Genome Biol. 22, 68 (2021).

[Editors’ note: what follows is the authors’ response to the second round of review.]

The manuscript has been improved but there are some remaining issues that need to be addressed, as outlined below:

Everyone felt that the manuscript has been significantly improved. However, the following issues remain:

1. The processed annotated tracks must be deposited in a data repository and released at the time of publication.

We have deposited methylation and histone modification tracks to Mendeley data repository (https://data.mendeley.com/datasets/f4k2gytbh5/1; doi: 10.17632/f4k2gytbh5.1). We have also provided the “Data availability” section:

“Data availability

In this study we have used various nanopore sequencing, WGBS, and ChIP-seq datasets. The source of each dataset is provided under the “Materials and methods” section under the appropriate subsection. We also used PofO allele-specific expression data from Zink et al. (2018; https://doi.org/10.6084/m9.figshare.6816917) to generate ASE track for IGV. Phased DNA methylation tracks for the 12 human LCL samples and other tracks including ASE, histone modifications and WGBS methylation tracks generated in this study are publicly available through the Mendeley data repository (https://data.mendeley.com/datasets/f4k2gytbh5/1; DOI: 10.17632/f4k2gytbh5.1). Codes used for partial methylation analysis of DMRs in WGBS datasets (PartialMethylation_AtDMR.sh) and binomial test analysis of histone modifications

(CountReadsAtSNV.py and Binomial_test.py) are available on GitHub

(https://github.com/vahidAK/NanoMethPhase/tree/master/scripts).”

2. Comment on the high incidence of polymorphic imprinting in the discussion.

We have revisited these results and also the new results from ZFP57 motif analysis (See the next comment/point) and commented in the first paragraph of the Discussion section:

“DMRs that are detected in only a single study displayed higher variations across individuals compared to those detected by at least two studies. Therefore, lack of phasing at some novel DMRs in previous studies and higher variation in imprinted methylation at novel DMRs could explain the reason they were not detected previously. We also demonstrated that germline DMRs with a greater number of ZFP57 motif tend to be more consistently imprinted across individuals suggesting motifs redundancy increases ZFP57 recruitment and tolerance to any DNA sequence variation. However, due to the availability of DNA sequence in a limited number of samples, we were not able to examine sequence variation at the DMRs and the ZFP57 binding motifs for any possible association with polymorphic imprinted methylation which will require further study.”

3. Test for sequence variation in ZFP57 binding sites in polymorphically methylated gDMRs if sequence information is available for the LCLs and other informative samples for which DNA methylation data exist.

We analysed DMRs for presence of the ZFP57 binding motif (TGCCGC) and observed a significant correlation between number of binding motifs and number of individuals with partial methylation in their whole-genome bisulfite sequencing data at germline DMRs. However, as sequence variation information was only available for the 12 cell lines and very few DMRs had DNA sequence variation in these cell lines at a ZFP57 motif, we were not able to derive any association between gDMR sequence variation and polymorphic imprinted methylation:

“Using ZFP57 and ZNF445 ChIP-seq peak calling information from human embryonic stem cells and the HEK 293T cell line (Imbeault et al., 2017; Takahashi et al., 2019), 44% of the novel gDMRs and 49% of the reported gDMRs were bound by ZFP57 and/or ZNF445 (Figure 3C; Supplementary File 4). Of these gDMRs, 89% had a ZFP57 peak and 45% had a ZNF445 peak. This highlights the importance of ZFP57 as an important factor for the maintenance of imprinted methylation at gDMRs. 5′-TGC(5mC)GC-3′ is the canonical binding motif for ZFP57 (Quenneville et al., 2011). 88% of the gDMRs with a ZFP57 peak had at least one 5′-TGCCGC3′ motif, while 40% of the gDMRs without ZFP57 peak had at least one 5′-TGCCGC-3′ motif in the human genome (GRCh38; Supplementary File 4). Moreover, at gDMRs the number of 5′TGCCGC-3′ motifs demonstrated a significant positive correlation with the number of individuals demonstrating partial methylation (Pearson = 0.54, p-value = 3.6e−07; Appendix 1 – figure 1). This suggests that a greater number of motifs provides more functional binding opportunities for ZFP57 and also less likelihood that all ZFP57 motifs could be perturbed through polymorphism or DNA sequence variation resulting in the imprinted methylation being less polymorphic.”

4. More rigor is required in the analysis of allelic enrichment of H3K36me3 and H3K27me3 at domains of extended PofO DNA methylation bias compared with those imprinted regions that do not, and compute the allelic read scores and provide an appropriate summary plot.

We have performed a binning-based binomial analysis of the allelic histone read counts to detect bins with significant allelic histone modifications at detected PofO methylated biased blocks and test blocks. We also included an appropriate plot to represent histone marks and the relation between H3K36me3 and H3K27me3 marks:

“To analyze allelic histone modifications and detect blocks of allelic histone marks at large blocks of PofO bias, we binned the genome into 10Kb intervals and performed a binomial test with Fisher’s combined p-value test to determine the significance of allelic read counts at 10Kb intervals with >3 informative heterozygous SNVs (Having at least 5 mapped reads) within each block in each sample. A 10Kb bin considered as significant for allelic histone mark if it had an adjusted P value <0.001 and if at least 70% of the SNVs within the 10Kb bin having ≥80% of the reads mapped to one allele. In total, 174 bins for H3K36me3 and 132 bins for H3K27me3 could be examined. Of these, 147 bins for H3K36me3 and 51 bins for H3K27me3 were significant. Thirty-eight bins were significant for both histone marks in the same sample. All the seven blocks demonstrated multiple significant bins for H3K36me3 at almost all the samples. L3MBTL1, GPR1-AS/ZDBF2, GNAS/GNAS-AS1, and ZNF597/NAA60 demonstrated multiple significant H3K27me3 bins in majority of the samples and KCNQ1OT1, PWS/AS, and ZNF331 had significant H3K27me3 bins at 3, 2, and 1 of the samples, respectively. H3K36me3 and H3K27me3 demonstrated mutual exclusive pattern and H3K36me3 appeared on the hypermethylated allele while H3K27me3 on the hypomethylated allele (Figure 8; Figure 8—figure supplement 1-6; Figure 9; Supplementary File 8).

To determine if allelic histone marks are unique to the PofO methylation-biased blocks, we examined allelic histone marks on several other imprinted clusters with strong ASE which did not display PofO bias methylation. For this, we examined PPIEL, MEG3, MEST, DIRAS3, IGF2, MTRNR2L4, and ADNP2/PARD6G-AS1 clusters. Eighty-three bins for H3K36me3 and 138 bins for H3K27me3 could be examined at the seven test blocks. Of these, only 5 bins for H3K36me3 and 7 bins for H3K27me3 were significant and none of the bins were significant for both histone marks (Figure 9—figure supplement 1-8; Supplementary File 9). These results suggest that the blocks of PofO methylation bias in the gene body of active alleles are mediated by transcription and histone marks at their gene bodies.”

More details on these points are given below, as well as minor points that need to be addressed also.

Reviewer #2 (Recommendations for the authors):

The authors addressed some (not all) of the concerns raised in the first round of review, often differently than suggested (which is acceptable).

The data availability however, is not acceptable in my view; given that the bioinformatic approach to DNA methylation phasing is computationally sophisticated and was published elsewhere already (Akbari, et al., 2021), it should be expected that the processed nanometh tracks (shown in Figure 3c, 6-8 and the extensive supplement) are made available via a data repository (e.g. GEO or Mendeley data). Inclusion of processed data tracks is obligatory for GEO submissions, and recreating these data tracks from the raw data places an undue burden on labs wishing to query these data for their loci of interest. These data tracks should include all chromosomes, including sex chromosomes, to enable others to re-run statistical analyses with their own (p-value and DNA methyation change) cutoffs.

We appreciate the reviewer’s positive evaluation of our effort to address the comments.

We have deposited methylation and histone modification tracks to Mendeley data repository (https://data.mendeley.com/datasets/f4k2gytbh5/1; doi: 10.17632/f4k2gytbh5.1).

Reviewer #3 (Recommendations for the authors):

In my review of the original manuscript, I suggested that it was important that the study showed something new, either in relation to undiscovered imprinted loci with potential importance in human development, physiology or disease, or new concepts in imprinting regulation. Specifically, I recommended further analysis of:

– polymorphic imprinting

– analysis of ZFP57/ZNF445 binding sites within candidate gDMRs

– stricter threshold for methylation levels in gametes to call candidate gDMRs

and additional improvements including:

– more focussed discussion

– clear summary table

In general, these points have been addressed well in the revised manuscript.

"Novel Imprinted DMRs Display Inter-Individual Variation"

I think this is an important addition to the manuscript that demonstrates the majority of the novel DMRs identified in the current study could represent 'polymorphic' imprinting and they are not consistently methylated in blood samples from 87 individuals, in comparison to 'well-characterised' DMRs. Some of the novel DMRs exhibited partial methylation consistent with imprinted status in as few as 1-2% of individuals, which could explain why they had not been detected in previous studies. The relevant analysis is performed well.

On the other hand, it is an omission that the authors do not comment on this high incidence of polymorphic imprinting in the discussion. The authors do need to return to this finding in their discussion. Although at this point, they are not able to provide much speculation for why these loci exhibit inter-individual variation, there are multiple implications of the finding.

Regarding the opening line of this section, "Imprinted methylation can display variation across individuals due to environmental and genetic factors", I think this statement could be modified, as generally speaking imprinting (i.e., well-characterised imprints) is consistent between individuals and resistant to environmental factors (with procedures associated with assisted reproduction techniques possibly the most likely 'environment' to lead to instability of methylation at gDMRs). Therefore, rephrase to something like: "Although imprinted methylation is generally regarded is consistent between individuals and resistant to environmental factors, there are examples of polymorphic imprinting……."

"Determination of Germline versus Somatic Status of Novel Imprinted DMRs"

This section now includes the analysis of ZFP57/ZNF445 binding data – published ChIP-seq datasets from hESCs and HEK293 cells. Interestingly, 44% of the novel gDMRs have binding for one or both of these proteins in these cells, compared with 49% of characterised gDMRs. A difference in binding by these factors does not therefore appear to relate to the high incidence of polymorphic imprinting amongst novel DMRs, but the possibility that there is sequence variation in the ZFP57 binding sites between the sequenced samples has not been addressed.

It is also welcomed that the authors apply the more stringent thresholds for gDMR classification in this section.

"Enriched Allelic H3K36me3 and H3K27me3 Histone Marks at Contiguous Blocks"

It is interesting to see the differential allelic enrichment for these two histone modifications across the seven extended domains that display biased parent-of-origin DNA methylation. The authors draw a distinction with other imprinted domains without this extended PofO bias. This conclusion is based entirely on inspection of screenshots of ChIP-seq data, examples of which are provided in Figure 8 and Supplementary Figures 14-26. I think the authors could be a little more rigorous and compute the allelic read scores and provide an appropriate summary plot.

We appreciate reviewer’s positive evaluation of our revised manuscript.

1. It is an omission that the authors do not comment on this high incidence of polymorphic imprinting in the discussion. The authors do need to return to this finding in their discussion.

This comment has been addressed above under the “responses to the main points” point number 2.

2. Test for sequence variation in ZFP57 binding sites in polymorphically methylated gDMRs if sequence information is available for the LCLs and other informative samples for which DNA methylation data exist.

This comment has been addressed above under the “responses to the main points” point number 3.

3. The authors should be more rigorous in their analysis of allelic enrichment of H3K36me3 and H3K27me3 at domains of extended PofO DNA methylation bias compared with those imprinted regions that do not, and compute the allelic read scores and provide an appropriate summary plot.

This comment has been addressed above under the “responses to the main points” point number 4.

4. Rephrase statement: "Imprinted methylation can display variation across individuals due to environmental and genetic factors".

We have rephrased this sentence as suggested by reviewer:

“Although imprinted methylation is generally regarded as consistent between individuals and resistant to environmental factors, there are examples of polymorphic imprinting where imprinted methylation is not consistently observed across individuals.”

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Akbari V. 2022. Genome-Wide Detection of Imprinted Differentially Methylated Regions Using Nanopore Sequencing_Akbari-etal. Mendeley Data. [DOI] [PMC free article] [PubMed]
    2. Shafin K, Pesout T, Lorig-Roach R, Haukness M, Olsen HE, Bosworth C, Armstrong J, Tigyi K, Maurer N, Koren S, Sedlazeck FJ, Marschall T, Mayes S, Costa V, Zook JM, Liu KJ, Kilburn D, Sorensen M, Munson KM, Vollger MR, Monlong J, Garrison E, Eichler EE, Salama S, Haussler D, Green RE, Akeson M, Phillippy A, Miga KH, Carnevali P, Jain M, Paten B. 2020. Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes. Human Pangenome Reference Consortium. hpgp-data [DOI] [PMC free article] [PubMed]
    3. Jain M, Koren S, Miga KH, Quick J, Rand AC, Sasani TA, Tyson JR, Beggs AD, Dilthey AT, Fiddes IT, Malla S, Marriott H, Nieto T, O'Grady J, Olsen HE, Pedersen BS, Rhie A, Richardson H, Quinlan AR, Snutch TP, Tee L, Paten B, Phillippy AM, Simpson JT, Loman NJ, Loose M. 2018. Nanopore sequencing and assembly of a human genome with ultra-long reads. Nanopore WGS Consortium. NA12878 [DOI] [PMC free article] [PubMed]
    4. De Coster W, De Rijk P, De Roeck A, De Pooter T, D'Hert S, Strazisar M, Sleegers K, Van Broeckhoven C. 2019. Structural variants identified by Oxford Nanopore PromethION sequencing of the human genome. European Nucleotide Archive. PRJEB26791 [DOI] [PMC free article] [PubMed]
    5. Zook JM, Catoe D, McDaniel J, Vang L, Spies N, Sidow A, Weng Z, Liu Y, Mason CE, Alexander N, Henaff E, McIntyre ABR, Chandramohan D, Chen F, Jaeger E, Moshrefi A, Pham K, Stedman W, Liang T, Saghbini M, Dzakula Z, Hastie A, Cao H, Deikus G, Schadt E, Sebra R, Bashir A, Truty RM, Chang CC, Gulbahce N, Zhao K, Ghosh S, Hyland F, Fu Y, Chaisson M, Xiao C, Trow J, Sherry ST, Zaranek AW, Ball M, Bobe J, Estep P, Church GM, Marks P, Kyriazopoulou-Panagiotopoulou S, Zheng GXY, Schnall-Levin M, Ordonez HS, Mudivarti PA, Giorda K, Sheng Y, Rypdal KB, Salit M. 2016. Extensive sequencing of seven human genomes to characterize benchmark reference materials. NCBI Genome in a Bottle FTP. FTP [DOI] [PMC free article] [PubMed]
    6. The 1000 Genomes Project Consortium 2015. A global reference for human genetic variation. The International Genome Sample Resource. 30x-grch38
    7. Stunnenberg HG, Abrignani S, Adams D, de Almeida M, Altucci L, Amin V, Amit I, Antonarakis SE, Aparicio S, Arima T, Arrigoni L, Arts R, Asnafi V, Badosa ME, Bae JB, Bassler K, Beck S, Berkman B, Bernstein BE, Hirst M. 2016. The International Human Epigenome Consortium: A Blueprint for Scientific Collaboration and Discovery. Blueprint Epigenome. blueprint [DOI] [PubMed]
    8. Bernstein BE, Stamatoyannopoulos JA, Costello JF, Ren B, Milosavljevic A, Meissner A, Kellis M, Marra MA, Beaudet AL, Ecker JR, Farnham PJ, Hirst M, Lander ES, Mikkelsen TS, Thomson JA. 2010. The NIH Roadmap Epigenomics Mapping Consortium. NCBI Gene Expression Omnibus. epigenomics [DOI] [PMC free article] [PubMed]
    9. ENCODE Project Consortium 2012. An Integrated Encyclopedia of DNA Elements in the Human Genome. ENCODE. encodeproject [DOI] [PMC free article] [PubMed]
    10. Okae H, Chiba H, Hiura H, Hamada H, Sato A, Utsunomiya T, Kikuchi H, Yoshida H, Tanaka A, Suyama M, Arima T. 2014. Genome-wide analysis of DNA methylation dynamics during early human development. DNA Data Bank of Japan. DRA003802 [DOI] [PMC free article] [PubMed]
    11. Steinmetz LM, Hogenesch JB, Kellis M, Batzoglou S, Snyder M. 2013. Extensive Variation in Chromatin States Across Humans. NCBI Sequence Read Archive. SRP030041
    12. Hon GC, Rajagopal N, Shen Y, McCleary DF, Yue F, Dang MD, Ren B. 2013. Epigenetic memory at embryonic enhancers identified in DNA methylation maps from adult mouse tissues. NCBI Gene Expression Omnibus. GSE42836 [DOI] [PMC free article] [PubMed]
    13. Tung J, Barreiro LB, Johnson ZP, Hansen KD, Michopoulos V, Toufexis D, Michelini K, Wilson ME, Gilad Y. 2012. Social environment is associated with gene regulatory variation in the rhesus macaque immune system. NCBI Gene Expression Omnibus. GSE34128 [DOI] [PMC free article] [PubMed]
    14. Jeong H, Mendizabal I, Berto S, Chatterjee P, Layman T, Usui N, Toriumi K, Douglas C, Singh D, Huh I, Preuss TM, Konopka G, Yi S V. 2021. Evolution of DNA methylation in the human brain. NCBI Gene Expression Omnibus. GSE151768 [DOI] [PMC free article] [PubMed]
    15. Dahlet T, Argüeso Lleida A, Al Adhami H, Dumas M, Bender A, Ngondo RP, Tanguy M, Vallet J, Auclair G, Bardet AF, Weber M. 2020. Genome-wide analysis in the mouse embryo reveals the importance of DNA methylation for transcription integrity. NCBI Gene Expression Omnibus. GSE130735 [DOI] [PMC free article] [PubMed]
    16. Jung YH, Sauria MEG, Lyu X, Cheema MS, Ausio J, Taylor J, Corces VG. 2017. Chromatin States in Mouse Sperm Correlate with Embryonic and Adult Regulatory Landscapes. NCBI Gene Expression Omnibus. GSE79226 [DOI] [PMC free article] [PubMed]
    17. Saenz-de-Juano MD, Ivanova E, Billooye K, Herta A-C, Smitz J, Kelsey G, Anckaert E. 2019. Genome-wide assessment of DNA methylation in mouse oocytes reveals effects associated with in vitro growth, superovulation, and sexual maturity. NCBI Gene Expression Omnibus. GSE128656 [DOI] [PMC free article] [PubMed]
    18. Gao F, Niu Y, Sun YE, Lu H, Chen Y, Li S, Kang Y, Luo Y, Si C, Yu J, Li C, Sun N, Si W, Wang H, Ji W, Tan T. 2017. De novo DNA methylation during monkey pre-implantation embryogenesis. NCBI Gene Expression Omnibus. GSE60166 [DOI] [PMC free article] [PubMed]
    19. Florian Z, Droplaug NM, Olafur TM, Nicolas JW, Tiffany JM, Asgeir S, Gisli HH, Sigurjon AG, Pall M, Helga I, Snædis K, Kristjan. Kristjan FA, Anna H, Julius G, Thorunn R, Ingileif J, Hilma H, Gudmundur IE, Olof S, Isleifur O, Gisli M, Daniel FG, Unnur T, Bjarni VH, Simon NS, Kari S. 2018. Methylation and expression data for whole-genome human imprinting study. figshare. [DOI]
    20. Imbeault M, Helleboid P, Trono D. 2017. ChIP-exo of human KRAB-ZNFs transduced in HEK 293T cells and KAP1 in hES H1 cells. NCBI Gene Expression Omnibus. GSE78099
    21. Takahashi N, Coluccio A, Thorball C. W, Planet E, Shi H, Offner S, Turelli P, Imbeault M, Ferguson-Smith A. C, Trono D. 2019. ZNF445 is a primary regulator of genomic imprinting. NCBI Gene Expression Omnibus. GSE115387 [DOI] [PMC free article] [PubMed]

    Supplementary Materials

    Figure 2—source data 1. The average methylation of each DMR in human WGBS tissue samples.
    Figure 2—source data 2. The average methylation of each DMR in human WGBS blood samples.
    Figure 4—source data 1. H3K4me3 allelic read counts for the heterozygous single-nucleotide variants (SNVs) mapped to the detected DMRs.
    Figure 4—source data 2. H3K4me3 allelic read counts for the paternal and maternal heterozygous single-nucleotide variants (SNVs) mapped to the detected DMRs.
    Figure 9—source data 1. H3K36me3 and H3K27me3 allelic read counts for the heterozygous single-nucleotide variants (SNVs) mapped to the detected PofO-biased methylation blocks.
    Figure 9—figure supplement 1—source data 1. H3K36me3 and H3K27me3 allelic read counts for the heterozygous single-nucleotide variants (SNVs) mapped to the test blocks.
    Supplementary file 1. List of the reported imprinted differentially methylated regions (DMRs).
    elife-77898-supp1.xlsx (32.4KB, xlsx)
    Supplementary file 2. The list of 200 detected imprinted differentially methylation regions from nanopore data.

    This list demonstrates the results of differential methylation analysis (DMA) using nanomethphase dma module that uses DSS R package for DMA.

    elife-77898-supp2.xlsx (31.9KB, xlsx)
    Supplementary file 3. Mapping detected differentially methylated regions (DMRs) using nanopore sequencing across 12 lymphoblastoid cell lines (LCLs) to reported DMRs.
    elife-77898-supp3.xlsx (29.6KB, xlsx)
    Supplementary file 4. Detected differentially methylated regions (DMRs) using nanopore sequencing across 12 lymphoblastoid cell lines (LCLs) that mapped to reported DMRs or confirmed in whole-genome bisulfite sequencing (WGBS) data.
    elife-77898-supp4.xlsx (48.6KB, xlsx)
    Supplementary file 5. The results for examining inter-individual variations of the detected differentially methylated regions (DMRs) in whole-genome bisulfite sequencing (WGBS) datasets from blood.
    elife-77898-supp5.xlsx (38.1KB, xlsx)
    Supplementary file 6. The allelic read counts of H3K4me3 chromatin immunoprecipitation sequencing (ChIP-seq) for each heterozygous single-nucleotide variant (SNV) at the detected differentially methylated regions (DMRs) in six lymphoblastoid cell line (LCL) samples.
    elife-77898-supp6.xlsx (61.6KB, xlsx)
    Supplementary file 7. List of the known imprinted genes.
    elife-77898-supp7.xlsx (28KB, xlsx)
    Supplementary file 8. The allelic read counts of H3K36me3 and H3K27me3 chromatin immunoprecipitation sequencing (ChIP-seq) for the seven parent-of-origin (PofO)-biased methylation blocks in six lymphoblastoid cell line (LCL) samples.
    elife-77898-supp8.xlsx (250KB, xlsx)
    Supplementary file 9. The allelic read counts of H3K36me3 and H3K27me3 chromatin immunoprecipitation sequencing (ChIP-seq) for the seven test blocks in six lymphoblastoid cell line (LCL) samples.
    elife-77898-supp9.xlsx (190.6KB, xlsx)
    Supplementary file 10. List of the whole-genome bisulfite sequencing (WGBS) tissues and WGBS blood samples used in our study.
    Transparent reporting form
    Appendix 1—figure 1—source data 1. Number of ZFP57-binding motif (TGCCGC) at differentially methylated regions (DMRs).

    Data Availability Statement

    The current manuscript is a computational study, so no new datasets have been generated for this manuscript. The source of each dataset is provided under the ‘“Materials and methods’” section under the appropriate subsection. Genomic tracks generated in this study including DNA methylation and histone modification tracks are deposited in the Mendeley data repository (https://doi.org/10.17632/f4k2gytbh5.1). Codes are uploaded to GitHub https://github.com/vahidAK/NanoMethPhase/tree/master/scripts (copy archived at swh:1:rev:1657f7aed60604aa7c7f3e77d992d76bee6bf6d3): PartialMethylation_AtDMR.sh, CountReadsAtSNV.py and Binomial_test.py.

    The following dataset was generated:

    Akbari V. 2022. Genome-Wide Detection of Imprinted Differentially Methylated Regions Using Nanopore Sequencing_Akbari-etal. Mendeley Data.

    The following previously published datasets were used:

    Shafin K, Pesout T, Lorig-Roach R, Haukness M, Olsen HE, Bosworth C, Armstrong J, Tigyi K, Maurer N, Koren S, Sedlazeck FJ, Marschall T, Mayes S, Costa V, Zook JM, Liu KJ, Kilburn D, Sorensen M, Munson KM, Vollger MR, Monlong J, Garrison E, Eichler EE, Salama S, Haussler D, Green RE, Akeson M, Phillippy A, Miga KH, Carnevali P, Jain M, Paten B. 2020. Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes. Human Pangenome Reference Consortium. hpgp-data

    Jain M, Koren S, Miga KH, Quick J, Rand AC, Sasani TA, Tyson JR, Beggs AD, Dilthey AT, Fiddes IT, Malla S, Marriott H, Nieto T, O'Grady J, Olsen HE, Pedersen BS, Rhie A, Richardson H, Quinlan AR, Snutch TP, Tee L, Paten B, Phillippy AM, Simpson JT, Loman NJ, Loose M. 2018. Nanopore sequencing and assembly of a human genome with ultra-long reads. Nanopore WGS Consortium. NA12878

    De Coster W, De Rijk P, De Roeck A, De Pooter T, D'Hert S, Strazisar M, Sleegers K, Van Broeckhoven C. 2019. Structural variants identified by Oxford Nanopore PromethION sequencing of the human genome. European Nucleotide Archive. PRJEB26791

    Zook JM, Catoe D, McDaniel J, Vang L, Spies N, Sidow A, Weng Z, Liu Y, Mason CE, Alexander N, Henaff E, McIntyre ABR, Chandramohan D, Chen F, Jaeger E, Moshrefi A, Pham K, Stedman W, Liang T, Saghbini M, Dzakula Z, Hastie A, Cao H, Deikus G, Schadt E, Sebra R, Bashir A, Truty RM, Chang CC, Gulbahce N, Zhao K, Ghosh S, Hyland F, Fu Y, Chaisson M, Xiao C, Trow J, Sherry ST, Zaranek AW, Ball M, Bobe J, Estep P, Church GM, Marks P, Kyriazopoulou-Panagiotopoulou S, Zheng GXY, Schnall-Levin M, Ordonez HS, Mudivarti PA, Giorda K, Sheng Y, Rypdal KB, Salit M. 2016. Extensive sequencing of seven human genomes to characterize benchmark reference materials. NCBI Genome in a Bottle FTP. FTP

    The 1000 Genomes Project Consortium 2015. A global reference for human genetic variation. The International Genome Sample Resource. 30x-grch38

    Stunnenberg HG, Abrignani S, Adams D, de Almeida M, Altucci L, Amin V, Amit I, Antonarakis SE, Aparicio S, Arima T, Arrigoni L, Arts R, Asnafi V, Badosa ME, Bae JB, Bassler K, Beck S, Berkman B, Bernstein BE, Hirst M. 2016. The International Human Epigenome Consortium: A Blueprint for Scientific Collaboration and Discovery. Blueprint Epigenome. blueprint

    Bernstein BE, Stamatoyannopoulos JA, Costello JF, Ren B, Milosavljevic A, Meissner A, Kellis M, Marra MA, Beaudet AL, Ecker JR, Farnham PJ, Hirst M, Lander ES, Mikkelsen TS, Thomson JA. 2010. The NIH Roadmap Epigenomics Mapping Consortium. NCBI Gene Expression Omnibus. epigenomics

    ENCODE Project Consortium 2012. An Integrated Encyclopedia of DNA Elements in the Human Genome. ENCODE. encodeproject

    Okae H, Chiba H, Hiura H, Hamada H, Sato A, Utsunomiya T, Kikuchi H, Yoshida H, Tanaka A, Suyama M, Arima T. 2014. Genome-wide analysis of DNA methylation dynamics during early human development. DNA Data Bank of Japan. DRA003802

    Steinmetz LM, Hogenesch JB, Kellis M, Batzoglou S, Snyder M. 2013. Extensive Variation in Chromatin States Across Humans. NCBI Sequence Read Archive. SRP030041

    Hon GC, Rajagopal N, Shen Y, McCleary DF, Yue F, Dang MD, Ren B. 2013. Epigenetic memory at embryonic enhancers identified in DNA methylation maps from adult mouse tissues. NCBI Gene Expression Omnibus. GSE42836

    Tung J, Barreiro LB, Johnson ZP, Hansen KD, Michopoulos V, Toufexis D, Michelini K, Wilson ME, Gilad Y. 2012. Social environment is associated with gene regulatory variation in the rhesus macaque immune system. NCBI Gene Expression Omnibus. GSE34128

    Jeong H, Mendizabal I, Berto S, Chatterjee P, Layman T, Usui N, Toriumi K, Douglas C, Singh D, Huh I, Preuss TM, Konopka G, Yi S V. 2021. Evolution of DNA methylation in the human brain. NCBI Gene Expression Omnibus. GSE151768

    Dahlet T, Argüeso Lleida A, Al Adhami H, Dumas M, Bender A, Ngondo RP, Tanguy M, Vallet J, Auclair G, Bardet AF, Weber M. 2020. Genome-wide analysis in the mouse embryo reveals the importance of DNA methylation for transcription integrity. NCBI Gene Expression Omnibus. GSE130735

    Jung YH, Sauria MEG, Lyu X, Cheema MS, Ausio J, Taylor J, Corces VG. 2017. Chromatin States in Mouse Sperm Correlate with Embryonic and Adult Regulatory Landscapes. NCBI Gene Expression Omnibus. GSE79226

    Saenz-de-Juano MD, Ivanova E, Billooye K, Herta A-C, Smitz J, Kelsey G, Anckaert E. 2019. Genome-wide assessment of DNA methylation in mouse oocytes reveals effects associated with in vitro growth, superovulation, and sexual maturity. NCBI Gene Expression Omnibus. GSE128656

    Gao F, Niu Y, Sun YE, Lu H, Chen Y, Li S, Kang Y, Luo Y, Si C, Yu J, Li C, Sun N, Si W, Wang H, Ji W, Tan T. 2017. De novo DNA methylation during monkey pre-implantation embryogenesis. NCBI Gene Expression Omnibus. GSE60166

    Florian Z, Droplaug NM, Olafur TM, Nicolas JW, Tiffany JM, Asgeir S, Gisli HH, Sigurjon AG, Pall M, Helga I, Snædis K, Kristjan. Kristjan FA, Anna H, Julius G, Thorunn R, Ingileif J, Hilma H, Gudmundur IE, Olof S, Isleifur O, Gisli M, Daniel FG, Unnur T, Bjarni VH, Simon NS, Kari S. 2018. Methylation and expression data for whole-genome human imprinting study. figshare.

    Imbeault M, Helleboid P, Trono D. 2017. ChIP-exo of human KRAB-ZNFs transduced in HEK 293T cells and KAP1 in hES H1 cells. NCBI Gene Expression Omnibus. GSE78099

    Takahashi N, Coluccio A, Thorball C. W, Planet E, Shi H, Offner S, Turelli P, Imbeault M, Ferguson-Smith A. C, Trono D. 2019. ZNF445 is a primary regulator of genomic imprinting. NCBI Gene Expression Omnibus. GSE115387


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES