Abstract
Region-based epigenome-wide association studies have demonstrated improved statistical power and biological interpretability compared with probe-wise analyses of DNA methylation data. However, most existing region-based methods characterize methylation dysregulation primarily through changes in mean methylation levels associated with a phenotype of interest. Substantial evidence indicates that phenotype-associated methylation alterations may also manifest through changes in methylation variability or through joint shifts in mean and variability. Despite this, no existing statistical framework jointly models mean–variance methylation changes in a region-based manner. We propose ReMeDy, a flexible statistical framework that uses a hierarchical likelihood approach within a generalized linear model setting to identify differentially methylated regions, variably methylated regions, and regions exhibiting joint differential and variable methylation at a genome-wide scale. Unlike existing models, ReMeDy operates directly on biologically defined co-methylated regions, allowing it to naturally capture spatial correlation inherent in DNA methylation array data, while avoiding reliance on heuristic, user-defined tuning parameters such as smoothing spans and kernel bandwidths that can substantially influence results and introduce subjectivity. Through extensive simulation studies and comprehensive benchmarking against popular models, we demonstrate that ReMeDy maintains false discovery and type-I error rates at nominal levels while achieving consistently higher statistical power across a wide range of realistic scenarios. Application to population-level DNA methylation data further shows that ReMeDy identifies biologically meaningful regions and pathways implicated in complex human diseases that are not captured by conventional mean-based analyses alone. ReMeDy is implemented as an open-source R package and is freely available at https://github.com/SChatLab/ReMeDy.
Keywords: DNA methylation, region-based analyses, methylation dysregulation, variable methylation, differential methylation, epigenome-wide association studies
1. Introduction
DNA methylation (DNAm) is a central epigenetic mechanism involved in the regulation of gene expression, chromatin organization, and cellular identity, serving as a dynamic interface between genetic variation, environmental exposures, and disease susceptibility1–3. Over the past decade, array-based technologies, particularly the Illumina HumanMethylation450 and MethylationEPIC platforms, have enabled epigenome-wide association studies (EWAS) at unprecedented scale, facilitating systematic, genome-wide investigations of DNAm patterns across diverse populations and disease contexts4–6. These studies have shown that disease-associated methylation changes are often not restricted to individual Cytosine-phosphate-Guanine (CpG) sites but instead occur across clusters of spatially correlated CpGs spanning regulatory regions such as promoters, enhancers, and CpG islands7–13. Consequently, region-based analyses of methylation data have emerged as a powerful alternative to single-site testing, offering increased statistical power, a reduced multiple-testing burden, and improved robustness to technical noise11;14;15. Genomic regions identified using this region-based approach are referred to as differentially methylated regions (DMRs), defined as regions in which the mean methylation levels of CpGs differ significantly between groups or treatment conditions. Importantly, substantial evidence indicates that DMRs tend to exhibit greater biological interpretability and improved reproducibility across cohorts, making them more suitable for downstream functional and translational analyses and for identifying potential therapeutic biomarkers16–18.
Despite substantial advances in study design, preprocessing, and statistical methodology, a major conceptual bias persists in EWAS, where epigenetic dysregulation is still predominantly characterized by differences in mean DNAm between phenotypic groups or exposure categories18–20. Most widely used EWAS pipelines are explicitly optimized to detect shifts in average methylation levels at individual CpG sites or aggregated genomic regions21;22, implicitly assuming that within-group variability is homogeneous or incidental to the primary scientific question, and rarely treating variance effects as explicit targets of inference23–25. Although mean-based frameworks have proven effective for identifying robust and reproducible associations, they impose inherent limitations on the classes of epigenetic dysregulation that can be detected26. Recent studies highlight that methylation dysregulation is more appropriately conceptualized as comprising three distinct components, mean, variance, and joint mean–variance changes, and that disease-associated alterations may manifest through not only mean methylation changes but also through increased dispersion, heterogeneity, or epigenetic instability even in the absence of substantial mean shifts27–29. Such variability-driven effects are increasingly recognized in complex and heterogeneous conditions, including cancer, autoimmune diseases, and aging-related phenotypes28;30–34. Consistent with recent studies, we provide additional corroborative evidence in this study demonstrating that all three components represent integral components of the methylation landscape (Figure 1), and therefore must be jointly investigated to avoid an incomplete and potentially biased understanding of epigenetic mechanisms underlying disease susceptibility and progression23;26.
Figure 1: This figure illustrates distinct components of methylation dysregulation observed across the genome.
Randomly selected regions are shown to demonstrate four patterns of methylation change between cases and controls, visualized using beta value density distributions. In each panel, the three density curves per group correspond to the three CpG sites within the region. Panel A shows a DMR, characterized by a location shift, where the density peaks are displaced between groups, indicating differences in mean methylation levels. Panel B shows a VMR, where the mean methylation levels are similar between groups but the spread of the distributions differs, reflecting changes in methylation variability. Panel C shows a DVMR, where both a location shift and changes in variability are observed. Panel D shows a region with no methylation dysregulation, where the distributions overlap between cases and controls.
Existing methods for detecting methylation dysregulation, including both differential methylation (DM) and variable methylation (VM) can be best viewed as a collection of methodological design choices rather than cohesive inferential frameworks, resulting in a fragmented analytical landscape35–37. Region-level DM approaches typically rely on outcome-guided region discovery, aggregating probe-wise association statistics across neighboring CpGs using smoothing, clustering, or p-value combination methods15;38;14. While these strategies leverage local spatial correlation to improve power, they are sensitive to tuning parameters, prone to outcome-driven boundary overfitting, and often identify regions that do not reflect intrinsic epigenomic organization, thereby reducing generalizability and interpretability14;39;40. In addition, these summary-statistic based approaches inherits the assumptions, biases, and calibration properties of underlying probe-wise tests and depends on accurate estimation of spatial autocorrelation, which can be unstable in modest sample sizes or under irregular probe spacing common to methylation data39;41. Methodological limitations are even more pronounced for VM analysis. Firstly, existing VM models are mostly site-based approaches and ignore the spatial dependence structure inherent to DNAm data23;42. Secondly, variance is typically modeled independently of the mean, despite fundamental mean–variance coupling in methylation data which can lead to mis-specified variance estimates and fragmented biological interpretation43. Thirdly, VM methods have restricted modeling flexibility, as most frameworks do not accommodate covariates to adjust for confounding or technical artifacts (such as batch effects, cell-composition shifts) and often limit the primary exposure to binary designs, substantially limiting their applicability to more complex epidemiological studies23;44;45;42. Collectively, this highlights that methylation dysregulation continues to be characterized in a piecemeal fashion, and that the absence of integrated region-level frameworks capable of jointly modeling location and scale effects warrants the development of a unified statistical framework that can capture all components of methylation dysregulation while accounting for the structural and biological complexity of DNAm data.
To address these methodological gaps, we propose ReMeDy (integrated statistical framework for Region-based detection of Methylation Dysregulation), which identifies differentially methylated, variably methylated, and joint differential–variable regions within a single coherent statistical framework. Firstly, ReMeDy is built on a hierarchical likelihood approach within a generalized linear model setting, enabling region-level modeling of methylation data which accounts for correlation among neighboring CpG sites through random effects and providing principled inference under spatial dependence. Secondly, unlike existing region-based methods that rely on outcome-guided region aggregation and are therefore sensitive to tuning parameters and boundary overfitting, ReMeDy operates only on co-methylation regions defined independently of the phenotype, ensuring that regions reflect intrinsic epigenomic organization and improving interpretability and generalizability. Thirdly, the framework performs joint mean–variance modeling, explicitly accounting for the mean–variance dependence inherent to methylation data, and thereby avoids the mis-specified variance estimates and biased inference that can arise when location and scale effects are modeled separately. Lastly, ReMeDy supports covariate/confounder adjustments, incorporation of multiple random effects to capture technical and biological sources of variation, and accommodates binary, continuous, and discrete phenotypic variables. Collectively, these features position ReMeDy as an integrated and statistically principled framework that addresses the fragmented nature of existing approaches and enables comprehensive characterization of methylation dysregulation in array-based EWAS. To our knowledge, this represents the first systematic implementation of a hierarchical generalized linear modeling (HGLM) framework with joint mean–variance modeling for identifying phenotype-associated methylation markers in DNAm data.
The remainder of the manuscript is organized as follows. In Section 2, we first describe the hierarchical likelihood framework within the generalized linear model (GLM) setting and introduce the resulting HGLM in the context of DNAm array data, along with algorithms for estimating model coefficients under a joint mean–variance modeling framework. In Section 3, using both synthetic data and published methylation array datasets, we conduct a systematic benchmarking study of more than ten representative methods for detecting aberrant methylation. We show that our proposed method, ReMeDy, consistently outperforms existing approaches across experimental platforms in terms of statistical power and false discovery rate control. Finally, Section 4 concludes with a brief discussion of our findings. Our open-source software package is available at https://github.com/SChatLab/ReMeDy.
2. Methods
2.1. Current modeling schemes of DNAm array data
DNAm array data exhibits complex dependence structures arising from both biological and technical sources. CpG sites are spatially organized along the genome and frequently display correlated methylation patterns due to shared regulatory mechanisms, chromatin organization, and genomic proximity7;46;47. Several strategies have been proposed to address correlation in DNAm data. A widely used class of models perform CpG-level association testing followed by post hoc aggregation of nearby significant sites using smoothing38, p-value combination and autocorrelation48 based approaches. These approaches leverage spatial proximity to implicitly account for local correlation but do not model dependence directly within the statistical framework. A less commonly used class of methods applies classical models for correlated data, such as linear mixed models (LMMs)49 and generalized estimating equations (GEEs)50, which explicitly model spatial dependence by treating clustered CpGs as repeated measurements within individuals51;52;47. While these approaches represent important progress in handling correlation in DNAm data, a common shortcoming still persists. Despite growing evidence that methylation dysregulation can manifest through changes in mean, variance, or their joint behavior, most existing methods are primarily formulated to assess mean effects and provide limited support for joint mean–variance modeling27–29. This serves as our motivating factor to develop a statistical framework that can jointly capture correlation in DNAm data along with mean-variance changes in methylation patterns at the regional level.
2.2. Hierarchical likelihood approach for modeling correlated methylation data
Likelihood-based inference forms the foundation of many statistical models, but classical likelihood approaches encounter challenges when applied to correlated and hierarchical data. In LMMs, random effects are introduced to account for correlation, and under Gaussian assumptions the random effects can be integrated out analytically to yield a closed-form marginal likelihood49. However, they are primarily designed to model mean structure, with variance components treated as nuisance parameters rather than quantities directly linked to covariates49. As a result, LMMs offer limited flexibility for modeling systematic changes in variability across phenotypic groups. Generalized linear mixed models (GLMMs)53 extend mixed-effects modeling to non-Gaussian responses, but likelihood evaluation requires integration over random effects, which is analytically intractable and typically relies on numerical approximations such as Laplace methods or quadrature54. While these approximations are often adequate for modeling mean effects, they become less reliable and increasingly computationally demanding when variance parameters are allowed to depend on covariates, limiting their practical use for joint mean–variance modeling54. GEEs work with a marginal likelihood approach by specifying a working correlation structure, but they do not define a full likelihood, limiting inference on variance components and likelihood-based model comparison50. To overcome these limitations, Lee and Nelder introduced the hierarchical likelihood (h-likelihood) approach55;56. Rather than integrating out random effects, the h-likelihood defines a joint likelihood for the observed data and the unobserved random effects, treating the random effects as additional unknown quantities to be estimated alongside the model parameters. This formulation explicitly represents correlation through shared random effects while retaining a coherent likelihood-based framework. By avoiding high-dimensional integration, the h-likelihood enables simultaneous estimation of fixed effects, random effects and dispersion parameters, making it suitable for modeling correlated biological data such as DNAm.
2.3. Hierarchical generalized linear modeling scheme for DNAm data
The h-likelihood framework leads to HGLMs which extends GLMMs by allowing both the mean and dispersion components of the response distribution to be modeled through separate but linked linear predictors, each of which may include fixed effects and random effects57. This structure enables joint inference on mean and variability while explicitly accounting for correlation. Random effects are introduced at the subject level to account for correlation among CpGs measured within the same individual, treating CpG sites within a region as repeated measurements. At the same time, dispersion model allow variability to depend on biological outcomes, enabling direct modeling of variable methylation effects. Together, this framework not only accounts for the correlation in the measurements but also allows methylation differences linked to a phenotype of interest to be characterized through changes in mean methylation, methylation variance, or both, within an integrated statistical framework.
2.4. Proposed Model Specification for Region-Based DNAm Analysis
In this study, we propose to model region-level DNAm data using the h-likelihood approach under the HGLM framework. Let denote the methylation level for sample at CpG site within genomic region . Since methylation levels are analyzed on the logit-transformed scale (M-values), which approximately follow a Gaussian distribution, we assume a Gaussian likelihood with an identity link, i.e.,
| (1) |
where denotes the conditional mean of the methylation level for CpG site in sample modeled through the mean model, and denotes the residual variance, which is modeled through the variance model described in the following sections. These two models are linked within the HGLM framework. The mean model characterizes changes in mean methylation levels while the variance model captures changes in methylation variability that is linked to the phenotypic variable of interest which can be binary, continuous or discrete in nature. Together, these components enable joint inference on mean and variance within an integrated h-likelihood formulation.
2.4.1. Mean Model
For a given genomic region , the conditional mean of the methylation level at CpG site for sample , given the subject-specific random effect , is modeled as
| (2) |
where denotes the identity link and . The covariate vector includes an intercept and any additional adjustment variables, with corresponding region-specific coefficients . The parameter represents the region-specific phenotype-associated effect on mean methylation, with denoting the phenotype variable of interest. Statistical inference on therefore provides a direct region-level assessment of DM, and regions with significant are identified as DMRs. The subject-specific random intercept accounts for correlation among CpG sites measured within the same individual, treating CpG sites within region as repeated measurements.
2.4.2. Variance Model
To model methylation variability across samples and phenotype of interest, the residual variance is modeled using a GLM with a log link,
| (3) |
where denotes the residual variance of the methylation level at CpG site for sample within region . The covariate vector includes an intercept and any additional adjustment variables, with corresponding region-specific coefficients . The parameter represents the phenotype-associated effect on methylation variability for region , with denoting the phenotype of interest. Statistical inference on therefore provides a direct region-level assessment of VM and regions with significant are identified as VMRs.
This two-part mean–variance formulation allows for simultaneous modeling of DM (mean differences) and VM (variance differences) across genomic regions. In this study, although our focus has been on a binary (case-control) phenotype variable, however, this proposed framework can trivially accommodate continuous and discrete phenotypic variable of interest.
2.5. Estimation via h-likelihood
Estimation of the model defined in equations (1)–(3) requires inference on the region-specific fixed effects and , the phenotype of interest effects and , the subject-level variance component , and the subject-specific random effects . Let denote the observed CpG-level M-values within genomic region . Typically, classical likelihood-based inference is based on the marginal likelihood obtained by integrating out the random effects,
| (4) |
where . Given the high-dimensional nature of DNAm data, direct evaluation of the marginal likelihood in (4) can be computationally unstable and demanding due to the presence of random effects and the explicit modeling of variance.
The h-likelihood framework55;57 avoids explicit integration by treating the random effects as additional unknown quantities to be estimated jointly with the fixed effects. The h-likelihood for region is defined as
| (5) |
which combines the conditional likelihood of the data given the random effects with the log-density of the random effects implied by their assumed Gaussian distribution. For the Gaussian HGLM considered here, the conditional log-likelihood contribution from observation is
| (6) |
where and are defined by the mean and variance models in equations (2) and (3), respectively.
Maximization of the h-likelihood proceeds through an iterative procedure that alternates between updating the mean and variance components. Conditional on current values of the variance parameters, maximization of (6) with respect to the mean structure leads to weighted estimating equations equivalent to fitting a linear mixed model with weights . This step yields updated estimates of , and the subject-specific random effects . Conditional on the updated mean structure, information about the variance parameters is contained in the squared residuals . Writing , differentiation of (6) with respect to shows that the score for the variance model depends on the quantity , which motivates treating the squared, standardized residuals from the mean model as the response for updating the variance model. These mean and variance updates are alternated until convergence.
2.6. Hypothesis testing and multiple testing adjustment
Inference for DM and VM is conducted at the region level using Wald-type test statistics derived from the fitted HGLM. For each genomic region , let and denote the estimated region-specific phenotypic effects in the mean and variance model, respectively. Let and denote the corresponding variance estimates obtained from the h-likelihood–based estimation procedure. DM is assessed by testing the null hypothesis using the Wald statistic , which is asymptotically distributed as under . The resulting -value is denoted by . Similarly, VM is assessed by testing using the Wald statistic , which is also asymptotically , yielding a variability-related -value . Here the superscript ‘M’ denotes mean model and ‘V’ denotes variance model. To assess joint evidence for regions exhibiting both DM and VM, we combine the two marginal -values using the Cauchy Combination Test (CCT)58. For each region , we define the test statistic
| (7) |
and compute the corresponding joint -value as
| (8) |
The CCT yields valid combined -values under arbitrary dependence between and .
To account for multiple testing across genomic regions, the sets and are adjusted separately using the Benjamini–Hochberg (BH) procedure59 to control the false discovery rate (FDR). Regions with adjusted ( denotes a pre-specified statistical significance threshold such as .) are declared DMRs, regions with adjusted are declared VMRs, and regions with adjusted are declared differentially and variably methylated regions (DVMRs).
3. Results
First, we describe the practical implementation details of ReMeDy, including model formulation and input requirements (Section 3.1). Second, we provide details about our simulation framework and the different simulation scenarios considered in our simulation framework (Section 3.2). Third, we present extensive simulation studies designed to evaluate the performance of ReMeDy under realistic methylation array data settings, comparing its ability to detect DMRs, VMRs, and DVMRs against existing methods (Section 3.3). Finally, we apply ReMeDy to analyze a publicly available DNAm array dataset to demonstrate its practical utility and to uncover biologically relevant methylation dysregulation that are not readily captured by existing approaches (Section 3.4).
3.1. Implementation Details
ReMeDy operates on user-supplied co-methylated regions and assumes that DNAm beta-values within these regions have been appropriately preprocessed using standard, well-established pipelines. Specifically, in our workflow, raw intensity data are processed using the minfi Bioconductor package60, which performs sample- and probe-level quality control. This includes filtering probes with poor detection p-values, removing probes with known single-nucleotide polymorphisms or cross-reactive genomic mapping, and excluding sex-chromosome probes when appropriate. This is followed by normalization and for batch effect correction we use ComBat61, an empirical Bayes method that models and adjusts for technical variation across batches, thereby removing batch-associated effects from the data. These correction for technical variation ensures that downstream analyses primarily reflect biological signal rather than technical noise. Following preprocessing, methylation levels are transformed to M-values and supplied to SACOMA62, which identifies co-methylated regions. SACOMA utilizes a spatially-constrained hierarchical clustering algorithm to obtain clusters of contiguous CpG sites exhibiting correlated methylation patterns in a genome-wide manner. ReMeDy is agnostic to the specific normalization or batch-effects correction strategy used prior to co-methylation region construction, provided that major technical effects are adequately controlled. Although we use minfi for preprocessing and SACOMA for obtaining co-methylated regions, alternative preprocessing pipelines such as ChAMP63 and co-methylated region construction methods such as coMethDMR47 can be used.
3.2. Simulation Strategy and Settings
We evaluated the capability of ReMeDy in identifying DMRs, VMRs, and DVMRs in comparison to a broad variety of existing methods. For DM, comparisons were made with region-based approaches including bumphunter38, comb-p48, dmrff64, and DMRcate65, while VM performance was assessed against existing site-level methods such as JLSsc42, iEVORA66, DiffVar24, Bartlett’s test67, and Levene’s test68 due to lack of region-based models. For joint differential and variable methylation, comparisons were made against JLSsc and iEVORA due to their ability to return CpGs exhibiting both effects. Although no published methods are explicitly designed to identify VMRs or DVMRs at the regional level, DMRcate was included in all three comparisons as its software documentation module mentions about its ability in detecting VMRs and DVMRs. Null (type-1 error) simulations were performed using datasets derived from two population-level methylation array datasets (See supplementary section 1.3 for details) representing low (N = 40) and high (N = 142) sample-size settings in which phenotypic labels of the samples were randomly permuted to remove any inherent associations and create new datasets with no true biological signals. Power simulations were performed using realistic synthetic DNAm datasets generated from multivariate normal distributions with compound symmetry region-specific correlation structure and mean and variance effects to represent DMRs, VMRs, and DVMRs, thereby providing known ground truth. Details of our simulation strategy and the compared models are provided in supplementary section 1.1 and 1.2 respectively. Model performance was evaluated using FDR, statistical power, Matthews correlation coefficient (MCC), and area under the receiver operating characteristic curve (AUROC) across all simulation scenarios (See supplementary section 1.5 for details). Simulation scenarios comprised of 10,000 regions with systematically varying sample sizes (N = 50, 100, and 150), mean effect sizes (0.4, 0.7, and 1.0), variance effects (with group 2 variability 1.5-, 2.5-, or 3.5-fold greater than group 1), and group proportions in 50%/50% or 30%/70% case–control splits (See STable 1 for summary). Results presented in this section focus on scenarios with a mean effect of 0.7, a variance effect of 2.5, and imbalanced design (30%/70% case-control split). Findings from all remaining simulation settings are reported in supplementary section 2.
3.3. Simulation Results
Under null simulations with 10,000 regions and no true biological signals present in the simulated data, most methods exhibited reasonable type-I error control, including ReMeDy, which maintained error rates below the nominal 5% threshold for both sample sizes (Figure 2). DiffVar was the only method to show mild type-I error inflation in the smaller sample size (N = 40), while achieving appropriate error control in the larger sample size (N = 142) setting. Specifically DMRcate, comb-p, bumphunter, and iEVORA were highly conservative across both sample sizes, with median type-I error rates ranging from 0 to 0.1% while Levene’s test, ReMeDy, JLSsc, and Bartlett’s test demonstrated moderate conservatism to balanced error control, with median type-I error rates ranging from 2 to 5% across both sample size settings.
Figure 2: Type-I error rate evaluation.
This panel plot illustrates the type-I error (y-axis) for ReMeDy and all competing methods under two different scenarios: Panel A shows results from a low sample size (N = 40) setting and Panel B shows results from a higher sample size (N = 142). Each boxplot represents 100 simulation runs. The dashed red line marks the nominal type-I error rate of 0.05. Methods with boxplots near or below this line are controlling false positives well. Approaches with no visible box have a type-I error rate of zero.
Beyond type-I error control, ReMeDy consistently outperformed existing DMR detection methods across all evaluation metrics under the DMR-only simulation setting, which comprised 10,000 regions, 10% true DMRs with a mean difference effect of 0.7, no variance effect, and unequal group proportions (Figure 3). ReMeDy achieved the highest statistical power across all sample sizes, with performance increasing steadily as sample size grew (Figure 3A). In comparison, DMRcate, comb-p, and bumphunter showed modest power gains, while dmrff consistently exhibited low power, consistent with its documented limitation arising from additional sub-region testing and the resulting increased multiple-testing burden64. ReMeDy, along with most other methods, maintained reliable FDR control at the nominal 5% level across all sample sizes (Figure 3B), whereas DMRcate exhibited inflated FDR at a sample size of 150. Consistent with these findings, ReMeDy achieved the highest MCC scores across all sample sizes, indicating the most balanced classification of DMRs and non-DMRs (Figure 3C). Similarly, ReMeDy attained the highest AUROC values across all sample sizes, demonstrating superior discrimination between true and non-DMRs (Figure 3D). In contrast, comb-p, DMRcate, and bumphunter showed AUROC values near 0.5 at the smallest sample size (N = 50), with improvement at larger sample sizes but remaining consistently below ReMeDy, while dmrff performed no better than random guessing. ReMeDy’s favorable performance was preserved under equal group sizes, and across other mean effect settings, demonstrating its ability to capture a variety of mean effects and robustness to group imbalance (SFigure 1–5).
Figure 3: Performance of ReMeDy and competing methods in the DMR only scenario.
This panel plot compares ReMeDy with other competing methods in DMR only scenario across three sample sizes (50, 100, and 150) with mean effect of 0.7 and unequal group proportions. Panel A shows statistical power, Panel B shows FDR, Panel C shows the MCC score and Panel D shows AUROC scores. Each bar reflects median of 100 simulation runs. The dashed red line in Panel B marks the nominal FDR threshold of 0.05. Higher values in Panels A, C, and D indicate better performance. Methods without a visible bar achieved value of zero.
Building on the DMR-only simulation results, we next evaluated performance under a VMR-only scenario using the same simulation framework of 10,000 regions and 10% true signals, but with no mean effect and a variance in group 2 that was 2.5 times larger than in group 1 (Figure 4). Across all metrics, ReMeDy again demonstrated favorable performance. In terms of power, ReMeDy showed the highest ability to detect true VMRs, with power increasing consistently with sample size (Figure 4A). JLSsc, despite being a site-level method, performed comparably to ReMeDy, while DiffVar and iEVORA showed moderate power that improved with sample size. The comparatively weaker performance of iEVORA reflects its two-step design, in which detection of variability is followed by filtering based on mean differences, resulting in limited signal when only variance effects are present. The VMR detection implementation of DMRcate showed little to no improvement in power across sample sizes, while Bartlett’s test consistently exhibited the lowest power. ReMeDy along with JLSsc, iEVORA, Bartlett’s test and Levene’s test maintained FDR control near the nominal 5% level across all sample sizes (Figure 4B), whereas DMRcate displayed inflated FDR across all sample sizes. In terms of MCC, JLSsc achieved the highest values at some sample sizes (N = 100 and 150), with ReMeDy remaining a close second (Figure 4C). A similar pattern was observed for AUROC, where ReMeDy and JLSsc consistently showed the strongest discrimination between true and non-VMRs (Figure 4D). These results were robust to equal group sizes, with ReMeDy again performing favorably across metrics and across settings with varying variance effects (SFigure 6–10).
Figure 4: Performance of ReMeDy and competing methods in the VMR only scenario.
This panel plot compares ReMeDy with other competing methods in VMR only scenario across three sample sizes (50, 100, and 150) with variance effect of 2.5 and unequal group proportions. Panel A shows statistical power, Panel B shows FDR, Panel C shows the MCC score and Panel D shows AUROC scores. Each bar reflects median of 100 simulation runs. The dashed red line in Panel B marks the nominal FDR threshold of 0.05. Higher values in Panels A, C, and D indicate better performance. Methods without a visible bar achieved value of zero.
Finally, in a similar vein, we assessed model performances under the DVMR scenario which incorporated both a mean difference of 0.7 and a variance effect of 2.5, while retaining the same simulation design of 10,000 regions, 10% true signals, and imbalanced group proportions (Figure 5). Across all evaluation metrics, ReMeDy continued to demonstrate superior performance. ReMeDy achieved the highest statistical power across all sample sizes, with steady improvement as sample size increased (Figure 5A). JLSsc and DMRcate also showed increasing power, while iEVORA exhibited limited improvement, reflecting again the inefficiency arising from its sequential mean-filtering step when both effects coexist. Similar to our previous findings, ReMeDy along with JLSsc and iEVORA consistently maintained FDR near the nominal 5% level across all sample sizes (Figure 5B), whereas DMRcate showed inflated FDR at moderate and large sample sizes. MCC and AUROC analyses further highlighted ReMeDy’s favorable performance, with ReMeDy achieving the highest scores across sample sizes and JLSsc consistently ranking second (Figure 5C–D) while DMRcate and iEVORA demonstrated only moderate discrimination capabilities, indicating limited adaptability to combined mean–variance effects. Consistent performance was observed under equal group sizes, with ReMeDy maintaining favorable results across evaluation metrics and under varying mean and variance effects (SFigure 11–27).
Figure 5: Performance of ReMeDy and competing methods in the DVMR scenario.
This panel plot compares ReMeDy with other competing methods in DVMR scenario across three sample sizes (50, 100, and 150) with mean effect of 0.7, variance effect of 2.5 and unequal group proportions. Panel A shows statistical power, Panel B shows FDR, Panel C shows the MCC score and Panel D shows AUROC scores. Each bar reflects median of 100 simulation runs. The dashed red line in Panel B marks the nominal FDR threshold of 0.05. Higher values in Panels A, C, and D indicate better performance. Methods without a visible bar achieved value of zero.
3.4. Re-analysis of population-level DNAm array dataset
We used ReMeDy to analyze a publicly available DNAm array dataset from the Gene Expression Omnibus database (GEO). The dataset (GSE306095) was generated using the Illumina EPIC platform, and quality control and batch effect correction were performed using the ChAMP63 pipeline and ComBat61 (See supplementary section 1.3 for more details). This dataset investigates DNAm changes in B-cell acute lymphoblastic leukemia (B-ALL) and comprises of 57 samples, including 53 B-ALL patients and 4 non-leukemia controls, with methylation measured at 719,234 CpG sites69. Childhood ALL is a blood and bone marrow malignancy characterized by uncontrolled proliferation of immature lymphoid cells and represents the most common pediatric cancer, with symptoms including fatigue, bruising, fever, bone pain, and recurrent infections70. Standard treatment involves multi-phase chemotherapy, often combined with targeted or immunotherapy, resulting in high long-term survival rates71;72. We selected this dataset because it represents a very recent EWAS that focused exclusively on identifying mean methylation differences associated with B-ALL, thereby overlooking other important components of methylation dysregulation, such as differential variability and joint mean–variance effects69. The original study reported a predominantly hypermethylated profile in B-ALL and identified 1,056 DMRs enriched in pathways related to cell proliferation and neuronal signaling. Strong associations were also observed at CpGs within MAD1L1 and RPTOR. However, this analysis primarily focused on mean differences, providing an incomplete view of the B-ALL associated methylation landscape69. By applying ReMeDy, we aim to recover additional variance-driven and joint mean–variance methylation signals that may further refine the epigenetic characterization of B-ALL and reveal complementary disease-associated patterns not captured by the current standard of EWAS.
In addition to analyzing the B-ALL dataset with ReMeDy, we performed comparative analyses using DMR-cate, JLSsc, and iEVORA, as these methods can also identify all three components of methylation dysregulation. ReMeDy identified the largest number of CpGs in the B-ALL dataset (35,833 CpGs grouped into 7,246 regions), compared with DMRcate (6,134 CpGs), JLSsc (1,920 CpGs), and iEVORA (3 CpGs) (Figure 6). Among the regions detected by ReMeDy, 383 were classified as DMRs (1,680 CpGs), 3 as VMRs (12 CpGs), and 6,860 as DVMRs (34,141 CpGs). To ensure fair comparison across methods that output either individual CpGs or regions containing multiple CpGs, overlaps were assessed at the CpG level. ReMeDy and DMRcate shared 1,380 CpGs, corresponding to an overlap of approximately 22.5%, indicating concordance between the two region-based approaches. Notably, ReMeDy uniquely identified 34,426 CpGs across 7,000 regions, the majority of which were DVMRs (6,622 regions comprising 32,774 CpGs), along with additional unique DMRs (375 regions, 1,640 CpGs) and VMRs (3 regions, 12 CpGs). Overall, these results demonstrate that ReMeDy achieves both agreement with established methods and, consistent with the simulation studies, exhibits substantially broader detection power by capturing additional methylation signals that are not identified by existing approaches.
Figure 6: Upset plot of CpG-level overlap across four methylation analysis methods in the pediatric B-ALL dataset.
Using four methods, we summarize the overlap of CpGs within the regions identified as differentially methylated, variably methylated or both between B-ALL patients (N = 53) and non-leukemia controls (N = 4) in the pediatric B-ALL dataset generated using the Illumina EPIC array. Numbers in parentheses indicate the total number of CpGs detected by each method. Intersections are evaluated at the CpG level to allow fair comparison between region-based and site-based approaches. CpGs with adjusted p-values smaller than were considered significant by each method.
Following data analysis, we performed functional bioinformatics analyses to assess the biological relevance of CpGs located in regions uniquely identified by ReMeDy (See supplementary section 1.4 for more details). CpG island annotation showed that a substantial proportion of these CpGs mapped to CpG islands (55.2%), with additional enrichment in open sea (20.2%) and shore regions, indicating that ReMeDy captures both promoter-proximal and distal regulatory methylation changes (Figure 7B). Genic annotation further revealed that most CpGs were located within gene-associated regions, including promoters (34.1%), regions 1–5 kb upstream of transcription start sites (24.7%), introns (26.2%), and exons (10.1%), highlighting their potential regulatory impact on gene expression (Figure 7C). Among regions uniquely identified by ReMeDy, 18.3% exhibited low effect sizes, while the majority (81.7%) showed moderate to high effect sizes, indicating sensitivity to both subtle and strong methylation changes. The top 10 significant regions and their corresponding effect sizes are shown in Figure 7D. Pathway-level analyses identified enrichment of transmembrane receptor protein tyrosine phosphatase signaling among the regions uniquely identified by ReMeDy (Figure 7A), driven in part by PTPRD, a receptor-type protein tyrosine phosphatase frequently disrupted or deleted in pediatric and adult ALL, including at relapse73;74. Given the central role of receptor-type protein tyrosine phosphatases in intracellular signal transduction, whose loss promotes aberrant signaling and leukemic progression75, these results support the biological plausibility of regions uniquely identified by ReMeDy. Consistent with this, Gene Ontology (GO)76 enrichment analysis further highlighted Wnt signaling as a key pathway associated with regions uniquely identified by ReMeDy (Figure 7E), with multiple genes (WNT3A, WNT7A, ROR2, FZD10, CSNK1E) involved in canonical and non-canonical Wnt cascades. Dysregulation of Wnt signaling is known to influence hematopoietic stem cell self-renewal, leukemic cell survival, aberrant -catenin activation, and treatment resistance in ALL77;78. Together, these findings indicate that ReMeDy captures methylation dysregulation linked to biologically meaningful pathways central to disease pathogenesis.
Figure 7:
This figure summarizes the genomic context, functional annotation and effect sizes of regions uniquely identified by ReMeDy. Panel A shows GO biological process enrichment derived from region-based analysis using goregion()88, where genomic regions were directly used to identify enriched functional terms; point size indicates the number of genes associated with each term and the x-axis shows −log10( – values). Panel B shows the distribution of these regions relative to CpG islands, including islands, shores, shelves and open sea regions. Panel C shows the genic annotation of CpGs within these regions, categorized by promoters, regions 1–5 kb upstream of transcription start sites, exons, introns, untranslated regions and intergenic regions. Panel D shows the log2FC of the top 10 regions ranked by effect size, with positive and negative values indicating upregulated and downregulated regions, respectively. Panel E shows GO biological process enrichment based on genes associated with ReMeDy-specific regions, obtained using genelevel enrichment analysis (enrichGO89), with bar lengths representing the number of genes contributing to each pathway.
We further mapped CpGs in regions uniquely identified by ReMeDy to their nearest genes, revealing several biologically relevant genes associated with B-ALL. Among the 7,000 regions uniquely identified by ReMeDy, one of the strongest signals (chr22:49826097–49826265) mapped to BRD1, an epigenetic regulator reported as a rare fusion partner of PAX5 in childhood B-cell precursor ALL, supporting its functional relevance in leukemogenesis79;80. Two additional regions on chromosome 21 (chr21:34886125–34886300 and chr21:34886769–34887501) mapped to RUNX1, a key hematopoietic transcription factor frequently involved in childhood B-ALL through the ETV6::RUNX1 fusion, which drives leukemic transformation in a major clinical subtype81. Recent studies have also reported rare pathogenic RUNX1 point mutations in relapsed B-ALL, including a novel p.Leu148Gln variant associated with poor clinical outcome82. Another region (chr21:38499362–38499580) mapped to ERG, an ETS-family transcription factor implicated in a distinct subtype of B-cell precursor ALL. Recurrent ERG deletions occur predominantly in DUX4-rearranged B-ALL, where DUX4 deregulation promotes expression of a dominant-negative ERG isoform or predisposes the locus to deletion, contributing to leukemogenesis and defining a biologically distinct subgroup83;84. The presence of these well-established B-ALL associated genes among regions uniquely detected by ReMeDy supports its ability to identify biologically meaningful epigenetic signals. At the same time, a subset of regions uniquely identified ReMeDy mapped to genes with no currently known links to B-ALL, suggesting the potential discovery of novel regulatory elements or candidate biomarkers that warrant further functional and clinical investigation.
4. Discussion
In this study, we introduced ReMeDy, an integrated statistical framework for region-based identification of methylation dysregulation, encompassing differential methylation, differential variability, and their joint effects. Methodologically, ReMeDy is built on a hierarchical generalized linear modeling framework that naturally incorporates correlation among CpG sites and enables joint mean–variance modeling with respect to the phenotype of interest and covariates under one statistical framework, thereby addressing key shortcomings of current EWAS pipelines (Table 1). With a computationally efficient hierarchical likelihood–based estimation framework and region-wise model fitting, ReMeDy is scalable to DNAm datasets comprising tens of thousands of genomic regions measured across large sample cohorts. While the primary use case presented here focuses on univariate analysis with a binary phenotype, extension to continuous or discrete phenotypes, multivariable modeling with multiple covariates or confounders, and incorporation of additional random effects is straightforward. Beyond array-based methylation data, the ReMeDy framework is readily extensible to sequencing-based methylation data and other high-dimensional modalities with correlated structure, including histone modification profiles, ATAC-seq, copy number variation, and microRNA expression data. Collectively, these features position ReMeDy as a flexible and powerful statistical framework that expands the scope of methylation dysregulation detectable in EWAS and other related genomic analyses.
Table 1:
This table summarizes key methodological characteristics and performance of ReMeDy and other methods for identifying methylation dysregulation. The first column lists the methods. The second and third columns indicate whether each method explicitly models mean and variance effects, respectively. The fourth column specifies whether CpG-level correlation within regions is accounted for. The fifth column indicates whether the method provides estimated effect sizes and associated standard errors. The sixth column summarizes empirical power, and the final column reports FDR control at the nominal 5% level. The methods in italics perform region-based analysis.
|
feature supported,
feature not supported.
● high, ◐ moderate, ○ low.
Operates on precomputed CpG-level p-values rather than raw methylation data.
Mean effects used in a secondary weighting step rather than a joint model.
Given the substantial body of existing methodological work in this area, it is important to clearly articulate the practical strengths and implementation advantages of ReMeDy. First, ReMeDy operates directly on biologically defined co-methylated regions, allowing it to naturally account for the inherent correlation structure present in DNAm array data, rather than reconstructing regions post hoc from site-level association statistics using smoothing, clustering, or p-value combination approaches. Second, unlike many existing region-based methods, ReMeDy does not rely on heuristic tuning parameters such as smoothing spans, kernel bandwidths, autocorrelation windows, or CpG-level significance thresholds that can substantially influence results and introduce subjectivity65;38;48;64. For example, DMRcate requires users to specify kernel bandwidth and scaling parameters, where different choices can lead to markedly different region sizes and numbers of discoveries. In contrast, ReMeDy performs inference within a single, coherent statistical model, yielding results that are robust to arbitrary user-defined settings. Third, ReMeDy achieves computational efficiency by replacing full marginal likelihood evaluation with hierarchical likelihood optimization, thereby avoiding high-dimensional numerical integration and enabling stable and efficient parameter estimation without the need for repeated resampling or intensive permutation procedures (SFigure 28). Fourth, ReMeDy can incorporate multiple random effects, allowing it to model subject-level variation alongside additional sources of correlation, such as tissue-specific effects85, thereby making it well suited for emerging multi-tissue methylation studies86;87. Finally, ReMeDy has been implemented in R in a parallelizable manner and is freely available through our GitHub repository, facilitating scalable computation, accessibility, reproducibility, and future methodological development by the broader research community.
The extensive null and power simulations conducted in this study provide strong empirical support for the statistical validity of ReMeDy. Under null scenarios, ReMeDy consistently maintained type-I error rates close to the nominal level across a range of sample sizes, demonstrating well-calibrated inference despite the added complexity of joint mean–variance modeling and region-level correlation structures. In contrast, several existing methods exhibited either excessive conservatism or mild inflation, particularly in small-sample settings, highlighting the challenges of error control in spatially correlated methylation data40;39. Across power simulations, ReMeDy showed robust and often superior performance in detecting true differential, variable, and joint mean–variance methylation signals, with gains becoming more pronounced as sample size increased. Notably, ReMeDy retained strong performance in scenarios driven purely by variance effects or by combined mean–variance changes, settings in which many competing approaches showed limited sensitivity due to their reliance on mean-only or sequential testing strategies. These results demonstrate that ReMeDy achieves a favorable balance between statistical power and false discovery control across a broad range of realistic EWAS conditions.
Application of ReMeDy to a analyze a recent pediatric ALL data demonstrated the practical value of moving beyond mean-centric analyses. While the original study focused exclusively on DM, ReMeDy identified a substantially broader set of ALL-associated genomic regions, the majority of which exhibited joint mean–variance dysregulation. Functional annotation showed that genomic regions uniquely identified by ReMeDy but missed by existing models were enriched in regulatory genomic contexts and displayed moderate-to-large effect sizes, supporting their biological relevance. Pathway enrichment analyses further identified signaling processes, including receptor-type protein tyrosine phosphatase and Wnt signaling, which are well established in B-ALL biology but were not fully captured by mean-based or variance-based models alone. Consistent with these findings, gene-level mapping revealed well-characterized B-ALL-associated genes such as BRD1, RUNX1, and ERG, reinforcing the biological plausibility of the detected signals. At the same time, a subset of regions mapped to loci without known links to B-ALL, highlighting ReMeDy’s potential to identify novel regulatory elements and candidate biomarkers. Although further experimental validation is required to establish functional relevance, the ability of ReMeDy to recover both known and previously uncharacterized signals underscores its utility for hypothesis generation and illustrates how integrated region-level joint mean–variance modeling enables a more complete and nuanced characterization of methylation dysregulation in complex diseases.
Supplementary Material
Acknowledgments
This research was supported in part by Lilly Endowment, Inc., through its support for the Indiana University Pervasive Technology Institute.
Footnotes
Competing Interests
The authors declare that they have no competing interests.
Availability of Data and Materials
The 450K datasets are accessible from the GEO database under accession number GSE44667 and GSE80970. The EPIC dataset is accessible from the GEO database under accession number GSE306095. Code for ReMeDy, simulations, and comparisons of methods identifying methylation dysregulation on the various datasets explored are available on GitHub at https://github.com/SChatLab/ReMeDy. The source code is licensed under an MIT License.
References
- [1].Jaenisch Rudolf and Bird Adrian. Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nature Genetics, 33(3):245–254, 2003. [DOI] [PubMed] [Google Scholar]
- [2].Jones Peter A. Functions of DNA methylation: islands, start sites, gene bodies and beyond. Nature Reviews Genetics, 13(7):484–492, 2012. [Google Scholar]
- [3].Moore Lisa D, Le Thuc, and Fan Guoping. DNA methylation and its basic function. Neuropsychopharmacology, 38(1):23–38, 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [4].Sandoval Juan, Heyn Holger, Moran Sebastian, Serra-Musach Jordi, Pujana Miguel A, Bibikova Marina, and Esteller Manel. Validation of a DNA methylation microarray for 450,000 CpG sites in the human genome. Epigenetics, 6(6):692–702, 2011. [DOI] [PubMed] [Google Scholar]
- [5].Moran Sebastian, Arribas Carles, and Esteller Manel. Validation of a DNA methylation microarray for 850,000 CpG sites of the human genome enriched in enhancer sequences. Epigenomics, 8(3):389–399, 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [6].Pidsley Ruth, Zotenko Elena, Peters Timothy J, Lawrence Mitchell G, Risbridger Gail P, Molloy Peter, Van Djik Susan, Muhlhausler Beverly, Stirzaker Clare, and Clark Susan J. Critical evaluation of the Illumina MethylationEPIC BeadChip microarray for whole-genome DNA methylation profiling. Genome Biology, 17(1):208, 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Eckhardt Florian, Lewin Joern, Cortese Rene, Rakyan Vardhman K, Attwood John, Burger Matthias, Burton John, Cox Tony V, Davies Rob, Down Thomas A, et al. DNA methylation profiling of human chromosomes 6, 20 and 22. Nature Genetics, 38(12):1378–1385, 2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [8].Bell Jordana T, Tsai Pei-Chien, Yang Tsun-Po, Pidsley Ruth, Nisbet James, Glass Daniel, Mangino Massimo, Zhai Guangju, Zhang Feng, Valdes Ana, et al. Epigenome-wide scans identify differentially methylated regions for age and age-related phenotypes in a healthy ageing population. PLoS Genetics, 8(4):e1002629, 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [9].Shafi Adib, Mitrea Cristina, Nguyen Tin, and Draghici Sorin. A survey of the approaches for identifying differential methylation using bisulfite sequencing data. Briefings in Bioinformatics, 19(5):737–753, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [10].Irizarry Rafael A, Ladd-Acosta Christine, Wen Bo, Wu Zhijin, Montano Carolina, Onyango Patrick, Cui Hengmi, Gabo Kevin, Rongione Michael, Webster Maree, et al. The human colon cancer methylome shows similar hypo-and hypermethylation at conserved tissue-specific CpG island shores. Nature Genetics, 41(2):178–186, 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].Jaffe Andrew E and Irizarry Rafael A. Accounting for cellular heterogeneity is critical in epigenome-wide association studies. Genome Biology, 15(2):R31, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [12].Teschendorff Andrew E, Marabita Francesco, Lechner Matthias, Bartlett Thomas, Tegner Jesper, Gomez-Cabrero David, and Beck Stephan. A beta-mixture quantile normalization method for correcting probe design bias in Illumina Infinium 450 k DNA methylation data. Bioinformatics, 29(2):189–196, 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [13].Feinberg Andrew P, Ohlsson Rolf, and Henikoff Steven. The epigenetic progenitor origin of human cancer. Nature Reviews Genetics, 7(1):21–33, 2006. [Google Scholar]
- [14].Peters Timothy J., Buckley Matthew J., Statham Aaron L., Pidsley Ruth, Samaras Kate, Lord Robert V., Clark Susan J., and Molloy Paul L.. De novo identification of differentially methylated regions in the human genome. Epigenetics & Chromatin, 8:6, 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [15].Lent Samantha, Xu Hanfei, Wang Lan, Wang Zhe, Sarnowski Chloé, Hivert Marie-France, and Dupuis Josée. Comparison of novel and existing methods for detecting differentially methylated regions. BMC Genetics, 19(Suppl 1):84, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [16].Slieker Roderick C, Bos Steffan D, Goeman Jelle J, Bovée Judith VMG, Talens Rudolf P, Van Der Breggen Ruud, Suchiman H Eka D, Lameijer Eric-Wubbo, Putter Hein, Van Den Akker Erik B, et al. Identification and systematic annotation of tissue-specific differentially methylated regions using the Illumina 450k array. Epigenetics & Chromatin, 6(1):26, 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [17].Rakyan Vardhman K, Beyan Huriya, Down Thomas A, Hawa Mohammed I, Maslau Siarhei, Aden Deeqo, Daunay Antoine, Busato Florence, Mein Charles A, Manfras Burkhard, et al. Identification of type 1 diabetes-associated DNA methylation variable positions that precede disease diagnosis. PLoS Genetics, 7(9):e1002300, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [18].Michels Karin B, Binder Alexandra M, Dedeurwaerder Sarah, Epstein Charles B, Greally John M, Gut Ivo, Houseman E Andres, Izzi Benedetta, Kelsey Karl T, Meissner Alexander, et al. Recommendations for the design and analysis of epigenome-wide association studies. Nature Methods, 10(10):949–955, 2013. [DOI] [PubMed] [Google Scholar]
- [19].Rakyan Vardhman K, Down Thomas A, Balding David J, and Beck Stephan. Epigenome-wide association studies for common human diseases. Nature Reviews Genetics, 12(8):529–541, 2011. [Google Scholar]
- [20].Relton Caroline L and Smith George Davey. Epigenetic epidemiology of common complex disease: prospects for prediction, prevention, and treatment. PLoS Medicine, 7(10):e1000356, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [21].Ritchie Matthew E, Phipson Belinda, Wu DI, Hu Yifang, Law Charity W, Shi Wei, and Smyth Gordon K. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Research, 43(7):e47–e47, 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [22].Maksimovic Jovana, Phipson Belinda, and Oshlack Alicia. A cross-package bioconductor workflow for analysing methylation array data. F1000Research, 5:1281, 2017. [Google Scholar]
- [23].Saddiki Hachem, Colicino Elena, and Lesseur Corina. Assessing differential variability of high-throughput DNA methylation data. Current Environmental Health Reports, 9(4):625–630, 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [24].Phipson Belinda and Oshlack Alicia. DiffVar: a new method for detecting differential variability with application to methylation in cancer and aging. Genome Biology, 15(9):465, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [25].Teschendorff Andrew E and Relton Caroline L. Statistical and integrative system-level analysis of DNA methylation data. Nature Reviews Genetics, 19(3):129–147, 2018. [Google Scholar]
- [26].Kuan Pei Fen, Wang Sijian, Zhou Xin, and Chu Haitao. A statistical framework for Illumina DNA methylation arrays. Bioinformatics, 26(22):2849–2855, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [27].Teschendorff Andrew E and Widschwendter Martin. Differential variability improves the identification of cancer risk markers in DNA methylation studies profiling precursor cancer lesions. Bioinformatics, 28(11):1487–1494, 2012. [DOI] [PubMed] [Google Scholar]
- [28].Slieker Roderick C, van Iterson Maarten, Luijk René, Beekman Marian, Zhernakova Daria V, Moed Matthijs H, Mei Hailiang, Van Galen Michiel, Deelen Patrick, Bonder Marc Jan, et al. Age-related accrual of methylomic variability is linked to fundamental ageing mechanisms. Genome Biology, 17(1):191, 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [29].Feinberg Andrew P and Irizarry Rafael A. Stochastic epigenetic variation as a driving force of development, evolutionary adaptation, and disease. Proceedings of the National Academy of Sciences, 107(suppl_1):1757–1764, 2010. [Google Scholar]
- [30].Webster Amy P, Plant Darren, Ecker Simone, Zufferey Flore, Bell Jordana T, Feber Andrew, Paul Dirk S, Beck Stephan, Barton Anne, Williams Frances MK, et al. Increased DNA methylation variability in rheumatoid arthritis-discordant monozygotic twins. Genome Medicine, 10(1):64, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [31].Hansen Kasper Daniel, Timp Winston, Bravo Héctor Corrada, Sabunciyan Sarven, Langmead Benjamin, McDonald Oliver G, Wen Bo, Wu Hao, Liu Yun, Diep Dinh, et al. Increased methylation variation in epigenetic domains across cancer types. Nature Genetics, 43(8):768–775, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [32].Suman Medha, Dugué Pierre-Antoine, Wong Ee Ming, Joo JiHoon Eric, Hopper John L, Nguyen-Dumont Tu, Giles Graham G, Milne Roger L, McLean Catriona, and Southey Melissa C. Association of variably methylated tumour DNA regions with overall survival for invasive lobular breast cancer. Clinical Epigenetics, 13(1):11, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [33].Huo Zhiguang, Zhu Yun, Yu Lei, Yang Jingyun, De Jager Philip, Bennett David A, and Zhao Jinying. DNA methylation variability in Alzheimer’s disease. Neurobiology of Aging, 76:35–44, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [34].Kiltschewskij Dylan J, Reay William R, and Cairns Murray J. Schizophrenia is associated with altered DNA methylation variance. Molecular Psychiatry, 30(4):1383–1395, 2025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [35].Pia Campagna Maria, Xavier Alexandre, Lechner-Scott Jeannette, Maltby Vicky, Scott Rodney J, Butzkueven Helmut, Jokubaitis Vilija G, and Lea Rodney A. Epigenome-wide association studies: current knowledge, strategies and recommendations. Clinical Epigenetics, 13(1):214, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [36].Lussier Alexandre A, Zhu Yiwen, Smith Brooke J, Simpkin Andrew J, Smith Andrew DAC, Suderman Matthew J, Walton Esther, Ressler Kerry J, and Dunn Erin C. Updates to data versions and analytic methods influence the reproducibility of results from epigenome-wide association studies. Epigenetics, 17(11):1373–1388, 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [37].Sahoo Karishma and Sundararajan Vino. Methods in DNA methylation array dataset analysis: A review. Computational and Structural Biotechnology Journal, 23:2304–2325, 2024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [38].Jaffe Andrew E, Murakami Peter, Lee Hwajin, Leek Jeffrey T, Fallin M Daniele, Feinberg Andrew P, and Irizarry Rafael A. Bump hunting to identify differentially methylated regions in epigenetic epidemiology studies. International Journal of Epidemiology, 41(1):200–209, 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [39].Zheng Yuanchao, Lunetta Kathryn L, Liu Chunyu, Katrinli Seyma, Smith Alicia K, Miller Mark W, and Logue Mark W. An evaluation of the genome-wide false positive rates of common methods for identifying differentially methylated regions using illumina methylation arrays. Epigenetics, 17(13):2241–2258, 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [40].Mallik Saurav, Odom Gabriel J, Gao Zhen, Gomez Lissette, Chen Xi, and Wang Lily. An evaluation of supervised methods for identifying differentially methylated regions in illumina methylation arrays. Briefings in Bioinformatics, 20(6):2224–2235, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [41].Alhassan Daniel, Olbricht Gayla R, and Adekpedjou Akim. Differential methylation region detection via an array-adaptive normalized kernel-weighted model. PLoS One, 19(6):e0306036, 2024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [42].Staley James R, Windmeijer Frank, Suderman Matthew, Lyon Matthew S, Smith George Davey, and Tilling Kate. A robust mean and variance test with application to high-dimensional phenotypes. European Journal of Epidemiology, 37(4):377–387, 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [43].Kruppa Jochen, Sieg Miriam, Richter Gesa, and Pohrt Anne. Estimands in epigenome-wide association studies. Clinical Epigenetics, 13, 042021. [Google Scholar]
- [44].Hannon Eilis, Mansell Georgina, Walker Emma, Nabais Marta, Burrage Joe, Kepa Agnieszka, Best-Lane Janis, Rose Anna, Heck Suzanne, Moffitt Terrie, Caspi Avshalom, Arseneault Louise, and Mill Jonathan. Assessing the co-variability of DNA methylation across peripheral cells and tissues: Implications for the interpretation of findings in epigenetic epidemiology. PLOS Genetics, 17:e1009443, 032021. [Google Scholar]
- [45].Freydenzon Anna, Nabais Marta, Lin Tian, Williams Kelly, Wallace Leanne, Henders Anjali, Blair Ian, Wray Naomi, Pamphlett Roger, and McRae Allan. Association between DNA methylation variability and self-reported exposure to heavy metals. Scientific Reports, 12:10582, 062022. [Google Scholar]
- [46].Deaton Aimée M and Bird Adrian. CpG islands and the regulation of transcription. Genes & Development, 25(10):1010–1022, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [47].Gomez Lissette, Odom Gabriel J, Young Juan I, Martin Eden R, Liu Lizhong, Chen Xi, Griswold Anthony J, Gao Zhen, Zhang Lanyu, and Wang Lily. coMethDMR: accurate identification of co-methylated and differentially methylated regions in epigenome-wide association studies with continuous phenotypes. Nucleic Acids Research, 47(17):e98–e98, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [48].Pedersen Brent S, Schwartz David A, Yang Ivana V, and Kechris Katerina J. Comb-p: software for combining, analyzing, grouping and correcting spatially correlated p-values. Bioinformatics, 28(22):2986–2988, 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [49].Laird Nan M and Ware James H. Random-effects models for longitudinal data. Biometrics, pages 963–974, 1982. [PubMed] [Google Scholar]
- [50].Liang Kung-Yee and Zeger Scott L. Longitudinal data analysis using generalized linear models. Biometrika, 73(1):13–22, 1986. [Google Scholar]
- [51].Sofer Tamar, Schifano Elizabeth, Hoppin Jane, Hou Lifang, and Baccarelli Andrea. A-clustering: A Novel Method for the Detection of Co-regulated Methylation Regions, and Regions Associated with Exposure. Bioinformatics (Oxford, England), 29, 082013. [Google Scholar]
- [52].Oluwayiose Oladele A, Wu Haotian, Gao Feng, Baccarelli Andrea A, Sofer Tamar, and Pilsner J Richard. Aclust2.0: a revamped unsupervised R tool for Infinium methylation beadchips data analyses. Bioinformatics, 38(20):4820–4822, 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [53].McCulloch Charles E. An introduction to generalized linear mixed models. 1996.
- [54].Breslow Norman E and Clayton David G. Approximate inference in generalized linear mixed models. Journal of the American Statistical Association, 88(421):9–25, 1993. [Google Scholar]
- [55].Lee Youngjo and Nelder John A. Hierarchical generalized linear models. Journal of the Royal Statistical Society Series B: Statistical Methodology, 58(4):619–656, 1996. [Google Scholar]
- [56].Lee Youngjo and Nelder John A. Hierarchical generalised linear models: a synthesis of generalised linear models, random-effect models and structured dispersions. Biometrika, 88(4):987–1006, 2001. [Google Scholar]
- [57].Lee Youngjo, Nelder John A., and Pawitan Yudi. Generalized Linear Models with Random Effects: Unified Analysis via H-likelihood, volume 106 of Monographs on Statistics and Applied Probability. Chapman & Hall/CRC, Boca Raton, FL, 2006. [Google Scholar]
- [58].Liu Yaowu and Xie Jun. Cauchy combination test: a powerful test with analytic p-value calculation under arbitrary dependency structures. Journal of the American Statistical Association, 115(529):393–402, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [59].Benjamini Yoav and Hochberg Yosef. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society: Series B (Methodological), 57:289–300, 011995. [Google Scholar]
- [60].Aryee Martin J, Jaffe Andrew E, Corrada-Bravo Hector, Ladd-Acosta Christine, Feinberg Andrew P, Hansen Kasper D, and Irizarry Rafael A. Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics, 30(10):1363–1369, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [61].Johnson W Evan, Li Cheng, and Rabinovic Ariel. Adjusting batch effects in microarray expression data using empirical bayes methods. Biostatistics, 8(1):118–127, 2007. [DOI] [PubMed] [Google Scholar]
- [62].Meshram Siddhant, Fadikar Arindam, Arunkumar Ganesan, and Chatterjee Suvo. A spatially-aware unsupervised pipeline to identify co-methylation regions in DNA methylation data. bioRxiv, pages 2025–11, 2025. [Google Scholar]
- [63].Morris Tiffany J, Butcher Lee M, Feber Andrew, Teschendorff Andrew E, Chakravarthy Ankur R, Wojdacz Tomasz K, and Beck Stephan. ChAMP: 450k chip analysis methylation pipeline. Bioinformatics, 30(3):428–430, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [64].Suderman Matthew, Staley James R, French Robert, Arathimos Ryan, Simpkin Andrew, and Tilling Kate. Dmrff: Identifying differentially methylated regions efficiently with power and control. bioRxiv, page 508556, 2018. [Google Scholar]
- [65].Peters Timothy J, Buckley Michael J, Statham Aaron L, Pidsley Ruth, Samaras Katherine, Lord Reginald V, Clark Susan J, and Molloy Peter L. De novo identification of differentially methylated regions in the human genome. Epigenetics & Chromatin, 8(1):6, 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [66].Teschendorff Andrew E, Gao Yang, Jones Allison, Ruebner Matthias, Beckmann Matthias W, Wachter David L, Fasching Peter A, and Widschwendter Martin. DNA methylation outliers in normal breast tissue identify field defects that are enriched in cancer. Nature Communications, 7(1):10478, 2016. [Google Scholar]
- [67].Bartlett Maurice. Properties of Sufficiency an Statistical Tests. Proceedings of The Royal Society A: Mathematical, Physical and Engineering Sciences, 160:268–282, 011937. [Google Scholar]
- [68].Levene H.. Robust tests for equality of variances. Contributions to Probability and Statistics: Essays in Honor of Harold Hotelling, 2:278–292, 011960. [Google Scholar]
- [69].Fong-López Alan Alberto, Núñez-Enríquez Juan Carlos, Bekker-Méndez Vilma Carolina, Flores-Lujano Janet, Mata-Rocha Minerva, Jiménez-Hernández Elva, Patricia Ortíz-Maganda Mónica, Xavier Guerra-Castillo Francisco, Medina-Sanson Aurora, Alfonso Martín-Trejo Jorge, et al. Comprehensive Epigenome-Wide Profiling Reveals Distinctive DNA Methylation Signatures and Potential Prognostic Biomarkers in Mexican Pediatric B-ALL. International Journal of Molecular Sciences, 26(21):10261, 2025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [70].Cancer Research UK. Acute lymphoblastic leukaemia (all). https://www.cancerresearchuk.org/about-cancer/acute-lymphoblastic-leukaemia-all, 2024. Accessed: 2025-12-04.
- [71].National Cancer Institute (PDQ) Adult Treatment Editorial Board. Adult acute lymphoblastic leukemia treatment (pdq)-patient version. https://www.cancer.gov/types/leukemia/patient/adult-all-treatment-pdq, 2025. Accessed: 2025-12-04.
- [72].St. Jude Children’s Research Hospital. Acute lymphoblastic leukemia (all) – care and treatment for childhood cancer. https://www.stjude.org/care-treatment/treatment/childhood-cancer/leukemia-lymphoma/acute-lymphoblastic-leukemia-all.html, 2025. Accessed: 2025-12-04.
- [73].Kawamata Norihiko, Ogawa Seishi, Seeger Karl, Kirschner-Schwabe Renate, Huynh Thien, Chen John, Megrabian Nairi, Harbott Jochen, Zimmermann Martin, Henze Günter, et al. Molecular allelokaryotyping of relapsed pediatric acute lymphoblastic leukemia. International Journal of Oncology, 34(6):1603–1612, 2009. [DOI] [PubMed] [Google Scholar]
- [74].Okamoto Ryoko, Ogawa Seishi, Nowak Daniel, Kawamata Norihiko, Akagi Tadayuki, Kato Motohiro, Sanada Masashi, Weiss Tamara, Haferlach Claudia, Dugas Martin, et al. Genomic profiling of adult acute lymphoblastic leukemia by single nucleotide polymorphism oligonucleotide microarray and comparison to pediatric acute lymphoblastic leukemia. Haematologica, 95(9):1481, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [75].Tonks Nicholas K. Protein tyrosine phosphatases: from genes, to function, to disease. Nature Reviews Molecular Cell Biology, 7(11):833–846, 2006. [DOI] [PubMed] [Google Scholar]
- [76].Thomas Paul D., Ebert Dustin, Muruganujan Anushya, Mushayahama Tharshana, Albou Laurent P., and Mi Huaiyu. PANTHER: Making genome-scale phylogenetics accessible to all. Protein Science, 31(1):8–22, 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [77].Lento William, Congdon Kendra, Voermans Carlijn, Kritzik Marcie, and Reya Tannishtha. Wnt signaling in normal and malignant hematopoiesis. Cold Spring Harbor Perspectives in Biology, 5(2):a008011, 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [78].Ruan Yongsheng, Kim Hye Na, Ogana Heather, and Kim Yong-Mi. Wnt signaling in leukemia and its bone marrow microenvironment. International Journal of Molecular Sciences, 21(17):6247, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [79].Atlas of Genetics and Cytogenetics in Oncology and Haematology. t(9;22)(p13;q13) — pax5–brd1 fusion. https://atlasgeneticsoncology.org/haematological/1562/t(9;22)(p13;q13)-pax5-brd1, 2025. Accessed: 2025-12-04.
- [80].Nebral Karin, Denk Dagmar, Attarbaschi Andishe, König Margit, Mann Georg, Haas Oskar A, and Strehl Sabine. Incidence and diversity of PAX5 fusion genes in childhood acute lymphoblastic leukemia. Leukemia, 23(1):134–143, 2009. [DOI] [PubMed] [Google Scholar]
- [81].Sun Congcong, Chang Lixian, and Zhu Xiaofan. Pathogenesis of ETV6/RUNX1-positive childhood acute lymphoblastic leukemia and mechanisms underlying its relapse. Oncotarget, 8(21):35445, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [82].Qipa Egzona, Acar Muradiye, Bozkurt Sureyya, Buyukdogan Murat, Sonmez Hazal B, Sayitoglu Muge, Erbilgin Yucel, Karakaş Zeynep, and Hançer Veysel S. Novel RUNX1 variation in B-cell acute lymphoblastic leukemia. Mediterranean Journal of Hematology and Infectious Diseases, 15(1):e2023036, 2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [83].Zaliova Marketa, Potuckova Eliska, Hovorkova Lenka, Musilova Alena, Winkowska Lucie, Fiser Karel, Stuchly Jan, Mejstrikova Ester, Starkova Julia, Zuna Jan, et al. ERG deletions in childhood acute lymphoblastic leukemia with dux4 rearrangements are mostly polyclonal, prognostically relevant and their detection rate strongly depends on screening method sensitivity. Haematologica, 104(7):1407, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [84].Zhang Jinghui, McCastlain Kelly, Yoshihara Hiroki, Xu Beisi, Chang Yunchao, Churchman Michelle L, Wu Gang, Li Yongjin, Wei Lei, Iacobucci Ilaria, et al. Deregulation of DUX4 and ERG in acute lymphoblastic leukemia. Nature Genetics, 48(12):1481–1489, 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [85].Nishitani Shota, Isozaki Makoto, Yao Akiko, Higashino Yoshifumi, Yamauchi Takahiro, Kidoguchi Masamune, Kawajiri Satoshi, Tsunetoshi Kenzo, Neish Hiroyuki, Imoto Hirochika, et al. Cross-tissue correlations of genome-wide DNA methylation in japanese live human brain and blood, saliva, and buccal epithelial tissues. Translational Psychiatry, 13(1):72, 2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [86].Jacques Macsue, Seale Kirsten, Voisin Sarah, Lysenko Anna, Grolaux Robin, Jones-Freeman Bernadette, Lamon Severine, Levinger Itamar, Bauer Carlie, Sharples Adam P, et al. DNA methylation ageing atlas across 17 human tissues. bioRxiv, pages 2025–07, 2025. [Google Scholar]
- [87].Mendonça Vanessa, Soares-Lima Sheila Coelho, and Martins Moreira Miguel Angelo. Exploring cross-tissue DNA methylation patterns: blood–brain CpGs as potential neurodegenerative disease biomarkers. Communications Biology, 7(1):904, 2024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [88].Phipson Belinda, Maksimovic Jelena, and Oshlack Alicia. missmethyl: an R package for analyzing data from Illumina’s HumanMethylation450 platform. Bioinformatics, 32(2):286–288, 2016. [DOI] [PubMed] [Google Scholar]
- [89].Yu Guangchuang, Wang Li-Gen, Han Yan, and He Qing-Yu. clusterprofiler: an R package for comparing biological themes among gene clusters. OMICS, 16(5):284–287, 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The 450K datasets are accessible from the GEO database under accession number GSE44667 and GSE80970. The EPIC dataset is accessible from the GEO database under accession number GSE306095. Code for ReMeDy, simulations, and comparisons of methods identifying methylation dysregulation on the various datasets explored are available on GitHub at https://github.com/SChatLab/ReMeDy. The source code is licensed under an MIT License.







