Summary
Cells store information by means of chromatin modifications that persist through cell divisions and can hold gene expression silenced over generations. However, how these modifications may maintain other gene expression states has remained unclear. This study shows that chromatin modifications can maintain a wide range of gene expression levels over time, thus uncovering analog epigenetic memory. By engineering a genomic reporter and epigenetic effectors, we tracked the gene expression dynamics following targeted perturbations to the chromatin state. We found that distinct grades of DNA methylation led to corresponding, persistent gene expression levels. Altering the DNA methylation grade, in turn, resulted in permanent loss of gene expression memory. Consistent with experiments, our chromatin modification model indicates that analog memory arises when the positive feedback between DNA methylation and repressive histone modifications is lacking. This discovery will lead to a deeper understanding of epigenetic memory and to new tools for synthetic biology.
Keywords: epigenetics, synthetic biology, systems biology, computational biology, genome engineering
Graphical abstract

Highlights
-
•
A system was engineered to edit chromatin marks and study gene expression memory
-
•
Gene expression can be memorized at a spectrum of levels, not just “on”' and “off”
-
•
DNA methylation grade is conserved and mediates gene expression level maintenance
-
•
A model indicates that analog memory emerges in the absence of DNAme-H3K9me3 feedback
Traditional models of epigenetic memory support binary memory. Targeted editing of the chromatin state, coupled with temporal tracking of gene expression with clonal resolution, reveals that epigenetic memory can be analog. DNA methylation serves as the “knob” of a molecular dimmer switch that sets and memorizes gene expression levels. A chromatin modification model capturing histone modifications and DNA methylation indicates that this is possible when positive feedback between DNA methylation and H3K9me3 is lacking.
Introduction
Epigenetic memory is the ability to maintain multiple gene expression states through cell divisions and time without differences in genetic sequence. It is required by many biological processes, including the maintenance of distinct cell identities,1,2 innate and adaptive immunity,3,4,5 memory formation in the brain,6,7 and could become a powerful tool for mammalian cell engineering.8
Chromatin modifications, such as histone modifications and DNA methylation, are critical mediators of epigenetic memory,9,10,11,12,13 owing to their ability to affect gene expression by altering transcription14,15,16,17,18 and because they can self-propagate through DNA replication and cell division.14,19,20 Because of these properties, chromatin modifications mediate memory of gene expression in a variety of biological contexts. DNA methylation and histone H3 lysine 9 tri-methylation (H3K9me3) enable the maintenance of cell identity by inducing and keeping stable silencing of alternate fate genes.1,2 For example, somatic cell types maintain pluripotent genes such as Oct4 in a highly methylated, and hence silenced, state.21 In the context of immunology, the persistence of histone modifications, established after initial exposure to a pathogen, has been linked to the innate memory of the immune system.4 Finally, DNA methylation has been shown to be required for memory formation in the brain,6 although research is still unfolding to determine how that is established.
Despite the critical roles that chromatin modifications play in epigenetic memory across biological processes, the spectrum of gene expression states that these chemical modifications stably maintain remains uncharted.
Heterochromatin is a well-studied state of chromatin with concurrent DNA methylation and H3K9me3, in which gene expression is maintained silenced.19 The long-term temporal stability of this state was assessed in prior studies that performed time-resolved analysis of a reporter gene’s expression.22,23 Related studies have shown that histone modifications mediate a bistable epigenetic switch where gene expression is either silenced or active.24,25 Therefore, current models of epigenetic memory posit that memory is binary, wherein genes are maintained either “on” or “off” based on autocatalytic histone modifications that induce bistability26 (Figure 1A). While informative, the abovementioned studies have been carried out in a small set of contexts limited to artificial chromosomes or to specific endogenous genes. Because the effect of chromatin modifications on gene expression is context dependent,27,28,29,30 it remains unclear whether a model of binary memory generalizes across genetic contexts, particularly for endogenous loci.
Figure 1.
Binary versus analog epigenetic memory of gene expression
(A) A binary memory model posits that only silenced or active gene expression states are maintained long-term, but no other states. Each trajectory can be regarded as one cell. The emergent long-term distribution of gene expression across a cell population has two distinct peaks.
(B) In an analog memory model, any gene expression level is maintained. The resulting long-term distribution of gene expression levels can take any shape.
Here, we investigate whether chromatin modifications can maintain a broader spectrum of gene expression states and, hence, mediate non-binary memory. Additional forms of memory beyond binary include n-ary memory and analog memory. Memory that is n-ary extends the concept of binary memory to n stable states and emerges from multi-stable networks.31 In the limit where the n stable states form a continuum of states, memory is analog. With analog memory, every possible gene expression state is stably maintained in time (Figure 1B). Elucidating the landscape of gene expression states that chromatin modifications maintain requires methods that initialize gene expression at different levels through targeted perturbations to the chromatin state. Although methods using targeted editing of reporters knocked into endogenous genes have been developed before, they can be subject to genetic context effects that are difficult to control and may influence the results.23,28,30,32,33 To mitigate the influence of context, prior studies have constructed reporter genes within chromatin insulators in artificial chromosomes22; however, whether these findings generalize to natural chromosomes is unclear.
To overcome these difficulties, we developed a single-copy genomic reporter system in a natural chromosome at a specific site separated from endogenous genes. We also developed a set of chimeric proteins for targeted editing of the chromatin state and a chromatin modification model that, different from existing models, includes both histone and DNA methylation. We used these tools to map the spectrum of gene expression states that are memorized by chromatin modifications and determined the causal roles of histone modifications versus DNA methylation in long-term epigenetic memory. We show that memory can be analog and that, despite recent emphasis on histone modifications, DNA methylation is the causal determinant of analog epigenetic memory, with H3K9me3 only being an intermediate regulator of gene expression recruited via DNA methylation.
Results
An engineered genomic reporter system to study gene expression dynamics post targeted chromatin editing
We first set out to develop an experimental system to study epigenetic memory in a natural chromosome and the role that histone and DNA methylation play in its establishment. To this end, we constructed a single-copy reporter integrated site-specifically in an endogenous mammalian locus and engineered a set of proteins for targeted epigenetic editing (Figures 2A–2D, S1, S2A, and S2B; STAR Methods; and Tables S1 and S2). We constructed the genomic reporter in Chinese hamster ovary (CHO)-K1 cells, a common model cell line in mammalian synthetic biology. The gene comprises the mammalian elongation factor 1a promoter (EF1a) driving the expression of fluorescent reporter (EBFP2). This enables the study of changes in gene expression and protein level due to the correlation between fluorescent protein level and fluorescence intensity measured using flow cytometry.34 We flanked the gene with cHS4 chromatin insulators for isolation of the reporter from other genes.35 The reporter comprises five binding sites upstream of the promoter for each of the DNA-binding proteins PhlF and rTetR (Figures S1A and S1B and Table S2),36 where the binding of PhlF and rTetR to DNA can be modulated using 2,4-diacetylphloroglucinol (DAPG) and doxycycline, respectively. Moreover, the region upstream of the promoter comprises five guide RNA (gRNA) target sites, which enable targeted chromatin editing using epigenetic effectors fused to catalytically dead Cas9 (dCas9) (Figures 2B and S1C and Tables S2 and S3). Epigenetic editing was performed by expressing epigenetic effectors fused to PhlF, rTetR, or dCas9 (Figures S1A–S1F and Table S2) using transient transfection. The epigenetic effectors were co-transfected with a fluorescent transfection marker (EYFP). Since the expression of two co-transfected vectors is highly correlated,37 this enables sampling of cells with varying levels of epigenetic effector by sorting for cells with varying levels of EYFP using fluorescence-activated cell sorting (Figure 2E).
Figure 2.
The gene expression dynamics in DNMT3A-edited cells indicate a form of memory different from binary
(A) Transient epigenetic editing using KRAB, DNMT3A, or TET1 fused to dCas9, PhlF, or rTetR DNA-binding domains.
(B) Experimental system. A mammalian constitutive promoter (EF1a) drives expression of fluorescent protein EBFP2. Upstream binding sites enable targeted recruitment of epigenetic effectors fused to DNA-binding proteins rTetR, PhlF, or dCas9. The reporter gene is flanked by chromatin insulators (Figures S1, S2A, and S2B; STAR Methods; Tables S1 and S2).
(C) CpGs along the reporter (Table S1).
(D) Copy number of the genomic integration determined using qPCR. Control is the landing pad cell line (STAR Methods).
(E) Transient transfection into cells bearing the reporter, fluorescence-activated cell sorting based on transfection level, and time-course flow cytometry measurements.
(F) Gene expression dynamics following editing with DNMT3A-dCas9 according to the experimental timeline of (E). Shown are single-cell flow cytometry measurements (EBFP2) of DNMT3A-edited cells. DNMT3A-dCas9 was targeted to five target sites upstream of the promoter, and a scrambled gRNA target sequence was used as the control (Figures S2C and S2D; Table S3). Shaded yellow is the time that the transfection marker was detected (Figure S2D). The marker indicates geometric mean. Data shown are from a representative replicate from three independent replicates.
(G) MeDIP-qPCR and ChIP-qPCR analysis 14 days after transfection of DNMT3A-dCas9 and cell sorting for high levels of transfection (Table S4 and STAR Methods). Data are from three independent replicates. Reported are fold changes and their mean using the standard method with respect to the active state (STAR Methods). A non-targeting (NT) scrambled gRNA target sequence (gRNA NT) was used as the control. Error bars are SD. ∗p < 0.05, ∗∗p < 0.01, and ∗∗∗p < 0.001, unpaired two-tailed t test.
(H) Gene expression dynamics post PhlF-KRAB editing according to the experimental timeline of (E). Shown are flow cytometry measurements of the reporter gene (EBFP2) for single cells. The shaded yellow region indicates the time the transfection marker was detected during the time when DAPG was not applied. DAPG was applied in the PhlF-KRAB and PhlF conditions beginning at 6 days. The marker indicates geometric mean. A different independent replicate was measured on each day. Data shown are from three independent replicates.
(I) MeDIP-qPCR and ChIP-qPCR analysis 6 days after transfection of PhlF-KRAB and cell sorting for high levels of transfection (Table S4 and STAR Methods). Shown are fold changes and their mean with respect to the active state from three independent replicates. Fold changes were calculated using the standard ΔΔCt method. Error bars are SD. ∗p < 0.05, ∗∗p < 0.01, and ∗∗∗p < 0.001, unpaired two-tailed t test.
(J) Chromatin modification circuit obtained when KRAB = 0 and TET1 = 0 (Figure S5C).
(K) Top: dose-response curve for the (CpGme and X) pair. Bottom: dose-response curve between the height of a DNMT3A pulse (Equation 7) and the DNA methylation grade (CpGme). See Figures S5D, S7, and S15.
(L) Stationary probability distribution of gene expression for the system in Tables S5 and S8, with parameter values in STAR Methods.
(M) Probability distributions of gene expression for the same system as in (L) after t = 28 days, with parameter values and initial conditions given in STAR Methods. For (L) and (M), the distributions were obtained computationally using the stochastic simulation algorithm (SSA)38; STAR Methods. See Figures S5B and S14. Here, scales the ratio between the basal (recruited) erasure rate and the auto-catalysis rate of each modification. See Figures S5E, S7, and S15. A.U.F., arbitrary units of fluorescence.
Transient chromatin editing with DNMT3A leads to permanent changes in gene expression
We first investigated the gene expression dynamics following epigenetic editing by DNA methylation writer DNA methyltransferase 3 alpha (DNMT3A). To this end, we expressed a fusion of dCas9 and the catalytic domain of DNMT3A (DNMT3A-dCas9)39 (Figure S1C and Table S2), via transient transfection targeted to five target sites upstream of the promoter in the reporter gene using a gRNA (Figures 2F, S2C, and S2D and Table S3). Subsequently, we utilized fluorescence-activated cell sorting to sample cell populations with various transfection levels and proceeded with time-course flow cytometry measurements of the resulting populations. We observed that gene expression was repressed, reaching maximal repression approximately 11 days after transfection (Figures 2F and S2C). After reaching maximal repression, the cell distributions remained stationary for all levels of DNMT3A-dCas9 (Figures 2F and S2C). This, along with no detection of the transfection marker of DNMT3A-dCas9 by day 8 (Figure S2D), indicates memory of the transient DNMT3A editing. No repression was observed for cells where a gRNA with a scrambled target sequence was co-transfected with DNMT3A-dCas9 or in control experiments where dCas9 was targeted to the gene (Figures S2E and S2F).
Although the epigenetic effector dosage increased the proportion of cells in the silenced state, all dosages of DNMT3A-dCas9 editing led to stationary distributions with cells expressing the reporter at all levels of gene expression, ranging from silenced to fully active (Figures 2F and S2C). Experiments where we transiently transfected the same DNMT3A catalytic domain fused to rTetR instead of dCas9 (rTetR-XTEN80-DNMT3A) (Figures S1D and S3 and Table S2) also led to stationary distributions of gene expression spanning the whole spectrum of levels, from silenced to fully active. These observations rule out binary memory (Figure 1). Since maximal repression is reached approximately 11 days post-transfection of DNMT3A-dCas9, we selected day 14 for analyzing DNA and histone methylation to ensure maximal repression has been reached. Methylated DNA immunoprecipitation (MeDIP) and chromatin immunoprecipitation (ChIP) followed by qPCR further showed an increase in DNA methylation and H3K9me3 levels, accompanied by a decrease in H3K4me3 levels in the reporter gene compared to the untransfected (active) cells (Figure 2G; STAR Methods; and Table S4). This is in accordance with reports showing the ability of methylated DNA to recruit writers for H3K9me3,40 indicating that these two chromatin modifications play a role in the temporal maintenance of the gene expression states.
Transient chromatin editing with KRAB leads only to temporary changes in gene expression
In order to discriminate the roles of DNA methylation and H3K9me3 in the maintenance of gene expression states, we investigated the gene expression dynamics following epigenetic editing with writers of H3K9me3. To this end, we fused DNA-binding protein PhlF to KRAB (Figure S1E and Table S2),41 an epigenetic effector that promotes H3K9 trimethylation.42,43 We thus expressed PhlF-KRAB via transient transfection in the cells bearing the reporter gene, sampled cell populations across different transfection levels using fluorescence-activated cell sorting, and performed time-course flow cytometry measurements (Figures 2H and S4A–S4C). We observed rapid repression of gene expression for cells transfected with PhlF-KRAB and no repression for cells transfected with PhlF. Irrespective of the dosages of PhlF-KRAB, gene expression reverted to the active gene state within 4 days after reaching maximal repression. This aligns with previous reports where KRAB-based epigenetic editing resulted in temporary repression while failing to establish permanent silencing.23,39 Further experiments with transient expression of PhlF-KRAB also resulted in temporary repression of the gene while no long-term memory was observed (Figures S4D–S4F).
To determine what chromatin modifications emerged as a result of KRAB-mediated editing, we performed MeDIP and ChIP followed by qPCR on cells repressed with PhlF-KRAB (Figure 2I and STAR Methods). Since gene expression is maximally repressed around day 6 post-transfection and then reactivates, we selected day 6 for this analysis. We observed an increase in H3K9me3 levels in the reporter gene compared to untransfected (active) cells, while DNA methylation levels remained unchanged, and H3K4me3 levels decreased. Taken together, these data indicate that an increase in H3K9me3 in the absence of DNA methylation is unable to permanently downregulate gene expression. Therefore, DNA methylation, but not H3K9me3, emerges as a primary mediator of the temporal maintenance of gene expression states in the cells edited by DNMT3A (Figure 2F).
A model where DNA methylation drives H3K9me3 predicts analog memory
Since de novo DNA methylation leads to H3K9me3 (Figure 2G), we conclude that DNA methylation mediates the establishment of H3K9me3. This is consistent with studies reporting that methylated CpGs are bound by a reader protein (methyl-CpG-binding protein 2) that recruits H3K9me3 writers.40 By contrast, H3K9me3 does not mediate the establishment of DNA methylation in our system (Figure 2I). Taken together, these data suggest a model in which DNA methylation drives H3K9me3. Furthermore, the permanent change in gene expression resulting from transient recruitment of DNA methylation writer (Figure 2F), but not from H3K9me3 writer (Figure 2H), suggests that DNA methylation’s decay is negligible, while H3K9me3 decay is relatively fast. This long-term stability of DNA methylation has been attributed to the fast action of maintenance enzyme DNMT1, which re-writes DNA methylation on the daughter DNA strand upon DNA replication.14,44
We therefore propose a model in which the DNA methylation state remains constant in time in the absence of writer DNMT3A, drives the formation of H3K9me3 and the removal of H3K4me3.12,45,46 These histone marks, in turn, are known to mutually inhibit each other by recruiting each other’s erasers47 and are autocatalytic since they each recruit their own writers.11,19,20,48 Furthermore, H3K4me3 but not H3K9me3 is associated with transcriptionally active promoters,20 leading to our chromatin modification model (Figures 2J and S5; see STAR Methods; and Table S7 for the chemical reaction system).
This model predicts an inverse graded relationship between the DNA methylation grade in the gene, defined as the mean fraction of methylated CpGs, and gene expression level (Figure 2K [top] and STAR Methods). When the mutual inhibition between activating and repressive marks is sufficiently strong, the map between the intensity of a DNMT3A pulse and the DNA methylation grade becomes ultrasensitive (Figures 2K [bottom] and S5D). This, in turn, with noise in the reactions and in the DNMT3A levels, leads to a bimodal distribution of gene expression in response to a transient DNMT3A input (Figures 2L and S5E), which matches well experimental observations (Figures S2C and S2D). The model also recapitulates the rest of the experimental observations (Figure S6).
Most importantly, the model predicts that any initially set DNA methylation grade remains approximately constant in time in the absence of DNMT3A, since the decay rate of DNA methylation is negligible. Thus, any gene expression level initially set with the corresponding equilibrium grade of DNA methylation persists in time, with some variability under the effect of noise (Figure 2M). This demonstrates analog memory. Concerning noise properties, the model also predicts that intermediate DNA methylation grades will result in broader gene expression distributions (compare the orange and yellow distributions to the gray and blue distributions in Figure 2M). This occurs when the basal erasure of histone modifications is sufficiently slow compared to their autocatalysis (small ). In this regime, the histone modification circuit becomes bistable for intermediate DNA methylation grade (see STAR Methods and Figure S7A), which leads to increased gene expression variability under the influence of noise (Figure S7B). In summary, our data-educated model predicts that DNA methylation is primarily responsible for analog memory, with histone modifications following it to modulate the gene expression level and its variability.
Fluorescence-activated cell sorting reveals non-binary epigenetic memory
Following the model predictions, we hypothesized that epigenetic memory can be analog, thereby enabling cells to maintain any gene expression level. To initially test this hypothesis, we first sought to investigate whether cells with the gene in a state different from silenced or active within DNMT3A-edited populations exhibited long-term stability of gene expression. To this end, we performed fluorescence-activated cell sorting of the DNMT3A-edited cells from low, intermediate, and high gene expression levels (Figures 3A and 3B and STAR Methods). We then performed time-course flow cytometry measurements for 29 days to analyze the long-term gene expression trajectories (Figures 3C–3E). The cells with low and high levels of gene expression maintained an approximately stable distribution of gene expression levels after cell sorting, as expected by a binary memory model. Nonetheless, the cells with intermediate levels of expression, after an initial widening of the expression distribution, also kept a stable gene expression distribution across the population for the entire time course. This is incompatible with binary memory (Figure 1A). To characterize the epigenetic state of these cells, we performed ChIP followed by qPCR of the low, intermediate, and active cell populations (Figure 3F and STAR Methods). We found an increase of H3K9me3 in the low population compared to the intermediate and active cell populations. Accordingly, we observed a decrease in H3K4me3 in the low- and intermediate-expressing cells compared to the active cells. These results are consistent with the known role of H3K9me3 as a chromatin modification associated with repression and H3K4me3 as a chromatin modification associated with active transcription at promoters.49
Figure 3.
Gene expression is stably maintained at multiple levels in DNMT3A-edited cells
(A) Experiment overview for (B–I).
(B) Fluorescence-activated cell sorting of DNMT3A-edited cells (DNMT3A-dCas9) according to low, intermediate, and high levels of gene expression (EBFP2). Single-cell gene expression measurements on the day of cell sorting using flow cytometry.
(C) Time-course flow cytometry data from cells obtained as shown in (B).
(D) Density of cells from flow cytometry measurements obtained on the last time point in (C) (29 days after cell sorting). (B–D) show a representative replicate from three independent replicates.
(E) Dynamics of gene expression exhibited by cells shown in (B–D). Shown is the mean of geometric means from three independent replicates. Error bars are the SD of the mean.
(F) ChIP-qPCR for H3K9me3 and H3K4me3 of the promoter in the reporter gene. Shown are fold changes and the corresponding mean from three independent replicates as determined by the method with respect to the active state (parental untransfected cells bearing the reporter gene) (STAR Methods). Error bars are SD. ∗p < 0.05, ∗∗p < 0.01, and ∗∗∗p < 0.001, unpaired two-tailed t test.
(G) Heatmap of targeted bisulfite sequencing of the reporter gene. Shown is the level of DNA methylation as the mean percentage of DNA methylation at each CpG from three independent replicates (STAR Methods).
(H) Probability distribution of the fraction of methylated CpGs (STAR Methods).
(I) Correlation plot between the mean fraction of CpGs methylated and the mean gene expression level. Shown are means and error bars indicating SD. The green line represents the best-fit obtained by linear regression. The p value associated with the slope coefficient is p (STAR Methods).
(J) Experiment overview for (K–N).
(K) Fluorescence-activated cell sub-sorting of DNMT3A-edited cells (DNMT3A-dCas9) exhibiting intermediate levels of gene expression (EBFP2). Data are from gene expression measurements using flow cytometry on the day of cell sorting.
(L) Flow cytometry-based time-course analysis of the dynamics of gene expression exhibited by cells obtained in (K).
(M) Density of cells from flow cytometry measurements obtained on the last time point in (L) (14 days after cell sampling). (K–M) show a representative replicate from three independent replicates.
(N) Dynamics of gene expression exhibited by cells shown in (K–M). Shown is the mean of geometric means from three independent replicates. Error bars are the SD of the mean. A.U.F., arbitrary units of fluorescence.
Because we pinpointed DNA methylation as the critical mediator of the maintenance of gene expression states, we performed targeted bisulfite sequencing of the reporter gene for each of the three cell populations: low, intermediate, and active (Figure 3G and STAR Methods). While the active state is devoid of methylated CpGs, the probability of finding any given CpG methylated in the promoter and dCas9-rTetR binding region in the intermediate state is distinctly smaller than in the low state and higher than in the active state. From these data, we computed the probability distributions of the fraction of methylated CpGs in the promoter and dCas9-rTetR binding region, for each of the low, intermediate, and active states (Figure 3H and STAR Methods). These distributions reinforce that the intermediately expressing cells have intermediate levels of DNA methylation. In particular, the mean value of the fraction of methylated CpGs, which we define as the DNA methylation grade, is intermediate in cells with intermediate expression levels compared to that of the low and active cells and has a log-linear negative correlation (p value < 0.05) with the mean gene expression level (Figure 3I).
We additionally tested the hypothesis that epigenetic memory can operate in an analog manner by further subdividing the DNMT3A-edited cells according to the gene expression levels, followed by time-resolved tracking (Figures 3J–3N and STAR Methods). If epigenetic memory were analog, cells sampled from different gene expression levels should approximately maintain these distinct levels over time. Each of the four resulting cell populations maintained distinct distributions of gene expression levels according to the level on the day of cell sampling (Figures 3K–3N). We also sampled cells that had been edited with DNMT3A fused to rTetR (rTetR-XTEN80-DNMT3A) instead of dCas9, using fluorescence-activated cell sorting (Figure S8 and STAR Methods), and the same temporal stability was observed.
Overall, these observations support the hypothesis that the memory is analog (Figure 1B) in agreement with the model (Figures 2M and S9A–S9H; STAR Methods), and that DNA methylation grade is the primary mediator of the stable maintenance of gene expression levels.
Single-cell analysis reveals that a wide range of gene expression levels is maintained over time, along with the corresponding DNA methylation grade
To further test the hypothesis that epigenetic memory operates on an analog basis, we set out to characterize the long-term trajectories of gene expression with extensive sampling of single cells from DNMT3A-edited cell populations followed by monoclonal expansion and analysis (Figure 4). This approach provides a high-resolution evaluation of gene expression trajectories generated exclusively from one initial DNA methylation profile. We first performed highly dense sampling of single cells, using fluorescence-activated cell sorting, from three distinct cell populations that had been previously sampled from DNMT3A-edited cells (Figures 4A and 4B; STAR Methods). Specifically, we selected eight different consecutive regions of gene expression levels spanning across low (two regions), intermediate (four regions), and high (two regions) gene expression levels in DNMT3A-edited cells (STAR Methods). After sampling single cells from each of those regions, we performed clonal expansion, resulting in 10 monoclonal populations from each of the eight gene expression regions for a total of 93 monoclonal populations. Flow cytometry measurements were performed for each of the cell populations after 15 days of monoclonal expansion (Figure 4C). The monoclonal populations showed a continuum of gene expression levels ranging from background to active gene expression levels. To test whether the gene expression levels from the monoclonal populations could be maintained in time, we selected eight monoclonal populations (highlighted in Figure 4C), spanning the gene expression range from the 93 clones, and performed time-course flow cytometry measurements up to 161 days (> 5 months) since the beginning of monoclonal expansion (Figure 4D). In these measurements, we observed that the gene expression distributions remained approximately stable over time, validating the initial hypothesis that cells can maintain memory of any gene expression level, which defines analog memory.
Figure 4.
Epigenetic memory can be analog, and this is mediated by DNA methylation
(A) Experiment overview.
(B) Single cells were obtained after sampling of DNMT3A-edited cells with low, intermediate, and high levels of gene expression. After monoclonal expansion, the total number of monoclonal populations obtained was N = 93 (STAR Methods).
(C) Violin plots of the monoclonal populations (n = 93) measured 15 days after single-cell sampling from the cell populations shown in (B). Monoclonal populations were positioned on the x axis according to the geometric mean of gene expression. Clones indicated with numbers 1–8 were selected for further analysis. Colors represent the cell population of origin as shown in (B).
(D) Flow cytometry-based time course analysis of gene expression dynamics of the eight monoclonal populations selected in (C).
(E) Methylation level determined using bisulfite sequencing of the reporter gene in the eight monoclonal populations at two different time points (STAR Methods).
(F) Flow cytometry measurements of the reporter gene in the eight monoclonal populations at two different time points corresponding to those shown in (E).
(G) Probability distribution of the fraction of methylated CpGs (STAR Methods). The SD for each clone is: 0.1752, 0.4707, 0.9492, 1.2714, 1.3101, 1.262, 1.1683, and 0.8095, ordered from clone 1 to clone 8.
(H) Simulations of the probability distributions of the system in Tables S10 and S8. The distributions are obtained computationally using the SSA38 (STAR Methods). The SD for each clone is: 0.1346, 0.5472, 1.2585, 1.1189, 1.0431, 1.1117, 0.8512, and 0.8259, ordered from clone 1 to clone 8. The parameter values used for these simulations are listed in STAR Methods.
(I) Correlation between the mean fraction of CpGs methylated and mean gene expression level at 42 days after single-cell sorting. Shown are means and error bars indicating SD. The green line represents the best linear fit . The p value associated with the slope coefficient is p (STAR Methods).
(J) Correlation plot between the mean fraction of CpGs methylated and the mean gene expression level, x, obtained via simulations. Data shown are the mean and error bars indicating SD. The green line represents the best linear fit . The p value associated with the slope coefficient is p (STAR Methods). A.U.F., arbitrary units of fluorescence.
We next investigated DNA methylation grade as a determinant of the maintenance of gene expression level. To this end, we performed targeted bisulfite sequencing of the reporter gene in each of the eight populations at two different time points (Figures 4E and 4F and STAR Methods). The data showed that clones (7 and 8) selected from cells in the high gene expression distribution are largely devoid of methylated CpGs. By contrast, clones (1 and 2) selected from cells within the low distribution have the majority of CpGs methylated. The remaining clones (3, 4, 5, and 6), selected from the intermediate distribution, display a graded decreasing number of CpGs that are methylated with high probability (Figure 4E). These trends inversely correlate with the gene expression levels in the clones (Figure 4F). To acquire a quantitative understanding of the graded nature of this variation, we computed from the bisulfite sequencing data the probability distribution of the fraction of methylated CpGs in the gene’s promoter and rTetR binding sites (STAR Methods). This confirmed a progressive and gradual shift of the distributions through the clonal populations (Figure 4G), in accordance with model predictions (Figure 4H). The correlation between the mean gene expression level of each clonal population and the mean number of methylated CpGs is log-linear (Figures 4I and 4J), confirming a gradual increase in the DNA methylation grade through the clonal populations from high to low gene expression levels. Furthermore, the distribution of the fraction of methylated CpGs remained approximately unchanged between the two time points where bisulfite sequencing was performed (Figure 4G), indicating that the DNA methylation grade is stably maintained.
Further analysis of the clonal distributions of the fraction of methylated CpGs revealed that the variability of the number of methylated CpGs about their mean value, measured by the standard deviation, is larger for intermediate DNA methylation grades compared to low and high methylation grades (Figure 4G). The mathematical model recapitulates this property (Figure 4H) when it includes the two following additional possible molecular interactions: (1) H3K9me3 helps recruit DNA methylation maintenance enzyme (DNMT1) to the gene50 and (2) DNMT1 can cause de novo DNA methylation, although at a very low rate51,52 (see STAR Methods for model derivation). With these interactions, the errors made by DNMT1 in maintaining methylation are less probable for a fully methylated gene due to a higher recruitment rate of DNMT1 via H3K9me3. Similarly, errors made by DNMT1 in de novo methylating additional CpGs are less probable for a gene with no methylation since DNMT1 recruitment is practically absent (see STAR Methods and Figure S10 for model analysis). Overall, these interactions can explain the tighter distributions observed for highly and lowly methylated gene states (Figure 4G).
The genome engineering system that we utilized in this study has been demonstrated to be highly precise.53 Additionally, we have verified that the copy number of the chromosomal integration is one (Figure 2D). Nevertheless, we still sought to verify that the different clones had one genomic copy of the reporter in the correct location to rule out the possibility that the variation of gene expression observed across the clones is due to variability in the genomic integration. To this end, we verified using genomic PCRs that the integration was in the ROSA26 locus for all clones (Figure S11A and STAR Methods). Moreover, we performed copy-number variation analysis using digital PCR (dPCR), which showed that all of the clones have one copy (Figure S11B and STAR Methods). Taken together, these data establish that the DNA methylation grade is maintained by the clonal populations and that this grade fine-tunes the gene expression level.
Finally, we sought to rule out that the observed differences in gene expression could be due to differences in the genome of the cells bearing the chromosomal integration. To this end, we selected the active monoclonal population (monoclonal population 8), targeted DNMT3A-dCas9 to the reporter, and tracked gene expression over time (Figures S11C–S11E). Subsequently, we sorted eight populations with different gene expression levels for temporal tracking (Figures S11F and S11G). Since the monoclonal cell line is composed of identical cells, characterized in detail using dPCR, genomic PCRs, and bisulfite sequencing (Figures 4E, S11A, and S11B), any differences observed in gene expression are due to epigenetic changes. Consistent with all prior observations, the distribution of gene expression in the selected populations initially widened and then remained stable for each gene expression level (Figures S11F and S11G). Bisulfite sequencing data further confirmed grades of DNA methylation that change linearly with gene expression level across the eight populations, consistent with our prior results (Figures S11H–S11J).
DNA methylation drives the maintenance of analog gene expression states
Next, we sought to establish a causal link between the DNA methylation grade and the level of gene expression. To this end, we investigated whether altering the DNA methylation grade in cells maintaining an intermediate level of gene expression would impair this maintenance. To enhance DNA methylation, we utilized rTetR-XTEN80-DNMT3A, as we have shown that DNMT3A enhances DNA methylation (Figure 2G) in accordance with prior reports.39 To remove DNA methylation, we engineered a fusion protein comprising rTetR fused to the catalytic domain of the ten eleven translocation protein 1 (TET1)33,54 to construct rTetR-XTEN80-TET1 (Figure S1F and Table S2). This catalytic domain has been shown to enable targeted DNA demethylation of mammalian genes.33 Moreover, we fused rTeR to KRAB to construct rTetR-KRAB (Figure S1G and Table S2) to evaluate changes to gene expression when enhancing H3K9me3 without changing DNA methylation.
We transiently transfected these epigenetic effectors in cells that had been sampled from intermediate levels of gene expression in DNMT3A-edited cells. As before, we utilized fluorescence-activated cell sorting to obtain transfected cells and subsequently performed time-course flow cytometry measurements (Figures 5 and S11K–S11M). By 10 days after transient transfection with rTetR-XTEN80-DNMT3A, the gene expression was repressed, remained silenced over time, and overlapped with the distribution of cells that had been previously sorted for low levels of gene expression post DNMT3A-editing (Figure 5A). Transient transfection with rTetR-XTEN80-TET1 resulted in nearly complete and permanent reactivation of gene expression (Figure 5B). By contrast, transient transfection with rTetR-KRAB did not result in any permanent change in gene expression. Specifically, 6 days after transfection, we observed levels of repression comparable to those of the low levels in DNMT3A-edited cells, but the expression reverted to the initial intermediate level and remained stable afterward (Figure 5C). These data indicate that transient recruitment of a DNA methylation writer or eraser to an intermediately methylated and partially active gene can permanently silence or reactivate the gene. By contrast, transient recruitment of an H3K9me3 writer does not exert a permanent change in gene expression. All of this is consistent with the model predictions (Figure S12).
Figure 5.
DNA methylation effectors (DNMT3A or TET1) impair the maintenance of analog gene expression states in DNMT3A-edited cells
Time-course flow cytometry measurements after transient editing using (A) rTetR-XTEN80-DNMT3A, (B) rTetR-XTEN80-TET1, or (C) rTetR-KRAB in DNMT3A-edited cells with stable intermediate levels of gene expression. The densities of cells shown are from one representative replicate from three independent replicates. The shaded blue region indicates the time the transfection marker was detected during the time that doxycycline was added (Figures S11K–S11M). Doxycycline was applied for 6 days. Shown is the mean of the geometric means from three independent replicates. Error bars are the SD of the mean. A.U.F., arbitrary units of fluorescence.
We thus hypothesized that distinct levels of gene expression are maintained because their corresponding DNA methylation grade is maintained over time. To test this hypothesis, we sought to inhibit DNMT1 activity since DNMT1 is the enzyme that maintains DNA methylation through cell division.14,44 To this end, we selected clonal populations with intermediate levels of gene expression and DNA methylation from Figure 4D and treated them with DNMT1 inhibitor 5-azacytidine.55 DNMT1 inhibition caused gene expression in these clones to completely reactivate within 4 days, indicating loss of gene expression memory (Figure S13).
Taken together, these results show that each gene expression level is memorized by the corresponding grade of DNA methylation through the activity of the DNA methylation maintenance enzyme DNMT1.
Discussion
Current models posit that epigenetic memory is binary, largely based on autocatalytic histone modifications that stabilize genes in either an “on” or “off” state.26 Although experimental studies support this hypothesis,22,24,25 the context-dependent nature of chromatin modifications27,28,29,30 casts doubts on whether a binary memory model is broadly applicable. With our genomic reporter system and sequence-specific chromatin regulators, we have discovered that gene expression can be memorized at a continuum of levels, encoded by the grade of DNA methylation.
Unlike histone modifications, DNA methylation is not autocatalytic. This feature, when coupled with a negligible decay rate and a lack of positive feedback between DNA methylation and H3K9me3 (Figure 6A), makes any DNA methylation grade persist over time. Since DNA methylation modulates gene expression,20,56 it confers long-term stability to the corresponding gene expression level. In our system, positive feedback between DNA methylation and H3K9me3 is lacking; although DNA methylation recruits writers of H3K9me340 (Figure 2G), H3K9me3 does not recruit writers of DNA methylation (Figure 2I).
Figure 6.
The interactions between DNA methylation and histone modifications explain when analog or binary epigenetic memory emerges
(A) Analog memory. DNA methylation is not under positive feedback.
(B) Binary memory. DNA methylation is under positive feedback because H3K9me3 recruits writer enzymes for DNA methylation (yellow dashed arrow marked by ) and DNA methylation, in turn, mediates the recruitment of H3K9me3. Refer to Figures S5A and S5C for a complete diagram.
Because the crosstalk between DNA methylation and histone modifications is context-dependent,57 different patterns of interactions likely appear in different cellular and genetic contexts. In situations where H3K9me3 also mediates the establishment of DNA methylation,58,59 DNA methylation becomes subject to positive feedback (Figures 6B and S5). This feedback can push a partially methylated gene to be fully methylated. Likewise, the competitive interaction between DNA methylation and H3K4me3 can cause H3K4me3 to win over a low DNA methylation grade and to revert this to the unmethylated state. The result is binary memory (STAR Methods and Figure S14). The strength of the positive feedback ( in Figure 6B), determines the speed at which intermediate states converge to the extreme ones (Figure S14). In practice, the cellular and genetic context will determine the strength of the positive feedback and thus the temporal duration of analog states of gene expression (Figure S14).
Although this study has placed a causal link between the grade of DNA methylation in a gene and the temporal maintenance of the expressed protein’s level, other factors can play a role in modulating this level. These include RNA stability, RNA modifications, and the possible crosstalk between DNA methylation and RNA methylation.60,61,62 Investigating these interactions will enrich our understanding of how specific memory levels can be tuned. Accordingly, our mathematical model can be extended to include RNA stability modulation, such as by covalent modifications and interactions with effectors of the chromatin state.
We have provided a proof-of-concept study showing that analog epigenetic memory is possible and have identified graded, stably maintained DNA methylation as the enabling molecular mechanism. To achieve this, we have mitigated potential genetic context effects by insulating the reporter system from the surrounding gene loci through flanking chromatin insulators and by spacing it away from endogenous coding sequences (Figure 2B) Additional studies are required to investigate analog memory in more native and physiologic genetic contexts and to determine how it generalizes across different cell lines, promoters, and with varying promoter CpG contents. Our hypothesis is that analog memory will emerge in those contexts lacking the positive feedback between DNA methylation and repressive histone modifications (Figure 6).
Patterns of intermediate DNA methylation and gene expression level have been observed in a variety of biological contexts. For example, the hippocampus is composed of spatial gradients of cell types with varying degrees of gene expression63 and DNA methylation.64 Similarly, genome-wide studies have reported an inverse correlation between gene expression level and DNA methylation content in chronic lymphocytic leukemia cells,65 embryonic stem cells,66 and human placenta.67 Intermediately methylated and partially active gene states that appear to be maintained in time were also found in a class of human stem/progenitor cells.68 However, the temporal stability of gene expression and DNA methylation grade at the single cell and gene level will need to be addressed to demonstrate analog epigenetic memory in these contexts.
Analog epigenetic memory can be used as a tool for engineering mammalian cells that store analog information. Programming analog memory through the DNA methylation grade could allow us to create high-resolution spatial gradients of cell identities for future engineered tissues and organoids. To this end, additional research will be required to precisely control DNA methylation grade by user-defined inputs, such as small signaling molecules or drugs.
In summary, this study has revealed that chromatin modifications support analog epigenetic memory and has established tools that can be used for further biological discovery and future mammalian cell engineering applications.
Limitations of the study
We only used one promoter (EF1a), one cell line (CHO-K1), and one integration site. Further experiments using additional promoters, cell lines, and integration sites are required to determine the extent to which analog memory is broadly observed. We utilized single-site recombination for chromosomal integration, instead of recombinase-mediated cassette exchange, which may influence how chromatin regulators and methyl-CpG-binding proteins are recruited to the site due to the presence of the chromosomal integration backbone. We performed H3K9me3 editing indirectly through KRAB and not through the chromatin-modifying enzyme that KRAB recruits. We may find stronger recruitment of H3K9me3 in the latter case, which may enable us to reach a state where H3K9me3 is maintained on its own. We explored this with sequential transfections of PhlF-KRAB but still only observed a transient effect. This will not affect the presence of analog memory, which is established by using DNMT3A as the input.
Resource availability
Lead contact
Requests for reagents and resources should be directed to and will be fulfilled by the lead contact, Domitilla Del Vecchio (ddv@mit.edu).
Materials availability
All plasmids generated in this study are available from the corresponding author upon reasonable request and from Addgene. DNA sequences of all engineered proteins and the genomic reporters are included in the manuscript.
Data and code availability
Code for mathematical modeling is available at https://zenodo.org/records/15345727 (Zenodo, DOI: 15345727) and GitHub at https://github.com/simonbruno100/AnalogEpigeneticMemoryPaper2025/tree/v1.0. Targeted bisulfite sequencing data are available online (NCBI SRA: PRJNA1271686). All original data are available from the lead contact upon reasonable request.
Acknowledgments
We thank Dr. James J. Collins for providing scientific comments and suggestions on the link between the model predictions and the experimental data, Dr. Lacramioara Bintu for support with DNA methylation assays, Kalon Overholt for providing useful references on DNA methylation and providing feedback on some of the figures, and the Biological Engineering Comm Lab for helping us refine some of the diagrams. This study was supported by the National Science Foundation, by NSF-MODULUS (award no. 2027949), and partially by the Vannevar-Bush Faculty Fellowship (Office of Naval Research award no. N00014-25-1-2053).
Author contributions
S.P., S.B., and D.D.V. conceptualized the study. D.D.V. directed and supervised the study. S.P., S.B., R.W., E.S., A.K., and D.D.V. contributed to experimental design and/or data analysis. S.P., E.S., I.G.-M., A.K., and K.I. conducted or assisted in experimental investigation. S.P. performed computational protein structure modeling and analysis. S.B. performed mathematical modeling and data analysis. S.P., S.B., and D.D.V. wrote the manuscript.
Declaration of interests
The authors declare no competing interests.
STAR★Methods
Key resources table
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Antibodies | ||
| Rabbit polyclonal anti-H3K4me3 | Diagenode | C15410003; AB_2924768 |
| Rabbit polyclonal anti-H3K9me3 | Diagenode | C15410056; AB_3674193 |
| Mouse monoclonal anti-5mC | Diagenode | C15200081; 2572207 |
| Chemicals, peptides, and recombinant proteins | ||
| Doxycycline | Sigma-Aldrich | D9891 |
| 2,4-diacetylphloroglucinol (DAPG) | Santa Cruz | c-206518 |
| Penicillin-Streptomycin | Sigma | P4333 |
| Puromycin | InvivoGen | ant-pr-1 |
| Formaldehyde | Thermo Fisher | 28906 |
| Critical commercial assays | ||
| DNeasy Blood and Tissue Kit | Qiagen | 69504 |
| Q5 High-Fidelity 2X Master Mix | NEB | M0492 |
| Power SYBR™ Green PCR Master Mix | Thermo Fisher | 4368577 |
| MagMeDIP Kit | Diagenode | C02010021 |
| MicroChIP-seq kit | Diagenode | C01010132 |
| Deposited data | ||
| Targeted bisulfite sequencing original data | This paper | NCBI SRA: PRJNA1271686 |
| Experimental models: Cell lines | ||
| Hamster: CHO-K1 cell line | ATCC | CCL-61 |
| Hamster: CHO-K1 landing pad cell line | Gaidukov et al.54 | sLP20 |
| Oligonucleotides | ||
| Primers qPCRs, see Table S4E | This paper | N/A |
| Primers for targeted bisulfite sequencing, see Table S4E | This paper | N/A |
| Primers of PCRs, see Table S4E | This paper | N/A |
| Recombinant DNA | ||
| Sequences for genomic reporter, see Table S1E | This paper | N/A |
| Sequences for epigenetic effectors, see Table S2E | This paper | N/A |
| Sequences for gRNAs, see Table S3E | This paper | N/A |
| DNMT3A-dCas9 | O’Geen et al.40 | Addgene 100090 |
| Software and algorithms | ||
| Computational model code | This paper | Zenodo, DOI: 1534572. https://github.com/simonbruno100/AnalogEpigeneticMemorPaper2025/tree/v1.0 |
| MATLAB | MathWorks | https://www.mathworks.com/products/matlab.html |
| Bismark | Babraham Institute | http://www.bioinformatics.babraham.ac.uk/projects/bismark/ |
| Trim Galore | Babraham Institute | https://www.bioinformatics.babraham.ac.uk/projects/trim_galore |
| FastQC | Babraham Institute | https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ |
| Cytoflow | MIT | https://cytoflow.readthedocs.io/en/stable/ |
| LightCycler 96 Application Software | Roche | https://lifescience.roche.com/global/en/products/product-category/lightcycler.html#4 |
| Benching | Benchling | https://www.benchling.com/ |
| AlphaFold 2 (ColabFold) | Mirdita et al.69 | https://colab.research.google.com/github/sokrypton/ColabFold/blob/main/AlphaFold2.ipynb |
| QIAcuity Software Suite | Qiagen | https://www.qiagen.com/us/resources/resourcedetail?id=def90e09-0c22-4dad-ba8d-820fedb5ec0d&lang=en |
| QuantStudio Real-Time PCR Software | Thermo Fisher | https://www.thermofisher.com/us/en/home/global/forms/life-science/quantstudio-6-7-flex-software.html |
| Python | Python Software Foundation | https://www.python.org |
| Prism | Graphpad | https://www.graphpad.com/ |
Experimental model and study participant details
Cell engineering
The reporter gene was integrated using BxB1 site-specific chromosomal integration into an endogenous mammalian locus (Rosa26) in monoclonal CHO-K1 cells (female) bearing a monoallelic landing pad53 (Figure S2A). We selected this method because it is highly efficient and precise in mammalian cells.53,70 This locus had been previously identified by a homology-based computational search (BLAST) using the murine Rosa26 locus as reference. Each insulator consists of two tandem copies of the 5′ 250bp core sequence of the 1.2kb cHS4 insulator sequence.35 The core sequence isolates chromosomal integrations via both enhancer-blocking and barrier activity.30,35,71,72 EYFP is expressed in the cells exclusively before site-specific chromosomal integration into the Rosa26 locus (Figures S2A and S2B). The cells become puromycin resistant exclusively after correct chromosomal integration (Rosa26 locus)(Figures S2A and S2B). For integration, 500ng of the chromosomal integration vector and 500ng of the Bxb1 recombinase expression vector were transfected into the cells. After 24 h, routine cell culture was commenced with the supplementation of growth media with puromycin (8 g/ml) for 2 weeks. Subsequently, a fluorescence-activated cell sorter (BD Aria) was utilized to obtain cells that were positive for EBFP2 expression and negative for EYFP expression. Real-time PCR (qPCR) was used to confirm 1 genomic reporter per genome (Figure 2D). Genomic PCRs were used to confirm integration into the Rosa26 locus (Figure S2B). The cells were also assayed and confirmed to test negative for mycoplasma contamination (MIT’s Koch Institute Preclinical Modeling Core).
Cell culture conditions
Cells were maintained in tissue culture plates (Corning) with growth media comprised of Dulbecco’s Modified Eagle Medium (DMEM) (Corning) supplemented with 10% FBS (Corning), 1X non-essential amino acids (Gibco), 100 U/ml penicillin (Sigma), and 100 g/ml streptomycin (Sigma) at 37°C in a 5% CO2 incubator. The growth media was changed every 2–4 days. For routine passaging, the growth media was removed, followed by the addition of 0.25% trypsin (Corning) and incubation at 37°C in a 5% CO2 incubator for 7 min. Trypsin was then neutralized with growth media at room temperature. The cells were then centrifuged for 3 min at 200g, the supernatant was discarded, and the cell pellet was resuspended in fresh growth media. The cells were then diluted (ratio between 1 and 20) and reseeded into new tissue culture plates for subsequent cell culture.
Copy number determination using real-time PCR
To determine the copy number of the integration in our engineered cell line (Figure 2D), genomic DNA extraction was performed on the cells bearing the chromosomal integration and the cells before chromosomal integration (landing pad cell line) as control.53 The extraction was performed using the DNeasy Blood & Tissue Kit per the manufacturer’s instructions (Qiagen) with approximately 1x cells. After genomic extraction, the DNA samples were submitted to Azenta Life Sciences for performing the rest of the assay. TaqMan probes and primers were designed by Azenta Life Sciences for the chromosomal integration and the endogenous COG1 gene as reference. Real-Time PCRs were performed in triplicates on a QuantStudio Real-Time PCR system and analyzed using QuantStudio Real-Time PCR Software. An amplicon-based standard curve was developed to quantify copies per reaction. The COG1 gene was used as reference gene for normalization to obtain copies per genome (assuming 2 COG1 copies per genome).
Copy number determination using digital PCR
To determine the copy number of the integration in each of the 8 clones (Figure S11B), genomic DNA extraction was performed with the DNeasy Blood & Tissue Kit (Qiagen) using approximately 1x cells, according to the manufacturer’s instructions. Samples were then submitted to Azenta Life Sciences for conducting the rest of the assay. A NanoDrop spectrophotometer was first used to assess sample concentration and confirm purity of the samples. The samples were then loaded in triplicate for analysis on a QIAcuity 8.5k 96-Well Nanoplate using the Qiagen QIAcuity digital PCR system, with primes and probes targeting EBFP2 and COG1. Copy number calculation was conducted using the QIAcuity Software Suite (version 2.2.0.26). The COG1 gene was used as reference gene for normalization (assuming 2 COG1 copies per genome). Each sample was compared to a no-template control to verify the absence of external contamination.
Genomic PCRs
The DNeasy Blood & Tissue Kit (Qiagen) was used to extract genomic DNA according with the manufacturer’s instructions and using approximately 1x cells per sample. PCRs were performed using the Q5 High-Fidelity 2X Master Mix (NEB) for 25–35 cycles using the primers in Table S4.
Method details
Transfections
Cells were cultured in growth media at 37°C in a 5% CO2 incubator. The cells were then treated with 0.25% trypsin (Corning) and incubated for 7 min at 37°C in a 5% CO2 incubator. The cell suspension was subsequently neutralized with growth media at room temperature. Cells were centrifuged for 3 min at 200g followed by removal of the supernatant and cell resuspension in fresh growth media. Cells were then seeded in 12-well tissue culture plates (Corning) at 300,000 cells/well and cultured for 24 h at 37°C in a 5% CO2 incubator. Growth media was replaced with growth media without antibiotics. Transfection was then performed using Lipofectamine LTX (Invitrogen) according to the manufacturer’s instructions. After 24 h of incubation at 37°C in a 5% CO2 incubator, replacement for routine growth media supplemented with penicillin and streptomycin was performed. All transfections with rTetR-XTEN80-DNMT3A, rTetR-XTEN80-DNMT3A, rTetR-XTEN80-TET1 and rTetR-KRAB were performed using 500ng of the fusion protein and 300ng of trasfection marker (EYFP). Transfections with PhlF-KRAB that involved cell sorting were performed using 450ng of either PhlF-KRAB or PhlF, and 450ng of the transfection marker (450ng). Experiments of PhlF-KRAB without cell sorting were perfomed using 450ng of PhlF-KRAB and 450ng of the transfection marker (1:1 stochiometry), 45ng of PhlF-KRAB and 450ng of the transfection marker (0.1:1 stochiometry), or 4.5ng of PhlF-KRAB and 450ng of the transfection marker (0.01:1 stochiometry). An empty vector was used to maintain the DNA amounts constant for the various stoichiometries. Transfections of DNMT3A-dCas9 that involved cell sorting were performed using 500ng of DNMT3A-dCas9, 600ng of the gRNA expression vector, and 300ng of the trasfection marker (EYFP). Transfections of DNMT3A-dCas9 in Figures S11D and S11E used 190ng of DNMT3A-dCas9, 600ng of the gRNA expression vector, and 300ng of the trasfection marker (EYFP). In experiments using dCas9, 450ng of the gRNA expression vector, 300ng of either DNMT3A-dCas9 or dCas9, and 300ng of the transfection marker (EYFP) were used. All the experiments presented use 2 M of dox and 30 M of DAPG.
DNA construction
DNA cloning was performed using a combination of Golden Gate Assembly, Gibson Assembly, restriction cloning, and PCR cloning. Expression vectors for the epigenetic effectors were cloned using Golden Gate Assembly. The cloning was performed in a single reaction with the plasmids comprising a destination vector, cHS4 chromatin insulator, EF1a promoter, 5′ UTR, gene sequence, 3′ UTR, and polyadenylation signal. Gene sequences for rTetR-TET1, rTetR-XTEN80-TET1, rTeTR-DNMT3A, rTeTR-XTEN80-DNMT3A, PhlF-KRAB, rTetR-KRAB, and PhlF were synthesized and cloned into a plasmid with compatible overhangs for Golden Gate Assembly (Azenta Life Sciences). The expression vectors for TET1 and rTetR were cloned by performing PCR cloning using the rTetR-XTEN80-TET1 expression vector as a template. For the reporter gene, the region comprising the PhlF and rTetR binding sites was synthesized (Azenta Life Sciences) and subsequently cloned upstream of a vector containing the EF1a promoter using restriction cloning. Golden Gate Assembly was then performed in a reaction comprising the EF1a promoter with upstream DNA-binding sites, destination vector, upstream cHS4 chromatin insulator, 5′ UTR, EBFP2, 3′ UTR, and polyadenylation signal. The resulting vector was cloned along with the downstream cHS4 chromatin insulator into the chromosomal integration vector using Gibson Assembly. All sequences were sequenced verified via Sanger sequencing (Azenta Life Sciences) or whole plasmid sequencing (Primordium).
Flow cytometry
Cells were trypsinized using 0.25% trypsin (Corning) and incubation at 37°C in a 5% CO2 incubator for 7 min. Following this, trypsin was neutralized by adding growth media at room temperature. Afterward, the cells were centrifuged at 200g for 3 min. The supernatant was removed, and the resulting cell pellet was then resuspended in fresh growth media. The cell suspension was filtered through a filter cap into flow cytometry tubes (Falcon). Flow cytometry data was then collected using a flow cytometer (BD LSRFortessa). EBFP2 was measured using a 405nm laser and a 450/50 filter with 259 V, EYFP was measured using a 488nm laser and 530/30 filter with 190 V, mKO2 was measured using a 561nm laser and 610/20 filter with 250 V, iRFP was measured using a 637nm laser and 670/30 filter with 326 V. For flow cytometry in Figure S11, data was collected on a BD LSRFortessa with EBFP2 measured using a 405nm laser and a 450/40 filter with 332 V, and EYFP was measured using a 488nm laser and a 530/30 filter with 310 V. Cytoflow (version 1.2) was used for analyzing all flow cytometry data and obtaining cell densities, means, standard deviations, and percentages of flow cytometry measurements. Measurements reported in the study refer to arbitrary units of fluorescence. For analysis, measurements were gated for single cells based on forward and side scatter (Figure S13B), and only gated measurements were used for subsequent analysis. CHO-K1 cells (WT) were used to assess background fluorescence, which is comparable to the maximal silencing we observed when the genomic reporter was edited with DNMT3A or KRAB (Figure S13C). URCP-100-2H calibration particles (SPHERO) were used to ensure instrument linearity and no day-to-day variation within experiments (Figure S13D). After processing the flow cytometry data using Cytoflow, the data was plotted using either Cytoflow, GraphPad Prism or Matplotlib. Graphpad Prism was used for generating bar plots. For the correlation plot between mean EBFP2 fluorescence and DNA methylation, Cytoflow’s gated measurements were imported into MATLAB, where the correlation was performed using the polyfit function.
Time-course flow cytometry
A fluorescence-activated cell sorter (BD Aria) was used to obtain cells with specific levels of transfection (EYFP) or gene expression (EBFP2). For transfection experiments that involved cell sorting, cell sorting was performed between 42 and 48 h after transfection. When flow cytometry was performed on the day of cell sorting, a fraction of the cells that were sorted was utilized for flow cytometry measurements (BD LSRFortessa), and the rest of the cells were reseeded on tissue culture plates. For flow cytometry measurements, cells were trypsinized using 0.25% trypsin (Corning) followed by incubation at 37°C in a 5% CO2 incubator. Trypsin neutralization was performed with growth media at room temperature. The cells were then centrifuged for 3 min at 200g. The supernatant was removed followed by resuspension of the cells in fresh in growth media. A fraction of the cells was utilized for the acquisition of flow cytometry measurements (BD LSRFortessa). The rest of the cells were diluted (ratio between 1 and 20) and reseeded into new tissue culture plates for subsequent measurements.
Cell sorting
Cells were trypsinized using 0.25% trypsin (Corning) followed by incubation at 37°C in a 5% CO2 incubator for 7 min. Subsequently, trypsin was neutralized with growth media at room temperature. The cells were then centrifuged at 200g for 3 min. The supernatant was discarded and the cell pellet was resuspended in growth media supplemented with 2.5g/ml BSA and corresponding small molecules according to the experiment. The cell suspension was then filtered through a filter cap into flow cytometry tubes. Cell sorting was then performed using a fluorescence-activated cell sorter (BD Aria) approximately 42–48 h after transfection, and cells were collected in growth media supplemented with penicillin/streptomycin and corresponding small molecules according to the experiment. EBFP2 was measured using a 405nm laser and a 450/50 filter with 332 V. EYFP was measured using a 488nm laser and 515/20 filter with 200 V. For cell sorting in Figure S11, a BD Aria was used with EBFP2 measured using a 405nm laser and a 450/50 filter with 225 V, and EYFP was measured using a 488nm laser and a 530/30 filter with 186 V. Cells were sorted either for transfection level (EYFP) or gene expression level (EBFP2). During cell sorting for EBFP2, a gate was also applied to sort exclusively for EYFP negative cells. Cell sorting to study the gene expression dynamics after epigenetic editing using DNMT3A-dCas9 (Figures 2F, S2C and S2D) used ranges 100–200, 222–445, 500–1000 and 1150–2300 EYFP A.U.F. Figure 2F shows the 222–445 range. Cell sorting to study the gene expression dynamics after epigenetic editing using PhlF-KRAB (Figures 2H and S4A–S4C) used ranges 25–100, 100–1000, 1000–10000, and 10000–100000 corresponding to EYFP A.U.F. Figure 2H shows the 100–1000 range.
Cell sorting to study histone and DNA methylation changes using MeDIP-qPCR and ChIP-qPCR (Figures 2G and 2I) used ranges 3000–100000 EYFP A.U.F. Cell sorting to demonstrate the stability of DNMT3A-edited (DNMT3A-dCas9) cells expressing intermediate levels of gene expression (Figure 3B) was performed according to <100 A.U.F for low expression, 200–500 A.U.F for intermediate expression, and >1000 A.U.F for high expression. Cells had been edited with transfection of DNMT3A-dCas9 targeted to the upstream binding sites of the promoter (Tables S1 and S3), cell sorting for 7000–100000 EYFP A.U.F., and were maintained in culture for 2 weeks. Cell sorting to subdivide cell populations from stable intermediate levels of gene expression (Figure 3K) was performed according to <100 A.U.F for bin 1, 179–224 for bin 2, 400–500 for bin 3, >1000 A.U.F for bin 4. Cells expressing intermediate levels of gene expression had been sorted from DNMT3A-edited cells (DNMT3A-dCas9) from the range 300–400 EBFP A.U.F. and maintained in culture for 2 weeks. Cells had been edited with transfection of DNMT3A-dCas9 targeted to the upstream binding sites (Tables S1 and S3), cell sorting for 3000–100000 EYFP A.U.F. (transfection marker), and maintained in culture for 2 weeks. Cell sorting for rTetR-XTEN80-DNMT3A (Figure 5A) and rTetR-KRAB (Figure 5C) transfections in cells with stable intermediate levels of expression used 3000–100000 EYFP A.U.F. Cell sorting for rTetR-XTEN80-TET1 (Figure 5B) transfections in cells with stable intermediate levels of expression used 410–2800 EYFP A.U.F. Cells expressing low and intermediate levels of gene expression had been sorted from DNMT3A-edited cells (DNMT3A-dCas9) using <100 A.U.F for low expression and 200–500 A.U.F for intermediate expression and maintained in culture for 2 weeks. Cells had been edited with transfection of DNMT3A-dCas9 targeted to the upstream binding sites (Tables S1 and S3), cell sorting for 3000–100000 EYFP A.U.F., and maintained in culture for 2 weeks. Cell sorting to study the gene expression dynamics after epigenetic editing using rTetR-XTEN80-DNMT3A (Figures S3A–S3C) used ranges 60–445, 500–7000, and 7000–100000 EYFP A.U.F. Figure S3A shows the 60–445 range. Cell sorting to demonstrate the stability of DNMT3A-edited (rTetR-XTEN80-DNMT3A) cells expressing intermediate levels of gene expression (Figure S8) was performed according to <80 EBFP2 A.U.F for low expression, and ranges 304–380, 400–500, and 530–662 EBFP2 A.U.F for intermediate expression. The cells had been edited with transfection of rTetR-XTEN80-DNMT3A, supplementation of dox for 8 days, cell sorting for 60–445 EYFP A.U.F., and maintained in culture for 2 weeks. Cell sorting to study the effect of DNMT3A-dCas9 on monoclonal population 8 used ranges 100–200, 210–450, 500–1000, 1100–2200 EYFP A.U.F. (Figures S11C–S11E). The cells from the 1100–2200 EYFP A.U.F. range were used for subsequent subdivision after maintenance in culture for 2 weeks. Cell sorting in Figure S11F used ranges 100–180, 250–310, 480–600, 800–1000, 1610–2000, 3210–4000, 5500–6850, 11000–13700 EBFP2 A.U.F.
Single-cell sampling and expansion
Single cell sorting was performed using a fluorescence-activated cell sorter (BD Aria) for various levels of EBFP2 fluorescence. For the cells exhibiting low levels of fluorescence, single cells were obtained from range 21–30 A.U.F. and range 50–60 A.U.F. For the cells exhibiting intermediate levels of fluorescence, single cells were obtained from range 200–240 A.U.F, range 254–305 A.U.F, range 320–384 A.U.F, and range 417–500 A.U.F. For the cells exhibiting high levels of fluorescence, single cells were obtained from range 1000–1200 A.U.F and range 1750–2100 A.U.F. From each range, 90 single cells well obtained for a total of 720 single cells, each seeded on a separate well of a 96-well tissue culture plate (Corning) with growth media supplemented with penicillin/streptomycin. Single cells were then cultured under standard cell culture conditions at 37°C in a 5% CO2 incubator. Growth media was replaced independently for each well every four days. After 14 days of cell culture, 93 monoclonal populations were observed, representing 10 monoclonal populations for each of the 8 ranges sampled. The cells in Figures 3B–3E were used for single-cell sampling at day 29.
DNMT1 inhibition
Cells were seeded and cultured in growth media at 37°C in a 5% CO2 incubator for 24 h. Subsequently, DNMT1 inhibition was initiated by growth media supplementation with 10μM of 5-azacytidine. Beginning a day after supplementation, the cells were measured by flow cytometry every day, diluted 1:2, and reseeded in fresh media supplemented with 10μM 5-azacytidine. The intermediately DNMT3A-silenced clonal cells had been cultured after single-cell sorting for 232 days.
MeDIP-qPCR and ChIP-qPCR
MeDIP-qPCR was performed using the MagMeDIP-qPCR kit (Diagenode) according to the manufacturer’s instructions. For MeDIP-qPCR, genomic DNA was extracted using the DNeasy Blood & Tissue Kit (Qiagen) according to the manufacture’s instructions using approximately 0.5x cells, followed by dilution to 100ng/ l. ChIP-qPCR was performed using the True MicroChIP-seq kit (C01010132, Diagenode) according to the manufacturer’s instructions using approximately 1x cells per immunoprecipitation reaction, which were cross-linked for 8 min at room temperature with methanol-free formaldehyde (Thermo Fisher) at a concentration of 1%. Sonication was performed using a Bioruptor Pico (Diagenode). We utilized the anti-5mC antibody in the MagMeDIP-qPCR kit (Diagenode) for DNA methylation. We utilized antibodies for H3K4me3 (C15410003, Diagenode) and H3K9me3 (C15410056, Diagenode) for histone methylation. The qPCRs were performed using Power SYBR Green PCR Master Mix (Applied Biosystems) and primers in Table S4 using a LightCycler 96 (Roche). Melt curves were used to confirm a single amplification product and the LightCycler 96 Application Software (Roche) was used to obtain the values. For the reporter gene, the promoter region was amplified. -Actin (H3K4me3) and IgF2 (H3K9me3 and DNA methylation) were used as reference endogenous genes.22
Targeted bisulfite sequencing
Genomic DNA was extracted using the DNeasy Blood & Tissue Kit (Qiagen) according to the manufacture’s instructions. Samples were then submitted to Zymo Research for conducting the rest of the assay. Assays targeting CpG sites along the chromosomal integration region comprising the continuous sequence covering all DNA binding sites, promoter and EBFP2 were developed using the Rosefinch tool from Zymo Research (Table S4). Primers were used to amplify bisulfite-converted DNA from the samples, using the EZ DNA Methylation-Lightning Kit for conversion. Amplified products were pooled, barcoded, and prepared for sequencing on an Illumina MiSeq system with a V2 300bp kit, following paired-end sequencing. Trim Galore was used to trim low quality nucleotides and adapter sequences. FastQC was used to evaluate sequence quality. Sequencing reads were aligned to the chromosomal integration sequence using Bismark. The estimation of methylation levels for each sampled cytosine was calculated by dividing the count of reads identifying a cytosine (C) by the sum of reads identifying either a cytosine (C) or a thymine (T).
Design of gRNA sequences
Sequences for gRNAs were designed in Benchiling using the chromosomal integration sequence and the CHO-K1 genome sequence (National Library of Medicine (NCBI) RefSeq assembly GCF_000223135.1).
Protein structure predictions
Protein structure predictions were performed with AlphaFold273 using MMseqs2 on the Google Colab Platform (ColabFold v1.5.5)74 on GPUs. AlphaFold2 was used to perform structural predictions using 5 models for each protein. The top ranking predicted structure (according to plDDT score for single chains and pTM score for protein complexes) was then relaxed using Assisted Model Building with Energy Refinement (AMBER) relaxation. The protein structures of PhlF-KRAB, rTetR-KRAB, PhlF, and rTetR were predicted as multimers (homodimers). The protein structures of DNMT3A-dCas9, rTetR-XTEN80-DNMT3A, rTetR-XTEN80-TET1 were predicted as single chains. The structures of rTetR-XTEN80-TET1 and rTetR-XTEN80-DNMT3A were then aligned using the predicted homodimer of rTetR as a template in PyMOL (2.5.2). Minimal rotations ( 1 per linker) were performed at the flexible linkers in PyMOL.
Mathematical model and computational analysis
The chromatin modification model of Figure 2J is comprised of the chemical reactions listed below:
in which represents nucleosomes with methylated DNA, and the other species are defined in the STAR Methods subsection “Model of chromatin modification circuit.” The chromatin modification circuit model of Figure 2J is a reduction of the full chromatin modification circuit depicted in Figure S5, which was previously published in Bruno et al.75 This circuit includes both DNA methylation and histone modifications, the positive feedback loop between DNA methylation and H3K9me3, without making any assumptions on the relative time scales between DNA methylation and histone modifications. The chromatin modification circuit model of Figure 2J can be obtained from this circuit by removing the recruitment of DNA methylation by H3K9me3, thereby breaking the positive feedback loop between H3K9me3 and DNA methylation, and by removing the decay rate of DNA methylation, assumed slow compared to that of histone modifications. See STAR Methods subsection “Derivation of model (11)” for the derivation.
For the plots obtained in Figure 2K, we used the reaction rate equation model corresponding to the reactions listed above, using the law of mass action.76 This is the deterministic system of ordinary differential equations (ODEs) given in Equation 11 with gene expression dynamics ODE model given in Equation 10 with . In order to obtain the dose-response curves show in Figure 2K, the system of ODEs was simulated using MATLAB and endpoint simulation quantities were plotted.
The plots of Figures 2L and 2M were obtained by simulating the chemical reaction system in Tables S1 and S4 using Gillespie’s Stochastic Simulation Algorithm (SSA).38 Specifically, we obtained different sample temporal paths using the SSA starting at the indicated initial condition and with parameter values listed at the end of the STAR Methods, and then computed the distributions by obtaining histograms of the final values of the species abundance across the different sample paths. The SSA is a Monte Carlo method that generates temporal trajectories of a chemically reacting system while accounting for the randomness of chemical reactions. We used Gillespie’s first reaction method38 and the MATLAB codes have been deposited on GitHub and archived on Zenodo (DOI: 15345726).
Derivation and characterization of mathematical models
Model of chromatin modification circuit
The chromatin modification circuit reaction model considered in our study, shown in Figure S5A, was constructed starting from the one in Bruno et al.75 and it is based on known molecular interactions from the literature. Here, we give a brief description of the reaction model (see Bruno et al.75 for additional details). The chromatin modifications considered in the circuit are H3K9 methylation (H3K9me3), H3K4 methylation (H3K4me3), and DNA methylation. The basic unit of the model is the nucleosome with DNA wrapped around it, D, that can be modified with H3K4me3, , DNA methylation, , H3K9me3, , or both H3K9me3 and DNA methylation, . The reaction model can be graphically represented by the circuit of Figure S5A, whose associated reactions are given in Table S5. More precisely, writer enzymes have the ability to de novo establish chromatin marks. Additionally, histone modifications recruit writer enzymes for the same modification to nearby modifiable nucleosomes (auto-catalysis),11,19,20,48 and DNA methylation and repressive histone modification cooperate by recruiting each other’s writer enzymes (cross-catalysis).40 Ultimately, these modifications can be passively removed through dilution during DNA replication or by the action of eraser enzymes (basal erasure). Both activating and repressive modifications are involved in the recruitment of each other’s eraser enzymes (recruited erasure).12,45,46,47
Now, defining the number of as , , , , and , let us derive the related ordinary differential equation (ODE) model in terms of the fractions , , , and , with the total number of nucleosomes in a gene of interest. This can be done by assuming that is sufficiently large so that , , , and can be treated as real number. Now, let us introduce where is the reaction volume, and define the normalized inputs: , , , , , and . Furthermore, let us introduce the following dimensionless parameters:
| (Equation 1) |
in which (, ) is the non-dimensional rate constant associated with auto-catalysis (cross-catalysis). Let
| (Equation 2) |
with such that and such that . Then, encapsulates the asymmetry between the erasure rate of repressive histone modifications (the DNA demethylation rate) and the erasure rate of activating marks. Furthermore, given that , , and , is a parameter that scales the ratio between the basal (recruited) erasure rate and the auto-catalysis rate of each modification. Considering the normalized time , the ODEs associated with the chromatin modification circuit are
| (Equation 3) |
with (see Bruno et al.75 for full derivation). It is important to note that in our reporter gene, the expression of fluorescent protein EBFP2 is driven by a constitutive promoter. This means that the promoter remains active continuously under normal conditions, without the need for external activators. This is the reason why, for , we introduce a constant non-basal establishment term . As for the other species, we will describe in the following section how external inputs, such as KRAB, DNMT3A, and TET1, modulate the parameters of the chromatin modification circuit model.
How KRAB, DNMT3A, and TET1 modulate model parameters
In our study, we exploit KRAB, DNMT3A, and TET1 to modulate chromatin modification state and they affect the model parameters as follows. KRAB is a well-known epigenetic effector that mediates the recruitment of H3K9me3 writers.42 As we introduce KRAB through transient transfection, its concentration will gradually decrease due to dilution until it eventually dies out. Furthermore, KRAB is fused to the DNA binding domain (DBD) PhlF. This fusion not only allows us to obtain sequence-specific recruitment, but also to regulate the binding and unbinding of KRAB through DAPG, as DAPG inhibits the ability of PhlF-KRAB to bind to DNA. To incorporate these aspects into our model, we write the expression for , i.e., reaction rate constant for H3K9me3 establishment (see reactions in Table S5), as follows:
| (Equation 4) |
in which is the total amount of PhlF-KRAB, is a parameter independent of PhlF-KRAB, is the dissociation constant related to the binding - unbinding reactions between DAPG and PhlF-KRAB, and is the dilution rate constant. Then, the normalized input in the ODE model (3) can be written as
| (Equation 5) |
in which we define and .
The enzyme DNMT3A is a DNA methyltransferase that catalyzes de novo DNA methylation (Allis et al.,11 Chapter 5) and, as a consequence, it modulates the rate of de novo DNA methylation establishment (see reactions in Table S5). As for KRAB, we introduce DNMT3 through transient transfection, and therefore its concentration will gradually decrease. To incorporate this aspect into our model, we can modify the expression for as follows:
| (Equation 6) |
in which is the total amount of DNMT3A and is a parameter independent of DNMT3A. Then, the normalized input in the ODE model (3) can be written as
| (Equation 7) |
in which we define and .
Finally, TET1 is an enzyme that catalyzes the conversion of DNA methylation into modifications that are not recognized by the maintenance enzyme DNMT1, and hence are passively removed through dilution.12,45,69 This enzyme then modulates the constants and in the reactions of Table S5. As for KRAB, we introduce TET1 through transient transfection and we fuse it to a DBD. More specifically, we employ rTetR, a DBD that binds DNA exclusively in the presence of doxycycline (dox). To incorporate these aspects into our model, we can modify the expressions for and as follows:
| (Equation 8) |
in which is the component of the rate coefficient that does not depend on the external rTetR-TET1 transiently transfected, is the total amount of rTetR-TET1 transiently transfected, and are parameters independent of rTetR-TET1, and is the dissociation constant related to the binding - unbinding reactions between dox and rTetR-TET1. Then, the non-dimensional parameter in the ODE model (3) can be written as
| (Equation 9) |
in which we define , , and . From 9, it is possible to note that, without transfection of rTetR-TET1, , indicating that is non-zero. This is because of the presence of passive DNA demethylation. However, previous experimental studies suggest that passive DNA demethylation, although present, is a slow process.22,44
Model of transcriptional regulation
The gene expression process involves two main steps. The first step is transcription, during which the genetic information is transcribed from DNA into mRNA (m). The second step is translation, during which the mRNA is translated into the gene product (X). Chromatin state affects transcription by modulating nucleosome compaction and then gene expression.11,77 Thus, we assume that transcription is predominantly allowed by , while allowing a low level of transcription to D and to all the species characterized by repressive chromatin modifications. Furthermore, mRNA m is subject to dilution due to cell division and degradation, while the gene product X is subject to dilution only. This is because fluorescent reporters are highly stable and thus not degraded but only diluted.76 The reactions associated with the gene expression model can be found in Table S8. Considering the normalized time and introducing the non-dimensional parameters , , , , , the ODE model for X and m can be written as
| (Equation 10) |
In this paper, we will refer to the model that combines the complete chromatin modification circuit model (Equation 3) along with the transcriptional regulation model outlined in this section as the “4D + X model”.
Effect of and on the behavior of the system
In this section, we analyze the stochastic behavior of the 4D + X model with a specific focus on understanding how the parameters (normalized rate of DNA methylation establishment through repressive histone modifications) and (ratio between the DNA demethylation rate and the activating histone modification erasure rate) affect the probability distribution of gene expression levels and thus the type of memory that can be achieved.
When the rate of DNA demethylation is 0 , achieving analog memory is possible only when repressive histone modification (H3K9me3) does not catalyze the de novo establishment of DNA methylation (Figure S5B). For , the gene expression level shifts either to a low or high level. Furthermore, the higher , the more the distribution tends to shift toward a low gene expression level (Figure S14 - top panel). When is different from 0, only binary memory can be achieved (Figure S14 - intermediate and bottom panels). Moreover, the higher is, the more the gene expression level shifts toward a high level (Figure S14 - bottom panel). Overall, these results suggest that analog memory can be achieved only when DNA demethylation rate is sufficiently small compared to histone modification erasure rate (that is, can be approximated as zero on the time scales of interest) and DNA methylation is not catalyzed by repressive histone modifications .
If these conditions are verified, and external inputs (KRAB, DNMT3A, and TET1) are not applied or are applied transiently (transient transfection), then the number of nucleosomes with methylated DNA stabilizes at a constant value. Introducing the variable , that is, the fraction of nucleosomes with methylated DNA in the gene, the behavior of the original model (3) can be captured by a reduced 2D ordinary differential equation (ODE) model below:
| (Equation 11) |
with and (full derivation below). A representative diagram of the reduced system is depicted in Figure S5C. It is important to point out that in the original model of chromatin modification circuit, it is assumed that the basic unit is a nucleosome with DNA wrapped around it and with only one modifiable CpG.75 There are cases in which this assumption is not verified. However, it is possible to demonstrate that removing this assumption yields a similar reduced model. This indicates that the assumption does not impact the qualitative results regarding the influence of the parameters and on the probability distribution of gene expression levels (see STAR Methods subsection “Modification to the chromatin modification circuit model to include more than one CpG per nucleosome”).
Derivation of model (11)
Let us first merge the rates linked to the enhancement of H3K9me3 establishment by into a single rate. We will assume that this rate is identical to the one associated with the enhancement of H3K9me3 establishment by . Similarly, let us merge the rates linked to the recruited erasure of H3K4me3 by into a single rate and assume that this rate is identical to the one associated with the recruited erasure of H3K4me3 by . These simplifying assumptions do not affect the qualitative results related to the effect of the cooperative and competitive interactions among chromatin modifications on epigenetic cell memory. The ODE system (3) can then be rewritten as
| (Equation 12) |
Now, let us rewrite system (12) by assuming negligible basal de novo establishment and introducing the variable , which corresponds to the fraction of nucleosomes with methylated DNA. We then obtain
| (Equation 13) |
with . Based on previous experimental results22 and our experimental findings (see Figure 2I), we can further simplify our model by accounting for the fact that the enhancement of DNA methylation establishment by H3K9me3 is negligible, i.e., . The ODE system (13) can then be rewritten as
| (Equation 14) |
with . Now, let us rewrite the ODE system (14) introducing the expressions for , , and derived in the previous subsection (Expressions (5), (7), and (9)):
| (Equation 15) |
with and . Let us now consider the parameter regime in which the rate of DNA demethylation, without transfection of rTetR-TET1, is sufficiently low compared to histone modification dynamics, i.e., . This assumption is consistent with experimental data that suggest that the passive DNA demethylation is a slow process.22,44 The system (15) can then be rewritten as follows:
| (Equation 16) |
with and . From 16, it follows that, after a temporary phase during which the externally transfected inputs decrease, and then , and the last two ODEs in 16 depend only on , , and . This implies that, after a transient change, stabilizes at a constant value and the behavior of the original model can be captured by the reduced two-dimensional ODE model given by the last two equations in 16, that corresponds to the ODE model in 11.
Effect of and on the behavior of the system
In this section, we analyze the deterministic and stochastic behavior of our system, with a specific focus on understanding the effect of parameters (parameter that scales the ratio between the basal erasure rate and the auto-catalysis rate of each modification) and (parameter that scales the ratio between the recruited erasure rate and the auto-catalysis rate of each modification) on the probability distribution of gene expression levels.
We first conduct a deterministic analysis by studying the ODE model associated with the simplified chromatin modification circuit, i.e., Equation 11, and determine how and affect the value of at the equilibrium for different fractions of methylated CpGs in the gene, i.e., (Figure S7A). When is large, the system consistently exhibits a unique stable steady state characterized by low (Figure S7A). The value of is smaller for higher . As decreases, the steady-state value of increases, particularly for low values of , where (Figure S7A). Further reduction in results in the system becoming bistable for intermediate values of (Figure S7A). Changes in do not significantly impact these trends, except for when is small. In such cases, larger values of reduce the range of for bistability and diminish the difference in the values of between the two steady states (Figure S7A). We next study the impact of and on the number of methylated CpGs at equilibrium for various initial levels of DNMT3A, represented by as defined in Equation 6 (Figure S15A). In general, increases with higher . However, for large values of , remains low over an extended range of values. As decreases, reaches higher values. In the case of small , an increase in results in a more ultrasensitive curve.
Overall, this analysis suggests that high fractions of nucleosomes with H3K4me3, and then a high level of gene expression, is achievable only when is sufficiently small (Figure S7A). In this parameter regime, when is sufficiently high, has an ultrasensitive response to DNMT3A transient dosage and then different ranges of DNMT3A pulse height would lead to either high or low gene expression levels only (Figures S5D and S15A). To validate these findings, we use the Stochastic Simulation Algorithm (SSA)38 to perform a computational study on the 4D + X model, that is, the model that incorporates the full chromatin modification circuit, whose reactions are listed in Tables S5 and S8 (Figures S5E and S15B). Here, we consider several ranges of the initial amount of DNMT3A transfected , which are uniformly distributed.
For different ranges of initial DNMT3A transfection levels, a bimodal probability distribution of gene expression levels emerges when is sufficiently large (Figure S15B, left-hand side plots). As decreases, the stationary distribution becomes unimodal (Figure S15B, left-hand side plots). When considering ranges that include higher values of , it becomes more evident how the unique peak of the distribution shifts toward lower gene expression levels for sufficiently high values of (Figure S5E, left-hand side plots; Figure S15B, right-hand side plots).
We then conduct a similar computational analysis to assess the impact of and on number of methylated CpGs and gene expression level across various initial levels of TET1 enzyme (Figures S15C and S15D). The obtained results mirror those described in the previous analysis (Figures S15A and S15B).
By looking at our experimental results involving transfection of KRAB (Figure 2H), it is possible to note that the probability distribution of gene expression levels is bimodal. Previous computational study suggests that this behavior can be obtained only for sufficiently small values of 75,78. Furthermore, our experimental results involving transfection of DNMT3A (Figures 2F and S2C) also show a bimodal probability distribution for gene expression levels. Based on our mathematical analysis (Figures S15A and S15B), these outcomes can be observed only when is sufficiently small and is sufficiently large. We validate these hypotheses by successfully replicating our experimental data via simulations considering this parameter regime, i.e., small , large (Figure S6).
Refined chromatin modification circuit model to replicate the bisulfite sequencing data from the monoclonal experiment (Figure 4)
In the study conducted in the previous sections, we observed that in the absence of external stimuli and when and can be assumed equal to zero, the chromatin modification circuit model, whose associated reactions are listed in Table S5, predicts a “frozen” level of DNA methylation, i.e., . However, this model does not perfectly align with the experimental results obtained from the bisulfite sequencing data of the synthetic reporter system in the eight different clones, each once associated with a distinct EBFP2 expression level (Figures 4A, 4B and 4D–4G). In fact, for each clone, we do not observe a single value for methylated CpGs across the entire cell population. Instead, we observe a distribution with mean and standard deviation that remain approximately constant throughout the observed time points (Figure 4G). Notably, the distribution of methylated CpGs appears more spread for clones associated with an intermediate level of methylated CpGs.
In order to recapitulate this observed behavior, we incorporated in the full model known molecular mechanisms involved in the DNA methylation maintenance process that were omitted before. Specifically, in the original full model we have
| (Equation 17) |
in which represents the net passive demethylation rate constant obtained from the balance between the dilution and the DNMT1 maintenance process. In order to recapitulate the experimental results, we added the following reactions:
| (Equation 18) |
It is important to point out that, for the sake of simplicity in the description of the DNA methylation maintenance model above, we used, with abuse of notation, to denote any species with DNA methylation and to denote any species with H3K9me3. The first reaction in 18 represents the possibility that the DNA methylation maintenance enzyme DNMT1 can cause de novo DNA methylation, although at a very low rate (low ).51,52 The second reaction represents the fact that H3K9me3 helps recruit DNMT1 to the gene and then reinforces DNMT1-mediated maintenance.50 Finally, we introduce the third reaction since H3K9me3 helps recruit DNMT1 and DNMT1 can, by mistake, establish DNA methylation on previously unmethylated sites.
Considering the normalized time and introducing the non-dimensional parameters , , , , the ODE describing the dynamics of fraction of nucleosomes with methylated DNA, that is, , can be then rewritten as
| (Equation 19) |
Then, the average level of DNA methylation remains approximately constant in time when , that is, if all the reaction rate constants are sufficiently small .
This is confirmed by the computational study conducted by simulating the reactions of the refined chromatin modification circuit model that incorporates the DNA methylation maintenance model described above, whose reactions are listed in Table S10, using the SSA (Figure S10A). Additionally, this analysis shows that the reactions encapsulating the recruitment mechanism of DNMT1 by H3K9me3 (the second and the third in 18) are crucial for obtaining broader distributions in correspondence of intermediate levels of methylated CpGs (Figure S10B). In fact, as illustrated in the figure, when the rate constants associated with these reactions are low, all distributions have approximately the same height and width. However, higher values of these rates result in broader distributions at intermediate levels of methylated CpGs.
4D + X model: Reactions
Chromatin modification circuit
In this reaction model, it is assumed that the rate constants of the establishment, auto and cross-catalytic, and erasure processes of H3K9me3 (DNA methylation) do not change if the other repressive mark is present on the same modifiable unit. The chromatin modification circuit reaction model can then be written as in Table S5.
When DNA methylation is not included in the system, the only species included in the model are , , and , and the reactions included in the model become only the ones listed in Table S6.
Finally, the model represented in Figure 2J can be obtained starting from the full chromatin modification circuit model (Table S5) and assuming negligible basal de novo establishment , merging the rates linked to the enhancement of H3K9me3 establishment by into a single rate, merging the rates linked to the recruited erasure of H3K4me3 by into a single rate, assuming negligible DNA methylation establishment by H3K9me3, and assuming a DNA demethylation rate, without transfection of rTetR-TET1, sufficiently low compared to histone modification dynamics. The reaction list associated with this model can be written as in Table S7 and the corresponding ODE model can be found in 16.
Transcriptional regulation
The reactions associated with the gene expression model described in the STAR Methods subsection “Model of transcriptional regulation” are listed in Table S8. Here, m is the mRNA, X is the gene product, is the decay rate constant of m due to dilution and degradation, is the decay rate constant of X due to dilution, are the transcription rate constants, with , and is the translation rate constant.
When DNA methylation, and then species and are not included in the chromatin modification circuit, then the gene expression model can be simplified as in Table S9.
DNA methylation maintenance and refined chromatin modification circuit
To recapitulate the experimental results shown in Figure 4G, modifications need to be made to the reaction system that models the DNA methylation maintenance process, as explained in STAR Methods subsection “Refined chromatin modification circuit model to replicate the bisulfite sequencing data from the monoclonal experiment (Figure 4)”. The reaction model for the chromatin modification circuit can then be revised and expressed as detailed in Table S10.
Modification to the chromatin modification circuit model to include more than one CpG per nucleosome
In the original model of the chromatin modification circuit, it is assumed that the basic unit, that is a nucleosome with DNA wrapped around, can have only one modifiable CpG.75 There are cases in which this assumption is not verified. However, it is possible to show that this assumption does not affect the qualitative results related to the effect of the parameters (normalized rate of DNA methylation establishment through repressive histone modifications) and (ratio between the DNA demethylation rate and the activating histone modification erasure rate) on the probability distribution of gene expression levels.
To this end, let us assume that the basic unit of the model, D, is the nucleosome with DNA wrapped around it with two modifiable CpGs. The conclusion obtained in this section would not change if we consider the DNA wrapped around the nucleosome having modifiable CpGs. The basic unit of the model can then be modified with H3K4me3, , one DNA methylation, , H3K9me3, , one DNA methylation and H3K9me3, , two DNA methylations, , and two DNA methylation and H3K9me3, . Now, defining the number of as , , , , , and , let us derive the related ordinary differential equation (ODE) model in terms of the fractions , , , , , and , with the total number of nucleosomes on a gene of interest. This can be done by assuming that is sufficiently large so that , , , and can be treated as real number. Now, considering the dimensionless parameters and normalized inputs introduced in the STAR Methods subsection “Model of chromatin modification circuit” and the normalized time , the ODEs associated with the chromatin modification circuit are
| (Equation 20) |
with .
As we did for the original ODE system (3), let us merge the rates linked to the enhancement of H3K9me3 establishment by , , and into a single rate. We will assume that this rate is identical to the one associated with the enhancement of H3K9me3 establishment by . Similarly, let us merge the rates linked to the recruited erasure of H3K4me3 by , , and into a single rate and assume that this rate is identical to the one associated with the recruited erasure of H3K4me3 by . The ODE system (20) can then be rewritten as
| (Equation 21) |
Now, let us rewrite system (21) by assuming negligible basal establishment and introducing the variable , which corresponds to the fraction of nucleosomes with one methylated CpG, and the variable , which corresponds to the fraction of nucleosomes with at least one methylated CpG. We then obtain
| (Equation 22) |
with . As we did for the ODE system associated with our original chromatin modification circuit in STAR Methods subsection “Model of chromatin modification circuit”, we can further simplify our model by assuming that the enhancement of DNA methylation establishment by H3K9me3 is negligible, i.e., . The ODE system (22) can then be rewritten as
| (Equation 23) |
with . Now, introducing in 23 the expressions for , , and previously derived (Expressions (5), (7), and (9)), we obtain:
| (Equation 24) |
with . Furthermore, if we consider the parameter regime in which the rate of DNA demethylation, without transfection of rTetR-TET1, is sufficiently low compared to histone modification dynamics,i.e., , the system (24) can then be rewritten as follows:
| (Equation 25) |
with . From the fourth equation in 25, it is possible to notice that, following a temporary phase during which the externally transfected inputs decrease, , and then . This implies that, once the transient phase concludes, stabilizes at a constant value and the behavior of the original model can be captured by a reduced 2D model described by the subsequent ODEs:
| (Equation 26) |
with and . It is evident that the ODE model (26) closely resembles the original model, in which we assume to have one CpG per nucleosome, i.e., (11), with the only distinction being the presence of instead of . That is, in this revised model, the representation of the fraction of the number of methylated CpGs in the gene is denoted by . Then, the introduction of the simplifying assumption, having one CpG per nucleosome, does not affect the qualitative results of the analysis that focuses on identifying the impact of the parameters (normalized rate of DNA methylation establishment through repressive histone modifications) and (ratio between the DNA demethylation rate and the activating histone modification erasure rate) on the probability distribution of gene expression levels.
Values of parameters and rationale for parameter selections used in generating figures in the main paper and SI
In our model, we defined the total number of nucleosomes within a gene of interest as . Assuming approximately one nucleosome per 200 bp79 and given that the length of our gene is bp, then in our computational study we set the value of to 15.
With regard to the effect of dilution due to cell growth, we use a standard approach by modeling it as a first-order decay reaction.80 The decay rate constant can then be expressed as , where corresponds to the cell cycle length. Given that in our experiments we use CHO cells, whose cycle length has been estimated to be 20 h,22 in the simulations aimed at replicating our experimental data, we set (corresponding to h).
Additionally, our mathematical study in Methods subsection “Effect of and on the behavior of the system” suggests that our 4D + X model can replicate the experimental data related to the transfection of KRAB and DNMT3A (Figure 2) only within a parameter regime characterized by small and large . The expressions defining these parameters are given in Equation 2. Therefore, we set , , and , since these parameter values, along with the previously listed values of and , allow us to be within the desired parameter regime and accurately replicate our experimental results.
Regarding the other parameters, we considered multiple values and selected those that allow us to accurately replicate all experimental results presented in this paper.
Parameter values used to realize the plots in Figure 2K - top panel
, , , , , , , , , , , As initial conditions, we set , with .
Parameter values used to realize the plots in Figure 2K - bottom panel
, , , , , , , , , , . As initial conditions, we set .
Parameter values used to realize the plots in Figure 2L
, , , , , , , , , , , , , , , , , , , , , . As initial condition, we consider = (0,0,0,15) and the initial values of and at time were set to their steady states of the ODEs.
Parameter values used to realize the plots in Figure 2M
, , , , , , , , , , , , , , , , , , , , , . We consider four initial conditions: = (14,0,1,0) (blue), = (6,0,8,1) (red), = (4,0,5,6) (yellow), = (1,0,1,13) (purple). For each case, the initial values of and at time were set to their steady states of the ODEs.
Parameter values used to realize the plots in Figures 4H and 4J
, , , , , , , , , , , , , , , , , , , , , , , , . We consider eight initial conditions: = (13,0,0,2) (Bin 1), (12,0,0,3) (Bin 2), (8,0,0,7) (Bin 3), (6,0,0,9) (Bin 4), (5,0,0,10) (Bin 5), (4,0,0,11) (Bin 6), (1,0,0,14) (Bin 7), (0,0,0,15) (Bin 8). For each case, the initial values of and at time were set to their steady states of the ODEs.
Parameter values used to realize the plots in Figure S5B
, , , , , , , , , , , , , , , , , , , , , . We consider four initial conditions: = (14,0,1,0) (blue), = (6,0,8,1) (red), = (4,0,5,6) (yellow), = (1,0,1,13) (purple). For each case, the initial values of and at time were set to their steady states of the ODEs.
Parameter values used to realize the plots in Figure S5D - left hand-side panel
, , , , , , . As initial conditions, we set , with .
Parameter values used to realize the plots in Figure S5D - right hand-side panel
, , , , , , , , , , . As initial conditions, we set .
Parameter values used to realize the plots in Figure S5E
, , , , , , , , , , , , , , , , , , , , , . As initial condition, we consider = (0,0,0,15) and the initial values of and at time were set to their steady states of the ODEs.
Parameter values used to realize the plots in Figure S6C
, , , , , , , , , , , , , , , , . As initial condition, we consider = (0,15) and the initial values of and at time were set to their steady states of the ODEs.
Parameter values used to realize the plots in Figure S6D
, , , , , , , ,
, , , , , , , , , , , , , . As initial condition, we consider = (0,0,0,15) and the initial values of and at time were set to their steady states of the ODEs.
Parameter values used to realize the plots in Figure S7A
, , , , , , . As initial conditions, we set , with .
Parameter values used to realize the plots in Figure S7B - top panels
, , , , , , , , , , . As initial condition, we consider = (0,15 - ).
Parameter values used to realize the plots in Figure S7B - bottom panels
, , , , , , , , , , , , , , . As initial condition, we consider = (0,15 - ) and the initial values of and at time were set to their steady states of the ODEs.
Parameter values used to realize the plots in Figure S9B
, , , , , , , , , , , , , , , , , , , , . As initial condition, we consider = (0,0,0,15) and the initial values of and at time were set to their steady states of the ODEs. The distribution shown is after 30 days.
Parameter values used to realize the plots in Figure S9C
, , , , , , , , , , , , , , , , , , , , . As initial condition, we consider values of from the simulations shown in Figure S9B and divide them with respect to their value of , following the following intervals: [0.100] (red), [200,800] (orange), [900,10000] (blue).
Parameter values used to realize the plots in Figure S9G
, , , , , , , , , , , , , , , , , , , , . As initial condition, we consider values of from the simulations associated with the orange distribution in Figure S9D and divide them with respect to their value of , following the following intervals: [0,80] (Bin 1), [90,150] (Bin 2), [250,400] Bin 3, [500,10000] (Bin 4).
Parameter values used to realize the plots in Figure S10A
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , . We consider four initial conditions: = (14,0,0,1) (blue), = (6,0,0,9) (red), = (4,0,0,11) (yellow), = (0,0,0,15) (purple). For each case, the initial values of and at time were set to their steady states.
Parameter values used to realize the plots in Figure S10B
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , . We consider four initial conditions: = (14,0,0,1) (blue), = (6,0,0,9) (red), = (4,0,0,11) (yellow), = (0,0,0,15) (purple). For each case, the initial values of and at time were set to their steady states of the ODEs.
Parameter values used to realize the plots in Figure S12A
, , , , , , , , , , , , , , , , , , , , , . As initial condition, we consider values of from the simulations associated with the orange distribution in Figure S9C.
Parameter values used to realize the plots in Figure S12B
, , , , , , , , , , , , , , , , , , , , , . As initial condition, we consider values of from the simulations associated with the orange distribution in Figure S9C.
Parameter values used to realize the plots in Figure S12C
, , , , , , , , , , , , , , , , , , , , , . As initial condition, we consider values of from the simulations associated with the orange distribution in Figure S9C.
Parameter values used to realize the plots in Figure S14
, , , , , , , , , , , , , , , , , , , , , . We consider four initial conditions: = (14,0,1,0) (blue), = (6,0,8,1) (red), = (4,0,5,6) (yellow), = (1,0,1,13) (purple). For each case, the initial values of and at time were set to their steady states of the ODEs.
Parameter values used to realize the plots in Figure S15A
, , , , , , , , , , . As initial conditions, we set .
Parameter values used to realize the plots in Figure S15B
, , , , , , , (left hand-side panel) and (right hand-side panel), , , , , , , , , , , , , , . As initial condition, we consider = (0,0,0,15) and the initial values of and at time were set to their steady states of the ODEs.
Parameter values used to realize the plots in Figure S15C
, , , , , , , , , , . As initial conditions, we set .
Parameter values used to realize the plots in Figure S15D
, , , , , , , , , , , , , , , , , , , , , . As initial condition, we consider = (15,0,0,0) and the initial values of and at time were set to their steady states of the ODEs.
Quantification and statistical analysis
DNA methylation analysis
To determine the probability of a specific fraction of CpGs being methylated based on bisulfite sequencing data, which offers the probability of a specific CpG being methylated for the cell population, we employ the following method. This approach is applied to both bisulfite sequencing datasets presented in Figures 3G and 4E, resulting in the corresponding plots in Figures 3H and 4H, respectively.
Let us assume that the gene of interest has a total of CpGs. Then, for any , having CpGs methylated corresponds to the event where precisely CpGs are methylated, and all other CpGs (i.e., ) are unmethylated. Now, let us define the probability of the described event as , where . Additionally, let the probability of CpG being methylated be , and the probability of CpG being unmethylated be , where . The bisulfite sequencing data provide us with , and since the event “CpG unmethylated” is the complement of “CpG methylated”, is known and can be expressed as . Then, the probability that exactly CpGs are methylated out of a total of CpGs, where each CpG has its own probability of being methylated or not being methylated , can be calculated by using the following formula:
| (Equation 27) |
More precisely, the sum is taken over all possible subsets of events with exactly elements. For each subset , we calculate the product of the probabilities of the events in occurring . We then calculate the product of the probabilities of the events not in occurring . The overall probability is the sum of these products for all subsets . In simpler terms, the formula considers all possible combinations of events occurring out of events, multiplying the probabilities of those events occurring and the probabilities of the remaining events not occurring in each combination. The sum aggregates these probabilities for all valid combinations, providing the probability that exactly events happen.
As an illustrative example, let us consider the case in which the total number of CpG in the gene of interest are . Then, defining the probability of CpG to be and not to be methylated as and , with , we then can calculate the probability of having CpGs methylated, i.e., , with , using 27, obtaining , , , and . The probability distribution of the fraction of CpGs methyalated is then obtained by plotting for .
Linear regression analysis
Concerning the best-fit linear regression lines shown in Figures 3I, 4I, 4J and S11J they were obtained using MATLAB polyfit function. Furthermore, the p-value associated with the slope coefficient was determined through the MATLAB fitlm function.
Fold change calculation
Fold changes from qPCRs in the study were calculated using a standard method as described below. When calculating , the mean of across independent replicates was used.
Statistical analysis for fold changes and percentages
For statistical tests comparing percentages calculated from flow cytometry measurements or fold changes from MeDIP-qPCR and ChIP-qPCR, unpaired two sample t-tests were performed. Significance was defined using ∗p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.001.
Published: September 9, 2025
Footnotes
Supplemental information can be found online at https://doi.org/10.1016/j.xgen.2025.100985.
Supplemental information
References
- 1.Holmberg J., Perlmann T. Maintaining differentiated cellular identity. Nat. Rev. Genet. 2012;13:429–439. doi: 10.1038/nrg3209. [DOI] [PubMed] [Google Scholar]
- 2.Chen T., Dent S.Y.R. Chromatin modifiers and remodellers: regulators of cellular differentiation. Nat. Rev. Genet. 2014;15:93–106. doi: 10.1038/nrg3607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Zhang Q., Cao X. Epigenetic regulation of the innate immune response to infection. Nat. Rev. Immunol. 2019;19:417–432. doi: 10.1038/s41577-019-0151-6. [DOI] [PubMed] [Google Scholar]
- 4.Netea M.G., Joosten L.A.B., Latz E., Mills K.H.G., Natoli G., Stunnenberg H.G., O’Neill L.A.J., Xavier R.J. Trained immunity: A program of innate immune memory in health and disease. Science. 2016;352 doi: 10.1126/science.aaf1098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Lau C.M., Adams N.M., Geary C.D., Weizman O.-E., Rapp M., Pritykin Y., Leslie C.S., Sun J.C. Epigenetic control of innate and adaptive immune memory. Nat. Immunol. 2018;19:963–972. doi: 10.1038/s41590-018-0176-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Day J.J., Sweatt J.D. DNA methylation and memory formation. Nat. Neurosci. 2010;13:1319–1323. doi: 10.1038/nn.2666. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Zovkic I.B., Guzman-Karlsson M.C., Sweatt J.D. Epigenetic regulation of memory formation and maintenance. Learn. Mem. 2013;20:61–74. doi: 10.1101/lm.026575.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Thakore P.I., Black J.B., Hilton I.B., Gersbach C.A. Editing the epigenome: technologies for programmable transcription and epigenetic modulation. Nat. Methods. 2016;13:127–137. doi: 10.1038/nmeth.3733. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Felsenfeld G. A brief history of epigenetics. Cold Spring Harbor Perspect. Biol. 2014;6 doi: 10.1101/cshperspect.a018200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Kouzarides T. Chromatin modifications and their function. Cell. 2007;128:693–705. doi: 10.1016/j.cell.2007.02.005. [DOI] [PubMed] [Google Scholar]
- 11.Allis C.D., Caparros M.-L., Jenuwein T., Reinberg D. Second Edition. Cold Spring Harbor Laboratory Press; 2015. Epigenetics. [Google Scholar]
- 12.Huang S., Litt M., Blakey C.A. Academic Press; 2015. Epigenetic Gene Expression and Regulation. [Google Scholar]
- 13.Allis C.D., Jenuwein T. The molecular hallmarks of epigenetic control. Nat. Rev. Genet. 2016;17:487–500. doi: 10.1038/nrg.2016.59. [DOI] [PubMed] [Google Scholar]
- 14.Li E., Zhang Y. DNA methylation in mammals. Cold Spring Harbor Perspect. Biol. 2014;6 doi: 10.1101/cshperspect.a019133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Cheng X. Structural and functional coordination of DNA and histone methylation. Cold Spring Harbor Perspect. Biol. 2014;6 doi: 10.1101/cshperspect.a018747. a018747–705. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Bannister A.J., Zegerman P., Partridge J.F., Miska E.A., Thomas J.O., Allshire R.C., Kouzarides T. Selective recognition of methylated lysine 9 on histone H3 by the HP1 chromo domain. Nature. 2001;410:120–124. doi: 10.1038/35065138. [DOI] [PubMed] [Google Scholar]
- 17.Jones P.L., Veenstra G.J., Wade P.A., Vermaak D., Kass S.U., Landsberger N., Strouboulis J., Wolffe A.P. Methylated DNA and MeCP2 recruit histone deacetylase to repress transcription. Nat. Genet. 1998;19:187–191. doi: 10.1038/561. [DOI] [PubMed] [Google Scholar]
- 18.Schneider R., Bannister A.J., Myers F.A., Thorne A.W., Crane-Robinson C., Kouzarides T. Histone H3 lysine 4 methylation patterns in higher eukaryotic genes. Nat. Cell Biol. 2004;6:73–77. doi: 10.1038/ncb1076. [DOI] [PubMed] [Google Scholar]
- 19.Hathaway N.A., Bell O., Hodges C., Miller E.L., Neel D.S., Crabtree G.R. Dynamics and memory of heterochromatin in living cells. Cell. 2012;149:1447–1460. doi: 10.1016/j.cell.2012.03.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Zhang T., Cooper S., Brockdorff N. The interplay of histone modifications - writers that read. EMBO Rep. 2015;16:1467–1481. doi: 10.15252/embr.201540945. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Pan G.J., Chang Z.Y., Schöler H.R., Pei D. Stem cell pluripotency and transcription factor Oct4. Cell Res. 2002;12:321–329. doi: 10.1038/sj.cr.7290134. [DOI] [PubMed] [Google Scholar]
- 22.Bintu L., Yong J., Antebi Y.E., McCue K., Kazuki Y., Uno N., Oshimura M., Elowitz M.B. Dynamics of epigenetic regulation at the single-cell level. Science. 2016;351:720–724. doi: 10.1126/science.aab2956. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Nuñez J.K., Chen J., Pommier G.C., Cogan J.Z., Replogle J.M., Adriaens C., Ramadoss G.N., Shi Q., Hung K.L., Samelson A.J., et al. Genome-wide programmable transcriptional memory by CRISPR-based epigenome editing. Cell. 2021;184:2503–2519.e17. doi: 10.1016/j.cell.2021.03.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Angel A., Song J., Dean C., Howard M. A Polycomb-based switch underlying quantitative epigenetic memory. Nature. 2011;476:105–108. doi: 10.1038/nature10241. [DOI] [PubMed] [Google Scholar]
- 25.Saxton D.S., Rine J. Distinct silencer states generate epigenetic states of heterochromatin. Mol. Cell. 2022;82:3566–3579.e5. doi: 10.1016/j.molcel.2022.08.002. [DOI] [PubMed] [Google Scholar]
- 26.Dodd I.B., Micheelsen M.A., Sneppen K., Thon G. Theoretical Analysis of Epigenetic Cell Memory by Nucleosome Modification. Cell. 2007;129:813–822. doi: 10.1016/j.cell.2007.02.053. [DOI] [PubMed] [Google Scholar]
- 27.Millán-Zambrano G., Burton A., Bannister A.J., Schneider R. Histone post-translational modifications — cause and consequence of genome function. Nat. Rev. Genet. 2022;23:563–580. doi: 10.1038/s41576-022-00468-7. [DOI] [PubMed] [Google Scholar]
- 28.Policarpi C., Munafò M., Tsagkris S., Carlini V., Hackett J.A. Systematic epigenome editing captures the context-dependent instructive function of chromatin modifications. Nat. Genet. 2024;56:1168–1180. doi: 10.1038/s41588-024-01300-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Isbel L., Grand R.S., Schübeler D. Generating specificity in genome regulation through transcription factor sensitivity to chromatin. Nat. Rev. Genet. 2022;23:728–740. doi: 10.1038/s41576-022-00512-6. [DOI] [PubMed] [Google Scholar]
- 30.Cabrera A., Edelstein H.I., Glykofrydis F., Love K.S., Palacios S., Tycko J., Zhang M., Lensch S., Shields C.E., Livingston M., et al. The sound of silence: Transgene silencing in mammalian cell engineering. Cell Syst. 2022;13:950–973. doi: 10.1016/j.cels.2022.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Huang S., Eichler G., Bar-Yam Y., Ingber D.E. Cell fates as high-dimensional attractor states of a complex gene regulatory network. Phys. Rev. Lett. 2005;94 doi: 10.1103/PhysRevLett.94.128701. [DOI] [PubMed] [Google Scholar]
- 32.Stelzer Y., Shivalila C.S., Soldner F., Markoulaki S., Jaenisch R. Tracing Dynamic Changes of DNA Methylation at Single-Cell Resolution. Cell. 2015;163:218–229. doi: 10.1016/j.cell.2015.08.046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Liu X.S., Wu H., Ji X., Stelzer Y., Wu X., Czauderna S., Shu J., Dadon D., Young R.A., Jaenisch R. Editing DNA Methylation in the Mammalian Genome. Cell. 2016;167:233–247.e17. doi: 10.1016/j.cell.2016.08.056. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Telford W.G., Hawley T., Subach F., Verkhusha V., Hawley R.G. Flow cytometry of fluorescent proteins. Methods. 2012;57:318–330. doi: 10.1016/j.ymeth.2012.01.003. [DOI] [PubMed] [Google Scholar]
- 35.Recillas-Targa F., Pikaart M.J., Burgess-Beusse B., Bell A.C., Litt M.D., West A.G., Gaszner M., Felsenfeld G. Position-effect protection and enhancer blocking by the chicken beta-globin insulator are separable activities. Proc. Natl. Acad. Sci. USA. 2002;99:6883–6888. doi: 10.1073/pnas.102179399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Stanton B.C., Siciliano V., Ghodasara A., Wroblewska L., Clancy K., Trefzer A.C., Chesnut J.D., Weiss R., Voigt C.A. Systematic transfer of prokaryotic sensors and circuits to mammalian cells. ACS Synth. Biol. 2014;3:880–891. doi: 10.1021/sb5002856. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Gam J.J., DiAndreth B., Jones R.D., Huh J., Weiss R. A ’poly-transfection’ Method for Rapid, One-Pot Characterization and Optimization of Genetic Systems. Nucleic Acids Res. 2019;47 doi: 10.1093/nar/gkz623. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Gillespie D.T. Stochastic Simulation of Chemical Kinetics. Annu. Rev. Phys. Chem. 2007;58:35–55. doi: 10.1146/annurev.physchem.58.032806.104637. [DOI] [PubMed] [Google Scholar]
- 39.O’Geen H., Ren C., Nicolet C.M., Perez A.A., Halmai J., Le V.M., Mackay J.P., Farnham P.J., Segal D.J. dCas9-based epigenome editing suggests acquisition of histone methylation is not sufficient for target gene repression. Nucleic Acids Res. 2017;45:9901–9916. doi: 10.1093/nar/gkx578. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Fuks F., Hurd P.J., Wolf D., Nan X., Bird A.P., Kouzarides T. The methyl-CpG-binding protein MeCP2 links DNA methylation to histone methylation. J. Biol. Chem. 2003;278:4035–4040. doi: 10.1074/jbc.M210256200. [DOI] [PubMed] [Google Scholar]
- 41.Margolin J.F., Friedman J.R., Meyer W.K., Vissing H., Thiesen H.J., Rauscher F.J., 3rd Krüppel-associated boxes are potent transcriptional repression domains. Proc. Natl. Acad. Sci. 1994;91:4509–4513. doi: 10.1073/pnas.91.10.4509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Schultz D.C., Ayyanathan K., Negorev D., Maul G.G., Rauscher F.J. SETDB1: a novel KAP-1-associated histone H3, lysine 9-specific methyltransferase that contributes to HP1-mediated silencing of euchromatic genes by KRAB zinc-finger proteins. Genes Dev. 2002;16:919–932. doi: 10.1101/gad.973302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Ecco G., Imbeault M., Trono D. KRAB zinc finger proteins. Development. 2017;144:2719–2729. doi: 10.1242/dev.132605. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Jackson M., Krassowska A., Gilbert N., Chevassut T., Forrester L., Ansell J., Ramsahoye B. Severe global DNA hypomethylation blocks differentiation and induces histone hyperacetylation in embryonic stem cells. Mol. Cell Biol. 2004;24:8862–8871. doi: 10.1128/MCB.24.20.8862-8871. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Rasmussen K.D., Helin K. Role of TET enzymes in DNA methylation, development, and cancer. Genes Dev. 2016;30:733–750. doi: 10.1101/gad.276568.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Brown D.A., Di Cerbo V., Feldmann A., Ahn J., Ito S., Blackledge N.P., Nakayama M., McClellan M., Dimitrova E., Turberfield A.H., et al. The SET1 Complex Selects Actively Transcribed Target Genes via Multivalent Interaction with CpG Island Chromatin. Cell Rep. 2017;20:2313–2327. doi: 10.1016/j.celrep.2017.08.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Upadhyay A.K., Horton J.R., Zhang X., Cheng X. Coordinated methyl-lysine erasure: structural and functional linkage of a Jumonji demethylase domain and a reader domain. Curr. Opin. Struct. Biol. 2011;21:750–760. doi: 10.1016/j.sbi.2011.08.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Trerotola M., Relli V., Simeone P., Alberti S. Epigenetic inheritance and the missing heritability. Hum. Genom. 2015;9:17. doi: 10.1186/s40246-015-0041-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Bannister A.J., Kouzarides T. Regulation of chromatin by histone modifications. Cell Res. 2011;21:381–395. doi: 10.1038/cr.2011.22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Ren W., Fan H., Grimm S.A., Guo Y., Kim J.J., Yin J., Li L., Petell C.J., Tan X.F., Zhang Z.M., et al. Direct readout of heterochromatic H3K9me3 regulates DNMT1-mediated maintenance DNA methylation. Proc. Natl. Acad. Sci. USA. 2020;117:18439–18447. doi: 10.1073/pnas.2009316117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Laird C.D., Pleasant N.D., Clark A.D., Sneeden J.L., Hassan K.M.A., Manley N.C., Vary J.C., Jr., Morgan T., Hansen R.S., Stöger R. Hairpin-bisulfite PCR: assessing epigenetic methylation patterns on complementary strands of individual DNA molecules. Proc. Natl. Acad. Sci. USA. 2004;101:204–209. doi: 10.1073/pnas.2536758100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Riggs A.D., Xiong Z. Methylation and epigenetic fidelity. Proc. Natl. Acad. Sci. USA. 2004;101:4–5. doi: 10.1073/pnas.2536758100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Gaidukov L., Wroblewska L., Teague B., Nelson T., Zhang X., Liu Y., Jagtap K., Mamo S., Tseng W.A., Lowe A., et al. A multi-landing pad DNA integration platform for mammalian cell engineering. Nucleic Acids Res. 2018;46:4072–4086. doi: 10.1093/nar/gky216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Ito S., Shen L., Dai Q., Wu S.C., Collins L.B., Swenberg J.A., He C., Zhang Y. Tet proteins can convert 5-methylcytosine to 5-formylcytosine and 5-carboxylcytosine. Science. 2011;333:1300–1303. doi: 10.1126/science.1210597. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Köhler A.R., Haußer J., Harsch A., Bernhardt S., Häußermann L., Brenner L.-M., Lungu C., Olayioye M.A., Bashtrykov P., Jeltsch A. Modular dual-color BiAD sensors for locus-specific readout of epigenome modifications in single cells. Cell Rep. Methods. 2024;4 doi: 10.1016/j.crmeth.2024.100739. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Nan X., Ng H.H., Johnson C.A., Laherty C.D., Turner B.M., Eisenman R.N., Bird A. Transcriptional repression by the methyl-CpG-binding protein MeCP2 involves a histone deacetylase complex. Nature. 1998;393:386–389. doi: 10.1038/30764. [DOI] [PubMed] [Google Scholar]
- 57.Mattei A.L., Bailly N., Meissner A. DNA methylation: a historical perspective. Trends Genet. 2022;38:676–707. doi: 10.1016/j.tig.2022.03.010. [DOI] [PubMed] [Google Scholar]
- 58.Hervouet E., Peixoto P., Delage-Mourroux R., Boyer-Guittaut M., Cartron P.-F. Specific or not specific recruitment of DNMTs for DNA methylation, an epigenetic dilemma. Clin. Epigenet. 2018;10 doi: 10.1186/s13148-018-0450-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Lopez-Serra L., Esteller M. Proteins that bind methylated DNA and human cancer: reading the wrong words. Br. J. Cancer. 2008;98:1881–1885. doi: 10.1038/sj.bjc.6604374. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Deng S., Zhang J., Su J., Zuo Z., Zeng L., Liu K., Zheng Y., Huang X., Bai R., Zhuang L., et al. RNA m6A Regulates Transcription via DNA Demethylation and Chromatin Accessibility. Nat. Genet. 2022;54:1427–1437. doi: 10.1038/s41588-022-01173-1. [DOI] [PubMed] [Google Scholar]
- 61.Wei J., He C. Chromatin and Transcriptional Regulation by Reversible RNA Methylation. Curr. Opin. Cell Biol. 2021;70:109–115. doi: 10.1016/j.ceb.2020.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Zhuang A., Gu X., Ge T., Wang S., Ge S., Chai P., Jia R., Fan X. Targeting Histone Deacetylase Suppresses Tumor Growth Through Eliciting METTL14-Modified m6A RNA Methylation in Ocular Melanoma. Cancer Commun. 2023;43:1185–1206. doi: 10.1002/cac2.12471. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Yao Z., van Velthoven C.T.J., Nguyen T.N., Goldy J., Sedeno-Cortes A.E., Baftizadeh F., Bertagnolli D., Casper T., Chiang M., Crichton K., et al. A taxonomy of transcriptomic cell types across the isocortex and hippocampal formation. Cell. 2021;184:3222–3241.e26. doi: 10.1016/j.cell.2021.04.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Liu H., Zhou J., Tian W., Luo C., Bartlett A., Aldridge A., Lucero J., Osteen J.K., Nery J.R., Chen H., et al. DNA methylation atlas of the mouse brain at single-cell resolution. Nature. 2021;598:120–128. doi: 10.1038/s41586-020-03182-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Kulis M., Heath S., Bibikova M., Queirós A.C., Navarro A., Clot G., Martínez-Trillos A., Castellano G., Brun-Heath I., Pinyol M., et al. Epigenomic analysis detects widespread gene-body DNA hypomethylation in chronic lymphocytic leukemia. Nat. Genet. 2012;44:1236–1242. doi: 10.1038/ng.2443. [DOI] [PubMed] [Google Scholar]
- 66.Baubec T., Ivánek R., Lienert F., Schübeler D. Methylation-dependent and -independent genomic targeting principles of the MBD protein family. Cell. 2013;153:480–492. doi: 10.1016/j.cell.2013.03.011. [DOI] [PubMed] [Google Scholar]
- 67.Lim Y.C., Li J., Ni Y., Liang Q., Zhang J., Yeo G.S.H., Lyu J., Jin S., Ding C. A complex association between DNA methylation and gene expression in human placenta at first and third trimesters. PLoS One. 2017;12 doi: 10.1371/journal.pone.0181155. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Saunderson E.A., Encabo H.H., Devis J., Rouault-Pierre K., Piganeau M., Bell C.G., Gribben J.G., Bonnet D., Ficz G. CRISPR/dCas9 DNA methylation editing is heritable during human hematopoiesis and shapes immune progeny. Proc. Natl. Acad. Sci. USA. 2023;120 doi: 10.1073/pnas.2300224120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Jin C., Lu Y., Jelinek J., Liang S., Estecio M.R.H., Barton M.C., Issa J.-P.J. TET1 is a maintenance DNA demethylase that prevents methylation spreading in differentiated cells. Nucleic Acids Res. 2014;42:6956–6971. doi: 10.1093/nar/gku372. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Jusiak B., Jagtap K., Gaidukov L., Duportet X., Bandara K., Chu J., Zhang L., Weiss R., Lu T.K. Comparison of Integrases Identifies Bxb1-GA Mutant as the Most Efficient Site-Specific Integrase System in Mammalian Cells. ACS Synth. Biol. 2019;8:16–24. doi: 10.1021/acssynbio.8b00089. [DOI] [PubMed] [Google Scholar]
- 71.Rincón-Arano H., Furlan-Magaril M., Recillas-Targa F. Protection against telomeric position effects by the chicken cHS4 β-globin insulator. Proc. Natl. Acad. Sci. USA. 2007;104:14044–14049. doi: 10.1073/pnas.0704999104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Uchida N., Hanawa H., Yamamoto M., Shimada T. The Chicken Hypersensitivity Site 4 Core Insulator Blocks Promoter Interference in Lentiviral Vectors. Hum. Gene Ther. Methods. 2013;24:117–124. doi: 10.1089/hgtb.2012.152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Jumper J., Evans R., Pritzel A., Green T., Figurnov M., Ronneberger O., Tunyasuvunakool K., Bates R., Žídek A., Potapenko A., et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596:583–589. doi: 10.1038/s41586-021-03819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Mirdita M., Schütze K., Moriwaki Y., Heo L., Ovchinnikov S., Steinegger M. ColabFold: making protein folding accessible to all. Nat. Methods. 2022;19:679–682. doi: 10.1038/s41592-022-01488-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Bruno S., Williams R.J., Del Vecchio D. Epigenetic cell memory: The gene’s inner chromatin modification circuit. PLoS Comput. Biol. 2022;18 doi: 10.1371/journal.pcbi.1009961. e1009961–27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Del Vecchio D., Murray R.M. Princeton University Press; 2014. Biomolecular Feedback Systems. [Google Scholar]
- 77.Huisinga K.L., Brower-Toland B., Elgin S.C.R. The contradictory definitions of heterochromatin: transcription and silencing. Chromosoma. 2006;115:110–122. doi: 10.1007/s00412-006-0052-x. [DOI] [PubMed] [Google Scholar]
- 78.Bruno S., Williams R.J., Del Vecchio D. Model reduction and stochastic analysis of the histone modification circuit. 2022 European Control Conference (ECC) 2022;2022:264–271. doi: 10.23919/ECC55457.2022.9838047. [DOI] [Google Scholar]
- 79.Felsenfeld G., Groudine M. Controlling the double helix. Nature. 2003;421:448–453. doi: 10.1038/nature01411. [DOI] [PubMed] [Google Scholar]
- 80.Friedman N., Cai L., Xie X.S. Linking Stochastic Dynamics to Population Distribution: An Analytical Framework of Gene Expression. Phys. Rev. Lett. 2006;97:168302. doi: 10.1103/PhysRevLett.97.168302. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Code for mathematical modeling is available at https://zenodo.org/records/15345727 (Zenodo, DOI: 15345727) and GitHub at https://github.com/simonbruno100/AnalogEpigeneticMemoryPaper2025/tree/v1.0. Targeted bisulfite sequencing data are available online (NCBI SRA: PRJNA1271686). All original data are available from the lead contact upon reasonable request.






