Skip to main content
eLife logoLink to eLife
. 2022 May 23;11:e66909. doi: 10.7554/eLife.66909

Dnmt3a knockout in excitatory neurons impairs postnatal synapse maturation and increases the repressive histone modification H3K27me3

Junhao Li 1,, Antonio Pinto-Duarte 2,†,, Mark Zander 3,§, Michael S Cuoco 4, Chi-Yu Lai 2, Julia Osteen 2, Linjing Fang 5,#, Chongyuan Luo 3,6,, Jacinta D Lucero 2, Rosa Gomez-Castanon 3, Joseph R Nery 3,6, Isai Silva-Garcia 2, Yan Pang 2, Terrence J Sejnowski 2, Susan B Powell 7, Joseph R Ecker 3,, Eran A Mukamel 1,, M Margarita Behrens 2,7,
Editors: Anne E West8, Lu Chen9
PMCID: PMC9170249  PMID: 35604009

Abstract

Two epigenetic pathways of transcriptional repression, DNA methylation and polycomb repressive complex 2 (PRC2), are known to regulate neuronal development and function. However, their respective contributions to brain maturation are unknown. We found that conditional loss of the de novo DNA methyltransferase Dnmt3a in mouse excitatory neurons altered expression of synapse-related genes, stunted synapse maturation, and impaired working memory and social interest. At the genomic level, loss of Dnmt3a abolished postnatal accumulation of CG and non-CG DNA methylation, leaving adult neurons with an unmethylated, fetal-like epigenomic pattern at ~222,000 genomic regions. The PRC2-associated histone modification, H3K27me3, increased at many of these sites. Our data support a dynamic interaction between two fundamental modes of epigenetic repression during postnatal maturation of excitatory neurons, which together confer robustness on neuronal regulation.

Research organism: Mouse

Introduction

Epigenetic modifications of DNA and chromatin-associated histone proteins establish and maintain the unique patterns of gene expression in maturing and adult neurons (Kundakovic and Champagne, 2015). Neuron development requires the reconfiguration of epigenetic modifications, including methylation of genomic cytosine (DNA methylation, or mC) (Guo et al., 2014; Lister et al., 2013; Stroud et al., 2017) as well as covalent histone modifications associated with active or repressed gene transcription (Fagiolini et al., 2009; Putignano et al., 2007). While mC primarily occurs at CG dinucleotides (mCG) in mammalian tissues, neurons also accumulate a substantial amount of non-CG methylation (mCH) during postnatal brain development in the first 2–3 weeks of life in mice and the first two decades in humans (Lister et al., 2013). Accumulation of mCH, and the gain of mCG at specific sites, depend on the activity of the de novo DNA methyltransferase DNMT3A (Gabel et al., 2015). In mice, the abundance of Dnmt3a mRNA and protein peaks during the second postnatal week (Lister et al., 2013; Stroud et al., 2017), a time of intense synaptogenesis and neuronal maturation. Despite evidence for a unique role of Dnmt3a and mCH in epigenetic regulation of developing neurons, the long-term consequences of Dnmt3a-mediated methylation on brain function remain largely unknown (Stroud et al., 2017).

One challenge in investigating the developmental role of Dnmt3a has been the lack of adequate animal models. Deleting Dnmt3a around embryonic day 7.5 (E7.5) driven by the Nestin promoter dramatically impaired neuromuscular and cognitive development and led to early death (Nguyen et al., 2007). This early loss of Dnmt3a specifically affects the expression of long genes with high levels of gene body mCA (Boxer et al., 2020; Kinde et al., 2016). By contrast, deletion of Dnmt3a starting around postnatal day 14 driven by the Camk2a promoter caused few behavioral or electrophysiological phenotypes (Feng et al., 2010), with only subtle alterations in learning and memory depending on genetic background (Morris et al., 2014). These results suggest that Dnmt3a may play a critical role during a specific time window between late gestation and early postnatal life. During these developmental stages, regulated gains and losses of DNA methylation throughout the genome establish unique epigenomic signatures of neuronal cell types (Qian et al., 2017; Luo et al., 2017; Mo et al., 2015).

To address the role of Dnmt3a-dependent epigenetic regulation in the functional maturation of cortical excitatory neurons, we created a mouse line using the Neurod6 promoter (Schwab et al., 2000) (Nex-Cre) to delete exon 19 of Dnmt3a (Okano et al., 1999). In this conditional knockout (cKO), Dnmt3a is functionally ablated in excitatory neurons in the neocortex and hippocampus starting in mid-to-late gestation (embryonic day E13–15) (Goebbels et al., 2006). In Dnmt3a cKO animals, DNA methylation was substantially disrupted in excitatory neurons, leading to altered behavior and synaptic physiology without early life lethality or overt brain morphological alterations. We generated deep DNA methylome, transcriptome, and histone modification data in Dnmt3a cKO and control pyramidal cells of the frontal cortex (Supplementary file 1), enabling a detailed assessment of the molecular basis of neurophysiological and behavioral phenotypes. We found that the polycomb repressive complex 2 (PRC2)-associated chromatin modification, H3K27me3, increases during postnatal development following the loss of mCH DNA methylation in Dnmt3a cKO neurons primarily at sites where mCG is also depleted. Our data support a dynamic interaction between two fundamental modes of epigenetic repression in developing brain cells.

Results

Dnmt3a cKO in pyramidal neurons during mid-gestation specifically impairs working memory, social interest, and acoustic startle

Previous studies of Dnmt3a KO mice yielded results ranging from little cognitive or health effects (Feng et al., 2010; Morris et al., 2014) to profound impairment and lethality (Dura et al., 2022; Nguyen et al., 2007). These studies suggest that the developmental timing and cell type of Dnmt3a loss may determine the extent of subsequent phenotypes. Here, we took a targeted approach by functionally ablating Dnmt3a in cortical pyramidal cells starting during mid-gestation (Figure 1A). We took advantage of the developmental onset of Neurod6 expression between embryonic day E11 and E13, after the onset of Nestin expression (Thompson et al., 2014) but well before the major postnatal wave of reprogramming of the neuronal DNA methylome (Figure 1—figure supplement 1; Lister et al., 2013). We confirmed that Neurod6-dependent Cre recombination occurred only in excitatory neurons using the INTACT (isolation of nuclei tagged in specific cell types) (Mo et al., 2015) mouse (Figure 1—figure supplement 2). The deletion of Dnmt3a exon 19 was faithfully captured in cortical excitatory neurons of cKO animals (Figure 1B and Figure 1—figure supplement 3A), which was known to produce a deletion of 50 amino acids in the methyltransferase domain of the DNMT3A protein (Lyko, 2018). We also verified the reduction in DNMT3A protein in whole tissue extracts at early postnatal time points (P5 and P13) when Dnmt3a mediated accumulation of mCH normally begins in the frontal cortex (Lister et al., 2013; Figure 1—figure supplement 3B). Dnmt3a cKO animals survived and bred normally, without overt morphological alterations in the brain (Figure 1—figure supplement 3C). We found no impairments in gross motor function in an open field test: cKO mice traveled more during the first 5 min of testing (Figure 1—figure supplement 4A) while performing fewer rearings associated with exploratory interest (Figure 1—figure supplement 4B). Moreover, cKO mice had no signs of increased anxiety-like behavior on three separate behavioral tests (Figure 1—figure supplement 4C-E), in contrast with the reported anxiogenic effects of Dnmt3a knockdown in the mPFC of adult mice (Elliott et al., 2016). The absence of major impairments in overall health, motor function, or anxiety-like behavior established a baseline for investigating the role of Dnmt3a in specific cognitive and social behaviors.

Figure 1. Dnmt3a conditional knockout (cKO) in cortical pyramidal neurons during mid-gestation impaired working memory, social interest, and acoustic startle responses.

(A) An experimental model of the conditional loss of Dnmt3a in excitatory neurons. P0 and P39, postnatal day 0 and 39. FANS, fluorescence-activated nuclei sorting. (B) RNA-seq confirmation of the deletion of Dnmt3a exon 19 in P39 excitatory neurons. RPKM, reads per kilobase per million. R1/2, replicate 1/2. *, t-test p=0.014. (C) Dnmt3a cKO mice made fewer spontaneous alternations in the Y-maze test of working memory (Wilcoxon test, **, p=0.0079; *, p=0.011; n=15 male control, 15 male cKO, 11 female control, 10 female cKO). (D) Male Dnmt3a cKO mice spent less time interacting with an unfamiliar mouse, indicating reduced social interest (Wilcoxon test; *, p=0.01048; **p=0.006833; n=14 male control, 15 male cKO, 11 female control, 10 female cKO). (E) Male Dnmt3a cKO mice had decreased startle response to a 120 dB acoustic pulse (Wilcoxon test, **, p=0.0019; n.s., not significant; n=14 male control, 15 male cKO, 11 female control, 10 female cKO).

Figure 1.

Figure 1—figure supplement 1. Neurod6 starts to express between embryonic day E11 and E13.

Figure 1—figure supplement 1.

(A) Expression quantification of in situ hybridization data of gene Nestin and Neurod6 in telencephalic vesicle from the Allen Developing Brain Atlas (http://developingmouse.brain-map.org/). Left panel, heatmap of the gene expression across ages during development. Right panel, example images of the gene expression in E11.5. Image credit: Allen Institute. E11.5–18.5, embryonic days; P4–28, postnatal days. (B) Expression of Neurod6 at different developmental time points (embryonic days 11.5, 13.5, and 15.5, and postnatal days 4, 14, 28, and 56). Images were taken from the Allen Developing Brain Atlas.
Figure 1—figure supplement 2. Neurod6-dependent Cre recombination occurred only in excitatory neurons.

Figure 1—figure supplement 2.

(A–C) Cell-type-specific expression of nuclear membrane tag (Sun1-sfGFP-myc) in non-inhibitory NEUROD6+ neurons in mouse brain. (A) Overview (10× magnification) of a sagittal section stained with anti-GFP antibody. Note that the Sun1-sfGFP nuclear tag expression is region-specific, as described for the characterization of the NexCre mouse line (Goebbels et al., 2006). The numbered insets in (A) are enlarged and shown in (B), as example regions where the nuclear membrane tag is expressed (mPFC and CA1 hippocampal region) and where it is not (caudate putamen and thalamus). (C) Confocal images (60× magnification, maximum intensity projection) of mouse brain slices (40 µm thick) triple-labeled with antibodies anti-GFP (Sun1-sfGFP-tag, green channel), anti-GAD67 (inhibitory cells marker, red channel), and anti-NeuN (neuronal marker, gray). Note that Sun1-sfGFP-tag is only expressed in neurons that are not inhibitory. (D) Expression of pan-neuronal, excitatory, and inhibitory neuron marker genes in our RNA-seq data. TPM, transcripts per million.
Figure 1—figure supplement 3. Dnmt3a was disrupted on both the mRNA and protein levels in the Dnmt3a conditional knockout (cKO) excitatory neurons.

Figure 1—figure supplement 3.

(A) Genome browser tracks of mRNA-seq data show confirmation of the deletion of Dnmt3a exon 19 in P39 Dnmt3a cKO excitatory neurons. The targeted exon region is highlighted in the light blue shaded box with an asterisk. R1/2, replicate 1/2. (B) The protein product of the Dnmt3a gene is disrupted in the cKO sample. Top panel, Western blot; Bottom panel, quantification of the protein abundance. P5 and P13, postnatal days 5 and 13. **, t-test p=0.0017. (C) Nissl-stained slices show no morphological alterations in the brain of the Dnmt3a cKO animals. mPFC, medial prefrontal cortex; CPu, caudate putamen.
Figure 1—figure supplement 4. The conditional ablation of Dnmt3a in pyramidal neurons did not significantly impair motor activity nor increased anxiety levels.

Figure 1—figure supplement 4.

(A) and (D) Dnmt3a conditional knockout (cKO) mice displayed normal behavior in the open field test, traveling a similar distance (A) as control mice, and also showing a similar degree of center activity (D) (B) The exploratory activity was slightly decreased in Dnmt3a cKO animals, as suggested by an attenuated rearing behavior. (C) The time spent in light in the dark-light transfer test was not significantly affected by the lack of Dnmt3a. (E) The female, but not the male, cohort of Dnmt3a cKO mice spent significantly more time than control mice in the open arms of the elevated plus maze, consistent with lower anxiety levels (Wilcoxon test, **, p=0.0048; n.s., not significant.). In the line plots (A–B), data were presented as mean ± SEM In all boxplots (C–E), the middle horizontal bar represents the median; the lower and upper hinges correspond to the first and third quartiles, and the whisker extends from the hinge to the value no further than 1.5 * IQR from the hinge, where IQR is the interquartile range. The values of individual experiments are represented by dots superimposed on the boxplots. Wilcoxon test significance: *, p<0.05; **, p<0.01; n.s., not significant.
Figure 1—figure supplement 5. Dnmt3a conditional knockout (cKO)-induced impairment of startle response was accompanied by increased prepulse inhibition (PPI), and the cKO did not affect fear memory.

Figure 1—figure supplement 5.

(A) The increased PPI accompanied the impairment in startle responses to a 120 dB tone played at three time points during the recording session (HAB1 – beginning of the session; HAB2 – middle of the session; HAB3 – end of the session) (Wilcoxon test p=0.0027, 0.0019, and 0.0035 in male HAB1, HAB2, HAB3, respectively, and not significant in female). The habituation to the 120 dB auditory tone (i.e. the relative reduction in startle response throughout the experiment) was not significantly different between genotypes. (B) The percentage of PPI at prepulse intensity of 69, 73, and 81 dB (4, 8, and 16 dB above the 65 dB background, respectively) was increased in male, but not female mice (Wilcoxon test, ***, p=0.00076; *, p=0.016; n.s, not significant). (C–F) Fear learning and extinction were tested over 4 consecutive days. N=14–15 per group. (C) Fear acquisition to three tone-shock pairings occurred on day 1; (D) contextual fear in relation to the acquisition context (8 min, Block = 2 min) was measured on day 2; (E) cued fear recall and extinction training occurred on day 3 (Block = 4 tone trials); and (F) extinction recall (Block = 4 tone trials) occurred on day 4. Wilcoxon test reported no significant changes between the Dnmt3a cKO and control. In boxplots (A), the middle horizontal bar represents the median; the lower and upper hinges correspond to the first and third quartiles, and the whisker extends from the hinge to the value no further than 1.5 * IQR from the hinge, where IQR is the interquartile range. The values of individual experiments are represented by dots superimposed on the boxplots. In the line plots (B–F), data were presented as mean ± SEM.

We focused on cognitive domains associated with neurodevelopmental illness, including working memory and sensorimotor gating (Habib et al., 2019) and social interest (Dodell-Feder et al., 2015). Dnmt3a cKO mice did not alternate spontaneously between the arms of a Y-maze (p=0.0079 for males, p=0.011 for females, Figure 1C), indicating impaired spatial working memory. Moreover, when tested in a three-chamber box in which one of the sides contained a novel mouse and the opposite a novel object, male Dnmt3a cKO animals spent less time in the former, opting, instead, to remain significantly longer in the center (empty compartment), which is suggestive of reduced exploration and social interest (Figure 1D, left panel; p=0.01048). Male Dnmt3a cKO mice also had significantly attenuated acoustic startle reflex (p=0.0019, Figure 1E and Figure 1—figure supplement 5A). We observed increased prepulse inhibition (PPI) in male Dnmt3a cKO mice, but this may be driven by the reduced startle reflex (Figure 1—figure supplement 5B, p<0.05). It is noteworthy that the observed deficits in startle response were not due to impaired hearing, since Dnmt3a cKO mice displayed intact hearing in tests of PPI and fear conditioning.

To test whether these deficits in specific neurocognitive domains reflect generalized impairment in brain function, we assessed long-term memory using a fear conditioning paradigm. There were no significant differences between Dnmt3a cKO and control male mice in acquisition or recall of fear memory following re-exposure to the context or conditioned stimulus after 24–48 hr (Figure 1—figure supplement 5C-E), or in extinction (Figure 1—figure supplement 5F). Altogether, these behavioral results indicate that Dnmt3a cKO in excitatory neurons impairs working memory, social interest, and acoustic startle, without generalized cognitive disruption.

Loss of Dnmt3a impairs synapse maturation and attenuates neuronal excitability

To test the impact of Dnmt3a cKO on dendritic morphology, we quantified the number and structure of 1278 DiI-labeled dendritic spines (NexCre/C57: n=701 from 5 mice; cKO: n=577 from 4 mice) of layer 2 pyramidal neurons of the mouse prelimbic region (~2 mm anterior to Bregma) (Figure 2A; Materials and methods), a region critical for working memory (Yang et al., 2014) and social approach behavior (Lee et al., 2016). While the overall density of dendritic spines was equivalent in control and Dnmt3a cKO neurons (Figure 2B), the spines were significantly longer (mean length 2.219±0.052 µm in cKO, 1.852±0.034 µm in control) and narrower (mean width 0.453±0.008 µm in cKO, 0.519±0.008 µm in control) in Dnmt3a cKO neurons (Figure 2C, KS test p<0.001; Figure 2—figure supplement 1). Consistent with this, a larger proportion of spines in Dnmt3a cKO mice were classified as immature filopodia, and fewer were mushroom-shaped mature spines (Figure 2D) according to pre-established morphometric criteria (see Materials and methods). The proportion of spines with other morphologies, including branched spines with more than one neck (data not shown), was not significantly different between genotypes (Figure 2D). These data indicate a role for Dnmt3a in dendritic spine maturation.

Figure 2. Immature spine morphology and reduced excitability of layer 2 excitatory neurons following Dnmt3a conditional knockout (cKO).

(A) Example dendritic segments of layer 2 pyramidal neurons in the prelimbic region labeled with DiI and visualized using a 63× objective coupled to an Airyscan confocal microscope. Arrowheads show filopodia, which were more abundant in Dnmt3a cKO mice. (B) The density of membrane protrusions was unchanged in the Dnmt3a cKO (Wilcoxon test, n.s., not significant). (C) Membrane protrusions were significantly longer and narrower in the Dnmt3a cKO (KS test, p<0.001). (D) More spines were classified as immature filopodia, and fewer as mature mushroom-shaped spines with large postsynaptic densities in the Dnmt3a cKO (Wilcoxon test, **, p=0.0015; *, p=0.046 and 0.011 for thin and mushroom, respectively). (E) Example whole-cell patch-clamp recordings from prelimbic layer 2 pyramidal neurons following 60 pA current injections. (F) The median rheobase (i.e. the minimal current necessary to elicit an action potential) was significantly higher in the Dnmt3a cKO (t-test, **, p=0.0042). (G) Action potential frequency vs. injected current (mean ± SEM) showed reduced excitability in Dnmt3a cKO (Wilcoxon test, *, p<0.05). (H) and (I) Dnmt3a cKO neurons were slightly hyperpolarized at Vrest when compared to control (Wilcoxon test, *, p=0.049) and had lower membrane resistance (Wilcoxon test, *, p=0.023).

Figure 2.

Figure 2—figure supplement 1. The membrane protrusions in the Dnmt3a conditional knockout (cKO) showed longer dendritic spines and narrower heads.

Figure 2—figure supplement 1.

Related to Figure 2C. Here, each line/box represents data from one dendrite fragment. (A) and (C) The cumulative distribution of the length (A) and width (C) of membrane protrusions in the Dnmt3a cKO (orange), as compared to control mice (black). (B) and (D) Distribution of the length (B) and width (D) of the protrusions. The shaded colors in the box represent the individual animals from which the dendrite fragment originated (5 control mice and 4 cKO mice). The middle horizontal bar represents the median; the lower and upper hinges correspond to the first and third quartiles, and the whisker extends from the hinge to the value no further than 1.5 * IQR from the hinge, where IQR is the interquartile range. The Wilcoxon test was done with the medians of the boxes (16 control vs. 15 cKO).
Figure 2—figure supplement 2. The conditional ablation of Dnmt3a in pyramidal neurons did not significantly alter the membrane potential threshold for action potential generation, the mean amplitude, or frequency of miniature excitatory postsynaptic events.

Figure 2—figure supplement 2.

(A) Action potentials were initiated at a similar membrane potential in both genotypes (Wilcoxon test, n.s., not significant). (B) While miniature excitatory postsynaptic currents (mEPSCs) amplitude was not significantly different between genotypes (Wilcoxon test, n.s., not significant), it was slightly, yet significantly, more variable in the Dnmt3a conditional knockout (cKO) (F-test, p=0.0032). (C) mEPSCs frequency was not significantly changed. In all boxplots, the middle horizontal bar represents the median; the lower and upper hinges correspond to the first and third quartiles, and the whisker extends from the hinge to the value no further than 1.5 * IQR from the hinge, where IQR is the interquartile range. The values of individual experiments are represented by dots superimposed on the boxplots.

To test how the loss of Dnmt3a and subsequent stunting of spine maturation affect intrinsic neuronal excitability and synapse sensitivity, we next performed patch-clamp experiments in visually identified layer 2 pyramidal neurons from the prelimbic region (Figure 2E). Whole-cell current-clamp recordings showed that Dnmt3a cKO neurons (n=22 cells from 11 mice) required greater current injections than control (n=17 cells from 12 mice) to trigger an action potential (higher rheobase, t-test p=0.0042, Figure 2F), though there was no difference in membrane potential at the firing threshold (Figure 2—figure supplement 2A). Dnmt3a cKO neurons also produced fewer spikes in response to injected current (Figure 2G). These neurons were slightly hyperpolarized at rest (mean –70.90±0.8 mV in cKO, n=22 cells from 11 mice vs. –67.22±1.4 mV in control, n=17 cells from 12 mice, Wilcoxon test p=0.049, Figure 2H), which could reflect differential expression of ion channels at the plasma membrane. Consistent with this, Dnmt3a cKO neurons had a lower input resistance (Wilcoxon test p=0.023, Figure 2I), suggesting increased expression of functional transmembrane ion channels. Whole-cell voltage-clamp recordings of miniature excitatory postsynaptic currents (mEPSCs) showed slight, yet significant, increased amplitude variability in Dnmt3a cKO mice (Figure 2—figure supplement 2, 8.77±0.32 pA in cKO, n=8 cells from 4 mice vs. 8.67±0.089 pA in control, n=8 cells from 5 mice, F-test, p=0.0032), consistent with disruption at postsynaptic sites. However, we found no alteration in the mean amplitude (Figure 2—figure supplement 2B) or frequency (Figure 2—figure supplement 2C) of mEPSCs recorded at the soma.

Altered gene expression in Dnmt3a cKO excitatory neurons

To investigate the impact of epigenetic disruption on gene expression, we compared the transcriptomes of cKO and control excitatory neuron nuclei in mature mice (postnatal day 39) (Supplementary file 2). We isolated nuclei from excitatory neurons in frontal cortex by backcrossing the Dnmt3a cKO animals into the INTACT mouse (Mo et al., 2015) on a C57BL/6J background, followed by fluorescence-activated nuclei sorting (FANS) and RNA-seq. Although sorted nuclei contain only a subset of the cell’s total mRNA and are enriched in immature transcripts, nuclear RNA-seq is nevertheless a quantitatively accurate assay of gene expression that is robust with respect to neural activity-induced transcription (Bakken et al., 2018; Lacar et al., 2016). All of the molecular assays were applied to at least two independent biological samples, with each sample being derived from a pool of tissue from 1 or 2 mice (Supplementary file 1). Nuclear RNA abundance was highly consistent across independent replicates within the same group (Spearman correlation r=0.93–0.94, Figure 3—figure supplement 1A). Given the repressive role of DNA methylation (mCG and mCH) in regulating gene expression in neurons (Guo et al., 2014; Lister et al., 2013), we expected to find increased gene expression in the cKO. Consistent with this, we detected 46 differentially expressed (DE) genes with higher expression in the cKO (false discovery rate [FDR] < 0.05, Figure 3A–B and Figure 3—figure supplement 1B, Supplementary file 2). We also detected significantly lower expression of 24 genes in cKO neurons (Figure 3A). Several of the DE genes had annotated roles in dendrite morphogenesis (Elavl4, Hecw2, Ptprd), as well as in the regulation of Na+ (Hecw2, Scn3b) and Ca2+ levels (Cacnb3) (Figure 3B). Early life experiences were shown to alter the expression of Dnmt3a and hence the DNA methylation levels in some transposable elements (TEs) in the hippocampus (Bedrosian et al., 2018). We examined the expression of transposon families or classes in our data but found no significant changes between cKO and control in the cortical excitatory neurons (two-sample t-test, FDR > 0.05; Figure 3—figure supplement 2).

Figure 3. Loss of Dnmt3a leaves thousands of genomic regions in a fetal-like demethylated state.

(A) 70 genes were differentially expressed (DE) (false discovery rate [FDR] < 0.05) in P39 pyramidal neurons in Dnmt3a conditional knockout (cKO) vs. control. Top, fold-change (FC) between cKO and control; bottom, heatmap showing normalized expression of the DE genes in each sample. Z-scores were computed using mRNA counts per million (CPM) for each DE gene. (B) Differential gene expression in control vs. Dnmt3a cKO excitatory neurons at P39. Significant up-regulated and down-regulated DE genes are shown in red and blue, respectively. Differentially expressed (DE) genes associated with dendrite morphogenesis (Elavl4, Hecw2, Ptprd), and regulation of Na+ (Hecw2, Scn3b) and Ca2+ levels (Cacnb3) are labeled. (C) Non-CG DNA methylation (mCH) is eliminated, and mCG is reduced, in P39 Dnmt3a cKO pyramidal cells, while mCG and mCH levels are not changed in P0 (t-test: *, p<0.05; n.s., not significant). P0 and P39, postnatal days 0 and 39, respectively. Each bar represents the methylation level in one replicate. (D) Non-CG DNA methylation (mCH) in P39 pyramidal cells in control samples, in 1 kb bins in the flanking region around the transcription start (TSS) and end site (TES) of DE genes and non-DE genes with matched expression levels. The lines denote the means across genes in each gene set, and the shared areas represent the 95% confidence intervals of the means. (E) The difference in gene body methylation vs. fold-change of gene expression between P39 Dnmt3a cKO and control. The plots show mean ± SEM gene expression fold-change for genes in 10 non-overlapping bins (deciles of mC difference). (F) The Nedd4 promoter locus contains five differentially methylated regions (DMRs, yellow horizontal rectangles and shaded in blue boxes) with naive, fetal-like mCG in P39 Dnmt3a cKO. Ticks show mCG at CG sites. Four out of the five P39 Dnmt3a cKO DMRs overlapping developmental gain-of-methylation DMRs (red horizontal rectangles) are marked with arrows. CGI, CpG island. R1 and R2, replicates 1 and 2. (G) Overlap of P39 Dnmt3a cKO DMRs and developmental DMRs. (H) P39 Dnmt3a cKO hypo-DMRs are significantly enriched (depleted) in DMRs that normally gain (lose) methylation during development (Fisher’s test, p<0.05).

Figure 3.

Figure 3—figure supplement 1. RNA-seq data showed transcriptomic disruption in P39 Dnmt3a conditional knockout (cKO) pyramidal neurons.

Figure 3—figure supplement 1.

(A) Correlation matrix of gene expression (log10TPM) in the biological replicates of the control and Dnmt3a cKO mouse excitatory neurons. The color bar represents the Spearman correlation coefficients. (B) Volcano plot shows the gene expression fold-change of P39 Dnmt3a cKO vs. control samples and their significance. Significant up-regulated and down-regulated differentially expressed genes (DE genes, false discovery rate [FDR] < 0.05) are colored in red and blue, respectively. DE genes associated with dendrite morphogenesis (Elavl4, Hecw2, Ptprd), and regulation of Na+ (Hecw2, Scn3b) and Ca2+ levels (Cacnb3) are labeled.
Figure 3—figure supplement 2. The expression of transposable elements (TEs) was not affected by Dnmt3a conditional knockout (cKO).

Figure 3—figure supplement 2.

TE subfamily expression was estimated in fragments per kilobase per million (FPKM) from P39 control and Dnmt3a cKO samples grouped by TE class (A) and TE family (B).
Figure 3—figure supplement 3. Genome-wide reduction of DNA methylation was observed in Dnmt3a conditional knockout (cKO).

Figure 3—figure supplement 3.

(A) Heatmap to show the Spearman correlations across the control and Dnmt3a cKO samples from P39 and P0 animals. The correlations coefficients were computed using CG methylation levels in 10 kb genomic bins (left) and CH methylation levels in 10 kb genomic bins (right). (B) The reduction of genome-wide DNA methylation level in P39 is observed in all three non-CG contexts (CA, CC, and CT). t-test significance, **, CA, p=0.009745; ***, CC, p=0.0009546; *, CT, p=0.01323. (C) DNA methylation at CG sites is reduced across most functional genomic compartments in P39. UTR, untranslated region. t-test significance, *, p<0.05; n.s., not significant. (D) Reduced mCG is strongly correlated with the reduction in mCH in P39 in 10 kb tiling genomic bins (259,718 bins with at least 10 reads covered in each sample). r, Spearman correlation coefficient.
Figure 3—figure supplement 4. Reduction of DNA methylation cannot fully explain the disruption in the transcriptome after Dnmt3a conditional knockout (cKO).

Figure 3—figure supplement 4.

(A) Correlation of gene expression and gene body mCH level for up-regulated genes (red), down-regulated genes (blue), and non-differentially expressed (DE) genes (black, false discovery rate [FDR] ≥ 0.05 and fold-change <1.1) in the P39 control samples. For each gene group, genes are stratified by their expression in the control sample by 15 bins, and the mean gene body mCH levels are plotted. The shaded ribbon areas indicate the standard error of the mean. TPM, transcripts per million. (B) Violin plots to show the expression levels of non-DE genes (gray) selected to match baseline expression as those in significantly (FDR < 0.05) up-regulated genes (left, red) and down-regulated genes (right, blue). (C) CG DNA methylation (mCG) in P39 pyramidal cells in 1 kb bins in the region around the transcription start (TSS) and end site (TES) of DE genes and non-DE genes with matched expression levels. The lines denote the means across genes in each gene set, and the shared areas represent the 95% confidence intervals of the means. (D) Density scatter plots show the relationship between changes of gene body methylation (delta mCG or mCH) and the gene expression fold-changes for expressed genes (14,754 genes) between P39 Dnmt3a cKO and control samples. The linear regression fits, p-values, and variances explained by ∆%mC (R2) are shown. (E) Gene length distribution of P39 DE genes (FDR < 0.1). As a comparison, non-DE genes were selected with FDR ≥ 0.1 and fold-change <1.1 (see Supplementary file 2). The down-regulated genes are generally shorter than the up-regulated genes or the non-DE genes. kb, kilobases. Wilcoxon test, ****, p<10–4; **, p<0.01; n.s. not significant.
Figure 3—figure supplement 5. Differentially methylated regions (DMRs) in P39 Dnmt3a conditional knockout (cKO) were associated with regulatory regions in enhancers and repressed chromatin.

Figure 3—figure supplement 5.

(A) 2D distribution of mCG levels in P39 control vs. Dnmt3a cKO samples at DMRs. DMR density is estimated through a Gaussian smoothed kernel. (B) The density of P39 Dnmt3a cKO hypo-DMRs around significant differentially expressed (DE) genes (false discovery rate [FDR] < 0.05) and non-DE genes with matched expression levels. The lines denote the means across genes in each gene set, and the shared areas represent the 95% confidence intervals of the means. TSS, transcription start site; TES, transcription end site. (C) Enrichment (red) or depletion (blue) of P39 cKO DMRs in GENCODE annotated gene features (top), in CpG island-related features (middle), and in the chromatin states map in mouse embryonic stem cell (bottom). All enrichments and depletions shown are significant (Fisher’s test p<0.05). (D) Number of known transcription factor binding motifs within P39 Dnmt3a cKO hypo-DMRs and their fold enrichment. Significant motifs (FDR < 0.05) are colored in red.

Dnmt3a cKO abolishes postnatal DNA methylation

Deletion of Dnmt3a during mid-gestation should disrupt the subsequent gain of DNA methylation at specific genomic sites during development (Qian et al., 2017; Lister et al., 2013), without affecting sites that maintain or lose methylation after E14.5. Using single base resolution, whole-genome MethylC-seq (Lister et al., 2008) in biological replicates with strong consistency (Figure 3—figure supplement 3A), we confirmed that non-CG DNA methylation (mCH) in excitatory neurons is absent at birth (<0.1% of all CH sites at P0), and accumulates by postnatal day 39 (1.98% at P39) (Lister et al., 2013). The cKO all but eliminated mCH (<0.1% at P0 and P39) (Figure 3C and Figure 3—figure supplement 3B). While mCH increases in neurons during postnatal life, the genome-wide level of mCG in the brain remains high throughout the lifespan (Lister et al., 2013). We found that the genome-wide mCG level was 12.5% lower in mature (P39) cKO neurons (60.1% in cKO vs. 72.6% in control). There was no difference in mCG in newborn mice (P0, 73.1% in both cKO and control, Figure 3C). mCG at P39 was reduced in 92.2% of all genomic bins (10 kb resolution), and was significantly lower in introns, 3’ UTR, and intergenic regions (Figure 3—figure supplement 3C). The reduction in mCG was strongly correlated with reduced mCH (Spearman correlation r=0.805, p<10–3, Figure 3—figure supplement 3D). These data support a role for Dnmt3a in postnatal de novo CG and CH DNA methylation across genomic compartments (Lister et al., 2013; Stroud et al., 2017).

Reduced DNA methylation does not fully explain altered transcription in Dnmt3a cKO

We investigated whether the altered gene expression in Dnmt3a cKO neurons correlated with loss of DNA methylation at specific sites. We first analyzed DNA methylation around DE genes in mature neurons (P39). The simple model of DNA methylation as a repressive regulator of gene expression predicts that genes that lose the most mC should be most transcriptionally up-regulated in the cKO. Consistent with this, we found that mCH was strongly enriched in the gene body of up-regulated genes in control neurons (Figure 3D and Figure 3—figure supplement 4A). By contrast, genes with similar expression levels in the control neurons which were not transcriptionally up-regulated in the cKO (Figure 3—figure supplement 4B) had significantly lower gene body mCH (Figure 3D). Moreover, down-regulated genes had low gene body mCH. These data support a causal role for gene body mCH in repressing gene expression. The relatively lower mCH level in down-regulated genes could make them less sensitive to the loss of Dnmt3a. The dysregulation of their expression may be due to secondary effects subsequent to the direct loss of DNA methylation.

We also examined the pattern of mCG at the promoter and gene body of DE genes. In contrast with the pattern of mCH, mCG was not significantly different between DE and control genes (Figure 3—figure supplement 4C).

The difference in gene body methylation (cKO – control) was negatively correlated with gene expression changes, consistent with repressive regulation (Gabel et al., 2015; Lavery et al., 2020; Figure 3E). This correlation accounted for 0.46% of the variance of differential gene expression, whereas the total explainable variance (R2 between biological replicates) was 1.30% (Figure 3—figure supplement 4D). The strength of the association between mCH and mRNA changes may be limited by the use of only two biological replicates in our dataset.

We next sought to determine if up- and down-regulated genes differ in ways that could explain their different responses to the loss of Dnmt3a. Up-regulated genes were on average longer than down-regulated genes (Figure 3—figure supplement 4E, Wilcoxon rank-sum test p<10–4) and non-DE genes (p<0.01), consistent with the reported enrichment of mCA and MeCP2-dependent gene repression in long genes (Boxer et al., 2020; Gabel et al., 2015; Kinde et al., 2016). However, there was a broad distribution of gene lengths for both up- and down-regulated genes (Lavery et al., 2020).

In addition to promoters and gene bodies, distal regulatory elements such as enhancers are major sites of dynamic DNA methylation where epigenetic regulation can activate or repress the expression of genes over long genomic distances through 3D chromatin interactions (Malik et al., 2014). We investigated gene regulatory elements by identifying differentially methylated regions (DMRs) where mCG is altered in cKO compared to control neurons. We found a limited number of DMRs in newborn mice (P0: 1087 DMRs with lower, 164 with higher mCG in cKO; ≥30 difference in %mCG; FDR < 0.01). In mature neurons (P39), by contrast, we found 222,006 DMRs with substantially lower mCG in cKO compared with controls (Figure 3—figure supplement 5A; Supplementary file 3). Only 89 DMRs had ≥30 higher %mCG in cKO. To illustrate, we found five DMRs in an ~40 kb region around the promoter of the DE gene Nedd4 (Figure 3F). Four of these DMRs were also unmethylated in excitatory neurons in newborn mice, showing that the loss of Dnmt3a blocked the developmental gain of mCG at these sites. However, the density of P39 cKO DMRs around the DE genes was not significantly different in the up- and down-regulated genes and the non-DE genes (Figure 3—figure supplement 5B).

The majority of P39 cKO DMRs (68.0%) were distal (≥10 kb) from the annotated transcription start sites. These DMRs were significantly enriched in both active enhancers and repressed chromatin, suggesting they have a regulatory role (Figure 3—figure supplement 5C; see also below). We found 113,557 developmental DMRs that gain mCG between P0 and P39 in control neurons (≥30 difference in %mCG, FDR < 0.01). These DMRs strongly overlapped (83.2%) with the cKO DMRs in mature neurons (Figure 3G–H, Supplementary file 3). Moreover, the P39 cKO DMRs were enriched in DNA sequence motifs of multiple transcription factors (TFs) associated with neuronal differentiation, including Rest, Lhx2, Pou3f2(Brn2), and Pax6 (FDR < 0.05, Figure 3—figure supplement 5D, Supplementary file 4). Notably, the DMRs in Dnmt3a cKO neurons represent only a part of the global reduction in mCG that we observed throughout the genome. Indeed, we found that mCG is reduced by ~10% in all genomic compartments, even after excluding P39 DMRs (Figure 3—figure supplement 3C). These results suggest that Dnmt3a is essential for the methylation and subsequent repression of neuronal enhancers that are active during prenatal brain development.

Increased PRC2-associated repressive histone modification H3K27me3 in Dnmt3a cKO

Given that the cKO loses DNA methylation throughout the genome, we were surprised that the expression level of many genes was not disrupted. We, therefore, explored other potential epigenetic regulators which could contribute to maintaining gene expression. To identify TFs and chromatin regulators with experimental evidence of binding at cis-regulatory regions of the DE genes, we performed Binding Analysis for Regulation of Transcription (BART) (Wang et al., 2018). Chromatin regulators associated with PRC2, including Ezh2, Suz12, Eed, and Jarid2, were among the top DNA binding proteins enriched near the promoters of both up- and down-regulated DE genes (Figure 4A). Several TFs associated with chromatin organization, including the histone deacetylase (Hdac) and demethylase (Kdm) families, and Ctcf, were also enriched (Figure 4—figure supplement 1, Supplementary file 5). These results suggest that Dnmt3a cKO impacts the chromatin landscape in excitatory neurons, potentially via altered PRC2 activity.

Figure 4. Polycomb repressive complex 2 (PRC2) associated histone modification H3K27me3 is up-regulated following the loss of DNA methylation.

(A) Transcription factors (TFs) predicted to regulate P39 Dnmt3a conditional knockout (cKO) differentially expressed genes include many proteins associated with PRC2. The functional TF rank score was assigned by Binding Analysis of Regulation of Transcription (BART; Wang et al., 2018). PRC2-associated TFs are labeled and highlighted in gold circles. (B) Browser view of the Mab21l2 locus, where increased H3K27me3 (differentially modified regions, bottom red bars and highlighted in blue shaded box) coincides with the loss of DNA methylation (Dnmt3a cKO DMRs, orange bars under the ‘P39 cKO’ track and highlighted in blue shaded box) in P39 Dnmt3a cKO. This region loses H3K27me3 during normal development in control pyramidal neurons (blue bars, P39 < E14). DMR, differentially methylated region; E14, embryonic day 14; P0 and P39, postnatal days 0 and 39. (C) Quantification of the increase in H3K27me3 chromatin immunoprecipitation sequencing (ChIP-seq) signal in each replicate at the H3K27me3 differentially modified region between P39 control and cKO at the Mab21l2 locus shown in (B). Each bar shows the DEseq2 normalized counts in each replicate, and the triple asterisks denote a significant increase (false discovery rate [FDR] = 1.33e-4, fold-change=2.41). (D) Histone modification ChIP-seq peaks for active marks (H3K4me3, H3K27ac) are largely preserved in the Dnmt3a cKO, while repressive H3K27me3 peaks expand. The Venn diagrams denote numbers of peaks that overlap between cKO (yellow) and control (black) (numbers in the center), and numbers of peaks that are unique to one of the conditions (numbers on the edges).

Figure 4.

Figure 4—figure supplement 1. Increased signal of the repressive histone mark H3K27me3 after Dnmt3a conditional knockout (cKO).

Figure 4—figure supplement 1.

(A) Transcription factors (TFs) associated with chromatin organization were predicted to regulate differentially expressed (DE) genes in the P39 Dnmt3a cKO. The functional TF rank score was given by Binding Analysis of Regulation of Transcription (BART; Wang et al., 2018). TFs predicted to regulate only up-regulated genes, to regulate only down-regulated genes and to regulate both the up-regulated and down-regulated genes are colored in red, blue, and black, respectively. TFs associated with chromatin organization and remodeling are labeled. (B) Correlation heatmap of epigenetic marks and RNA reads in 10 kb tiling genomic bins in P39 samples. For mCG and mCH, the mean methylation in each bin is used. For RNA-seq and histone modification chromatin immunoprecipitation sequencing (ChIP-seq), the RPKM (reads per kilobase per million) value in each bin is used. The dendrogram is calculated by hierarchical clustering with complete linkage using the Euclidean distances of the Spearman correlation across the samples. Rep1/2, replicate 1/2. (C) MA plots show no significant changes of H3K4me3 and H3K27ac signal in P39 Dnmt3a cKO compared to control. (D) MA plots show that significant differentially modified H3K27me3 regions are only found in P39 Dnmt3a cKO but not E14 or P0 when compared to control.
Figure 4—figure supplement 2. The increased H3K27me signal in P39 was generally not observed in E14 or P0.

Figure 4—figure supplement 2.

(A–B) Correlations of the differences of H3K27me3 signal between Dnmt3a conditional knockout (cKO) and control across the three development time points in 10 kb genomic bins (A) and the P39 H3K27me differentially modified (DM) regions (B). (C–D) Boxplots to show the distribution of the differences of H3K27me3 signal between Dnmt3a cKO and control in E14, P0, and P39, stratified by ∆H3K27me3 signal at P39. Signals are counted in 10 kb genomic bins (C), as in (A) and the P39 H3K27me DM regions (D), as in (B). Asterisks show the significances from Wilcoxon rank-sum tests against zero (red, greater than zero; blue less than zero; *, p<0.05; **, p<0.01; ***, p<0.001).
Figure 4—figure supplement 3. H3K27me3 differentially modified (DM) regions in P39 Dnmt3a conditional knockout (cKO) were highly overlapped with differentially methylated regions (DMRs).

Figure 4—figure supplement 3.

(A) H3K27me3 signals in DM regions (left) compared to random shuffles across the genome (middle; keeping peak sizes and the chromosome same as DM regions) and random shuffles within H3K27me3 peaks (right; H3K27me3 peaks are the union set of cKO and control peaks; restrict shuffles within the said peaks and keeping peak sizes and the chromosome same as DM regions). ****, Wilcoxon rank-sum test p<0.0001; n.s., not significant. (B) Top 10 Gene Ontology terms of the Biological Process ontology enriched in the genes associated with regions with up-regulated H3K27me3 signal in the P39 Dnmt3a cKO samples. Gene-region association was estimated using GREAT (McLean et al., 2010). FDR, false discovery rate; the vertical dashed line shows the threshold of FDR = 0.05. (C) Violin plots to show the H3K27me3 signals in non-differentially modified regions (non-DM, gray) selected to match baseline H3K27me3 as those in significantly DM H3K27me regions (DM, red). ****, Wilcoxon rank-sum test p<0.0001; n.s., not significant. (D) Overlaps of P39 H3K27me3 DM regions with P39 Dnmt3a cKO DMRs. As a comparison, non-DM regions selected in (C) with matched H3K27me3 signal in the P39 control sample were used. Left panel, Venn diagrams to show the numbers of H3K27me3 DM and non-DM regions that overlap P39 cKO DMRs. Right panel, significant enrichments of the overlaps (Fisher’s test, p<0.05). (E) The differences of CG (left) or CH (right) methylation levels between Dnmt3a cKO and control samples in DM (red) and non-DM (gray) regions. ****, Wilcoxon rank-sum test p<0.0001.

To experimentally address this, we performed chromatin immunoprecipitation sequencing (ChIP-seq) in excitatory neurons at embryonic day 14 (E14) and postnatal days 0 and 39 to measure trimethylation of histone H3 lysine 27 (H3K27me3), a repressive mark whose deposition is catalyzed by PRC2 and is important for transcriptional silencing of developmental genes. In P39 neurons, we also measured two histone modifications associated with active chromatin: H3K4me3 (trimethylation of histone H3 lysine 4, associated with promoters) and H3K27ac (acetylation of histone H3 lysine 27, associated with active promoters and enhancers) (Heinz et al., 2015). For each mark, we performed sequencing on two independent samples, each of which used pooled tissue from two mice. The active and repressive marks had positive and negative correlations with mRNA expression, respectively (Figure 4—figure supplement 1B). In addition, we noted regions where increased H3K27me3 concurred with the decreased DNA methylation. For example, at the Mab21l2 locus (Figure 4B), we observed a 2.41-fold increase (Figure 4C) in H3K27me3 in P39 Dnmt3a cKO neurons, coinciding with the loss of CG methylation at multiple DMRs spanning the gene body and surrounding region. The Mab21l2 gene was lowly expressed (TPM < 3) in both control and cKO neurons, consistent with a role for the gain of H3K27me3 in maintaining the repression of this gene.

Using a conservative strategy to call ChIP-seq peaks (Zang et al., 2009), we found that marks associated with transcriptional activity (H3K4me3 and H3K27ac) were largely conserved in the P39 cKO and control (Figure 4D). By contrast, we found 51.9% more H3K27me3 peaks in mature (P39) cKO than in control neurons (Figure 4D, Supplementary file 6). When we directly identified differentially modified (DM) regions, we found no DM for H3K4me3 and H3K27ac between the cKO and control in P39 neurons (Figure 4—figure supplement 1C). By contrast, we found 4040 regions with significantly increased H3K27me3 in the P39 cKO, covering ~31.05 MB of the genome (Figure 4—figure supplement 1D, FDR < 0.05, Supplementary file 7). Differential H3K27me3 appears late during brain maturation: only 3 DM regions were found in earlier development stages (E14 or P0; Figure 4—figure supplement 1D), and the signal differences of cKO and control did not show clear correlations between P39 and earlier stages (Figure 4—figure supplement 2). These DM regions have a medium but non-zero level of H3K27me3 in the P39 control (higher than random shuffles across the whole genome but lower than random shuffles within the peak regions, Figure 4—figure supplement 3A), and hence presumably fine-tunable after Dnmt3a cKO. Genes associated with these DM regions were enriched in development-related functions (Figure 4—figure supplement 3B, Supplementary file 8). These results suggest that the increase of H3K27me3 in Dnmt3a cKO excitatory neurons occurred postnatally, following the major impact of the loss of Dnmt3a on neuronal DNA methylation.

The increase in H3K27me3 was closely associated with regions that lost CG DNA methylation in the Dnmt3a cKO. The majority (66.2%) of the regions marked by H3K27me3 in the cKO overlapped with P39 cKO DMRs (Figure 5A, Fisher’s test p<1e-100). Likewise, P39 cKO DMRs were significantly enriched in peaks (22.4% overlapped H3K27me3 peaks in cKO, p<1e-100, Figure 5B) and DM regions of H3K27me3 (Figure 4—figure supplement 3D). The DM regions of H3K27me3 had significantly more overlaps with DMRs compared to the non-DM control regions (Fisher’s exact test p<2.2e-16, OR: 2.70, Figure 4—figure supplement 3C-D). The DM regions also have larger decreases in both mCG and mCH when compared to non-DM regions (Wilcoxon rank-sum test p<0.0001, Figure 4—figure supplement 3E). Conversely, the DMRs were depleted in regions marked by H3K27ac (12.7%, p<1e-100) or H3K4me3 (0.98%, p=1.73e-149) (Figure 5A–B). Moreover, H3K27me3 was more abundant at the center of DMRs in cKO compared to control neurons at P39 (∆=0.15 in the unit of fold enrichment vs. IgG, 5.04% increases, Figure 5C). There were much smaller changes of H3K27me3 at these DMRs in newborn (P0, ∆=0.0075, 0.27% changes) or fetal (E14, ∆=–0.028, –1.02% changes) neurons (Figure 5C), and the increases were not seen in randomly shuffled regions (Figure 5—figure supplement 1A). The signal of H3K4me3 and H3K27ac at the DMRs was also elevated in the cKO, to a lesser extent (H3K4me3: ∆=0.068, 2.92% changes; H3K27ac: ∆=0.13, 4.64% changes). Going beyond overlaps of regions, we found a quantitative association between the changes in DNA methylation and H3K27me3 in mature (P39) neurons (Figure 5D). At H3K27me3 peaks, the ChIP-seq signal intensity fold-change between cKO and control correlated with the loss of mCG in cKO (Spearman r=–0.26, p<2.2e-16). When accessing the H3K27me3 changes as a function of baseline H3K27me3 levels in the control across the genome tiled in 1 kb bins, we observed bigger H3K27me3 increases in bins with overlapping DMRs compared to bins without overlapping DMRs (Wilcoxon rank-sum test p<0.001 in each bin, Figure 5—figure supplement 1B). These results indicate that DMRs are particularly unique in showing increased H3K37me3 signal.

Figure 5. Increased H3K27me3 correlates with the loss of postnatal DNA methylation.

(A) Most of the P39 H3K27me3 peaks (57.1% and 66.2% of control and conditional knockout [cKO] peaks), but only some of the P39 H3K27ac peaks (28.9% and 33.1%), overlap with P39 Dnmt3a cKO differentially methylated regions (DMRs). (B) Significant enrichment (red) or depletion (blue) of P39 Dnmt3a cKO DMRs in the histone modification chromatin immunoprecipitation sequencing (ChIP-seq) peaks (Fisher’s test, p<0.05). E14, embryonic day 14; P0 and P39, postnatal days 0 and 39. (C) Histone modification ChIP-seq signal around the center of DMRs. RPKM, reads per kilobase per million. ***, Wilcoxon rank-sum test of the differences at the center, p<0.001. (D) Correlation of P39 H3K27me3 signal fold-changes and P39 CG methylation levels differences between Dnmt3a cKO and control in H3K27me ChIP-seq peaks. The smoothed line is fitted using a generalized additive model, and the shaded area shows the 95% confidence interval of the fit. r, Spearman correlation coefficient. (E) P39 Dnmt3a cKO down-regulated DE genes (false discovery rate [FDR] < 0.05) with overlapping P39 Dnmt3a cKO DMRs show small but significant increases of H3K27me3 in P39 cKO (upper panel). *, Wilcoxon rank-sum test against zero, p<0.05. No such differences were observed in differentially expressed (DE) genes that do not overlap with P39 cKO DMRs (bottom panel). H3K27me3 signal was calculated as the RPKM fold-change between H3K27me3 and IgG.

Figure 5.

Figure 5—figure supplement 1. Differentially methylated regions (DMRs) are particularly unique in showing increased H3K27me3 signal.

Figure 5—figure supplement 1.

(A) Histone modification chromatin immunoprecipitation sequencing (ChIP-seq) signal around the center of DMRs. Two sets of controlled regions were selected as comparisons: The first shuffled set (dotted lines) is regions generated by shuffling the DMRs randomly across the same chromosome (excluding blacklist regions) as the observed and keeping the same region size. The second shuffled set (dashed lines) is more stringent shuffles that meet the same criteria as the first set but with an additional restriction that the shuffles must reside within the P39 H3K27me3 peak regions (the union of P39 control and P39 cKO H3K27me3 peaks). RPKM, reads per kilobase per million. (B) The differences of H3K27me3 signal between Dnmt3a conditional knockout (cKO) and control as a function of the baseline level of H3K27me3 in control samples. 1 kb tiled genomic bins are grouped by whether or not they overlap any DMRs, and the dots represent the average signal of bins stratified by deciles of baseline H3K27me signal. Different replicates were used in computing values on the x-axis and y-axis to avoid double-dipping. Data are presented as mean ± SEM for each bin. ***, Wilcoxon rank-sum test, p<0.001.
Figure 5—figure supplement 2. Relationships between gene expression, H3K27me3 signal and gene body mCH.

Figure 5—figure supplement 2.

(A) Correlations of gene expression fold-change and gene body H3K27me3 signal difference in P39 Dnmt3a conditional knockout (cKO) differentially expressed (DE) genes (false discovery rate [FDR] < 0.2). Left panel, scatter plot to show the correlation. DE genes are grouped by whether or not they overlap with P39 Dnmt3a cKO differentially methylated regions (DMRs). r, Spearman correlation coefficient. Right panel, P39 Dnmt3a cKO down-regulated DE genes with overlapping P39 cKO DMRs, in general, show small but significant increases of H3K27me3 in P39 cKO. Asterisks in the middle of each box show the fold-change significance against zero using Wilcoxon rank-sum test for each gene set (*, p<0.05; ***, p<0.001). Comparing across gene sets, these down-regulated DE genes also show bigger increases of H3K27me3 signal after cKO, compared to non-DE genes of matched expression levels with overlapping DMRs (Wilcoxon rank-sum test, *, p<0.05) and up-regulated DE genes with overlapping DMRs (**, p<0.01). No such differences were observed in DE genes without overlapping P39 cKO DMRs (n.s, not significant). (B) Differences in gene body methylation vs. difference of gene H2K27me3 signal between P39 Dnmt3a cKO and control in all non-DE genes (14,591 genes with FDR ≥ 0.05). The plots show mean ± SEM differences of H3K27me signal for genes in 10 non-overlapping bins (deciles of mC difference). (C) Genes are grouped by the quantiles of loss of mCH in cKO, and the gene expression fold-changes (mean ± SEM) as a function of the changes of H3K27me3 in deciles in the gene body for each of these gene sets are shown.

We further examined whether the increased H3K27me3 could account for the reduced expression of some genes in the Dnmt3a cKO neurons. We found that down-regulated genes that overlap DMRs and the non-DE genes selected with matched expressions of up-regulated DE genes that overlap DMRs showed a small but significant median increase of H3K27me3 in the cKO (Wilcoxon rank-sum test p-value < 0.05; Figure 5E upper panel). When we considered a larger set of DE genes with a more relaxed threshold (FDR < 0.2), we observed that down-regulated DE genes containing DMRs accumulate significantly more H3K27me3 than non-DE genes containing DMRs and up-regulated genes containing DMRs (Wilcoxon rank-sum test p=0.0029, Figure 5—figure supplement 2A). By contrast, up-regulated genes had no significant accumulation of H3K27me3 and were not significantly different from the control genes (Figure 5—figure supplement 2A). No such differences were observed between DE and non-DE genes without overlapping DMRs (Figure 5E lower panel and Figure 5—figure supplement 2A). These results suggest that the effect of H3K27me3 is likely specific to genes containing DMRs and the effect is stronger in the down-regulated DE genes, which may partially explain the fact that 24 genes were significantly down-regulated after the loss of repressive DNA methylation in the Dnmt3a cKO (Figure 2A–B).

We next analyzed how changes in H3K27me3 related to the loss of mCH. In all non-DE genes (FDR ≥ 0.05), we observed a larger increase of H3K27me3 signal in the gene body of genes that lost most mCH in the cKO (Figure 5—figure supplement 2B). Indeed, when we grouped all genes by the extent of the loss of mCH, we found that genes that lost the most mCH had a negative correlation between the changes in gene body H3K27me3 signal and gene expression fold-change. Such correlation was not observed in genes that did not lose mCH (Figure 5—figure supplement 2C). These results suggest that H3K27me3 may have a role in repressing expression specifically in regions that lose repressive non-CG DNA methylation.

Developmental changes in H3K27me3 are not affected by Dnmt3a cKO

Our finding that Dnmt3a cKO disrupts the normal developmental gain of DNA methylation prompted us to ask whether the changes in H3K27me3 in the cKO are likewise associated with developmental regulation of H3K27me3. Indeed, our ChIP-seq data from E14, P0, and P39 excitatory neurons revealed striking developmental dynamics in H3K27me3. We identified 12,994 developmentally regulated H3K27me3 DM regions between E14 and P39, with a similar number of regions that gain (6774) and lose (6220) H3K27me3 (Figure 6A and Figure 6—figure supplement 1A, Supplementary file 9). We also examined DM regions at P0 vs. E14 and P0 vs. P39 (Figure 6—figure supplement 1A). However, due to greater biological variability at the perinatal time point, the two replicate ChIP-seq datasets at P0 were less consistent than the E14 and P39 samples (Figure 6—figure supplement 2). As a result, our data were not well powered to detect changes at P0 and we chose to focus on the E14 vs. P39 DM regions as sites of developmental chromatin remodeling. Genes associated with these DM regions were enriched in biological processes involved in development such as nervous system development and neurogenesis (Figure 6—figure supplement 1B). The developmental H3K27me3 DM regions overlapped developmental DMRs, with a notable overlap of regions gaining both mCG and H3K27me3 (Figure 6B). Both modifications may thus act together to repress thousands of genomic regions during development.

Figure 6. Developmental dynamics of H3K27me3.

(A) Heatmap of developmentally regulated H3K27me3 regions in E14 and P39 control samples. CPM – counts per million; R1/2 – replicates. (B) Bar plots show the numbers of developmental differentially modified H3K27me3 regions (E14 vs. P39) that overlap developmental differentially methylated regions (DMRs) (P0 vs. P39, left panel), and the numbers of developmental DMRs that overlap developmental differentially modified H3K27me3 regions (right panel). (C–E) Normalized H3K27me3 signal (fold-changes compared to P39 control), and mCG, mCH differences (compared to the average of the two replicates from P39 control) in peaks that overlap with E14 vs. P39 developmental loss-of-H3K27me3 regions (C), developmental gain-of-H3K27me3 regions (D), or increased H3K27me3 in P39 Dnmt3a cKO (E).

Figure 6.

Figure 6—figure supplement 1. Regions prone to alteration of H3K27me3 by Dnmt3a conditional knockout (cKO) were distinct from the regions affected by developmentally dynamic H3K27me3.

Figure 6—figure supplement 1.

(A) Differentially modified H3K27me3 regions during development in control samples. (B) Top 10 most enriched biological process terms from the Gene Ontology in the genes associated with differentially modified H3K27me3 regions during development in control samples. Gene-region association was predicted by GREAT (McLean et al., 2010). FDR, false discovery rate; the vertical dashed line shows the threshold of FDR = 0.05. (C) Correlations of P39 H3K27me3 signal fold-changes of Dnmt3a cKO vs. control and developmental H3K27me3 signal fold-changes of P39 vs. E14 in H3K27me3 chromatin immunoprecipitation sequencing (ChIP-seq) peaks. Note that peaks with big absolute fold-changes in P39 cKO vs. P39 control are generally not changed much in P39 control vs. E14 control, and vice versa. Replicates 1 and 2 of P39 control are used in the y and x axis, respectively, to avoid double-dipping. The smoothed line is fitted using a generalized additive model, and the shaded area shows the 95% confidence interval of the fit. r, Spearman correlation coefficient. (D) The developmental H3K27me3 signal fold-changes are not affected by Dnmt3a cKO. r, Spearman correlation coefficient. (E–G) Boxplots to show the distribution of H3K27me3 signal, mCG, and mCH levels in peaks that overlaps with E14 vs. P39 developmental loss-of-H3K27me3 regions (E), peaks that overlap with developmental gain-of-H3K27me3 regions (F), and peaks that overlap with increased H3K27me3 regions in P39 Dnmt3a cKO (G). Asterisks show the significances from paired t-tests (***, p<0.001). ∆ values are the mean of the differences for the two groups in comparison (group on the right vs. group on the left).
Figure 6—figure supplement 2. Correlations of chromatin immunoprecipitation sequencing (ChIP-seq) signals across replicates.

Figure 6—figure supplement 2.

Correlations were computed with ChIP-seq counts in 1 kb genomic bins (A) and in peaks (B) across replicates. r, Spearman correlation coefficient. n, number of peaks. RPKM, reads per kilobase million.

The P39 Dnmt3a cKO changes of H3K27me3 signal were weakly correlated with the developmental changes of H3K27me3 signal (r = –0.12, p<2.2e-16, Figure 6—figure supplement 1C). Moreover, there was little appreciable difference between the H3K27me3 signal fold-changes between P39 and E14 cKO samples compared with those of control samples (Spearman r=0.67, p<2.2e-16, Figure 6—figure supplement 1D).

To further stratify the joint distribution of developmental and Dnmt3a cKO-dependent changes in both H3K27me3 and DNA methylation, we assigned peaks to three groups (Figure 6C–E and Figure 6—figure supplement 1E-G). ‘Group DevLoss’ and ‘Group DevGain’ peaks lose or gain H3K27me3 during development, respectively (Figure 6C–D and Figure 6—figure supplement 1E-F). ‘Group cKO’ peaks have higher H3K27me3 in the Dnmt3a cKO compared to control at P39 (Figure 6E and Figure 6—figure supplement 1G). We found that developmental peaks (DevLoss and DevGain) were relatively unaffected by the cKO (ΔH3K27me3=0.02, –0.03 respectively, in units of log10(CPM +1)), whereas Group cKO had 4.5-fold larger mean effect (ΔH3K27me3=0.09). Group cKO peaks also experienced greater loss of mCG (ΔmCG = –23.5%) than Group DevLoss (–13.0%) or DevGain (18.6%) (Figure 6C–E and Figure 6—figure supplement 1E-G, middle and right panels). These results suggest that regions prone to alteration of H3K27me3 by Dnmt3a cKO are distinct from the regions affected by developmentally dynamic H3K27me3. We observed a very similar pattern when we examined sites with differential H3K27me3 between P0 and P39 (data not shown).

Novel DNA methylation valleys with increased H3K27me3 signal in the Dnmt3a cKO

DNA methylation and H2K27me3 have complementary roles at DNA methylation valleys (DMVs), that is, large regions (≥5 kb) with low mCG (≤15%) that occur around key transcriptional regulators of development in human and mouse tissues (Mo et al., 2015; Xie et al., 2013). Previous studies comparing the epigenetic profile of DMVs across tissues identified multiple categories, including constitutive DMVs present in all tissues as well as tissue-specific DMVs (Li et al., 2018). We found more than twice as many DMVs in P39 cKO (1838) compared with control (881) neurons (Supplementary file 10), covering a greater genomic territory (16.02 Mbp in cKO, 7.91 Mbp in control). The majority of these P39 DMVs were either expanded or unique in cKO (Figure 7A), while DMVs identified in P0 samples were mostly not altered (Figure 7—figure supplement 1). Most P39 DMVs had active histone marks (H3K27ac+, H3K27me3-), while some had repressed or bivalent profiles consistent with PRC2-associated gene silencing (H3K27me3+) (Figure 7B).

Figure 7. Distinct clusters of DNA methylation valleys (DMVs) were associated with the increased H3K27me3 signal in the Dnmt3a conditional knockout (cKO).

(A) Number of DMVs identified in the P39 Dnmt3a cKO and the control samples, categorized by whether they appear in one or both groups or change size in the cKO. (B) Overlap of DMVs with the H3K4me4 and/or the H3K27me3 chromatin immunoprecipitation sequencing (ChIP-seq) peaks. (C) Heatmap of DMVs clustered by their methylation levels and histone modifications. The last two columns show the enrichments of differentially modified (DM) peaks of H3K27me3 and differentially expressed (DE) genes. R1/2, replicate 1/2; RPKM, reads per kilobase per million. (D) Browser tracks show examples of unique DMVs in the Dnmt3a cKO samples and the increased H3K27me3 signal in their flanking regions.

Figure 7.

Figure 7—figure supplement 1. Number of DNA methylation valleys (DMVs) identified in the P0 Dnmt3a conditional knockout (cKO) and the control samples.

Figure 7—figure supplement 1.

P0 DMVs were categorized by whether they appear in one or both groups or change size in the cKO, as in Figure 7A.

By clustering the DMVs using their pattern of DNA methylation, chromatin modifications, and gene expression, we found eight distinct categories (Figure 7C–D). Whereas most DMVs lack H3K27me3 (clusters C2,3,5,7,8), we found some groups of DMVs associated with moderate (C1, C4) or high (C6) levels of H3K27me3. Cluster C6 DMVs, such as the promoter of Tfap2c (Figure 7C–D), had high H3K27me3 and low mCG in both control and cKO neurons, and were not strongly affected by the loss of mC in the Dnmt3a cKO. By contrast, cluster C1 and C4 DMVs, including the promoters of Lhx2, Foxp1, Foxp2, Slc17a6 (encoding vesicular glutamate transporter, Vglut2) and Sema3f, gain mC during normal development (Figure 7C–D). Cluster C1 DMVs gained H3K27me3 in the P39 cKO compared to control animals. The loss of mC in these regions in the Dnmt3a cKO did not lead to strong activation of gene expression, potentially due to compensatory PRC2-mediated repression.

The remaining clusters lack H3K27me3 and are instead marked by low mCG and either high H3K27ac (cluster C8) or both H3K27ac and H3K4me3 (clusters C7). Cluster C5 DMVs (e.g. Nr1d1, Pde4d, Figure 7D) have high mCG at P0 and lose methylation in excitatory neurons during brain development. This demethylation is not affected in the Dnmt3a cKO, and we found little difference between the control and cKO neurons at these sites. Finally, clusters C2 and C3 were enriched for up- and down-regulated genes, respectively.

Discussion

To examine the role of DNA methylation in cortical excitatory neurons after their birth, we developed a mouse model where the loss of DNA methylation occurs in postmitotic neurons, prior to the postnatal increase in Dnmt3a expression and non-CG methylation (Lister et al., 2013). Neurod6 conditional deletion of Dnmt3a in cortical excitatory neurons abolished non-CG methylation and reduced CG methylation throughout the genome (Figure 3C). We found that Dnmt3a cKO neurons had altered expression of dozens of genes, including both up- and down-regulated genes (Figure 3A–B). Similar complex patterns in gene expression were reported when Dnmt3a was deleted in inhibitory neurons (Lavery et al., 2020) and following manipulation of the DNA methylation reader MeCP2 (Boxer et al., 2020; Johnson et al., 2017; Lavery et al., 2020). These gene expression changes may partly reflect the direct effect of lower gene body CH methylation and loss of CG and CH methylation at gene promoters and distal enhancers (Boxer et al., 2020; Clemens et al., 2020). In addition, some gene expression changes could result from the disruption of other regulatory processes, such as TF expression or chromatin modification. Indeed, we found that the Dnmt3a cKO DMRs overlapped with regions that gain methylation during normal postnatal development (Figure 3G–H). These DMRs had increased H3K27ac in the Dnmt3a cKO (Figure 5C), consistent with observations in adult animals lacking Dnmt3a specifically in GABAergic Sst- or Vip-expressing interneurons (Stroud et al., 2020). In those experiments, embryonic gene regulatory elements had lower cytosine methylation and increased H3K27ac and H3K4me1 (Stroud et al., 2020). This suggests an essential role for Dnmt3a and DNA methylation in shaping the transcriptome during development in part via the inactivation of embryonic enhancers, with potentially long-lasting effects on the gene expression pattern of mature neurons. Although we did not detect individual peaks with a statistically significant difference in H3K27ac between cKO and control (Figure 4—figure supplement 1C), this could be due to a small number of replicates and the subtle magnitude of changes in gene expression relative to the large number of tested peaks. Future experiments using more replicates or orthogonal techniques like CUT&RUN and CUT&Tag could help to further investigate the relationship between the Dnmt3a-dependent DNA methylation and the activation of enhancers.

There is a strong antagonistic relationship between DNA methylation and the PRC2-associated histone mark, H3K27me3 (Brinkman et al., 2012; Jermann et al., 2014; Lynch et al., 2012; Reddington et al., 2013; Wu et al., 2010). Switching between polycomb- and DNA methylation-mediated repression has been observed during development and in cancer (Mohn et al., 2008; Schlesinger et al., 2007; Widschwendter et al., 2007). Severe depletion of mCG can lead to redistribution of H3K27me3, causing derepression of developmental regulators such as the Hox gene clusters (Reddington et al., 2013). We did not observe ectopic expression of these genes, possibly due to the relatively modest reduction in mCG in our model compared with cells lacking Dnmt1. Instead, we found that in cortical excitatory neurons, thousands of sites gained H3K27me3 following the loss of mCG in the Dnmt3a cKO (Figure 4—figure supplement 1D). These sites, which normally gain DNA methylation during postnatal development, were left unmethylated in cKO neurons (Figures 4B and 6E). These regions were largely distinct from the sites that gain or lose H3K27me3 during normal development (Figure 6C–E), and had an intermediate level of H3K27me3 in the control samples (Figure 4—figure supplement 3A). A subset of these regions formed large-scale DMVs spanning key regulatory genes (Figure 7C–D). Overall, our results suggest that when DNA methylation is disrupted, H3K27me3 might partially compensate for the loss of mCG and/or mCH and act as an alternative mode of epigenetic repression. Nevertheless, we did not find differential expression in any of the four core components of PRC2 (Ezh2, Suz12, Eed, and Rbbp4) in adult Dnmt3a cKO animals. It is possible that the increased H3K27me3 was mediated by transient expression of PRC2 components during development in the cKO. Furthermore, the predictions from BART (Figure 4A) were derived from various cell lines and tissues from the ENCODE project (Davis et al., 2018; ENCODE Project Consortium, 2012), suggesting that the potential PRC2 binding at our DEGs may normally happen in systems other than the brain or pyramidal neurons, or at other time points during development. Additional experiments which directly manipulate components of the PRC2 system are required to further test the potential compensation mechanism.

Previous work showed that the early embryonic deletion of Dnmt3a in excitatory neurons caused gross motor deficits and a shortened lifespan (Nguyen et al., 2007). Mid-gestation Neurod6-driven Dnmt3a ablation, however, did not cause such alterations, allowing us to test how the epigenetic and transcriptional alterations affected the morphology and function of excitatory synapses, neurons’ passive and active properties, and behavior. We found that prelimbic layer 2 neurons of Dnmt3a cKO animals had more immature dendritic spines and were less sensitive to somatic injections of depolarizing current compared to control neurons (Figure 2). This is consistent with several of the observed DE genes having annotated roles in dendrite morphogenesis (Elavl4, Hecw2, Ptprd, Figure 3B) and Na+ influx/transport (Hecw2, Scn3b, Figure 3B). Indeed, the down-regulation of the latter set of genes could be expected to increase the action potential threshold (thus dampening neuronal excitability) and trigger neurodevelopmental delays (Berko et al., 2017). As well, Cacnb3, a gene that encodes a regulatory beta subunit of the voltage-dependent calcium channel, was found among the set of down-regulated genes after loss of Dnmt3a. This gene has been linked with schizophrenia (Maycox et al., 2009), ADHD, and bipolar disorder (van Hulzen et al., 2017) in humans. Our study thus suggests that the disruption of methylation patterns established by Dnmt3a during infancy might have far-reaching mechanistic relevance for multiple neurodevelopmental disorders, complementing previous genetic risk association studies linking Dnmt3a with autism (C Yuen et al., 2017; Sanders et al., 2015). Although our experiments were not designed to make quantitative comparisons between cohorts of different sexes, we observed a wider range of behavioral impairments in male Dnmt3a cKO mice, which included, for example, decreased social interest (Figure 1). Those data correlate well with several human neurodevelopmental disorders, in which males are disproportionately more affected than females, and thus provide important support for follow-up investigations into the underlying causes of those differences.

Our findings highlight the critical and interconnected roles in brain development and cognitive function of two major modes of epigenetic repression of gene expression: DNA methylation and PRC2-mediated repression. The loss of DNA methylation in excitatory neurons has effects on gene expression, synaptic function, and cognitive behavior. Moreover, loss of DNA methylation leads to a gain of the PRC2-associated repressive mark H3K27me3. Our cKO is a restricted manipulation of one neuron type, yet it directly impacts DNA methylation throughout the genome at millions of sites. PRC2-mediated repression may compensate for the loss of mCG and/or mCH, acting as an alternative repressive mechanism when DNA methylation is disrupted. Future work focusing on earlier developmental stages, and using targeted methods to manipulate epigenetic marks in local genomic regions (Liu et al., 2016), may help elucidate the causal interactions among epigenetic modifications that are critical for neuronal maturation and function.

Materials and methods

Generation of the Dnmt3a cKO mice line

All animal procedures were conducted in accordance with the guidelines of the American Association for the Accreditation of Laboratory Animal Care and were approved by the Salk Institute for Biological Studies Institutional Animal Care and Use Committee (protocol number 18-00006). For behavior, slice physiology, and spine analyses, Dnmt3a-floxed animals (Okano et al., 1999) (backcrossed to C57BL/6 for at least seven generations) were crossed to Neurod6-Cre (Nex-Cre) (Goebbels et al., 2006, backcrossed to C57BL/6 J for >10 generations) mice to generate Dnmt3a-KO animals carrying the deletion only in pyramidal cells. To be able to isolate pyramidal neuron nuclei for DNA methylation, transcription, and ChIP analyses, the mouse lines (Dnmt3a-KO and Neurod6-Cre) were crossed to a mouse line carrying the INTACT background (B6.129-Gt(ROSA)26Sortm5(CAG-Sun1/sfGFP)Nat/MmbeJ.Strain 030952, Jackson laboratories). The deletion of Dnmt3a from pyramidal cells was confirmed by RNA-seq (deletion of exon 19) and Western blot (Figure 1B and Figure 1—figure supplement 3A-B). For both backgrounds, Nex-Cre hemizygous mice were used as controls.

Frontal cortex dissection, nuclei isolation, and flow cytometry

Frontal cortex tissue was produced as described (Lister et al., 2013; Luo et al., 2017) from postnatal day 0 and 39 (P39) Dnmt3a cKO and control animals, in an INTACT background. The nuclei of GFP-expressing NeuN-positive excitatory neurons were isolated and collected using FANS as described (Lister et al., 2013; Luo et al., 2017) with the following modification: prior to FANS, nuclei were labeled with anti-NeuN-AlexaFluor647 and anti-GFP-AlexaFluor488. Nuclei were sorted as described (Lister et al., 2013). Double positive nuclei were retained for RNA-seq, ChIP-seq, and MethylC-seq library preparation and sequencing.

Western blot

Frontal cortex proteins were obtained by homogenization in RIPA buffer of the following composition: 150 mM NaCl, 10 mM Na2HPO4, 1% NaDOC, 1% NP-40, 0.5% SDS, 1 mM DTT, 1 mM PMSF in DMSO, supplemented with protease inhibitor (Sigma-Aldrich #11836153001) and phosphatase inhibitor (Pierce #A32957) cocktails. After centrifugation at 15,000× g, supernatants were preserved and protein concentration was determined by the BCA method (Pierce). Protein bands were separated in 8% PAGE gels and transferred to nitrocellulose membranes. After blocking in TBS-tween with 5% milk, DNMT3A was detected by the use of anti-DNMT3A antibody (Abcam) and chemiluminescence. DNMT3A bands were normalized to ACTIN content in each sample.

Patch-clamp electrophysiology

Male and female mice (6–9 weeks) were anesthetized with isoflurane and decapitated. The brains were quickly removed and coronal slices of the frontal cortex containing the prelimbic region (~2 mm anterior to Bregma) were cut in an ice-cold slicing medium of the following composition (in mM): 110 sucrose, 2.5 KCl, 0.5 CaCl2, 7 MgCl2, 25 NaHCO3, 1.25 NaH2PO4, and 10 glucose (bubbled with 95% O2 and 5% CO2). The slices were then transferred to artificial CSF (aCSF) containing (in mM): 130 NaCl, 2.5 KCl, 1.25 NaH2PO4, 23 NaHCO3, 1.3 MgCl2, 2 CaCl2, and 10 glucose, equilibrated with 95% O2 and 5% CO2 at 35°C for 30 min and afterward maintained at room temperature (22–24°C) for at least 1 hr (patch-clamp recording) before use. Brain slices were then transferred to a recording chamber and kept minimally submerged under continuous superfusion with aCSF at a flow rate of ~2 ml/min. Whole-cell recordings were obtained from putative prelimbic layer 2 (L2) pyramidal cells (identified by their pyramidal-shaped cell bodies and long apical dendrite using an upright microscope equipped with differential interference contrast optics). In acute mPFC slices, the prelimbic L2 is clearly distinguishable from L1 and L3 as a thin dark band that is densely packed with neuron somata. Pipettes had a tip resistance of 4–8 MΩ when filled with an internal solution of the following composition (in mM): 125 K-gluconate, 15 KCl, 8 NaCl, 10 HEPES, 2 EGTA, 10 Na2 phosphocreatine, 4 MgATP, 0.3 NaGTP (pH 7.25 adjusted with KOH, 290–300 mOsm). Access resistance (typically 15–35 MΩ) was monitored throughout the experiment to ensure stable recordings.

After obtaining the whole-cell configuration in voltage-clamp mode, cells were switched from a holding potential of –70 mV to current-clamp mode and the bridge-balance adjustment was performed. Passive electrical properties were quantified from recordings with hyperpolarizing current injections that evoked small ~5 mV deflections in membrane potential from resting. Responses to stepwise current injections (10–300 pA in increments of 10 pA; duration, 1 s) were recorded at 20 kHz in order to calculate input-output curves and rheobase – the minimal current necessary to trigger the first action potential. Miniature excitatory postsynaptic currents (mEPSCs) were recorded for 5 min in voltage-clamp mode (Vh=−70 mV) in the presence of the Na+ channel blocker, TTX (0.5 μM), to prevent the generation of action potentials, and picrotoxin (50 μM), an antagonist of GABAA receptors, to minimize inhibitory responses. In these conditions, mEPSCs could be blocked by the AMPA receptor antagonist, CNQX (25 μM). Single events larger than 6 pA were detected offline using the Minianalysis program (Synaptosoft Inc Decatur, GA). All data were acquired using a Multiclamp 700B amplifier and pCLAMP 9 software (Molecular Devices, LLC, San Jose, CA).

Fluorescent labeling of dendritic spines

Coronal brain slices containing the mPFC of 10–13 weeks’ female mice were prepared as for electrophysiological recordings and placed in a beaker for 3 hr at room temperature (24°C) to allow functional and morphological recovery. One slice was then transferred to a recording chamber and kept minimally submerged under continuous superfusion with aCSF bubbled with carbogen (95% O2/5% CO2) at a flow rate of ~2 ml/min. Previously sonicated crystals of the fluorescent marker DiI were placed next to the somata of layer 2 neurons in the prelimbic cortex, identified with the aid of an upright microscope equipped with differential interference contrast optics. The mice in these experiments had a Thy1-YFP background to help rule out non-specific labeling of deeper layer neurons. The neurons were exposed to the DiI crystals for 60 min. The slices were then gently removed from the incubation chamber with a transfer pipette and immersed in fixative (4% PFA) for 30 min. Then, the slices were rinsed three times with PBS for 5–10 min each, after which they were mounted on slides with prolonged gold antifade mounting medium (Life Technologies – Molecular Probes). The slides were kept in a dark box for 24 hr at room temperature to allow the liquid medium to form a semi-rigid gel. Imaging took place 24–48 hr from the time of the initial staining.

Confocal imaging

Dendritic spines were imaged by an investigator blind to the genotype using a Zeiss AiryScan confocal laser scanning microscope. All images were taken using the Zeiss Plan-APOCHROMAT 63× oil-immersion lens (N/A 1.4). A 543 nm laser was used to visualize the fluorescence emitted by DiI. Serial stack images with a 0.2 μm step size were collected, and then projected to reconstruct a three-dimensional image that was post-processed by the AiryScan software. Dendritic segments in layer 1, which were derived from layer 2 pyramidal neurons retrogradely labeled with DiI and that were well separated from neighboring neural processes, were randomly sampled and imaged. Each dendritic segment imaged for quantification belonged to a different neuron.

Dendritic spine quantification

The z-stack series were imported into the Reconstruct software (https://synapseweb.clm.utexas.edu/software-0/), with which a second investigator also blind to the genotype performed the identification of dendritic spines and their morphometric analysis. By scrolling through the stack of different optical sections, individual spine heads could be identified with greater certainty. All dendritic protrusions with a clearly recognizable stalk were counted as spines. Spine density was determined by summing the total number of spines per dendritic segment length (30–40 μm) and then calculating the average number of spines per μm. Individual dendritic spines were classified in the following order according to pre-established criteria: protrusion longer than 3 μm, filopodia; head wider than 0.6 μm, mushroom; protrusion longer than 2 μm and head narrower than 0.6 μm, long-thin; protrusion longer than 1 μm and head narrower than 0.6 μm, long-thin; the remaining spines were labeled stubby. Branched spines (with more than one neck) were counted separately.

Behavioral testing

Phenotypic characterization was initiated when the animals reached 9 weeks of age using cohorts of 10–15 male or female mice per genotype, according to the order described below.

Open field test

The open field test was performed using MED Associates hardware and the Activity Monitor software according to the manufacturer’s instructions (MED Associates Inc, St Albans, VT). Animals were individually placed into clear Plexiglas boxes (43.38 × 43.38 × 30.28 cm3) surrounded by multiple bands of photo beams and optical sensors that measure horizontal and vertical activity. Movement was detected as breaks within the beam matrices and automatically recorded for 60 min.

Light/dark transfer test

The light/dark transfer procedure was used to assess anxiety-like behavior in mice by capitalizing on the conflict between exploration of a novel environment and the avoidance of a brightly lit open field (150–200 lux in our experiments). The apparatus were Plexiglas boxes as for the open field test (43.38 × 43.38 × 30.28 cm3) containing dark box inserts (43.38 × 12.8 × 30.28 cm3). The compartments were connected by an opening (5.00 × 5.00 cm2) located at floor level in the center of the partition. The time spent in the light compartment was used as a predictor of anxiety-like behavior, that is, a greater amount of time in the light compartment was indicative of decreased anxiety-like behavior. Mice were placed in the dark compartment (4–7 lux) at the beginning of the 15 min test.

Elevated plus maze

The maze consisted of four arms (two open without walls and two with enclosed walls) 30 cm long and 5 cm wide in the shape of a plus sign. The apparatus was elevated approximately 33 cm over a table. At the beginning of each trial, one animal was placed inside a cylinder located at the center of the maze for 1 min. The mouse was then allowed to explore the maze for 5 min. The session was video-recorded by an overhead camera and subjected to automated analysis using ANY-maze software. The apparatus was wiped down with sani-wipes between trials to remove traces of odor cues. The percentage of time spent in open or closed arms was scored and used for analysis.

Y-maze test for spontaneous alternations

Spontaneous alternations between three 38 cm long arms of a Y-maze were taken as a measure of working memory. Single 6 min trials were initiated by placing each mouse in the center of the Y-maze. Arm entries were recorded with a video camera and the total number of arm entries, as well as the order of entries, was determined. The apparatus was wiped down with sani-wipes between trials to remove traces of odor cues. Spontaneous alternations were defined as consecutive triplets of different arm choices and % spontaneous alternation was defined as the number of spontaneous alternations divided by the total number of arm entries minus 2.

Social approach

The apparatus consisted of a Plexiglas box (60 × 38 × 23.5 cm3) divided into three compartments by Plexiglas partitions containing openings through which the mice could freely enter the three chambers. The test was conducted in two 10 min phases. In phase I, the test mouse is first allowed to explore the chambers for 10 min. Each of the two outer chambers contained an empty, inverted stainless steel wire cup. In phase II, the test mouse is briefly removed, and a sex-matched unfamiliar mouse was placed under one of the wire cups, and plastic blocks were placed under the other wire cup. The test mouse was then gently placed back in the arena and given an additional 10 min to explore. An overhead camera and video tracking software (ANY-maze, Wood Dale, IL) were used to record the amount of time spent in each chamber. The location (left or right) of the novel object and novel mouse alternates across subjects.

Acoustic startle responses and PPI of the acoustic startle response

Acoustic startle responses were tested inside SR-LAB startle apparatus (San Diego Instruments, San Diego, CA), consisting of an inner chamber with a speaker mounted to the wall and a cylinder mounted on a piezoelectric sensing platform on the floor. At the beginning of testing, mice were placed inside the cylinder and then were subjected to background 65 dB white noise during a 5 min acclimation period. The PPI session began with the presentation of six pulse-alone trials of 120 dB, 40 ms. Then, a series of pulse-alone trials and prepulse trials (69, 73, or 81 dB, 20 ms followed by 100 ms pulse trial, 120 dB) were each presented 12 times in a pseudorandom order. The session concluded with the presentation of six pulse-alone trials. The apparatus was wiped down with sani-wipes between trials to remove traces of odor cues. The startle amplitude was calculated using arbitrary units, and the acoustic startle response was the average startle amplitude of pulse-alone trials. The percent PPI was calculated as follows: [100 − (mean prepulse response/mean pulse alone response) × 100].

Cued and contextual fear conditioning

Fear conditioning experiments were performed using automated fear conditioning chambers (San Diego Instruments, San Diego, CA), similar to previous studies (Gresack et al., 2010; Risbrough et al., 2014). On day 1, after a 2 min acclimation period, mice were presented with a tone conditioned stimulus (75 dB, 4 kHz) for 20 s that co-terminated with a foot shock unconditioned stimulus (1 s, 0.5 mA). A total of three tone-shock pairings were presented with an inter-trial interval of 40 s. To assess acquisition, freezing was quantified during foot shock presentations. Mice were returned to their home cages 2 min after the final shock. These moderate shock parameters were previously found suitable to detect both increases and decreases in fear-conditioned behavior (Risbrough et al., 2014). Twenty-four hr later, on day 2, mice were re-exposed to the conditioning chamber to assess context-dependent fear retention. This test lasted 8 min during which time no shocks or tones were presented and freezing was scored for the duration of the session. Time freezing was quantified across four 2 min blocks. Day 3: 24 hr after the context fear-retention test, mice were tested for CS-induced fear retention and extinction. The context of the chambers was altered across several dimensions (tactile, odor, visual) for this test in order to minimize generalization from the conditioning context. After a 2 min acclimation period, during which time no tones were presented (‘pre-tone'), 32 tones were presented for 20 s with an inter-trial interval of 5 s. Freezing was scored during each tone presentation and quantifications were done in eight blocks of four tones. Mice were returned to their home cage immediately after termination of the last tone. On day 4, after a 2 min acclimation period, during which time no tones were presented (‘pre-tone’), a shorter session of 16 tones was used to assess extinction. Time freezing was quantified across four blocks of four tones.

RNA extraction, RNA-seq library construction, and sequencing

Nuclei (between 50,000 and 60,000) were used to isolate RNA using Single-Cell RNA Purification Kit (Norgen, catalog# 51800). In brief, aliquots of nuclei were resuspended in 350 µl of RL buffer (Norgen) and passed through an 18 G syringe five times. RNA extraction, including DNase digestion, followed manufacturers’ instructions. RNA was eluted in 20 µl of Elution Solution A (Norgen). The nuclear RNA concentration was determined using TapeStation (Agilent). RNA was diluted to 1 ng/µl and a total of 5 ng was processed for RNA-seq library preparation. RNA libraries were prepared using NuGen Ovation RNA-Seq System V2 (#7102–32) for cDNA preparation following the product manual. cDNA purification was done using Zymo Research DNA Clean & Concentrator-25 with modification from the Ovation protocol. cDNA, eluted in 30–40 µl of TE (1 µg per sample), was fragmented at 300 bp using Covaris S2 (Sonolab S-series V2), followed by library preparation according to KAPA LTP Library Preparation Kit (KK8232), using Illumina indexed adapters. Libraries were sequenced on NovaSeq 6000.

DNA extraction

DNA extraction was performed using the Qiagen DNeasy Blood and Tissue kit (catalog #69504) and eluted into 50–100 µl AE.

Genomic DNA library construction and sequencing

1.5 µg of genomic DNA was fragmented with a Covaris S2 (Covaris, Woburn, MA) to 400 bp, followed by end repair and addition of a 3’ A base. Cytosine-methylated adapters provided by Illumina (Illumina, San Diego, CA) were ligated to the sonicated DNA at 16°C for 16 hr with T4 DNA ligase (New England Biolabs). Adapter-ligated DNA was isolated by two rounds of purification with AMPure XP beads (Beckman Coulter Genomics, Danvers, MA). Half of the adapter-ligated DNA molecules were enriched by 6 cycles of PCR with the following reaction composition: 25 µl of Kapa HiFi Hotstart Readymix (Kapa Biosystems, Woburn, MA) and 5 µl TruSeq PCR Primer Mix (Illumina) (50 µl final). The thermocycling parameters were: 95°C 2 min, 98°C 30 s, then 6 cycles of 98°C 15 s, 60°C 30 s, and 72°C 4 min, ending with one 72°C 10 min step. The reaction products were purified using AMPure XP beads and size selection was done from 400 to 600 bp. Libraries were sequenced on NovaSeq 6000.

MethylC-seq library construction and sequencing

MethylC-seq libraries were prepared as previously described (Urich et al., 2015). All DNA obtained from the extraction was spiked with 0.5% unmethylated Lambda DNA. The DNA was fragmented with a Covaris S2 (Covaris, Woburn, MA) to 300 bp, followed by end repair and addition of a 3’ A base. Cytosine-methylated adapters provided by Illumina (San Diego, CA) were ligated to the sonicated DNA at 16°C for 16 hr with T4 DNA ligase (New England Biolabs). Adapter-ligated DNA was isolated by two rounds of purification with AMPure XP beads (Beckman Coulter Genomics, Danvers, MA). Adapter-ligated DNA (≤450 ng) was subjected to sodium bisulfite conversion using the EZ methylation Direct kit (Zymo, D5021) as per the manufacturer’s instructions. The bisulfite-converted, adapter-ligated DNA molecules were enriched by 8 cycles of PCR with the following reaction composition: 25 µl of Kapa HiFi Hotstart Uracil +Readymix (Kapa Biosystems, Woburn, MA) and 5 µl TruSeq PCR Primer Mix (Illumina) (50 µl final). The thermocycling parameters were: 95°C 2 min, 98°C 30 s, then 8 cycles of 98°C 15 s, 60°C 30 s and 72°C 4 min, ending with one 72°C 10 min step. The reaction products were purified using AMPure XP beads. Up to two separate PCR reactions were performed on subsets of the adapter-ligated, bisulfite-converted DNA, yielding up to two independent libraries from the same biological sample. MethylC-seq libraries were sequenced on NovaSeq 6000.

ChIP-seq library construction and sequencing

Sorted nuclei were crosslinked for 15 min in 1% formaldehyde solution and quenched afterward with glycine at a final concentration of 0.125 M. After crosslinking, nuclei were sonicated in Lysis buffer (50 mM Tris HCl pH 8, 20 mM EDTA, 1% SDS, 1× EDTAfree protease inhibitor cocktail). ChIP assays were conducted with antibodies against H3K27me3 (39156, Active Motif), H3K27ac (39133, Active Motif), and H3K4me3 (04-745, Millipore Sigma). Mouse IgG (015-000-003, Jackson ImmunoResearch) served as a negative control. H3K4me3 ChIP-seq assays were conducted with 100 K nuclei and 500 K nuclei were used for H3K27me3 and H3K27ac ChIP-seq assays. The respective antibodies and IgG were coupled for 4–6 hr to Protein G Dynabeads (50 µl, 10004D, Thermo Fisher Scientific). Equal amounts of sonicated chromatin were diluted with 9 volumes of Binding buffer (1% Triton X-100, 0.1% Sodium Deoxycholate, 2× EDTA free protease inhibitor cocktail) and subsequently incubated overnight with the respective antibody-coupled Protein G beads. Beads were washed successively with low salt buffer (50 mM Tris HCl pH 7.4, 150 mM NaCl, 2 mM EDTA, 0.5% Triton X-100), high salt buffer (50 mM Tris HCl pH 7.4, 500 mM NaCl, 2 mM EDTA, 0.5% Triton X-100) and wash buffer (50 mM Tris HCl pH 7.4, 50 mM NaCl, 2 mM EDTA) before de-crosslinking, proteinase K digestion, and DNA precipitation. Libraries were generated with the Accel-NGS 2S Plus DNA Library Kit (21024, Swift Biosciences) and sequenced on the Illumina HiSeq 4000 Sequencing system.

RNA-seq data processing

RNA-seq reads first went through quality control using FastQC (Andrews et al., 2012) (v0.11.8, https://www.bioinformatics.babraham.ac.uk/projects/fastqc/), and then were trimmed to remove sequencing adapters and low-quality sequences (minimum Phred score 20) using Trim Galore (v0.5.0, https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/, a wrapper tool powered by Cutadapt [Martin, 2011] v1.16) in the paired-end mode. Clean reads were then mapped to the mouse mm10 (GRCm38) genome and the GENCODE annotated transcriptome (release M10) with STAR (Dobin et al., 2013) (Spliced Transcripts Alignment to a Reference, v2.5.1b). Gene expression was estimated using RSEM (Li and Dewey, 2011) (RNA-Seq by Expectation Maximization, v1.2.30). Gene-level ‘expected count’ from the RSEM results were rounded and fed into edgeR (Robinson et al., 2010) (v3.24.1) to call DE genes. Only genes that were expressed (with counts per million [CPM] > 2) in at least two samples were kept. These counts were then normalized using the TMM method (Robinson and Oshlack, 2010), and DE genes were then called in the quasi-likelihood F-test mode, requiring FDR < 0.05 and FC > 20% (Supplementary file 2). To quantify TE expression, we used the Fetch, Clean, Map, and Count modules from SQuIRE (Yang et al., 2019) with the RepeatMasker annotation from UCSC. To evaluate the differences in TE family and class abundance between Dnmt3a knockout and control samples, we grouped SQuIRE’s TE subfamily expression estimates (FPKM) into their respective families and classes and performed two-sample t-tests.

Enrichment test of GO terms in DE genes

Gene Ontology (GO) enrichment analysis was performed using clusterProfiler (Yu et al., 2012) (v3.10.0). Only ‘Biological Process’ terms with no less than 10 genes and no more than 250 genes were considered. Terms with FDR < 0.05 were considered significantly enriched.

Genomic DNA sequencing data processing and SNP calling

To estimate the completeness of the inbreeding of the mouse strains, and to avoid incorrect cytosine context assignment in the following MethylC-seq data processing, we used the genomic DNA sequencing data of both the Dnmt3a cKO and the control animals to call SNPs against the mouse mm10 genome. We followed the GATK ‘best practices for germline SNPs and indels in whole genomes and exomes’ pipeline (DePristo et al., 2011; McKenna et al., 2010; Van der Auwera et al., 2013). Briefly, raw data were first trimmed to remove sequencing adapters and low-quality sequences (minimum Phred score 20) using Trim Galore in the paired-end mode. Clean data were then mapped to the mm10 genome using BWA (Li and Durbin, 2009) (v0.7.13-r1126). Duplicates reads were marked with Picard (Broad Institute, 2018). Then the analysis-ready reads were fed into GATK (v3.7) to perform two rounds of joint genotyping and base recalibration. Variants were then filtered using the following criteria: QD < 2.0 || FS > 60.0 || MQ < 40.0 || MQRankSum < –12.5 || ReadPosRankSum < –8.0 || SOR > 4.0. By that, we identified 548,530 and 507,669 SNPs (relative to mm10) in the Dnmt3a cKO and the control animals, respectively. At last, we created a substituted genome to mask out all these SNPs (replaced with Ns) with the ‘maskfasta’ tool in the BEDTools suite (Quinlan and Hall, 2010) (v2.27.1). This substituted genome was used in the following MethylC-seq data processing pipeline.

MethylC-seq data processing

MethylC-seq reads were processed using the methylpy pipeline (v1.3.2, https://github.com/yupenghe/methylpy; He, 2021) as previously described (Lister et al., 2013; Mo et al., 2015). Briefly, a computationally bisulfite-converted genome index was built using the aforementioned substituted genome file appended with the lambda phage genomic sequence. MethylC-seq raw reads were first trimmed to remove sequencing adapters and low-quality sequences (minimum Phred score 10) using Cutadapt in paired-end mode. To acquire higher mappability, we treated the two ends of the clean reads as they were sequenced in single-end mode, and mapped them to the converted genome index with bowtie2 (Langmead and Salzberg, 2012) (v2.3.0) as aligner in the single-end pipeline of methylpy. Only reads uniquely mapped were kept, and clonal reads were removed. The bisulfite non-conversion rate (NCR) was estimated using the spiked-in unmethylated lambda phage DNA. For each cytosine, a binomial test was performed to test whether the methylation levels are significantly greater than 0 with an FDR threshold of 0.01.

For a particular genomic region, the raw methylation level for a given cytosine context (CG or CH) was defined as:

%mC = 100×mh,

where m is the total number of methylated based calls within the region, and h is the total number of covered based calls within the region. Methylation levels were then corrected for NCR using the following maximum likelihood formula:

%mC_adj=%mc%NCR100%NCR,where %mC_adj[0, 100].

When profiling the methylation landscapes around DE genes, we selected control genes with comparable gene expression using the R package MatchIt (Ho et al., 2011) (v3.0.2). These control genes were defined with nearest neighbor matching of the expression (in the unit of TPM) using logistic link propensity score as a distance measure, requiring the standard deviation of the distance to be less than 0.01.

DMRs calling

CG DMRs were identified using a previously reported method (Ma et al., 2014; Schmitz et al., 2013; Schultz et al., 2015), which is implemented in the DMRfind function in methylpy. We required at least three differentially methylated sites within a particular DMR, and significant sites located within 500 bp of each other were merged into the same DMR. With an FDR cutoff of 0.01 and a post-filtering cutoff of methylation levels change greater than 30 (in the unit of %mCG), we found 222,006 Dnmt3a cKO hypo-DMRs in P39 (Supplementary file 3). Note that with these criteria we also found 89 Dnmt3a cKO hyper-DMRs, which we thought were noise and/or SNPs that failed to be detected by the masking pipeline described earlier. Therefore, we removed these hyper-DMRs from further consideration.

Enrichment test of DMRs and other genomic regions

To test whether DMRs were significantly enriched in certain genomic features, we use methods adapted from a recent report (Rizzardi et al., 2019). Briefly, for each genomic feature, we constructed a 2 by 2 contingency table of (n11, n12, n21, n22), where:

  • n11 is the number of CG sites in DMRs that were inside the feature;

  • n12 is the number of CG sites in DMRs that were outside of the feature;

  • n21 is the number of CG sites not in DMRs that were inside in feature;

  • n22 is the number of CG sites not in DMRs that were outside of the feature.

The total number of CG sites in consideration was the number of autosomal and chromosome X CG in the reference genome. Counting the number of CG rather than the number of DMRs or bases accounts for the non-uniform distribution of CG along the genome and avoids double-counting DMRs that are both inside and outside of the feature.

With this contingency table, we estimated the enrichment log odd ratio (OR) along with its standard error (se) and 95% confidence interval (ci) with the following formulas:

log2OR = log2n11 + log2n12 log2n21 log2n22
se(log2OR) = 1/n11 + 1/n12  + 1/n21+1/n22
ci(log2OR) = [log2OR  2×se(log2OR),  log2OR + 2×se(log2OR)]

p-Value from performing Fisher’s exact test for testing the null of independence of rows and columns in the contingency table (the null of no enrichment or depletion) was computed using the fisher.test() function in R.

The genomic regions/features used in these enrichment tests include: a list of developmental DMRs that gain or lose methylation during development (Lister et al., 2013); gene features (genic, exonic, intronic, promoter, 5’UTR, 3’UTR, intergenic) based on the GENCODE vM10 annotation (promoters were defined as the ±2 kb regions around transcription start sites); CpG island (CGI)-related features based on the ‘cpgIslandExt’ annotation from the UCSC genome browser (Karolchik et al., 2004; Kent et al., 2002) (http://genome.ucsc.edu/index.html), where CGI shores were defined as CGI ± 2 kb, CGI shelves were defined as ±2–4 kb of CGI and open seas were defined as regions that were at least 4 kb away from any CGI; the 12 states of the chromatin states map in mouse embryonic stem cell (Pintacuda et al., 2017) (https://github.com/guifengwei/ChromHMM_mESC_mm10) generated by ChromHMM (Ernst and Kellis, 2017; Ernst and Kellis, 2012) using ChIP-seq data from the ENCODE project (Davis et al., 2018; ENCODE Project Consortium, 2012); the H3K4me3, H3K27ac, and H3K27me3 peaks and the H3K27me3 differentially binding regions generated with our ChIP-seq data (see the ‘ChIP-seq data processing’ section).

Predicting functional TFs regulating the DE genes with BART

The lists of up-regulated and down-regulated genes were fed into BART (Wang et al., 2016; Wang et al., 2018) (BART) separately to predict functional TFs and chromatin regulators that bind at cis-regulatory regions of the DE genes. To make a better visualization, we transformed the relative rank (a metric generated by BART to represent the average rank of Wilcoxon p-value, z-score, and max AUC for each factor divided by the total number of factors) into the functional TF rank score (which is simply 1 minus the relative rank) so that the higher the rank score the more possible the TF regulates the DE genes. The integrative rank significance was estimated with the Irwin-Hall p-value. TFs with Irwin-Hall p-value < 0.05 were considered significant.

Enrichment test of known TF binding motifs in DMRs

We used the ‘findMotifsGenome.pl’ tool in HOMER (Heinz et al., 2010) (Hypergeometric Optimization of Motif EnRichment, v4.8.3) to find known TF binding motif in the Dnmt3a cKO DMRs. The parameters used are as follows: ‘-size 500 -len 8,10,12S 25 -fdr 100p 10 -mset vertebrates -bits -gc -nlen 3 -nomotif’. A set of non-neural DMRs (Hon et al., 2013) was used as the background.

ChIP-seq data processing

ChIP-seq reads were pre-processed with the ENCODE Transcription Factor and Histone ChIP-Seq processing pipeline (https://github.com/ENCODE-DCC/chip-seq-pipeline2, v1.1.6; Jin wook, 2022). Briefly, paired-end reads were mapped to the mm10 genome with BWA (Li and Durbin, 2009) (v0.7.13-r1126). Reads were then filtered using samtools (Li et al., 2009) (v1.2) to remove unmapped, mate unmapped, not primary alignment and duplicate reads (-F 1804). Properly paired reads were retained (-f 2). Multi-mapped reads (MAPQ < 30) were removed. PCR duplicates were removed using the MarkDuplicates tool in Picard (Broad Institute, 2018) (v2.10.6). Reads mapped to the blacklist regions (ENCODE Project Consortium, 2012) in the mouse mm10 genome (http://mitra.stanford.edu/kundaje/akundaje/release/blacklists/mm10-mouse/mm10.blacklist.bed.gz) were also removed.

Peak calling was performed using epic2 (Stovner and Sætrom, 2019) (v0.0.16), a reimplementation of SICER (Zang et al., 2009). For H3K4me3 and H3K27ac, we used the following parameters: ‘--bin-size 200 --gaps-allowed 1’. For H3K27me3, we used the following parameters: ‘--bin-size 200 --gaps-allowed 3’. The IgG sample was used as a control.

DM regions of the histone modification ChIP-seq data were called using DiffBind (Ross-Innes et al., 2012) (v2.10.0) in DESeq2 (Love et al., 2014) (v1.22.1) mode. Regions with FDR < 0.05 were considered significant. Genes associated with these regions were identified using GREAT (McLean et al., 2010) (v3.0.0) with the ‘Basal plus extension’ association rule with default parameters. GO enrichment analysis of the associated genes was performed with GREAT, and we considered the GO biological process terms with hypergeometric test FDR < 0.05 as significant.

To select a set of non-DM control regions to match the base levels of H3K27me3 in DM regions, we started with the union peaks of the control and cKO samples and removed any peaks that overlap the DM regions. From these non-DM peaks, we used the R package MatchIt (Ho et al., 2011) (v3.0.2) to select regions with matched peak lengths and H3K27me3 levels (in the unit of RPKM) as those in the DM regions, with the greedy nearest neighbor matching using logistic link propensity score as a distance measure (requiring the standard deviation of the distance to be less than 0.01).

Definition of bivalent and active CGI promoters

CGI promoters were defined as CpG islands (downloaded from the UCSC genome browser) that overlapping with promoters (±2 kb regions around transcription start sites annotated in GENCODE vM10). These CGI promoters were further tested to see whether they overlapped with the ChIP-seq peaks of H3K4me3 and H3K27me3. Bivalent CGI promoters were defined as CGI promoters that overlapped with both the H3K4me3 and H3K27me3 peaks, whereas active CGI promoters were defined as CGI promoters that overlapped with only the H3K4me3 peaks but not the H3K27me3 peaks.

Identification of DMVs

To find DMVs, we first identified UMRs (undermethylated regions) and LMRs (low methylated regions) using MethylSeekR (Burger et al., 2013) (v1.22.0) with m=0.3 (for P39) or 0.5 (for P0), n=7 and FDR < 0.05. In P39 samples PMDs (partially methylated domains) were excluded from further consideration, and no PMDs were found in P0 samples. DMVs were then defined as UMRs with length ≥5 kb and mean methylation level ≤15%. To compare DMVs identified in the P39 Dnmt3a cKO and the control samples, we further grouped these DMVs into six categories, namely consistent (exact same DMV in the two conditions), overhang, cKO unique, control unique, expanded (wider in the cKO), and shrunken (wider in the control) (see illustrations in Figure 7A).

For the clustering visualization in Figure 7C, we sorted DMVs according to the following criteria: (1) whether they overlap an H3K27me3 DM region; whether they overlap an up- or down-regulated DE gene (with absolute log2(fold-change) > 0.2); mean mCG > 0.3 in P39 control, P39 cKO; P0 control, or P0 cKO; and finally by the average level of H3K27me3, H3K3me3, and H3K27ac. The ‘Fraction DM H3K27me3’ shows what fraction of the length of the DMV overlaps a DM H3K27me3 region (called using DiffBind). And we also plotted the mean mRNA logFC for all genes contained within each DMV.

DMR enrichment around CGI promoter

We used regioneR (Gel et al., 2016) (v1.14.0) to test whether two sets of genomic regions had significantly higher numbers of overlaps compared to expected by chance. We used permTest() to perform the permutation test, and used the randomizeRegions() function to generate the shuffled control for 5000 times, where the query regions were randomly placed along the genome independently while maintaining their size. The strength of the association of the two sets of regions was estimated using z-score, the distance (measured in standard deviation) between the expected overlaps in the shuffled control and the observed overlaps, and the p-value was reported. To check if the association was specifically linked to the exact position of the query regions, we used the localZscore() function with window = 5000 and step = 50, which shifted the query regions and estimated how the value of the z-score changed when moving the regions.

Other tools used in the data analysis

Browser representations were created using AnnoJ (Lister et al., 2009). All analyses were conducted in R (v3.5.0), MATLAB 2017a and Python 3. Genomic ranges manipulation was done either with bedtools (Quinlan and Hall, 2010) or GenomicRanges (Lawrence et al., 2013). To generate randomly shuffled regions used in Figure 4—figure supplement 3A and Figure 5—figure supplement 1A, we used the sub-command ‘shuffle’ in the bedtools suite, with the ‘-excl’ parameter to exclude the blacklist regions, the ‘-noOverlapping’ parameter to prevent overlapping shuffled regions, and the optional ‘-incl’ parameter to limit the shuffle regions to reside within the P39 H3K27me3 peak regions (the union of control and cKO peaks). Multiple comparison correction for p-values was performed with the Benjamini-Hochberg FDR method (Benjamini and Hochberg, 1995). Results with FDR < 0.05 were considered significant except where stated otherwise. The smoothed lines in Figure 5D and Figure 6—figure supplement 1C were fitted with a generalized additive model using the ‘gam’ function in the ‘mgcv’ R package, with formula = y ~ s(x, bs = ‘cs’).

Data access

All sequencing data are available in the Gene Expression Omnibus under accession GSE141587. A genome browser displaying the sequencing data is available at https://brainome.ucsd.edu/annoj/mm_dnmt3a_ko/.

Acknowledgements

This work was supported by R01MH112763 to MMB and JRE, and a Kavli Foundation award to MMB, APD, and SBP. JRE is an Investigator of the Howard Hughes Medical Institute. We acknowledge stimulating discussions with Drs. Huda Zoghbi and Laura Lavery. We thank the members of the Salk Biophotonics Core, Dr Uri Manor, Sammy Weiser Novak, and Dr Tong Zhang for their insightful suggestions. We also thank Joseph Chambers and Caitlin Chambers for technical assistance in animal handling, Colleen Heller for technical assistance in behavioral experiments, and Faith Zhang for her initial involvement in the morphometric analysis of dendritic spines. The Waitt Advanced Biophotonics Core Facility at the Salk Institute receives funding from NIH-NCI CCSG: P30 014195 and the Waitt Foundation. The Flow Cytometry Core Facility of the Salk Institute receives funding from NIH-NCI CCSG: P30 014195. The authors have no conflict of interest in relation to the work described here.

Appendix 1

Appendix 1—key resources table.

Reagent type (species) or resource Designation Source or reference Identifiers Additional information
Gene (Mus musculus) Dnmt3a GENCODE GENCODE:ENSMUSG00000020661 GENCODE vM10
Genetic reagent (Mus musculus) Dnmt3a-floxed PMID:10555141 Backcrossed to C57BL/6 for at least seven generations
Genetic reagent (Mus musculus) Nex-Cre; Neurod6-Cre PMID:17146780 Jackson Laboratories
Backcrossed to C57BL/6J for 10 generations
Genetic reagent (Mus musculus) INTACT PMID:26087164 (B6.129-
Gt(ROSA)26Sor
tm5(CAG-Sun1/sfGFP)Nat/MmbeJ)
Genetic reagent (Mus musculus) Dnmt3a-KO; Dnmt3a cKO This paper See ‘Materials and Methods’, section ‘Generation of the Dnmt3a cKO mice line’
Antibody Anti-NeuN (Mouse monoclonal, Clone A60) Millipore MAB377 (1:1000)
Antibody Anti-DNMT3A (Rabbit polyclonal) Abcam Ab2850 (1:250)
Antibody Anti-H3K27me3 (Rabbit polyclonal) Active Motif Cat:#39156; RRID:AB_2636821 (5 µl)
Antibody Anti-H3K27ac (Rabbit polyclonal) Active Motif Cat:#39133; RRID:AB_2561016 (5 µl)
Antibody Anti-H3K4me3 (Rabbit monoclonal) Millipore Sigma Cat:#04–745 (5 µl)
Antibody Anti-IgG (Mouse unknown clonality) Jackson ImmunoResearch Labs Cat#015-000-003; RRID:AB_2337188 (2 µl)
Sequence-based reagent Cytosine-methylated adapters Illumina AD001, AD005
Sequence-based reagent TruSeq PCR Primer Mix Illumina 20015960, 20015961
Peptide, recombinant protein T4 DNA ligase New England Biolabs M0202L
Commercial assay or kit Single-Cell RNA Purification Kit Norgen Cat:#51800
Commercial assay or kit Ovation RNA-Seq System V2 NuGEN Cat:#7102–32
Commercial assay or kit KAPA LTP Library Preparation Kit Roche Cat:#KK8232
Commercial assay or kit Qiagen DNeasy Blood and Tissue kit Qiagen Cat:#69504
Commercial assay or kit EZ methylation Direct kit Zymo Cat:#D5021
Commercial assay or kit Accel-NGS 2S Plus DNA Library Kit Swift Biosciences Cat:#21024
Chemical compound, drug Protease inhibitor Sigma-Aldrich Cat:#11836153001
Chemical compound, drug Phosphatase inhibitor Pierce Cat:#A32957
Chemical compound, drug Kapa HiFi Hotstart Readymix Kapa Biosystems 07958935001-KK2602
Software, algorithm Minianalysis Minianalysis (https://www.synaptosoft.com/MiniAnalysis/) RRID:SCR_002184
Software, algorithm pCLAMP Molecular Devices (https://www.moleculardevices.com/products/software/pclamp.html) RRID:SCR_011323
Software, algorithm Reconstruct Synapse Web (https://synapses.clm.utexas.edu/tools/reconstruct/reconstruct.stm) RRID:SCR_002716
Software, algorithm Activity Monitor MED Associates (https://www.med-associates.com/product/activity-monitor/) RRID:SCR_014296
Software, algorithm ANY-maze San Diego Instruments (https://sandiegoinstruments.com/product/any-maze/) RRID:SCR_014289
Software, algorithm FastQC Babraham Bioinformatics (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) RRID:SCR_014583 v0.11.8
Software, algorithm Trim Galore Babraham Bioinformatics (https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/) RRID:SCR_011847 v0.5.0
Software, algorithm Cutadapt DOI:10.14806/ej.17.1.200 RRID:SCR_011841 v1.16
Software, algorithm STAR PMID:23104886 RRID:SCR_004463 v2.5.1b
Software, algorithm RSEM PMID:21816040 RRID:SCR_013027 v1.2.30
Software, algorithm edgeR PMID:19910308 RRID:SCR_012802 v3.24.1
Software, algorithm SQuIRE PMID:30624635
Software, algorithm clusterProfiler PMID:22455463 RRID:SCR_016884 v3.10.0
Software, algorithm GATK PMID:21478889 RRID:SCR_001876 v3.7
Software, algorithm BWA PMID:19451168 RRID:SCR_010910 v0.7.13-r1126
Software, algorithm Picard Broad (https://broadinstitute.github.io/picard/) RRID:SCR_006525 V2.10.6
Software, algorithm BEDTools PMID:20110278 RRID:SCR_006646 v2.27.1
Software, algorithm methylpy methylpy (https://github.com/yupenghe/methylpy) v1.3.2
Software, algorithm bowtie2 PMID:22388286 RRID:SCR_016368 v2.3.0
Software, algorithm MatchIt DOI:10.18637/jss.v042.i08 v3.0.2
Software, algorithm BART PMID:29608647
Software, algorithm HOMER PMID:20513432 RRID:SCR_010881 v4.8.3
Software, algorithm ENCODE Transcription Factor and Histone ChIP-Seq processing pipeline PMID:22955991 RRID:SCR_021323 v1.1.6
Software, algorithm samtools PMID:19505943 RRID:SCR_002105 v1.2
Software, algorithm SICER PMID:19505939 RRID:SCR_010843
Software, algorithm DiffBind PMID:22217937 RRID:SCR_012918 v2.10.0
Software, algorithm DESeq2 PMID:25516281 RRID:SCR_015687 v1.22.1
Software, algorithm GREAT PMID:20436461 RRID:SCR_005807 v3.0.0
Software, algorithm MethylSeekR PMID:23828043 RRID:SCR_006513 v1.22.0
Software, algorithm regioneR PMID:26424858 V1.14.0
Software, algorithm R R Project for Statistical Computing (https://www.r-project.org/) RRID:SCR_001905 v3.5.0
Software, algorithm MATLAB MathWorks (https://www.mathworks.com/products/matlab.html) RRID:SCR_001622 v2017a
Software, algorithm Python Python Programming Language (https://www.python.org/) RRID:SCR_008394 v3.x

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Joseph R Ecker, Email: ecker@salk.edu.

Eran A Mukamel, Email: emukamel@ucsd.edu.

M Margarita Behrens, Email: mbehrens@salk.edu.

Anne E West, Duke University, United States.

Lu Chen, Stanford University, United States.

Funding Information

This paper was supported by the following grants:

  • National Institute of Mental Health R01MH112763 to Joseph R Ecker, Eran A Mukamel, M Margarita Behrens.

  • Kavli Foundation to Antonio Pinto-Duarte, Susan B Powell, M Margarita Behrens.

  • Howard Hughes Medical Institute to Joseph R Ecker.

Additional information

Competing interests

No competing interests declared.

No competing interests declared.

Author contributions

Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Visualization, Writing – original draft, Writing – review and editing.

Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review and editing.

Data curation, Investigation, Methodology, Validation, Writing – review and editing.

Formal analysis.

Data curation, Investigation, Methodology, Validation.

Data curation, Investigation, Methodology, Validation.

Data curation, Investigation, Methodology, Resources, Validation.

Data curation, Formal analysis, Investigation, Methodology, Writing – review and editing.

Investigation, Methodology, Resources.

Investigation, Methodology.

Investigation, Methodology.

Investigation, Methodology, Validation.

Investigation, Methodology, Resources.

Resources, Supervision.

Funding acquisition, Investigation, Methodology, Resources, Supervision, Writing – review and editing.

Conceptualization, Funding acquisition, Project administration, Resources, Supervision, Writing – review and editing.

Conceptualization, Formal analysis, Funding acquisition, Project administration, Resources, Supervision, Visualization, Writing – original draft, Writing – review and editing.

Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Writing – original draft, Writing – review and editing.

Ethics

All animal procedures were conducted in accordance with the guidelines of the American Association for the Accreditation of Laboratory Animal Care and were approved by the Salk Institute for Biological Studies Institutional Animal Care and Use Committee (Protocol number 18-00006).

Additional files

Supplementary file 1. Sequencing metrics for RNA-seq, MethylC-seq, chromatin immunoprecipitation sequencing (ChIP-seq), and genomic DNA for the Dnmt3a conditional knockout (cKO) and control samples.
elife-66909-supp1.xlsx (36.7KB, xlsx)
Supplementary file 2. Gene expression and list of differentially expressed genes in P39 Dnmt3a conditional knockout (cKO) and P39 control.
elife-66909-supp2.xlsx (4.5MB, xlsx)
Supplementary file 3. List of differentially methylated regions (DMRs).
elife-66909-supp3.xlsx (21.1MB, xlsx)
Supplementary file 4. Known transcription factor motif enrichment in P39 Dnmt3a conditional knockout (cKO) differentially methylated regions (DMRs).
elife-66909-supp4.xlsx (43.3KB, xlsx)
Supplementary file 5. List of transcription factors and chromatin regulators that bind at cis-regulatory regions of the differentially expressed genes in P39 Dnmt3a conditional knockout (cKO).
elife-66909-supp5.xlsx (93.1KB, xlsx)
Supplementary file 6. List of chromatin immunoprecipitation sequencing (ChIP-seq) peaks.
elife-66909-supp6.xlsx (13.3MB, xlsx)
Supplementary file 7. List of up-regulated H3K27me3 signal regions in P39 Dnmt3a conditional knockout (cKO).
elife-66909-supp7.xlsx (481.2KB, xlsx)
Supplementary file 8. List of enriched Gene Ontology terms for biological process in genes associated with up-regulated H3K27me3 regions in P39 Dnmt3a conditional knockout (cKO).
elife-66909-supp8.xlsx (112.7KB, xlsx)
Supplementary file 9. List of developmental regulated H3K27me3 signal regions in control pyramidal neurons.
elife-66909-supp9.xlsx (765.8KB, xlsx)
Supplementary file 10. List of DNA methylation valleys (DMVs).
elife-66909-supp10.xlsx (342.5KB, xlsx)
MDAR checklist

Data availability

All sequencing data are available in the Gene Expression Omnibus under accession GSE141587. A genome browser displaying the sequencing data is available at https://brainome.ucsd.edu/annoj/mm_dnmt3a_ko/.

The following dataset was generated:

Li J, Pinto-Duarte A, Zander M, Lai C, Osteen J, Fang L, Luo C, Lucero J, Gomez-Castanon R, Nery J, Silva-Garcia I, Pang Y, Powell SB, Ecker JR, Luo C, Mukamel EA, Behrens MM. 2022. Transcriptomic and epigenetic disruptions in excitatory neurons in Dnmt3a conditional knockout mouse. NCBI Gene Expression Omnibus. GSE141587

The following previously published dataset was used:

Lister R, Mukamel EA, Nery JR, Urich M, Puddifoot CA, Johnson ND, Lucero J, Huang Y, Dwork AJ, Schultz MD, Yu M, Tonti-Filippini J, Heyn H, Hu S, Rao A, Esteller M, He C, Haghighi FG, Sejnowski TJ, Behrens MM, Ecker JR, Wu JC. 2013. Global epigenomic reconfiguration during mammalian brain development. NCBI Gene Expression Omnibus. GSE47966

References

  1. Andrews S, Krueger F, Segonds-Pichon A, Biggins L, Krueger C, Wingett S. FastQC. FastQC. 2012 https://www.bioinformatics.babraham.ac.uk/projects/fastqc/
  2. Bakken TE, Hodge RD, Miller JA, Yao Z, Nguyen TN, Aevermann B, Barkan E, Bertagnolli D, Casper T, Dee N, Garren E, Goldy J, Graybuck LT, Kroll M, Lasken RS, Lathia K, Parry S, Rimorin C, Scheuermann RH, Schork NJ, Shehata SI, Tieu M, Phillips JW, Bernard A, Smith KA, Zeng H, Lein ES, Tasic B. Single-nucleus and single-cell transcriptomes compared in matched cortical cell types. PLOS ONE. 2018;13:e0209648. doi: 10.1371/journal.pone.0209648. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bedrosian TA, Quayle C, Novaresi N, Gage FH. Early life experience drives structural variation of neural genomes in mice. Science (New York, N.Y.) 2018;359:1395–1399. doi: 10.1126/science.aah3378. [DOI] [PubMed] [Google Scholar]
  4. Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society. 1995;57:289–300. doi: 10.1111/j.2517-6161.1995.tb02031.x. [DOI] [Google Scholar]
  5. Berko ER, Cho MT, Eng C, Shao Y, Sweetser DA, Waxler J, Robin NH, Brewer F, Donkervoort S, Mohassel P, Bönnemann CG, Bialer M, Moore C, Wolfe LA, Tifft CJ, Shen Y, Retterer K, Millan F, Chung WK. De novo missense variants in HECW2 are associated with neurodevelopmental delay and hypotonia. Journal of Medical Genetics. 2017;54:84–86. doi: 10.1136/jmedgenet-2016-103943. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Boxer LD, Renthal W, Greben AW, Whitwam T, Silberfeld A, Stroud H, Li E, Yang MG, Kinde B, Griffith EC, Bonev B, Greenberg ME. MeCP2 Represses the Rate of Transcriptional Initiation of Highly Methylated Long Genes. Molecular Cell. 2020;77:294–309. doi: 10.1016/j.molcel.2019.10.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Brinkman AB, Gu H, Bartels SJJ, Zhang Y, Matarese F, Simmer F, Marks H, Bock C, Gnirke A, Meissner A, Stunnenberg HG. Sequential ChIP-bisulfite sequencing enables direct genome-scale investigation of chromatin and DNA methylation cross-talk. Genome Research. 2012;22:1128–1138. doi: 10.1101/gr.133728.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Broad Institute picard-tools. 1.8.xGithub. 2018 http://broadinstitute.github.io/picard/
  9. Burger L, Gaidatzis D, Schübeler D, Stadler MB. Identification of active regulatory regions from DNA methylation data. Nucleic Acids Research. 2013;41:e155. doi: 10.1093/nar/gkt599. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. C Yuen RK, Merico D, Bookman M, L Howe J, Thiruvahindrapuram B, Patel RV, Whitney J, Deflaux N, Bingham J, Wang Z, Pellecchia G, Buchanan JA, Walker S, Marshall CR, Uddin M, Zarrei M, Deneault E, D’Abate L, Chan AJS, Koyanagi S, Paton T, Pereira SL, Hoang N, Engchuan W, Higginbotham EJ, Ho K, Lamoureux S, Li W, MacDonald JR, Nalpathamkalam T, Sung WWL, Tsoi FJ, Wei J, Xu L, Tasse A-M, Kirby E, Van Etten W, Twigger S, Roberts W, Drmic I, Jilderda S, Modi BM, Kellam B, Szego M, Cytrynbaum C, Weksberg R, Zwaigenbaum L, Woodbury-Smith M, Brian J, Senman L, Iaboni A, Doyle-Thomas K, Thompson A, Chrysler C, Leef J, Savion-Lemieux T, Smith IM, Liu X, Nicolson R, Seifer V, Fedele A, Cook EH, Dager S, Estes A, Gallagher L, Malow BA, Parr JR, Spence SJ, Vorstman J, Frey BJ, Robinson JT, Strug LJ, Fernandez BA, Elsabbagh M, Carter MT, Hallmayer J, Knoppers BM, Anagnostou E, Szatmari P, Ring RH, Glazer D, Pletcher MT, Scherer SW. Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder. Nature Neuroscience. 2017;20:602–611. doi: 10.1038/nn.4524. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Clemens AW, Wu DY, Moore JR, Christian DL, Zhao G, Gabel HW. MeCP2 Represses Enhancers through Chromosome Topology-Associated DNA Methylation. Molecular Cell. 2020;77:279–293. doi: 10.1016/j.molcel.2019.10.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Davis CA, Hitz BC, Sloan CA, Chan ET, Davidson JM, Gabdank I, Hilton JA, Jain K, Baymuradov UK, Narayanan AK, Onate KC, Graham K, Miyasato SR, Dreszer TR, Strattan JS, Jolanki O, Tanaka FY, Cherry JM. The Encyclopedia of DNA elements (ENCODE): data portal update. Nucleic Acids Research. 2018;46:D794–D801. doi: 10.1093/nar/gkx1081. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, Philippakis AA, del Angel G, Rivas MA, Hanna M, McKenna A, Fennell TJ, Kernytsky AM, Sivachenko AY, Cibulskis K, Gabriel SB, Altshuler D, Daly MJ. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nature Genetics. 2011;43:491–498. doi: 10.1038/ng.806. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinformatics (Oxford, England) 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Dodell-Feder D, Tully LM, Hooker CI. Social impairment in schizophrenia: new approaches for treating a persistent problem. Current Opinion in Psychiatry. 2015;28:236–242. doi: 10.1097/YCO.0000000000000154. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Dura M, Teissandier A, Armand M, Barau J, Lapoujade C, Fouchet P, Bonneville L, Schulz M, Weber M, Baudrin LG, Lameiras S, Bourc’his D. DNMT3A-dependent DNA methylation is required for spermatogonial stem cells to commit to spermatogenesis. Nature Genetics. 2022;54:469–480. doi: 10.1038/s41588-022-01040-z. [DOI] [PubMed] [Google Scholar]
  17. Elliott E, Manashirov S, Zwang R, Gil S, Tsoory M, Shemesh Y, Chen A. Dnmt3a in the Medial Prefrontal Cortex Regulates Anxiety-Like Behavior in Adult Mice. The Journal of Neuroscience. 2016;36:730–740. doi: 10.1523/JNEUROSCI.0971-15.2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. ENCODE Project Consortium An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. doi: 10.1038/nature11247. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Ernst J, Kellis M. ChromHMM: automating chromatin-state discovery and characterization. Nature Methods. 2012;9:215–216. doi: 10.1038/nmeth.1906. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Ernst J, Kellis M. Chromatin-state discovery and genome annotation with ChromHMM. Nature Protocols. 2017;12:2478–2492. doi: 10.1038/nprot.2017.124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Fagiolini M, Jensen CL, Champagne FA. Epigenetic influences on brain development and plasticity. Current Opinion in Neurobiology. 2009;19:207–212. doi: 10.1016/j.conb.2009.05.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Feng J, Zhou Y, Campbell SL, Le T, Li E, Sweatt JD, Silva AJ, Fan G. Dnmt1 and Dnmt3a maintain DNA methylation and regulate synaptic function in adult forebrain neurons. Nature Neuroscience. 2010;13:423–430. doi: 10.1038/nn.2514. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Gabel HW, Kinde B, Stroud H, Gilbert CS, Harmin DA, Kastan NR, Hemberg M, Ebert DH, Greenberg ME. Disruption of DNA-methylation-dependent long gene repression in Rett syndrome. Nature. 2015;522:89–93. doi: 10.1038/nature14319. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Gel B, Díez-Villanueva A, Serra E, Buschbeck M, Peinado MA, Malinverni R. regioneR: an R/Bioconductor package for the association analysis of genomic regions based on permutation tests. Bioinformatics (Oxford, England) 2016;32:289–291. doi: 10.1093/bioinformatics/btv562. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Goebbels S, Bormuth I, Bode U, Hermanson O, Schwab MH, Nave KA. Genetic targeting of principal neurons in neocortex and hippocampus of NEX-Cre mice. Genesis (New York, N.Y. 2006;44:611–621. doi: 10.1002/dvg.20256. [DOI] [PubMed] [Google Scholar]
  26. Gresack JE, Risbrough VB, Scott CN, Coste S, Stenzel-Poore M, Geyer MA, Powell SB. Isolation rearing-induced deficits in contextual fear learning do not require CRF(2) receptors. Behavioural Brain Research. 2010;209:80–84. doi: 10.1016/j.bbr.2010.01.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Guo JU, Su Y, Shin JH, Shin J, Li H, Xie B, Zhong C, Hu S, Le T, Fan G, Zhu H, Chang Q, Gao Y, Ming G, Song H. Distribution, recognition and regulation of non-CpG methylation in the adult mammalian brain. Nature Neuroscience. 2014;17:215–222. doi: 10.1038/nn.3607. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Habib A, Harris L, Pollick F, Melville C. A meta-analysis of working memory in individuals with autism spectrum disorders. PLOS ONE. 2019;14:e0216198. doi: 10.1371/journal.pone.0216198. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. He Y. methylpy. swh:1:rev:0dbf4ef30d6c4d1b98f4a53c3b08b721114c9aaaGitHub. 2021 https://github.com/yupenghe/methylpy
  30. Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, Cheng JX, Murre C, Singh H, Glass CK. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Molecular Cell. 2010;38:576–589. doi: 10.1016/j.molcel.2010.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Heinz S, Romanoski CE, Benner C, Glass CK. The selection and function of cell type-specific enhancers. Nature Reviews. Molecular Cell Biology. 2015;16:144–154. doi: 10.1038/nrm3949. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Ho DE, Imai K, King G, Stuart EA. MatchIt : Nonparametric Preprocessing for Parametric Causal Inference. Journal of Statistical Software. 2011;42:1–28. doi: 10.18637/jss.v042.i08. [DOI] [Google Scholar]
  33. Hon GC, Rajagopal N, Shen Y, McCleary DF, Yue F, Dang MD, Ren B. Epigenetic memory at embryonic enhancers identified in DNA methylation maps from adult mouse tissues. Nature Genetics. 2013;45:1198–1206. doi: 10.1038/ng.2746. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Jermann P, Hoerner L, Burger L, Schübeler D. Short sequences can efficiently recruit histone H3 lysine 27 trimethylation in the absence of enhancer activity and DNA methylation. PNAS. 2014;111:E3415–E21. doi: 10.1073/pnas.1400672111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Jin wook L. ENCODE-DCC/chip-seq-pipeline2. swh:1:rev:1fa47346c9c74b1e2dc5e67dec353104ef9d3a43GitHub. 2022 https://github.com/ENCODE-DCC/chip-seq-pipeline2
  36. Johnson BS, Zhao YT, Fasolino M, Lamonica JM, Kim YJ, Georgakilas G, Wood KH, Bu D, Cui Y, Goffin D, Vahedi G, Kim TH, Zhou Z. Biotin tagging of MeCP2 in mice reveals contextual insights into the Rett syndrome transcriptome. Nature Medicine. 2017;23:1203–1214. doi: 10.1038/nm.4406. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Karolchik D, Hinrichs AS, Furey TS, Roskin KM, Sugnet CW, Haussler D, Kent WJ. The UCSC Table Browser data retrieval tool. Nucleic Acids Research. 2004;32:D493–D6. doi: 10.1093/nar/gkh103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, Haussler D. The human genome browser at UCSC. Genome Research. 2002;12:996–1006. doi: 10.1101/gr.229102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Kinde B, Wu DY, Greenberg ME, Gabel HW. DNA methylation in the gene body influences MeCP2-mediated gene repression. PNAS. 2016;113:15114–15119. doi: 10.1073/pnas.1618737114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Kundakovic M, Champagne FA. Early-life experience, epigenetics, and the developing brain. Neuropsychopharmacology. 2015;40:141–153. doi: 10.1038/npp.2014.140. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Lacar B, Linker SB, Jaeger BN, Krishnaswami SR, Barron JJ, Kelder MJE, Parylak SL, Paquola ACM, Venepally P, Novotny M, O’Connor C, Fitzpatrick C, Erwin JA, Hsu JY, Husband D, McConnell MJ, Lasken R, Gage FH. Nuclear RNA-seq of single neurons reveals molecular signatures of activation. Nature Communications. 2016;7:e11022. doi: 10.1038/ncomms11022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nature Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Lavery LA, Ure K, Wan Y-W, Luo C, Trostle AJ, Wang W, Jin H, Lopez J, Lucero J, Durham MA, Castanon R, Nery JR, Liu Z, Goodell M, Ecker JR, Behrens MM, Zoghbi HY. Losing Dnmt3a dependent methylation in inhibitory neurons impairs neural function by a mechanism impacting Rett syndrome. eLife. 2020;9:e52981. doi: 10.7554/eLife.52981. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Lawrence M, Huber W, Pagès H, Aboyoun P, Carlson M, Gentleman R, Morgan MT, Carey VJ, Prlic A. Software for Computing and Annotating Genomic Ranges. PLOS Computational Biology. 2013;9:e1003118. doi: 10.1371/journal.pcbi.1003118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Lee E, Rhim I, Lee JW, Ghim JW, Lee S, Kim E, Jung MW. Enhanced Neuronal Activity in the Medial Prefrontal Cortex during Social Approach Behavior. Journal of Neuroscience. 2016;36:6926–6936. doi: 10.1523/JNEUROSCI.0307-16.2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics (Oxford, England) 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. 1000 Genome Project Data Processing Subgroup. Bioinformatics (Oxford, England) 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics. 2011;12:323. doi: 10.1186/1471-2105-12-323. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Li Y, Zheng H, Wang Q, Zhou C, Wei L, Liu X, Zhang W, Zhang Y, Du Z, Wang X, Xie W. Genome-wide analyses reveal a role of Polycomb in promoting hypomethylation of DNA methylation valleys. Genome Biology. 2018;19:18. doi: 10.1186/s13059-018-1390-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Lister R, O’Malley RC, Tonti-Filippini J, Gregory BD, Berry CC, Millar AH, Ecker JR. Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell. 2008;133:523–536. doi: 10.1016/j.cell.2008.03.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Lister R, Pelizzola M, Dowen RH, Hawkins RD, Hon G, Tonti-Filippini J, Nery JR, Lee L, Ye Z, Ngo Q-M, Edsall L, Antosiewicz-Bourget J, Stewart R, Ruotti V, Millar AH, Thomson JA, Ren B, Ecker JR. Human DNA methylomes at base resolution show widespread epigenomic differences. Nature. 2009;462:315–322. doi: 10.1038/nature08514. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Lister R, Mukamel EA, Nery JR, Urich M, Puddifoot CA, Johnson ND, Lucero J, Huang Y, Dwork AJ, Schultz MD, Yu M, Tonti-Filippini J, Heyn H, Hu S, Wu JC, Rao A, Esteller M, He C, Haghighi FG, Sejnowski TJ, Behrens MM, Ecker JR. Global epigenomic reconfiguration during mammalian brain development. Science (New York, N.Y.) 2013;341:e1237905. doi: 10.1126/science.1237905. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Liu XS, Wu H, Ji X, Stelzer Y, Wu X, Czauderna S, Shu J, Dadon D, Young RA, Jaenisch R. Editing DNA Methylation in the Mammalian Genome. Cell. 2016;167:233–247. doi: 10.1016/j.cell.2016.08.056. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Luo C, Keown CL, Kurihara L, Zhou J, He Y, Li J, Castanon R, Lucero J, Nery JR, Sandoval JP, Bui B, Sejnowski TJ, Harkins TT, Mukamel EA, Behrens MM, Ecker JR. Single-cell methylomes identify neuronal subtypes and regulatory elements in mammalian cortex. Science (New York, N.Y.) 2017;357:600–604. doi: 10.1126/science.aan3351. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Lyko F. The DNA methyltransferase family: a versatile toolkit for epigenetic regulation. Nature Reviews. Genetics. 2018;19:81–92. doi: 10.1038/nrg.2017.80. [DOI] [PubMed] [Google Scholar]
  57. Lynch MD, Smith AJH, De Gobbi M, Flenley M, Hughes JR, Vernimmen D, Ayyub H, Sharpe JA, Sloane-Stanley JA, Sutherland L, Meek S, Burdon T, Gibbons RJ, Garrick D, Higgs DR. An interspecies analysis reveals a key role for unmethylated CpG dinucleotides in vertebrate Polycomb complex recruitment. The EMBO Journal. 2012;31:317–329. doi: 10.1038/emboj.2011.399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Ma H, Morey R, O’Neil RC, He Y, Daughtry B, Schultz MD, Hariharan M, Nery JR, Castanon R, Sabatini K, Thiagarajan RD, Tachibana M, Kang E, Tippner-Hedges R, Ahmed R, Gutierrez NM, Van Dyken C, Polat A, Sugawara A, Sparman M, Gokhale S, Amato P, Wolf DP, Ecker JR, Laurent LC, Mitalipov S. Abnormalities in human pluripotent cells due to reprogramming mechanisms. Nature. 2014;511:177–183. doi: 10.1038/nature13551. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Malik AN, Vierbuchen T, Hemberg M, Rubin AA, Ling E, Couch CH, Stroud H, Spiegel I, Farh KKH, Harmin DA, Greenberg ME. Genome-wide identification and characterization of functional neuronal activity-dependent enhancers. Nature Neuroscience. 2014;17:1330–1339. doi: 10.1038/nn.3808. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.Journal. 2011;17:e10. doi: 10.14806/ej.17.1.200. [DOI] [Google Scholar]
  61. Maycox PR, Kelly F, Taylor A, Bates S, Reid J, Logendra R, Barnes MR, Larminie C, Jones N, Lennon M, Davies C, Hagan JJ, Scorer CA, Angelinetta C, Akbar MT, Akbar T, Hirsch S, Mortimer AM, Barnes TRE, de Belleroche J. Analysis of gene expression in two large schizophrenia cohorts identifies multiple changes associated with nerve terminal function. Molecular Psychiatry. 2009;14:1083–1094. doi: 10.1038/mp.2009.18. [DOI] [PubMed] [Google Scholar]
  62. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, DePristo MA. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Research. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. McLean CY, Bristor D, Hiller M, Clarke SL, Schaar BT, Lowe CB, Wenger AM, Bejerano G. GREAT improves functional interpretation of cis-regulatory regions. Nature Biotechnology. 2010;28:495–501. doi: 10.1038/nbt.1630. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Mo A, Mukamel EA, Davis FP, Luo C, Henry GL, Picard S, Urich MA, Nery JR, Sejnowski TJ, Lister R, Eddy SR, Ecker JR, Nathans J. Epigenomic Signatures of Neuronal Diversity in the Mammalian Brain. Neuron. 2015;86:1369–1384. doi: 10.1016/j.neuron.2015.05.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Mohn F, Weber M, Rebhan M, Roloff TC, Richter J, Stadler MB, Bibel M, Schübeler D. Lineage-specific polycomb targets and de novo DNA methylation define restriction and potential of neuronal progenitors. Molecular Cell. 2008;30:755–766. doi: 10.1016/j.molcel.2008.05.007. [DOI] [PubMed] [Google Scholar]
  66. Morris MJ, Adachi M, Na ES, Monteggia LM. Selective role for DNMT3a in learning and memory. Neurobiology of Learning and Memory. 2014;115:30–37. doi: 10.1016/j.nlm.2014.06.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Nguyen S, Meletis K, Fu D, Jhaveri S, Jaenisch R. Ablation of de novo DNA methyltransferase Dnmt3a in the nervous system leads to neuromuscular defects and shortened lifespan. Developmental Dynamics. 2007;236:1663–1676. doi: 10.1002/dvdy.21176. [DOI] [PubMed] [Google Scholar]
  68. Okano M, Bell DW, Haber DA, Li E. DNA methyltransferases Dnmt3a and Dnmt3b are essential for de novo methylation and mammalian development. Cell. 1999;99:247–257. doi: 10.1016/s0092-8674(00)81656-6. [DOI] [PubMed] [Google Scholar]
  69. Pintacuda G, Wei G, Roustan C, Kirmizitas BA, Solcan N, Cerase A, Castello A, Mohammed S, Moindrot B, Nesterova TB, Brockdorff N. hnRNPK Recruits PCGF3/5-PRC1 to the Xist RNA B-Repeat to Establish Polycomb-Mediated Chromosomal Silencing. Molecular Cell. 2017;68:955–969. doi: 10.1016/j.molcel.2017.11.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Putignano E, Lonetti G, Cancedda L, Ratto G, Costa M, Maffei L, Pizzorusso T. Developmental downregulation of histone posttranslational modifications regulates visual cortical plasticity. Neuron. 2007;53:747–759. doi: 10.1016/j.neuron.2007.02.007. [DOI] [PubMed] [Google Scholar]
  71. Qian DC, Molfese DL, Jin JL, Titus AJ, He Y, Li Y, Vaissié M, Viswanath H, Baldwin PR, Krahe R, Salas R, Amos CI. Genome-wide imaging association study implicates functional activity and glial homeostasis of the caudate in smoking addiction. BMC Genomics. 2017;18:740. doi: 10.1186/s12864-017-4124-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics (Oxford, England) 2010;26:841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Reddington JP, Perricone SM, Nestor CE, Reichmann J, Youngson NA, Suzuki M, Reinhardt D, Dunican DS, Prendergast JG, Mjoseng H, Ramsahoye BH, Whitelaw E, Greally JM, Adams IR, Bickmore WA, Meehan RR. Redistribution of H3K27me3 upon DNA hypomethylation results in de-repression of Polycomb target genes. Genome Biology. 2013;14:R25. doi: 10.1186/gb-2013-14-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Risbrough V, Ji B, Hauger R, Zhou X. Generation and characterization of humanized mice carrying COMT158 Met/Val alleles. Neuropsychopharmacology. 2014;39:1823–1832. doi: 10.1038/npp.2014.29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Rizzardi LF, Hickey PF, Rodriguez DiBlasi V, Tryggvadóttir R, Callahan CM, Idrizi A, Hansen KD, Feinberg AP. Neuronal brain-region-specific DNA methylation and chromatin accessibility are associated with neuropsychiatric trait heritability. Nature Neuroscience. 2019;22:307–316. doi: 10.1038/s41593-018-0297-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics (Oxford, England) 2010;26:139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Robinson MD, Oshlack A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biology. 2010;11:R25. doi: 10.1186/gb-2010-11-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Ross-Innes CS, Stark R, Teschendorff AE, Holmes KA, Ali HR, Dunning MJ, Brown GD, Gojis O, Ellis IO, Green AR, Ali S, Chin SF, Palmieri C, Caldas C, Carroll JS. Differential oestrogen receptor binding is associated with clinical outcome in breast cancer. Nature. 2012;481:389–393. doi: 10.1038/nature10730. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Sanders SJ, He X, Willsey AJ, Ercan-Sencicek AG, Samocha KE, Cicek AE, Murtha MT, Bal VH, Bishop SL, Dong S, Goldberg AP, Jinlu C, Keaney JF, Klei L, Mandell JD, Moreno-De-Luca D, Poultney CS, Robinson EB, Smith L, Solli-Nowlan T, Su MY, Teran NA, Walker MF, Werling DM, Beaudet AL, Cantor RM, Fombonne E, Geschwind DH, Grice DE, Lord C, Lowe JK, Mane SM, Martin DM, Morrow EM, Talkowski ME, Sutcliffe JS, Walsh CA, Yu TW, Ledbetter DH, Martin CL, Cook EH, Buxbaum JD, Daly MJ, Devlin B, Roeder K, State MW, Autism Sequencing Consortium Insights into Autism Spectrum Disorder Genomic Architecture and Biology from 71 Risk Loci. Neuron. 2015;87:1215–1233. doi: 10.1016/j.neuron.2015.09.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Schlesinger Y, Straussman R, Keshet I, Farkash S, Hecht M, Zimmerman J, Eden E, Yakhini Z, Ben-Shushan E, Reubinoff BE, Bergman Y, Simon I, Cedar H. Polycomb-mediated methylation on Lys27 of histone H3 pre-marks genes for de novo methylation in cancer. Nature Genetics. 2007;39:232–236. doi: 10.1038/ng1950. [DOI] [PubMed] [Google Scholar]
  81. Schmitz RJ, Schultz MD, Urich MA, Nery JR, Pelizzola M, Libiger O, Alix A, McCosh RB, Chen H, Schork NJ, Ecker JR. Patterns of population epigenomic diversity. Nature. 2013;495:193–198. doi: 10.1038/nature11968. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Schultz MD, He Y, Whitaker JW, Hariharan M, Mukamel EA, Leung D, Rajagopal N, Nery JR, Urich MA, Chen H, Lin S, Lin Y, Jung I, Schmitt AD, Selvaraj S, Ren B, Sejnowski TJ, Wang W, Ecker JR. Human body epigenome maps reveal noncanonical DNA methylation variation. Nature. 2015;523:212–216. doi: 10.1038/nature14465. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Schwab MH, Bartholomae A, Heimrich B, Feldmeyer D, Druffel-Augustin S, Goebbels S, Naya FJ, Zhao S, Frotscher M, Tsai MJ, Nave KA. Neuronal basic helix-loop-helix proteins (NEX and BETA2/Neuro D) regulate terminal granule cell differentiation in the hippocampus. The Journal of Neuroscience. 2000;20:3714–3724. doi: 10.1523/JNEUROSCI.20-10-03714.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Stovner EB, Sætrom P. epic2 efficiently finds diffuse domains in ChIP-seq data. Bioinformatics (Oxford, England) 2019;35:4392–4393. doi: 10.1093/bioinformatics/btz232. [DOI] [PubMed] [Google Scholar]
  85. Stroud H, Su SC, Hrvatin S, Greben AW, Renthal W, Boxer LD, Nagy MA, Hochbaum DR, Kinde B, Gabel HW, Greenberg ME. Early-Life Gene Expression in Neurons Modulates Lasting Epigenetic States. Cell. 2017;171:1151–1164. doi: 10.1016/j.cell.2017.09.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Stroud H, Yang MG, Tsitohay YN, Davis CP, Sherman MA, Hrvatin S, Ling E, Greenberg ME. An Activity-Mediated Transition in Transcription in Early Postnatal Neurons. Neuron. 2020;107:874–890. doi: 10.1016/j.neuron.2020.06.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Thompson CL, Ng L, Menon V, Martinez S, Lee CK, Glattfelder K, Sunkin SM, Henry A, Lau C, Dang C, Garcia-Lopez R, Martinez-Ferre A, Pombero A, Rubenstein JLR, Wakeman WB, Hohmann J, Dee N, Sodt AJ, Young R, Smith K, Nguyen TN, Kidney J, Kuan L, Jeromin A, Kaykas A, Miller J, Page D, Orta G, Bernard A, Riley Z, Smith S, Wohnoutka P, Hawrylycz MJ, Puelles L, Jones AR. A high-resolution spatiotemporal atlas of gene expression of the developing mouse brain. Neuron. 2014;83:309–323. doi: 10.1016/j.neuron.2014.05.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Urich MA, Nery JR, Lister R, Schmitz RJ, Ecker JR. MethylC-seq library preparation for base-resolution whole-genome bisulfite sequencing. Nature Protocols. 2015;10:475–483. doi: 10.1038/nprot.2014.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Van der Auwera GA, Carneiro MO, Hartl C, Poplin R, Del Angel G, Levy-Moonshine A, Jordan T, Shakir K, Roazen D, Thibault J, Banks E, Garimella KV, Altshuler D, Gabriel S, DePristo MA. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Current Protocols in Bioinformatics. 2013;43:11. doi: 10.1002/0471250953.bi1110s43. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. van Hulzen KJE, Scholz CJ, Franke B, Ripke S, Klein M, McQuillin A, Sonuga-Barke EJ, PGC ADHD Working Group. Kelsoe JR, Landén M, Andreassen OA, PGC Bipolar Disorder Working Group. Lesch K-P, Weber H, Faraone SV, Arias-Vasquez A, Reif A. Genetic Overlap Between Attention-Deficit/Hyperactivity Disorder and Bipolar Disorder: Evidence From Genome-wide Association Study Meta-analysis. Biological Psychiatry. 2017;82:634–641. doi: 10.1016/j.biopsych.2016.08.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Wang S, Zang C, Xiao T, Fan J, Mei S, Qin Q, Wu Q, Li X, Xu K, He HH, Brown M, Meyer CA, Liu XS. Modeling cis-regulation with a compendium of genome-wide histone H3K27ac profiles. Genome Research. 2016;26:1417–1429. doi: 10.1101/gr.201574.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Wang Z, Civelek M, Miller CL, Sheffield NC, Guertin MJ, Zang C. BART: a transcription factor prediction tool with query gene sets or epigenomic profiles. Bioinformatics (Oxford, England) 2018;34:2867–2869. doi: 10.1093/bioinformatics/bty194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Widschwendter M, Fiegl H, Egle D, Mueller-Holzner E, Spizzo G, Marth C, Weisenberger DJ, Campan M, Young J, Jacobs I, Laird PW. Epigenetic stem cell signature in cancer. Nature Genetics. 2007;39:157–158. doi: 10.1038/ng1941. [DOI] [PubMed] [Google Scholar]
  94. Wu H, Coskun V, Tao J, Xie W, Ge W, Yoshikawa K, Li E, Zhang Y, Sun YE. Dnmt3a-dependent nonpromoter DNA methylation facilitates transcription of neurogenic genes. Science (New York, N.Y.) 2010;329:444–448. doi: 10.1126/science.1190485. [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Xie W, Schultz MD, Lister R, Hou Z, Rajagopal N, Ray P, Whitaker JW, Tian S, Hawkins RD, Leung D, Yang H, Wang T, Lee AY, Swanson SA, Zhang J, Zhu Y, Kim A, Nery JR, Urich MA, Kuan S, Yen C, Klugman S, Yu P, Suknuntha K, Propson NE, Chen H, Edsall LE, Wagner U, Li Y, Ye Z, Kulkarni A, Xuan Z, Chung W-Y, Chi NC, Antosiewicz-Bourget JE, Slukvin I, Stewart R, Zhang MQ, Wang W, Thomson JA, Ecker JR, Ren B. Epigenomic analysis of multilineage differentiation of human embryonic stem cells. Cell. 2013;153:1134–1148. doi: 10.1016/j.cell.2013.04.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Yang ST, Shi Y, Wang Q, Peng JY, Li BM. Neuronal representation of working memory in the medial prefrontal cortex of rats. Molecular Brain. 2014;7:61. doi: 10.1186/s13041-014-0061-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Yang WR, Ardeljan D, Pacyna CN, Payer LM, Burns KH. SQuIRE reveals locus-specific regulation of interspersed repeat expression. Nucleic Acids Research. 2019;47:e27. doi: 10.1093/nar/gky1301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Yu G, Wang LG, Han Y, He QY. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16:284–287. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Zang C, Schones DE, Zeng C, Cui K, Zhao K, Peng W. A clustering approach for identification of enriched domains from histone modification ChIP-Seq data. Bioinformatics (Oxford, England) 2009;25:1952–1958. doi: 10.1093/bioinformatics/btp340. [DOI] [PMC free article] [PubMed] [Google Scholar]

Editor's evaluation

Anne E West 1

In this manuscript the authors conditionally knock out the DNA methyltransferase Dnmt3a in developing excitatory cortical neurons to determine the consequences for chromatin regulation, gene expression, and neuron function. As expected they find widespread loss of DNA methylation but also an increase in histone methylation (H3K27me3) at many similar regions of the genome, which they propose may be a mechanism of functional compensation. Overall this study offers new insights into the gene regulatory and neuronal cellular functions of an important chromatin regulatory process.

Decision letter

Editor: Anne E West1
Reviewed by: Alvaro Rada-Iglesias2

Our editorial process produces two outputs: (i) public reviews designed to be posted alongside the preprint for the benefit of readers; (ii) feedback on the manuscript for the authors, including requests for revisions, shown below. We also include an acceptance summary that explains what the editors found interesting or important about the work.

Decision letter after peer review:

Thank you for submitting your article "Dnmt3a knockout impairs synapse maturation and is partly compensated by repressive histone modification H3K27me3" for consideration by eLife. Your article has been reviewed by 3 peer reviewers, one of whom is a member of our Board of Reviewing Editors, and the evaluation has been overseen by a Reviewing Editor and Lu Chen as the Senior Editor. The following individual involved in review of your submission has agreed to reveal their identity: Alvaro Rada-Iglesias (Reviewer #3).

The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.

Essential revisions:

1) The reviewers all felt that claims regarding the functional compensation by H3K27me3 for the loss of Dnmt3a were too strongly stated. The reviewers discussed this point extensively and decided that both the title and the sections of the paper that describe and discuss these data should be changed to reflect that the replacement of the marks has been observed but that the functional relevance of this change has not been directly tested. The reviewers appreciate that functional testing of this hypothesis is well beyond the scope of the current manuscript – only text revisions are required.

2) The reviewers strongly suggest (but do not absolutely require) that the authors perform bioinformatic analysis of the H3K27me3 compensation against gene expression in the Dnmt3a cKO in order to provide experimental evidence for the claim of functional compensation. One of the reviewers also offered a very thorough list of other possible analyses (especially regarding studies of enhancer marks) that would strengthen the study, however these are entirely at the discretion of the authors to consider.

3) The reviewers have described below specific examples where addition of further technical details would strengthen the manuscript and we ask the authors to expand on these experimental points in the text to help the readers evaluate the study. In particular all of the reviewers expressed some concern about the use of only two replicates, and we had a long discussion of whether a minimum of three should be required. It was not entirely clear in every case whether each replicate was obtained from only a single mouse or whether the samples might be pooled (which would strengthen the argument for using only 2). In either case, in the end we felt the manuscript was strong enough that we did not feel it was necessary for the authors to add more replicates, but we would like the authors to address in the text any limitations of the study that readers may need to keep in mind with the use of duplicates as opposed to larger replicate numbers.

Reviewer #1 (Recommendations for the authors):

1) Because of the way the authors present the examples and the comparisons, it is unclear if the compensatory upregulation of H3K27me3 occurs de novo, or whether regions that are previously methylated with H3K27me3 are the regions where this compensation occurs. The latter could suggest this might be a consequence of having more substrate for histone methylation – and may not actually serve any function. This is particularly important to delineate as there are several other DMV clusters that are resistant to changes in gene expression that do not have this compensatory upregulation of H3K27me3 – like clusters 4, 5 and 6 in Figure 8.

2) It would be more narratively and biologically significant if the authors could connect their data analysis and observations about compensatory H3K27me3 upregulation and changes to gene expression in the context of the developing cortical neurons, or the behavioral phenotypes observed in the introductory figures.

3) The inferences made in Figure 8 are a little inconsistent with those described earlier. For example, in Figure 5A the authors describe the overrepresentation of predicted PRC2-TF binding motifs at upregulated and downregulated genes during Dnmt3a-cko, while later discussions move toward describing PRC2-mediated regulation as the reason for resistance to changes in gene expression due to the cko. Additionally, H3K27me3 peaks in Figure 5B do not appear to be all that different between the control and cko mouse at Mab21l2 – diluting the punchline of the figure.

4) Figure 8 is difficult to read.

Reviewer #2 (Recommendations for the authors):

As described in the public comments, this study provides interesting new insights into organismal and cellular consequences of DNMT3A conditional knockout in excitatory neurons and presents novel effects on H3K27me3 in these mice. I have several concerns I think would be important to address before publication in eLife however. Here I elaborate in specific detail:

1. The analysis and core conclusions about changes in histone modifications in the study are based on n=2 experiments. While it is potentially beyond the scope of a revision to repeat additional replicates for all analyses that are carried out, it would substantially strengthen the study to perform additional replicates of H3K27me3 ChIP in order to rigorously show that the core finding regarding H3K27me3 changes is robust to sample-to-sample technical variability. This is particularly important given the subtle magnitude of changes that are described.

2. Though the authors provide strong characterization of developmental changes of H3K27me3 and show changes in this mark in response to loss of DNMT3A, their analyses fall short of definitively showing that this is a true functionally compensatory effect. Ideally, an experimental analysis would be done to test this by blocking H3K27me3 build up in DNMT3A mutants and determining if de-compensated gene expression changes occur. Given that this is outside the scope of a revision however, additional computational approaches could be employed to better test the hypothesis:

a. Figures 6E and S10A present the analysis that is most relevant to whether accumulation of K27me3 in the cKO impacts DEG and therefore may play a compensatory role. However, additional analysis could more decisively test the hypothesis. For example, do non-dysregulated genes containing DMRs show less K27me3 accumulation than down-regulated genes, but more K27me3 accumulation than upregulated genes? The authors could include a boxplot of a well-matched set of non-DE genes with overlapping DMRs in panel 6E to test this. A more powerful analysis, however, might be to examine a large set of genes from the genome that are selected to contain similar numbers of DMRs (in order to control for relative potential effects of demethylation) and ask if changes in K27me3 correspond with up-regulation (low K27me3 accumulation), no dysregulation (intermediate accumulation) or downregulation (high K27me3 accumulation).

b. On p19 the authors state that "P39 cKO DMRs were significantly enriched in peaks and DM regions of H3K27me3 (28.1% overlapped H3K27me3 peaks in cKO, p<1e-100) (Figure 6B)." However, 6B only presents the overlap with peaks in each genotype as an odds ratio and doesn't make it clear how the enrichment of DMRs in DM regions was assessed. This appears to be assessed to some degree in S8F (note the S8 figure legend appears to be wrong for this panel), but no analysis is presented to control for the general tendency of DMRs and H3K27me3 peaks to overlap. It seems imperative, given that DMRs already overlap with H3K27me3 peaks, that a resampling approach is used comparing DM peaks to non-DM K27me3 peaks that are matched to have similar wild-type H3K27me3 signal and size as DM peaks. Likewise, what would the changes in H3K27me3 signal presented in 6C look like at non-DMR regions that contain similar baseline wild-type H3K27me3 signal? Without these analyses it is not clear that DMRs are particularly unique in showing increased K27me3 signal.

c. The increase in H3K27me3 in the cKO at the mab21l2 locus at P0 (figure 5B), before methylation has or has not been established, suggests that H3K27me3 may change before methylation differences, rather than in reaction to changes in methylation. Is this a common effect seen across other genes? How does this impact the idea that compensation occurs due to lack of methylation? Related to this issue, on p.19 no specific plot is referenced to support the statement that "There was no corresponding increase of H3K27me3 at these DMRs in newborn (P0) or fetal (E14) neurons." A quantitative analysis as shown in 6C but for the E14 and P0 time-points could assess this directly.

d. Why were developmental DMs called between E14 and P39 while DMRs were called between P0 and P39? Developmental comparisons would be more justified if both used P0 and P39.

e. In the absence of additional data supporting H3K27me3 accumulation as a functional compensation mechanism for loss of DNA methylation, the title of Figure 6 should not so strongly claim that H3K27me3 changes compensate for mC loss.

3. While initially assessed by the authors, mCH changes are not integrated into the analysis of H3K27me3 changes. Previous studies by this group and others have suggested the importance of mCH in gene regulation within specific neuron populations and, given the interest to the field, more thorough analysis is warranted regarding this neuronal methylation and H3K27me3.

a. Given the association, even if limited, between differential mCH loss and gene dysregulation in the cKO, it is relevant to examine how changes in K27me3 relate to, or compensate for, loss of mCH. For example, do genes that are not dysregulated in the cKO, and normally have high mCH (which is lost in the cKO) show higher increases in K27me3 compared to non-dysregulated genes that have low mCH (and therefore lose less mCH)? Reciprocally, one could ask if a population of genes selected to have similar, high mCH loss in the cKO, show differential gene dysregulation as a function of differential K27me3 accumulation.

b. What are the changes in mCH at DM H3K27me3 sites? Since mCG and mCH loss are correlated, one might expect that these are sites that normally also contain high mCH. It might not be feasible to call mCH DMRs given the technical limitations, but quantification of this signal can be done at DMs and DMRs, and the potential for it to contribute to these effects could be discussed.

4. The authors present evidence of PRC2 binding at DEGs (Figure 5A) as a motivation to assess H3K27me3 in their mice. However, they then observe changes in H3K27me3 that they contend do not occur under normal conditions and propose that these changes are an underlying mechanism that blocks gene dysregulation in these mice. How are we to reconcile this original rationale of a normal function for PRC2 at dysregulated genes with the model of de novo compensatory H3K27me3 accumulation? Some explanation by the authors of how these separate notions may relate to one another would be helpful.

5. Some effects on H3K27ac are observed at DMRs, but the authors ignore the prospect that these effects could drive changes in gene expression. Previous groups have shown significant changes in H3K27ac in DNMT3A models, and some of the data the authors present suggest that this could also explain some of the observations in their model.

a. While DM K27ac peaks are not identified in the cKO, in figure 6B there is a similar enrichment of DMRs to associate with K27ac peaks in the cKO vs control as there is for K27me3. In addition, there is a similar increase in odds ratio of enrichment of K27ac peaks to the of K27me3 at DMRs for the cKO in figure 6B. The analysis in S8F-G that is referenced when referring to K27ac is either missing (S8G) or is not showing data for K27ac (S8F). The authors should at least comment in the Results section that these signals exist and may be significant for gene regulation.

b. Are two biological replicates sensitive enough to call differences in H3K27ac? In the context of a small number of replicates and the subtle magnitude of changes in gene expression that are observed, negative results for differential peaks should be interpreted with caution. In contrast, the larger peaks observed in K27me3 may have led to increased ability to call differential peaks.

6. Though the characterization of DNMT3A cKO in excitatory neurons is novel, if the authors are claiming that these neurons specifically are driving behavioral phenotypes then further contextualization and characterization of the extent of conditional deletion is needed.

a. It would be useful to justify the rationale for investigating excitatory neurons, and how using this class of neurons can uncover novel findings. Existing studies of DNMT3A cKO have shown behavioral and epigenomic effects in pan-neuronal and inhibitory neuron conditional knockouts. Comparisons between these models and discussion of differences could offer strong insights into what unique circuits may be most disrupted by DNMT3A cKO in excitatory neurons.

b. If the authors are claiming that behaviors and phenotypes are the result of specifically knocking out DNMT3A in all excitatory neurons, further evidence should be provided that the Cre-line is being expressed sufficiently and specifically in excitatory neurons, and that their genomic data is faithfully capturing the majority of excitatory cells while excluding non-excitatory cells.

7. The manuscript would be further improved by clarification of some methods and statistics, and modifications in data presentation:

a. The authors should provide information on the statistics performed on the behavior data. Are there significant differences between sexes, or significant sex by genotype interactions for these behavioral tasks? A two-way ANOVA using genotype and sex would be useful in understanding the impact of these variables. There appears to be many effects that are sex-specific, yet this appears to be overlooked in the text. Furthermore, if there are robust sex-specific effects then the subsequent molecular and genomic analyses should be done using both sexes, and comparisons could be done to explain the molecular underpinnings of these differences.

b. The authors might also consider showing preference index when analyzing social approach data when comparing genotypes.

c. What is the background set of genes for 2B-C? Synaptic genes are enriched in both up- and down-regulated gene lists. How is gene expression normalized? Is there a discovery bias for more highly expressed genes?

d. What value was used to identify equally expressed unchanged genes in figure 4B, TPM? Identifying equally expressed genes can be complicated for nuclear RNA where unprocessed RNAs contribute differentially to total counts based on the length of the gene. In addition, previous analysis of mCH enrichment at dysregulated genes in inhibitory neuron-specific KO of DNMT3A also found DEGs to be differentially methylated based on direction of change but showed that all other genes were more lowly methylated than these two groups (Lavery et al. 2020). Here the authors see the same relationship between DEG groups but find other genes to be higher in mCH rather than lower. Do the authors have any perspective on why these differences are observed?

e. On p22 the authors state "We found that developmental peaks (DevLoss and DevGain) were relatively unaffected by the cKO(H3K27me3 = 0.11, -0.20 respectively, in units of log10(CPM+1)), whereas Group cKO had ten-fold larger mean effect (H3K27me3 = 1.12 respectively)." Given the y scales in figures 7E and S10H it is unclear how they derived a ∆ value of 1.12 on a log 10 scale for Group cKO (e.g. in S10H the median values look much less 1 unit different on a log10 scale). Please clarify the actual magnitude of effect sizes.

f. The authors describe the association between gene body mCH and gene expression changes as only accounting for 1% of the variance. It would be useful to have a point of reference to understand if this is a low value or a high value in comparison to other potential predictors of gene expression changes. For example, how much does the proposed overcompensation by H3K27me3 accumulation contribute to the genome-wide variance in gene expression? In addition, is it the case that variance in the measured fold-changes will be mitigated by addition of more replicates to more accurately estimate true fold-changes? If this is the case, it should be noted.

g. Using different color schemes would be helpful in 8A and 8B to avoid confusion.

h. How was clustering analysis done in 8D? What do the values for K27me3 Diff. Mod and DEG enrichment indicate (i.e. what does the scale from 0 to 0.02 represent)?

Reviewer #3 (Recommendations for the authors):

– The authors delete exon 19 of Dnmt3a. I think they should provide a more thorough description of what this deletion is expected to cause: loss of function due to premature stop codon, loss of enzymatic activity, etc. Moreover, it would be desirable to show by immunostaining whether DNMT3A protein is specifically lost in excitatory neurons.

– Regarding the DMRs identified in DNMT3A KO neurons, the authors should use the generated profiles for H3K27ac in P0 and P39 WT neurons to determine which DMRs overlap enhancers that are active in any of these two neuronal stages. This is more relevant than overlaps with enhancers that could be active in any given cell type. Most importantly, the authors should evaluate the impact the loss of DNA methylation can have on enhancer activity by measuring H3K27ac levels and eRNAs in DNMT3A KO neurons. Measuring eRNA levels can be particularly relevant, as recent evidences suggest that is a better marker of enhancer activity than H3K27ac.

– The authors should aim at performing experiments that address the proposed compensatory role for PRC2 and H3K27me3. I understand that generating a KO for PRC2 could be rather complicated, but perhaps the authors could consider using new drugs that act as very specific inhibitors against the enzymatic activity of PRC2.

eLife. 2022 May 23;11:e66909. doi: 10.7554/eLife.66909.sa2

Author response


Essential revisions:

1) The reviewers all felt that claims regarding the functional compensation by H3K27me3 for the loss of Dnmt3a were too strongly stated. The reviewers discussed this point extensively and decided that both the title and the sections of the paper that describe and discuss these data should be changed to reflect that the replacement of the marks has been observed but that the functional relevance of this change has not been directly tested. The reviewers appreciate that functional testing of this hypothesis is well beyond the scope of the current manuscript – only text revisions are required.

We agree with the Reviewers that it is important to clearly define the conclusions we can draw from our data, as well as their limitations. Indeed, our data demonstrate that loss of Dnmt3a-dependent gain of methylation during the perinatal period leads to specific changes in H3K27me3, but they do not fully clarify the functional significance of these processes or show that Polycomb associated marks fully compensate for reduced DNA methylation.

To clarify this, we have revised the title of the paper: Dnmt3a knockout in excitatory neurons impairs postnatal synapse maturation and increases the repressive histone modification, H3K27me3

We have further removed the following phrase from the Abstract: “partially compensating for the loss of DNA methylation.” We have limited to the Discussion section our comments about a potential compensatory role for the gain of H3K27me3.

2) The reviewers strongly suggest (but do not absolutely require) that the authors perform bioinformatic analysis of the H3K27me3 compensation against gene expression in the Dnmt3a cKO in order to provide experimental evidence for the claim of functional compensation. One of the reviewers also offered a very thorough list of other possible analyses (especially regarding studies of enhancer marks) that would strengthen the study, however these are entirely at the discretion of the authors to consider.

We appreciate the suggestions from the reviewers. We have added new analyses to further support our claim, including comparisons of the changes in H3K27me3 in different groups of genes (shown in Figure 5E and Figure 5 — figure supplement 2A). These are fully described below, in the response to Reviewer #2 (comment 2a).

3) The reviewers have described below specific examples where addition of further technical details would strengthen the manuscript and we ask the authors to expand on these experimental points in the text to help the readers evaluate the study. In particular all of the reviewers expressed some concern about the use of only two replicates, and we had a long discussion of whether a minimum of three should be required. It was not entirely clear in every case whether each replicate was obtained from only a single mouse or whether the samples might be pooled (which would strengthen the argument for using only 2). In either case, in the end we felt the manuscript was strong enough that we did not feel it was necessary for the authors to add more replicates, but we would like the authors to address in the text any limitations of the study that readers may need to keep in mind with the use of duplicates as opposed to larger replicate numbers.

We appreciate this concern, and we agree that it is important to show that findings are replicable across multiple independent samples.

When tracing back the animal IDs to create a table of the number of animals included in each sample, we discovered an error in the dataset. Although we intended to use independent pools of mice for each biological replicate, we found that due to an experimental tracking error some of the replicates contained tissue from overlapping pools of animals. In particular, the P39 Dnmt3a cKO group (pooled from 3 mice) was split into two technical replicates but mislabeled as two biological replicates and used as such in the RNA-seq and MethylC-seq analyses. For this reason, for this resubmission we decided to repeat our experiments and to generate new RNA-seq data and MethylC-seq data for both P39 Dnmt3a cKO and control mice. Each condition now has two independent, newly-generated biological replicates, each pooled from two mice. We have added the information about the number of animals contributing tissue to each sample (including P0 MethylC-seq data and all ChIP-seq data which are not re-sequenced) in the “num_pooled_animals” column in Supplementary File 1. Most of our sequencing data come from tissue samples pooled from two mice, with the exception of P0 MethylC-seq data. For that time point, we used 6 control and 2 cKO samples that each came from a single mouse.

To show that the new dataset largely agrees with the previously collected data, we plotted the correlation across replicates from the two batches computed using mCG and mCH in 10kb genomic bins for MethylC-seq data, and gene expression for RNA-seq data (see Author response image 1). The molecular profiles are largely consistent across the two batches (Spearman correlation > 0.80; note that mCH in Dnmt3a cKO samples are essentially absent, so those samples were excluded from the mCH analysis). Moreover, the consistency across biological replicates in the new batch is also high (r > 0.93). We have also added plots in the manuscript to demonstrate the consistency across the replicates for RNA-seq (Figure 3 — figure supplement 1A), MethylC-seq (Figure 3 — figure supplement 3A), and ChIP-seq (Figure 6 — figure supplement 2).

Author response image 1. Heatmap to show the Spearman correlations across the control and Dnmt3a cKO samples from the two batches of data (new vs old). The correlations coefficients were computed using CG methylation levels in 10Kb genomic bins (left), CH methylation levels in 10Kb genomic bins (middle), and gene expression (log10TPM, right).

Author response image 1.

As is often the case for RNA-seq data, we found differences between the new and old batches (i.e. batch effects) that cannot be ignored (see Author response image 2A). Due to the mislabeling of technical replicates as biological replicates in our previous analysis, our original submission likely had inflated statistical significance for detecting differentially expressed genes. Using the new batch of RNA-seq data that includes true biological replicates, we detected fewer significant DE genes under the same FDR < 0.05 criteria (70 genes compared to the previous 1720 genes). However, the fold-changes of these DE genes are in general concordant across the two batches (see Author response image 2B). This new dataset and analysis replicate and validate the disruption of transcriptome we observed earlier. The smaller number of differentially expressed genes we now report is consistent with the fact that statistical significance was inflated in our previous analysis by the non-independence of the replicate samples.

Author response image 2. (A) PCA plot using log10TPM gene expression in the two batches of RNA-seq samples. (B) Scatter to show the consistency of gene expression fold-changes (Dnmt3a cKO vs. control) across the two batches of RNA-seq samples using significant DE genes detected in the new batch (left) and significant DE genes detected in the old batch (right).

Author response image 2.

Confidence in our differential expression findings is also supported by the strong correspondence we now find between genes that lose mCH and those that increase mRNA expression (Figure 3D-E). This gives additional confidence that the differential expression analysis identifies reliable and consistent effects in the Dnmt3a cKO.

We apologize for the confusion caused by the replacement of these new sequencing datasets. We hope that we have demonstrated that the new data largely agrees with the old, and that most of our conclusions are still supported by the new data albeit with reduced statistical power. In the revised manuscript, all results generated using P39 RNA-seq and MethylC-seq data were updated with the new datasets. Nevertheless, some analysis is no longer feasible due to fewer detected DE genes, and we have modified the manuscript accordingly. In the following responses, we address each reviewers’ specific concerns.

Reviewer #1 (Recommendations for the authors):

1) Because of the way the authors present the examples and the comparisons, it is unclear if the compensatory upregulation of H3K27me3 occurs de novo, or whether regions that are previously methylated with H3K27me3 are the regions where this compensation occurs. The latter could suggest this might be a consequence of having more substrate for histone methylation – and may not actually serve any function. This is particularly important to delineate as there are several other DMV clusters that are resistant to changes in gene expression that do not have this compensatory upregulation of H3K27me3 – like clusters 4, 5 and 6 in Figure 8.

We appreciate the reviewer’s suggestion and apologize for the confusion. To address this question, we have added the following analysis (Figure 4 — figure supplement 3A) to compare the baseline H3K27me3 levels in the differentially modified regions with their shuffle controls:

Here two types of random shuffle controls were used. In the first shuffle control (the middle two violins), we randomly shuffled the H3K27me3 DM regions in the whole genome, keeping the peak sizes and the chromosomes where the peaks reside in same as the original peaks. In the second set (the rightmost two violins) we required the same constraints as the first set but further restricted the shuffles must be within the H3K27me3 peaks (the union of control and cKO peaks). We can see that the regions with up-regulation of H3K27me3 (the leftmost two violins), had an intermediate level of H3K27me3 (median ~8 RPKM) in the control samples to begin with, which is higher than the level in random shuffles in the genome but lower than the level in random shuffles in H3K27me3 peaks. Therefore, these regions with upregulation of H3K27me3 after Dnmt3a cKO include regions that were previously methylated with H3K27me3 in the control samples, and we presume such a medium but non-zero level of H3K27me3 is fine-tunable and hence can act as a “failsafe” triggered after the original repression mechanism, DNA methylation, was disrupted. Of course, it is important to note that our ChIP-seq data come from a heterogeneous pool of pyramidal neurons, and it is possible that the quantitative increase in the H3K27me3 level corresponds to the genuine gain of de novo histone methylation in a particular subpopulation of excitatory cells.

We can also reach a similar conclusion in our updated DMV analysis (now in Figure 7C). C1 clusters are DMVs that highly overlap with the H3K27me3 DM regions, and they have an intermediate level of H3K27me3 (~8 RPKM) that is not as high as those in cluster 6 (~20 RPKM) but still higher than those in clusters 5, 7 and 8 (~0 RPKM). As a comparison, regions in cluster 6 were under tight regulation with high levels of H3K27me3 and a low level of DNA methylation (~10% mCG); genes within these DMVs were mostly not expressed. Therefore, losing the small amount of DNA methylation in these regions did not alter gene expression. As for the regions in clusters 5, 7 and 8, they are active with both low DNA methylation and H3K27me3, and hence they are not affected by the cKO.

We have added these results to the Result section of the manuscript:

“These DM regions have a medium but non-zero level of H3K27me3 in the P39 control (higher than random shuffles across the whole genome but lower than random shuffles within the peak regions, Figure 4 — figure supplement 3A), and hence presumably fine-tunable after Dnmt3a cKO.”

2) It would be more narratively and biologically significant if the authors could connect their data analysis and observations about compensatory H3K27me3 upregulation and changes to gene expression in the context of the developing cortical neurons, or the behavioral phenotypes observed in the introductory figures.

We agree that linking the effects of the cKO on adult neurons with developmental patterns of epigenetic regulation would be valuable. Our study includes developmental ChIP-seq profiles of H3K27me3, including three time points (E14, P0 and P39; see Figure 5B-C). These data show that the majority of the changes in H3K27me3 emerge in the adult neurons (P39) and are not present at earlier developmental time points (Figure 4 — figure supplement 1D). Moreover, we related the patterns of regulations at DMVs with their developmental history of H3K27me3 (Figure 7).

We have now added a new analysis to address the relationship between upregulated H3K27me3 signal and gene expression changes at P39 (see Figure 5 — figure supplement 2C and also our replies to reviewer #2, comment 3a).

Unfortunately, we did not profile gene expression after cKO in development so we can’t examine these observations in the context of development. Although we agree that this would be of interest, we believe that this question would be better addressed using single-cell transcriptomics (scRNA-seq) to disentangle the contribution from diverse subtypes of excitatory pyramidal neurons. We, therefore, plan to apply single-cell methods in the future to address this question.

3) The inferences made in Figure 8 are a little inconsistent with those described earlier. For example, in Figure 5A the authors describe the overrepresentation of predicted PRC2-TF binding motifs at upregulated and downregulated genes during Dnmt3a-cko, while later discussions move toward describing PRC2-mediated regulation as the reason for resistance to changes in gene expression due to the cko.

The BART results for up-regulated and down-regulated DE genes (now in Figure 4A) are bioinformatics predictions that motivated us to look into Polycomb repression via H3K27me3 profiling. With this rationale, we then performed the ChIP-seq analysis of H3K27me3 in the control and cKO neurons. We agree with the reviewer that there is not a clear explanation for the enrichment of PRC2 related sequence motifs at both up- and down-regulated genes. In light of this, and in response to other reviewer comments, we have significantly softened our interpretation of the direct link (“compensation”) between loss of DNA methylation and increased H3K27me3. Instead, we now report that increased H3K27me3 occurs preferentially in regions that lose DNA methylation, but we do not draw a causal link or interpret these data in terms of direct functional compensation.

Additionally, H3K27me3 peaks in Figure 5B do not appear to be all that different between the control and cko mouse at Mab21l2 – diluting the punchline of the figure.

Regarding the size of the difference in H3K27me3 at the Mab21l2 locus, we appreciate that it can be hard to judge the magnitude of the differential enrichment of the chromatin modification (due to the normalized scale to make it comparable across E14, P0, and P39) in the browser view (now in Figure 4B). The quantitative analysis of differential binding at this locus showed that it was highly significant (tested with DEseq2, fold-change = 2.41, FDR=1.33e-4). We have added a shaded box in Figure 4B to highlight the H3K27me3 signal differences between the P39 control and cKO mouse at the Mab21l2 locus, and also a new plot in Figure 4C to show the quantification of the increase in H3K27me3 ChIP-seq signal in each replicate at this locus.

We have also added analysis to show the increases of H3K27me3 were also enriched in non-DE genes with the largest magnitude of mCH (see Figure 5 — figure supplement 2B and also our replies to reviewer #2, comment 3a).

4) Figure 8 is difficult to read.

We have re-organized the figure (now Figure 7) to make it easier to read.

Reviewer #2 (Recommendations for the authors):

As described in the public comments, this study provides interesting new insights into organismal and cellular consequences of DNMT3A conditional knockout in excitatory neurons and presents novel effects on H3K27me3 in these mice. I have several concerns I think would be important to address before publication in eLife however. Here I elaborate in specific detail:

1. The analysis and core conclusions about changes in histone modifications in the study are based on n=2 experiments. While it is potentially beyond the scope of a revision to repeat additional replicates for all analyses that are carried out, it would substantially strengthen the study to perform additional replicates of H3K27me3 ChIP in order to rigorously show that the core finding regarding H3K27me3 changes is robust to sample-to-sample technical variability. This is particularly important given the subtle magnitude of changes that are described.

We understand the reviewer’s concern regarding the limited number of biological replicates. To reduce variability due to individual differences, each of our ChIP-seq samples included pooled tissue from 2 mice. This information is now included in the

“num_pooled_animals” column in Supplementary File 1 and explained in the main text:

“To experimentally address this, we performed chromatin immunoprecipitation sequencing (ChIP-seq) in excitatory neurons at embryonic day 14 (E14) and postnatal days 0 and 39 to measure trimethylation of histone H3 lysine 27 (H3K27me3), a repressive mark whose deposition is catalyzed by PRC2 and is important for transcriptional silencing of developmental genes. In P39 neurons, we also measured two histone modifications associated with active chromatin: H3K4me3 (trimethylation of histone H3 lysine 4, associated with promoters) and H3K27ac (acetylation of histone H3 lysine 27, associated with active promoters and enhancers) (Heinz et al., 2015). For each mark, we performed sequencing on two independent samples, each of which used pooled tissue from two mice.”

We have added additional analyses showing the consistency of the ChIP-seq signal in genomic bins and peak regions across biological replicates. These results are now shown in Figure 6 — figure supplement 2 and we have added descriptions of these results in the manuscript as follows:

“We also examined differentially modified (DM) regions at P0 vs. E14 and P0 vs. P39 (Figure 6 — figure supplement 1A). However, due to greater biological variability at the perinatal time point, the two replicate ChIP-seq datasets at P0 were less consistent than the E14 and P39 samples (Figure 6 — figure supplement 2). As a result, our data were not well powered to detect changes at P0 and we chose to focus on the E14 vs. P39 DM regions as sites of developmental chromatin remodeling.”

2. Though the authors provide strong characterization of developmental changes of H3K27me3 and show changes in this mark in response to loss of DNMT3A, their analyses fall short of definitively showing that this is a true functionally compensatory effect. Ideally, an experimental analysis would be done to test this by blocking H3K27me3 build up in DNMT3A mutants and determining if de-compensated gene expression changes occur. Given that this is outside the scope of a revision however, additional computational approaches could be employed to better test the hypothesis:

We agree that additional experiments that block H3K27me3 in Dnmt3a cKO animals are required to establish a causal role for the compensating effect. However, as noted by the Reviewer, such experiments are beyond the scope of this study. Instead, we have performed additional bioinformatic analyses to better test the hypothesis according to the reviewer’s suggestion.

As suggested by the reviewer, we used box plots to compare the gain of H3K27me3 in different groups of genes. These analyses (now in Figure 5E and Figure 5 — figure supplement 2A) compare up- and down-regulated genes with the control gene set chosen to have matching expression levels in the control samples. Moreover, we separately analyzed genes that overlap DMRs and those which do not overlap DMRs; the corresponding control genes were likewise grouped by whether or not they overlap DMRs. (See Methods and Figure 3 — figure supplement 4B on how we selected the expression-matched non-DE genes).

This analysis shows that down-regulated DE genes with overlapping DMRs (the first box in Figure 5E, upper panel) and non-DE genes selected with matched expressions of up-regulated DE genes (the fourth box in Figure 5E, upper panel) show a small but significant median increase of H3K27me3 in the cKO (Wilcoxon rank-sum test; asterisks in the middle of each boxplot denotes p-value: *, p < 0.05). When we considered a larger set of DE genes with a more relaxed threshold (FDR<0.2, Figure 5 — figure supplement 2A), we observed that down-regulated DE genes containing DMRs accumulate significantly more H3K27me3 than non-DE genes containing DMRs. By contrast, up-regulated genes had no significant accumulation of H3K27me3 and were not significantly different from the control genes (Figure 5 — figure supplement 2A). No such differences were observed between DE and non-DE genes without overlapping DMRs (Figure 5E lower panel and Figure 5 — figure supplement 2A). These results suggest that the effect of H3K27me3 is likely specific to genes containing DMRs and the effect is stronger in the down-regulated DE genes.

We have added descriptions of these results in the manuscript as follows:

“We further examined whether the increased H3K27me3 could account for the reduced expression of some genes in the Dnmt3a cKO neurons. We found that down-regulated genes that overlap DMRs and the non-DE genes selected with matched expressions of up-regulated DE genes that overlap DMRs showed a small but significant median increase of H3K27me3 in the cKO (Wilcoxon rank-sum test p-value < 0.05; Figure 5E upper panel). When we considered a larger set of DE genes with a more relaxed threshold (FDR<0.2), we observed that down-regulated DE genes containing DMRs accumulate significantly more H3K27me3 than non-DE genes containing DMRs and up-regulated genes containing DMRs (Wilcoxon rank sum test p = 0.0029, Figure 5 — figure supplement 2A). By contrast, up-regulated genes had no significant accumulation of H3K27me3 and were not significantly different from the control genes (Figure 5 — figure supplement 2A). No such differences were observed between DE and non-DE genes without overlapping DMRs (Figure 5E lower panel and Figure 5 — figure supplement 2A). These results suggest that the effect of H3K27me3 is likely specific to genes containing DMRs and the effect is stronger in the downregulated DE genes, which may partially explain the fact that 24 genes were significantly down-regulated after the loss of repressive DNA methylation in the Dnmt3a cKO (Figure 2A-B).”

a. Figures 6E and S10A present the analysis that is most relevant to whether accumulation of K27me3 in the cKO impacts DEG and therefore may play a compensatory role. However, additional analysis could more decisively test the hypothesis. For example, do non-dysregulated genes containing DMRs show less K27me3 accumulation than down-regulated genes, but more K27me3 accumulation than upregulated genes? The authors could include a boxplot of a well-matched set of non-DE genes with overlapping DMRs in panel 6E to test this. A more powerful analysis, however, might be to examine a large set of genes from the genome that are selected to contain similar numbers of DMRs (in order to control for relative potential effects of demethylation) and ask if changes in K27me3 correspond with up-regulation (low K27me3 accumulation), no dysregulation (intermediate accumulation) or downregulation (high K27me3 accumulation).

We apologize for the confusion. We have added a panel using the same method used in Figure 5B (see “Enrichment test of DMRs and other genomic regions” in the Materials and methods section) to show the odds ratio of DMRs overlapping the H3K27me3 DM regions (now in Figure 4 — figure supplement 3D, right panel).

In addition, in the previous version of our manuscript, the enrichment of DMRs in DM regions was also assessed in the Venn diagram in Figure 4 — figure supplement 3D, and we have corrected the figure reference in the main text and the figure legend for Figure 4 — figure supplement 3. We tested this using the `fisher` sub-commands in the BEDTools Suite, where we checked if the amount of overlap between the 2 sets of regions is more than expected given their coverage and the size of the genome. This is in most cases analytically the same as shuffling the genome and checking the simulated (shuffled) versus the observed.

As the reviewer suggested, we have selected a more stringent set of regions as control. For each DM H3K27me3 peak, one non-DM H3K27me3 peak with a similar peak size and similar H3K27me3 signal in the control (wild-type) mice was selected as the matched control region using the greedy nearest neighbor matching in the R package `MatchIt`. Figure 4 — figure supplement 3C shows the distribution of H3K27me3 signal in the DM and non-DM peak sets:

We then counted the number of overlaps with DMRs in these matched non-DM peaks and compared those with the observed DM peaks (Figure 4 — figure supplement 3D, left panel). We found that the DM-peaks have significantly more overlaps with DMRs compared to the non-DM peaks (Fisher exact test p < 2.2e-16, odds ratio OR=2.70):

We have added these additional figures in Figure 4 — figure supplement 3 and described these results as follows in the main text:

“Likewise, P39 cKO DMRs were significantly enriched in peaks (22.4% overlapped H3K27me3 peaks in cKO, p<1e-100, Figure 5B) and DM regions of H3K27me3 (Figure 4 — figure supplement 3D). The DM regions of H3K27me3 had significantly more overlaps with DMRs compared to the non-DM control regions (Fisher exact test p < 2.2e-16, OR: 2.70, Figure 4 — figure supplement 3C-D).”

The methods used here are clarified in the Material and Methods section as follows:

“To select a set of non-DM control regions to match the base levels of H3K27me3 in DM regions, we started with the union peaks of the control and cKO samples and removed any peaks that overlap the DM regions. From these non-DM peaks, we used the R package MatchIt (Ho et al., 2011) (v3.0.2) to select regions with matched peak lengths and H3K27me3 levels (in the unit of RPKM) as those in the DM regions, with the greedy nearest neighbor matching using logistic link propensity score as a distance measure (requiring the standard deviation of the distance to be less than 0.01).”

b. On p19 the authors state that "P39 cKO DMRs were significantly enriched in peaks and DM regions of H3K27me3 (28.1% overlapped H3K27me3 peaks in cKO, p<1e-100) (Figure 6B)." However, 6B only presents the overlap with peaks in each genotype as an odds ratio and doesn't make it clear how the enrichment of DMRs in DM regions was assessed. This appears to be assessed to some degree in S8F (note the S8 figure legend appears to be wrong for this panel), but no analysis is presented to control for the general tendency of DMRs and H3K27me3 peaks to overlap. It seems imperative, given that DMRs already overlap with H3K27me3 peaks, that a resampling approach is used comparing DM peaks to non-DM K27me3 peaks that are matched to have similar wild-type H3K27me3 signal and size as DM peaks. Likewise, what would the changes in H3K27me3 signal presented in 6C look like at non-DMR regions that contain similar baseline wild-type H3K27me3 signal? Without these analyses it is not clear that DMRs are particularly unique in showing increased K27me3 signal.

As for the reviewer’s suggestion of looking at non-DMR regions that contain similar baseline wild-type H3K27me3, it will be computationally intense to search for such matching regions as non-DMR regions are essentially the complement of DMR regions from the whole genome. And it will take time and luck to shuffle the DMR regions, compute the baseline H3K27me3 signal and get a match to the observed DMR regions. Instead, we addressed this comment with the following two approaches.

First, in Figure 5 — figure supplement 1A we analyze the ChIP-seq signal enrichment for two types of shuffle control regions. The first group of shuffle regions (dotted lines) is generated by shuffling the DMRs randomly across the same chromosome (excluding blacklist regions). These regions represent random background noise. Indeed, we find that the ChIP-seq signal is flat for these control regions in every sample and every histone mark. The second group of shuffle regions (dashed lines) is selected to meet the same criteria as the first group with an additional restriction that the shuffles must reside within the P39 H3K27me3 peak regions (the union of P39 Control and P39 cKO H3K27me3 peaks). These regions should at least have some P39 H3K27me3 signal, and as seen in the figure when averaged they formed a line that peaks at the center in the P39 H3K27me3 samples. Even though on average they have a higher enrichment of H3K27me3 compared to the observed DMRs, the differences between the cKO and Control are much smaller than those in the observed DMRs.

Secondly, we tiled the genome with 1kb bins and grouped them into bins with overlapping DMRs and bins without overlapping DMRs. For these two groups of bins, we plotted the mean changes of H3K27me3 in cKO vs. Control as a function of the baseline H3K27me3 signal in the control animals, stratified by the deciles of the baseline H3K27me3 signal (Figure 5 — figure supplement 1B). To avoid double-dipping (using the control signal in both the x-axis and y-axis), we used the baseline signal from the Control replicate 1 on the x-axis and the baseline signal from the Control replicate 2 on the y axis, and vice versa. In both cases, we observed bigger increases in H3K27me3 in bins with overlapping DMRs (Wilcoxon rank-sum test p < 0.001 in each bin), even after controlling for the baseline H3K27me3 level:

These new analyses indicate that DMRs are especially enriched in regions of increased H3K27me3. We have added these figures in Figure 5 — figure supplement 1A-B and described these results in the main text as follows:

“Moreover, H3K27me3 was more abundant at the center of DMRs in cKO compared to control neurons at P39 (∆ = 0.15 in the unit of fold enrichment vs. IgG, 5.04% increases, Figure 5C). There were much smaller changes of H3K27me3 at these DMRs in newborn (P0, ∆ = 0.0075, 0.27% changes) or fetal (E14, ∆ = -0.028, -1.02% changes) neurons (Figure 5C), and the increases were not seen in randomly shuffled regions (Figure 5 — figure supplement 1A). The signal of H3K4me3 and H3K27ac at the DMRs was also elevated in the cKO, to a lesser extent (H3K4me3: ∆ = 0.068, 2.92% changes; H3K27ac: ∆ = 0.13, 4.64% changes). … When accessing the H3K27me3 changes as a function of baseline H3K27me3 levels in the control across the genome tiled in 1kb bins, we observed bigger H3K27me3 increases in bins with overlapping DMRs compared to bins without overlapping DMRs (Wilcoxon rank-sum test p < 0.001 in each bin, Figure 5 — figure supplement 1B). These results indicate that DMRs are particularly unique in showing increased H3K37me3 signal.”

c. The increase in H3K27me3 in the cKO at the mab21l2 locus at P0 (figure 5B), before methylation has or has not been established, suggests that H3K27me3 may change before methylation differences, rather than in reaction to changes in methylation. Is this a common effect seen across other genes? How does this impact the idea that compensation occurs due to lack of methylation? Related to this issue, on p.19 no specific plot is referenced to support the statement that "There was no corresponding increase of H3K27me3 at these DMRs in newborn (P0) or fetal (E14) neurons." A quantitative analysis as shown in 6C but for the E14 and P0 time-points could assess this directly.

We apologize for the confusion. At the Mab21l2 locus (now in Figure 4B) there is no significant increase of H3K27me3 at P0 or E14 (i.e. there is no horizontal red bar under the P0 and E14 tracks; and our DiffBind analysis used a threshold of FDR<0.05). We have added a shaded box to highlight the region with significant increases of H3K27me3 at P39 to better illustrate our finding here (Also see our replies to reviewer #1, comment 3 for the quantification of the H3K27me3 levels at this locus).

We agree with the reviewer that quantitative analysis is needed to support our statement regarding the lack of increased H3K27me3 at DMRs at P0 and E14. Indeed, this analysis was included (now in Figure 5C) and we have updated the text to directly reference this figure panel as follows:

“Moreover, H3K27me3 was more abundant at the center of DMRs in cKO compared to control neurons at P39 (∆ = 0.15 in the unit of fold enrichment vs. IgG, 5.04% increases, Figure 5C). There were much smaller changes of H3K27me3 at these DMRs in newborn (P0, ∆ = 0.0075, 0.27% changes) or fetal (E14, ∆ = -0.028, -1.02% changes) neurons (Figure 5C), and the increases were not seen in randomly shuffled regions (Figure 5 — figure supplement 1A).”

Finally, we have also shown the H3K27me3 changes did not happen in earlier timepoint as we found almost no significant differentially modified H3K27me3 regions in E14 or P0 (Figure 4 — figure supplement 1D). Also, if we focus on the 4,040 significant up-regulated H3K27me3 regions found in P39, the increased signal changes in cKO vs. Control in P39 are not correlated with those in E14 or P0 (Figure 4 — figure supplement 2B and D).

d. Why were developmental DMs called between E14 and P39 while DMRs were called between P0 and P39? Developmental comparisons would be more justified if both used P0 and P39.

Our choice to analyze the DMs between E14 and P39 was motivated by several characteristics of our dataset. Unfortunately, we do not have methylation data for E14 excitatory neurons, therefore developmental DMRs could only be called between P0 and P39. Moreover, we found much more significant developmental DM H3K27me3 regions in P39 vs. E14 compared to P39 vs. P0 (Figure 6 — figure supplement 1A). We believe that one of the reasons for this is that the E14 ChIPseq data are of better quality than the perinatal data from P0 (Figure 6 — figure supplement 2). For this reason, in our original submission, we chose to use E14 and P39 to call developmental H3K27me DMs. We now provide a new figure for the reviewers (Figure R2) to show corresponding results for Figure 6B-E and Figure 6 — figure supplement 1C-G using the set of developmental H3K27me3 DMs between P39 and P0. In these figures, we demonstrate that the conclusions drawn in the previous version are verified in the P39 vs. P0 comparison.

e. In the absence of additional data supporting H3K27me3 accumulation as a functional compensation mechanism for loss of DNA methylation, the title of Figure 6 should not so strongly claim that H3K27me3 changes compensate for mC loss.

We have revised the title of this figure (now Figure 5): Increased H3K27me3 correlates with the loss of postnatal DNA methylation.

3. While initially assessed by the authors, mCH changes are not integrated into the analysis of H3K27me3 changes. Previous studies by this group and others have suggested the importance of mCH in gene regulation within specific neuron populations and, given the interest to the field, more thorough analysis is warranted regarding this neuronal methylation and H3K27me3.

a. Given the association, even if limited, between differential mCH loss and gene dysregulation in the cKO, it is relevant to examine how changes in K27me3 relate to, or compensate for, loss of mCH. For example, do genes that are not dysregulated in the cKO, and normally have high mCH (which is lost in the cKO) show higher increases in K27me3 compared to non-dysregulated genes that have low mCH (and therefore lose less mCH)? Reciprocally, one could ask if a population of genes selected to have similar, high mCH loss in the cKO, show differential gene dysregulation as a function of differential K27me3 accumulation.

We appreciate the reviewer’s insightful suggestions for analysis of the relationship between the loss of mCH and the changes in the H3K27me3 signal. We agree that since the loss of mCH and the loss of mCG are highly correlated (Figure 3 — figure supplement 3D), we would expect to see to a certain extent a similar compensation of H3K27me3 in genes/regions with loss of mCH. We have added a figure to Figure 5 — figure supplement 2B to show the changes in the H3K27me3 signal as a function of the loss of gene body mCH in non-DE genes (stratified by the deciles of δ mCH):

We indeed observed a larger increase of H3K27me3 signal in the gene body of genes that lost most mCH in the cKO (genes that have high mCH in the control; the leftmost decile in the plot), compared to genes that lost less mCH (genes that normally have low mCH).

In addition, we grouped the genes by the quantiles of loss of mCH in cKO and plotted the gene expression fold-changes as a function of the changes of H3K27me3 in gene body for each of these gene sets (Figure 5 — figure supplement 2C):

This shows that genes that lost the most mCH in the cKO (genes that have high mCH in the control; red line) tend to have increased expression in the cKO, compared to genes that lost the least mCH (genes that have low mCH to begin with; blue line), which is consistent with our results in Figure 3E. Moreover, we can see a negative trend between the changes of gene body H3K27me3 signal and gene expression fold-change in the gene set with the highest baseline mCH, in which genes that showed the biggest increases of H3K27me3 were inclined to decrease in expression whereas genes that had small increases or decreases of H3K27me3 signal were generally up-regulated in gene expression. Such correlation was not observed in genes with the lowest baseline mCH.

We have added these plots in Figure 5 — figure supplement 2B-C and described the results in the main text as follows:

“We next analyzed how changes in H3K27me3 related to the loss of mCH. In all non-DE genes (FDR ≥ 0.05), we observed a larger increase of H3K27me3 signal in the gene body of genes that lost most mCH in the cKO (Figure 5 — figure supplement 2B). Indeed, when we grouped all genes by the extent of the loss of mCH, we found that genes that lost the most mCH had a negative correlation between the changes in gene body H3K27me3 signal and gene expression fold-change. Such correlation was not observed in genes that did not lose mCH (Figure 5 — figure supplement 2C). These results suggest that H3K27me3 may have a role in repressing expression specifically in regions that lose repressive non-CG DNA methylation.”

b. What are the changes in mCH at DM H3K27me3 sites? Since mCG and mCH loss are correlated, one might expect that these are sites that normally also contain high mCH. It might not be feasible to call mCH DMRs given the technical limitations, but quantification of this signal can be done at DMs and DMRs, and the potential for it to contribute to these effects could be discussed.

As suggested by the reviewer, we have estimated the changes of mCG and mCH in the DM H3K27me3 regions and also compared them with the changes in matched non-DM regions (see our response to comment 2b for the methods we used to select these matched non-DM regions). We found that the DM regions showed significantly larger decreases in both mCG and mCH when compared to non-DM regions (Wilcoxon rank-sum test p < 0.0001).

We have added these plots in Figure 4 — figure supplement 3E and described the results in the main text as follows:

The DM regions also have larger decreases in both mCG and mCH when compared to non-DM regions (Wilcoxon rank-sum test p < 0.0001, Figure 4 — figure supplement 3E).

4. The authors present evidence of PRC2 binding at DEGs (Figure 5A) as a motivation to assess H3K27me3 in their mice. However, they then observe changes in H3K27me3 that they contend do not occur under normal conditions and propose that these changes are an underlying mechanism that blocks gene dysregulation in these mice. How are we to reconcile this original rationale of a normal function for PRC2 at dysregulated genes with the model of de novo compensatory H3K27me3 accumulation? Some explanation by the authors of how these separate notions may relate to one another would be helpful.

The BART results for up-regulated and down-regulated DE genes (now in Figure 4A) were bioinformatics predictions that drove us to look into the Polycomb repression via H3K27me3 profiling. BART predictions are based on ChIP-seq data in various cell lines and tissues from the ENCODE project. This means that the potential PRC2 binding at our DEGs may normally happen in systems other than the brain or pyramidal neurons, or at other time points during development. Our results showed that these DEGs were normally regulated by DNA methylation in the pyramidal neurons but they can still be regulated by PRC2 in the current system if the DNA methylation was abolished.

We have added these explanations in the Discussion section of the manuscript as follows:

“Overall, our results suggest that when DNA methylation is disrupted, H3K27me3 might partially compensate for the loss of mCG and/or mCH and act as an alternative mode of epigenetic repression. Nevertheless, we did not find differential expression in any of the four core components of PRC2 (Ezh2, Suz12, Eed and Rbbp4) in adult Dnmt3a cKO animals. It is possible that the increased H3K27me3 was mediated by transient expression of PRC2 components during development in the cKO. Furthermore, the predictions from BART (Figure 4A) were derived from various cell lines and tissues from the ENCODE project (Davis et al., 2018; ENCODE Project Consortium, 2012), suggesting that the potential PRC2 binding at our DEGs may normally happen in systems other than the brain or pyramidal neurons, or at other time points during development. Additional experiments which directly manipulate components of the PRC2 system are required to further test the potential compensation mechanism.”

5. Some effects on H3K27ac are observed at DMRs, but the authors ignore the prospect that these effects could drive changes in gene expression. Previous groups have shown significant changes in H3K27ac in DNMT3A models, and some of the data the authors present suggest that this could also explain some of the observations in their model.

a. While DM K27ac peaks are not identified in the cKO, in figure 6B there is a similar enrichment of DMRs to associate with K27ac peaks in the cKO vs control as there is for K27me3. In addition, there is a similar increase in odds ratio of enrichment of K27ac peaks to the of K27me3 at DMRs for the cKO in figure 6B. The analysis in S8F-G that is referenced when referring to K27ac is either missing (S8G) or is not showing data for K27ac (S8F). The authors should at least comment in the Results section that these signals exist and may be significant for gene regulation.

In the new set of DMRs from the new batch of DNA methylation data (see responses to the editor on why we introduced a new batch of sequencing data), we didn’t see the enrichment of overlapping DMRs in H3K27ac (now in Figure 5B). But indeed, we still see the enrichment of the P39 H3K27ac signal in the center of DMRs (Figure 5C). We decided to focus on the changes of H3K27me3 since it’s more evident, but we agree with the reviewer that H3K27ac may also have an impact on regulating the expression of the DEGs. We apologize for the confusion in referring to Figure S8F-G, and we have corrected the text and added some comments regarding H3K27ac as follows:

“The signal of H3K4me3 and H3K27ac at the DMRs was also elevated in the cKO, but to a lesser extent (H3K4me3: ∆ = 0.068, 2.92% changes; H3K27ac: ∆ = 0.13, 4.64% changes).”

b. Are two biological replicates sensitive enough to call differences in H3K27ac? In the context of a small number of replicates and the subtle magnitude of changes in gene expression that are observed, negative results for differential peaks should be interpreted with caution. In contrast, the larger peaks observed in K27me3 may have led to increased ability to call differential peaks.

We agree with the reviewer that our power for discovering changes in H3K27ac is somewhat limited by the sample size of two replicates per condition. We have added some comments to remind the readers of this potential issue in the Discussion section as follows:

“Indeed, we found that the Dnmt3a cKO DMRs overlapped with regions that gain methylation during normal postnatal development (Figure 3G-H). These DMRs had increased H3K27ac in the Dnmt3a cKO (Figure 5C), consistent with observations in adult animals lacking Dnmt3a specifically in GABAergic Sst- or Vip-expressing interneurons (Stroud et al., 2020). In those experiments, embryonic gene-regulatory elements had lower cytosine methylation and increased H3K27ac and H3K4me1 (Stroud et al., 2020). This suggests an essential role for Dnmt3a and DNA methylation in shaping the transcriptome during development in part via the inactivation of embryonic enhancers, with potentially long-lasting effects on the gene expression pattern of mature neurons. Although we did not detect individual peaks with a statistically significant difference in H3K27ac between cKO and control (Figure 4 — figure supplement 1C), this could be due to a small number of replicates and the subtle magnitude of changes in gene expression relative to the large number of tested peaks. Future experiments using more replicates or orthogonal techniques like CUT&RUN and CUT&Tag could help to further investigate the relationship between the Dnmt3a-dependent DNA methylation and the activation of enhancers.”

6. Though the characterization of DNMT3A cKO in excitatory neurons is novel, if the authors are claiming that these neurons specifically are driving behavioral phenotypes then further contextualization and characterization of the extent of conditional deletion is needed.

a. It would be useful to justify the rationale for investigating excitatory neurons, and how using this class of neurons can uncover novel findings. Existing studies of DNMT3A cKO have shown behavioral and epigenomic effects in pan-neuronal and inhibitory neuron conditional knockouts. Comparisons between these models and discussion of differences could offer strong insights into what unique circuits may be most disrupted by DNMT3A cKO in excitatory neurons.

Our previous studies (e.g. Lister et al., 2013; Mo et al. 2015; Luo et al. 2017) have shown that CG and non-CG DNA methylation across all neuron types has a highly cell type specific distribution that is globally remapped during postnatal brain development. To understand the functional significance of this epigenetic regulation, we chose to focus on the major neuronal populations with established roles in regulating cognitive behavior. Our previous study (Lavery et al., 2020) analyzed the loss of Dnmt3a in GABAergic neurons. Here, we chose to focus on the other major neuronal population, the excitatory (glutamatergic) neurons in the frontal cortex.

Note that both pan-neuronal and pan-GABAergic cKOs (e.g. (Feng et al., 2010; Morris et al., 2014; Nguyen et al., 2007)) affect many more regions of the brain than what is affected in our study, thus leading to confounding compensatory/deleterious phenotypes. Previous studies using those cKOs had profound phenotypes and died before reaching early adulthood. Because we wanted to analyze the effect of blunting the increase in Dnmt3a during the perinatal period in a cell-specific manner, without killing the mice, we believe our approach has allowed for a much more detailed study of the regulation of neuronal DNA methylation. Although the phenotypes observed in the present cKO are mild, both at the functional and transcriptional level, we still found significant changes that are more reminiscent of those found in neurodevelopmental disorders. Dnmt3a de novo mutations have been linked to autism spectrum and other developmental disorders. Our present results may point to the period of gain of methylation in postmitotic excitatory neurons as a critical period in the development of these disorders.

b. If the authors are claiming that behaviors and phenotypes are the result of specifically knocking out DNMT3A in all excitatory neurons, further evidence should be provided that the Cre-line is being expressed sufficiently and specifically in excitatory neurons, and that their genomic data is faithfully capturing the majority of excitatory cells while excluding non-excitatory cells.

We thank the reviewer for raising this question. The Neurod6-Cre mouse line (also referred to as NEX-Cre) used in our study is a knock-in mouse line that expresses Cre recombinase under the control of Neurod6 regulatory sequences (Goebbels et al., 2006). Neurod6 is a transcription factor exclusively expressed in the central nervous system – most prominently in excitatory neurons of the neocortex and hippocampus (e.g. see Yao et al., Cell 2021), and its expression parallels excitatory neuronal differentiation and synaptogenesis during brain development (see Figure 1 — figure supplement 1B).

Consistent with the expression of NeuroD6, Goebbels et al. (2006) observed the most prominent Cre activity in the neocortex and hippocampus, starting from around embryonic day 11.5. Moreover, Cre-mediated recombination marked pyramidal neurons and dentate gyrus mossy and granule cells and was absent from proliferating neural precursors of the ventricular zone, interneurons, oligodendrocytes, and astrocytes (Goebbels et al., 2006). We confirmed that Neurod6-dependent Cre recombination occurred only in excitatory neurons using the INTACT mouse (Mo et al., 2005), in which Cre-mediated excision of the transcription stop signals activates the expression of the nuclear membrane tag (Sun1 -sfGFP-myc) in the cell type of interest. Since recombination only occurs in excitatory neurons, we expect excision of exon 19 from Dnmt3a will only occur in those neurons expressing Cre recombinase. Figure 1 — figure supplement 2A-C depicts the staining of Sun1-GFP across the brain in non-inhibitory Neurod6+ neurons, confirming previous results (Goebbles et al., 2006). Note that the sfGFP-tag decorates the nucleus of cells where recombination occurs.

We also added the following Figure 1 — figure supplement 2D to show the expression of pan-neuronal, excitatory neuron and inhibitory neuron marker genes in our RNA-seq data, in which we observed high expression of excitatory neuron markers and very low expression of inhibitory neuron markers, further demonstrating that the neurons carrying the GFP label belong to the excitatory neuron population.

7. The manuscript would be further improved by clarification of some methods and statistics, and modifications in data presentation:

a. The authors should provide information on the statistics performed on the behavior data. Are there significant differences between sexes, or significant sex by genotype interactions for these behavioral tasks? A two-way ANOVA using genotype and sex would be useful in understanding the impact of these variables. There appears to be many effects that are sex-specific, yet this appears to be overlooked in the text. Furthermore, if there are robust sex-specific effects then the subsequent molecular and genomic analyses should be done using both sexes, and comparisons could be done to explain the molecular underpinnings of these differences.

We thank the reviewer for raising this point, for which we would like to provide clarification. The experiments in this section (Figures 1, Figure 1 — figure supplement 4 and 5) were first and foremost designed to investigate whether the loss of Dnmt3a-dependent DNA methylation during the maturation of pyramidal cells might lead to endophenotypic markers of neurodevelopmental illnesses. Male mice were used in generating all genomic datasets. We performed behavioral experiments in cohorts of male mice, which led to the discovery of impairments in social behavior, PPI, and working memory. We subsequently investigated if those same behaviors were affected in a cohort of female mice, finding that the loss of Dnmt3a caused similar impairments in working memory, but not in PPI or social behavior. Because the male and female cohorts were tested in completely independent experiments, it would not be appropriate to perform a two-way ANOVA using sex as a variable, as this could result in an inaccurate depiction of results. Yet, the wider range of behavioral impairments identified in the male cohort is noteworthy, as it correlates well with human neurodevelopmental disorders, such as schizophrenia and autism, in which males are disproportionately more affected than females. We have edited the Discussion to clarify that even though it was outside the scope of the present work, our data provide support for designing a comprehensive follow up study aiming to dissect sex-specific effects of Dnmt3a and their underlying causes.

b. The authors might also consider showing preference index when analyzing social approach data when comparing genotypes.

We thank the reviewer for this suggestion, which we have carefully considered. We decided to keep our results as raw "time spent on side" to take into account the differing time spent in the empty space, which showed high significance and would be lost if the results were reported as preference index. We edited the results text in order to mention this finding clearly:

“Moreover, when tested in a three-chamber box in which one of the sides contained a novel mouse and the opposite a novel object, male Dnmt3a cKO animals spent less time in the former, opting, instead, to remain significantly longer in the center (empty compartment), which is suggestive of reduced exploration and social interest (Figure 1D, left panel; p = 0.01048).”

c. What is the background set of genes for 2B-C? Synaptic genes are enriched in both up- and down-regulated gene lists. How is gene expression normalized? Is there a discovery bias for more highly expressed genes?

In the previous version of our manuscript, we used expressed genes in P39 as the background set in the GO analysis for the DE genes. Expressed genes were defined as genes with CPM (counts per million) greater than 2 in at least two samples. In the new batch of RNA-seq data (see responses to the editor on why we introduced a new batch of sequencing data), we identified fewer significant DE genes and hence found no significant enrichments in any GO terms. Therefore, we have removed all the figures regarding the GO analysis for the DE genes.

d. What value was used to identify equally expressed unchanged genes in figure 4B, TPM? Identifying equally expressed genes can be complicated for nuclear RNA where unprocessed RNAs contribute differentially to total counts based on the length of the gene. In addition, previous analysis of mCH enrichment at dysregulated genes in inhibitory neuron-specific KO of DNMT3A also found DEGs to be differentially methylated based on direction of change but showed that all other genes were more lowly methylated than these two groups (Lavery et al. 2020). Here the authors see the same relationship between DEG groups but find other genes to be higher in mCH rather than lower. Do the authors have any perspective on why these differences are observed?

We used the R package `MatchIt` to select non-DE genes with matched expression levels in the unit of TPM. We have added Figure 3 — figure supplement 4B to show these non-DE genes have similar expression levels as the DE genes in the P39 Control samples, and their expressions were not changed in the P39 cKO samples.

With the new batch of RNA-seq and MethylC-seq data with bona fide biological replicates (see responses to the editor on why we introduced a new batch of sequencing data), we updated the mCH profile in the DEGs and corresponding non-DE control genes (now in Figure 3D):

We now see that the up-regulated genes have higher mCH than the downregulated genes, and non-DE genes have lower mCH than the up-regulated genes but also indistinguishable levels from the down-regulated genes. This in general agrees with the findings in Lavery et al. We presumed the discrepancy between the findings in Lavery et al. and our previous version of the manuscript might be due to the differences in the power of detecting DE genes, the choice of thresholding significant DEGs, the method of selecting non-DE control genes, and probably some unknown distinctions of biological mechanism in regulating gene expression after Dnmt3a cKO in inhibitory neurons vs. in excitatory neurons.

e. On p22 the authors state "We found that developmental peaks (DevLoss and DevGain) were relatively unaffected by the cKO(H3K27me3 = 0.11, -0.20 respectively, in units of log10(CPM+1)), whereas Group cKO had ten-fold larger mean effect (H3K27me3 = 1.12 respectively)." Given the y scales in figures 7E and S10H it is unclear how they derived a ∆ value of 1.12 on a log 10 scale for Group cKO (e.g. in S10H the median values look much less 1 unit different on a log10 scale). Please clarify the actual magnitude of effect sizes.

We are sorry for the confusion. In the main figure (now in Figure 6C-E) we normalized the H3K27me3 signal to the means of the two P39 Control samples and plotted the fold changes. In the said supplemental figures (now in Figure 6 — figure supplement 1E-G) we plotted the H3K27me3 signal in units of log10(CPM+1) for each sample. In the previous version of our manuscript, the δ values were achieved by running a paired t-test on the raw CPM scale. To avoid confusion, we have rerun the paired t-test on the log10(CPM+1) scale and updated the δ values in Figure 6 — figure supplement 1E-G. We have also edited the relevant statements in the main text as follows:

“To further stratify the joint distribution of developmental and Dnmt3a cKOdependent changes in both H3K27me3 and DNA methylation, we assigned peaks to three groups (Figure 6C-E and Figure 6 — figure supplement 1EG). “Group DevLoss” and “Group DevGain” peaks lose or gain H3K27me3 during development, respectively (Figure 6C-D and Figure 6 — figure supplement 1E-F). “Group cKO” peaks have higher H3K27me3 in the Dnmt3a cKO compared to control at P39 (Figure 6E and Figure 6 — figure supplement 1G). We found that developmental peaks (DevLoss and DevGain) were relatively unaffected by the cKO (H3K27me3 = 0.02, -0.03 respectively, in units of log10(CPM+1)), whereas Group cKO had 4.5-fold larger mean effect (H3K27me3 = 0.09 respectively). Group cKO peaks also experienced greater loss of mCG (mCG = -23.5%) than Group DevLoss (13.0%) or DevGain (18.6%) (Figure 6C-E and Figure 6 — figure supplement 1E-G, middle and right panels). These results suggest that regions prone to alteration of H3K27me3 by Dnmt3a cKO are distinct from the regions affected by developmentally dynamic H3K27me3.”

f. The authors describe the association between gene body mCH and gene expression changes as only accounting for 1% of the variance. It would be useful to have a point of reference to understand if this is a low value or a high value in comparison to other potential predictors of gene expression changes. For example, how much does the proposed overcompensation by H3K27me3 accumulation contribute to the genome-wide variance in gene expression? In addition, is it the case that variance in the measured fold-changes will be mitigated by addition of more replicates to more accurately estimate true fold-changes? If this is the case, it should be noted.

Thank you for the valuable suggestion to provide a point of reference for the magnitude of the correlation between mCH and gene expression. The variance explained (R-squared) between the gene body mCH and RNA log fold-change was 0.46%. To give a quantitative context for this, we estimated the total explainable variance as the R-squared between RNA log fold-change from independent biological replicates. That is, we selected one of the cKO and one control sample to calculate the first replicate logFC value; we then used the second cKO and second control sample to get a second, independent replicate logFC estimate. We found that the correlation between these was 1.30%. This shows that, although the mCH R-squared is small, it is nevertheless a significant fraction of the total explainable variance given the noise in our RNA-seq data. As the reviewer suggests, it is likely that adding more data (more biological replicates) would reduce the noise in both the RNA logFC and the mCH measurements, which might increase the explained variance. We have revised the Results as follows:

“The difference in gene body methylation (cKO – Control) was negatively correlated with gene expression changes, consistent with repressive regulation (Gabel et al., 2015; Lavery et al., 2020) (Figure 3E). This correlation accounted for 0.46% of the variance of differential gene expression, whereas the total explainable variance (R2 between biological replicates) was 1.30% (Figure 3 — figure supplement 4D). The strength of the association between mCH and mRNA changes may be limited by the use of only two biological replicates in our dataset.”

Furthermore, we used a linear model to test how much the δ H3K27me3 signal may contribute to the variance of RNA logFC between cKO and control in 14,755 expressed genes. We found that the R-squared value is 0.0599%, which is smaller than the R-squared for mCH (0.456%, see Figure 3 — figure supplement 4D).

g. Using different color schemes would be helpful in 8A and 8B to avoid confusion.

We have modified the said figure (now in Figure 7A-B) as suggested to avoid confusion.

h. How was clustering analysis done in 8D? What do the values for K27me3 Diff. Mod and DEG enrichment indicate (i.e. what does the scale from 0 to 0.02 represent)?

The clustering of DMVs (now in Figure 7C) was originally performed using k-means. In the resubmission, we have updated this analysis to make the results more evident, and as a result we modified the clustering procedure. The new procedure sorts DMVs according to the following criteria: (1) whether they overlap an H3K27me3 DM region; whether they overlap an up- or down-regulated DE gene (with absolute log2(Fold-change)>0.2); mean mCG>0.3 in P39 Control, P39 cKO; P0 Control, or P0 cKO; and finally by the average level of H3K27me3, H3K3me3 and H3K27ac.

The “Fraction DM H3K27me3” shows what fraction of the length of the DMV overlaps a DM H3K27me3 region (called using diffbind).

We now show directly the mean mRNA logFC for all genes contained within each DMV.

We have added the description of this analysis in the method section as follows:

“For the clustering visualization in Figure 7C, we sorted DMVs according to the following criteria: (1) whether they overlap an H3K27me3 DM region; whether they overlap an up- or down-regulated DE gene (with absolute log2(Fold-change)>0.2); mean mCG>0.3 in P39 Control, P39 cKO; P0 Control, or P0 cKO; and finally, by the average level of H3K27me3, H3K3me3 and H3K27ac. The “Fraction DM H3K27me3” shows what fraction of the length of the DMV overlaps a DM H3K27me3 region (called using DiffBind). And we also plotted the mean mRNA logFC for all genes contained within each DMV.”

Reviewer #3 (Recommendations for the authors):

– The authors delete exon 19 of Dnmt3a. I think they should provide a more thorough description of what this deletion is expected to cause: loss of function due to premature stop codon, loss of enzymatic activity, etc. Moreover, it would be desirable to show by immunostaining whether DNMT3A protein is specifically lost in excitatory neurons.

We thank the reviewer for raising this question and allowing us to clarify an important point. Deletion of exon 19 in the mouse Dnmt3a gene is expected to produce a deletion of 50 amino acids (aa 623-673) in the methyltransferase domain (encompassing the S-adenosyl-L-methionine binding site), thus leading to disruption of the catalytic activity of the enzyme (Lyko 2018). In addition, this approach resulted in a marked downregulation of Dnmt3a protein levels, as quantified by western blot experiments (Figure 1 — figure supplement 3B). Such a downregulation in Dnmt3a expression was matched by an almost complete loss of mCH and loss of mCG remethylation in excitatory neurons confirming the lack of activity of Dnmt3a in excitatory neurons from frontal cortex (Figure 3C). Please also see our response to Reviewer #2 comment 6b where we performed triple labeling of brain slices, showing that the Cre-dependent recombination, and thus the deletion of Dnmt3a, occurs in excitatory, but not inhibitory, neurons.

– Regarding the DMRs identified in DNMT3A KO neurons, the authors should use the generated profiles for H3K27ac in P0 and P39 WT neurons to determine which DMRs overlap enhancers that are active in any of these two neuronal stages. This is more relevant than overlaps with enhancers that could be active in any given cell type. Most importantly, the authors should evaluate the impact the loss of DNA methylation can have on enhancer activity by measuring H3K27ac levels and eRNAs in DNMT3A KO neurons. Measuring eRNA levels can be particularly relevant, as recent evidences suggest that is a better marker of enhancer activity than H3K27ac.

We agree that it is important to analyze the impact of the loss of DNA methylation on enhancer activity. For P39 samples, we analyzed the overlap of H3K27ac peaks in pyramidal neurons with DMRs (Figure 5A-B and Figure 5 — figure supplement 1A). Moreover, in Figure 5C we plotted the mean level of H3K27ac in the region around DMRs. These analyses show that enhancer activity is indeed increased in regions that lose DNA methylation in the cKO. Please also see our responses to Reviewer #1 comment 3 and Reviewer #2 comment 5 regarding the analyses on H3K27ac.

We would like to clarify that we did not perform H3K27ac ChIP-seq for P0, so we were unable to specifically examine the association of enhancer activity at the perinatal timepoint.

We agree that it would be interesting to further examine the enhancers using eRNAs, however, we found that the depth of coverage of our RNA-seq data was not sufficient to address this.

– The authors should aim at performing experiments that address the proposed compensatory role for PRC2 and H3K27me3. I understand that generating a KO for PRC2 could be rather complicated, but perhaps the authors could consider using new drugs that act as very specific inhibitors against the enzymatic activity of PRC2.

We appreciate the reviewer’s suggestion, and we agree that it will need further experiments to directly test the hypothesis of the compensatory role for PRC2 and H3K27me3. But such experiments are out of the scope of this manuscript. We have modified the text to weaken our claims regarding the compensation effect of H3K27me3.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Li J, Pinto-Duarte A, Zander M, Lai C, Osteen J, Fang L, Luo C, Lucero J, Gomez-Castanon R, Nery J, Silva-Garcia I, Pang Y, Powell SB, Ecker JR, Luo C, Mukamel EA, Behrens MM. 2022. Transcriptomic and epigenetic disruptions in excitatory neurons in Dnmt3a conditional knockout mouse. NCBI Gene Expression Omnibus. GSE141587
    2. Lister R, Mukamel EA, Nery JR, Urich M, Puddifoot CA, Johnson ND, Lucero J, Huang Y, Dwork AJ, Schultz MD, Yu M, Tonti-Filippini J, Heyn H, Hu S, Rao A, Esteller M, He C, Haghighi FG, Sejnowski TJ, Behrens MM, Ecker JR, Wu JC. 2013. Global epigenomic reconfiguration during mammalian brain development. NCBI Gene Expression Omnibus. GSE47966 [DOI] [PMC free article] [PubMed]

    Supplementary Materials

    Supplementary file 1. Sequencing metrics for RNA-seq, MethylC-seq, chromatin immunoprecipitation sequencing (ChIP-seq), and genomic DNA for the Dnmt3a conditional knockout (cKO) and control samples.
    elife-66909-supp1.xlsx (36.7KB, xlsx)
    Supplementary file 2. Gene expression and list of differentially expressed genes in P39 Dnmt3a conditional knockout (cKO) and P39 control.
    elife-66909-supp2.xlsx (4.5MB, xlsx)
    Supplementary file 3. List of differentially methylated regions (DMRs).
    elife-66909-supp3.xlsx (21.1MB, xlsx)
    Supplementary file 4. Known transcription factor motif enrichment in P39 Dnmt3a conditional knockout (cKO) differentially methylated regions (DMRs).
    elife-66909-supp4.xlsx (43.3KB, xlsx)
    Supplementary file 5. List of transcription factors and chromatin regulators that bind at cis-regulatory regions of the differentially expressed genes in P39 Dnmt3a conditional knockout (cKO).
    elife-66909-supp5.xlsx (93.1KB, xlsx)
    Supplementary file 6. List of chromatin immunoprecipitation sequencing (ChIP-seq) peaks.
    elife-66909-supp6.xlsx (13.3MB, xlsx)
    Supplementary file 7. List of up-regulated H3K27me3 signal regions in P39 Dnmt3a conditional knockout (cKO).
    elife-66909-supp7.xlsx (481.2KB, xlsx)
    Supplementary file 8. List of enriched Gene Ontology terms for biological process in genes associated with up-regulated H3K27me3 regions in P39 Dnmt3a conditional knockout (cKO).
    elife-66909-supp8.xlsx (112.7KB, xlsx)
    Supplementary file 9. List of developmental regulated H3K27me3 signal regions in control pyramidal neurons.
    elife-66909-supp9.xlsx (765.8KB, xlsx)
    Supplementary file 10. List of DNA methylation valleys (DMVs).
    elife-66909-supp10.xlsx (342.5KB, xlsx)
    MDAR checklist

    Data Availability Statement

    All sequencing data are available in the Gene Expression Omnibus under accession GSE141587. A genome browser displaying the sequencing data is available at https://brainome.ucsd.edu/annoj/mm_dnmt3a_ko/.

    The following dataset was generated:

    Li J, Pinto-Duarte A, Zander M, Lai C, Osteen J, Fang L, Luo C, Lucero J, Gomez-Castanon R, Nery J, Silva-Garcia I, Pang Y, Powell SB, Ecker JR, Luo C, Mukamel EA, Behrens MM. 2022. Transcriptomic and epigenetic disruptions in excitatory neurons in Dnmt3a conditional knockout mouse. NCBI Gene Expression Omnibus. GSE141587

    The following previously published dataset was used:

    Lister R, Mukamel EA, Nery JR, Urich M, Puddifoot CA, Johnson ND, Lucero J, Huang Y, Dwork AJ, Schultz MD, Yu M, Tonti-Filippini J, Heyn H, Hu S, Rao A, Esteller M, He C, Haghighi FG, Sejnowski TJ, Behrens MM, Ecker JR, Wu JC. 2013. Global epigenomic reconfiguration during mammalian brain development. NCBI Gene Expression Omnibus. GSE47966


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES