Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2025 Aug 13;122(33):e2506846122. doi: 10.1073/pnas.2506846122

DNA polymerase β suppresses somatic indels at CpG dinucleotides in developing cortical neurons

Noriyuki Sugo a,1, Arikuni Uchimura b, Risa Matsumoto a, Hiro Nakayama a, Shota Fujimoto a, Saya Mizuno a, Mayumi Higuchi a, Masaaki Toshishige b, Yasunari Satoh b, Sayaka Wakayama c, Teruhiko Wakayama c, Takeshi Yagi a
PMCID: PMC12377747  PMID: 40802685

Significance

Proper brain development relies on the precise activation of neuronal genes. When this process fails, harmful changes can lead to mental illness. To activate key genes, developing brain cells must remove chemical tags called DNA methylation marks. However, this erasure can accidentally damage the DNA itself unless the DNA repair enzyme DNA polymerase β (Polβ) quickly fixes it. We found that such damage occurs most often at CpG sites—short DNA motifs that regulate many neuronal genes—and increases mutation frequency by ~ninefold, potentially disrupting neuronal function. These findings reveal a hidden cost of gene activation in the brain and help explain how DNA damage during development may contribute to the origins of psychiatric conditions.

Keywords: neurogenesis, mutagenesis, DNA repair, CpG demethylation

Abstract

Somatic mutations in cortical neurons have been implicated in psychiatric disorders. While endogenous DNA damage and repair errors are potential contributors to these mutations during development, the underlying mutagenic mechanism remains unclear. Here, we investigated somatic mutations in immature cortical neurons using mouse somatic cell nuclear transfer-derived embryonic stem cells and whole-genome sequencing. Insertions and deletions (indels) were commonly observed in both repeat and nonrepeat sequences in wild-type cells. The loss of DNA polymerase β (Polβ), an enzyme involved in gap-filling during base excision repair and Ten-Eleven Translocation (TET)-mediated active DNA demethylation, in neural progenitor cells increased indel frequency by ~ninefold at cytosine-phosphate-guanine (CpG) dinucleotides and raised the frequency of structural variants by ~fivefold. These mutations were enriched in neuronal genes, leading to frameshift mutations, amino acid insertions/deletions, and the gain and loss of CpG sites in regulatory regions. Our findings suggest that Polβ preferentially repairs DNA lesions generated at CpG sites by TET-mediated active demethylation, thereby suppressing the mutagenesis that accompanies neuronal gene activation during cortical development.


Single-cell whole-genome sequencing (WGS) of postmortem human brain samples has revealed somatic mosaicism among cortical neurons (14). Somatic mutations, including single-nucleotide variants (SNVs), small insertions and deletions (indels), structural variations (SVs) such as copy number variations, and mobile element transpositions, have been observed in cortical neurons from developmental stages and have been implicated in neurodevelopmental and psychiatric disorders (58). Understanding the mechanisms behind these mutations is crucial for elucidating their role in disease pathogenesis. Previous studies suggest that replication stress-induced DNA double-strand breaks (DSBs) and error-prone DNA repair via nonhomologous end-joining (NHEJ) contribute to SVs in both mouse and human neural progenitors (9, 10). However, the mechanisms of endogenous DNA damage and mutagenesis in the developing cortex remain poorly understood (11, 12).

X-family DNA polymerase β (Polβ) plays a pivotal role in single nucleotide gap-filling during short-patch base excision repair (BER) (13). In addition, thymine DNA glycosylase (TDG)–dependent BER is also involved in TET-mediated active DNA demethylation process that replaces 5-methyl-cytosine (5mC) with cytosine at CpG dinucleotides (14). Polβ-deficient mice exhibit increased apoptosis in immature neurons due to PARP1 and p53 activation in the developing nervous system, leading to lethality immediately after birth (1517). Our previous research demonstrated that Polβ loss increases single-strand breaks (SSBs) and DSBs in cortical neural progenitors and developing neurons during the period of neurogenesis and postnatal development. Furthermore, TET activity influences the extent of DSB formation (18, 19), raising the possibility that active DNA demethylation contributes to mutagenesis in neurons.

Although mutations in the Polβ gene have been reported in various cancers (2022), how the loss of Polβ affects mutagenesis remains controversial. Both loss-of-function and gain-of-function studies demonstrate that Polβ exhibits a mutator phenotype in response to monofunctional alkylating agents (23, 24). Under physiological conditions in vivo, the effects of Polβ appear to be cell type- and environment-dependent (2528). Despite extensive research on Polβ in cancer and DNA repair, its role in neuronal mutagenesis during cortical development remains unclear. Therefore, understanding the relationship between the Polβ-mediated process and neuronal mutagenesis in the developing cortex is critical.

This study aimed to elucidate the mutagenic mechanism following cortical neurogenesis. We investigated somatic mutations in wild-type and Emx1-Cre-driven Polβ-deficient immature excitatory cortical neurons during late embryonic stages. To detect rare somatic mutations in neurons, we used somatic cell nuclear transfer (SCNT) technique. Embryonic stem (ES) cell lines, specifically nuclear-transferred ES (ntES) cells, were established from neuronal nuclear-transferred embryos (29, 30). This approach allows precise replication of the original neuronal genome in proliferating ntES cells instead of in vitro single-cell whole-genome amplification. Their WGS and variant analysis enable the robust detection of de novo variations within individual neurons (31, 32). We found that indels at CpG dinucleotides are particularly prominent in Polβ-deficient clones, resulting in the gain and loss of CpG sites. The indel sites were enriched in regulatory and coding regions of neuronal genes. These results suggest that Polβ repairs DNA lesions generated at CpG sites by TET-mediated active demethylation and thereby suppresses somatic mutagenesis during neuronal gene activation in cortical development.

Results

WGS of Cortical Neuron Nuclei-Derived ntES Cells.

To investigate low-frequency mutagenesis occurring during neurogenesis and neuronal differentiation, we performed SCNT by transferring donor nuclei of cortical neurons, dissociated from E18.5 Emx1Cre/+Polβfl/fl (Emx1-Cre/Polβfl/fl) and littermate embryos, into wild-type enucleated oocytes. We then established seven ntES cell lines (clones A1-3 and B1-4) from two wild-type embryos and eight ntES clones (clones C1-4 and D1-4) from two Emx1-Cre/Polβfl/fl embryos (Fig. 1A and SI Appendix, Fig. S1A). In Emx1-Cre/Polβfl/fl embryos, Polβ is specifically disrupted in neural progenitors of the dorsal telencephalon before the onset of neurogenesis (19). We conducted whole genome sequencing for each clone (coverage range: 34× to 59×). The sequencing depth analysis revealed an aneuploidy event in clone C3, specifically the loss of the Y chromosome. In clone B3, a segment of chromosome 10 (from position 6,148,756 to the telomere) was deleted and subsequently duplicated.

Fig. 1.

Fig. 1.

Somatic SNV and indel analyses in cortical neurons reprogrammed via somatic cell nuclear transfer. (A) Experimental design to investigate mechanisms of somatic mutations during neurogenesis and neuronal differentiation in the cerebral cortex using Emx1-Cre/Polβfl/fl and Polβfl/fl mice. (B) Mean number of somatic SNVs in Emx1-Cre/Polβfl/fl, Polβfl/fl ntES cell lines, and adult HSCs (33). (C) Mean number of somatic insertions and deletions in Emx1-Cre/Polβfl/fl, Polβfl/fl ntES cells, and adult HSCs (33). (D) Number of indels in nonrepeat sequences in each Emx1-Cre/Polβfl/fl and Polβfl/fl ntES cell. (E) Mean number of insertions and deletions in nonrepeat sequences of Emx1-Cre/Polβfl/fl, Polβfl/fl ntES cells, and adult HSCs (33). (F) Distribution of indel lengths in nonrepeat sequences of Emx1-Cre/Polβfl/fl and Polβfl/fl ntES cells. (G) Mean indel size in nonrepeat sequences of Emx1-Cre/Polβfl/fl and Polβfl/fl ntES cells. (H) Number of indels in tandem repeats in each Emx1-Cre/Polβfl/fl and Polβfl/fl ntES cells.

De novo SNV and indel mutations were called using the Genome Analysis Toolkit (GATK) HaplotypeCaller (34). De novo mutation candidates were identified based on variant allele frequency (VAF) (31, 32). If a heterozygous mutation occurs in a neuron, the expected value of VAF in ntES cells is 50%. To exclude mutations arising during ntES cell establishment and culturing, we applied a threshold. Unique mutations were defined as those with a VAF >30% in a clone and <1% in sister clones, taking sequence coverage heterogeneity into account (SI Appendix, Fig. S1B). In addition to unique mutations in each clone, multiple shared mutations were also observed among some sister clones (SI Appendix, Fig. S1 C and D). The observed SNV counts in the seven wild-type cells (E18.5: 57 to 146, 82 ± 31.2/cell) are largely consistent with those reported in seven ntES cells (3 to 18 wk old: 50 to 112, 68.5 ± 22.9/cell) derived from mouse juvenile and adult olfactory neurons (32). In contrast, the SNV count in mouse adult clonal hematopoietic stem cells (HSCs; 16 wk old: 55 ± 16.4/cell), analyzed in the present study, was slightly lower (33). Moreover, compared to mouse postzygotic cells during early embryogenesis (1.0 SNVs per cell division) (31), cortical neurons—which undergo approximately 11 cell divisions during neurogenesis—exhibited a higher mutation rate (5 to 13 SNVs per cell division) (35). This higher rate is consistent with that observed in olfactory neurons (4 to 10 SNVs per cell division) (32) and is similar to that reported for human clonal neural progenitor cells (8.6 SNVs per cell division) (36).

In the following comparative analysis of Polβ-deficient and wild-type cells, shared mutations were excluded since their origin could not be definitively attributed to neurogenesis rather than early embryogenesis (31). The number of unique SNVs ranged from 46 to 140 in the seven wild-type clones and from 34 to 85 in the eight Polβ-deficient clones, respectively (SI Appendix, Fig. S2A and Dataset S1). Although the average number of SNVs per cell was slightly lower in Polβ-deficient clones (62 ± 14) compared to wild-type clones (76 ± 29), the difference was not statistically significant (Fig. 1B). An alternative pathway, such as Polβ-independent long-patch BER (PCNA, FEN1, Polδ/ε, and Polλ), may be sufficient to suppress the accumulation of somatic SNVs, maintaining normal levels in response to stochastic DNA damage in developing neurons (13, 37).

Loss of Polβ Frequently Induces Indel Mutations in Neurons.

The number of indels, particularly insertions, was higher in wild-type (123 ± 14/cell; range: 96 to 142) and Emx1-Cre/Polβfl/fl (111 ± 24/cell; range: 93 to 149) neurons than in adult HSCs (58 ± 12/cell) (Fig. 1C, SI Appendix, Fig. S2 B and C, and Dataset S2) (33). To investigate the mechanism of indel formation, sites were classified as nonrepeat, tandem repeat, and homopolymer. The most pronounced differences between wild-type and Polβ-deficient cells were found in nonrepeat sequences (Fig. 1 D and E). Insertions and deletions in nonrepeat sequences increased fourfold and 2.5-fold, respectively, in Emx1-Cre/Polβfl/fl cells (insertions: 7.3 ± 3.2/cell; deletions: 6.8 ± 2.9/cell) compared to wild-type cells (insertions: 1.7 ± 1.1/cell; deletions: 2.7 ± 1.1/cell). Polβ-deficient neurons frequently exhibited insertions ≥4 bp (Fig. 1F). The average insertion size was significantly larger in Polβ-deficient neurons (7.3 ± 3.7 bp) than in wild-type neurons (4.2 ± 3.5 bp, P = 0.015), whereas deletion sizes were comparable (wild-type: 4.9 ± 5.4 bp, Polβ-deficient: 5.1 ± 5.1 bp) (Fig. 1G). In contrast, adult HSCs displayed very few indels in nonrepeat sequences (Fig. 1E) (33). X-ray-induced DSBs efficiently promote deletions but not insertions (Fig. 1E) (33). These findings suggest that, in addition to error-prone DSB repair (38), Polβ mediates an insertion-prone repair mechanism in developing neurons.

The majority of indels occurred within homonucleotide runs and tandem repeat sequences (SI Appendix, Fig. S2 D and E). Although HSCs typically exhibit a similar number of insertions and deletions (33), cortical neurons exhibited a higher number of insertions than deletions in repeat sequences (SI Appendix, Fig. S2E). Notably, the number of insertions, but not deletions, within tandem repeat sequences per clone significantly decreased in a subset of Polβ-deficient cells (Fig. 1H). Indels in homopolymers were also comparable between wild-type and Polβ-deficient cells (SI Appendix, Fig. S2F). Thus, Polβ may selectively promote the expansion of tandem repeat sequences, though no significant multiunit of repeat expansion was observed. BER has been implicated in trinucleotide repeat expansions associated with oxidative damage in somatic cells (39, 40). However, the specific DNA polymerases responsible for this process in vivo remain unknown. In neurons, Polβ may play a role in this process.

Absence of Polβ Induces Indel Mutations at CpG Sites.

Forebrain development promotes cytosine demethylation at CpG dinucleotide sites (41, 42). To determine whether the loss of Polβ affects genome stability at CpG sites during active DNA demethylation, we analyzed CpG site frequency within ±100 bp of insertion sites. CpG sites were enriched at the insertion sites in Polβ-deficient cells, whereas no such accumulation was observed in wild-type cells (SI Appendix, Fig. S3A). CpG sites were significantly more frequent at the insertion sites (±2 bp) in Polβ-deficient cells (40/60) than in wild-type cells (1/12) (Fig. 2A). The frequency of insertions with CpG sites within ±10 bp was also significantly higher in Polβ-deficient cells (mean: 6.9/cell), showing a threefold to ninefold increase across biological replicates compared with wild-type cells (0.4/cell) (Fig. 2B and SI Appendix, Fig. S3 B and C). In some cases, multiple CpG sites were observed near the insertion sites (Fig. 2C). Furthermore, all insertions in the Polβ-deficient cells (60/60) and a substantial proportion in wild-type cells (9/12) exhibited sequence identity with subsequent downstream sequences, indicating duplications (Fig. 2C). Duplicated sequences in Polβ-deficient cells tended to have one additional CpG site (Fig. 2D).

Fig. 2.

Fig. 2.

Loss of Polβ increases indel frequency in nonrepeat sequences rather than repeat sequences. (A) Total number of insertions with CpG sites within ±2 bp of the insertion sites in nonrepeat sequences of Emx1-Cre/Polβfl/fl and Polβfl/fl ntES cells. (B) Mean number of insertions near CpG sites (within ±10 bp) in Emx1-Cre/Polβfl/fl and Polβfl/fl ntES cells. (C) Examples of insertions in the coding region or 5′ UTR of Emx1-Cre/Polβfl/fl ntES cells. Upper and Lower rows show the reference sequence and mutated sequence, respectively. Bold text indicates CpG sites, red characters indicate inserted sequences, and underlined text indicates duplicated sequences. (D) Number of CpG sites gained and lost in indels near CpG site (within ±10 bp) in Emx1-Cre/Polβfl/fl and Polβfl/fl ntES cells. The numbers above the bar indicate percentages. (E) Total number of deletions with CpG sites within ±2 bp of the deletion sites in nonrepeat sequences of Emx1-Cre/Polβfl/fl and Polβfl/fl ntES cells. (F) Mean number of deletions near CpG sites (within ±10 bp) in Emx1-Cre/Polβfl/fl and Polβfl/fl ntES cells. (G) Examples of deletions in the coding region of Emx1-Cre/Polβfl/fl ntES cells. Upper and Lower rows show the reference sequence and mutated sequence, respectively. Bold text indicates CpG sites, red characters indicate deleted sequences, and underlined text represents microhomology observed in the junction sites. (H) Length distribution of microhomology observed at deletion junctions in nonrepeat sequences.

Similar to insertions, CpG sites were significantly enriched at the deletion sites (±2 bp) in Polβ-deficient cells (26/52), in contrast to wild-type cells (4/19) (Fig. 2E). Polβ-deficient cells also exhibited a significantly higher frequency of deletions with CpG sites within ±10 bp of the deletion sites (mean: 4.8/cell), representing a ~fivefold increase over wild-type cells (0.9/cell, P = 7.4 × 10−3) (Fig. 2F and SI Appendix, Fig. S3 D and E). Loss of CpG sites by the deletions was increased in Polβ-deficient cells (Fig. 2D). These deletions were frequently associated with microhomology at the junction sequences in both Polβ-deficient (36/52) and wild-type (12/19) cells (Fig. 2 G and H). These findings suggest that Polβ deficiency leads to increased DSB formation at CpG sites. The accumulation of DSBs may overwhelm error-free repair pathways, thereby promoting error-prone repair through NHEJ and microhomology-mediated end-joining (MMEJ) (Fig. 5F) (38). Polβ may prevent mutagenesis at CpG sites during active DNA demethylation.

Fig. 5.

Fig. 5.

Somatic indels in nonrepeat sequences affect neuronal genes. (A) Genomic distribution of indel sites in Emx1-Cre/Polβfl/fl and Polβfl/fl ntES cells. (B) Top six Gene Ontology biological processes associated with genes containing intragenic mutations in nonrepeat sequences in Emx1-Cre/Polβfl/fl and Polβfl/fl ntES cells. (C) Number of indels near CpG sites located in feDMRs and CG-DMRs in Emx1-Cre/Polβfl/fl and Polβfl/fl ntES cells. (D) Top five transcription factor binding motifs enriched within ±200 bp of indel sites. (E) Indels detected within the coding regions in Emx1-Cre/Polβfl/fl ntES cells. (F) Schematic model illustrating Polβ-mediated indel and SV formation driven by active DNA demethylation during cortical neurogenesis.

To determine whether these characteristic indels arise during neurogenesis, a process involving active DNA demethylation (42), we analyzed mutations in ntES cells derived from E18.5 NexCre/+/Polβfl/fl (Nex-Cre/Polβfl/fl) neuron nuclei. In previous study, we reported that Nex-Cre/ Polβfl/fl neurons exhibit Polβ loss after the final mitosis but do not significantly accumulate DSBs in the developing cortex, unlike Emx1-Cre/ Polβfl/fl neurons (SI Appendix, Fig. S4A) (18, 19). Our results showed that the number of indels at CpG sites in these Polβ-deleted clones (clones F1-4) was comparable to that in wild-type clones (clone E1-4; SI Appendix, Fig. S4 BF and Dataset S3). However, clone F2 cells exhibited a slight increase in insertions at CpG sites, similar to clones derived from Emx1-Cre/Polβfl/fl neuron nuclei (SI Appendix, Fig. S4C). In this case, Cre expression may have been induced earlier than basal Nex promoter activity, disrupting the Polβ gene in neural progenitors. These findings suggest that Polβ loss contributes to indels at CpG sites during neurogenesis. Furthermore, this experiment also indicates that the observed results are not due to the effects of Polβ loss in the donor nuclei on DNA demethylation during SCNT reprogramming or the subsequent culture of ntES cells (14, 43). Whether Polβ is involved in the process of DNA demethylation during SCNT reprogramming and ntES cell establishment remains unclear. Presumably, other DNA polymerases may compensate for the loss of Polβ in this process.

Polβ Deficiency Alters Mutation Signatures.

Signature analysis using the COSMIC database (44) identified a distinctive feature of indels Polβ-deficient cells: insertions exceeding 5 bp in nonrepeat sequences, predominantly duplications (Fig. 3C). Control Polβfl/fl cells exhibited a signature with 35% ID5, 33% ID8, and 32% ID10 (Fig. 3D and SI Appendix, Fig. S5A). In Polβ-deficient cells, the signature is predominantly associated with ID10. ID10, characterized by duplications over 5 bp, has been identified in certain cancers with an unknown mechanism. This finding suggests that Polβ deficiency preferentially induces ID10, which may provide insight into a novel mechanism. ID5 and ID8, previously reported in human neurons, are known to accumulate in a clock-like manner (3, 4). Meanwhile, single base substitution (SBS) analysis indicated a higher prevalence of C > T mutations, likely resulting from methylated cytosine deamination, in both wild-type and Polβ-deficient cells (Fig. 3A). This pattern is consistent with previous observations in human neurons and mouse HSCs but diverges from the substitution signature identified in mouse adult tail fibroblasts using ntES cell lines (4, 31, 33). The mutational profile in Polβ-deficient cells was distinct from that of wild-type cells, primarily characterized by SBS40 and SBS18 (Fig. 3B). Importantly, SBS40 has been linked to SV formation (45) and increases in a dose-dependent manner upon X-ray irradiation in HSCs (33). SBS18, commonly observed in neuroblastoma (45, 46), is particularly relevant given that Polβ deficiency promotes medulloblastoma in p53 null mice (47). These findings suggest that Polβ loss alters both ID and SBS signatures in cortical neurons, likely due to the engagement of alternative repair pathways, such as NHEJ, MMEJ, and long-patch BER (13, 37, 38).

Fig. 3.

Fig. 3.

Somatic mutation signatures in cortical neurons. (A) Trinucleotide context of somatic substitutions in Emx1-Cre/Polβfl/fl (535 substitutions, n = 8 cells) and Polβfl/fl (494 substitutions, n = 7 cells) ntES cells. (B) Contribution of COSMIC SBS signatures in Emx1-Cre/Polβfl/fl and Polβfl/fl ntES cells. (C) Mutation spectrum of somatic indels in nonrepeat sequences of Emx1-Cre/Polβfl/fl and Polβfl/fl ntES cells. (D) Contribution of COSMIC ID signatures in Emx1-Cre/Polβfl/fl and Polβfl/fl ntES cells.

Loss of Polβ Induces SVs but Not MEIs.

Increased DSBs are known to induce SV formation (48). Additionally, DNA demethylation regulates the induction of mobile element insertions (MEIs), such as long interspersed elements (LINEs) and short interspersed elements (SINEs), which contribute to somatic mutations linked to psychiatric disorders (2, 6). To investigate this further, we analyzed SVs and MEIs in ntES cells using multiple detection tools (SvABA, Manta, and RUFUS) (4951). We identified 19 SVs, comprising 16 large deletions and three duplications, in Polβ-deficient cells (size 50 bp to 93 kbp, 2.3 ± 1.4 per cell). This represents an ~fivefold increase over wild-type cells, in which only three variants (size: 5 kbp to 40 kbp, two deletions and one duplication) were detected (0.43 ± 0.79 per cell) (Fig. 4A and SI Appendix, Fig. S6). This finding suggests a critical role of Polβ in the suppression of SV formation. The junction sequences of these SVs exhibited microhomology, with no significant enrichment of CpG sites (Fig. 4B). More than half of SVs were located in intragenic regions, potentially affecting regulatory elements and coding sequences of genes such as Tsn and Dnmt3a (Fig. 4 A and B). In contrast, the frequency of MEIs did not significantly differ between Polβ-deficient cells (1.5 ± 1.4 per cell; range: 0 to 4) and wild-type cells (1.4 ± 1.4 per cell; range: 0 to 5) (Fig. 4C and SI Appendix, Fig. S7). Thus, Polβ deficiency promotes SV formation but does not significantly impact MEI frequency, likely through DSB-induced repair via NHEJ or MMEJ in cortical neurons.

Fig. 4.

Fig. 4.

Loss of Polβ increases the number of SVs but not MEIs. (A) Number of somatic SVs in each Emx1-Cre/Polβfl/fl and Polβfl/fl ntES cell. (B) Examples of SV structures detected in the intragenic regions of Emx1-Cre/Polβfl/fl ntES cells. The Top, Middle, and Bottom rows indicate the reference gene structure, the position of the deletion or duplication regions (gray), and the altered gene structure, respectively. Black and blue characters show the 5′ and 3′ junction sequences of SVs, respectively, flanked by primer sets (arrows). Microhomology, insertion, and deletion sequences are indicated in red, green, and gray, respectively. (C) Number of somatic LINE and SINE insertions in each Emx1-Cre/Polβfl/fl and Polβfl/fl ntES cell.

Polβ-Mediated Indels Affect Both Regulatory and Coding Regions in Neuronal Genes.

In human neurons, mutations are frequently observed in actively transcribed neuronal genes with open chromatin (3, 52, 53). We analyzed the functional impact of somatic indels in nonrepeat sequences. In Polβ-deficient cells, indels were enriched in putative regulatory regions within the gene bodies, particularly in the 5′ and 3′ untranslated regions (UTRs) and introns (Fig. 5A). Moreover, gene ontology analysis showed enrichment of genes involved in synaptic signaling and neuronal morphogenesis (Fig. 5B). We further analyzed the positional relationship between indels and differentially methylated CpG regions (CG-DMRs) in developing mouse fetus using previously reported data (54). Notably, indel sites were predominantly localized within CG-DMRs, with a subset found in fetal enhancer-linked CG-DMRs (feDMRs), which are associated with putative transcriptional regulatory regions during forebrain development (Fig. 5C) (54). To identify transcription factors that bind to these regions, we performed motif analysis using HOMER (55). This analysis revealed an enrichment of DNA binding sites for key cortical development factors, such as LHX2 and EMX2 (56) (Fig. 5D). This suggests that the indels may influence neuronal gene expression during cortical development. Furthermore, Polβ-deficient cells exhibited one insertion and five deletions in coding regions (5/8 cells), whereas no such events were detected in wild-type cells (Fig. 5E). These indels caused in-frame alterations, resulting in amino acid insertions or deletions, as well as frameshift mutations. Of particular interest, they affected neuronal genes, including Ahi1 (implicated in Joubert syndrome), Tox3 (a calcium-dependent transactivator), and Htr5a (a serotonin receptor), all of which are implicated in neuronal function and psychiatric disorders (5759). Thus, Polβ-mediated somatic indels occurring during cortical development impact multiple neuronal genes through both regulatory and coding alterations, which may affect individual neuronal function.

Discussion

In this study, we investigated somatic mutations in immature cortical neurons using SCNT and WGS. Our results reveal that Polβ deficiency in neural progenitor cells leads to a ~ninefold increase in indels at CpG dinucleotides and a ~fivefold rise in SVs. These indels were enriched in neuronal genes, with some leading to frameshift mutations or alterations of CpG sites in putative transcriptional regulatory regions. Our findings suggest that Polβ preferentially repairs DNA lesions at CpG sites generated by TET-mediated active DNA demethylation, thereby suppressing the mutagenesis that accompanies neuronal gene activation during cortical development (Fig. 5F).

While Polβ loss in neural progenitor cells has been linked to increased DSB formation and apoptosis during neurogenesis (19), the precise mutation spectrum in neurons remained unclear. We reveal that CpG-specific indels are a hallmark of Polβ-deficient immature neurons, suggesting that these mutations predominantly arise during neurogenesis. This finding supports the hypothesis that TET-dependent active CpG demethylation contributes to mutagenesis via its interaction with BER, a process previously uncharacterized in vivo (11, 14, 60). During differentiation, active DNA demethylation may occur simultaneously at multiple CpG sites within DMRs, requiring efficient repair. If the canonical Polβ-dependent BER pathway is impaired, an alternative pathway, long-patch BER, may be insufficient, leading to DSB formation (13, 23, 37, 61). In such cases, error-prone DSB repair pathways, such as NHEJ and MMEJ, may introduce indels (12, 38). Our data suggest that the predominant repair mechanism for active DNA demethylation-associated lesions in neural progenitor cells relies on Polβ.

A striking feature of the observed indels is the frequent occurrence of duplications in addition to deletions. This contrasts with random DSBs induced by X-ray irradiation, which primarily result in deletions (33). Interestingly, a similar phenomenon is observed in the AID-dependent immunoglobulin hypermutation process, where Polλ promotes duplications in Polβ-deficient B cells (62). This raises the possibility that Polλ-dependent duplications also occur during neurogenesis, which warrants further investigation. Given the parallels between AID-mediated mutagenesis in B cells and active DNA demethylation during neurogenesis in Polβ-deficient cells, it is plausible that concurrent SSBs at adjacent cytosines leading to DSBs, coupled with Polλ-mediated NHEJ, contribute to duplications. Meanwhile, the frequent occurrence of deletions and SVs suggests that DSBs might arise simultaneously at multiple CpG sites within DMRs. Indeed, SVs were frequently observed within genes, and the nature of mutations may depend on the status of DSB termini and the activity of end-processing factors (38).

The accumulation of indels at CpG sites within DMRs during development suggests that genomic instability in regulatory regions of cell type–specific genes is selectively and transiently elevated. Variations in CpG sites within transcriptional regulatory regions can directly impact gene expression, and such mutations may contribute to neurodevelopmental disorders (63). Supporting this notion, WGS studies of postmortem schizophrenia brains have reported increased CpG > GpG transversions at transcription factor binding sites, possibly linked to active DNA demethylation (64). Consistent with this, we observed a slight increase in CpG > GpG transversions in Polβ-deficient cells, although the trend was not statistically significant (P = 0.069, SI Appendix, Fig. S5B). Further analysis with a larger sample size is needed to confirm this trend; however, our findings provide insights into the molecular mechanisms underlying these mutations.

Our results further suggest that Polβ may be linked to an increased mutation burden during human cortical development, a phenomenon known as brain mosaicism. Active DNA demethylation at neuronal enhancer sites induces SSBs in cultured human neurons, suggesting a mechanism through which CpG-site instability may arise (6567). We analyzed previously published single-cell WGS data from postmortem human brain samples and found that somatic indels occurring near CpG sites within ±10 bp were most frequent in infant neurons (28.5%, 6/21 indels) compared to adolescence (11.2%, 25/222) and adulthood (15.1%, 348/2,538) (53). Moreover, most of these indels (5/6) in infant neurons were localized in intron regions, suggesting that they may have arisen via active DNA demethylation during neuronal differentiation. While no direct link between Polβ and brain disorders has been established, these findings suggest that active DNA demethylation-induced indel formation at CpG sites may be a conserved mutagenic mechanism in humans.

SCNT-based approaches are effective for analyzing de novo mutations in somatic cells (31, 32). However, when investigating Polβ-deficient cells, global DNA demethylation during SCNT reprogramming must be considered (14, 43). To determine whether Polβ is required for reprogramming, we used Polβ-deficient donor nuclei from Nex-Cre/Polβfl/fl neurons with wild-type enucleated recipient cells. If Polβ were essential, we would expect increased indels at CpG sites in Polβ-deficient ntES cells, independent of Cre-mediated deletion timing. However, indel levels were comparable to those in wild-type cells, suggesting that Polβ is not critical for SCNT reprogramming (SI Appendix, Fig. S4). Alternatively, maternally supplied Polβ may be sufficient. Thus, the detected CpG site indels in ntES cells derived from Emx1-Cre/Polβfl/fl neurons likely reflect mutagenesis occurring in neural progenitor cells during neurogenesis rather than during reprogramming.

Detecting rare indels in homopolymers and tandem repeats remains challenging (3, 68). In this study, we found that Polβ deficiency specifically reduced insertions in tandem repeats but did not affect deletions (Fig. 1H). Moreover, homopolymers remained unaffected (SI Appendix, Fig. S2F). Given that somatic tandem repeat expansion under oxidative conditions requires mismatch repair and BER, Polβ may play a role in this process (40, 69). Somatic expansion of tandem repeats is associated with age-related neurodegeneration and psychiatric disorders (39, 40, 70). The impact of Polβ loss on tandem repeat stability underscores the need for long-term studies to clarify its function in neurons throughout life.

Our findings reveal that Polβ-mediated repair of active DNA demethylation is essential for maintaining genome stability in developing neurons, highlighting its potential involvement in neurodevelopmental disorders. Loss of Polβ triggers a ~ninefold increase in indels at actively demethylated CpG sites, which alter coding or regulatory sequences in neuronal genes. By demonstrating that active DNA demethylation can become mutagenic unless swiftly repaired, our work provides a mechanism that links epigenetic gene activation to the somatic mutations detected in various brain disorders. A deeper understanding of the interplay between epigenome regulation and DNA repair across development and aging is critical for identifying risk factors and therapeutic targets (52, 53, 70, 71).

Materials and Methods

Animals.

All experiments were conducted in accordance with the guidelines for the care and use of laboratory animals of Osaka University and University of Yamanashi. Emx1-Cre/Polβfl/fl and Nex-Cre/Polβfl/fl mice were generated as previously described (19). B6D2F1 mice (Japan SLC) were used as oocyte donors. Both male and female mice were included in all experiments. Noon of the day on which the vaginal plug was detected was designated as embryonic day 0.5 (E0.5).

Preparation of Dissociated Cortical Neuronal Cells.

Pregnant mice were deeply anesthetized with isoflurane, and neocortices were dissected from E18.5 embryos in ice-cold Hanks’ balanced salt solution. The neocortices were minced using fine scissors in phosphate-buffered saline (PBS; pH 7.4), and the tissue fragments were incubated in PBS containing 0.125% trypsin and 0.02% ethylenediaminetetraacetic acid (EDTA) at 37 °C for 5 min. To stop trypsin activity, DMEM/F-12 culture medium supplemented with 10% fetal bovine serum (FBS) was added. The digested tissue was then gently triturated using a fire-polished Pasteur pipette. After centrifugation, the cells were resuspended in CellBanker 1Plus freezing medium (Takara Bio) and stored at −80 °C until nuclear transfer.

Nuclear Transfer and Establishment of ntES Cell Lines.

Nuclear transfer was performed as previously described (29). Neurons dissociated from E18.5 Emx1-Cre/Polβfl/fl, Nex-Cre/Polβfl/fl, and Polβfl/fl cortices were used as nuclear donors. Following nuclear transfer, reconstructed oocytes were activated using 5 mM SrCl2 in Ca-free CZB medium in the presence of 5 μM latrunculin A and 50 nM trichostatin A for 9 h. After three washes in CZB, cloned embryos were cultured for 4 d in the same medium, and morula- or blastocyst-stage embryos were used to establish ntES cell lines as previously described (30).

WGS Analysis.

High-molecular-weight genomic DNA was extracted from ntES cells using the DNeasy Blood & Tissue Kit (Qiagen). Extracted DNA was used to prepare paired-end libraries according to the Illumina Sample Preparation Kit user manual, without PCR amplification. The libraries were sequenced using the Illumina NovaSeq platform with 150 bp paired-end reads. The samples had an average total read depth of 30× per nucleotide. Sequence reads were mapped to a mouse reference genome (mm10) using the Burrows–Wheeler aligner with the maximal exact matches (BWA-MEM) algorithm.

Mapping and Variant Calling.

We used BWA-MEM v0.7.17 with the “–M” option for Picard compatibility to map sequence reads to the mouse reference genome (UCSC mm10). Subsequently, we performed deduplication and base-quality recalibration using Picard v2.18.26. For SNV and small indel calling, to minimize false variant calls, we exclusively used only highly reliable (HR) reads meeting the following conditions were extracted: 1) properly mapped according to the aligner, 2) mapping with a quality score of ≥60 using SAMtools-1.9 (samtools view -q 60 -f 0 × 2 -F 0 × 500) (72), and 3) mapping to the reference without clipping. To ensure accurate identification and comparison of somatic mutations across all samples (31, 73), we defined effective whole-genome coverage (EWC) regions that satisfied the following criteria: 1) MQ60 read depths within 50 to 300% of the peak coverage for each chromosome, 2) a depth ratio of MQ60 reads to all mapped reads of ≥80% at each site, and 3) a minimum base quality of 20 according to SAMtools. These EWC regions were initially defined for each sample. Subsequently, the regions shared among all the samples were used for mutation analyses, covering 82.9% and 80.7% of diploid autosomes in Emx1-Cre/Polβfl/fl and Nex-Cre/Polβfl/fl mice, respectively (SI Appendix, Figs. S8 and S9). Genomic variants, compared with the mouse reference genome, were called using GATK v4.1.0.0 HaplotypeCaller (34). De novo mutation candidates were identified by filtering variants with the following criteria: a variant allele frequency >30% in a cell clone and <1% in sister cell clones. Validation of candidates involved visual inspection using Integrative Genomics Viewer (IGV) and verified variants were defined as de novo somatic mutations. Indel variants were divided into two groups: those occurring in repeat sequences (where the same motif appeared ≥5 times) and those in other nonrepeat sequences. ToppGene Suite (https://toppgene.cchmc.org) was employed to characterize biological processes associated with identified somatic SNVs and indels.

Mutation calling for SVs and MEIs was performed using SvABA v1.0.1, Manta v1.6.0, and RUFUS (4951). In SvABA, variants were called based on the following inclusion criteria: 1) an allele depth (AD) > 5 in a clone and 2) an AD of 0 or 1 in all clones from the other clones. In Manta, the variants were called based on the following inclusion criteria: 1) a split reads for alternative allele (SRA) value ≥ 4 in a clone and 2) an SRA of 0 in all clones from the other mice. RUFUS, run with one clone as the subject and the others as controls, called variants based on the following criteria: 1) SVTYPE INFO in a clone, 2) other INFO with ALT_LEN ≥ 30 in a clone, or 3) other INFO with REF_LEN ≥ 30 in a clone. All identified candidate variants in neuron-derived clones were manually validated using IGV, with the analysis not limited to EWC regions.

Mutation Signatures.

Mutational signature analysis and visualization of single-base substitutions within the EWC regions were performed using SigProfilerMatrixGenerator v1.2.2 (74) and SigProfilerExtractor v1.1.4, with the parameters “cosmic_version = 3.2,” “reference_genome = mm10,” “opportunity_genome = mm10,” and “nmf_replicates = 100” (44). Mutational signatures for each sample were categorized through hierarchical clustering using Ward’s minimum variance method (Ward. D2) employing the heatmap.2 function from the “gplots” package in R. Motif analysis for indel sites was performed with HOMER with its database of known transcription factor motifs (55). The enrichment of feDMRs and CG-DMRs at indel sites was analyzed using DeepTools and ENCODE epigenome data: CG-DMRs (thumper-e1.ENCODE_mouse_tissues.DMR_CG_ENCODE_4DMS) and feDMR (thumper-e1.ENCODE_mouse_tissues.feDMR_FB). These datasets were obtained from the ENCODE Mouse Fetal Development project (http://neomorph.salk.edu/ENCODE_mouse_fetal_development.html) (54, 75).

Statistical Analysis.

For each experiment, the number of samples analyzed is provided. Welch’s t test was used to determine significant differences, and summary statistics are presented as means ± SD. All data analyses were performed using Microsoft Excel and GraphPad Prism.

Declaration of Generative AI and AI-Assisted Technologies in the Writing Process.

During the preparation of this manuscript, the authors used ChatGPT4 and DeepL exclusively for grammatical error correction and stylistic changes. The authors thoroughly reviewed and edited the content following the use of these tools and take full responsibility for the final content of the publication.

Supplementary Material

Appendix 01 (PDF)

Dataset S01 (XLSX)

pnas.2506846122.sd01.xlsx (127.3KB, xlsx)

Dataset S02 (XLSX)

pnas.2506846122.sd02.xlsx (413.4KB, xlsx)

Dataset S03 (XLSX)

pnas.2506846122.sd03.xlsx (19.3KB, xlsx)

Acknowledgments

This work was supported by JSPS KAKENHI Grant No. 20K06854, 24K09644 to N.S., 22K11775 to M.H. and N.S., 24K03096, 24H02061 to A.U., 22H05498, 23H00390, 24H01237, 25H02501 to T.Y., and Canon Foundation (M20-0008) to T.W. We thank Dr. Noritaka Adachi for useful suggestions. We also thank Dr. Nobuhiko Yamamoto for continuous encouragement.

Author contributions

N.S., A.U., T.W., and T.Y. designed research; N.S., R.M., H.N., S.F., S.M., M.H., S.W., and T.W. performed research; N.S., A.U., R.M., H.N., M.T., Y.S., T.W., and T.Y. analyzed data; and N.S., A.U., T.W., and T.Y. wrote the paper.

Competing interests

The authors declare no competing interest.

Footnotes

This article is a PNAS Direct Submission.

Data, Materials, and Software Availability

Raw sequencing data have been deposited in the DNA Data Bank of Japan Sequence Read Archive (DDBJ‐SRA) under accession numbers (BioProject: PRJDB20767) (76). Data are publicly available at https://ddbj.nig.ac.jp/sra.

Supporting Information

References

  • 1.McConnell M. J., et al. , Mosaic copy number variation in human neurons. Science 342, 632–637 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Evrony G. D., et al. , Single-neuron sequencing analysis of L1 retrotransposition and somatic mutation in the human brain. Cell 151, 483–496 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Luquette L. J., et al. , Single-cell genome sequencing of human neurons identifies somatic point mutation and indel enrichment in regulatory elements. Nat. Genet. 54, 1564–1571 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Abascal F., et al. , Somatic mutation landscapes at single-molecule resolution. Nature 593, 405–410 (2021). [DOI] [PubMed] [Google Scholar]
  • 5.Rodin R. E., et al. , Author correction: The landscape of somatic mutation in cerebral cortex of autistic and neurotypical individuals revealed by ultra-deep whole-genome sequencing. Nat. Neurosci. 24, 611 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Bundo M., et al. , Increased L1 retrotransposition in the neuronal genome in schizophrenia. Neuron 81, 306–313 (2014). [DOI] [PubMed] [Google Scholar]
  • 7.Chung C., et al. , Comprehensive multi-omic profiling of somatic mutations in malformations of cortical development. Nat. Genet. 55, 209–220 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Poduri A., et al. , Somatic activation of AKT3 causes hemispheric developmental brain malformations. Neuron 74, 41–48 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Wei P. C., et al. , Long neural genes harbor recurrent DNA break clusters in neural stem/progenitor cells. Cell 164, 644–655 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Wang M., et al. , Increased neural progenitor proliferation in a hiPSC model of autism induces replication stress-associated genome instability. Cell Stem Cell 26, 221–233.e6 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Caldecott K. W., Ward M. E., Nussenzweig A., The threat of programmed DNA damage to neuronal genome integrity and plasticity. Nat. Genet. 54, 115–120 (2022). [DOI] [PubMed] [Google Scholar]
  • 12.Mckinnon P. J., Maintaining genome stability in the nervous system. Nat. Neurosci. 16, 1523–1529 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Wilson S. H., et al. , DNA polymerase β and mammalian base excision repair. Cold Spring Harb. Symp. Quant. Biol. 65, 143–156 (2000). [DOI] [PubMed] [Google Scholar]
  • 14.Wu H., Zhang Y., Reversing DNA methylation: Mechanisms, genomics, and biological functions. Cell 156, 45–68 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Sugo N., Aratani Y., Nagashima Y., Kubota Y., Koyama H., Neonatal lethality with abnormal neurogenesis in mice deficient in DNA polymerase β. EMBO J. 19, 1397–1404 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Sugo N., Niimi N., Aratani Y., Takiguchi-Hayashi K., Koyama H., P53 deficiency rescues neuronal apoptosis but not differentiation in DNA polymerase β-deficient mice. Mol. Cell Biol. 24, 9470–9477 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Sugo N., et al. , Decreased PARP-1 levels accelerate embryonic lethality but attenuate neuronal apoptosis in DNA polymerase β-deficient mice. Biochem. Biophys. Res. Commun. 354, 656–661 (2007). [DOI] [PubMed] [Google Scholar]
  • 18.Uyeda A., et al. , Suppression of DNA double-strand break formation by DNA polymerase β in active DNA demethylation is required for development of hippocampal pyramidal neurons. J. Neurosci. 40, 9012–9027 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Onishi K., et al. , Genome stability by DNA polymerase β in neural progenitors contributes to neuronal differentiation in cortical development. J. Neurosci. 37, 8444–8458 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Sobol R. W., et al. , Requirement of mammalian DNA polymerase-β in base-excision repair. Nature 379, 183–186 (1996). [DOI] [PubMed] [Google Scholar]
  • 21.Senejani A. G., et al. , Y265C DNA polymerase beta knockin mice survive past birth and accumulate base excision repair intermediate substrates. Proc. Natl. Acad. Sci. U.S.A. 109, 6632–6637 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Sobol R. W., Genome instability caused by a germline mutation in the human DNA repair gene POLB. PLoS Genet. 8, e1003086 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Sobol R. W., et al. , Mutations associated with base excision repair deficiency and methylation-induced genotoxic stress. Proc. Natl. Acad. Sci. U.S.A. 99, 6860–6865 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Canitrot Y., et al. , Overexpression of DNA polymerase β in cell results in a mutator phenotype and a decreased sensitivity to anticancer drugs. Proc. Natl. Acad. Sci. U.S.A. 95, 12586–12590 (1998). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Niimi N., et al. , Decreased mutant frequency in embryonic brain of DNA polymerase β null mice. Mutagenesis 21, 55–59 (2006). [DOI] [PubMed] [Google Scholar]
  • 26.Allen D., et al. , Mutagenesis is elevated in male germ cells obtained from DNA polymerase-beta heterozygous mice. Biol. Reprod. 79, 824–831 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Senejani A. G., et al. , Mutation of POLB causes lupus in mice. Cell Rep. 6, 1–8 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Cabelof D. C., et al. , Base excision repair deficiency caused by polymerase beta haploinsufficiency: Accelerated DNA damage and increased mutational response to carcinogens. Cancer Res. 63, 5799–5807 (2003). [PubMed] [Google Scholar]
  • 29.Wakayama T., Perry A. C. F., Zuccotti M., Johnson K. R., Yanagimachi R., Full-term development of mice from enucleated oocytes injected with cumulus cell nuclei. Nature 394, 369–374 (1998). [DOI] [PubMed] [Google Scholar]
  • 30.Wakayama S., Wakayama T., Improvement of mouse cloning using nuclear transfer-derived embryonic stem cells and/or histone deacetylase inhibitor. Int. J. Dev. Biol. 54, 1641–1648 (2011). [DOI] [PubMed] [Google Scholar]
  • 31.Uchimura A., et al. , Early embryonic mutations reveal dynamics of somatic and germ cell lineages in mice. Genome Res. 32, 945–955 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Hazen J. L., et al. , The complete genome sequences, unique mutational spectra, and developmental potency of adult neurons revealed by cloning. Neuron 89, 1223–1236 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Matsuda Y., et al. , Spectra and characteristics of somatic mutations induced by ionizing radiation in hematopoietic stem cells. Proc. Natl. Acad. Sci. U.S.A. 120, e2216550120 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Van der Auwera G. A., et al. , From FastQ data to high-confidence variant calls: The genome analysis toolkit best practices pipeline. Curr. Protoc. Bio informatics 43, 11.10.1–11.10.33 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Caviness V. S., Takahashi T., Nowakowski R. S., Numbers, time and neocortical neuronogenesis: A general developmental and evolutionary model. Trends Neurosci. 18, 379–383 (1995). [DOI] [PubMed] [Google Scholar]
  • 36.Bae T., et al. , Different mutational rates and mechanisms in human cells at pregastrulation and neurogenesis. Science 359, 550–555 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Fortini P., et al. , Different DNA polymerases are involved in the short- and long-patch base excision repair in mammalian cells. Biochemistry 37, 3575–3580 (1998). [DOI] [PubMed] [Google Scholar]
  • 38.Chang H. H. Y., Pannunzio N. R., Adachi N., Lieber M. R., Non-homologous DNA end joining and alternative pathways to double-strand break repair. Nat. Rev. Mol. Cell Biol. 18, 495–506 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Lai Y., Beaver J. M., Laverde E., Liu Y., Trinucleotide repeat instability via DNA base excision repair. DNA Repair. 93, 102912 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Kovtun I. V., et al. , OGG1 initiates age-dependent CAG trinucleotide expansion in somatic cells. Nature 447, 447–452 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Lister R., et al. , Global epigenomic reconfiguration during mammalian brain development. Science 341, 1237905 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Sanosaka T., et al. , DNA methylome analysis identifies transcription factor-based epigenomic signatures of multilineage competence in neural stem/progenitor cells. Cell Rep. 20, 2992–3003 (2017). [DOI] [PubMed] [Google Scholar]
  • 43.Zhang X., Gao S., Liu X., Advance in the role of epigenetic reprogramming in somatic cell nuclear transfer-mediated embryonic development. Stem Cells Int. 2021, 6681337 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Alexandrov L. B., et al. , The repertoire of mutational signatures in human cancer. Nature 578, 94–101 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Thatikonda V., et al. , Comprehensive analysis of mutational signatures reveals distinct patterns and molecular processes across 27 pediatric cancers. Nat. Cancer 4, 276–289 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Alexandrov L. B., et al. , Signatures of mutational processes in human cancer. Nature 2013, 415–421 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Kim J., Kim J., Lee Y., DNA polymerase β deficiency in the p53 null cerebellum leads to medulloblastoma formation. Biochem. Biophys. Res. Commun. 505, 548–553 (2018). [DOI] [PubMed] [Google Scholar]
  • 48.Carvalho C. M. B., Lupski J. R., Mechanisms underlying structural variant formation in genomic disorders. Nat. Rev. Genet. 17, 224–238 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Wala J. A., et al. , SvABA: Genome-wide detection of structural variants and indels by local assembly. Genome Res 28, 581–591 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Chen X., et al. , Manta: Rapid detection of structural variants and indels for germline and cancer sequencing applications. Bioinformatics 32, 1220–1222 (2016). [DOI] [PubMed] [Google Scholar]
  • 51.Ostrander B. E. P., et al. , Whole-genome analysis for effective clinical diagnosis and gene discovery in early infantile epileptic encephalopathy. npj Genom. Med. 3, 1–10 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Lodato M. A., et al. , Aging and neurodegeneration are associated with increased mutations in single human neurons. Science 359, 555–559 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Ganz J., et al. , Contrasting somatic mutation patterns in aging human neurons and oligodendrocytes. Cell 187, 1955–1970.e23 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.He Y., et al. , Spatiotemporal DNA methylome dynamics of the developing mouse fetus. Nature 583, 752–759 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Heinz S., et al. , Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell 38, 576–589 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Appan D., Hsu S. M., Hsu W. H., Chou S. J., Patterning the cerebral cortex into distinct functional domains during development. Curr. Opin. Neurobiol. 80, 102698 (2023). [DOI] [PubMed] [Google Scholar]
  • 57.Ferland R. J., et al. , Abnormal cerebellar development and axonal decussation due to mutations in AHI1 in Joubert syndrome. Nat. Genet. 36, 1008–1013 (2004). [DOI] [PubMed] [Google Scholar]
  • 58.Yuan S. H., Qiu Z., Ghosh A., TOX3 regulates calcium-dependent transcription in neurons. Proc. Natl. Acad. Sci. U.S.A. 106, 2909–2914 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Yosifova A., et al. , Case-control association study of 65 candidate genes revealed a possible association of a SNP of HTR5A to be a factor susceptible to bipolar disease in Bulgarian population. J. Affect. Disord. 117, 87–97 (2009). [DOI] [PubMed] [Google Scholar]
  • 60.Weber A. R., et al. , Biochemical reconstitution of TET1–TDG–BER-dependent active DNA demethylation reveals a highly coordinated mechanism. Nat. Commun. 7, 1–13 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Burak M. J., Guja K. E., Hambardjieva E., Derkunt B., Garcia-Diaz M., A fidelity mechanism in DNA polymerase lambda promotes error-free bypass of 8-oxo- dG. EMBO J. 35, 2045–2059 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Hao Q., et al. , DNA repair mechanisms that promote insertion-deletion events during immunoglobulin gene diversification. Sci. Immunol. 8, ade1167 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Wittkopp P. J., Kalay G., Cis-regulatory elements: Molecular mechanisms and evolutionary processes underlying divergence. Nat. Rev. Genet. 13, 59–69 (2011). [DOI] [PubMed] [Google Scholar]
  • 64.Maury E. A., et al. , Somatic mosaicism in schizophrenia brains reveals prenatal mutational processes. Science 386, 217–224 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Wang D., et al. , Active DNA demethylation promotes cell fate specification and the DNA damage response. Science 378, 983–989 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Wu W., et al. , Neuronal enhancers are hotspots for DNA single-strand break repair. Nature 593, 440–444 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Reid D. A., et al. , Incorporation of a nucleoside analog maps genome repair sites in postmitotic human neurons. Science 372, 91–94 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Tanudisastro H. A., Deveson I. W., Dashnow H., MacArthur D. G., Sequencing and characterizing short tandem repeats in the human genome. Nat. Rev. Genet. 25, 460–475 (2024). [DOI] [PubMed] [Google Scholar]
  • 69.Lokanga R. A., Senejani A. G., Sweasy J. B., Usdin K., Heterozygosity for a hypomorphic Polβ mutation reduces the expansion frequency in a mouse model of the fragile X-related disorders. PLoS Genet. 11, e1005181 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Handsaker R. E., et al. , Long somatic DNA-repeat expansion drives neurodegeneration in Huntington’s disease. Cell 188, 623–639.e19 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Dileep V., et al. , Neuronal DNA double-strand breaks lead to genome structural variations and 3D genome disruption in neurodegeneration. Cell 186, 4404–4421.e20 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Li H., et al. , The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Satoh Y., et al. , Characteristics of induced mutations in offspring derived from irradiated mouse spermatogonia and mature oocytes. Sci. Rep. 10, 1–13 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Bergstrom E. N., et al. , SigProfilerMatrixGenerator: A tool for visualizing and exploring patterns of small mutational events. BMC Genomics 20, 1–12 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Ramírez F., et al. , Deeptools2: A next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 44, W160–W165 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Sugo N., Whole genome analysis of somatic mutations in ntES cells derived from E18.5 cortical neurons of Polb-deficient and wild-type mice. NCBI BioProject. https://ddbj.nig.ac.jp/search/entry/bioproject/PRJDB20767. Deposited 15 May 2025.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Appendix 01 (PDF)

Dataset S01 (XLSX)

pnas.2506846122.sd01.xlsx (127.3KB, xlsx)

Dataset S02 (XLSX)

pnas.2506846122.sd02.xlsx (413.4KB, xlsx)

Dataset S03 (XLSX)

pnas.2506846122.sd03.xlsx (19.3KB, xlsx)

Data Availability Statement

Raw sequencing data have been deposited in the DNA Data Bank of Japan Sequence Read Archive (DDBJ‐SRA) under accession numbers (BioProject: PRJDB20767) (76). Data are publicly available at https://ddbj.nig.ac.jp/sra.


Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES