Abstract
8-oxo-7,8-dihydroguanine (OG) is one of the most abundant oxidative lesions in the genome and is associated with genome instability. Its mutagenic potential is counteracted by a concerted action of 8-oxoguanine DNA glycosylase (OGG1) and mutY homolog DNA glycosylase (MUTYH). It has been suggested that OG and its repair has epigenetic-like properties and mediates transcription, but genome-wide evidence of this interdependence is lacking. Here, we applied an improved OG-sequencing approach reducing artificial background oxidation and RNA-sequencing to correlate genome-wide distribution of OG with gene transcription in OGG1 and/or MUTYH-deficient cells. Our data identified moderate enrichment of OG in the genome that is mainly dependent on the genomic context and not affected by DNA glycosylase-initiated repair. Interestingly, no association was found between genomic OG deposition and gene expression changes upon loss of OGG1 and MUTYH. Regardless of DNA glycosylase activity, OG in promoter regions correlated with expression of genes related to metabolic processes and damage response pathways indicating that OG functions as a cellular stress sensor to regulate transcription. Our work provides novel insights into the mechanism underlying transcriptional regulation by OG and DNA glycosylases OGG1 and MUTYH and suggests that oxidative DNA damage accumulation and its repair utilize different pathways.
Graphical abstract
Created in BioRender. Scheffler, K. (2024) https://BioRender.com/v96f108
1. Introduction
Reactive oxygen species (ROS) are one of the major threats for DNA integrity [1]. ROS leads to accumulation of oxidative DNA damage which is a potential source of mutagenesis, which may lead to cancer, aging or neurodegenerative diseases [2,3]. Due to its low redox potential, the DNA base guanine (G) is most prone to oxidation, resulting in, among others 8-oxo-7,8-dihydroguanine (OG) which is estimated to occur on average about 100–500 times per cell each day [4]. OG is recognized and excised by 8-oxoguanine DNA glycosylase (OGG1) through a process called the base excision repair (BER) pathway [5]. OG is a pre-mutagenic lesion, as during replication adenine can be incorporated opposite of OG, giving rise to a stable and pro-mutagenic OG:A base pairing [6]. A second round of replication results in a T:A base pair manifesting the oxidative damage into a point mutation [7]. DNA glycosylase mutY homolog (MUTYH) repairs the misincorporated adenine opposite of OG thereby preventing mutations [6]. Previously we have shown that global levels of OG are not increased in the brain of mice lacking OGG1 and/or MUTYH but that their hippocampal transcriptome is altered [3]. Differentially expressed genes overlapped between the different knockout (KO) mice and were associated with anxiety, learning and memory pathways. However, the molecular mechanism behind this remains mainly unknown.
Several recent studies suggested a role for OG as an epigenetic-like mark. OG has been shown to be involved in crosstalk with cytosine methylation [8,9]. A passive inhibition of methylation of CpG islands was described, as OG reduces the binding affinity of DNA methyltransferases [[10], [11], [12]]. Furthermore, the binding affinity of methyl-CpG binding proteins in methylated CpG islands is affected by the presence of OG [13]. Stress-induced accumulation of OG and recognition by OGG1 in promoter regions was shown to regulate gene expression of specific genes [14,15]. In addition, apurinic (AP) sites generated by excision of OG activates gene transcription, which has been demonstrated for vascular endothelial growth factor (VEGF), endonuclease III-like protein 1 (NTHL1) [16] and sirtuin 1 (SIRT1) genes [17]. OG has also been shown to destabilize G-quadruplex (G4) formations in telomeric regions [18]. This secondary DNA structure consists largely of Gs and is involved in the regulation of transcription [19,20]. Upon excision of OG on the coding strand in potential G4 forming sequences, the resulting abasic site and subsequent AP endonuclease 1 (APE1) binding enabled G4 formation and upregulation of gene transcription of the VEGF gene [16,21]. Alternatively, within the Kirsten ras (KRAS) gene, OG was formed under oxidative stress within the folded G4 causing the recruitment of nuclear factors that unfolded the secondary G4 structure [22]. This enabled OGG1 to repair OG and subsequently transcriptional upregulation of KRAS was observed.
To better address the role of OG in genome regulation several OG-sequencing (OG-seq) methods have been established differing mainly in the mechanisms of OG recognition (anti-OG antibody [23], selective chemical reaction [24], OG-specific glycosylases [[25], [26], [27]] or chemical labeling and polymerase stalling [28]. Together, these studies demonstrated a non-random genomic distribution of OG with regulatory, CG-rich and G4 regions being mostly affected. However, conclusions are inconsistent and might be driven by artificial introduction of OG during sample preparation or inadequate data analysis approaches [1]. So far only one study directly addressed the consequence of deficient repair on genome-wide OG damage distribution and showed increased OG level particularly in promoter, 5′ untranslated regions (UTRs) and 3′ UTRs in OGG1-deficient mouse embryonic fibroblasts [24].
In the present study, we investigate the genome-wide distribution of OG in HAP1 cells deficient for OGG1 and/or MUTYH by using an improved OG-seq approach that reduces artificial background oxidation and compared it to gene expression levels using RNA-sequencing. This allows for the first time to address the suggested epigenetic-like properties of OG and its repair in transcriptional regulation on a genome-wide scale in vivo.
2. Material and methods
2.1. Cell culture
The near-haploid HAP1 cells are derived from the human leukemia KBM-7 cell line and were obtained from Horizon Discovery Ltd [29]. The cells were cultured in Iscove's Modified Dulbecco's Medium with 10 % Fetal Bovine Serum and 1 % Penicillin/Streptomycin in atmospheric oxygen conditions. The insertions/deletions generated by CRISPR-Cas9 technology in the OGG1 and MUTYH genes are provided in Supplementary Table 1. Two clones of each cell line were used for the experiments if not indicated otherwise. HAP1 cells can become diploid reaching 20 passages in culture [30] and to ensure haploidy, cells with passage number <15 were used for all experiments.
2.2. Proliferation and survival assay
HAP1 cells were seeded in triplicate onto a 96-well plate at a concentration of 5000 cells/well. For proliferation, cells were grown for 24 h–72 h. For cell survival, cells were treated 24 h after seeding with different concentrations of hydrogen peroxide (H2O2) for 24 h. Viability was measured using PrestoBlue cell viability reagent according to the manufacturers protocol. Fluorescence was measured using FLUOstar Omega Plate reader (BMG Labtech).
For RNA-sequencing, WT cells were treated 24 h after seeding with 100 μM H2O2 for 24 h.
2.3. Targeted mass spectrometry
To isolate nuclear proteins, HAP1 cells were incubated in hypotonic buffer (20 mM Tris-HCl (pH 7.4), 10 mM KCl, 2 mM MgCl2, 1 mM EGTA, 0.1 % NP40, and protease inhibitors) on ice for 15 min with mixing every second minute. The nuclei were pelleted by centrifugation at 450g for 10 min using a swing-out bucket rotor. The nuclei were resuspended in RIPA buffer (50 mM Tris-HCl (pH 7.4), 150 mM KCl, 10 mM MgCl2, 10 mM EGTA, 0.1 % SDS, 0.5 % sodium deoxycholate, 1 % Triton X-100, 10U DNAse1 10U) and complete protease inhibitor). Sonication for ten cycles 30 s ON/30 s OFF in a Bioruptor sonicator at 4 °C was performed to break the nuclei. Centrifugation was done for 10 min at 16000 g to remove debris. Protein concentration was determined using BioRad Bradford assay. 50 μg protein lysate was incubated with 5 mM TCEP for 30 min at room temperature with alkylated with iodoacetamide (1 μmol/mg protein) for 30 min in the dark, followed by precipitation via methanol-chloroform and overnight Trypsin (Thermo Scientific) digestion in 50 mM NH4HCO3 at 1:50 ratio (w/w, enzyme:protein) at 37 °C in a shaker. Samples were dried, resuspended in 40 μL formic acid and centrifuged for 10 min at max. speed (16000 g). Supernatants were transferred to HPLC vials for mass spectrometry analysis. Parallel reaction monitoring (PRM)-based targeted mass spectrometry methods were designed, analyzed, and processed using Skyline software version 23.1.0.455. Peptide standards (synthetic peptides containing heavy labeled Lysine (+8) or Arginine (+10) (PEPotec SRM Grade 2 Peptides, Thermo scientific) and peptides obtained via digestion of purified proteins) and samples were separated using an EASY-nLC 1200 UHPLC system (Thermo Scientific) and analyzed on a Thermo Scientific Q Exactive HF mass spectrometer operating in PRM mode as previously described [31]. Peptide areas for multiple peptides of the same protein were summed to assign relative protein abundance. Due to the low levels of these proteins, two unique peptides were detectable for MUTYH (DVAEVTAFR and TAETLQQLLPGVGR) and one for OGG1 (LDLVLPSGQSFR). The error bars represent standard deviation of two technical replicates for each clone.
2.4. Nucleic acid isolation
Genomic DNA was isolated using the DNeasy Blood & Tissue Kit (Qiagen) and total RNA was isolated using the RNeasy Kit (Qiagen) according to the manufactures protocol. To prevent DNA oxidation 0.1 mM Butylated hydroxytoluene (BHT) and 0.1 mM Deferoxamine (DFO) were added during the isolation procedure. DNA and RNA concentrations were measured using NanoDrop (Thermo Fisher).
2.5. Global OG measurement
Global OG measurements were performed on genomic DNA and fragmented DNA. Prior to hydrolysis, fragmented DNA samples were purified by isopropanol precipitation. 0.3 vol of 10 M ammonium acetate and 1 volume of isopropanol were added to the DNA samples. The samples were then centrifuged at 16 000 g for at least 30 min. Next, the supernatants were discarded by decanting and samples were washed twice with ice-cold 70 % ethanol. Finally, the pellets were dried at room temperature for 5 min.
To hydrolyze DNA, samples were incubated for 1 h at 37 °C in DNA hydrolysis mixture consisting of hydrolysis buffer (10 mM NH4Ac, 1 mM MgCl2, 0.1 mM ZnCl2) with 0.008 U/μl nuclease P1 (NP1), 0.8 U/μl benzonase, and 0.04 U/μl alkaline phosphatase, including the antioxidants BHT and DFO with a final concentration of 0.1 mM each. The reaction was stopped by placing the samples on ice. Next, proteins were precipitated from the solution by adding 150 μl ice-cold acetonitrile followed by 30 min centrifugation at 4 °C 16000 g. Subsequently, the supernatant was transferred into new tubes and lyophilized at - 80 °C overnight. Finally, the samples were dissolved in water. Single nucleotide 8- oxoguanine (O(dG)) was analyzed by liquid chromatography-mass spectrometry (LCMS) using an Agilent 6495 triple quadrupole LCMS system with an Agilent EclipsePlusC18 RRHD column (2.1 × 150 mm, 1.8 μm particle size) at 25 °C. The mass transition used was 284.1 → 168.0 m/z. The mobile phases were (A) UHPLC-grade water and (B) UHPLC-grade methanol, both containing 0.1 % UHPLC-grade formic acid. The HPLC method used a flow rate of 230 ml/min with 5 % B to 0.5 min, ramp to 20 % B at 2.5 min, ramp to 95 % B at 6 min, hold at 95 % B from 6 to 7 min, ramp to 5 % at 7.2 min, and equilibration with 5 % B from 7.2 to 10.5 min. For unmodified deoxyribonucleosides (dNs), we diluted samples 1:5000 and analyzed them on a API5000 triple quadrupole mass spectrometer (Applied Biosystems) with an Acentis® Express C18 column (0.5 × 150 mm, 2.7 μm particle size) at 40 °C. The HPLC method used a flow rate of 200 μl/min with an isocratic flow of 22 % B for 3 min. The mass transitions used were 252.1 → 136, 228.1 → 111.9, 268.1 → 152, and 243.1 → 127 m/z for dA, dC, dG, and dT, respectively.
2.6. RNA-sequencing
Total RNA from two clones of each genotype was sent to BGI Tech Solutions Co., Hong Kong for RNA-sequencing on DNBseq platform. FASTQ files were quality assessed using FastQC and Fastq-Screen (https://www.bioinformatics.babraham.ac.uk/projects/) and aligned to the GRCh38 genome using the splice-aware aligner Hisat2 (v.2.2.1; [32]). All samples had over 97 % alignment rate. Quantification of reads over genes was performed by the Bioconductor Rsubread (v2.2.6) package in R (v4.2.2) using annotations from the Ensembl GRCh38.p13 (release 103) reference. Normalization and differential expression analysis was then performed between genotypes using the quasi-likelihood method from the Biocondictor package edgeR (v.3.42.4; [33]). Genes were considered differentially expressed if they had a log2FC beyond ±1 and a P. value ≤ 0.05. Volcano plots for RNA-seq data were generated using VolcanosR2 (https://huygens.science.uva.nl/VolcaNoseR2/https://huygens.science.uva.nl/VolcaNoseR2/). Pathway analysis was performed on differentially expressed genes (DEGs) for gene ontology (GO) biological pathways using Pantherdb overrepresentation fisher test and p-values were corrected for false discovery rate. A normalized count matrix was exported from edgeR for further analysis and comparison.
2.7. DNA fragmentation
Antioxidants BHT and DFO were added with a final concentration of 0.1 mM to the samples during fragmentation to avoid artificial introduction of OG. For the sonication procedures, genomic DNA was prepared in 10 mM Tris-HCl (pH 8.0), 100 μM DFO, and 100 μM BHT. The sonicators were operated according to the manufacturers’ protocols to obtain a target fragment size of 150bp. Specifically, the Diagenode sonicator was operated in 30 cycles, wherein one cycle consists of 30 s on and 30 s off. The Covaris sonicator was operated with 50 W peak incident power, a duty factor of 20 %, 200 cycles per burst and with a total treatment time of 350 s. DNA fragmentation by Fragmentase was performed using 3 μg genomic DNA in the supplied 1x Fragmentase buffer (20 mM Tris-HCl, 15 mM MgCl2, 50 mM NaCl, 0.1 mg/ml bovine serum albumin, 0.15 % Triton X-100, pH 7.5) and additional 0.1 mM of BHT and DFO. After 30 min at 37 °C the reaction was stopped by placing the samples on ice and by adding 5 μl ice-cold EDTA. Fragmented DNA was cleaned up using DNA Clean & Concentrator-5 (Zymo Research). To visualize the fragmentation, DNA was loaded onto 2 % agarose gel including 1X GelRed (Biotium).
2.8. OG-sequencing
Affinity purification of OG containing DNA fragments was performed according to Ding et al. [24] with minor modifications. In brief, fragments were chemically labeled using 20 mM amine-terminated biotin with a polyethylene glycol linker (BTN) and 5 mM K2IrBr6 in 100 mM NaPi (pH 8.0) for 30 min at 45 °C. Subsequently, a PCR purification kit (Qiagen) was used to remove excess BTN. Labeled fragments were purified using Dynabeads MyOne Streptavidin C1 (Invitrogen) washed and eluted by incubation in 150 mM NaOH for 30 min. Prior to library preparation single stranded (ss) DNA was concentrated using the ssDNA/RNA clean & concentrator kit (ZYMO).
Libraries were constructed using the Accel-NGS® 1S Plus DNA Library Kit (Swift Biosciences™) according to the protocol with an initial input of 500 pg per sample. Electropherograms (High Sensitivity DNA Assay, Bioanalyzer, Agilent Technologies) were performed prior to pooling. Pooled libraries from two clones of each genotype (in total three independent experiments) were sent to BGI Tech Solutions Co., Hong Kong for sequencing on a MGISEQ-2000 platform. The samples were sequenced as paired end reads of 100bps.
Illumina adapters were trimmed from raw files using cutadapt (v.2.8) and quality of the reads was assessed using FastQC and Fastq-Screen. Datasets were mapped from FASTQ files to the human genome (GRCh38) with Bowtie2 (v.2.3.5.1; [34]) using default settings and duplicate reads were marked using samtools (v1.10). Samples were assessed by average read count over gene bodies from bam and bigwig files using deeptools [35] and no clonal differences were found to be present. Reads were counted over standard chromosomes (i.e. chr 1–22 & X) in non-overlapping 1000bp bins. Incomplete bins (i.e. smaller than 1000bp) or bins overlapping blacklisted regions [36] were removed. Reads were counted using bedtools (multicov; v2.27.1) and counts were normalized using a rank statistics-based signal extraction scaling (SES) method as previously published by Diaz et al. [37]. This method works by differentiating between genomic regions with enrichment in a targeted pull-down sample and background regions found in both the pull-down and the input to use only background regions for normalization. Briefly, binned counts are reordered by OG signal intensity and the difference of the cumulative percentage tag by ordered bin was calculated to differentiate signal from noise regions. An inter-library scaling factor was then calculated from only low-damage background regions, which was applied to input counts. Genotypes showed no difference in global accumulation of oxidative damage (Fig. 1C), but it was important to consider that differences in library size between genotypes at specific genomic elements could be dependent on genotype. Therefore, inter-genotype library scaling factors used in all regional analyses were calculated from total library size as the ratio of the individual sample to the mean of all samples over all bins.
Fig. 1.
Characterization of WT and DNA glycosylase deficient HAP1 cells. (A) Proliferation of WT, MUTYH-/, OGG1-/ and DKO over the course of 72 h. (B) Survival upon treatment with increasing concentrations of H2O2. (C) Global levels of single nucleotide 8-oxoguanine (O(dG)) per deoxyribonucleosides (dNs) of naïve cells. Statistical analyses were done in GraphPad Prism software 10.1 using 2-way ANOVA with Dunnett's multiple comparisons test. P-values <0.05 were considered significant. Error bars represent standard deviation from the mean based on two to five independent experiments and two clones per genotype.
2.9. Enrichment of OG across the genome
Areas of enrichment were identified by the log2 ratio of OG to Input signals over 1000bp bins. To ensure biological relevance, technical replicates were processed individually and only bins with the largest log2 fold-change were considered (≥95th percentile). Extracted bins were compared and retained only if they were present in both technical replicates and at least two of three biological replicates. Bookended bins were combined within genotype and annotated using HOMER (Hypergeometric Optimization of Motif EnRichment; v4.11.1) to closest gene and genomic feature. GC content of enriched regions was determined using bedtools.
2.10. Enrichment of OG at genomic features
To assess the difference in OG accumulation across different genomic features, reads were analyzed over the following regions: 1000bp upstream of gene start, CpG islands, exons, introns, 1000bp downstream of gene end and intergenic regions. Regional bed files were downloaded for the hg38 genome from the UCSC table browser (http://genome.ucsc.edu/cgi-bin/hgTables) for CpG regions, exons, introns, upstream, downstream, and whole genes. Intergenic regions were determined by taking the complement of whole genes over the genome using bedtools. Blacklisted regions were subtracted from all regional bed files. Samples were counted over regions individually and imported into R for processing. Input counts were normalized using signal extraction scaling (SES) factors and subtracted from OG reads to a minimum of 0. To find genotype-specific enrichment of oxidative damage, low-read filtering and differential enrichment was performed in edgeR using manually set predetermined library normalization factors. OG-differentially enriched regions (OG-DERs) were defined as all regions with a P. value ≤ 0.01 and a log2FC beyond ±0.5. Results were annotated using Homer and gene lists from differentially enriched CpG islands, exons, introns and 1000bp upstream of gene start were analyzed using Pantherdb for overrepresentation of genes in GO biological processes.
2.11. Enrichment of OG in G4 motifs
G4 regions were defined using DeepG4 [38]. DeepG4 uses a deep-learning approach to score DNA regions on their potential to form G4 complexes based on sequence motifs. Here we used DeepG4 to scan the hg38 genome to identify potential G4 regions based on the canonical G4 motif (G3+, N1−7, G3+, N1−7, G3+, N1−7, G3+). Bed files for individual chromosomes (1–22 & X) were created to have non-overlapping 100bp windows using bedtools and imported to R. Sequences were obtained for each bin using the Bioconductor package Biostrings (v2.62.2; BSgenome.Hsapiens.UCSC.hg38_1.4.4) and regions with missing base pairs (i.e. N) were removed. Bins were scored for G4 probability by DeepG4 and all bins with a probably score ≥0.99 were retained. Reads we counted over retained bins and compared across genotypes as described above in the regional analysis. Additionally, DeepG4 was used to scan enriched bins identified in the genome-wide relative enrichment analysis.
2.12. Comparison of OG enrichment and RNA expression
To consider the impact of OG on RNA expression, data was compared between RNA-sequencing counts and OG-sequencing reads in promoter regions (defined at 1000bp upstream of the transcription start site (TSS)). OG-seq reads were counted over promoter regions and SES normalized input counts were subtracted from the OG signal. Counts were collapsed to meta-features by mean and normalized. RNA counts were mapped to OG reads by gene. Counts were averaged over replicates and genes were filtered if counts were not present in at least three of four genotypes per sequencing approach. Counts were z-scored by row within each sequencing approach and hierarchical clustering was performed using Euclidean distance to identify clusters that reliably showed a positive or negative correlation between OG and RNA across all genotypes (i.e. genes that show a consistent relationship of differing from the mean between oxidative damage and gene expression regardless of DNA glycosylase activity). To identify enriched DNA sequence motifs, the findMotifGenome.pl command from HOMER with default parameters was applied using correlating promoter regions identified from the hierarchical clustering as input and all promoters considered in the clustering as background. To correlate changes in OG accumulation to changes in gene expression at specific genomic features, log2 fold changes for OG-DERs over the genomic feature and the corresponding log2 fold changes for the annotated genes from the RNA-seq analysis were extracted. Hierarchical clustering with a Pearson's correlation coefficient and average linkage method was applied.
2.13. Statistical analyses
Statistical analysis was performed with GraphPad Prism software 10.1 using either two-way ANOVA or ordinary one-way ANOVA with Dunnett's or Tukey's multiple comparison test, respectively. P-values <0.05 were considered significant. Error bars represent standard deviation from the mean based on three to five independent experiments.
3. Results
3.1. Loss of OGG1 and MUTYH impairs proliferation capacity and OG repair
HAP1 cells deficient for either MUTYH (MUTYH−/) or OGG1 (OGG1-/) or both DNA glycosylases (DKO) were generated by Horizon Discovery using CRISPR-Cas9 genome editing. The used guide RNA, targeted exon and insertion/deletion length is shown in Supplementary Table 1. Two independent clones of each knockout and wildtype (WT) cells were used for the experiments. Knockout of DNA glycosylases MUTYH and OGG1 was verified by targeted mass spectrometry in all clones (Supplementary Fig. 1). To characterize the impact of DNA glycosylase deficiency, proliferation capacity of the cells and response to oxidative stress was monitored (Fig. 1). Cell proliferation steadily increased over 72 h for all cell lines, but DKO showed a significantly reduced proliferation rate compared to WT from 48 h onwards (Fig. 1A). Treatment with increasing concentration of hydrogen peroxide (H2O2) decreased cell survival accordingly but was not further affected in the knockout cells (Fig. 1B). We measured global OG levels by LCMS and found no significant differences upon loss of DNA glycosylases compared to WT (Fig. 1C). No clonal differences were observed as exemplified by similar global OG levels across the independent clones (Supplementary Fig. 2).
3.2. Fragmentation of DNA by sonication induces artificial OG
One crucial step during OG-seq is the initial DNA fragmentation, which determines the sequencing resolution. Fragments should be as small as possible to ensure a high resolution while still long enough to enable sequencing, ideally 150bp. Previously published OG-seq approaches employed sonication [24,26]. However, sonication is a harsh treatment, with the potential of introducing artificial DNA oxidation. Here, we used a commercial nuclease mixture, Fragmentase®, to fragment genomic DNA and compared it to two sonication methods to investigate the effect on fragment shearing size and OG level increase. The Fragmentase® enzyme mixture digests DNA in a non-sequence specific manner and results in fragment sizes similar to sonication (Fig. 2A lower panel). To identify the amount of artificial DNA oxidation introduced during sample preparation, we measured global level of OG by LCMS and found a wide variation among the three tested fragmentation methods (Fig. 2A upper panel). Surprisingly, sonication increased global OG levels by 10-fold using the Covaris instrument and significantly by 39-fold for the Diagenode instruments compared to control. Enzymatically fragmented DNA did not significantly increase global OG levels (2.4-fold). Thus, DNA fragmentation using Fragmentase® minimizes artificial DNA oxidation providing a considerable improvement to the OG-seq pipeline.
Fig. 2.
Optimization of fragmentation and OG-seq in WT and DNA glycosylase deficient HAP1 cells. (A) Comparison of different fragmentation methods. Top: LCMS was used to measure global O(dG) levels from Diagnode and Bioruptor sonicators as well as an enzymatic approach using Fragmentase. Error bars represent standard deviation from the mean, based on independent experiments (Control n = 4, Diagnode n = 3, Bioruptor n = 3, Fragmentase n = 5). Statistical analyses were done in GraphPad Prism software 10.1 using ordinary one-way ANOVA with Tukey's multiple comparisons test. P-values <0.05 were considered significant, n.s. = not significant. Bottom: representative agarose gels from the respective fragmentation methods. (B) Number of OG enriched bins (OG-EBs) in WT, MUTYH-/, OGG1-/ and DKO HAP1 cells. Log2 fold ratio of OG/input was calculated and enriched bins were defined as ≥ 95th percentile within each genotype. (C) Heatmap of consensus peaks based on Homer annotation in studied genotypes, displayed as log2 ratio (observed/expected). (D) Percent of OG-differentially enriched regions (OG-DERs) in different genomic features in MUTYH-/, OGG1-/ and DKO compared to WT. Regions have been annotated to the nearest gene, among the different genomic features.
3.3. OG is moderately enriched in the genome and predominantly located at CpG-rich regions
OG-seq, similar to immunoprecipitation-based sequencing methods (i.e. ChIP), involves identifying regions of read/signal enrichment in a targeted sample compared to a background sample. However, due to the ubiquitousness of OG across the genome, low frequency of occurrence and single-nucleotide nature of OG, classical peak-calling methods were unable to be applied to OG-seq data, a problem that has been reported previously [26]. As such, a genome-wide relative enrichment analysis was used to identify OG-enriched bins (OG-EBs).
OG-EBs had an up to 15 % higher average GC content compared to the reported genomic GC content of 40.9 % indicating specific OG pulldown (Supplementary Table 2). However, only a small number of OG-EBs in the genome were identified (average of 424 bins), with similar levels across the analyzed genotypes (Fig. 2B). Mean coverage of these regions was analyzed across the independent clones showing no clonal differences in overall OG-signal intensity (Supplementary Fig. 3). OG-EBs were annotated to their closest gene using Homer and found to be largely compromised of introns and intergenic regions (Supplementary Fig. 4A). Using the ratio of observed to expected number of sequences, we found that our identified OG-EBs were overrepresented for regulatory genomic features including promoter regions, 5′ untranslated regions (UTR) and CpG-islands (Fig. 2C). On the other hand, OG-EBs were underrepresented in 3′ UTR and intergenic regions. This correlated well with the CpG content of the genomic features, with an increased enrichment of OG-EBs in CpG-richer regions (Supplementary Table 3). OG-EBs were similarly distributed across genomic features between genotypes (Fig. 2C), however, analysis of the annotated genes showed almost no overlap (Supplementary Fig. 4B). This indicates that though OG frequency is related to the genomic landscape, it appears to be stochastic in relation to specific genes. Despite this, GO pathway analysis of OG-EBs revealed similar pathways across genotypes which were predominately related to system and cellular development as well as cellular growth and differentiation (Supplementary excel file 1).
To identify DNA glycosylase specific differences, we compared OG accumulation between WT and knock out cells across specific genomic features. We found similar levels of gain or loss of OG within each genomic feature between all genotypes (Fig. 2D). CpG-Islands showed the highest percentage of OG-differentially enriched regions (OG-DERs) affecting on average 1.86 % of all CpG-islands analyzed. Introns were the least affected by loss of DNA glycosylases, with on average only 0.19 % of these regions being identified as OG-DERs. Pathway analysis of OG-DERs across all the analyzed genomic locations and genotypes revealed a few significant hits which mainly related to processes with a broader biological role in development (Supplementary excel file 2). We further focused on OG-DERs found within genomic features that demonstrated most differences in OG enrichment including CpG-islands, upstream regions and exons. GO analysis of either gain or loss of OG in the KOs across all three regions identified no significant (FDR≤0.05) pathways in MUTYH-/, but pathways for OGG1-/ and DKO associated with chemical stimuli, biogenesis, and development (Supplementary excel file 2). Despite similar levels of loss and gain across our KOs, the overlap of OG-DERs annotated genes between the genotypes were substantially higher in genes with OG gain (69–73 %) than OG loss (30–40 %) across all three regions.
3.4. OG accumulates in the context of G4-structures
Previous studies reported differences in the occurrence of OG within G4-structures [24,26,28], thus we investigated the distribution of predicted G4-forming sequences over OG-EBs in our cell lines. We found 0.05 % of all analyzed bins across the genome contained high probability G4-forming motifs (≥0.99). In contrast, we observed around 18 % of the OG-EBs in our genotypes contained G4-forming motifs with the highest level seen in OGG1-/ (Fig. 3A). To find OG-DERs within G4's (G4-DERs), we first identified high probability G4-motifs across the human reference genome (hg38). By using 100bp bins and a probability score ≥0.99, we identified 13 256 bins containing potential G4-forming sequences. These G4 regions demonstrated a high CpG content comparable to CpG islands. (Supplementary Table 3). We found that up to 2 % of the G4-regions had different OG accumulation in DNA glycosylase deficient cells and most of these showed gain of OG (Fig. 3B). However, G4-DERs showed similar levels of gain and loss across all genotypes when compared to WT. GO analysis of G4-DERs revealed pathways related to cellular differentiation, proliferation, homeostasis, and response to stimuli, however these did not maintain significance after FDR correction (Supplementary excel file 3).
Fig. 3.
Enrichment of OG at G4-forming sequences in WT and DNA glycosylase deficient HAP1 cells. (A) Percent of OG enriched bins (OG-EBs) containing G4-forming sequences of WT, MUTYH-/, OGG1-/ and DKO HAP1 cells. G4-forming sequences have been identified using DeepG4 with a probably score of ≥ 0.99. (B) Percent of OG-differentially enriched regions within G4 motif regions (G4-DERs) in MUTYH-/, OGG1-/ and DKO compared to WT.
3.5. DNA glycosylase-dependent changes in gene expression
Next, we performed RNA-seq to investigate the impact of DNA glycosylase-deficiency on gene expression. The lack of MUTYH caused 212 differentially expressed genes (DEGs), compared to WT, while the lack of OGG1 caused 299 DEGs. The absence of both DNA glycosylases resulted in 317 DEGs (Fig. 4A). Overall, only 28 genes were found to be differentially expressed in all three genotypes, but between 40 and 46 % of DEGs of each individual genotype were found in at least one other genotype (Supplementary Fig. 5). When looking across all DEGs independent of genotype using GO, we found a broad spectrum of metabolic and developmental processes along with cell differentiation, gene expression and RNA processing (Fig. 4B). When analyzing the DEGs found in at least two genotypes, specific developmental pathways (i.e. muscle, organ, circulatory development) as well as response to oxygen-containing compound were overrepresented (Supplementary excel file 4).
Fig. 4.
Correlation of RNA-seq with OG-seq in WT and DNA glycosylase deficient HAP1 cells. (A) Volcano plot of up and down regulated genes identified by RNA-sequencing from MUTYH-/, OGG1-/ and DKO HAP1 cells compared to WT. Dotted lines represents FC >±1 (log2) and p-value <0.5 (-log10) (B) Top ten enriched Gene Ontology pathways ranked by p-value and identified by protein analysis through evolutionary relationships (PANTHER). (C) Overlap of all differentially expressed genes (DEGs) from RNA-seq with OG-differentially enriched regions (OG-DERs) from MUTYH-/, OGG1-/ and DKO across exons, CpG islands and upstream region. (D) Overlap of all DEGs from RNA-seq and OG-DERs between exons, CpG islands and upstream regions across all genotypes. (E) HOMER-identified enriched motifs in positively or negatively correlated OG-seq and RNA-seq read counts at promoter regions (defined as 1000bp upstream of the transcription start site) (F) Hierarchical clustering of stress-response genes. Z-scored RPKMs from RNA-sequencing in H2O2 treated and non-treated WT cells, where rows represent genes, shows response to oxidative treatment across both clones in 21.2 % of identified stress response genes.
3.6. Stress response genes regulate their transcription dependent on OG
To address the epigenetic-like function of OG, genome-wide OG distribution was correlated with the gene expression profile of all genotypes. First, we compared the DEGs across all genotypes (n = 633) to OG-EBs that were found in the KOs but not in the WT and found little overlap (n = 38) (Supplementary Fig. 6A). Similarly, the overlap of G4-DERs (n = 471) and DEGs resulted in little overlap (n = 8) (Supplementary Fig. 6B). Next, we focused our analysis on the overlap of OG-DERs from CpG-islands, upstream regions, and exons with DEGs. Venn analysis by genotype showed that about 2 % of the OG-DERs from any one genotype could be found amongst DEGs, but there was no difference in overlap rates between genotypes (Fig. 4C). Venn analysis by genomic feature across genotypes showed that OG-DERs in exons and upstream regions had a small overlap with DEGs of 2.4 % and 1.7 % respectively (Fig. 4D). In contrast, OG-DERs from CpG islands showed an 8.2 % overlap with DEGs. Pathway analysis revealed that overlapping genes were related to developmental pathways (Supplementary excel file 5). The comparison of RNA-seq reads and OG-seq reads from overlapping genes in the analyzed regions revealed both a negative and positive effect of OG on gene expression (Supplementary Fig. 7).
To further investigate the correlation between OG accumulation and transcription, we compared all RNA-seq reads and OG-seq reads in gene promoters (defined as 1000bp upstream of the TSS). Using hierarchical clustering of scaled read counts we focused on genes that showed consistent positive or negative correlation between OG levels and RNA expression regardless of genotype. Using GO pathway analysis, we found a significant overrepresentation of genes related to metabolic processes amongst positively correlated genes, while genes that consistently showed a negative relationship between OG- and RNA-seq had an overrepresentation of genes related to cellular (re)organization, DNA damage response and cell cycle (Table 1). Motif-enrichment analysis identified the nuclear factor erythroid 2 (NF-E2)-related factor 1 (NFE2L1, also known as Nrf1) motif as the strongest enriched in positively correlating regions (Fig. 4E). This transcription factor is a regulator of stress responses by controlling redox balance and proteasome activity [39]. Promoter regions with a negative correlation between OG levels and RNA expression showed an enrichment of the aryl hydrocarbon receptor (AHR)-nuclear translocator ARNT which is linked to increased oxidative metabolism and the formation of ROS [40]. These results indicate a DNA glycosylase-independent effect on transcription by OG at promoters specifically of genes related to the stress-response systems. To verify that these genes can indeed respond to oxidative stress, we performed RNA-seq of H2O2 -treated WT cells and found that more than 20 % of the stress response genes that showed a consistent correlation between OG levels and RNA expression were differentially expressed compared to non-treated WT cells (Fig. 4F).
Table 1.
Top 5 highest ranked GO pathways from either positively or negatively correlated OG-seq and RNA-seq read counts at promoter regions (defined as 1000bp upstream of the transcription start site).
| GO biological process | Fold enrichment | FDR |
|---|---|---|
| positive correlation | ||
| cellular metabolic process | 1.16 | 0.00669 |
| primary metabolic process | 1.14 | 0.00302 |
| nitrogen compound metabolic process | 1.14 | 0.00850 |
| metabolic process | 1.13 | 0.00311 |
| organic substance metabolic process | 1.13 | 0.00423 |
| negative correlation | ||
| vacuole organization | 2.14 | 0.03530 |
| DNA damage response | 1.63 | 0.00188 |
| cell cycle | 1.44 | 0.00695 |
| organelle organization | 1.30 | 0.00060 |
| cellular localization | 1.25 | 0.00416 |
4. Discussion
Oxidative DNA damage and its repair has been shown to be involved in transcriptional regulation but whether these processes are directly linked remained largely understudied. Correlating genome-wide OG distribution and the gene expression profile of OGG1 and/or MUTYH-deficient cells, we present evidence that both OG and DNA glycosylases modulate gene transcription independently.
OGG1 and MUTYH prevent the mutagenic effect of OG and synergistically promote survival in mice [41]. Using HAP1 cells deficient for both DNA glycosylases, we observed decreased cell proliferation after longer culturing periods but no additional effect on cell survival after oxidant treatment. This is partially in contrast with a previous study, using primary fibroblast from DKO mice that also found slower proliferation but increased sensitivity to H2O2 treatment [42]. However, OGG1 and/or MUTYH-deficient HAP1 cancer cells may have developed strategies to adjust to increased oxidative stress and maintain resistance to apoptosis which has recently been demonstrated for NEIL DNA glycosylases [30].
Despite lacking DNA glycosylases, we found no significant global accumulation of OG in untreated cells compared to WT. Previous studies of OGG1 and/or MUTYH deficient models have reported, dependent on tissue, age, and oxidative stress exposure, both no change in OG accumulation as well as significantly increased OG. In the liver, Ogg1−/− and DKO showed higher OG accumulation that continued to increase with age, while no differences were found in kidney or spleen [43,44]. In the brain, no significant differences in OG accumulation have been found in Ogg1, Mutyh and DKO mice [3,44]. To our knowledge, no other study has quantified OG in DNA glycosylase deficient HAP1 cells, however, one study analyzed repair capacity and found that OGG1 deficient HAP1 cells showed a higher rate of BER than WT cells at baseline [45] suggesting efficient compensation of other available DNA glycosylases in the absence of OGG1. Potential candidates that were shown to repair OG are NTHL1, NEIL1 and to a lesser extend NEIL2 [[46], [47], [48]]. Another mechanism to prevent accumulation of OG in the absence of OGG1 may be provided by MutT homolog 1 (MTH1), also known as nudix hydrolase 1 (NUDT1) which sanitizes the oxidized nucleotide pool by hydrolyzing 8-oxo-dGTP [49].
Given the proposed epigenetic role of OG, sequencing its genomic distribution is of high interest. However, background oxidation of DNA during sample preparation can artificially increase OG and compromise data interpretations. Here, we identified sonication as a major source of artificial OG introduction which may explain previous results that reported several thousands of OG enriched regions as compared to our 448 OG-EBs in WT HAP1 cells using our improved approach [23,24]. To our knowledge, only one study analyzed genome-wide OG distribution in OGG1-deficient cells and found a 2-fold increase in peaks of OG enrichment [24]. However, these results were based on one sequencing sample prepared using sonication and should be taken with caution. We found no major difference in OG enrichment after loss of OGG1 and/or MUTYH which is in line with the similar global levels of OG and may be explained by compensatory repair mechanisms as discussed before. In summary, by minimizing the presence of artificial OG using enzymatic digestion, one can ensure that the level of OG throughout the genome reflects the in vivo situation better compared to earlier studies and the conclusions drawn here, have a higher biological relevance. However, considering the low abundance of OG and background oxidation during sample preparation that cannot completely be avoided it remains challenging to obtain the true basal OG distribution in the genome.
Most of our OG-EBs were found in introns and intergenic regions as described previously [26]. However, after accounting for the relative size of genomic features across the genome, we found an overrepresentation of enriched bins in gene regulatory regions like promoters, 5′ UTRs and CpG islands, in line with what has been reported by others [25]. Given the higher G content of CpG-islands the likelihood of OG accumulation is consequently higher. A similar bias has been described before [24]. However, we cannot exclude that a higher abundance of G's may lead to a stochastic increase of G oxidation caused by artificial oxidation during sample preparation. In contrast, sequencing of OG in single base resolution suggested a bimodal model for OG accumulation at high CpG promoters with a drop of OG in a narrow window around TSS [28] which will be missed with our OG-seq approach due to the lower resolution.
The overall level of OG accumulation in specific genomic features did not differ between KOs and WT, only the precise location of OG. Moreover, the number of OG-DERs were generally low suggesting that OG distribution is not notably impacted by its repair. However, some genomic features showed a greater difference in OG accumulation in KOs, and this correlated directly with the GC% in those regions, with CpG-islands showing the highest percentage of OG-DERs and highest GC content. Together, this indicates that OG accumulates randomly within a genomic feature and differences are mainly driven by region rather than genotype. This is further underlined by pathway analysis which revealed no significant overrepresentation of pathways in all regions and all genotypes with few exceptions. However, in a previous study [25], a catalytic inactive OGG1 was found bound to OG mainly at the same regions that we identified as differentially enriched in the KOs (e.g. upstream and CpG islands) suggesting that at least partially OG might be preferentially repaired at gene regulatory regions by OGG1.
Another genomic feature with higher G:C content is G4 structures, which are found at promoter regions [50]. In line with previous studies, we identified an enrichment of OG at G4 sites [25,26] which further increased upon loss of OGG1 [24]. More recently, however, it was shown that OG is depleted at G4 structures that have been detected in living cells [28]. Since G4 formation will depend on cell type, and to our knowledge no G4 profiling has been completed in HAP1 cells, we cannot exclude the possibility that OG-EBs within G4 motifs identified here fail to form G4s in vivo. The different knockout models showed minor differences in relative G4 enrichment suggesting that OG distribution at G4 sites is mainly independent of its repair. Pathway analysis of G4-DERs compared to all G4 regions did not find any significant overrepresentation of pathways neither within nor across genotypes, further highlighting that OG accumulation within G4 regions seems to occur randomly. One study mapped the binding of acetylated OGG1 in human cells and found a positive correlation with genome-wide occurrence of G4s and reduced G4 foci in OGG1-deficient MEF cells suggesting that OGG1-initiated repair is important for G4 formation [51]. However, direct evidence is missing, and more studies are needed to fully understand the link between OG and DNA glycosylases at G4 sites.
We have previously shown that loss of OGG1 and/or MUTYH in mice impact gene transcription independent of global accumulation of OG [3]. However, a contribution of gene region-specific OG accumulation driving gene expression changes could not be excluded. Here, we found no correlation between OG-DERs and gene expression changes, and a minor overlap of OG-DERs with DEGs in the knockout cells. This demonstrates that OGG1 and MUTYH can modulate gene transcription independent of OG repair. In support of our finding, a previous study demonstrated that catalytic inactive OGG1 could facilitate pro-inflammatory gene expression [52]. However, we cannot exclude that under oxidative stress conditions DNA repair-dependent mechanisms occur as proposed by others [[53], [54], [55]]. The underlying mechanism of DNA glycosylase repair activity-independent gene regulation is still unknown. Several studies suggest them as modifiers of the epigenetic landscape [[56], [57], [58], [59]] and demonstrate their interaction with epigenetic modifiers such as DNA methyltransferases [60], histone demethylases [61] and members of the polycomb repressor complex (PRC) [62]. Interestingly, DEGs of our knockout cells showed highest enrichment in target genes of PRC2 including SUZ12 (adjusted p-value: 4.690e-18), JARID2 (adjusted p-value: 8.007e-15) and EZH2 (adjusted p-value: 4.194e-11) as identified by Enrichr [63]. In line with this, our group recently found that DNA methylation at PRC2 target genes were altered in the hippocampus of Ogg1-and/or Mutyh-deficient mice [64]. Moreover, distribution of SUZ12 and histone modifications associated with PRC2 activity were disordered resulting in cell-type specific gene expression changes which were associated with pathways important for neuronal function and cognition. Together, this supports that OGG1 and MUTYH may cooperate with PRC2 to modulate the epigenome and regulate gene expression.
Nevertheless, several studies suggest that OG itself can regulate gene expression particularly when located in promoter regions [53,65]. Thus, we filtered our RNA-seq dataset on genes that showed a consistent relationship between OG accumulation in promoters and gene expression regardless of DNA glycosylase activity and show that OG can drive both transcriptional repression and activation, particularly of genes involved in general stress response pathways. In line with that, OG regions associated strongest with the transcription factor motif of NRF1, a key regulator of antioxidant defense systems [66,67]. Interestingly, NRF1 was previously identified among the transcription factor binding motifs in gene promoters harboring OG in adipose tissue [65]. This suggests that the accumulation of OG may serve as an oxidative stress sensor for those genes triggering cellular responses, whereby the cell works to meet the energy and biosynthetic demands caused by the stress condition [68]. This impact of OG on transcription, however, is independent of its repair and might instead be facilitated by physically altering the binding of transcription factors or the formation of secondary DNA structures. For instance, it was shown that when OG is located in promoter elements of transcription factors specificity protein 1 (SP1) [69], nuclear factor kappa-light-chain enhancer of activated B cells (NF-κB) [70] or hypoxia-inducible factor-1 (HIF-1) [71] protein binding affinity was reduced and gene transcription affected. In G4 forming sequences it has been shown that dependent on location, OG can decrease G4 stability by sterically clashing with the amino group of a neighboring guanine [22]. However, other studies report a stabilizing effect of OG on G4s achieved by its distinct hydrogen bonding properties shifting the structural equilibrium in favor of G4s [72] which in turn can enhance gene-specific transcription [16]. Regulation of transcription is important to mitigate the effects of acute or chronic oxidative stress. It will therefore be intriguing, in future studies, to explore the impact of OG and DNA glycosylases on gene regulation under different stress conditions or in oxidative stress-mediated diseases.
In conclusion, our findings reveal distinct pathways of transcriptional regulation by OG and its repair proteins OGG1 and MUTYH. Whereas OG accumulation is driven by the sequence context to signal cellular stress, OGG1 and MUTYH may have more specialized functions possibly as epigenetic modifiers.
CRediT authorship contribution statement
Tobias Obermann: Writing – review & editing, Writing – original draft, Visualization, Validation, Methodology, Investigation, Formal analysis, Data curation. Teri Sakshaug: Writing – review & editing, Writing – original draft, Visualization, Validation, Software, Investigation, Formal analysis, Data curation. Vishnu Vignesh Kanagaraj: Investigation, Formal analysis. Andreas Abentung: Writing – review & editing, Supervision. Mirta Mittelstedt Leal de Sousa: Formal analysis, Data curation. Lars Hagen: Formal analysis, Data curation. Antonio Sarno: Writing – review & editing, Supervision, Methodology. Magnar Bjørås: Writing – review & editing, Supervision, Resources, Funding acquisition, Conceptualization. Katja Scheffler: Writing – review & editing, Writing – original draft, Visualization, Validation, Supervision, Resources, Project administration, Funding acquisition, Data curation, Conceptualization.
Data availability
All genomic data produced in the present project (OG-seq and RNA-seq) have been deposited in the NCBI GEO database under accession numbers GSE256080 and GSE256081, respectively.
Funding
Norwegian University of Science and Technology; Research Council of Norway [275777 to K.S, 287911 to MB, 326101 to MB.]; the Central Norway Regional Health Authority of Norway [90172200 to K.S, 90369200 to K.S.]; Funding for open access charge: Norwegian University of Science and Technology.
Declaration of competing interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgements
We thank Christine Gran Neurauter for verifying the genotype of the DNA glycosylase-deficient HAP1 clones and the Proteomics and Modomics Experimental Core Facility (PROMEC) at NTNU for the help with mass spectrometry analysis.
Footnotes
Supplementary data to this article can be found online at https://doi.org/10.1016/j.redox.2024.103461.
Appendix A. Supplementary data
The following are the Supplementary data to this article.
References
- 1.Poetsch A.R. The genomics of oxidative DNA damage, repair, and resulting mutagenesis. Comput. Struct. Biotechnol. J. 2020;18:207–219. doi: 10.1016/j.csbj.2019.12.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Hegde M.L., Mantha A.K., Hazra T.K., Bhakat K.K., Mitra S., Szczesny B. Oxidative genome damage and its repair: implications in aging and neurodegenerative diseases. Mech. Ageing Dev. 2012;133:157–168. doi: 10.1016/j.mad.2012.01.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Bjorge M.D., Hildrestrand G.A., Scheffler K., Suganthan R., Rolseth V., Kusnierczyk A., Rowe A.D., Vagbo C.B., Vetlesen S., Eide L., et al. Synergistic actions of Ogg1 and Mutyh DNA glycosylases modulate anxiety-like behavior in mice. Cell Rep. 2015;13:2671–2678. doi: 10.1016/j.celrep.2015.12.001. [DOI] [PubMed] [Google Scholar]
- 4.Lindahl T. Instability and decay of the primary structure of DNA. Nature. 1993;362:709–715. doi: 10.1038/362709a0. [DOI] [PubMed] [Google Scholar]
- 5.Krokan H.E., Bjoras M. Base excision repair. Cold Spring Harbor Perspect. Biol. 2013;5 doi: 10.1101/cshperspect.a012583. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Shibutani S., Takeshita M., Grollman A.P. Insertion of specific bases during DNA synthesis past the oxidation-damaged base 8-oxodG. Nature. 1991;349:431–434. doi: 10.1038/349431a0. [DOI] [PubMed] [Google Scholar]
- 7.Markkanen E. Not breathing is not an option: how to deal with oxidative DNA damage. DNA Repair. 2017;59:82–105. doi: 10.1016/j.dnarep.2017.09.007. [DOI] [PubMed] [Google Scholar]
- 8.Radak Z., Pan L., Zhou L., Mozaffaritabar S., Gu Y., R A.P., Zheng X., Ba X., Boldogh I. Epigenetic and "redoxogenetic" adaptation to physical exercise. Free Radic. Biol. Med. 2024;210:65–74. doi: 10.1016/j.freeradbiomed.2023.11.005. [DOI] [PubMed] [Google Scholar]
- 9.Hahm J.Y., Park J., Jang E.S., Chi S.W. 8-Oxoguanine: from oxidative damage to epigenetic and epitranscriptional modification. Exp. Mol. Med. 2022;54:1626–1642. doi: 10.1038/s12276-022-00822-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Weitzman S.A., Turk P.W., Milkowski D.H., Kozlowski K. Free radical adducts induce alterations in DNA cytosine methylation. Proc Natl Acad Sci U S A. 1994;91:1261–1264. doi: 10.1073/pnas.91.4.1261. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Maltseva D.V., Baykov A.A., Jeltsch A., Gromova E.S. Impact of 7,8-dihydro-8-oxoguanine on methylation of the CpG site by Dnmt3a. Biochemistry. 2009;48:1361–1368. doi: 10.1021/bi801947f. [DOI] [PubMed] [Google Scholar]
- 12.Turk P.W., Laayoun A., Smith S.S., Weitzman S.A. DNA adduct 8-hydroxyl-2'-deoxyguanosine (8-hydroxyguanine) affects function of human DNA methyltransferase. Carcinogenesis. 1995;16:1253–1255. doi: 10.1093/carcin/16.5.1253. [DOI] [PubMed] [Google Scholar]
- 13.Valinluck V., Tsai H.H., Rogstad D.K., Burdzy A., Bird A., Sowers L.C. Oxidative damage to methyl-CpG sequences inhibits the binding of the methyl-CpG binding domain (MBD) of methyl-CpG binding protein 2 (MeCP2) Nucleic Acids Res. 2004;32:4100–4108. doi: 10.1093/nar/gkh739. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Pan L., Zhu B., Hao W., Zeng X., Vlahopoulos S.A., Hazra T.K., Hegde M.L., Radak Z., Bacsi A., Brasier A.R., et al. Oxidized guanine base lesions function in 8-oxoguanine DNA glycosylase-1-mediated epigenetic regulation of nuclear factor kappaB-driven gene expression. J. Biol. Chem. 2016;291:25553–25566. doi: 10.1074/jbc.M116.751453. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Pan L., Hao W., Xue Y., Wang K., Zheng X., Luo J., Ba X., Xiang Y., Qin X., Bergwik J., et al. 8-Oxoguanine targeted by 8-oxoguanine DNA glycosylase 1 (OGG1) is central to fibrogenic gene activation upon lung injury. Nucleic Acids Res. 2023;51:1087–1102. doi: 10.1093/nar/gkac1241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Fleming A.M., Ding Y., Burrows C.J. Oxidative DNA damage is epigenetic by regulating gene transcription via base excision repair. Proc Natl Acad Sci U S A. 2017;114:2604–2609. doi: 10.1073/pnas.1619809114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Antoniali G., Lirussi L., D'Ambrosio C., Dal Piaz F., Vascotto C., Casarano E., Marasco D., Scaloni A., Fogolari F., Tell G. SIRT1 gene expression upon genotoxic damage is regulated by APE1 through nCaRE-promoter elements. Mol. Biol. Cell. 2014;25:532–547. doi: 10.1091/mbc.E13-05-0286. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Vorlickova M., Tomasko M., Sagi A.J., Bednarova K., Sagi J. 8-oxoguanine in a quadruplex of the human telomere DNA sequence. FEBS J. 2012;279:29–39. doi: 10.1111/j.1742-4658.2011.08396.x. [DOI] [PubMed] [Google Scholar]
- 19.Henderson E., Hardin C.C., Walk S.K., Tinoco I., Jr., Blackburn E.H. Telomeric DNA oligonucleotides form novel intramolecular structures containing guanine-guanine base pairs. Cell. 1987;51:899–908. doi: 10.1016/0092-8674(87)90577-0. [DOI] [PubMed] [Google Scholar]
- 20.Rhodes D., Lipps H.J. G-quadruplexes and their regulatory roles in biology. Nucleic Acids Res. 2015;43:8627–8637. doi: 10.1093/nar/gkv862. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Fleming A.M., Zhu J., Ding Y., Burrows C.J. 8-Oxo-7,8-dihydroguanine in the context of a gene promoter G-quadruplex is an on-off switch for transcription. ACS Chem. Biol. 2017;12:2417–2426. doi: 10.1021/acschembio.7b00636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Cogoi S., Ferino A., Miglietta G., Pedersen E.B., Xodo L.E. The regulatory G4 motif of the Kirsten ras (KRAS) gene is sensitive to guanine oxidation: implications on transcription. Nucleic Acids Res. 2018;46:661–676. doi: 10.1093/nar/gkx1142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Amente S., Di Palo G., Scala G., Castrignano T., Gorini F., Cocozza S., Moresano A., Pucci P., Ma B., Stepanov I., et al. Genome-wide mapping of 8-oxo-7,8-dihydro-2'-deoxyguanosine reveals accumulation of oxidatively-generated damage at DNA replication origins within transcribed long genes of mammalian cells. Nucleic Acids Res. 2019;47:221–236. doi: 10.1093/nar/gky1152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Ding Y., Fleming A.M., Burrows C.J. Sequencing the mouse genome for the oxidatively modified base 8-Oxo-7,8-dihydroguanine by OG-seq. J. Am. Chem. Soc. 2017;139:2569–2572. doi: 10.1021/jacs.6b12604. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Fang Y., Zou P. Genome-wide mapping of oxidative DNA damage via engineering of 8-oxoguanine DNA glycosylase. Biochemistry. 2020;59:85–89. doi: 10.1021/acs.biochem.9b00782. [DOI] [PubMed] [Google Scholar]
- 26.Poetsch A.R., Boulton S.J., Luscombe N.M. Genomic landscape of oxidative DNA damage and repair reveals regioselective protection from mutagenesis. Genome Biol. 2018;19:215. doi: 10.1186/s13059-018-1582-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Wu J., McKeague M., Sturla S.J. Nucleotide-resolution genome-wide mapping of oxidative DNA damage by click-code-seq. J. Am. Chem. Soc. 2018;140:9783–9787. doi: 10.1021/jacs.8b03715. [DOI] [PubMed] [Google Scholar]
- 28.An J., Yin M., Yin J., Wu S., Selby C.P., Yang Y., Sancar A., Xu G.L., Qian M., Hu J. Genome-wide analysis of 8-oxo-7,8-dihydro-2'-deoxyguanosine at single-nucleotide resolution unveils reduced occurrence of oxidative damage at G-quadruplex sites. Nucleic Acids Res. 2021;49:12252–12267. doi: 10.1093/nar/gkab1022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Carette J.E., Raaben M., Wong A.C., Herbert A.S., Obernosterer G., Mulherkar N., Kuehne A.I., Kranzusch P.J., Griffin A.M., Ruthel G., et al. Ebola virus entry requires the cholesterol transporter Niemann-Pick C1. Nature. 2011;477:340–343. doi: 10.1038/nature10348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Neurauter C.G., Pannone M., Sousa M.M.L., Wang W., Kusnierczyk A., Luna L., Saetrom P., Scheffler K., Bjoras M. Enhanced glutathione levels confer resistance to apoptotic and ferroptotic programmed cell death in NEIL DNA glycosylase deficient HAP1 cells. Free Radic. Biol. Med. 2024;213:470–487. doi: 10.1016/j.freeradbiomed.2024.01.037. [DOI] [PubMed] [Google Scholar]
- 31.Olsen M.B., Huse C., de Sousa M.M.L., Murphy S.L., Sarno A., Obermann T.S., Yang K., Holter J.C., Jorgensen M.J., Christensen E.E., et al. DNA repair mechanisms are activated in circulating lymphocytes of hospitalized covid-19 patients. J. Inflamm. Res. 2022;15:6629–6644. doi: 10.2147/JIR.S379331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Kim D., Paggi J.M., Park C., Bennett C., Salzberg S.L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 2019;37:907–915. doi: 10.1038/s41587-019-0201-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Lun A.T., Chen Y., Smyth G.K. It's DE-licious: a recipe for differential expression analyses of RNA-seq experiments using quasi-likelihood methods in edgeR. Methods Mol. Biol. 2016;1418:391–416. doi: 10.1007/978-1-4939-3578-9_19. [DOI] [PubMed] [Google Scholar]
- 34.Langmead B., Salzberg S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Ramirez F., Ryan D.P., Gruning B., Bhardwaj V., Kilpert F., Richter A.S., Heyne S., Dundar F., Manke T. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 2016;44:W160–W165. doi: 10.1093/nar/gkw257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Amemiya H.M., Kundaje A., Boyle A.P. The ENCODE blacklist: identification of problematic regions of the genome. Sci. Rep. 2019;9:9354. doi: 10.1038/s41598-019-45839-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Diaz A., Park K., Lim D.A., Song J.S. Normalization, bias correction, and peak calling for ChIP-seq. Stat. Appl. Genet. Mol. Biol. 2012;11 doi: 10.1515/1544-6115.1750. Article 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Rocher V., Genais M., Nassereddine E., Mourad R. DeepG4: a deep learning approach to predict cell-type specific active G-quadruplex regions. PLoS Comput. Biol. 2021;17 doi: 10.1371/journal.pcbi.1009308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Liu X., Xu C., Xiao W., Yan N. Unravelling the role of NFE2L1 in stress responses and related diseases. Redox Biol. 2023;65 doi: 10.1016/j.redox.2023.102819. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.McIntosh B.E., Hogenesch J.B., Bradfield C.A. Mammalian Per-Arnt-Sim proteins in environmental adaptation. Annu. Rev. Physiol. 2010;72:625–645. doi: 10.1146/annurev-physiol-021909-135922. [DOI] [PubMed] [Google Scholar]
- 41.Xie Y., Yang H., Cunanan C., Okamoto K., Shibata D., Pan J., Barnes D.E., Lindahl T., McIlhatton M., Fishel R., et al. Deficiencies in mouse Myh and Ogg1 result in tumor predisposition and G to T mutations in codon 12 of the K-ras oncogene in lung tumors. Cancer Res. 2004;64:3096–3102. doi: 10.1158/0008-5472.can-03-3834. [DOI] [PubMed] [Google Scholar]
- 42.Xie Y., Yang H., Miller J.H., Shih D.M., Hicks G.G., Xie J., Shiu R.P. Cells deficient in oxidative DNA damage repair genes Myh and Ogg1 are sensitive to oxidants with increased G2/M arrest and multinucleation. Carcinogenesis. 2008;29:722–728. doi: 10.1093/carcin/bgn033. [DOI] [PubMed] [Google Scholar]
- 43.Osterod M., Hollenbach S., Hengstler J.G., Barnes D.E., Lindahl T., Epe B. Age-related and tissue-specific accumulation of oxidative DNA base damage in 7,8-dihydro-8-oxoguanine-DNA glycosylase (Ogg1) deficient mice. Carcinogenesis. 2001;22:1459–1463. doi: 10.1093/carcin/22.9.1459. [DOI] [PubMed] [Google Scholar]
- 44.Russo M.T., De Luca G., Degan P., Parlanti E., Dogliotti E., Barnes D.E., Lindahl T., Yang H., Miller J.H., Bignami M. Accumulation of the oxidative base lesion 8-hydroxyguanine in DNA of tumor-prone mice defective in both the Myh and Ogg1 DNA glycosylases. Cancer Res. 2004;64:4411–4414. doi: 10.1158/0008-5472.CAN-04-0355. [DOI] [PubMed] [Google Scholar]
- 45.Jun Y.W., Albarran E., Wilson D.L., Ding J., Kool E.T. Fluorescence imaging of mitochondrial DNA base excision repair reveals dynamics of oxidative stress responses. Angew. Chem. 2021;134 doi: 10.1002/anie.202111829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Morland I., Rolseth V., Luna L., Rognes T., Bjoras M., Seeberg E. Human DNA glycosylases of the bacterial Fpg/MutM superfamily: an alternative pathway for the repair of 8-oxoguanine and other oxidation products in DNA. Nucleic Acids Res. 2002;30:4926–4936. doi: 10.1093/nar/gkf618. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Muftuoglu M., de Souza-Pinto N.C., Dogan A., Aamann M., Stevnsner T., Rybanska I., Kirkali G., Dizdaroglu M., Bohr V.A. Cockayne syndrome group B protein stimulates repair of formamidopyrimidines by NEIL1 DNA glycosylase. J. Biol. Chem. 2009;284:9270–9279. doi: 10.1074/jbc.M807006200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Hanna B.M.F., Michel M., Helleday T., Mortusewicz O. NEIL1 and NEIL2 are recruited as potential backup for OGG1 upon OGG1 depletion or inhibition by TH5487. Int. J. Mol. Sci. 2021;22 doi: 10.3390/ijms22094542. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Nakabeppu Y., Ohta E., Abolhassani N. MTH1 as a nucleotide pool sanitizing enzyme: friend or foe? Free Radic. Biol. Med. 2017;107:151–158. doi: 10.1016/j.freeradbiomed.2016.11.002. [DOI] [PubMed] [Google Scholar]
- 50.Huppert J.L., Balasubramanian S. G-quadruplexes in promoters throughout the human genome. Nucleic Acids Res. 2007;35:406–413. doi: 10.1093/nar/gkl1057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Roychoudhury S., Pramanik S., Harris H.L., Tarpley M., Sarkar A., Spagnol G., Sorgen P.L., Chowdhury D., Band V., Klinkebiel D., et al. Endogenous oxidized DNA bases and APE1 regulate the formation of G-quadruplex structures in the genome. Proc Natl Acad Sci U S A. 2020;117:11409–11420. doi: 10.1073/pnas.1912355117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Hao W., Wang J., Zhang Y., Wang C., Xia L., Zhang W., Zafar M., Kang J.Y., Wang R., Ali Bohio A., et al. Enzymatically inactive OGG1 binds to DNA and steers base excision repair toward gene transcription. FASEB J. 2020;34:7427–7441. doi: 10.1096/fj.201902243R. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Pan L., Hao W., Zheng X., Zeng X., Ahmed Abbasi A., Boldogh I., Ba X. OGG1-DNA interactions facilitate NF-kappaB binding to DNA targets. Sci. Rep. 2017;7 doi: 10.1038/srep43297. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Ba X., Bacsi A., Luo J., Aguilera-Aguirre L., Zeng X., Radak Z., Brasier A.R., Boldogh I. 8-oxoguanine DNA glycosylase-1 augments proinflammatory gene expression by facilitating the recruitment of site-specific transcription factors. J. Immunol. 2014;192:2384–2394. doi: 10.4049/jimmunol.1302472. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Wang R., Hao W., Pan L., Boldogh I., Ba X. The roles of base excision repair enzyme OGG1 in gene expression. Cell. Mol. Life Sci. 2018;75:3741–3750. doi: 10.1007/s00018-018-2887-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Spruijt C.G., Gnerlich F., Smits A.H., Pfaffeneder T., Jansen P.W., Bauer C., Munzel M., Wagner M., Muller M., Khan F., et al. Dynamic readers for 5-(hydroxy)methylcytosine and its oxidized derivatives. Cell. 2013;152:1146–1159. doi: 10.1016/j.cell.2013.02.004. [DOI] [PubMed] [Google Scholar]
- 57.Bogdanovic O., Smits A.H., de la Calle Mustienes E., Tena J.J., Ford E., Williams R., Senanayake U., Schultz M.D., Hontelez S., van Kruijsbergen I., et al. Active DNA demethylation at enhancers during the vertebrate phylotypic period. Nat. Genet. 2016;48:417–426. doi: 10.1038/ng.3522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Schomacher L., Han D., Musheev M.U., Arab K., Kienhofer S., von Seggern A., Niehrs C. Neil DNA glycosylases promote substrate turnover by Tdg during DNA demethylation. Nat. Struct. Mol. Biol. 2016;23:116–124. doi: 10.1038/nsmb.3151. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Olsen M.B., Hildrestrand G.A., Scheffler K., Vinge L.E., Alfsnes K., Palibrk V., Wang J., Neurauter C.G., Luna L., Johansen J., et al. NEIL3-Dependent regulation of cardiac fibroblast proliferation prevents myocardial rupture. Cell Rep. 2017;18:82–92. doi: 10.1016/j.celrep.2016.12.009. [DOI] [PubMed] [Google Scholar]
- 60.Kruger L., Mantyh P.W., Sternini C., Brecha N.C., Mantyh C.R. Calcitonin gene-related peptide (CGRP) in the rat central nervous system: patterns of immunoreactivity and receptor binding sites. Brain Res. 1988;463:223–244. doi: 10.1016/0006-8993(88)90395-2. [DOI] [PubMed] [Google Scholar]
- 61.Perillo B., Ombra M.N., Bertoni A., Cuozzo C., Sacchetti S., Sasso A., Chiariotti L., Malorni A., Abbondanza C., Avvedimento E.V. DNA oxidation as triggered by H3K9me2 demethylation drives estrogen-induced gene expression. Science. 2008;319:202–206. doi: 10.1126/science.1147674. [DOI] [PubMed] [Google Scholar]
- 62.O'Hagan H.M., Wang W., Sen S., Destefano Shields C., Lee S.S., Zhang Y.W., Clements E.G., Cai Y., Van Neste L., Easwaran H., et al. Oxidative damage targets complexes containing DNA methyltransferases, SIRT1, and polycomb members to promoter CpG Islands. Cancer Cell. 2011;20:606–619. doi: 10.1016/j.ccr.2011.09.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Chen E.Y., Tan C.M., Kou Y., Duan Q., Wang Z., Meirelles G.V., Clark N.R., Ma'ayan A. Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinf. 2013;14:128. doi: 10.1186/1471-2105-14-128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Abentung A., Sakshaug T., Dumaru R., Liabakk N.-B., Yang M., Wang J., Bjørås M., Scheffler K. 2024. DNA Glycosylases Ogg1 and Mutyh Mediate Gene Expression of PRC2 Target Genes Important for Neuronal Processes Underlying Memory Formation. [Google Scholar]
- 65.Park J.W., Han Y.I., Kim S.W., Kim T.M., Yeom S.C., Kang J., Park J. 8-OxoG in GC-rich Sp1 binding sites enhances gene transcription in adipose tissue of juvenile mice. Sci. Rep. 2019;9 doi: 10.1038/s41598-019-52139-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Hu S., Feng J., Wang M., Wufuer R., Liu K., Zhang Z., Zhang Y. Nrf1 is an indispensable redox-determining factor for mitochondrial homeostasis by integrating multi-hierarchical regulatory networks. Redox Biol. 2022;57 doi: 10.1016/j.redox.2022.102470. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Liu X., Xu C., Xiao W., Yan N. Unravelling the role of NFE2L1 in stress responses and related diseases. Redox Biol. 2023;65 doi: 10.1016/j.redox.2023.102819. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Fulda S., Gorman A.M., Hori O., Samali A. Cellular stress responses: cell survival and cell death. Int J Cell Biol. 2010;2010 doi: 10.1155/2010/214074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Ramon O., Sauvaigo S., Gasparutto D., Faure P., Favier A., Cadet J. Effects of 8-oxo-7,8-dihydro-2'-deoxyguanosine on the binding of the transcription factor Sp1 to its cognate target DNA sequence (GC box) Free Radic. Res. 1999;31:217–229. doi: 10.1080/10715769900300781. [DOI] [PubMed] [Google Scholar]
- 70.Hailer-Morrison M.K., Kotler J.M., Martin B.D., Sugden K.D. Oxidized guanine lesions as modulators of gene transcription. Altered p50 binding affinity and repair shielding by 7,8-dihydro-8-oxo-2'-deoxyguanosine lesions in the NF-kappaB promoter element. Biochemistry. 2003;42:9761–9770. doi: 10.1021/bi034546k. [DOI] [PubMed] [Google Scholar]
- 71.Pastukh V., Ruchko M., Gorodnya O., Wilson G.L., Gillespie M.N. Sequence-specific oxidative base modifications in hypoxia-inducible genes. Free Radic. Biol. Med. 2007;43:1616–1626. doi: 10.1016/j.freeradbiomed.2007.08.027. [DOI] [PubMed] [Google Scholar]
- 72.Bielskute S., Plavec J., Podbevsek P. Oxidative lesions modulate G-quadruplex stability and structure in the human BCL2 promoter. Nucleic Acids Res. 2021;49:2346–2356. doi: 10.1093/nar/gkab057. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All genomic data produced in the present project (OG-seq and RNA-seq) have been deposited in the NCBI GEO database under accession numbers GSE256080 and GSE256081, respectively.





