Skip to main content
PLOS Genetics logoLink to PLOS Genetics
. 2021 Apr 15;17(4):e1009523. doi: 10.1371/journal.pgen.1009523

Aicardi-Goutières syndrome-associated gene SAMHD1 preserves genome integrity by preventing R-loop formation at transcription–replication conflict regions

Kiwon Park 1,2,#, Jeongmin Ryoo 3,#, Heena Jeong 1,2, Minsu Kim 1,2, Sungwon Lee 1,2, Sung-Yeon Hwang 1,2, Jiyoung Ahn 4, Doyeon Kim 2, Hyungseok C Moon 5, Daehyun Baek 2, Kwangsoo Kim 6, Hye Yoon Park 5, Kwangseog Ahn 1,2,*
Editor: Natalia Gromak7
PMCID: PMC8078737  PMID: 33857133

Abstract

The comorbid association of autoimmune diseases with cancers has been a major obstacle to successful anti-cancer treatment. Cancer survival rate decreases significantly in patients with preexisting autoimmunity. However, to date, the molecular and cellular profiles of such comorbidities are poorly understood. We used Aicardi-Goutières syndrome (AGS) as a model autoimmune disease and explored the underlying mechanisms of genome instability in AGS-associated-gene-deficient patient cells. We found that R-loops are highly enriched at transcription-replication conflict regions of the genome in fibroblast of patients bearing SAMHD1 mutation, which is the AGS-associated-gene mutation most frequently reported with tumor and malignancies. In SAMHD1-depleted cells, R-loops accumulated with the concomitant activation of DNA damage responses. Removal of R-loops in SAMHD1 deficiency reduced cellular responses to genome instability. Furthermore, downregulation of SAMHD1 expression is associated with various types of cancer and poor survival rate. Our findings suggest that SAMHD1 functions as a tumor suppressor by resolving R-loops, and thus, SAMHD1 and R-loop may be novel diagnostic markers and targets for patient stratification in anti-cancer therapy.

Author summary

Mutations in SAMHD1 cause Aicardi-Goutières syndrome (AGS), a monogenic lupus-like autoimmune disease. Among AGS-associated genes, SAMHD1 is most frequently mutates in various types of tumors and malignancies, suggesting that it is biologically relevant to cancer development. Here, we show that SAMHD1 resolves R-loops induced by transcription-replication conflicts, thereby contributing to the maintenance of genome stability. Our findings provide insight into the molecular and mechanical understanding of the autoimmunity and cancer comorbidity, and suggest that SAMHD1 and R-loops are potential and reliable biomarkers in anti-cancer therapeutics.

Introduction

Recent advances in immunotherapy have led to the development of more efficient cancer therapeutics. However, the autoimmune comorbidities with cancer are the major obstacles to successful anti-cancer treatment, particularly in patients with preexisting autoimmune disease (AD). Despite of the greater risk of developing cancer in patients with preexisting AD than in the general population [13], cancer patients with preexisting AD have been excluded from immunotherapy and clinical trials because of concerns about the exacerbation of their autoimmune responses [4]. Consequently, to date, cancer survival rate has been decreasing significantly in patients with preexisting AD [5]. Therefore, the conditions in which cancer arises in AD call for the development of novel therapeutic options based on a better understanding of the molecular and cellular profiling of such comorbidities.

Most of autoimmune diseases have complex genetic factors and do not show a Mendelian pattern of inheritance. Therefore, it is challenging to elucidate the mechanism underlying the observed comorbidity between AD and cancer. Aicardi-Goutières syndrome (AGS) is a rare monogenic AD whose clinical importance is magnified as its clinical and biochemical characteristics highly overlap with much common AD, systemic lupus erythematosus (SLE) [6]. AGS mimics the congenital infection and it exhibits type I interferonopathy, which is considered central to the pathogenesis of the SLE [7]. AGS is caused by mutations in genes including SAMHD1, TREX1, RNASEH2A, and RNASEH2B, most of which play roles in nucleic acid metabolism [8]. Deficiencies in AGS-associated genes often share the common features of abnormal activation of the immune system due to aberrant nucleic acid metabolism. However, among AGS-associated genes, sterile alpha motif and HD domain-containing protein 1 (SAMHD1) is most frequently reported in the context of multiple cancer types and malignancies [914]. Therefore, SAMHD1-associated AGS provides an excellent experimental model for studying the comorbidity of AD and cancer.

SAMHD1 exhibits dual enzymatic activities: deoxynucleoside triphosphohydrolase (dNTPase) and phosphorolytic 3′-5′ -exoribonuclease activities [1517]. SAMHD1 is frequently mutated not only in AGS but also in a variety of human cancers, such as chronic lymphocytic leukemia, colorectal cancer, and epidermotropic cutaneous T-cell lymphoma [10,13,18]. Additionally, downregulation of SAMHD1 expression has been reported in cutaneous T-cell lymphoma and lung adenocarcinoma [11]. Intriguingly, the recently reported role of SAMHD1 in DNA repair and replication fork processing upon genotoxic reagent treatment is independent of its known enzymatic activity, that is, dNTPase activity [19,20]. Accumulating evidence indicates that SAMHD1 plays a role in preserving genome integrity and preventing cancer. However, the mechanisms by which SAMHD1 preserves genomic integrity against intrinsic DNA damage and by which an endogenous source causes cancer development in SAMHD1-deficiency but not in other AGS-associated gene deficiencies remain unknown.

R-loops, also known as DNA:RNA hybrids, are transcription byproducts that play crucial roles in several biological processes, such as immunoglobulin class switching and epigenetics [21]. Dysregulation of R-loops and the resulting harmful R-loops cause replication fork stalling, which could serve as a source of genome instability [22]. R-loops can be formed during transcription-replication conflicts (TRCs) within gene body regions and that they induce distinct DNA damage responses depending on the orientation of collisions between the DNA replisome and RNA polymerase [23]. Head-on (HO) collisions occur when the DNA replisome and RNA polymerase II (RNAP II) process toward each other and collide in the “HO” direction and they exacerbate the formation of genome-destabilizing R-loops [23]. On the other hand, co-directional (CD) TRCs occur when the replication and transcription machineries proceed along the same DNA template at different rates as the DNA replisome and RNAP II move in the same direction [24], and resolves R-loops. Interestingly, R-loops accumulate in fibroblasts of patients with AGS [25]. AGS-specific R-loops are quite different from the conventional R-loops, which are observed at the transcription start site, transcription termination site, and GC skew. Fibroblasts with mutations in different AGS-associated genes exhibit unique patterns of R-loop accumulation along genomic regions. However, the characteristics and biological relevance of these R-loops remains elucidated.

Genome instability is a hallmark of cancer and R-loop enrichment around TRCs has been suggested as a potent threat to genome integrity. In this study, we analyzed genome-wide DNA:RNA hybrid accumulation at TRC regions in AGS patient fibroblasts via bioinformatics analysis based on multiple databases. R-loop formation at TRC regions was found only in the fibroblasts of AGS patients bearing SAMHD1 mutations but not in those of other AGS-associated gene mutations. Despite the absence of exogenous DNA damaging stimuli, we found that SAMHD1-deficient cells exhibit R-loop-mediated genome instability. Furthermore, analyses of The Cancer Genome Atlas (TCGA) database indicated that downregulation of SAMHD1 expression occurs in various cancer types and is associated with poor survival in colorectal cancer patients. Our study identifies an unexpected function of SAMHD1 in resolving R-loops that accumulate following TRCs, thereby contributing to the maintenance of genome stability and preventing cancer.

Results

R-loops adjacent to replication origins accumulate in fibroblasts of AGS patients bearing SAMHD1 mutations

To investigate the relationship between SAMHD1 and R-loops that give rise to genome instability, we analyzed deposited DNA:RNA hybrid immunoprecipitation followed by the sequencing (DRIP-seq) data obtained from AGS patient fibroblasts with deficiencies in SAMHD1 and other AGS-associated genes [25]. We focused on the accumulation of R-loops in gene body regions, particularly with replication origins, where TRCs potentially occur. To distinguish gene body regions with or without replication origins, we employed an existing dataset that maps replication origins in actively transcribed genes [23,2628]. This dataset was established by integrating published Okazaki fragment sequencing (OK-seq) data that defines replication origins in the human genome and global run-on sequencing (GRO-seq) data to identify transcribed genes [23]. Normalized DRIP-seq reads were mapped to each gene body region: origins in gene bodies (n = 727) or centers of gene bodies (n = 484), where replication origins were present or absent, respectively. This enabled us to predict R-loop accumulation at genomic regions with replication origins in 24 kb windows (±12 kb windows centered on replication origins of gene bodies) and distinguish TRCs as HO or CD based on their collision orientations. We therefore compared R-loop accumulation levels across each gene body region in different types of fibroblasts from patients with AGS. Remarkably, SAMHD1-deficient AGS (AGS5) patient fibroblasts exhibited a much-enriched DRIP-seq read-count in comparison with fibroblasts from healthy controls, in gene body regions with replication origins (Fig 1A, left), but not in the control region of the same gene bodies without replication origins (Fig 1A, right). However, patient fibroblasts with deficiencies in other AGS-associated genes, such as TREX1 (AGS1), RNASEH2A (AGS4), and RNASEH2B (AGS2), exhibited similar levels of R-loop enrichment along both replication origins and centers of gene bodies with that of the healthy control (Fig 1A, right and left). To eliminate any bias in our bioinformatics data processing, we mapped DRIP-seq reads on random genomic regions covering approximately 40% of the whole genome, and none of the AGS patient fibroblast samples exhibited DRIP-seq read-count enrichment in these negative control regions (S1A Fig). Since such AGS-associated genes are involved in nucleic acid metabolism [8], their mutation could have altered the global transcriptome and thereby genomic TRC profile. Therefore, we have analyzed RNA sequencing (RNA-seq) data obtained from the fibroblasts of AGS patients and compared them with those of healthy control [25]. Among the selected genes we used for DRIP-seq analysis, there were only a few of differentially expressed genes (DEGs) in AGS patient fibroblasts (S1B Fig). The average DRIP-seq read-count around the DNA replication regions of gene bodies in 12 kb windows (±6 kb windows centered on replication origins of gene bodies) was quantified. R-loops in TRC regions were significantly enriched only in SAMHD1-deficient fibroblasts when compared to that of the fibroblasts from healthy control (adjusted P = 3.69x10-25, Fisher’s exact test); however, the enrichment was not observed in other AGS-associated gene-deficient fibroblasts (Fig 1B). In addition, cumulative distribution frequency curves indicated that fibroblasts bearing the SAMHD1 mutation had the highest proportion of enriched DRIP-seq read mapping at each 12 kb window along the region around the origins (Fig 1C). Interestingly, the DRIP-seq read-count at HO collision sub-regions was highly and significantly enriched in SAMHD1-deficient fibroblasts (adjusted P = 6.5x10-33, Fisher’s exact test), while other AGS associated gene-deficient fibroblasts showed insignificant differences compared to that of healthy control (S1C Fig). On the other hand, SAMHD1-deficient fibroblasts showed insignificant DRIP-seq read-count enrichment at CD collision sub-regions (adjusted P = 0.0707, Fisher’s exact test), while other AGS associated gene-deficient fibroblasts showed even lower R-loop accumulation in comparison to that of healthy control (S1D Fig). HO collisions, which occur when the DNA replisome and RNAP II collide in a head-on orientation, exacerbate the formation of genome-destabilizing R-loops, which activate DNA damage responses [23]. Instead, CD-oriented TRCs resolve R-loops [23]. These results therefore suggest that SAMHD1 deficiency promotes TRC-dependent R-loop accumulation and that these R-loops could be a potent threat to normal replication fork progression and genome integrity.

Fig 1. SAMHD1 mutations induce R-loop formation by transcription–replication conflict in AGS patient fibroblasts.

Fig 1

A, Average mapped DRIP-seq read accumulation around replication origins across the gene bodies (n = 727) (left) or center of the same gene bodies (n = 484) (right) in fibroblasts from control (yellow) and AGS patients with the indicated gene mutations (blue, SAMHD1; pink, TREX1; green, RNASEH2B; purple, RNASEH2A). For individual graph, the line indicates the mean DRIP-seq read-count of each genomic region of 24 kb windows centered on origins or center of gene bodies. B, Box graph indicating quantified average DRIP-seq read-count across 12 kb window centered on the replication origins of the gene bodies (n = 727). For each sample, orange line in the center of boxes indicates the median, and the box indicates 1st and 3rd quartiles. Statistical significance was assessed using the Fisher’s exact test followed by Bonferroni correction. C, Cumulative distribution frequency of DRIP-seq mapped read-count around each replication origins of the gene bodies in 12 kb windows centered on replication origins of the gene bodies (n = 727). Individual line represents the cumulative distribution frequency of healthy control and AGS patients with the indicated gene mutations (yellow, healthy control; blue, AGS5; pink, AGS1; green, AGS2; purple, AGS4).

SAMHD1 deficiency induces replication stress and delays S phase progression

We depleted SAMHD1 protein expression in human U2OS cells with short hairpin RNA (shRNA) expression vectors targeting SAMHD1 untranslated regions (UTRs) (S2A Fig). We examined the resulting cellular phenotypes in terms of genome instability in SAMHD1-deficiency without any genotoxic agent treatment, which exogenously induced replication stress or double-strand DNA breaks. As TRCs and abnormal accumulation of R-loops can lead to replication fork stalling and slowing of replication, we examined the proliferation of SAMHD1-knockdown cells. SAMHD1-depleted U2OS cells grew much slower than the rate at which control cells transfected with a Luciferase-targeting shRNA vector grew (S2B Fig). However, this defect in cell growth was not a result of apoptosis or cell cycle arrest (S2C, S2D and S2E Fig). Given that there was no arrest of the overall cell cycle, we examined cell cycle progression in SAMHD1-depleted cells in detail using a BrdU pulse-chase assay to determine whether there was a cell cycle delay. Unsynchronized cells were pulse-labeled with BrdU and allowed to continue cycling. Compared with control cells, only approximately half of the cells entered the next G1 phase when SAMHD1 was depleted, at 9 h after the removal of BrdU (Fig 2A and 2B). Despite the temporal or spatial separation between the replication and transcription of some genes, the two cellular processes necessarily occur on the same DNA template at the same time under certain circumstances [29]. Accordingly, TRCs inevitably occur in the S phase and cause genome instability when or where there are deficiencies in the temporal or spatial safeguards [24,30,31]. Replication stress restrains cell cycle progression, particularly when it is perceived by cell cycle checkpoint proteins [27]. This suggests that cells experiencing a high level of replication stress from TRCs and R-loops would exhibit slow S phase progression. S phase progression was analyzed in order to confirm that SAMHD1-depleted cells undergo DNA replication stress and perturbation during cell cycle progression. Cells were synchronized to the G1/ early S phase using a double thymidine block and then released to continue cycling (S3A Fig). The population in each phase of the cell cycle was determined by propidium iodide (PI) staining followed by flow cytometry analysis. SAMHD1-knockdown cells exhibited much slower S phase progression than control cells, as indicated by a higher percentage of the population entering the S phase at later time points (8 h and 10 h post-release) (Fig 2C). Considered together, these data indicate the depletion of SAMHD1 protein results in intrinsic replication stress, which in turn leads to defects in S phase progression.

Fig 2. Depletion of SAMHD1 induces cell cycle delay and spontaneous DNA damage response.

Fig 2

A, Representative flow plot of SAMHD1-depleted or control U2OS cells in BrdU pulse-chase experiment showing G1 phase cell population after BrdU labelling (pink box). B, Quantification analysis of BrdU pulse-chase experiment in A. Data represent mean ± SEM percentages of BrdU positive G1 phase (2N) cell population. Statistical significance was assessed using two-tailed Student’s t-test (n = 3). C, Flow cytometry prolife of SAMHD1-depleted and control U2OS cells at different time points after release from double thymidine block. Cells were subjected to PI staining followed by FACS analysis. Data represent mean ± SEM, n = 3, P values were calculated using two-way ANOVA. D, Representative images of comet assay performed under alkaline electrophoresis condition in SAMHD1-depleted and control U2OS cells. E, Quantification analysis of comet tail moment lengths for each condition showed in D. Data represent mean ± SEM, the statistical significance was assessed using two-tailed Student’s t-test (n = 50). F, Immunoblots of CHK1 phosphorylated on S345 (p-CHK1), CHK2 phosphorylated on T68 (p-CHK2), and CHK1 and CHK2 in asynchronized (Asyn) or S phase-synchronized (S phase) SAMHD1-depleted and control U2OS cells (n = 2). G, Representative image of immunofluorescence assay of H2AX phosphorylated on S139 (γH2AX) in S phase synchronized SAMHD1-depleted and control U2OS cells. Cells were co-stained with DAPI (blue) and anti-γH2AX antibodies (green) (n = 2). H, Quantification of γH2AX positive signal per nucleus for the immunofluorescence assay described in G. Data represent mean ± SEM, the statistical significance was assessed using the two-tailed Student’s t-test (n >5). I, Representative image of immunofluorescence assay of ATM phosphorylated on S1981 (p-ATM) in S phase-synchronized SAMHD1-depleted and control U2OS cells. Cells were co-stained with DAPI (blue) and anti-p-ATM antibodies (red) (n = 2). J, Quantification of p-ATM positive signal per nucleus for the immunofluorescence assay described in I. The box plot indicates values from minimum to maximum, the statistical significance was assessed using the Mann-Whitney U test (n >100). K, Representative image of immunofluorescence assay of ATR phosphorylated on S428 (p-ATR) S phase synchronized in SAMHD1-depleted and control U2OS cells. Cells were co-stained with DAPI (blue) and anti-p-ATR antibodies (green) (n = 2). L, Quantification of p-ATR positive signal per nucleus for the immunofluorescence assay described in I. The box plot indicates values from minimum to maximum, the statistical significance was assessed using the Mann-Whitney U test (n >100).

SAMHD1 is required for preventing DNA damage in non-stressed cells

As the DNA damage response is accompanied by replication stress [27] and TRC-dependent R-loops [23], we examined whether the depletion of SAMHD1 induces DNA damage and genome instability. We tested the overall genome integrity in SAMHD1-depleted U2OS cells using an alkaline single cell gel electrophoresis (SCGE) assay, which is also known as the comet assay. The comet assay under alkaline condition enables a simple evaluation of cellular DNA damage, including single- and double-stranded DNA breaks, the majority of apurinic/apyrimidinic sites, and alkali-labile DNA adducts. SAMHD1-depleted U2OS cells harbored more DNA damage than control cells, as indicated by the formation of longer tail moments (Fig 2D and 2E). Moreover, the length of comet tail moment in SAMHD1-depleted cells decreased following the reconstitution of SAMHD1 protein using a SAMHD1-HA-expressing construct (S3B and S3C Fig). A particularly interesting aspect of this observation was that DNA damage occurred spontaneously without any exogenous stimuli following the loss of SAMHD1. This suggests that SAMHD1 depletion leads to an accumulation of endogenous sources of genome instability.

Further, we examined the phosphorylation levels of CHK1 and CHK2 in both asynchronized and S-phase-synchronized SAMHD1-knockdown and control cells. CHK1 and CHK2 are the replication checkpoint proteins and substrates of the ATR and ATM checkpoint kinases, respectively. Because TRC-dependent R-loops can appear during the S phase [2931], and S phase progression was slowed down in SAMHD1-depleted cells (Fig 2C), we tested the activation levels of CHK1 and CHK2 in the S phase-synchronized cell population. The SAMHD1-depleted and control U2OS cells were evaluated 4 h after release from the double-thymidine cell cycle block, which is a widely used DNA replication inhibitor with relatively low cytotoxicity [32] (S3D Fig). The phosphorylation levels of both CHK1 and CHK2 were increased in S phase-synchronized SAMHD1-depleted cells but not in the heterogeneous cell population (Fig 2F). This result is in line with a very recent report that demonstrated S-phase specific DNA damage response in cells experiencing TRC-mediated R-loop accumulation [33]. This could explain why the spontaneous activation of DNA damage response signaling proteins in SAMHD1-deficient cells could not be observed in previous studies [10,19,20,34]. Moreover, the activation statuses of CHK1 and CHK2 were restored by ectopic SAMHD1 protein expression (S3E and S3F Fig). Consistent with this finding, S phase-synchronized SAMHD1-knockdown cells exhibited increased activation of H2AX, a major component of the DNA damage response, the phosphorylation of which serves as a sensitive marker of DNA damage (Fig 2G and 2H). HO-oriented TRCs promote ATR activation, which responds to ssDNA, while CD-oriented TRCs promote ATM activation, which is primarily activated by DSBs [23]. R-loop accumulation in SAMHD1-deficient fibroblasts at HO and CD collision sub-regions were different (S1C and S1D Fig). Therefore, we examined the phosphorylation levels of ATM and ATR in S phase-synchronized SAMHD1-depleted and control U2OS cells. SAMHD1-knockdown cells exhibited increased activation of both ATM (p-ATM) (Fig 2I and 2J) and ATR (p-ATR) (Fig 2K and 2L). Since cellular R-loop processing may convert an R-loop to a DSB [35], this result is in agreement with a previously proposed molecular mechanism of R-loop-mediated DNA damage response. Additionally, we have observed longer tail moments in asynchronized SAMHD1-depleted cells than those in control cells under neutral condition, which enables an evaluation of cellular double-stranded DNA breaks (S3G and S3H Fig). Overall, these data suggest that the depletion of SAMHD1 protein gives rise to obstacles to the maintenance of genome stability, such as TRC-dependent R-loops.

SAMHD1 prevents cellular R-loop accumulation

R-loops are considered as endogenous threats to genomic integrity [21,22], and their accumulation has been observed in tissues of cancer patients [36]. Following the bioinformatics approach that revealed TRCs-dependent R-loop accumulation in SAMHD1-deficient AGS fibroblasts (Fig 1), we analyzed cellular TRCs and R-loop accumulation following SAMHD1 depletion in S phase-synchronized U2OS cells. First, we examined whether SAMHD1-depletion in U2OS cells increases TRCs by a proximity-ligation assay (PLA) between RNA polymerase II (RNAPII) and PCNA, which are essential proteins in the transcription and replication machineries, respectively. The RNAPII-PCNA PLA signal was increased upon SAMHD1 depletion (Fig 3A and 3B). SAMHD1-knockdown or control cells were probed with S9.6 antibody, which specifically binds to DNA:RNA hybrids and detects R-loops. We observed R-loop accumulation in SAMHD1-depleted cells by DNA:RNA hybrid immunofluorescence assay using the S9.6 antibody, and the S9.6 signal (orange) intensity per nucleus was quantified after removal of the nucleolar signal (green) (Fig 3C). The nuclear fluorescence signal associated with R-loops was enhanced in cells depleted of SAMHD1 (Fig 3D). We also measured R-loop levels using an HB-GFP retention assay [37]. HB-GFP is a fusion protein consisting of the DNA:RNA hybrid-binding (HB) domain of RNaseH1 tagged with EGFP, which enables detection of R-loops in cells using flow cytometry analysis of GFP-associated fluorescence. We performed HB-GFP retention assay upon small interfering RNA (siRNA)-mediated depletion of SAMHD1 and other well-known R-loop regulators [21,38], in asynchronized U2OS cells (S4A Fig). Cells were transiently transfected with an HB-GFP-expressing vector for the HB-GFP retention assay, which quantifies only R-loop-binding HB-GFP signals following cell semi-permeabilization and washing away of proteins that do not bind R-loops (S4B Fig). To observe the HB-GFP-positive S phase cell population, we co-stained cells with PI. Flow cytometry analysis showed that SAMHD1, SETX, THOC1 and MRE11 knockdown led to an increase in the HB-GFP-positive population (Fig 3E), whereas HB-GFP transfection efficiency and expression levels were not affected by target gene knockdown (S4C Fig). Moreover, the levels of R-loop accumulation in S phase cells were enriched in SAMHD1-, SETX-, THOC1- and MRE11-knockdown cells in comparison to those in control cells (Fig 3F), while the proportion of S phase cells was not affected by target gene knockdown (S4D Fig). The R-loop accumulation in SAMHD1-depleted cells was also observed by DNA:RNA hybrid dot blots using the anti-S9.6 antibodies, dot intensity declined when SAMHD1 protein was restored (Figs 3G and S4E). The enhanced R-loop signals on dot blots of SAMHD1-knockdown cells decreased significantly upon in vitro treatment with RNase H, which specifically degrades RNAs of DNA:RNA hybrids (S4F Fig). We further confirmed R-loop accumulation in SAMHD1-depleted cells and validated our bioinformatics analysis using DNA-RNA immunoprecipitation and qPCR (DRIP-qPCR) using anti-S9.6 antibodies. In this case, the S9.6 signal was determined for genes actively transcribed such as NEK7 and SEC24D, and we used RPL13A as a positive control, which has been previously validated for the detection of R-loops [37]. Additionally, we further specified R-loop accumulating regions in NEK7 and SEC24D gene bodies into the HO collision sub-region (NEK7 HO, SEC24D HO) and CD collision sub-region (NEK7 CD, SEC24D CD) based on the dataset that maps replication origins in actively transcribed genes [23]. First, we confirmed that the depletion of SAMHD1 does not affect the genomic TRC profile, as transcription levels of the subjected genes are not affected by SAMHD1 protein depletion (S4G Fig). We assumed that the replication origin usages are not altered by SAMHD1 protein depletion, as it is previously observed in AGS fibroblasts and HEK293T cell line [19,39]. We detected the accumulation of R-loops at the RPL13A, NEK7 HO and SEC24D HO regions in SAMHD1-depleted cells compared to those in the control U2OS cells, whereas at the NEK7 CD and SEC24D CD did not show significant R-loop accumulations (Fig 3H). Importantly, in vitro RNase H treatment induced a dramatic signal decrease at RPL13A, NEK7 HO, and SEC24D HO regions, confirming that enriched signals were R-loop specific (Fig 3H). Moreover, we have investigated more number of genes such as GBE1, TRIO, MAP4K4, CALD1 and PALM2-AKAP2. The transcription levels of those genes were not altered by SAMHD1 protein depletion (S4H Fig). As expected, the DRIP signals on HO regions of all those genes were enriched by more than 1.5-fold in SAMHD1-depleted cells compared to that of control cells (S4I Fig, top). However, DRIP signals on CD regions of the genes were comparable in SAMHD1-deplted cells and control cells (MAP4K4 and PALM2-AKAP2), or slightly increased in SAMHD1-depleted cells but with larger adjusted P values than their DRIP signals on HO regions (GBE1, TRIO and CALD1) (S4I Fig, bottom). Together, these results indicate that SAMHD1 prevents R-loop formation, which strongly agrees with our bioinformatics analysis.

Fig 3. Depletion of SAMHD1 causes cellular R-loop accumulation.

Fig 3

A, Representative images of proximity ligation assay (PLA) between PCNA and RNA polymerase II (RNAPII) in S phase-synchronized SAMHD1-depleted and control U2OS cells. Cells were subjected to PLA (red) and co-stained with DAPI (blue). B, Quantification analysis of number of PLA foci per nucleus in A. RNAP II_alone and PCNA_alone are single-antibody controls from S phase-synchronized SAMHD1-depleted U2OS cells. The mean value for each data is shown as a red line. Statistical significance was assessed using the two-tailed Mann-Whitney U test (n = 50). C, Representative images of immunofluorescence assay of S9.6 nuclear signals in SAMHD1-depleted and control U2OS cells. Cells were co-stained with anti-S9.6 (orange) and anti-nucleolin antibodies (green). The nucleus (defined by DAPI image) is outlined. Where indicated, cells were incubated with or without RNase H in vitro before staining with anti-S9.6 antibodies. D, Quantification of S9.6 signal intensity per nucleus after nucleolar signal subtraction for the immunofluorescence assay described in C. The mean value for each data is shown as a red line. Statistical significance was assessed using one-way ANOVA (n >28). E, Representative fluorescence-activated cell sorting (FACS) profile of HB-GFP retention assay, examining basal R-loop levels in U2OS cells treated with control (siControl) or indicated siRNAs.pcDNA3.1 transfected siControl-treated U2OS cells are negative control for HB-GFP retention assay. F, Quantification analysis of HB-GFP retention assay co-stained with PI. Data represent mean ± SEM percentages of GFP positive S phase cell population. Statistical significance was assessed using two-tailed Student’s t-test (n = 3). G, Dot blot analysis of R-loop in genomic DNA extracts from SAMHD1-depleted and control U2OS cells, in the absence or presence of ectopic SAMHD1-GFP expression (n = 3). The genomic DNA were probed with anti-S9.6 and anti-ssDNA antibodies to verify equivalent DNA loading. Fold-induction was normalized to the control U2OS cells in absence of ectopic SAMHD1-GFP expression by quantifying the dot intensity of the blots and calculating the ratios of S9.6 signal to total amount of genomic DNA. H, DRIP-qPCR using the anti S9.6 antibody at RPL13A, NEK7 HO, NEK7 CD, SEC24D HO and SEC24D CD are shown in SAMHD1-depleted and control U2OS cells. Pre-immunoprecipitated samples were untreated (-) or treated (+) with RNase H, as indicated. Values are relative to those of control U2OS cells. Data represent mean ± SEM the statistical significance was assessed using the one-way ANOVA (n = 2).

SAMHD1 regulates R-loop-mediated genome instability

A correlation between SAMHD1-dependent R-loop accumulation and the cellular DNA damage response has not been previously described. To further test our hypothesis that TRC-dependent R-loops are the endogenous source of genome instability in SAMHD1-deficiency, we rescued SAMHD1-depleted cells by RNaseH1, which specifically resolves R-loops. The slower S phase in SAMHD1-depleted cells was rescued when cells were transiently transfected with a construct encoding wild-type E. coli RNaseH1 (RNH1) (Fig 4A). The level of DNA damage in SAMHD1-depleted cells was evidenced by the longer tail moment in the SCGE assay. The degree of DNA damage was also reduced by ectopic expression of RNH1 (Fig 4B and 4C). Moreover, the DNA damage response in S phase-synchronized SAMHD1-depleted cells recovered upon WT RNase H1 expression, as indicated by reduced phosphorylation levels of both CHK1 and CHK2 (Fig 4D). Consistent with this, increased activation of H2AX in the absence of SAMHD1 was also rescued by RNH1 (Fig 4E and 4F). Further, we examined whether TRCs in SAMHD1-depleted cells could be recovered by RNH1. The PCNA-RNAPII PLA signal that was enriched in SAMHD1-knockdown cells was reduced by the ectopic expression of RNH1 (Fig 4G and 4H). Overall, these results reveal that the TRC-dependent R-loops serve as the primary source of replication stress leading to genome instability in SAMHD1-deficiency.

Fig 4. Depletion of SAMHD1 induces R-loop-mediated genome instability and cellular DNA damage response.

Fig 4

A, Flow cytometry prolife of control U2OS and SAMHD1-depleted cells, in the absence or presence of ectopic RNaseH1-GFP expression at different time points after release from double thymidine block. Cells were subjected to PI staining followed by FACS analysis. Data represent mean ± SEM, n = 3, P values were calculated using two-way ANOVA. B, Representative images of comet assays performed under alkaline electrophoresis conditions in SAMHD1-depleted and control U2OS cells, in the absence or presence of ectopic RNaseH1-mCherry expression. C, Quantitative analysis of comet tail moment lengths for each condition described in B. Statistical significance was assessed using two-tailed Student’s t-test (n = 50). D, Immunoblots of CHK1 and CHK2 in S phase-synchronized SAMHD1-depleted and control U2OS cells, in the absence or presence of ectopic RNaseH1-mCherry (RNH1-mCherry) expression. Fold-induction was normalized to the empty vector-transfected control U2OS cells by quantifying the band intensity of the blots and calculating the ratios of p-CHK1 and p-CHK2 to total expression levels of CHK1 and CHK2 expression, respectively. E, Representative image of immunofluorescence assay of H2AX phosphorylated on S139 (γH2AX) in S phase synchronized SAMHD1-depleted and control U2OS cells, in the absence or presence of ectopic RNaseH1-mCherry (RNH1-mCherry) expression. Cells were co-stained with DAPI (blue) and anti-γH2AX antibodies (green) (n = 2). F, Quantification of γH2AX positive signal per nucleus for the immunofluorescence assay described in E. Data represent mean ± SEM, the statistical significance was assessed using the two-tailed Student’s t-test (n > 5). G, Representative images of proximity ligation assay (PLA) between PCNA and RNA polymerase II (RNAPII) in S phase synchronized SAMHD1-depleted and control U2OS cells, in the absence or presence of ectopic RNaseH1-GFP (RNH1-GFP) expression (n = 2). Cells were subjected to PLA (red) and co-stained with DAPI (blue). H, Quantification analysis of number of PLA foci per nucleus in G. The mean value for each data is shown as a red line. Statistical significance was assessed using the two-tailed Mann-Whitney U test (n = 322).

Endogenous SAMHD1 expression is associated with cancer

Genome instability is a hallmark of cancer, and dysregulated R-loops are a great threat to genome integrity [22]. Because we identified R-loops as an endogenous source of genome instability in SAMHD1 deficiency, we focused on the association between SAMHD1 expression and cancer development. We analyzed The Cancer Genome Atlas (TCGA) database to evaluate the SAMHD1 expression levels in various cancer types. RNA sequencing by Expectation Maximization (RSEM) analysis of SAMHD1 expression levels in different cancer types revealed that SAMHD1 expression was downregulated in tumor tissues (Fig 5A, red dots) compared to that in normal control tissues (Fig 5A, green dots). This agrees with previous reports of downregulated SAMHD1 expression in chronic lymphocytic leukemia, cutaneous T-cell lymphoma, and lung adenocarcinoma [10,11,13,18]. Consistent with our TCGA database analysis, we have identified that some of colorectal cancer-relevant SAMHD1 mutants [40] have lower expression levels than that of the wild-type SAMHD1 (S5A and S5B Fig). SAMHD1-deficient AGS fibroblasts (AGS5) exhibited a distinct R-loop formation pattern compared to fibroblasts with other AGS-associated gene deficiencies (Fig 1). Even though AGS-associated gene deficiencies often share common physiologic features and disorders caused by type I interferonopathies, malignancy has most frequently been reported in the context of SAMHD1 [10,11,14]. Thus, we examined the correlation between the R-loop accumulation patterns in the deficiency of each AGS-associated gene with cancer. The overall survival rate was compared between colorectal adenocarcinoma (COADREAD) patients with high versus low AGS-associated gene expression levels. Interestingly, patients exhibiting low expressions of SAMHD1 and RNASEH2A (Fig 5B, blue) had a significantly lower survival rate in comparison with that in patients exhibiting high SAMHD1 and RNASEH2A expressions (Fig 5B, red). However, there was no significant difference in patient survival rate between patients exhibiting high versus low expressions of TREX1 and RNASEH2B, which did not show distinct R-loop signal accumulation in DRIP-seq analysis (Fig 1A). RNASEH2B has been reported as a colorectal tumor suppressor in its knockout mouse model, but in P53 dependent manner [41]. RNASEH2A silencing causes reduced proliferation and increased apoptosis of human glioblastoma cells in vitro [42]. This observation could explain why low RNASEH2A-expressiong COADREAD patients had a low overall survival rate despite insignificant R-loop signal enrichment in RNASEH2A-mutated AGS patient fibroblasts. To test the relevance of the TRC-dependent R-loop regulatory function of SAMHD1 in cancer, we have examined whether colorectal cancer-relevant SAMHD1 mutants can regulate cellular TRCs. We identified two SAMHD1 mutants (F59C and T232M), which could be involved in cancer development due to their inabilities to regulate normal TRC-dependent R-loop regulation (S5C and S5D Fig).

Fig 5. SAMHD1 is downregulated in many cancer types and associated with cancer patient survival rate.

Fig 5

A, Box and whisker plots showing RSEM (RNA-Seq by Expectation Maximization) analyses of SAMHD1 expression levels in several cancer types (red) versus those in the corresponding normal tissues (green) in TCGA datasets (LUSC: lung squamous cell carcinoma, LUAD: lung adenocarcinoma, BLCA: bladder urothelial carcinoma, READ: rectal adenocarcinoma, COAD: colon adenocarcinoma, BRCA: breast invasive carcinoma, PRAD: prostate adenocarcinoma). The median, quartiles and range of values are shown. Statistical significance was assessed using log-rank test. B, Kaplan-Meier plots for the overall survival of SAMHD1, TREX1, RNASEH2A, and RNASEH2B high-expressor (red) and low-expressor (blue) groups of COADREAD patients. COADREAD data were obtained from TCGA database, and the cut-off was estimated by the optimal cut-point. Statistical significance was assessed using the log-rank test. The P values for Kaplan-Meier analyses of each genes are indicated.

Discussion

In this study, we set to understand the molecular mechanism and pathologies underlying the comorbidities of AD and cancer using a well-known model of systemic autoimmune disorder, AGS. An elegantly designed bioinformatics study has demonstrated that R-loop enrichment around TRCs is a potent threat to genome integrity [23]. The approach of that study inspired us to investigate whether TRC-dependent R-loop formation serves as an endogenous source of genome instability in AGS, a model disease for systemic autoimmunity. We integrated various bioinformatics data, and analyzed and compared DRIP-seq data regarding AGS patient fibroblasts. We found that R-loops are highly enriched at regions near replication origins in gene bodies only in SAMHD1-mutated AGS patient fibroblasts and that SAMHD1-deficient fibroblasts exhibited a significantly higher genome-destabilizing R-loops derived from HO TRCs. Our results are consistent with a previous report that SAMHD1 localizes to replication foci and increased fork stalling was observed by DNA fibre assay in SAMHD1- knockdown cells [19]. In line with these results, we observed that S-phase progression was delayed, and DNA damage responses were spontaneously activated in SAMHD1-deficiency cells. Importantly, we confirmed that TRCs and R-loop accumulation are enhanced in SAMHD1-deficient cells, and SAMHD1-deficient cells exhibit R-loop-mediated genome instability. Notably, by analyzing the TCGA dataset, we demonstrated that SAMHD1 transcription is downregulated in many cancer types. Moreover, the overall survival rate of COADREAD patients was found to be significantly lower in patients with low SAMHD1 expression.

SAMHD1 is an AGS-associated gene, that is reported to be frequently mutated in multiple types of cancer [14]. In addition, our TCGA analysis results and previous reports demonstrate downregulation of endogenous SAMHD1 expression in different cancer types [10,11,13,18]. However, in the previous studies, the role of SAMHD1 in the genome maintenance was examined exclusively in a biological context using cells exposed to genotoxic reagents [10,19,20]. Considering these results in light of our present result, a novel role for SAMHD1 in R-loop regulation has now been demonstrated, which when dysregulated leads to endogenous genome-destabilizing stress (S6 Fig). Moreover, we suggest that SAMHD1 and R-loop are novel diagnostic markers and targets for patient stratification in anti-cancer therapy.

In recent decade, many tumor-suppressor genes such as BRCA1, BRCA2 and BLM and downstream nucleases including XPF and MRE11 have been suggested as proteins that are important for R-loop tolerance and removal [37,38,4345]. More of classical proteins known for their roles in replication fork protection and transcription-coupled DNA repair are revealed as R-loop regulatory factors [46]. However, to develop R-loops as molecular markers in cancer and therapeutics, the context-dependent-physiological role of the R-loop should be considered. Uncontrolled R-loop formation can process to DSBs or cause ssDNA exposure [35]. On the other hand, physiological R-loop formation followed by DSBs can lay the foundation for DNA repair mechanisms such as chromosomal rearrangement, homologous recombination (HR), non-homologous end joining (NHEJ), or RNA-mediated HR [35]. Therefore, further investigation on the physiological meaning of the R-loop regarding to genome instability is needed.

Although TRC-dependent R-loops have been defined as “harmful R-loops”, which promote cellular DNA damage responses [23], the understanding of other characteristics of R-loops in disease pathologies is still very limited. Even in fibroblasts of patients with AGS, AGS-specific DRIP peaks occupy in various regions other than replication origin adjacent gene bodies [18,25]. For example, mutations in RNASEH2A and RNASEH2B genes increased R-loops in the intergenic region and LINE. Additionally, AGS-associated gene mutations increase R-loops in LTRs. One could question that how these AGS unique R-loops can have significant impact on disease pathology as they only consist a small portion of total R-loops in the whole genome (S1A Fig). However, the AGS-unique R-loops have a higher number of DRIP peaks and cover much longer genomic regions than those of the healthy control-specific R-loops [25]. Therefore, the these gene mutations are certainly “burdens” to cells due to the high load of R-loop in the genome. It will be of great interest to investigate how these R-loops with diverse characteristics function in human diseases.

Since SAMHD1 has no binding affinity and cannot directly degrade DNA:RNA hybrids in vitro [16,47], its nucleic acid binding ability might be required for the recruitment of R-loop resolving factors to R-loops adjacent to replication origins. A possible mechanism of the R-loop regulatory function of SAMHD1 is that it might cooperate with R-loop resolving cofactors, such as helicases or nucleases. For example, SAMHD1 recruits and activates the MRE11 exonuclease at stalled replication forks [19]. Recently, a new function of the MRE11-RAD50-NBS1 complex in DNA replication-associated R-loop resolution has been reported [38]. In future investigations, it is possible to evaluate the collaborative activity of SAMHD1 together with known factors in R-loop regulation. Moreover, we identified two colorectal cancer-relevant SAMHD1 mutants, which cannot restore the increased TRCs in SAMHD1-depleted cells. In addition to SAMHD1 expression defects, SAMHD1 alternations may also be involved in R-loop-mediated genome instability, and cancer development may be due to of its inability to interact with R-loop regulatory cofactors. Future studies are expected to better characterize the physiological and pathological R-loops and SAMHD1-deficiency derived R-loops in cancer development.

Materials and methods

Experimental design

Sample size for immunofluorescence microscopy and comet assay in this study were chosen according to published guidelines and to our laboratory manual. All experiments were reproduced and the number of experimental replications for each experiment is stated in figure legend. There was no data exclusion. No method of randomization or blinding was applicable.

Cell culture

U2OS cells were cultured in Dulbecco’s modified Eagle’s medium (HyClone) supplemented with 10% (v/v) fetal bovine serum (FBS, HyClone), antibiotics mixture (100 units/ml penicillin-streptomycin, Gibco), and 1% (v/v) GlutaMAX-I (Gibco). Cells were incubated at 37°C under a 5% CO2 atmosphere.

Immunoblotting

Cells were lysed using RIPA buffer (50 mM Tris [pH 7.4], 150 mM sodium chloride, 0.5% sodium deoxycholate, 0.1% SDS, and 1.0% NP-40) supplemented with 10 μM leupeptin (Sigma-Aldrich) and 1 mM phenylmethanesulfonyl fluoride (Sigma-Aldrich) and boiled at 98°C for 10 min with SDS sample buffer prior to SDS-PAGE. Primary antibodies used were mouse monoclonal anti-SAMHD1 (ORIGENE, TA501956), monoclonal mouse anti-CHK1 (Cell Signaling, 2G1D5), monoclonal rabbit anti-phospho-CHK1 (Cell Signaling, 133D3), monoclonal rabbit anti-CHK2 (Cell Signaling, 6334T), monoclonal rabbit anti-phospho-CHK2 (Cell Signaling, C13C1), rabbit anti-GAPDH (AbFroontier), rabbit anti-mCherry (abcam, ab167453), and rabbit anti- HA-Tag (C29F4) (Cell Signaling, 3724). All primary antibodies were used at a dilution of 1:1000 for Western blotting. As secondary antibodies, peroxidase-conjugated anti-mouse IgG (115-035-062) and anti-rabbit IgG (111-035-003; Jackson Laboratories) were used at a 1:5000 dilution. Signals were detected using a SuperSignal West Pico chemiluminescence kit (Thermo Fisher Scientific).

Plasmids

When indicated, cells were transiently transfected with the following plasmids. pICE-NLS-mCherry, pICE-RNaseH1-WT-NLS-mCherry, and pICE-RNaseH1-D10R-E48R-NLS-mCherrry (hereafter, Mut-RNaseH1-mCherrry) were purchased from Addgene (plasmid #60364, #60365, #60367). pcDNA3.1(+)-HA was purchased from Invitrogen (catalog #V79020), and pcDNA3.1(+)-WT SAMHD1-HA was prepared using restriction enzyme cloning. HB-GFP was prepared by cloning RNaseH1 HB domain into N3-EGFP (Clontech, catalog #6080–1) as previously described [37]. EGFP-tagged SAMHD1 was prepared by cloning full-length SAMHD1 into N3-EGFP.

In vitro cell proliferation assay

SAMHD1-depleted (shSAMHD1) and control (shLuciferase) U2OS cells (4 × 103 cells) were seeded into the wells of 24-well culture plates on day 1 of the experiment. The cells were then counted on days 3, 5, 7, 8, 10, 12, 14, and 15.

Cell synchronization by double-thymidine block

For double-thymidine block of cell cycle synchronization, U2OS cells were seeded (1.5 × 105 cells) into the wells of a 6-well plate on day 1. On day 2, 2.5 mM thymidine (Sigma, T1895) was added to the culture medium for the first thymidine block. After 18 h, the thymidine-containing culture medium was replaced with fresh growth medium without thymidine, and the cells were left for 9 h. The second round of thymidine block was given by adding 2.5 mM thymidine to the culture medium and incubating for another 18 h. If indicated, G1/S-phase–synchronized cells were released for 4 h to obtain an S-phase–synchronized cell population. The cells were then washed with PBS twice and left in culture medium. Alternatively, the cells were harvested at indicated time points from 0 to 10 h post-release for further analysis.

RNA interference and transfection

shRNAs targeting two different regions of the SAMHD1 3’-UTR and luciferase as a control were cloned into the pSUPER-retro-puro retroviral vector (laboratory collection). For the generation of SAMHD1-knockdown U2OS cells, cells were transfected with each shRNA cloned pSUPER-retro-puro retroviral vector using Lipofectamine 3000 reagent according to the manufacturer’s instructions. The transfected cells were selected in 1 μg/mL puromycin for 1 week. The shRNA-targeting sequences of the SAMHD1 3′-UTR were as follows: 5′-GCATGCTGAAGCTAAGTAACT-3′ and 5′-GGTACAAATTGGAACTAGAAA-3′. Lipofectamine 3000 (Invitrogen) transfection reagent was used for all plasmids, according to the manufacturer’s protocol.

For target knockdown using siRNAs, the following synthetic duplex siRNAs were purchased from Dharmacon (siControl, ON-TARGET plus non-targeting siRNA (D001810-01, Dharmacon); siTHOC1, ON-TARGET plus Human THOC1 siRNA—SMART pool (L-019911-00, Dharmacon); siSETX, ON-TARGET plus Human SETX siRNA—SMART pool (L-021420-00, Dharmacon); siSAMHD1, siGENOME Human SAMHD1 siRNA–Set of 4 (MQ-013950-00-0020, Dharmacon); and siMRE11, ON-TARGETplus Human MRE11 siRNA-SMART pool (L-009271-00-0005). U2OS cells were transfected with siRNA (10 nM) using the Lipofectamine 3000 reagent according to the manufacturer’s instructions. Transfections with ON-TARGETplus Non-targeting pool (Dharmacon) were performed in parallel as a negative control.

Immunofluorescence microscopy

For the immunofluorescence assays of S9.6 nuclear signals, SAMHD1-depleted or control U2OS cells were fixed with 100% ice-cold methanol for 10 min on ice and then incubated with 100% ice-cold acetone for 1 min. The slides were washed three times with 1× PBS and then incubated with or without 60 U/mL of RNase H (M0297S, NEB) at 37°C for 36 h or left untreated. The slides were subsequently briefly rinsed three times with 2% BSA/0.05% Tween (in PBS) and incubated with mouse anti-DNA:RNA hybrid S9.6 (Kerafast, ENH001) used at a dilution of 1:100 and rabbit anti-nucleolin (Abcam, ab22758) at a dilution of 1:300 in 2% BSA/0.05% Tween (in PBS) for 4 h at 4°C. The slides were then washed three times with 2% BSA/0.05% Tween (in PBS) and incubated with goat anti-rabbit AlexaFluor-488–conjugated (Invitrogen, A-11008) and goat anti-mouse AlexaFluor-568–conjugated (Molecular Probes, A11004) secondary antibodies (1:200) for 2 h at RT. The slides were then washed three times with 2% BSA/0.05% Tween (in PBS) and mounted using ProLong Gold AntiFade reagent (Invitrogen). Images were obtained using an inverted microscope Nikon Eclipse Ti2, equipped with a 1.45 numerical aperture (NA) Plan apo λ 100x oil objective and an sCMOS camera (Photometrics prime 95 B 25mm). For each field of view, images were taken with DAPI395, GFP488, and Alexa594 channels using NIS-Elements software. For quantification analysis, binary masks of nuclei and nucleoli were generated using the ROI manager and auto local thresholding by the Phansalker method in ImageJ [48]. Centers of R-loop puncta were detected by TrackNTrace [49]. The intensity of R-loop puncta was quantified by adding the intensity of 9 pixels from 3 × 3 squares. R-loop puncta inside the nucleus and outside the nucleoli were analyzed using custom code written in MATLAB. For the immunofluorescence assays of H2AX phosphorylation, cells were fixed with 3.7% formaldehyde in PBS. The slides were then incubated with γH2AX rabbit (Cell Signaling, 20E3), phospho-ATR (Ser428) rabbit (Cell Signaling, 2853S), and phospho-ATM (S1981) rabbit (Abcam, ab81292) antibodies, as wll as goat anti-HA-Tag (Genescript, A00168), at 1:200 dilution with 2% BSA (in PBS) for 2 h at RT. The slides were washed three times with 2% BSA/0.05% Tween (in PBS) and incubated with goat anti-rabbit AlexaFluor-568–conjugated (Molecular Probes, A11011), goat anti-rabbit AlexaFluor-488–conjugated (Invitrogen, A-11008), and 1:200 rabbit anti-goat-FITC-conjugated (Jackson lab, 305-095-047) antibodies at 1:200 dilution for 1 h at RT. Subsequently, the slides were washed three times with 2% BSA (in PBS) and then mounted using ProLong Gold AntiFade reagent (Invitrogen). To obtain images, the mounted specimens were visually scanned, and representative images were acquired using a Zeiss LSM 710 laser scanning confocal microscope (Carl Zeiss).

Comet assay

SAMHD1-depleted (shSAMHD1) and control (shLuciferase) U2OS cells (1.5 × 105 cells) were seeded 1 day prior to plasmid transfection. Cells were transiently transfected with either full-length SAMHD1-HA expression vector or empty vector using Lipofectamine 2000 according to the manufacturer’s instructions. For RNase H1 and SAMHD1-HA reconstitution experiments, cells were transfected with either protein expression vector or empty vector. At 48 hours post-transfection, an alkaline assay was performed on U2OS cells using an OxiSelect Comet Assay kit (CELL BIOLABS, STA-351) according to the manufacturer’s instructions. Fifty comets per slide were scored using CometScore software (TriTek).

DNA-RNA hybrid immunoprecipitation (DRIP)

DRIP was performed as described previously in [50] with the following modifications. In brief, corresponding cells were harvested and genomic DNA was extracted with DNA purification kit (QIAGEN, 51106) by manufacturer’s instruction. 20ug of extracted nucleic acids were fragmented using a restriction enzyme cocktail (20U each of EcoRI, BamHI, HindIII, BsrBI and Xho I; New England Biolabs) overnight at 37°C. Half of the fragmented nucleic acids were digested with 40U RNase H (New England Biolabs) to serve as negative control, overnight at 37°C. The digested nucleic acids were cleaned with standard phenol-chloroform extraction and re-suspended in DNase/RNase-free water. DNA-RNA hybrids were immunoprecipitated from total nucleic acids using 10ug of the mouse anti-DNA-RNA hybrid S9.6 (Kerafast, ENH001) in binding buffer (10 mM NaPO4 pH 7.0, 140 mM NaCl, 0.05% Triton X-100) and incubated overnight at 4°C. Dynabeads Protein A (Invitrogen, 10001D) were used to pull-down the DNA-antibody complexes, by incubating 3hr at 4°C. Isolated complexes were washed twice with binding buffer and once with TE buffer for 5min at RT, before elution with elution buffer (50 mM Tris pH8.0, 10mM EDTA, 0.5% SDS, 2.5ug protease k for 30min at 55°C. Subsequently, DNA was purified following the standard phnol-chloroform extract method and used for qPCR.

Quantitative real-time PCR (qPCR)

Equivalent amounts of purified genomic DNA (50 ∼ 100 ng) from each sample were analyzed by qPCR. For RT (reverse transcription)-qPCR, 1 ∼ 2 μg of RNA was reverse-transcribed using the ReverTra Ace qPCR RT Kit (TOYOBO) following the manufacturer’s instructions. The resulting complementary DNA (cDNA) was diluted with sterile deionized H2O (1:6). RT-qPCR reactions were performed in TOPreal qPCR PreMIX (Enzynomics) with 2 μl of the diluted cDNA. The total reaction volume was 20 μl, and all reactions were performed in triplicate. The PCR reactions were performed using the iCycler iQ real-time PCR detection system (Bio-Rad). Data were normalized according to the expression levels of β-actin or MDM2, as indicated. qPCR analyses were performed using the specific primers listed in S1 Table.

Proximity ligation assay (PLA)

For the PCNA–RNAPII PLA, cells grown on coverslips were fixed with 3.7% formaldehyde for 20 min at room temperature and permeabilized with 0.2% Triton X-100 for 5 min. Cells were then blocked with 1ꓫ blocking solution (Merck, DUO92102) for 1 h at 37°C in a humidity chamber. After blocking, cells were incubated with primary antibody overnight at 4°C (for PCNA–RNAPII PLA; 1:500 rabbit anti-PCNA antibody (ab18197, Abcam), 1:100 mouse anti-RNAPII (Santa Cruz, sc-56767), and 1:200 goat anti-HA-Tag (Genescript, A00168). The next day, after washing with 1× Wash buffer A twice (Merck, DUO92102), cells were incubated with pre-mixed Duolink PLA plus, PLA minus probes, and 1:200 rabbit anti-goat-FITC-conjugated (Jackson lab, 305-095-047) antibodies for 1 h at 37°C. The subsequent steps in the proximal ligation assay were carried out using the Duolink PLA Fluorescence kit (Sigma) according to the manufacturer’s instructions. To obtain images, the mounted specimens were visually scanned, and representative images were acquired using a Zeiss LSM 710 laser scanning confocal microscope (Carl Zeiss). For quantification analysis, binary masks of nuclei were generated by consecutive steps of Gaussian blur, Bernsen auto local thresholding, watershed segmentation, and Analyze Particles function in ImageJ. PLA puncta were detected and analyzed by custom code written in ImageJ macro language utilizing Gaussian blur, manual thresholding, and Analyze Particles function.

Cell-cycle analysis

Cell cycle analysis was performed by both PI staining and pulse-chase BrdU (Sigma) incorporation, as previously described [10]. For BrdU pulse-chase experiments, a pulse of 10 mM BrdU was added to the cell culture 30 min before harvesting. Afterwards, BrdU-free medium was added to the cell culture, and analyses were performed every 3 h for an additional 12 h. Cells were fixed with 70% ethanol (EtOH), and the DNA was denatured using 1 ml of 1 N HCl/0.05% Triton X-100 and then neutralized with 1 ml of 0.1 M Na2B4O7. Cells were blocked using PBS/1%BSA/0.5% Tween-20 and incubated with mouse anti-BrdU primary antibody (Cell Signaling, 5292), followed by goat-donkey-mouse AlexaFluor-488–conjugated secondary antibody (Life Technologies, A21202). The cells were then subjected to PI staining followed by flow cytometry analysis. For PI staining, cells were harvested and fixed in pre-chilled 70% EtOH and then stored overnight at −20°C. Fixed cells were recovered and resuspended in PBS containing 100 μg/ml of RNase A (Thermo Fisher Scientific, R1253) and 10 μg/ml of PI and incubated for 12 h at 4°C and then analyzed using a Flow-Activated Cell Sorter Canto II (BD Bioscience) and Flowjo software (Flowjo).

RNA/DNA hybrid dot blotting

Total genomic DNA was extracted using a QIAmp DNA mini kit (QIAZEN, 51304) according to the manufacturer’s instructions. Genomic DNA (1.2 μg) was treated with 2 U of RNase H (NEB, M2097) per microgram of genomic DNA for 48 h at 37°C, with half of the sample left untreated but denatured. Half of the DNA sample was probed with S9.6 antibody (1:1000), and the other half was probed with anti-ssDNA antibody (MAB3034, Millipore, 1:10000).

Annexin V-FITC/ PI double-staining assay

Annexin V-FITC/PI double staining was performed using an Annexin V-FITC kit (BD Bioscience, 556547). The cells were labeled with FITC-conjugated Annexin V and 5 μl of PI according to the manufacturer’s instructions. After incubation in the dark for 15 min at RT, the cells were analyzed by flow cytometry. The AnnexinV-FITC-negative/PI-negative population was regarded as normal, whereas the Annexin V-FITC–positive/PI–negative and Annexin V-FITC–positive/PI–positive populations were taken as measurements of early and late apoptotic/necrotic cells, respectively.

HB-GFP retention fluorescence-activated cell sorting assay

SAMHD1-depleted (shSAMHD1) and control (shLuciferase) U2OS cells were transfected with HB-GFP. At 24 h post-transfection, cells were harvested and resuspended in ice-cold PBS containing 2% FBS and then permeabilized for 4 min with 0.05% Triton X-100 in the presence of protease inhibitor cocktail (Cell Signaling, 5872S). Three volumes of 100% ice-cold ethanol were added, and the sample was kept on ice for 1 h and then co-stained with PI and then analyzed via flow cytometry.

Data collection and TCGA dataset analysis

RNA sequencing (RNA-seq) data and clinical information for patients with lung squamous cell carcinoma, lung adenocarcinoma, bladder urothelial carcinoma, COADREAD, breast invasive carcinoma, and prostate adenocarcinoma were downloaded from TCGA dataset using the UCSC Xena browser, as previously described [51]. Gene expression profiles were generated experimentally using an Illumina HiSeq 2000 RNA sequencing platform by the University of North Carolina TCGA Genome Characterization Center. RNA-seq data were quantified using RSEM, as described previously [52]. Associations between the SAMHD1 expression and clinicopathologic features of patients with each type of cancer were analyzed and plotted. Overall survival of COADREAD patients was determined using clinical and demographic information downloaded from the Pan-cancer Project TCGA dataset. The patients were separated into two groups depending on the expression levels of SAMHD1, TREX1, RNASEH2A, or RNASEH2B, with the cut-off being the optimal cut-point. Survival curves for high-expressing and low-expressing groups were produced using the Kaplan-Meier method, and log-rank tests were used to assess differences in survival between groups.

Bioinformatics analysis

DRIP-seq data from control, AGS1, AGS2, AGS4 and AGS5 patients were obtained from the Gene Expression Omnibus database under the accession number GSE57353 [25]. Fastq files of each replicates were filtered to obtain files with 100nt read length and those identical lengths of reads were used for analysis. For each fastq files, adapter sequences were trimmed with cutAdapt, and artifacts were removed by fastx-toolkit. Trimmed reads were mapped to hg19 with bowtie2. Alignment was performed with ‘—very-sensitive’ option & 1nt mismatch was allowed on each reads. Duplicates reads were removed by samtools. DRIP-signal for each 100nt region was defined as mapped read-count, which was normalized to 10M total duplicate-removed read-count. To analyze DRIP-seq read-count of AGS patients for each region, we obtained dataset from under the accession number GSE93368 and custom python scripts from GitHub (https://github.com/cimprichlab/Hamperl_et_al) [23]. The custom python script was modified according to the modified datasets as described above. The DRIP-seq data for control, AGS1, AGS2, AGS4, and AGS5 patients were aggregated to these classified origins, and then DRIP-seq read-count profiles were calculated over a 24 kb window around origins or centers of gene bodies. As previously described [23], in order for HO collision to have a negative genome coordinate from origins, the windows were inverted in prior to aggregation in gene bodies those are transcribed on negative strands. The modified custom python scripts used to determine DRIP-seq signal across TRC regions were made available under GitHub (https://github.com/spiritmage72/samhd1_rloop). Metaplot (Fig 1A) and boxplot (Figs 1B, S1B and S1C) were obtained with means or medians, which were bootstrapped 10,000 times.

Raw fastq files were imported from published data under accession number GSE57353. From the 484 genes selected for DRIP-seq analysis, we investigated 467 genes whose ENSEMBL ID matched the expression level dataset. To identify DEGs of control versus those of AGS patient fibroblasts, we used the DESeq2 tool as previously described [53]. The detection criteria to identify DEGs were |log2(fold change)| >2 and adjusted P value <0.01.

Statistical analyses

Statistical parameters, including the number of biological replicates (n), standard deviation, and statistical significance, are indicated in each figure and figure legend. Statistical analysis was performed with GraphPad Prism 5. Statistical significance was determined by two-tailed Student’s t test, two-way ANOVA or two-tailed Mann-Whitney U test, as indicated.

For DRIP-seq data bioinformatics analyses, statistical analysis was performed with python package, SciPy. For DRIP-seq read-count enrichment comparison, a contingency table was computed for each AGS patient fibroblast DRIP-seq sample by normalized read-counts mapped to 12 kb or 6 kb window. The adjusted P values were calculated by using the Fisher’s exact test followed by Bonferroni correction for multiple test. Means or medians used for statistical analysis were not bootstrapped.

Supporting information

S1 Fig. R-loop accumulation at the origins of gene bodies in SAMHD1-deficient AGS patient fibroblasts.

A, Average mapped DRIP-seq read accumulation around random genomic position (n = 50,000) in fibroblasts from control (yellow) and AGS patients with the indicated gene mutations (blue, SAMHD1; pink, TREX1; green, RNASEH2B; purple, RNASEH2A). For individual graph, the line indicates the mean DRIP-seq read-count of each genomic region of 24 kb windows centered on random genomic position. B, Volcano plot of RNA-seq transcriptome data displaying the pattern of gene expression values for fibroblasts from control and AGS patients (AGS5, AGS1, AGS2, and AGS4). Significantly differentially expressed genes (DEGs) (|log2(fold change)| >2 and adjusted P value <0.01) are highlighted in red (upregulated) or blue (downregulated). None of the DEGs are highlighted in black. Selected genes used for DRIP-seq analysis (n = 467) are highlighted in dark green. The number of DEGs among selected genes used for DRIP-seq analysis for each AGS patient are indicated. C, Box graph indicating quantified average DRIP-seq read-count across the 6 kb window upstream from replication origins of the gene bodies (n = 727), where HO collision sub-regions exist. D, Box graph indicating quantified average DRIP-seq read-count across a 6-kb window downstream from replication origins of the gene bodies (n = 727), where CD collision sub-regions exist. In C and D, the orange line at the center of the boxes indicates the median, and the boxes indicate 1st and 3rd quartiles. Statistical significance was assessed using Fisher’s exact test followed by Bonferroni correction.

(TIF)

S2 Fig. Depletion of SAMHD1 does not affect cellular apoptosis or cell cycle arrest.

A, SAMHD1 in U2OS cells transfected with vectors expressing shRNA targeting luciferase control (shLuc) and two different SAMHD1 UTRs (shSAMHD1-#1 and shSAMHD1-#2). B, In vitro cell proliferation assay of SAMHD1-depleted or control U2OS cells. Data represent the mean ± SD, n = 3. The P value was calculated according to two-way ANOVA. C, Representative flow plot of Annexin V-FITC assay of SAMHD1-depleted and control U2OS cells. Apoptosis was determined by staining cells with FITC-conjugated Annexin V and PI, followed by flow cytometry analysis. Four populations are indicated as Q1, necrotic; Q2, late apoptosis; Q3, live; and Q4, early apoptotic. D, Quantification of Annexin V-FITC apoptosis assay results showing the percentage of cell death modes: live cells, early apoptosis, and late apoptosis/necrosis in SAMHD1-depleted and control U2OS cells. Statistical significance was assessed by two-way ANOVA. Data represent the mean ± SEM (n = 3, P < 0.001). E, Representative cell cycle profile analysis by flow cytometry following labeling of cells with BrdU. SAMHD1-depleted and control U2OS cells were pulsed with BrdU. The BrdU-positive population was followed over time as it transitioned through the cell cycle.

(TIF)

S3 Fig. Depletion of SAMHD1 leads to increased DNA damage.

A, Summary of experimental design for Fig 2C, representing experimental process of double thymidine block followed by time course sample harvest. B, Representative images of comet assays performed under alkaline electrophoresis conditions in SAMHD1-depleted and control U2OS cells, in the absence or presence of ectopic SAMHD1-HA expression. C, Quantitative analysis of comet tail moment lengths for each condition described in B. Statistical significance was assessed using two-tailed Student’s t-test (n >50). D, Summary of experimental design for Fig 2F, representing experimental process to obtain heterogeneous (Asyn) or S phase-synchronized cellular population by double thymidine block. E, Summary of experimental design for F, representing experimental process of SAMHD1-HA reconstitution followed by double thymidine block to obtain S phase-synchronized cellular population. F, Immunoblots of CHK1 phosphorylated on S345 (p-CHK1), CHK2 phosphorylated on T68 (p-CHK2), CHk1 and CHk2 in S phase-synchronized SAMHD1-depleted and control U2OS cells, in absence or presence of ectopic SAMHD1-HA expression in a does dependent manner (n = 3). G, Representative images of comet assays performed under neutral conditions in SAMHD1-depleted and control U2OS cells. H, Quantitative analysis of comet tail moment lengths for each condition described in G. Statistical significance was assessed using two-tailed Student’s t-test (n >50).

(TIF)

S4 Fig. Accumulation of R-loops in SAMHD1-depleted cells.

A, Indicated gene expression as measured by RT-qPCR in control (siControl) or indicated siRNAs treated U2OS cells. Data represent mean ± SEM, n = 3, P values were calculated according to two-tailed Student’s t-test. B, Schematic representation of HB-GFP retention assay. C, Quantification analysis of HB-GFP retention assay co-stained with PI. Data represent mean ± SEM percentages of GFP positive cell population. Statistical significance was assessed using two-tailed Student’s t-test (n = 3). P > 0.05; n.s, not significant. D, Quantification analysis of HB-GFP retention assay co-stained with PI. Data represent mean ± SEM percentages of S phase cell population. Statistical significance was assessed using two-tailed Student’s t-test (n = 3). P > 0.05; n.s, not significant. E, Immunoblots of SAMHD1 in cells described in Fig 3G, showing ectopic expression of GFP-tagged SAMHD1 in SAMHD1-depleted and control U2OS cells. F, Dot blot analysis of R-loops in genomic DNA extracted from SAMHD1-depleted and control U2OS cells (n = 3). The total cellular genomic DNA from each cell line was incubated with or without RNase H in prior to loading for blotting, as indicated. The genomic DNA loaded membrane were processed to immunoblotting with anti-S9.6 and anti-ssDNA antibodies to verify equivalent DNA loading. Fold-induction was normalized to the control U2OS cells by quantifying the dot intensity of the blots and calculating the ratios of S9.6 signal to total amount of genomic DNA.G and H, Indicated gene expression as measured by RT-qPCR in control or SAMHD1-depleted U2OS cells. Data represent mean ± SEM, n = 3, P values were calculated according to two-tailed Student’s t-test. P > 0.05; n.s, not significant. I, DRIP-qPCR using the anti S9.6 antibody at GBE1 HO, GBE1 CD, MAP4K4 HO, MAP4K4 CD, CALD1 HO, CALD1 CD, PALM2-AKAP2 HO and PALM2-AKAP2 CD are shown in SAMHD1-depleted and control U2OS cells. Pre-immunoprecipitated samples were untreated (-) or treated (+) with RNase H, as indicated. Values are relative to those of control U2OS cells. Data represent mean ± SEM the statistical significance was assessed using the one-way ANOVA (n = 2).

(TIF)

S5 Fig. Colorectal cancer-relevant SAMHD1 mutants are associated with TRCs in U2OS cells.

A, Immunoblots of SAMHD1 in U2OS cells transfected with vector expressing indicated SAMHD1 mutants tagged with HA. B, Genomic DNA extracted from U2OS cells described in A were analyzed by quantitative PCR. The relative DNA copy number for each vector was determined by Puromycin resistance sequence and normalized to MDM2. Data represent mean ± SEM, n = 2, P values were calculated according to two-tailed Student’s t-test. P > 0.05; n.s, not significant. C, Representative images of proximity ligation assay (PLA) between PCNA and RNA polymerase II (RNAPII) in S phase synchronized SAMHD1-depleted and control U2OS cells, in the absence or presence of indicated SAMHD1 mutant ectopic expression (SAMHD1-HA) (n = 2). Cells were subjected to PLA (red), and co-stained with HA (green) and DAPI (blue). D, Quantification analysis of number of PLA foci per nucleus in C. The mean value for each data is shown as a red line. Statistical significance was assessed using the two-tailed Mann-Whitney U test (n > 70).

(TIF)

S6 Fig. Model of the role of SAMHD1 at transcription-replication conflict-derived R-loops.

In normal cells (left), R-loops formed by transcription-replication collision, are resolved by SAMHD1, which allows successful DNA replication and RNA transcription. In SAMHD1-deficient cells (right), unresolved R-loops at transcription-replication conflict regions showed genomic instability.

(TIF)

S1 Table. Oligonucleotides used in this study.

(DOCX)

S1 Data. Source Data: Spreadsheet of source data shown in this study.

(XLSX)

Acknowledgments

We are grateful to Dr. Dong-Yeon Cho for help with bioinformatics analysis and discussion.

Data Availability

The bioinformatics data and figures reported in this study are available under GitHub (https://github.com/spiritmage72/samhd1_rloop). The modified custom python scripts used to determine DRIP-seq signal across HO and CD regions were made available under GitHub (https://github.com/spiritmage72/samhd1_rloop). All relevant underlying numerical data for all graphs and summary statistics are either in Spreadsheets of Supporting information or included in relevant data.

Funding Statement

This work was supported by the Institute for Basic Science of the Ministry of Science Grant (IBS-R008-D1) and the National Research Foundation of Korea grant funded by the Korea government (NRF-2020R1A5A1018081) (to K. A.), (NRF-2020R1A2C3011298) (to K.A. and K.P.), and the BK21 plus fellowship (A0423-20200100) (to K.P.). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Yu KH, Kuo CF, Huang LH, Huang WK, See LC. Cancer Risk in Patients With Inflammatory Systemic Autoimmune Rheumatic Diseases: A Nationwide Population-Based Dynamic Cohort Study in Taiwan. Medicine (Baltimore). 2016;95(18):e3540. 10.1097/MD.0000000000003540 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Liu J, Blake SJ, Smyth MJ, Teng MW. Improved mouse models to assess tumour immunity and irAEs after combination cancer immunotherapies. Clin Transl Immunology. 2014;3(8):e22. 10.1038/cti.2014.18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Franks AL, Slansky JE. Multiple associations between a broad spectrum of autoimmune diseases, chronic inflammatory diseases and cancer. Anticancer Res. 2012;32(4):1119–36. [PMC free article] [PubMed] [Google Scholar]
  • 4.Haanen J, Ernstoff MS, Wang Y, Menzies AM, Puzanov I, Grivas P, et al. Autoimmune diseases and immune-checkpoint inhibitors for cancer therapy: review of the literature and personalized risk-based prevention strategy. Ann Oncol. 2020. 10.1016/j.annonc.2020.03.285 [DOI] [PubMed] [Google Scholar]
  • 5.Young A, Quandt Z, Bluestone JA. The Balancing Act between Cancer Immunity and Autoimmunity in Response to Immunotherapy. Cancer Immunol Res. 2018;6(12):1445–52. 10.1158/2326-6066.CIR-18-0487 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Rice GI, Bond J, Asipu A, Brunette RL, Manfield IW, Carr IM, et al. Mutations involved in Aicardi-Goutieres syndrome implicate SAMHD1 as regulator of the innate immune response. Nat Genet. 2009;41(7):829–32. 10.1038/ng.373 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Crow MK. Type I interferon in systemic lupus erythematosus. Curr Top Microbiol Immunol. 2007;316:359–86. 10.1007/978-3-540-71329-6_17 [DOI] [PubMed] [Google Scholar]
  • 8.Crow YJ, Rehwinkel J. Aicardi-Goutieres syndrome and related phenotypes: linking nucleic acid metabolism with autoimmunity. Hum Mol Genet. 2009;18(R2):R130–6. 10.1093/hmg/ddp293 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Crow YJ, Manel N. Aicardi-Goutieres syndrome and the type I interferonopathies. Nat Rev Immunol. 2015;15(7):429–40. 10.1038/nri3850 [DOI] [PubMed] [Google Scholar]
  • 10.Clifford R, Louis T, Robbe P, Ackroyd S, Burns A, Timbs AT, et al. SAMHD1 is mutated recurrently in chronic lymphocytic leukemia and is involved in response to DNA damage. Blood. 2014;123(7):1021–31. 10.1182/blood-2013-04-490847 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.de Silva S, Wang F, Hake TS, Porcu P, Wong HK, Wu L. Downregulation of SAMHD1 expression correlates with promoter DNA methylation in Sezary syndrome patients. J Invest Dermatol. 2014;134(2):562–5. 10.1038/jid.2013.311 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Li M, Zhang D, Zhu M, Shen Y, Wei W, Ying S, et al. Roles of SAMHD1 in antiviral defense, autoimmunity and cancer. Rev Med Virol. 2017;27(4). 10.1002/rmv.1931 [DOI] [PubMed] [Google Scholar]
  • 13.Rentoft M, Lindell K, Tran P, Chabes AL, Buckland RJ, Watt DL, et al. Heterozygous colon cancer-associated mutations of SAMHD1 have functional significance. Proc Natl Acad Sci U S A. 2016;113(17):4723–8. 10.1073/pnas.1519128113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Rodero MP, Crow YJ. Type I interferon-mediated monogenic autoinflammation: The type I interferonopathies, a conceptual overview. J Exp Med. 2016;213(12):2527–38. 10.1084/jem.20161596 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Beloglazova N, Flick R, Tchigvintsev A, Brown G, Popovic A, Nocek B, et al. Nuclease activity of the human SAMHD1 protein implicated in the Aicardi-Goutieres syndrome and HIV-1 restriction. J Biol Chem. 2013;288(12):8101–10. 10.1074/jbc.M112.431148 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Ryoo J, Choi J, Oh C, Kim S, Seo M, Kim SY, et al. The ribonuclease activity of SAMHD1 is required for HIV-1 restriction. Nat Med. 2014;20(8):936–41. 10.1038/nm.3626 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Goldstone DC, Ennis-Adeniran V, Hedden JJ, Groom HC, Rice GI, Christodoulou E, et al. HIV-1 restriction factor SAMHD1 is a deoxynucleoside triphosphate triphosphohydrolase. Nature. 2011;480(7377):379–82. 10.1038/nature10623 [DOI] [PubMed] [Google Scholar]
  • 18.Merati M, Buethe DJ, Cooper KD, Honda KS, Wang H, Gerstenblith MR. Aggressive CD8(+) epidermotropic cutaneous T-cell lymphoma associated with homozygous mutation in SAMHD1. JAAD Case Rep. 2015;1(4):227–9. 10.1016/j.jdcr.2015.05.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Coquel F, Silva MJ, Techer H, Zadorozhny K, Sharma S, Nieminuszczy J, et al. SAMHD1 acts at stalled replication forks to prevent interferon induction. Nature. 2018;557(7703):57–61. 10.1038/s41586-018-0050-1 [DOI] [PubMed] [Google Scholar]
  • 20.Daddacha W, Koyen AE, Bastien AJ, Head PE, Dhere VR, Nabeta GN, et al. SAMHD1 Promotes DNA End Resection to Facilitate DNA Repair by Homologous Recombination. Cell Rep. 2017;20(8):1921–35. 10.1016/j.celrep.2017.08.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Santos-Pereira JM, Aguilera A. R loops: new modulators of genome dynamics and function. Nat Rev Genet. 2015;16(10):583–97. 10.1038/nrg3961 [DOI] [PubMed] [Google Scholar]
  • 22.Aguilera A, Garcia-Muse T. R loops: from transcription byproducts to threats to genome stability. Mol Cell. 2012;46(2):115–24. 10.1016/j.molcel.2012.04.009 [DOI] [PubMed] [Google Scholar]
  • 23.Hamperl S, Bocek MJ, Saldivar JC, Swigut T, Cimprich KA. Transcription-Replication Conflict Orientation Modulates R-Loop Levels and Activates Distinct DNA Damage Responses. Cell. 2017;170(4):774–86 e19. 10.1016/j.cell.2017.07.043 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Hamperl S, Cimprich KA. Conflict Resolution in the Genome: How Transcription and Replication Make It Work. Cell. 2016;167(6):1455–67. 10.1016/j.cell.2016.09.053 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Lim YW, Sanz LA, Xu X, Hartono SR, Chedin F. Genome-wide DNA hypomethylation and RNA:DNA hybrid accumulation in Aicardi-Goutieres syndrome. Elife. 2015;4. 10.7554/eLife.08007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Petryk N, Kahli M, d’Aubenton-Carafa Y, Jaszczyszyn Y, Shen Y, Silvain M, et al. Replication landscape of the human genome. Nat Commun. 2016;7:10208. 10.1038/ncomms10208 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Zeman MK, Cimprich KA. Causes and consequences of replication stress. Nat Cell Biol. 2014;16(1):2–9. 10.1038/ncb2897 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Andersson R, Refsing Andersen P, Valen E, Core LJ, Bornholdt J, Boyd M, et al. Nuclear stability and transcriptional directionality separate functionally distinct RNA species. Nat Commun. 2014;5:5336. 10.1038/ncomms6336 [DOI] [PubMed] [Google Scholar]
  • 29.Gomez-Gonzalez B, Aguilera A. Transcription-mediated replication hindrance: a major driver of genome instability. Genes Dev. 2019;33(15–16):1008–26. 10.1101/gad.324517.119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Garcia-Muse T, Aguilera A. Transcription-replication conflicts: how they occur and how they are resolved. Nat Rev Mol Cell Biol. 2016;17(9):553–63. 10.1038/nrm.2016.88 [DOI] [PubMed] [Google Scholar]
  • 31.Meryet-Figuiere M, Alaei-Mahabadi B, Ali MM, Mitra S, Subhash S, Pandey GK, et al. Temporal separation of replication and transcription during S-phase progression. Cell Cycle. 2014;13(20):3241–8. 10.4161/15384101.2014.953876 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Amin MA, Varma D. Combining Mitotic Cell Synchronization and High Resolution Confocal Microscopy to Study the Role of Multifunctional Cell Cycle Proteins During Mitosis. J Vis Exp. 2017(130). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Lam FC, Kong YW, Huang Q, Vu Han TL, Maffa AD, Kasper EM, et al. BRD4 prevents the accumulation of R-loops and protects against transcription-replication collision events and DNA damage. Nat Commun. 2020;11(1):4083. 10.1038/s41467-020-17503-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Kretschmer S, Wolf C, Konig N, Staroske W, Guck J, Hausler M, et al. SAMHD1 prevents autoimmunity by maintaining genome stability. Ann Rheum Dis. 2015;74(3):e17. 10.1136/annrheumdis-2013-204845 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Crossley MP, Bocek M, Cimprich KA. R-Loops as Cellular Regulators and Genomic Threats. Mol Cell. 2019;73(3):398–411. 10.1016/j.molcel.2019.01.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Zhang X, Chiang HC, Wang Y, Zhang C, Smith S, Zhao X, et al. Attenuation of RNA polymerase II pausing mitigates BRCA1-associated R-loop accumulation and tumorigenesis. Nat Commun. 2017;8:15908. 10.1038/ncomms15908 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Bhatia V, Barroso SI, Garcia-Rubio ML, Tumini E, Herrera-Moyano E, Aguilera A. BRCA2 prevents R-loop accumulation and associates with TREX-2 mRNA export factor PCID2. Nature. 2014;511(7509):362–5. 10.1038/nature13374 [DOI] [PubMed] [Google Scholar]
  • 38.Chang EY, Tsai S, Aristizabal MJ, Wells JP, Coulombe Y, Busatto FF, et al. MRE11-RAD50-NBS1 promotes Fanconi Anemia R-loop suppression at transcription-replication conflicts. Nat Commun. 2019;10(1):4265. 10.1038/s41467-019-12271-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Franzolin E, Coletta S, Ferraro P, Pontarin G, D’Aronco G, Stevanoni M, et al. SAMHD1-deficient fibroblasts from Aicardi-Goutieres Syndrome patients can escape senescence and accumulate mutations. FASEB J. 2020;34(1):631–47. 10.1096/fj.201902508R [DOI] [PubMed] [Google Scholar]
  • 40.Kohnken R, Kodigepalli KM, Wu L. Regulation of deoxynucleotide metabolism in cancer: novel mechanisms and therapeutic implications. Mol Cancer. 2015;14:176. 10.1186/s12943-015-0446-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Aden K, Bartsch K, Dahl J, Reijns MAM, Esser D, Sheibani-Tezerji R, et al. Epithelial RNase H2 Maintains Genome Integrity and Prevents Intestinal Tumorigenesis in Mice. Gastroenterology. 2019;156(1):145–59 e19. 10.1053/j.gastro.2018.09.047 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Dai B, Zhang P, Zhang Y, Pan C, Meng G, Xiao X, et al. RNaseH2A is involved in human gliomagenesis through the regulation of cell proliferation and apoptosis. Oncol Rep. 2016;36(1):173–80. 10.3892/or.2016.4802 [DOI] [PubMed] [Google Scholar]
  • 43.Hatchi E, Skourti-Stathaki K, Ventz S, Pinello L, Yen A, Kamieniarz-Gdula K, et al. BRCA1 recruitment to transcriptional pause sites is required for R-loop-driven DNA damage repair. Mol Cell. 2015;57(4):636–47. 10.1016/j.molcel.2015.01.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Chang EY, Novoa CA, Aristizabal MJ, Coulombe Y, Segovia R, Chaturvedi R, et al. RECQ-like helicases Sgs1 and BLM regulate R-loop-associated genome instability. J Cell Biol. 2017;216(12):3991–4005. 10.1083/jcb.201703168 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Sollier J, Stork CT, Garcia-Rubio ML, Paulsen RD, Aguilera A, Cimprich KA. Transcription-coupled nucleotide excision repair factors promote R-loop-induced genome instability. Mol Cell. 2014;56(6):777–85. 10.1016/j.molcel.2014.10.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Chang EY, Stirling PC. Replication Fork Protection Factors Controlling R-Loop Bypass and Suppression. Genes (Basel). 2017;8(1). 10.3390/genes8010033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Tungler V, Staroske W, Kind B, Dobrick M, Kretschmer S, Schmidt F, et al. Single-stranded nucleic acids promote SAMHD1 complex formation. J Mol Med (Berl). 2013;91(6):759–70. 10.1007/s00109-013-0995-3 [DOI] [PubMed] [Google Scholar]
  • 48.Schneider CA, Rasband WS, Eliceiri KW. NIH Image to ImageJ: 25 years of image analysis. Nat Methods. 2012;9(7):671–5. 10.1038/nmeth.2089 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Stein SC, Thiart J. TrackNTrace: A simple and extendable open-source framework for developing single-molecule localization and tracking algorithms. Sci Rep. 2016;6:37947. 10.1038/srep37947 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Sridhara SC, Carvalho S, Grosso AR, Gallego-Paez LM, Carmo-Fonseca M, de Almeida SF. Transcription dynamics prevent RNA-mediated genomic instability through SRPK2-dependent DDX23 phosphorylation. Cell Rep. 2017;18(2):334–343. 10.1016/j.celrep.2016.12.050 [DOI] [PubMed] [Google Scholar]
  • 51.Goldman MJ, Craft B, Hastie M, Repecka K, McDade F, Kamath A, et al. Visualizing and interpreting cancer genomics data via the Xena platform. Nat Biotechnol. 2020. 10.1038/s41587-020-0546-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Lilljebjorn H, Henningsson R, Hyrenius-Wittsten A, Olsson L, Orsmark-Pietras C, von Palffy S, et al. Identification of ETV6-RUNX1-like and DUX4-rearranged subtypes in paediatric B-cell precursor acute lymphoblastic leukaemia. Nat Commun. 2016;7:11790. 10.1038/ncomms11790 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15(12):550. 10.1186/s13059-014-0550-8 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Peter McKinnon, Natalia Gromak

10 Nov 2020

Dear Dr Ahn,

Thank you very much for submitting your Research Article entitled 'Aicardi-Goutières syndrome-associated gene SAMHD1 preserves genome integrity by preventing R-loop formation at transcription-replication conflict regions' to PLOS Genetics. Your manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the current manuscript. Based on the reviews, we will not be able to accept this version of the manuscript, but we would be willing to review again a much-revised version. We cannot, of course, promise publication at that time. In particular, we would like the see more molecular evidence underlying the links between SAMHD1 function and R-loop biology to support the model presented in this paper. Furthermore, all reviewers have raised concerns regarding quality of the data and lack of appropriate controls. This includes lack of gene specific validations and correlation between different cell lines used in the paper, all of which need to be addressed in the revised version.

Should you decide to revise the manuscript for further consideration here, your revisions should address the specific points made by each reviewer. We will also require a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript.

If you decide to revise the manuscript for further consideration at PLOS Genetics, please aim to resubmit within the next 60 days, unless it will take extra time to address the concerns of the reviewers, in which case we would appreciate an expected resubmission date by email to plosgenetics@plos.org.

If present, accompanying reviewer attachments are included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see our guidelines.

Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool.  PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process.

To resubmit, use the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder.

[LINK]

We are sorry that we cannot be more positive about your manuscript at this stage. Please do not hesitate to contact us if you have any concerns or questions.

Yours sincerely,

Natalia Gromak, PhD

Guest Editor

PLOS Genetics

Peter McKinnon

Section Editor: Cancer Genetics

PLOS Genetics

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The manuscript “Aicardi-Goutières syndrome-associated gene SAMHD1 preserves genome integrity by preventing R-loop formation at transcription-replication conflict regions” by Park et al. reports on R-loop increases in fibroblasts of human AGS patients with mutations of distinct genes. Then, bioinformatic studies provide some evidence that SAMHD1 deficiency may delay replication by increasing R-loops near replication origins located in the body of genes and activation of checkpoint signalling pathways. Finally, bioinformatic analyses of TCGA datasets correlate SAMHD1 expression with cancer progression.

The data are novel, correctly presented and discussed. However, some critical issues are present in the paper that need to be addressed:

- DRIPseq data: it would be interesting to present and discuss the overall distribution of R-loops in the genome of fibroblasts bearing different mutated AGS genes. Is the overall distribution changed with specific gene mutations ? In other words: the changes reported in Fig 1 are unique or other changes are present elsewhere ?

- AGS gene mutations might have an effect on gene transcription and activated ORI may be different among cell types. Thus, some validations of the DRIPseq results should be provided at single-gene levels. At selected genes, authors should check if their expression levels, ORI activity and R-loop levels are altered (or not), and discuss the results with respect to DRIPseq findings.

- Cellular data of Fig 2 are not fully convincing. Panel A reports a difference from 3.5% to 1.7%. Can that be an experimental fluctuation ? Is such a difference biological relevant ? Moreover, double thymine block is known to be very stressful for cells, inducing DNA damage response (also shown in panel E in fig 2). At least, a better discussion of these points should be included in the text.

- Panel G, Figure 2: are images mis-labelled ?

- Activation of DNA damage response should be demonstrated by checking phosphorylation of H2AX, and both ATR and ATM kinases. This is particular interesting as the authors show that R-loops are increased in the half of ORI where replication/transcription head-on collision may occur.

- Immunofluorescence microscopy images are very poor. Looking the images, I am not convincing that R-loops increase in the nucleus of SAMHD1-deficient cells. I know that immunofluorescence microscopy with S9.6 may be tricky, however, I would suggest not to increase the fluorescence intensity of images to avoid to decrease signal/background ration. In addition, RNaseH1 treatment samples should be shown along with controls in which the same treatment was performed but without RNaseH1 addition in the buffer.

- With respect to the previous point, how may an overall R-loop increase (as seen in immunofluorescence images) compare with a rather selective and modest increase at ORIs in transcribed genes ?

- I suggest to edit the text for English syntax and fluency.

Reviewer #2: In this manuscript, Park, Ryoo and colleagues report the observation that SAMHD1, a factor mutated in the Aicardi Goutières syndrome, prevents the accumulation of R-loops and protect human cells from head-on transcription-replication conflicts and associated genomic instability. This view is largely based on bioinformatic analyses of published datasets showing that SAMHD1 deficiency, but not related AGS mutations, is associated with increased R-loop levels at a subset of genes containing replication origins. In addition, they provide evidence that SAMHD1-depleted U2OS cells have a delayed S phase accumulate spontaneous DNA damage and activate DNA damage response pathways. Finally, they show that colon cancer patients that have a low expression of either SAMHD1 or RNase H2A have a poor survival. On the basis of these data, the authors propose that SAMHD1 contributes to the resolution of R-loops and prevents genomic instability in cancer cells. Although this model is attractive, the data presented in the manuscript are not sufficient to support it and mechanistic insights in how SAMHD1 could execute this important function is missing. Further work is therefore required to clearly establish that SAMHD1 plays a role in R-loop metabolism.

Major issues:

1. In Figure 1, the authors reproduced the analysis performed by the Cimprich lab (Hamperl, Cell 2017) to compare the distribution of R-loops in genes containing a replication origin in the gene body versus genes that do not contain replication origins. To this end, they used DRIP-seq data generated by the Chedin lab (Lim, ELife 2015) to position R-loops in primary fibroblasts from AGS patients and OK-seq data generated by the Hyrien lab to determine fork direction (Petryk, Nat Commun 2016). This analysis revealed a striking difference in the distribution of R-loops at origin-containing genes in SAMHD1-deficient cells, but not in other AGS mutants. Although this result is intriguing, it is not sufficient to conclude that SAMHD1 resolves R-loops at genes experiencing transcription-replication conflicts. Indeed, it is well established that the position of replication origins varies between cell lines. The Cimprich lab mapped R-loops in HeLa cells and analyzed their distribution relative to forks using OK-seq data also generated in HeLa cells. Here, the authors compared R-loops mapped by the Chedin lab in primary fibroblasts relative to forks mapped in HeLa cells, which is like comparing apples and pears. Moreover, SAMHD1-deficient cells have altered dNTP pools and slower replication forks, which may also impact on the replication program and explain the difference with other AGS mutants. In conclusion, although the difference observed between SAMHD1-deficient fibroblasts and other AGS mutants is clear, this analysis needs to be performed in the same cell lines, or at least confirmed for a subset of loci by other methods.

2. The authors claim that they report for the first time a role for SAMHD1 in S-phase progression in unchallenged cells but this statement is incorrect. Other groups have already reported that cell cycle progression is delayed and forks are slower in the absence of SAMHD1 (see for instance Franzolin, PNAS 2013; Kretschmer, Annals of the Rheumatic Diseases 2015; Coquel, Nature 2018). This statement should therefore be toned down. Along the same line, the authors repeatedly state that SAMHD1 is the only AGS-associated gene involved in cancer. RNase H2B has been reported as a colorectal tumor suppressor (Aden et al. https://doi.org/10.1053/j.gastro.2018.09.047). RNase H2A is involved in human gliomagenesis through the regulation of cell proliferation and apoptosis (Dai et al. https://doi.org/10.3892/or.2016.4802).

3. The data showing that SAMHD1-depleted cells have increased R-loop levels (Fig. 3) are not really convincing. The reliability of immunofluorescence to quantify R-loops is questioned in the field and the flow cytometry approach would require positive controls to position the gate. The slot blot is not quantified, lacks positive controls and the number of biological replicates is not indicated. DRIP-qPCR experiments on a set of genes should be performed.

4. From the data shown in Fig.1, one would predict that the overall amount of R-loops in SAMHD1-deficient cells should not dramatically change if the effect is restricted to genes containing active origins. This issue needs to be clarified textually.

5. In Fig. 4, the authors convincingly show that the overexpression of RNase H1 reduces the formation of spontaneous DSBs in SAMHD1-depleted cells but they would have made their case stronger by showing that these breaks occur during S phase. Moreover, they need to show that the slower S phase and increased gamma-H2AX levels observed in the absence of SAMHD1 (Fig. 2) is also rescued by RNase H1. The authors could also assess fork progression in SAMHD1-depleted cells and test whether fork progression defects can be suppressed by transcription inhibition or RNase H1 overexpression.

6. The association of cancer patient survival to SAMHD1 level of expression is interesting, but can it be generalized to other types of cancers? Do tumors expressing low levels of SAMHD1 and RNase H2A constitute different groups? It has been reported that SAMHD1 levels impact on the response to chemotherapy, notably when nucleoside analogues (ara-C) are used (Herold, Nat Med 2017). Is that relevant to colon adenocarcinoma treatment?

Minor issues:

1. Fig. S1C: the p-value for DRIP-seq enrichment at CD collision is labelled as P=0.0707, however in the text, the authors indicate P=0.707 (line 158). Which value is correct?

2. Fig. 2G: Control cells show a very high gamma-H2AX signal. The labeling is incorrect.

3. Fig. 4C: The blot for p-CHK2 should be improved.

Reviewer #3: In the manuscript entitled “Aicardi-Goutières syndrome-associated gene SAMHD1 preserves genome integrity by preventing R-loop formation at transcription-replication conflict regions”, the authors show first in a bioinformatic analysis that R-loop levels are increased at particular gene body regions prone to Head-ON transcription-replication conflicts in fibroblasts mutated in the dNTPase and phosphorolytic 3ʹ- 5ʹ -exoribonuclease SAMHD1 compared to control cells (Figure 1)

Depletion of SAMHD1 in U-2OS cells leads to replication stress and DNA damage in S-Phase (Figure 2) which correlates with cellular R-loop accumulation (Figure 3). Removal of R-loops can rescue these effects (Figure 4) suggesting a more direct link between R-loop accumulation and the genomic instability phenotypes of SAMHD1 depleted cells. Finally, TCGA database analysis reveals a correlation between SAMHD1 expression level and patient survival rate (Figure 5).

Overall, I think the manuscript addresses an interesting question relevant to a broad audience. The data are generally of high quality and convincing. However, I do have a few major concerns that need to be addressed and solidified prior to publication in Plos Genetics.

Major Points:

1) The bioinformatic analysis was adapted from a previous study showing that genomic regions prone to HO conflicts have higher R-loop levels than CD (Hamperl et al., Cell 2017). This study used OK-Seq data in HeLa cells to determine replication fork directionality. As I could not find specific information in the material and method section, did the authors use the same data set (derived from HeLa) to determine the locations of origins for the analysis in the AGS mutant cell lines? As usage of replication origins is stochastic and may vary to a significant extent between different cell types and genetic backgrounds, it is unclear to me whether one can confidently conclude that these origins mapped in HeLa cells also exist in the fibroblast cell lines. It would be best if OK-Seq data are available in the exact same cell lines. At the minimum, the authors could analyze additional OK-Seq data sets from other human cell lines and then analyze if and to what extent these origins are conserved between multiple cell lines

2) Another point regarding this analysis in Figure 1. Have the authors looked into gene expression levels of these 500 genes whether they show significant levels of transcription in the different mutants versus control cells?

3) Based on the phenotypes observed in U-2OS cells (Figures 2-4), the authors have clear indications that SAMHD1 knockdown leads to DNA damage-inducing accumulation of RNA:DNA hybrids on a global/cellular level but there is no effect in the overall level of R-loops in SAMHD1 mutant cells compared to control cells (Figure S1A). Is this an effect of knocking down the protein (in U-2OS cells) versus expressing a mutant version of the enzyme in the fibroblast that can induce a dominant negative effect? I think it would be important if the authors could confirm their findings in U-2OS cells by introducing the exact same mutation of SAMHD1 in U-2OS cells in order to make the two different experimental systems/cell lines more comparable. A major selling point of the paper is the relevance of SAMHD1 towards many types of cancer but I believe this point is only proven if the same effects of R-loops at sites of transcription-replication conflicts can be shown with the cancer relevant mutated version of the enzyme.

4) The DNA damage accumulation in SAMHD1-depleted cells leading to Chk1, Chk2 and yH2AX activation shows nicely an S-phase specific effect of SAMHD1 (Figure 2). The alkaline comet assay shows all kinds of DNA damage including ssDNA and DSBs. Could the authors also perform a neutral comet assay to get a better understanding what type of damage is inflicted upon SAMHD1 loss?

5) Figure 3: Can the authors show that the accumulation of R-loops is specific to S-phase cells (maybe co-labelling with an EdU pulse)? Alternatively, co-stain with PI to investigate the DNA content of cells that have elevated HB-GFP signal (Figure 3D)?

6) A more conceptual point is that SAMHD1 was recently shown to associate with active replication forks, thereby promoting degradation of nascent DNA at stalled replication forks by stimulating the exonuclease activity of MRE11, thereby mitigating an autoimmune response (Coquel et al., Nature 2018). I guess one model could be that these sites where SAMHD1 acts on are replication forks stalled due to transcription-replication conflicts. Can the authors confirm this by using a more direct readout that shows elevated levels of conflicts in SAMHD1 knockdown conditions (e.g. proximity ligation assays with RNAPII pS2/PCNA or RNAPII pS2/SAMHD1)?

Minor Points:

1) The yH2AX panels in Fig. 2G seem to be flipped between control and SAMHD1 knockdown cells as based on the images, the shLuc cells would have much higher yH2AX signal.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: No: I have not been provided with an accession code to a genomic site for DRIPseq data presented in the paper.

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Decision Letter 1

Peter McKinnon, Natalia Gromak

24 Feb 2021

Dear Dr Ahn,

Thank you very much for submitting your Research Article entitled 'Aicardi-Goutières syndrome-associated gene SAMHD1 preserves genome integrity by preventing R-loop formation at transcription–replication conflict regions' to PLOS Genetics.

The manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated the attention to an important topic and the level of revision experiments. However, Reviewer 2 has identified some outstanding concerns that we ask you address in a revised manuscript.

We therefore ask you to modify the manuscript according to the review recommendations. Your revisions should address the specific points made by Reviewer 2.

In addition we ask that you:

1) Provide a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript.

2) Upload a Striking Image with a corresponding caption to accompany your manuscript if one is available (either a new image or an existing one from within your manuscript). If this image is judged to be suitable, it may be featured on our website. Images should ideally be high resolution, eye-catching, single panel square images. For examples, please browse our archive. If your image is from someone other than yourself, please ensure that the artist has read and agreed to the terms and conditions of the Creative Commons Attribution License. Note: we cannot publish copyrighted images.

We hope to receive your revised manuscript within the next 30 days. If you anticipate any delay in its return, we would ask you to let us know the expected resubmission date by email to plosgenetics@plos.org.

If present, accompanying reviewer attachments should be included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission.

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript.

PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process.

To resubmit, you will need to go to the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder.

[LINK]

Please let us know if you have any questions while making these revisions.

Yours sincerely,

Natalia Gromak, PhD

Guest Editor

PLOS Genetics

Peter McKinnon

Section Editor: Cancer Genetics

PLOS Genetics

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #2: The authors have made a great effort revising this manuscript, with the addition of numerous new experiments. Overall, these additional data have greatly improved the manuscript, which is now potentially suitable for publication. I still have two concerns and one minor request.

My first concern is that although the authors have provided new evidence that R-loops levels are higher in SAMHD1-depleted cells, it is a bit disappointed that they analyzed only two genes (NEK7 and SEC24D) by DRIP-qPCR.

The second (major) concern relates to the experiment shown in Fig. S4H to support the view that the same replication origins are used in the absence of SAMHD1. The fact that DNA copy number does not change at loci of interest does not mean that the same origins are used. Changes in DNA copy number in a population of cells indicates at best the time of replication (early or late), not origin usage. Moreover, the replication of the reference locus used to normalize DNA copy numbers could also change in SAMHD1-deficient cells, which would affect the interpretation of the data. In short, this experiment is inconclusive and should be removed. Removal of these data would also weaken the argument that origin usage does not change in the absence of SAMHD1 and this limitation needs to be discussed in the manuscript.

Finally, the novel PLA experiments are very nice, but the intensity of PLA foci in Figs 4G and 3A is too low and should be increased.

Reviewer #3: The authors have addressed all my previous comments/concerns and have also added significant new experimental data including DRIP-qPCR validation of the bioinformatic analysis, S9.6 IF and FACS quantifications with appropriate positive and negative controls as well as PLA data between RNAPII-PCNA as requested by the reviewers. Together, the results substantiate the author’s conclusions that R-loops are highly enriched at transcription-replication conflict regions of the genome in fibroblast of patients bearing SAMHD1 mutation. Therefore, I’m positive and support timely publication of the manuscript in Plos Genetics.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

Reviewer #3: Yes: Stephan Hamperl

Decision Letter 2

Peter McKinnon, Natalia Gromak

29 Mar 2021

Dear Dr Ahn,

We are pleased to inform you that your manuscript entitled "Aicardi-Goutières syndrome-associated gene SAMHD1 preserves genome integrity by preventing R-loop formation at transcription–replication conflict regions" has been editorially accepted for publication in PLOS Genetics. Congratulations!

Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional acceptance, but your manuscript will not be scheduled for publication until the required changes have been made.

Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org.

In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field.  This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager.

If you have a press-related query, or would like to know about making your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics!

Yours sincerely,

Natalia Gromak, PhD

Guest Editor

PLOS Genetics

Peter McKinnon

Section Editor: Cancer Genetics

PLOS Genetics

www.plosgenetics.org

Twitter: @PLOSGenetics

----------------------------------------------------

Comments from the reviewers (if applicable):

----------------------------------------------------

Data Deposition

If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website.

The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: 

http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-20-01541R2

More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support.

Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present.

----------------------------------------------------

Press Queries

If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org.

Acceptance letter

Peter McKinnon, Natalia Gromak

8 Apr 2021

PGENETICS-D-20-01541R2

Aicardi-Goutières syndrome-associated gene SAMHD1 preserves genome integrity by preventing R-loop formation at transcription–replication conflict regions

Dear Dr Ahn,

We are pleased to inform you that your manuscript entitled "Aicardi-Goutières syndrome-associated gene SAMHD1 preserves genome integrity by preventing R-loop formation at transcription–replication conflict regions" has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Katalin Szabo

PLOS Genetics

On behalf of:

The PLOS Genetics Team

Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom

plosgenetics@plos.org | +44 (0) 1223-442823

plosgenetics.org | Twitter: @PLOSGenetics

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. R-loop accumulation at the origins of gene bodies in SAMHD1-deficient AGS patient fibroblasts.

    A, Average mapped DRIP-seq read accumulation around random genomic position (n = 50,000) in fibroblasts from control (yellow) and AGS patients with the indicated gene mutations (blue, SAMHD1; pink, TREX1; green, RNASEH2B; purple, RNASEH2A). For individual graph, the line indicates the mean DRIP-seq read-count of each genomic region of 24 kb windows centered on random genomic position. B, Volcano plot of RNA-seq transcriptome data displaying the pattern of gene expression values for fibroblasts from control and AGS patients (AGS5, AGS1, AGS2, and AGS4). Significantly differentially expressed genes (DEGs) (|log2(fold change)| >2 and adjusted P value <0.01) are highlighted in red (upregulated) or blue (downregulated). None of the DEGs are highlighted in black. Selected genes used for DRIP-seq analysis (n = 467) are highlighted in dark green. The number of DEGs among selected genes used for DRIP-seq analysis for each AGS patient are indicated. C, Box graph indicating quantified average DRIP-seq read-count across the 6 kb window upstream from replication origins of the gene bodies (n = 727), where HO collision sub-regions exist. D, Box graph indicating quantified average DRIP-seq read-count across a 6-kb window downstream from replication origins of the gene bodies (n = 727), where CD collision sub-regions exist. In C and D, the orange line at the center of the boxes indicates the median, and the boxes indicate 1st and 3rd quartiles. Statistical significance was assessed using Fisher’s exact test followed by Bonferroni correction.

    (TIF)

    S2 Fig. Depletion of SAMHD1 does not affect cellular apoptosis or cell cycle arrest.

    A, SAMHD1 in U2OS cells transfected with vectors expressing shRNA targeting luciferase control (shLuc) and two different SAMHD1 UTRs (shSAMHD1-#1 and shSAMHD1-#2). B, In vitro cell proliferation assay of SAMHD1-depleted or control U2OS cells. Data represent the mean ± SD, n = 3. The P value was calculated according to two-way ANOVA. C, Representative flow plot of Annexin V-FITC assay of SAMHD1-depleted and control U2OS cells. Apoptosis was determined by staining cells with FITC-conjugated Annexin V and PI, followed by flow cytometry analysis. Four populations are indicated as Q1, necrotic; Q2, late apoptosis; Q3, live; and Q4, early apoptotic. D, Quantification of Annexin V-FITC apoptosis assay results showing the percentage of cell death modes: live cells, early apoptosis, and late apoptosis/necrosis in SAMHD1-depleted and control U2OS cells. Statistical significance was assessed by two-way ANOVA. Data represent the mean ± SEM (n = 3, P < 0.001). E, Representative cell cycle profile analysis by flow cytometry following labeling of cells with BrdU. SAMHD1-depleted and control U2OS cells were pulsed with BrdU. The BrdU-positive population was followed over time as it transitioned through the cell cycle.

    (TIF)

    S3 Fig. Depletion of SAMHD1 leads to increased DNA damage.

    A, Summary of experimental design for Fig 2C, representing experimental process of double thymidine block followed by time course sample harvest. B, Representative images of comet assays performed under alkaline electrophoresis conditions in SAMHD1-depleted and control U2OS cells, in the absence or presence of ectopic SAMHD1-HA expression. C, Quantitative analysis of comet tail moment lengths for each condition described in B. Statistical significance was assessed using two-tailed Student’s t-test (n >50). D, Summary of experimental design for Fig 2F, representing experimental process to obtain heterogeneous (Asyn) or S phase-synchronized cellular population by double thymidine block. E, Summary of experimental design for F, representing experimental process of SAMHD1-HA reconstitution followed by double thymidine block to obtain S phase-synchronized cellular population. F, Immunoblots of CHK1 phosphorylated on S345 (p-CHK1), CHK2 phosphorylated on T68 (p-CHK2), CHk1 and CHk2 in S phase-synchronized SAMHD1-depleted and control U2OS cells, in absence or presence of ectopic SAMHD1-HA expression in a does dependent manner (n = 3). G, Representative images of comet assays performed under neutral conditions in SAMHD1-depleted and control U2OS cells. H, Quantitative analysis of comet tail moment lengths for each condition described in G. Statistical significance was assessed using two-tailed Student’s t-test (n >50).

    (TIF)

    S4 Fig. Accumulation of R-loops in SAMHD1-depleted cells.

    A, Indicated gene expression as measured by RT-qPCR in control (siControl) or indicated siRNAs treated U2OS cells. Data represent mean ± SEM, n = 3, P values were calculated according to two-tailed Student’s t-test. B, Schematic representation of HB-GFP retention assay. C, Quantification analysis of HB-GFP retention assay co-stained with PI. Data represent mean ± SEM percentages of GFP positive cell population. Statistical significance was assessed using two-tailed Student’s t-test (n = 3). P > 0.05; n.s, not significant. D, Quantification analysis of HB-GFP retention assay co-stained with PI. Data represent mean ± SEM percentages of S phase cell population. Statistical significance was assessed using two-tailed Student’s t-test (n = 3). P > 0.05; n.s, not significant. E, Immunoblots of SAMHD1 in cells described in Fig 3G, showing ectopic expression of GFP-tagged SAMHD1 in SAMHD1-depleted and control U2OS cells. F, Dot blot analysis of R-loops in genomic DNA extracted from SAMHD1-depleted and control U2OS cells (n = 3). The total cellular genomic DNA from each cell line was incubated with or without RNase H in prior to loading for blotting, as indicated. The genomic DNA loaded membrane were processed to immunoblotting with anti-S9.6 and anti-ssDNA antibodies to verify equivalent DNA loading. Fold-induction was normalized to the control U2OS cells by quantifying the dot intensity of the blots and calculating the ratios of S9.6 signal to total amount of genomic DNA.G and H, Indicated gene expression as measured by RT-qPCR in control or SAMHD1-depleted U2OS cells. Data represent mean ± SEM, n = 3, P values were calculated according to two-tailed Student’s t-test. P > 0.05; n.s, not significant. I, DRIP-qPCR using the anti S9.6 antibody at GBE1 HO, GBE1 CD, MAP4K4 HO, MAP4K4 CD, CALD1 HO, CALD1 CD, PALM2-AKAP2 HO and PALM2-AKAP2 CD are shown in SAMHD1-depleted and control U2OS cells. Pre-immunoprecipitated samples were untreated (-) or treated (+) with RNase H, as indicated. Values are relative to those of control U2OS cells. Data represent mean ± SEM the statistical significance was assessed using the one-way ANOVA (n = 2).

    (TIF)

    S5 Fig. Colorectal cancer-relevant SAMHD1 mutants are associated with TRCs in U2OS cells.

    A, Immunoblots of SAMHD1 in U2OS cells transfected with vector expressing indicated SAMHD1 mutants tagged with HA. B, Genomic DNA extracted from U2OS cells described in A were analyzed by quantitative PCR. The relative DNA copy number for each vector was determined by Puromycin resistance sequence and normalized to MDM2. Data represent mean ± SEM, n = 2, P values were calculated according to two-tailed Student’s t-test. P > 0.05; n.s, not significant. C, Representative images of proximity ligation assay (PLA) between PCNA and RNA polymerase II (RNAPII) in S phase synchronized SAMHD1-depleted and control U2OS cells, in the absence or presence of indicated SAMHD1 mutant ectopic expression (SAMHD1-HA) (n = 2). Cells were subjected to PLA (red), and co-stained with HA (green) and DAPI (blue). D, Quantification analysis of number of PLA foci per nucleus in C. The mean value for each data is shown as a red line. Statistical significance was assessed using the two-tailed Mann-Whitney U test (n > 70).

    (TIF)

    S6 Fig. Model of the role of SAMHD1 at transcription-replication conflict-derived R-loops.

    In normal cells (left), R-loops formed by transcription-replication collision, are resolved by SAMHD1, which allows successful DNA replication and RNA transcription. In SAMHD1-deficient cells (right), unresolved R-loops at transcription-replication conflict regions showed genomic instability.

    (TIF)

    S1 Table. Oligonucleotides used in this study.

    (DOCX)

    S1 Data. Source Data: Spreadsheet of source data shown in this study.

    (XLSX)

    Attachment

    Submitted filename: Park_Response to reviewers.docx

    Attachment

    Submitted filename: Park_revised_Response to reviewers.docx

    Data Availability Statement

    The bioinformatics data and figures reported in this study are available under GitHub (https://github.com/spiritmage72/samhd1_rloop). The modified custom python scripts used to determine DRIP-seq signal across HO and CD regions were made available under GitHub (https://github.com/spiritmage72/samhd1_rloop). All relevant underlying numerical data for all graphs and summary statistics are either in Spreadsheets of Supporting information or included in relevant data.


    Articles from PLoS Genetics are provided here courtesy of PLOS

    RESOURCES