Skip to main content
PLOS One logoLink to PLOS One
. 2021 Apr 27;16(4):e0250291. doi: 10.1371/journal.pone.0250291

Protection of the C. elegans germ cell genome depends on diverse DNA repair pathways during normal proliferation

Bettina Meier 1,#, Nadezda V Volkova 2,#, Ye Hong 1,¤a, Simone Bertolini 1, Víctor González-Huici 1,¤b, Tsvetana Petrova 1,¤c, Simon Boulton 3, Peter J Campbell 4,5,6, Moritz Gerstung 2,7,*, Anton Gartner 1,8,9,*
Editor: Arthur J Lustig10
PMCID: PMC8078821  PMID: 33905417

Abstract

Maintaining genome integrity is particularly important in germ cells to ensure faithful transmission of genetic information across generations. Here we systematically describe germ cell mutagenesis in wild-type and 61 DNA repair mutants cultivated over multiple generations. ~44% of the DNA repair mutants analysed showed a >2-fold increased mutagenesis with a broad spectrum of mutational outcomes. Nucleotide excision repair deficiency led to higher base substitution rates, whereas polh-1(Polη) and rev-3(Polζ) translesion synthesis polymerase mutants resulted in 50–400 bp deletions. Signatures associated with defective homologous recombination fall into two classes: 1) brc-1/BRCA1 and rad-51/RAD51 paralog mutants showed increased mutations across all mutation classes, 2) mus-81/MUS81 and slx-1/SLX1 nuclease, and him-6/BLM, helq-1/HELQ or rtel-1/RTEL1 helicase mutants primarily accumulated structural variants. Repetitive and G-quadruplex sequence-containing loci were more frequently mutated in specific DNA repair backgrounds. Tandem duplications embedded in inverted repeats were observed in helq-1 helicase mutants, and a unique pattern of ‘translocations’ involving homeologous sequences occurred in rip-1 recombination mutants. atm-1/ATM checkpoint mutants harboured structural variants specifically enriched in subtelomeric regions. Interestingly, locally clustered mutagenesis was only observed for combined brc-1 and cep-1/p53 deficiency. Our study provides a global view of how different DNA repair pathways contribute to prevent germ cell mutagenesis.

Introduction

Germ cells are required to pass genetic information from one generation to the next, rendering the maintenance of their genetic integrity particularly important. While germ cell mutations are the basis of evolution, mutational events tend to be detrimental and are associated with both reduced fitness and inherited disease.

Endogenous mutagenesis can be caused by nucleotide mis-incorporation during replication and by reactive cellular metabolites. Hydrolytic reactions trigger abundant depurinations, depyrimidinations, and the deamination of cytosine and 5-methylcytosine (for review [1]). Reactive oxygen species, byproducts of oxidative phosphorylation and oxygen-dependent enzymatic processes, induce 10,000–100,000 DNA lesions per cell per day, including base modifications such as 8-oxo-dG, thymine glycol and DNA single-strand breaks [2]. In addition, enzymatic and non-enzymatic mechanisms lead to base methylations. For instance, 3-methyl-adenine and 3-methyl-cytosine can lead to mutation by blocking replication, and O6-methyl-guanine leads to G>A changes (for review [1]). Metabolic byproducts such as reactive aldehydes form DNA adducts that can crosslink bases from complementary DNA strands generating obstacles to replication and transcription.

DNA double-strand breaks (DSBs) are among the most toxic DNA lesions and arise when the replication fork is stalled by base modifications, repetitive DNA, DNA sequences prone to form tertiary structures, or collision with the transcription machinery [3]. Nevertheless, some cellular events require DSBs to be induced naturally, for example during immunoglobulin gene rearrangement to ensure immunoglobulin diversification. Additionally, during germ cell meiosis multiple DSBs are introduced in each chromosome, resulting in at least one crossover recombination event between homologs, thereby facilitating the exchange of genetic information and orderly chromosome segregation (for review [4]). Recombination requires a free DNA end to search for and invade a homologous DNA strand, which acts as a template to facilitate the restoration of genetic information. When DSBs occur in repetitive DNA such as tandem repeats or interspersed repeat elements like Line and Alu sequences, ‘homology search’ provides a formidable challenge.

Nevertheless, only a vanishingly small fraction of primary lesions leads to mutations, highlighting how effective DNA damage response mechanisms are in detecting and mending multifarious forms of DNA damage. The analysis of the observed mutations has the potential to shed light on the primary mutagenic lesion. When the amount of DNA damage introduced by a mutagenic process exceeds the capacity of DNA repair, distinct patterns of mutations, referred to as mutational signatures or spectra, arise. Here, we characterize genome-wide mutational spectra by analysing the number and distribution of single and multi-nucleotide variants (SNVs and MNVs), small insertions and deletions (indels) and structural variants (SVs) composed of larger (over 400 bp) deletions, inversions, duplications, and chromosomal translocations.

The rates and types of germline mutations were previously studied in humans and model organisms such as mouse, fruit fly, C. elegans, and primates [58]. These studies discovered a relatively uniform mutation pattern, sometimes referred to as ‘signature 5’ [6, 9], which has also been observed in different human somatic tissues. The underlying aetiology of this signature–and whether it is the product of a single, or multiple mutational processes–remains unclear. We recently reported a high-level analysis comprising 2700 C. elegans genomes treated with 11 genotoxic agents. ~40% of analysed DNA repair mutants exhibited altered mutations, thus highlighting a prominent role of DNA repair pathways in shaping mutation rates and signatures [10].

Here, we focus our analysis on mutations accumulating when C. elegans is propagated over generations in the absence of exposure to exogenous mutagens to enhance our understanding of how mutagenesis is prevented in the germ cell lineage, largely using the same primary set of data. This study encompasses 528 genomes derived from wild-type and 54 single and 7 double DNA repair mutants. DNA repair mutants were chosen to encompass the majority of known DNA repair and damage response pathways. We provide a comparative, detailed and systematic genome-wide analysis of the contribution of these DNA repair pathways towards maintaining germ cell genome integrity.

Results

Mutation rates in C. elegans wild-type and DNA repair mutants

C. elegans offers a suitable system to study mutation accumulation (MA) over generations based on its short life-cycle of three to four days and its ability to self-fertilize. Self-fertilization enables the clonal propagation of lines from single animals randomly picked in each generation. In our mutation accumulation experiments (MA), we propagated (typically for 20 or 40 generations) 5–10 clonal lines for each genotype and randomly selected a minimum of 3 lines for sequencing (Fig 1A) (S1 Table for list and description of DNA repair mutant lines analysed). Out of 528 whole genome sequencing (WGS) datasets we analyse as part of this study (S1 Table), 472 were previously deposited (S1 Table) [10], and 56 (corresponding to 5 newly generated double and 6 single mutant strains) were newly deposited (S1 Table). DNA repair mutants were chosen to encompass the majority of known DNA repair and damage response pathways. Genomic DNA for sequencing was isolated from starved nematode populations, each a clonal expansion from a single L4 stage hermaphrodite from the first or last propagated generation (Materials and Methods, [1012]). Crucially, these lines pass through a single-cell bottleneck provided by the zygote, enabling us to analyse how mutations arise in the germline (Fig 1A).

Fig 1. Experimental outline and background mutagenesis in wild-type.

Fig 1

A. L4 larvae (F1 generation) from a parental founder strain (P0) were individually picked onto NGM plates and allowed to self-fertilize prior to picking individual L4 larvae of the next generation (F2) from each F1 plate. This process was repeated until clonal lines reached generation F20 or F40. Clonal lines were then allowed to expand, harvested, and prepared for whole genome sequencing (Materials and Methods). B. Mutation types and their location on the 6 C. elegans chromosomes (I-V and X) across all wild-type samples and mutation classes. The height of the white bars corresponds to the length of the respective C. elegans chromosome. Single nucleotide variants are indicated by a dot, dinucleotide variants (DNVs) by a square, indels divided in deletions (D) and insertions (I) by a triangle, and structural variants (SVs) by a line. C. Average number of heterozygous mutations in the N2 wild-type genome per generation across all mutation classes and types. Single nucleotide variants are shown in the context of their 5’ and 3’ base. Grey bars denote 95% credible intervals for the number of mutations in each type. “Complex indels” class denotes deletions with insertions. Data for N2 was previously shown in ([10] Fig 1C) . Information related to the 528 whole genome sequencing WGS primary-source datasets (56 deposited in this study, 472 deposited in (Suppl Data 1 and Supple Note 1 of [10] can be found in S1 Table).

Calculating mutation rates from more than 30 wild-type MA lines including 5 lines grown for 40 generations, comparing mutations from the first and last generation (Fig 1A), we refined our previous mutation rate estimations [1012] to ~0.9 mutations in the diploid C. elegans genome per generation (Materials and Methods). This corresponds to ~2.9 x 10−10 (95% CI: 2.5 x 10−10–3.2 x 10−10) mutations per base pair and germ cell division. Mutations were equally distributed across the wild-type genome with no evidence of clustering (Fig 1B). The most frequent mutations were a) single base insertions, with prevalent T>A changes in the context of a 5’A and a 3’ T, and b) deletions in homopolymeric sequences (Fig 1C), indicative of replication slippage as a source of mutations in wild-type. Together with our previous data on MMR deficient strains [11] and the estimates provided below, this suggests that replication polymerase slippage in homopolymeric sequences is the most frequent type of genetic error.

Across 61 C. elegans DNA repair deficient mutants ([1012], S1 Table), the median mutation rate was close to that observed in wild-type: 0.82 heterozygous base substitutions per generation compared to 0.57 (standard deviation SD = 0.04) in wild-type, 0.25 indels (0.26 (SD = 0.03) in wild-type) and 0.03 SVs (0.02 (SD = 0.01) in wild-type). However, mutation rates varied by mutation type, making the comparison of overall mutation rates misleading. We therefore stratified mutations into 1) single and multi-nucleotide variants, 2) indels smaller than 400 base pairs and 3) SVs (Fig 2, S1 Fig). Interestingly, 69% (42 out of 61) of DNA repair deficient strains displayed mutation rates significantly different from wild-type in at least one mutation class (false discovery rate FDR = 5%), with an over 2-fold change in 44% (27 out of 61) of mutants.

Fig 2. Mutation rates across 60 C. elegans genotypes stratified by mutation type: Base substitutions, indels, and structural variants.

Fig 2

Mutation rates are shown as number of heterozygous mutations per generation for N2 wild-type (WT), and mutants used in this study grouped by the major DNA repair pathway they contribute to; direct damage reversal (DR), base excision repair (BER), nucleotide excision repair (NER), DNA double-strand break repair (DSBR), translesion synthesis (TLS), crosslink repair (ICLR), spindle assembly checkpoint (SAC), apoptosis, and mismatch repair (MMR). Base substitutions are shown in red (top), indels in green (center) and structural variants in blue (bottom). Dotted lines denote the mutation rates for wild-type. Error bars show the 95% confidence intervals; large dots represent variants with 2-fold increased or decreased mutation rates over N2 wild-type which are statistically significant with a false discovery rate (FDR) below 5%. All CIs which extend below the lower edge of the plot have zero as their lower border. Information related to the 528 whole genome sequencing WGS primary-source datasets (56 deposited in this study, 472 deposited in (Suppl Data 1 and Supple Note 1 of [10] can be found in S1 Table).

Besides the previously reported high mutation rate in DNA mismatch repair (MMR) mutants (pms-2 and mlh-1), with 25–30 times more base substitutions and a ~100-fold increase in indels per generation [11, 13, 14], a ~2 fold increase of single SNV was observed in several mutants defective for NER, HR, direct damage reversal (DR), and helicases (Fig 2, red dots). Moreover, ~3–5 fold more SNVs occurred in smc-5 and smc-6 HR mutants, as well as in the cep-1; brc-1 brd-1 triple mutant defective for the C. elegans orthologs of BRCA1, its binding partner BARD1, and p53 (Fig 2, red dots). The DNA interstrand crosslink repair mutant dog-1/FANCJ accumulated ~6 times more indels compared to wild-type, and several HR and TLS deficient strains showed ~2 fold increased indel rates (Fig 2, green dots). Finally, the number of structural variants (SVs) tended to be elevated in lines compromised for HR and in various DNA helicase mutants (dog-1, helq-1, him-6/BLM, rtel-1) (Fig 2, blue dots).

Several DNA repair mutants, namely fcd-2/FANCD2 and fnci-1/FANCI DNA ICL-repair defective lines, and the microhomology mediated end-joining (MMEJ) defective polq-1/POLQ mutant exhibited reduced indel rates (Fig 2, green dots) Additionally, polq-1 mutants harboured reduced SNVs (Fig 2, red dot). POLQ-1 dependent MMEJ is an error prone pathway, in which resected 3’ single-stranded overhangs pair at their complementary terminal nucleotide(s) to prime DNA synthesis, often leading to small deletions [1518]. However, given that indel and SV mutation rates in our dataset are already low in wild-type and more wild-type than mutant samples were included in the analysis (wild-type n = 30, mutant n = 4–8) the sample variance in genotypes with mutation rates close or lower to wild-type may be underestimated. We therefore caution that the observed reductions in mutagenesis levels are likely to be false discoveries.

Direct damage reversal (DR), base excision repair (BER), nucleotide excision repair (NER), and single-strand break (SSB) repair

We next wished to systematically characterise the signatures and features of mutations accumulated over generations by DNA repair pathways. Mutants deficient in DR, BER, and NER did not show large changes in the overall mutation spectra, but several small differences in particular mutation types (Fig 3A, S2A Fig).

Fig 3. Mutation analysis of C. elegans BER, NER, and DR mutants with mutation rates or spectra different from wild-type.

Fig 3

A. Mutational signatures of BER, NER, and DR mutants that display statistically significantly different mutation spectra than wild-type shown as the number of mutations per generation across all mutation classes. Underscores (bold coloured bars) below each mutation profile indicate mutation types where the total mutation numbers are different from wild-type, three stars indicate genotypes with significantly different rates of substitutions, indels or SVs compared to those in wild-type (FDR < 5%). Single nucleotide variants are shown in the context of their 5’ and 3’ base context. B. Number of mutations of all classes shown for each individual sequenced line of the indicated genotype and generation. The four sequenced wild-type P0 lines reflect the variance present in initial generations. Mutations are shown cumulatively with mutations present in generation F20 included in F40. C. Mutation types of all classes and their location on the 6 C. elegans chromosomes (I-V and X) observed across agt-2 mutant lines. The height of the white bars corresponds to the length of each individual chromosome. Single nucleotide variants (SNVs) are indicated by a dot, dinucleotide variants (DNVs) by a square, indels divided in deletions (D), insertions (I), and deletions with insertions (DI) by a triangle, and structural variants (SVs) by a line. Clustered mutations that are present within a single agt-2 line are depicted by enlarged bold symbols. An analysis of brca-1, him-6 and smc-6 swas previously shown in ([10] Fig 1C). Information related to the 528 whole genome sequencing WGS primary-source datasets (56 deposited in this study, 472 deposited in (Suppl Data 1 and Supple Note 1 of [10] can be found in S1 Table).

In addition to AGT-1, which facilitates direct damage reversal by removing methyl moieties from O6-methyl guanine, AGT-2 encodes for a further predicted C. elegans O6-alkylguanine DNA alkyltransferase [19]. We found a 2-fold increased mutation rate in agt-2 deficient lines (Fig 2), owing to an elevated frequency of C>T changes caused by the mispairing of O6-methyl guanine with T (Fig 3A and 3B). Interestingly, agt-2 mutants exhibited a moderate degree of mutation clustering, evidenced by 7 cases of 2–3 mutations located in closer proximity to each other than expected by chance, scattered across 10 agt-2 mutant lines (Fig 3C, S2B Fig). We also confirmed increased numbers of C>T changes in mutants defective for ung-1, an Uracil-DNA glycosylase that excises uracil during BER [20] (Fig 3A and 3B). Uracil is introduced via UTP mis-incorporation or cytosine deamination and pairs with adenine, which leads to C>T mutations. Other BER mutants, including mutants deficient in PARP-1 and PARP-2, the two C. elegans poly-ADP ribose polymerases needed for SSB repair, did not show altered mutation rates compared to wild-type (Fig 2, S2A Fig).

The NER pathway is involved in the repair of bulky DNA adducts and DNA crosslinks, both of which cause a structural distortion of the DNA double helix [21]. xpa-1, xpf-1, and xpg-1 lines compromised for all NER and xpc-1 lines solely defective for global genome NER (but not csb-1 lines uniquely defective for transcription coupled NER) showed increased mutation rates without overt changes in mutational signatures (Figs 2, 3A and 3B, S3A Fig). We speculate that this increased mutagenesis might be caused by cyclopurines induced by reactive oxygen species, and/or from exposure to ambient and fluorescent light. Comparison between the C. elegans NER signature adjusted for the human nucleotide composition (see [11]) and COSMIC signatures SBS8 (associated with NER using human organoids) and SBS5 (associated with NER defects in urothelial cancers) showed no conformity (cosine similarity scores 0.64 and 0.56, respectively) (S3B Fig). DSB repair by nonhomologous DNA end-joining (NHEJ), microhomology mediated end-joining (MMEJ) and homologous recombination (HR).

DSB repair is facilitated by several redundant pathways. HR is considered largely error-free, restoring genetic information using an intact homologous DNA strand as a repair template. End-joining pathways, classical NHEJ (c-NHEJ) and MMEJ, are typically error-prone and join free DNA ends. c-NHEJ largely acts on blunt DNA ends, while MMEJ requires short DNA resection to generate complementary 2–20 base single-stranded DNA termini to join broken DNA ends [16].

Inactivation of the core components of NHEJ, cku-80 and lig-4, did not produce changes in mutagenesis (Fig 2, S4A Fig). To study the effect of defective HR, we investigated mutation accumulation in brc-1 brd-1 double mutants. BRC-1/Brca1 and BRD-1/Bard1 proteins form a heterodimer and the corresponding deficiencies are considered epistatic [22]. We report on the double mutant as our genome sequencing analysis revealed that the brc-1 mutant strain we used also contained a brd-1 deletion (S1 Table). In contrast to end-joining mutants, the brc-1 brd-1 double mutant showed increased numbers of single nucleotide variants (Fig 2), small deletions between 5 and 50 bases (Fig 4A), and tandem duplications (TDs) between 1.6 and 500 kbps, with a median of ~12 kbps (Fig 4B and 4C, S5 Fig). Overall, the mutational signature of C. elegans BRC-1 BRD-1 deficiency agrees with the flat profile of increased base substitutions described in HR deficient human cancers [23, 24], BRCA1 negative human lymphoblastic MA lines [25] and also resembles the pattern of SVs associated with BRCA1 loss in breast and ovarian cancers [2628]. HR has been shown to be the predominant DNA repair pathway in C. elegans germ cells while NHEJ has a role in somatic cells [29, 30]. Our finding that mutation rates are not elevated in NHEJ mutants is consistent with these observations.

Fig 4. Mutational signatures and genomic features of mutations in DSBR deficient C. elegans.

Fig 4

A. Mutational signatures of DSBR mutants that exhibited statistically significant different mutation rates to wild-type displayed in numbers of mutations per generation. Bold coloured bars denote individual mutation classes where the number of mutations is different from wild-type, an underscore below each mutation profile indicates mutation types with total mutation numbers different from wild-type, and three stars indicate genotypes which have rates of substitutions, indels or SVs significantly different compared to wild-type (FDR < 5%). B. Estimated composition of structural variants per generation as estimated for wild-type and DNA repair mutants with elevated SV rates. C. Size distributions of tandem duplications (top, pink) and deletions (bottom, green) across wild-type and mutants with elevated SV rates. D. Clustering of mutations in DNA repair deficient mutants. Grey dots reflect the average proportions of clustered mutations. Error bars denote 95% confidence intervals. Mutants with a significantly different propensity for mutation clustering from wild-type (dotted black line) are shown and highlighted in red. ‘Information related to the 528 whole genome sequencing WGS primary-source datasets (56 deposited in this study, 472 deposited in (Suppl Data 1 and Supple Note 1 of [10] can be found in S1 Table).

Homologous recombination repair requires substantial end processing at the site of DSBs, which is performed by a series of nucleases creating single-stranded DNA overhangs. As mutants of C. elegans nucleases rad-51, mre-11, and com-1 are sterile due to defects in meiotic recombination thus precluding MA experiments, we analysed mutation rates in strains deficient for the rad-51 paralog rfs-1 and for rip-1, which encodes a RFS-1 interacting protein [31]. The RFS-1 RIP-1 complex is thought to stimulate the remodelling of presynaptic RAD-51-coated DNA filaments to facilitate strand invasion for recombinational repair [31]. We observed an overall 2-fold elevated mutagenesis in rfs-1 and rip-1 mutants (Fig 2), defined by increased numbers of base substitutions in both strains, increased numbers of small deletions in rfs-1, and increased numbers of SVs in rip-1 (Fig 4A and 4B, S6 Fig). Intriguingly, we observed three ‘translocation type’ events in rip-1 but not in any other MA line we analysed (Fig 4B, S6 and S7 Figs). We deduced that these events involved templated insertions of 200–4000 bp sequences, which showed strong homology to multiple genomic regions, including to one homeologous region located as far as 275 kb away from the donor sequence on the same chromosome, accompanied by a deletion of several hundred basepairs at the acceptor site (S7 Fig). These templated insertions may be explained by strand invasion into homeologous template DNA, in line with the pro-recombinogenic role of RIP-1 in mediating RAD-51 dissociation from invading strands [31].

SMC-5 and SMC-6 are components of a ring shaped cohesin complex, considered to tether broken DNA strands to the repair template on the sister chromatid to facilitate HR [32]. smc-5 and smc-6 mutants, which have been shown to exhibit defects in meiotic recombination between sister chromatids [32, 33] showed an increased rate of base substitutions and SVs, largely comprised of deletions, tandem duplications, and complex rearrangements (Figs 2, 4A and 4B, S6 Fig). In agreement with a high preponderance of SVs, these lines could only be propagated for up to 5 generations before succumbing to sterility. In contrast to brc-1 brd-1 mutants, smc-5 and smc-6 mutants exhibited a high proportion of large ~10 kb deletions (Fig 4C). Given the role of the ring-shaped SMC-5-6 complex in enforcing close proximity of damaged and template DNA, we hypothesize that double-strand break induced HR is initiated in the absence of the SMC-5-6 complex, but the displacement loop (D-loop) formed following strand invasion falls apart prematurely resulting in the loss of genetic material.

The structure-specific nucleases MUS81 and SLX1 act in conjunction to process Holliday Junctions, key four-way DNA intermediates of HR [3436]. slx-1 and mus-81 mutants displayed similar mutational signatures characterized by increased numbers of base substitutions and SVs, with large deletions and TDs being most prevalent (Figs 2, 4A and 4B, S8 Fig). In contrast, the absence of GEN-1, a canonical Holliday Junction resolvase [37], or LEM-3, the ortholog of mammalian Ankle1, recently implicated in the processing of recombination intermediates that persist beyond anaphase [38], did not yield overt changes in mutation rates or signatures (Fig 2, S4A Fig).

DNA helicases are enzymes that unwind double-stranded DNA. Among their multiple roles in HR, they contribute to the unwinding of D-loop structures, a function especially important when a broken DNA end invades a template strand with imperfect sequence homology, thus preventing recombination with homeologous sequences. To investigate mutation patterns induced by helicase deficiencies, we analysed mutants defective for the three C. elegans RecQ helicases: him-6—the ortholog of the mammalian Bloom syndrome gene which encodes for a helicase involved in HJ resolution and prevention of crossover recombination [39, 40]; wrn-1, the ortholog of Werner’s syndrome gene which encodes for a helicase possessing an N-terminal 3’-5’ exonuclease domain, and capable of resolving aberrant DNA structures with 3’ recessed ends [41, 42]; and rcq-5, the ortholog of human RECQ5, which displaces RAD-51 from single-stranded DNA and thus prevents excessive recombination [43]. In addition, we analysed lines deficient for rtel-1 which encodes a conserved helicase involved in genome stability and telomere maintenance [44]. While rcq-5 and wrn-1 mutants did not show increased mutagenesis, him-6 mutants demonstrated 8-fold elevated SV rates compared to wild-type, with 0.15 SVs per generation (SD = 0.03) (Figs 2, 4A and 4B, S4A and S9 Figs). Even more SVs were observed in rtel-1 mutants with an estimated rate of 0.8 TDs per generation (SD = 0.2) (Figs 2, 4A and 4B), which spanned on average 8 kbps and were generally smaller than TDs observed in brc-1 brd-1 mutants (Fig 4C). In addition, rtel-1 deficiency led to ~2.5 fold increase in base substitutions (Figs 2 and 4A). RTEL-1 has a unique role in preventing heterologous recombination during break-induced repair and in promoting non-crossover products [45, 46]. Interestingly, loss of mammalian RTEL1 yields a high number of large deletions and complex rearrangements as a result of excessive crossover and heterologous recombination [45], unlike our data which showed a more simple, tandem duplication signature (Fig 4, S8 Fig). In our experiments, C. elegans rtel-1 mutants did not grow beyond F15, and most lines became sterile within 5 generations (F5) (S1 Table), suggesting that the absence of RTEL-1 may lead to accumulation of SVs incompatible with organismal viability.

Investigating the genomic context of structural variants, we did not observe any overt changes in the presence of microhomology at the breakpoints compared to wild-type (S4B and S4C Fig). In addition, we confirmed that SVs across almost all HR deficient genetic backgrounds tended to be associated with repetitive DNA regions (S4D Fig), in line with previous reports [47].

Among the HR mutants, we note that brc-1 brd-1, rfs-1, rip-1, smc-5, and smc-6 display elevated levels of base substitutions. Increased base substitutions were also observed in BRCA1 defective human lymphoblastic MA lines [25]. Moreover, smg-1, rip-1, and rfs-1 exhibit evidence of mutational clustering (Fig 4D), with about 15% of base substitutions occurring within distances smaller than 1 kbps (Materials and Methods). Clusters of mutations may arise through error-prone polymerases reading across lesions [4850]. In addition, the NHEJ or MMEJ error-prone DSB repair pathways can also generate clustered mutations when DNA strands with incompatible ends are joined together [51]. The absence of clustered mutations in NHEJ or MMEJ mutants could be explained by the action of redundant error-free HR pathways.

Translesion synthesis (TLS)

TLS polymerases are specialised DNA polymerases that replicate across and insert nucleotides opposite damaged bases. Depending on the inserted nucleotide, this results in error-free or error-prone lesion bypass [52]. C. elegans rev-3/REV3L mutants, deficient in the catalytic subunit of polymerase ζ, accumulated increased numbers of 50–400 bp deletions (Figs 2 and 5A, S10 Fig). Similarly, polh-1/POLH (lf31) and polh-1/POLH (ok3317), DNA polymerase η mutants, displayed 50–400 bp deletions, with only polh-1(lf31) reaching clear statistical significance over the generations tested (Fig 2, S10 Fig). Our data suggest that REV-3, and likely POLH-1, prevent DNA breaks by reading across damaged bases that also occur in the absence of exogenous DNA damage. Our results on REV-3 and POLH-1 are in line with previous findings in C. elegans [18, 5355], yeast, and mammalian cells [56, 57].

Fig 5. Signatures and genomic features of mutations in TLS and ICLR deficient C. elegans.

Fig 5

A. Mutational signatures of TLS and ICLR mutants that exhibited statistically significant differences to wild-type mutation rates displayed in numbers of mutations per generation. Same layout as Fig 4A. B. Proportion of indels (brown) and SVs (black) in G-rich regions in wild-type and across genotypes with elevated rates of SVs. Dotted line represents the proportion of variants falling into these regions as expected by chance. C. Tandem duplications (TDs) in helq-1 mutants. An analysis of rev-3 was previously shown in ([10] Fig 1C). Information related to the 528 whole genome sequencing WGS primary-source datasets (56 deposited in this study, 472 deposited in (Suppl Data 1 and Supple Note 1 of [10] can be found in S1 Table).

In addition, we observed a slightly increased base substitutions rate, namely for C>T changes, in polk-1 (Fig 5A, S10 Fig), which may indicate a role in error-free bypass of endogenously arising guanine modifications [58]. Mutants defective in REV-1 TLS polymerase did not demonstrate a significant and reliable change in mutagenesis compared to wild-type (S10 Fig).

Mutation accumulation in mutants deficient for DNA crosslink repair

The repair of DNA interstrand crosslinks (ICL) provides a formidable task. It involves the Fanconi Anaemia (FA) proteins required for sensing ICLs and assembling various repair factors at the site of damage [5961]. Here we investigate mutant lines defective for FNCM-1/FANCM, a helicase involved in DNA damage recognition, FANCI-1/FANCI, and FCD-2/FANCD2, the key Fanconi repair proteins ubiquitinated by an E3 ubiquitin ligase complex and thought to assemble proteins required for ICL processing. We did not observe overt differences in mutagenesis between fcd-2, fncm-1, or fnci-1 mutants, and wild-type, suggesting that the C. elegans Fanconi Anemia ICL repair pathway does not significantly contribute to the repair of DNA damage that occurs under normal, unchallenged growth conditions (Fig 2, S11A Fig). FAN-1/FAN1 is a conserved structure-specific DNA nuclease that can resolve ICLs independently of the FA pathway [6264]. As for the core FA components, we did not observe elevated mutation rates in fan-1 mutant lines (Fig 2, S11A Fig).

DOG-1, the C. elegans ortholog of the mammalian FANCJ helicase, facilitates error-free replication through DNA tertiary structures formed by G-rich DNA sequences, referred to as G-quadruplexes [6567]. dog-1 mutants exhibited increased mutagenesis (Fig 2) as previously described [67], with 6-fold higher numbers of 50–400 base pair indels and 13 fold more SVs, predominantly deletions (Fig 5A, S9 Fig). Across all 11 dog-1 deficient samples, 81% of long deletions (17/21) and 78% (109/139) of shorter, 50–400 bp deletions overlapped with one of the 4291 regions in the C. elegans genome predicted to form G-quadruplex structures [68], in line with previous reports [67] (Fig 5B, S9 and S11B Figs). We found that G-quadruplex induced deletions occurred at a frequency of about 1 lesion per generation in dog-1 mutants. We rarely observed SVs associated with G-quadruplex forming sequences in other DNA repair mutants (Fig 5B, S5, S6, S8, S9 Figs), including him-6, encoding the C. elegans ortholog of the mammalian BLM helicase (S9 Fig), which has been shown to prevent replication fork stalling at G-quadruplex sites in human and murine cells in conjunction with FANCJ [69].

Another helicase mutant that displayed a distinct phenotype was helq-1/HELQ, encoding for a conserved helicase and thought to act in DNA crosslink repair in a pathway separate from FCD-2 [70]. In addition, helq-1 has been shown to be synthetically lethal with rfs-1, due to its role in resolving DSB repair intermediates during meiosis [71]. helq-1 deficient lines showed an increased proportion of tandem duplications (TDs) compared to other strains (Figs 4B and 5A, S5 Fig). TDs ranged in size between 457 and 8089 bp with a median of 1270 bp (Fig 4C). The mutational spectrum of helq-1 differed from that of brc-1 mutants, in which deletions of 6–50 bp and TD (with a median size of ~12 kbps) were observed with comparable frequency (Fig 4B and 4C, S5 Fig). Interestingly, 41% (7/17) of TDs in helq-1 mutants were associated with inverted repeat sequences (S2 Table). To investigate how TDs present at inverted repeats relate to DNA replication directionality, we used the closest origin of replication as a reference point and tested the correlation between the orientation of the TD breakpoints and the direction of leading strand synthesis. Out of the 7 inverted repeat-associated TDs, we could determine the directionality of replication in 5 cases (Materials and Methods, S2 Table). In 4 cases, TD oriented in line with leading strand replication. In 3 of these, inverted repeats were present downstream of the TD, and in one case upstream (Materials and Methods, S2 Table). We speculate how these tandem duplications may arise in the discussion. In summary, our data suggest that the DNA helicases HELQ-1 and DOG-1 are required to facilitate replication fork passage through distinct secondary structures (Fig 5B and 5C). While DOG-1 is needed for the passage through G-rich structures [65], HELQ-1 may help to overcome stem loop structures, possibly on the lagging strand.

Mutations and subtelomeric chromosome fusions in ATM-1 defective strains

ATM is a conserved PI3 kinase involved in DNA damage checkpoint regulation and telomere homeostasis. ATM deficiency has been reported to be associated with shorter telomeres, from yeast to mammalian cells. Moreover, in ATM deficient yeasts, C. elegans and Drosophila, the last depending on retrotransposon transposition rather than telomerase activity for telomere maintenance, chromosome fusions have been observed cytologically or through sequencing of PCR products across chromosomes [7277]. C. elegans atm-1/ATM mutants are hypersensitive to ionizing radiation (IR) [78, 79]. In addition, atm-1 lines propagated over multiple generations have been described to display a stochastic him (high incidence of males) phenotype, an indicator of meiotic chromosome mis-segregation associated with sex chromosome to autosome fusions [78, 80]. Investigating mutation rates in atm-1 lines grown for 20 generations, we observed 2-fold elevated numbers of SVs, predominantly inversions (Figs 2 and 6A). This increased incidence of SVs agrees with previous estimates of mutation rates in atm-1 mutants which were based on scoring the number of essential mutations in atm-1 backgrounds [78]. Interestingly, 4 of the 5 atm-1 lines grown for 20 generations carried SVs (with over 70% (9/11) of all observed SVs) localised in subtelomeric regions (Fig 6B, S5 Fig), unlike any other DNA repair deficient mutants examined. Most subtelomeric SVs (8/9) could be classified as complex rearrangements with at least 2 overlapping or adjacent events, often associated with copy number changes (Fig 6C, S5 and S13 Figs). Moreover, 4/5 complex rearrangements involving a single chromosome end displayed a loss of telomere sequences, suggesting a possible end-to-end fusion between homologous chromosomes. Furthermore, we observed 2 cases of interchromosomal rearrangements between different autosomes with breakpoints in subtelomeric regions associated with deletion of telomere sequences and copy number alterations (Fig 6B, S5 and S13 Figs). None of the atm-1 lines carried translocations or amplifications of the genomic regions termed TALT1 or TALT2, amplified in survivors of telomerase-deficient C. elegans strains, and considered to be utilized as templates for an alternative (telomerase independent) telomere lengthening (ALT) mechanism [81]. Similarly, we did not observe translocation events associated with atm-1 SVs (S5 and S13 Figs) making templated telomere maintenance from interstitial telomere sequences buried in the genome unlikely.

Fig 6. Signatures and genomic features of mutations in DNA damage checkpoint and apoptosis deficient C. elegans.

Fig 6

A. Mutational signatures of mutants that exhibited statistically significant differences to wild-type mutation rates. The chromosomes on which respective genes are located are indicated in superscript following each mutant name. Layout as Fig 4A. B. Proportion of SVs with breakpoints into subtelomeric regions across wild-type and mutants that exhibit elevated SV rates. Dotted lines represent the fraction of variants expected to occur in subtelomeric regions by chance. C. Examples of subtelomeric structural variants in atm-1 mutants. D. Quantification of mutation burden in indicated DNA repair mutants for initial generations and F20 and F40 generations as shown. Information related to the 528 whole genome sequencing WGS primary-source datasets (56 deposited in this study, 472 deposited in (Suppl Data 1 and Supple Note 1 of [10] can be found in S1 Table).

We speculate that loss of C. elegans atm-1 function could lead to reduced recruitment or access of telomerase to the shortest telomeres consistent with studies in tel1/ATM mutant yeast [8284], thereby leading to telomere loss and chromosome fusions. Access of telomerase to its telomeric substrate could be inhibited by atm-1 loss due to reduced end resection, a mechanism also discussed for atm-/- dependent telomere shortening and fusions in mammalian cells [85].

p53 and apoptosis defective strains

In the presence of excessive DNA damage, cells can trigger the p53 pathway to activate their apoptotic demise. CED-3 and CED-4, orthologs of a human caspase and the APAF1 protein, are essential for DNA damage induced and developmental apoptosis [86]. In contrast, CEP-1, the C. elegans p53 ortholog is specifically required for DNA damage induced apoptosis [87, 88]. We did not observe increased mutagenesis in ced-3, ced-4, or cep-1 defective mutants, indicating no major role of DNA damage induced apoptosis in preventing mutagenesis under unchallenged growth conditions (S14 Fig). Our dataset also includes deletions of spindle assembly checkpoint (SAC) genes. The spindle assembly checkpoint delays anaphase progression until all chromosomes are correctly attached to the mitotic spindle apparatus, thereby ensuring faithful chromosome segregation [89]. In addition, SAC has been implicated in DSB repair consistent with bub-3 and san-1 SAC mutants exhibiting IR sensitivity [90]. Lines deficient for bub-3 and san-1, corresponding to human BUB3 and BUB1B proteins, respectively [91], did not show increased mutagenesis (Fig 2, S14 Fig).

Having observed distinct mutational patterns in HR deficient mutants, we wanted to test whether combining HR mutants with apoptosis and/or DNA damage response deficiency would lead to increased or altered mutagenesis. Increased cep-1/p53 dependent germ cell apoptosis has been reported in a number of HR mutants such as him-6 and brc-1, suggesting that a higher number of nuclei carry increased or unrepaired DNA damage which might be eliminated by cep-1 dependent apoptosis [22, 40, 92, 93]. Double mutants of him-6 with ced-3 (6 lines), ced-4 (3 lines) or cep-1 (3 lines) did not display changes in mutation rates or spectra compared to him-6 single mutants apart from 2 out of 3 him-6; ced-4 lines exhibiting a variable amount of clustering (S14 Fig, Fig 4D). In contrast, 4 cep-1; brc-1 brd-1 lines (but not the 5 brc-1 brd-1; ced-3 lines) showed an increased rate of mutagenesis compared to brc-1 brd-1 alone (Fig 6A and 6D). Specifically, cep-1 inactivation exaggerated the mutational features of the brc-1 brd-1 mutant, leading to increased incidence of small deletions and structural variants (Fig 6A). We also observed prominent clustering of base substitutions in all triple mutant lines averaging to over 20% of mutations being clustered (Fig 4D). Thus, at least for HR deficiency conferred by brc-1 brd-1, additional cep-1 inactivation increases mutation clustering. Given that apoptosis deficiency in brc-1 brd-1; ced-3 lines does not result in increased mutagenesis, we speculate that increased mutagenesis in brc-1 brd-1; cep-1 lines might be associated with a role of CEP-1, independent of apoptosis regulation. CEP-1 could either trigger the cell-cycle checkpoint or facilitate more efficient DNA repair.

Discussion

Here, extending on our previous study primarily focused on the effects of genotoxic agents [10], we systematically catalogued the mutational characteristics of DNA repair deficiencies across all conserved C. elegans DNA repair and damage response pathways in inbred lines propagated for up to 40 generations. Our data provide a comprehensive picture of the contributions of various DNA repair and damage response pathways towards genome integrity in an experimental system under spontaneous, endogenous conditions not challenged by mutagen exposure. Except for mismatch repair deficiency [11, 47], defined repair deficiencies only lead to modest effects, with a 2–5 fold increase of mutations in 44% of all strains tested, with notable examples in almost every pathway. Alkylguanine alkyltransferase, Uracil glycosylase and NER deficiency are associated with increased acquisition of base changes. TLS mutants rev-3(pol ζ) and polh-1(pol η) show elevated numbers of 50–400 bps deletions. Interestingly, HR deficiency can manifest in different ways. brc-1/Brca1 and rad-51 paralog mutants show elevated mutagenesis across most types of mutations. Other HR mutants inactivating the MUS-81 and SLX-1 nucleases, and the HIM-6/BLM, HELQ-1 and RTEL-1 helicases are associated with increased numbers of SVs. DOG-1 has a unique role in preventing deletions next to G-rich sequences, while the HELQ helicase may contribute to faithful replication across secondary DNA structures such as inverted repeats. RIP-1 appears to avert templated insertion into homeologous sequences. The ATM-1 checkpoint kinase prevents chromosome end-to-end fusions. Finally, deficiency of the p53 like gene cep-1 exacerbates mutagenesis caused by HR defects.

Redundancy of DNA repair pathways

It is well established that thousands of DNA lesions occur spontaneously during each cell-cycle and that the vast majority of DNA lesions are repaired. Thus, the absence of significant mutagenic effects in the majority of genotypes tested underpins a high level of redundancy within and among different DNA repair pathways. Moreover, it may require the combined deficiency of multiple DNA repair pathways to trigger excessive mutagenesis in the germline. Such reasoning is in line with our recent whole genome analyses showing that multiple pathways act in concert to repair DNA lesions caused by the exposure to known genotoxins, with deficiencies of different pathways potentially leading to increased mutagenesis and/or altered mutagenic signatures [10]. Equally, latent defects such as those caused by the deletion of the non-essential polymerase subunit pole-4, only become apparent in conjunction with a DNA repair deficiency [11]. While pole-4 alone does not cause increased mutagenesis, combining this mutant with MMR causes mutagenesis beyond what is observed upon MMR alone [11]. Many cases of DNA repair pathway redundancies have been described, for instance, simultaneously defective TLS, NER, and MMEJ renders C. elegans sensitive to normal levels of daylight [94].

In contrast to a recent large-scale mutation accumulation screen in budding yeast [47], we did not observe widespread copy number changes in our analysis, likely because most such changes are incompatible with viability and fertility of a multicellular organism such as C. elegans. It is possible that nematodes suffering from gross chromosomal alterations are lost during propagation across generations. However, using our experimental set-up, we were previously able to detect instances of severe chromosomal rearrangements, including complex chromosome fusion events that contain scars indicative of breakage-fusion-bridge cycles and chromothripsis [12]. Nevertheless, it is likely that we underestimate mutation rates, especially in strains that could not be propagated for 40 generations, namely those defective for the RTEL-1 helicase and the SMC-5 and SMC-6 cohesion proteins.

Mechanistic insights

Our detailed characterisation of mutation rates, mutational signatures and localised mutation features, combined with the known enzymology of many repair enzymes, provides mechanistic insights: Our data confirm the specific role of the DOG-1/FANCJ helicase in unwinding G-quadruplex forming sequences [95], and we show that this feature is unique among the helicase mutants we analysed. We also reveal a specific role of the HELQ-1 helicase frequently in the context of repetitive sequences such as inverted repeats. At present, we can only speculate how these tandem duplications might arise. It is likely that the genesis of TDs involves microhomology-mediated break-induced replication. Inverted repeats may form secondary stem-loop structures that impede replication fork progression (Fig 5C, S12 Fig). In the case of the presence of a stem-loop structure in the DNA ahead of a replication fork, stalling could occur during the attempt to unwind the secondary DNA structure. The replication machinery may re-initiate at a downstream template, resulting in a duplicated region, before successfully replicating through the stem loop structure in a second attempt. In such a scenario, tandem duplications would always occur upstream of the inverted repeat (Fig 5C, S2 Table). Alternatively, an inverted repeat could more readily adopt a stem loop structure in the extended single-stranded region of the lagging strand, which is particularly prone to form secondary structures (Fig 5C, S12 Fig). Such stem loop structures would be prime substrates for HELQ, which has been shown to bind to single-stranded DNA and act as a 3’ to 5’ helicase, thus facilitating the dissolution of the stem loop [96]. In the absence of HELQ activity, these stem loops could be recognised by a nuclease (S12 Fig) and a resulting single-strand break might facilitate the invasion into the freshly replicated leading strand DNA and prime break-induced replication. Recapturing the original template (S12 Fig, step 3) after break-induced replication would restore the fork, resulting in a tandem duplication in only one of the two chromatids (S12 Fig). Crucially, the position of the nucleolytic cut, upstream, downstream or within the inverted repeat, and the position at which invasion into the template strand occurs, would determine the breakpoint of the tandem duplication.

In summary, our data suggest that the DNA helicases HELQ-1 and DOG-1 are required to facilitate replication fork passage through distinct secondary structures (Fig 5B and 5C). While DOG-1 is needed for the passage through G-rich structures [65], HELQ-1 may help to overcome stem loop structures, possibly on the lagging strand.

Furthermore, cases of unique gene conversion events into homeologous sequences in rip-1 mutants support a role of RAD-51 paralogs in preventing homeologous recombination. This is in line with biochemical activity of the RFS-1 RIP-1 paralog complex in remodelling presynaptic RAD-51-containing DNA filaments to facilitate strand invasion for recombinational repair [31]. Finally, we provide evidence that the ATM-1 checkpoint kinase has a specific role in protecting sub-telomeric repeats from DSBs and preventing deletions, inversions and chromosome fusions.

Based on their mutational signatures, strains defective for homologous recombination can be broadly grouped into two classes. First, BRC-1 and RAD-51 paralog mutants show elevated numbers of point mutations, as well as increased numbers of small deletions, and structural variants. A similar pattern was observed in a study of HR knockouts in chicken DT40 cell lines [25]. We suspect that increased point mutations might be a scar indicative of error prone translesion synthesis, necessary when damaged bases are neither repaired by BER and NER nor by replication fork reversal which is linked to recombinational repair [97]. Point mutations and small deletions also occur when HR is replaced by more error-prone NHEJ or MMEJ pathways, the latter being associated with the occurrence of small deletions in human BRCA1 mutants [28]. Conversely, deficiencies of other HR proteins, like SLX-1 and MUS-81, and helicases including HIM-6, RTEL-1 and HELQ-1, are associated with a specific increase of SVs. We speculate that these proteins may not have a role in HR pathways directly linked with DNA replication (see below).

Nature of germ cell divisions and mutagenesis

It is important to keep in mind that germ cell mutagenesis might occur at different stages of the C. elegans life cycle. The nature of germ cell divisions is fundamentally different across various developmental stages. During the invariant embryonic development of C. elegans, the germ cell lineage is defined by 3 asymmetric cell divisions, and from the first zygotic cell division onwards, a single posterior daughter cell always defines the germ line, which finally splits as a part of the 4th germ cell division into the two founder cells [98]. Starting from the L1 larval stage, each of these founders within a timeframe of three days expands to form one of the two gonads comprising ~1000 germ cells [98]. Embryonic germ cell divisions occur very rapidly within a timeframe of less than 20 minutes, and cells are largely refractory to DNA damage checkpoints. Evidence exists that translesion polymerases are particularly important during this stage [99, 100]. The first cell-cycle in developing germ lines occurs after an extended period of transcriptional quiescence, and synchronized transcriptional onset appears particularly challenging for genome integrity [101, 102], all the more so that global transcriptional induction in these cells required topoisomerase II induced DNA double strand breakage which has to be mended [101]. In contrast, germ cells residing in the adult germ line are subject to cell-cycle and apoptosis checkpoints, and take an excess of 10 hours to complete [86]. It appears possible that the increased number of mutations observed in cep-1 brc-1 double mutants reflects a role of the CEP-1 p53 like protein in preventing excessive mutagenesis. CEP-1 is expressed in mitotically dividing germ cells, as well as in late pachytene where late stages of meiotic recombination occur. Given that we did not observe comparable mutagenesis in brc-1 and apoptosis defective double mutants, apoptosis being restricted to pachytene cells, we speculate that mutagenesis might reflect a role of CEP-1 in mitotically dividing germ cells, possibly affecting the cell-cycle checkpoint or DNA damage response. Finally, it is reasonable to suggest that many lesions we observe to accumulate in our transgenerational set-up occur in germ cells, especially during meiosis. Indeed, it appears plausible that many SVs might be associated with meiotic recombination. A large excess of DSBs are generated by the SPO-11 nuclease during meiosis, and typically only one DSB per chromosome pair matures into a crossover to facilitate the exchange of genetic information between maternal and paternal chromosomes [4]. Many SVs we observed in HR mutants, particularly those that did not demonstrate an excessive accumulation of point mutations, likely result from faulty recombinational events during meiosis: For instance, the SLX-1 and MUS-81 nucleases, and the HIM-6 helicase contribute to the resolution of meiotic Holliday junction intermediates [34, 103, 104]. In contrast, BRC-1 and the SMC-5/6 complex are implicated in the repair of the excess meiotic DSBs not engaged in crossover recombination [33].All in all, we provide a comprehensive view of how the DNA repair and damage response machinery acts to preserve genome integrity over generations. Even in the absence of strong exogeneous genotoxins, multiple DNA repair pathways are required to mend endogenous DNA damage. In wild-type, background mutagenesis leads to a relatively uniform mutation pattern. A similar flat mutational signature has been observed in the human germline and in somatic tissues [6, 9]. It is tempting to assume that a diverse set of DNA repair processes is also constantly operating in human germ and somatic cells during normal proliferation to ensure a highest possible level of genomic integrity, resulting in a flat residual mutational signature. It will be interesting to extend our studies to human inherited conditions, where defective DNA repair is associated with progeria, developmental defects, microcephaly, spinocerebellar ataxia, and cancer. Studying both experimental human cell models and patient samples will allow testing if increased mutagenesis occurs and if certain mutational features correlate with disease phenotypes.

Materials and methods

C. elegans strains, propagation and maintenance

All C. elegans strains used in this study, newly and previously generated [1012] (S1 Table) were backcrossed 6 times against the wild-type N2 Bristol reference strain TG1813 [1012], (Fig 1A). The majority of strains were clonally propagated for 20 or 40 generations as described previously [10]. rtel-1, smc-5, smc-6 lines were grown for 5 generations as these lines tended to become sterile when grown for more generations (S1 Table for number of lines and generations per genotype). As described in detail [10] 5–10 single L4 stage hermaphrodites (F1s) were randomly chosen for each genotype and transferred to individual 1× NGM plates seeded with OP50 bacteria. Every 3–4 days, 1 single L4 hermaphrodite was randomly chosen among the progeny per plate and individually propagated further, a process repeated until the indicated generation (F5, F20 and F40). Once final generation hermaphrodites had produced clonal progeny 5 lines were transferred to 9 cm 3× NGM plates and allowed to reach starvation. Mixed stage worms (the majority of which two generations the final transferred L4) were washed off plates, washed 3× in M9 medium, pelleted and frozen in liquid nitrogen [10]. Genomic DNA was isolated from three samples using the Invitrogen ChargeSwitch® gDNA Mini Tissue Kit (Thermo Fisher Scientific, CS11204) [12].

Variant analysis

The relevant procedures are described in detail [10]. In summary, DNA sequencing was performed using Illumina HiSeq 2000 and 10X Genomics short reads sequencing platforms at 100 bp paired-end, with a mean coverage of 50x. The resulting reads were then put through Sanger Cancer IT Pipeline, including alignment with BWA [105108] against WBcel235.74.dna.toplevel.fa as the reference genome (http://ftp.ensembl.org/pub/release-74/fasta/caenorhabditis_elegans/dna/), CaVEMan for base substitution calling [105107], Pindel for indel calling [105, 106]. Structural variants were called manually using DELLY [105] for calling structural variants and deletions/duplications longer than 400 bps. All variant calling procedures for each sample used a dedicated P0 sample from the same genotype group, or one wild-type P0 sample as a control (S1 Table).

Resulting variants were filtered based on the site coverage, number and orientation of the reads, supporting the variant in the test and control samples, overlap with other variants (relevant for substitutions and indels in homopolymer tracks), and a panel of 6 wild-type samples (for more detailed filtering description, see [10, 11]). In addition, we filtered out the recurrent variants between unrelated samples to ensure the absence of technical artifacts. The variants in samples with generation higher than 1 were filtered against all the P0 and F1 line samples of the same genotype. Mutations were classified based on their size and context: base substitutions were classified into single base substitutions, further split into 96 types by mutation type (C>A, C>G, C>T, T>A, T>C, T>G), and trinucleotide context based on pyrimidine reference, di- and multi-nucleotide variants; indels were classified based on event type (deletion, insertion, or complex indel), local context (whether it falls into a repetitive region, only for events smaller than 5 bp), and size (1bp, 2–5 bp, 5–50 bp, 50–400 bp); and structural variants were classified into deletions, tandem duplications, inversions, intra- and interchromosomal translocations, foldbacks or complex events based on the breakpoint orientation, and proximity (for more details, see [10]). Sample information and their corresponding ENA accession codes are listed in S1 Table. Filtered variant sets for the samples already published [10] are available in the supplementary data of the respective publication. Filtered variants of a further 31 samples analysed in this study are provided in S1 File.

Mutation rates and mutational signatures calculations

Total mutation rates, as well as the rates of base substitutions, indels and structural variants, for each genotype were expressed in mutations acquired per generation and were estimated using additive non-negative Poisson regression using the samples with generation higher than 1. Every sample i, i = 1,…N out of N = 528 was assigned a number of mutations of the category of interest, YiN{0}. For a vector of mutation counts across all samples Y, Y={Yi}i=1N, we calculated the mutation rates per generation for each genotype using the following model:

YPoisson(λ),λ=gGμ,

where gR+N is the adjusted number of generations which takes into account the 25% chances of a heterozygous mutation to be lost or to become fixed [12], GMatN×K({0,1}) is a binary matrix indicating the genotype of each sample, and μR+K is a non-negative vector of mutation rates per genotype, with K = 62 being the number of genotypes analysed.

To calculate the mutational signatures, which consisted of mutation rates across all R = 119 mutation types (96 single base substitutions, 2 types of multi-nucleotide substitutions, 14 types of indels and 7 types of structural variants), we used a negative binomial model to account for a higher variance compared to the mean in individual counts and a high amount of zero values. For a matrix of counts YMatN×R(N∪{0}), the matrix of signatures SMatR×K(R+∪{0}) was calculated using the following model (where ST denotes a transposed matrix S):

Y_NegativeBinomial(χ,ϕ),χ=gGST,ϕ=100.

The parameter ϕ = 100 was chosen empirically based on the estimations from the data, and suggests a slight deviation from Poisson model towards higher variance. Signatures were estimated using the log-normal prior Sij~logN(0,σ2) with a fixed parameter σ2 estimated from the data.

The estimates of the posterior means for total mutation rates in each genotype, as well as the mutation rates per type/class, were obtained using Hamiltonian Monte Carlo sampling procedure with at least 1000 warm-up and 1000 post warm-up samples [109]. As these estimates were assumed to be log-normally distributed, mutation rates in DNA repair deficient genotypes were compared to the respective entity for wild-type by testing whether a difference between their logarithms followed a normal distribution, or, equivalently, if their squared log ratio followed a chi-squared distribution. Resulting two-sided p-values were corrected for multiple testing across all genotypes using the Benjamini-Hochberg procedure [110]. Mutation rates per base pair per cell division were calculated assuming 15 cell divisions per generation and 2 copies of the 100,272,607 bp long nuclear genome.

Analysis of repetitive regions and G4-prone sequences

Genome-wide G4-prone sequences for C. elegans [68], and repetitive regions as deposited in Repbase (www.girinst.org/downloads/repeatmaps/C.Elegans) were used to determine the association of SVs with specific genomic regions [111]. For each SV, 60 bp regions around the breakpoints were overlapped with the location of G4-prone and repetitive regions. Only unique variants were used to calculate the associated proportions for each genotype. Proportions expected by chance were estimated as the ratios between the sums of all regions of interest, and C. elegans genome size.

Relationship to replication directionality

Directionality of replication was determined using Okazaki fragment sequencing from [112] performed on different developmental stages of C. elegans. To identify the directions of replication across the genome, we split the genome in 100 bp bins and calculated the fractions of Okazaki fragment reads on the minus strand within each bin for each of the 6 samples analysed in the study:

tjl=tjltj+ltjl+tj+l,

where j = 1…,6 denotes the sample index, and l denotes the index of the bin. The bins where the average across samples was greater than its standard deviation, i.e. |meanj(tjl)|>2sdj(tjl), were assigned a “+” direction (or called right-replicating) if meanj(tjl)>0, and a “-” direction (or called left-replicating regions) if meanj(tjl)<0. The bins where the standard deviation of the fraction of minus strand Okazaki fragment reads was greater than its average (|meanj(tjl)|<2sdj(tjl)) were considered as having an ambiguous replication direction. In total, we inferred the direction of replication for 45% of the genome.

Analysis of clustered mutations

Clustering of mutations was assessed using the start points of all base substitutions and indels across samples of the same genotype and generation. Clustered status was assigned based on a sliding window of 1000 bp. Estimates of the proportion of clustered mutations were obtained from a linear model using samples with generation higher than 1 in genetic backgrounds with >3 such samples: Proportion_clustered ~ rates + ε, ε ~ N(0,σ2), and all DNA repair deficient backgrounds were compared to that in wild-type by the following Z test: Z=rgrwtSE(rg)2+SE(rwt)2 [113], where rg denotes the rate for a DNA repair deficient genotype g. False discovery rate among the resulting p-values was corrected for multiple testing using Benjamini-Hochberg procedure [110].

Microhomologies at SV and indel breakpoints

Microhomologies (MH) at the breakpoints of SVs and indels were assessed by measuring the lengths of precise alignments around each breakpoint, calculated as a sum of perfect alignment lengths between the two 30 bp regions upstream, and two 30 bp regions downstream from the breakpoint sites. Only unique SVs and indels were used to calculate the proportions of variants with MH for each genotype.

Supporting information

S1 Fig. Summary of mutations acquired in the screen.

Total number of mutations (substitutions in black, indels in green, and structural variants in purple) per sample across wild-type and DNA repair mutant lines. The black line denotes the median number of mutations for each mutation class across all experiments.

(EPS)

S2 Fig. Mutational signatures and distribution of mutations in DR and BER deficient mutants.

A. Mutational signatures of all tested DR and BER mutants, shown as numbers of mutations per generation. Bold coloured bars below each mutation profile indicate mutation types for which the sum across individual classes is different from wild-type. Three stars indicate genotypes with significantly different rates of substitutions, indels or SVs compared to wild-type with FDR < 5%. B. Distribution of mutations across chromosomes for DR and BER deficient samples. Only agt-2 shows a significant degree of mutation clustering. The pink and orange shaded regions in parp-1 and parp-2 samples, respectively, indicate the location and extent of observed structural variants (pink—TD (tandem duplication); orange—Fold (foldback duplication)).

(EPS)

S3 Fig. Mutational signatures and distribution of mutations in NER deficient mutants.

A. Mutational signatures of all tested NER mutants displayed as numbers of mutations per generation. Bright bars denote individual mutation classes for which mutation numbers differ significantly from wild-type. Bold coloured lines below each mutation profile indicate a wider range of mutation types for which total mutation numbers are different from wild-type. Three stars indicate genotypes with significantly different rates of substitutions, indels or SVs compared to wild-type (FDR < 5%). B. Comparison between the humanised version of the combined mutational spectrum across all NER deficient C. elegans samples and COSMIC signatures SBS5 and SBS8 previously associated with NER deficiency (cosine similarity scores 0.64 and 0.56, respectively). No other COSMIC SBS signatures showed significant similarity to the C. elegans NER spectrum.

(EPS)

S4 Fig. Mutational signatures and characteristics of mutagenesis in DSBR mutants.

A. Mutational signatures of all tested DSBR mutants displayed as numbers of mutations per generation. Bright bars denote individual mutation classes for which mutation numbers differ significantly from wild-type. Bold coloured lines below each mutation profile indicate a wider range of mutation types for which total mutation numbers are different from wild-type. Three stars indicate genotypes with significantly different rates of substitutions, indels or SVs compared to wild-type (FDR < 5%). B. Proportion of SVs and indels with breakpoint microhomology across DSBR mutants. C. Distribution of microhomology sizes at indel and SV breakpoints across DSBR mutants. D. Proportion of SVs with breakpoints in repetitive regions across wild-type and genotypes with elevated SV rates. Dotted lines represent the fraction of variants expected to arise in repetitive regions by chance.

(EPS)

S5 Fig. Features of structural variants in wild-type, atm-1, brc-1 brd-1, and helq-1 mutants.

The number of lines grown for both F20 and/or F40 generations is indicated below each genotype. The presence or absence of repetitive or G4-forming sequences at breakpoints of SVs is indicated by a plus ‘+’ ‘or minus ‘-’, respectively. A single minus sign ‘-’ indicates that neither of the two breakpoints of the SV occurs in the indicated sequence context. ‘+/-’ indicates repetitive or G4-forming sequences at the left but not the right breakpoint (consistent with the direction on the reference genome chromosome), and ‘-/+’ indicates that such sequences only occur at the right breakpoint. The size and nature of SVs is shown on a log scale on the right. SV are classed into interchromosomal rearrangements (INTCHR), large deletions (DEL), tandem duplications (TD), and inversions (INV). Details on CD Sample IDs and their respective European Nucleotide Archive (ENA) accession numbers are available in the sample description of S1 Table.

(EPS)

S6 Fig. Features of structural variants in wild-type, rip-1, rfs-1, smc-5, and smc-6 HR mutants.

The number of lines grown for both F20 and/or F40 generations is indicated below each genotype. The presence or absence of repetitive or G4-forming sequences at breakpoints of SVs is indicated by a plus ‘+’ ‘or minus ‘-’, respectively. A single minus sign ‘-’ indicates that neither of the two breakpoints of the SV occurs in the indicated sequence context. ‘+/-’ indicates repetitive or G4-forming sequences at the left but not the right breakpoint (consistent with the direction on the reference genome chromosome), and ‘-/+’ indicates that such sequences only occur at the right breakpoint. The size and nature of SVs is shown on a log scale on the right. SV are classed into interchromosomal rearrangements (INTCHR), large deletions (DEL), tandem duplications (TD), and inversions (INV). Details on CD Sample IDs and their respective ENA accession numbers are available in the sample description of S1 Table.

(EPS)

S7 Fig. Translocations in rip-1 mutants.

Visual representation of the 3 duplications/translocation type events observed in two rip-1 F40 lines. CD numbers reflect the European Nucleotide Archive designation of sequenced samples. Translocations (TRLS) are shown using the wild-type genomic region as reference, nucleotides demarking key features, such as the position of the translocation, the donor sequence and the deletion in the acceptor locus being indicated. In the respective lower panes the level of sequence homology between donor and acceptor loci is indicated. Green arrows indicate tandem duplication-like (TD-like) breakpoints, blue arrows deletion-like (DEL-like) breakpoints.

(EPS)

S8 Fig. Features of structural variants in wild-type, mus-81, slx-1, and rtel-1 HR mutants.

The number of lines grown for both F20 and/or F40 generations is indicated below each genotype. The presence or absence of repetitive or G4-forming sequences at breakpoints of SVs is indicated by a plus ‘+’ ‘or minus ‘-’, respectively. A single minus sign ‘-’ indicates that neither of the two breakpoints of the SV occurs in the indicated sequence context. ‘+/-’ indicates repetitive or G4-forming sequences at the left but not the right breakpoint (consistent with the direction on the reference genome chromosome), and ‘-/+’ indicates that such sequences only occur at the right breakpoint. The size and nature of SVs is shown on a log scale on the right. SV are classed into interchromosomal rearrangements (INTCHR), large deletions (DEL), tandem duplications (TD), and inversions (INV). Details on CD Sample IDs and their respective ENA accession numbers are available in the sample description of S1 Table.

(EPS)

S9 Fig. Features of structural variants in wild-type, him-6, wrn-1, and dog-1 HR mutants.

The number of lines grown for both F20 and/or F40 generations is indicated below each genotype. The presence or absence of repetitive or G4-forming sequences at breakpoints of SVs is indicated by a plus ‘+’ ‘or minus ‘-’, respectively. A single minus sign ‘-’ indicates that neither of the two breakpoints of the SV occurs in the indicated sequence context. ‘+/-’ indicates repetitive or G4-forming sequences at the left but not the right breakpoint (consistent with the direction on the reference genome chromosome), and ‘-/+’ indicates that such sequences only occur at the right breakpoint. The size and nature of SVs is shown on a log scale on the right. SV are classed into interchromosomal rearrangements (INTCHR), large deletions (DEL), tandem duplications (TD), and inversions (INV). Details on CD Sample IDs and their respective ENA accession numbers are available in the sample description of S1 Table.

(EPS)

S10 Fig. Mutational signatures and characteristics of mutagenesis in TLS mutants.

A. Mutational signatures of all tested TLS mutants displayed in numbers of mutations per generation. Bright bars denote individual mutation classes for which numbers differ significantly from wild-type. Bold coloured lines below each mutation profile indicate a wider range of mutation types for which total mutation numbers are different from wild-type. Three stars indicate genotypes with significantly different rates of substitutions, indels or SVs compared to wild-type (FDR < 5%).

(EPS)

S11 Fig. Mutational signatures and characteristics of mutagenesis in DNA crosslink repair mutants.

A. Mutational signatures of tested DNA crosslink repair mutants displayed in numbers of mutations per generation. Bright bars denote individual mutation classes for which numbers differ significantly from wild-type. bold coloured bars below each mutation profile indicate a wider range of mutation types for which the total numbers of mutations are different from wild-type. Three stars indicate genotypes with significantly different rates of substitutions, indels or SVs compared to wild-type (FDR < 5%). B. Distribution of mutations across 11 samples of F20 dog-1 helicase deficient lines. Over 60% of variants occur in G4-forming regions (large symbols).

(EPS)

S12 Fig. Possible mechanism for inducing tandem duplications in loci containing inverted repeat sequences.

Step-wise representation of a possible mechanism leading to tandem duplications (TDs) in helq-1 mutants. Inverted sequences are depicted by red lines with arrows indicating directionality. Replication forks are shown as splayed structures with lagging strand discontinuity indicated by arrows with dotted lines. Scissors indicate a putative endonuclease cleavage site.

(EPS)

S13 Fig. Telomere proximal structural variants observed in F20 generation atm-1 mutants.

Structural variants in telomere proximal regions are shown for the indicated atm-1 F20 lines (S1 Table) with their chromosomal locations. Individual structural variants are coloured based on their classification as duplications (DUP), deletions (DEL), inversion (INV), and interchromosomal rearrangement (BND, breakends) with vertical lines denoting breakpoint locations and horizontal lines spanning the regions between related breakpoints. Black dots represent the coverage in bins of 50 bp or 100 bp (for regions above 20 kb) adjusted for the coverage in the corresponding control sample and indicate copy number.

(EPS)

S14 Fig. Mutational signatures and characteristics of mutagenesis in checkpoint and apoptosis mutants.

Mutational signatures of the indicated genotypes are displayed in numbers of mutations per generation. Bright bars denote individual mutation classes for which the numbers differ significantly from wild-type. Bold coloured bars below each mutation profile indicate a wider range of mutation types in which the total number of mutations is different from wild-type. Three stars indicate genotypes with significantly different rates of substitutions, indels or SVs compared to wild-type (FDR < 5%).

(EPS)

S1 Table. Details of C. elegans strains used in and sequencing samples from this study.

(XLSX)

S2 Table. Analysis of tandem duplications in helq-1 mutants and their correlation with inverted repeat sequences.

(XLSX)

S1 File

(DOCX)

Acknowledgments

We thank the Mitani Lab and the Caenorhabditis Genetics Center for providing C. elegans strains. We are grateful to the COMSIG consortium for discussing unpublished data, and to Kei-ichi Takeda and Orlando Schaerer for feedback and proofreading.

Data Availability

Sequencing data and variant calling files are available under ENA Study Accession Numbers ERP000975 and ERP004086. ENA sample IDs are annotated in S1 Table. R code for the analysis of mutation rates and signatures as well as mutational clustering is available on GitHub under http://github.com/gerstung-lab/mutationaccumulation.

Funding Statement

This work was supported by: AG: Wellcome Trust [COMSIG consortium grant RG70175 and Senior Research Award 090944/Z/09/Z] https://wellcome.org/. AG: Worldwide Cancer Research [18-0644] https://www.worldwidecancerresearch.org/. AG: Korean Institute for Basic Science [IBS-R022-A2-2020] https://www.ibs.re.kr/eng.do. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Lindahl T, Barnes DE. Repair of endogenous DNA damage. Cold Spring Harb Symp Quant Biol. 2000;65: 127–133. 10.1101/sqb.2000.65.127 [DOI] [PubMed] [Google Scholar]
  • 2.Ames BN, Shigenaga MK, Gold LS. DNA lesions, inducible DNA repair, and cell division: three key factors in mutagenesis and carcinogenesis. Environ Health Perspect. 1993;101 Suppl 5: 35–44. 10.1289/ehp.93101s535 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Mehta A, Haber JE. Sources of DNA double-strand breaks and models of recombinational DNA repair. Cold Spring Harb Perspect Biol. 2014;6: a016428. 10.1101/cshperspect.a016428 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Hillers KJ, Jantsch V, Martinez-Perez E, Yanowitz JL. Meiosis (May 4, 2017), WormBook, ed. The C. elegans Research Community, WormBook, 10.1895/wormbook.1.178.1.2017 [DOI] [PMC free article] [PubMed]
  • 5.Keightley PD, Ness RW, Halligan DL, Haddrill PR. Estimation of the Spontaneous Mutation Rate per Nucleotide Site in aDrosophila melanogasterFull-Sib Family. Genetics. 2014. pp. 313–320. 10.1534/genetics.113.158758 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Rahbari R, Wuster A, Lindsay SJ, Hardwick RJ, Alexandrov LB, Turki SA, et al. Timing, rates and spectra of human germline mutation. Nat Genet. 2016;48: 126–133. 10.1038/ng.3469 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Venn O, Turner I, Mathieson I, de Groot N, Bontrop R, McVean G. Nonhuman genetics. Strong male bias drives germline mutation in chimpanzees. Science. 2014;344: 1272–1275. 10.1126/science.344.6189.1272 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Denver DR, Dolan PC, Wilhelm LJ, Sung W, Lucas-Lledó JI, Howe DK, et al. A genome-wide view of Caenorhabditis elegans base-substitution mutation processes. Proc Natl Acad Sci U S A. 2009;106: 16310–16314. 10.1073/pnas.0904895106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Alexandrov LB, Kim J, Haradhvala NJ, Huang MN, Tian Ng AW, Wu Y, et al. The repertoire of mutational signatures in human cancer. Nature. 2020;578: 94–101. 10.1038/s41586-020-1943-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Volkova NV, Meier B, González-Huici V, Bertolini S, Gonzalez S, Vöhringer H, et al. Mutational signatures are jointly shaped by DNA damage and repair. Nat Commun. 2020;11: 2169. 10.1038/s41467-020-15912-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Meier B, Volkova NV, Hong Y, Schofield P, Campbell PJ, Gerstung M, et al. Mutational signatures of DNA mismatch repair deficiency in C. elegans and human cancers. Genome Res. 2018;28: 666–675. 10.1101/gr.226845.117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Meier B, Cooke SL, Weiss J, Bailly AP, Alexandrov LB, Marshall J, et al. C. elegans whole-genome sequencing reveals mutational signatures related to carcinogens and DNA repair deficiency. Genome Res. 2014;24: 1624–1636. 10.1101/gr.175547.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Zou X, Owusu M, Harris R, Jackson SP, Loizou JI, Nik-Zainal S. Validating the concept of mutational signatures with isogenic cell models. Nat Commun. 2018;9: 1744. 10.1038/s41467-018-04052-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Drost J, van Boxtel R, Blokzijl F, Mizutani T, Sasaki N, Sasselli V, et al. Use of CRISPR-modified human stem cell organoids to study the origin of mutational signatures in cancer. Science. 2017;358: 234–238. 10.1126/science.aao3130 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Brambati A, Barry RM, Sfeir A. DNA polymerase theta (Polθ)—an error-prone polymerase necessary for genome stability. Curr Opin Genet Dev. 2020;60: 119–126. 10.1016/j.gde.2020.02.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Seol J-H, Shim EY, Lee SE. Microhomology-mediated end joining: Good, bad and ugly. Mutat Res. 2018;809: 81–87. 10.1016/j.mrfmmm.2017.07.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Wood RD, Doublié S. DNA polymerase θ (POLQ), double-strand break repair, and cancer. DNA Repair. 2016;44: 22–32. 10.1016/j.dnarep.2016.05.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Schimmel J, van Schendel R, den Dunnen JT, Tijsterman M. Templated Insertions: A Smoking Gun for Polymerase Theta-Mediated End Joining. Trends Genet. 2019. 10.1016/j.tig.2019.06.001 [DOI] [PubMed] [Google Scholar]
  • 19.Kanugula S, Pegg AE. Novel DNA repair alkyltransferase fromCaenorhabditis elegans. Environmental and Molecular Mutagenesis. 2001. pp. 235–243. 10.1002/em.1077 [DOI] [PubMed] [Google Scholar]
  • 20.Nakamura N, Morinaga H, Kikuchi M, Yonekura S-I, Ishii N, Yamamoto K, et al. Cloning and characterization of uracil-DNA glycosylase and the biological consequences of the loss of its function in the nematode Caenorhabditis elegans. Mutagenesis. 2008;23: 407–413. 10.1093/mutage/gen030 [DOI] [PubMed] [Google Scholar]
  • 21.Lans H, Vermeulen W. Nucleotide Excision Repair inCaenorhabditis elegans. Molecular Biology International. 2011. pp. 1–12. 10.4061/2011/542795 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Boulton SJ, Martin JS, Polanowska J, Hill DE, Gartner A, Vidal M. BRCA1/BARD1 Orthologs Required for DNA Repair in Caenorhabditis elegans. Current Biology. 2004. pp. 33–39. 10.1016/j.cub.2003.11.029 [DOI] [PubMed] [Google Scholar]
  • 23.Polak P, Kim J, Braunstein LZ, Karlic R, Haradhavala NJ, Tiao G, et al. A mutational signature reveals alterations underlying deficient homologous recombination repair in breast cancer. Nat Genet. 2017;49: 1476–1486. 10.1038/ng.3934 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Riaz N, Blecua P, Lim RS, Shen R, Higginson DS, Weinhold N, et al. Pan-cancer analysis of bi-allelic alterations in homologous recombination DNA repair genes. Nat Commun. 2017;8: 857. 10.1038/s41467-017-00921-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Póti Á, Gyergyák H, Németh E, Rusz O, Tóth S, Kovácsházi C, et al. Correlation of homologous recombination deficiency induced mutational signatures with sensitivity to PARP inhibitors and cytotoxic agents. Genome Biol. 2019;20: 240. 10.1186/s13059-019-1867-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Nik-Zainal S, Davies H, Staaf J, Ramakrishna M, Glodzik D, Zou X, et al. Landscape of somatic mutations in 560 breast cancer whole-genome sequences. Nature. 2016;534: 47–54. 10.1038/nature17676 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Macintyre G, Goranova TE, De Silva D, Ennis D, Piskorz AM, Eldridge M, et al. Copy number signatures and mutational processes in ovarian carcinoma. Nat Genet. 2018;50: 1262–1270. 10.1038/s41588-018-0179-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Viari A, Campbell PJ, Stratton MR, Nik-Zainal S. HRDetect is a predictor of BRCA1 and BRCA2 deficiency based on mutational signatures. Nat Med. 2017. Available: https://www.nature.com/nm/journal/v23/n4/abs/nm.4292.html 10.1038/nm.4292 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Harris J, Lowden M, Clejan I, Tzoneva M, Thomas JH, Hodgkin J, et al. Mutator phenotype of Caenorhabditis elegans DNA damage checkpoint mutants. Genetics. 2006;174: 601–616. 10.1534/genetics.106.058701 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Pontier DB, Tijsterman M. A robust network of double-strand break repair pathways governs genome integrity during C. elegans development. Curr Biol. 2009;19: 1384–1388. 10.1016/j.cub.2009.06.045 [DOI] [PubMed] [Google Scholar]
  • 31.Taylor MRG, Špírek M, Chaurasiya KR, Ward JD, Carzaniga R, Yu X, et al. Rad51 Paralogs Remodel Pre-synaptic Rad51 Filaments to Stimulate Homologous Recombination. Cell. 2015;162: 271–286. 10.1016/j.cell.2015.06.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Bickel JS, Chen L, Hayward J, Yeap SL, Alkers AE, Chan RC. Structural Maintenance of Chromosomes (SMC) Proteins Promote Homolog-Independent Recombination Repair in Meiosis Crucial for Germ Cell Genomic Stability. PLoS Genetics. 2010. p. e1001028. 10.1371/journal.pgen.1001028 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Hong Y, Sonneville R, Agostinho A, Meier B, Wang B, Blow JJ, et al. The SMC-5/6 Complex and the HIM-6 (BLM) Helicase Synergistically Promote Meiotic Recombination Intermediate Processing and Chromosome Maturation during Caenorhabditis elegans Meiosis. PLoS Genet. 2016;12: e1005872. 10.1371/journal.pgen.1005872 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Agostinho A, Meier B, Sonneville R, Jagut M, Woglar A, Blow J, et al. Correction: Combinatorial Regulation of Meiotic Holliday Junction Resolution in C. elegans by HIM-6 (BLM) Helicase, SLX-4, and the SLX-1, MUS-81 and XPF-1 Nucleases. PLoS Genetics. 2013. 10.1371/annotation/d8c73205-151d-4e22-89c6-3aa574037d10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Wyatt HDM, Laister RC, Martin SR, Arrowsmith CH, West SC. The SMX DNA Repair Tri-nuclease. Mol Cell. 2017;65: 848–860.e11. 10.1016/j.molcel.2017.01.031 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Wyatt HDM, Sarbajna S, Matos J, West SC. Coordinated actions of SLX1-SLX4 and MUS81-EME1 for Holliday junction resolution in human cells. Mol Cell. 2013;52: 234–247. 10.1016/j.molcel.2013.08.035 [DOI] [PubMed] [Google Scholar]
  • 37.Bailly AP, Freeman A, Hall J, Déclais A-C, Alpi A, Lilley DMJ, et al. The Caenorhabditis elegans homolog of Gen1/Yen1 resolvases links DNA damage signaling to DNA double-strand break repair. PLoS Genet. 2010;6: e1001025. 10.1371/journal.pgen.1001025 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Hong Y, Velkova M, Silva N, Jagut M, Scheidt V, Labib K, et al. The conserved LEM-3/Ankle1 nuclease is involved in the combinatorial regulation of meiotic recombination repair and chromosome segregation in Caenorhabditis elegans. PLoS Genet. 2018;14: e1007453. 10.1371/journal.pgen.1007453 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.De Muyt A, Jessop L, Kolar E, Sourirajan A, Chen J, Dayani Y, et al. BLM helicase ortholog Sgs1 is a central regulator of meiotic recombination intermediate metabolism. Mol Cell. 2012;46: 43–53. 10.1016/j.molcel.2012.02.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Wicky C, Alpi A, Passannante M, Rose A, Gartner A, Müller F. Multiple genetic pathways involving the Caenorhabditis elegans Bloom’s syndrome genes him-6, rad-51, and top-3 are needed to maintain genome stability in the germ line. Molecular and Cellular Biology. 2004;24: 5016–5027. 10.1128/MCB.24.11.5016-5027.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Ozgenc A, Loeb LA. Current advances in unraveling the function of the Werner syndrome protein. Mutat Res. 2005;577: 237–251. 10.1016/j.mrfmmm.2005.03.020 [DOI] [PubMed] [Google Scholar]
  • 42.Lee S-J, Yook J-S, Han SM, Koo H-S. A Werner syndrome protein homolog affects C. elegans development, growth rate, life span and sensitivity to DNA damage by acting at a DNA damage checkpoint. Development. 2004;131: 2565–2575. 10.1242/dev.01136 [DOI] [PubMed] [Google Scholar]
  • 43.Paliwal S, Kanagaraj R, Sturzenegger A, Burdova K, Janscak P. Human RECQ5 helicase promotes repair of DNA double-strand breaks by synthesis-dependent strand annealing. Nucleic Acids Res. 2014;42: 2380–2390. 10.1093/nar/gkt1263 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Uringa E-J, Youds JL, Lisaingo K, Lansdorp PM, Boulton SJ. RTEL1: an essential helicase for telomere maintenance and the regulation of homologous recombination. Nucleic Acids Res. 2011;39: 1647–1655. 10.1093/nar/gkq1045 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.León-Ortiz AM, Panier S, Sarek G, Vannier J-B, Patel H, Campbell PJ, et al. A Distinct Class of Genome Rearrangements Driven by Heterologous Recombination. Mol Cell. 2018;69: 292–305.e6. 10.1016/j.molcel.2017.12.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Youds JL, Mets DG, McIlwraith MJ, Martin JS, Ward JD, ONeil NJ, et al. RTEL-1 enforces meiotic crossover interference and homeostasis. Science. 2010;327: 1254–1258. 10.1126/science.1183112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Puddu F, Herzog M, Selivanova A, Wang S, Zhu J, Klein-Lavi S, et al. Genome architecture and stability in the Saccharomyces cerevisiae knockout collection. Nature. 2019;573: 416–420. 10.1038/s41586-019-1549-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Supek F, Lehner B. Clustered Mutation Signatures Reveal that Error-Prone DNA Repair Targets Mutations to Active Genes. Cell. 2017;170: 534–547.e23. 10.1016/j.cell.2017.07.003 [DOI] [PubMed] [Google Scholar]
  • 49.Stone JE, Lujan SA, Kunkel TA, Kunkel TA. DNA polymerase zeta generates clustered mutations during bypass of endogenous DNA lesions in Saccharomyces cerevisiae. Environ Mol Mutagen. 2012;53: 777–786. 10.1002/em.21728 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Roberts SA, Sterling J, Thompson C, Harris S, Mav D, Shah R, et al. Clustered mutations in yeast and in human cancers can arise from damaged long single-strand DNA regions. Mol Cell. 2012;46: 424–435. 10.1016/j.molcel.2012.03.030 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Chen J-M, Férec C, Cooper DN. Transient hypermutability, chromothripsis and replication-based mechanisms in the generation of concurrent clustered mutations. Mutat Res. 2012;750: 52–59. 10.1016/j.mrrev.2011.10.002 [DOI] [PubMed] [Google Scholar]
  • 52.Sale JE. Translesion DNA synthesis and mutagenesis in eukaryotes. Cold Spring Harb Perspect Biol. 2013;5: a012708. 10.1101/cshperspect.a012708 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Roerink SF, van Schendel R, Tijsterman M. Polymerase theta-mediated end joining of replication-associated DNA breaks in C. elegans. Genome Res. 2014;24: 954–962. 10.1101/gr.170431.113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.van Schendel R, van Heteren J, Welten R, Tijsterman M. Genomic Scars Generated by Polymerase Theta Reveal the Versatile Mechanism of Alternative End-Joining. PLoS Genet. 2016;12: e1006368. 10.1371/journal.pgen.1006368 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.van Bostelen I, van Schendel R, Romeijn R, Tijsterman M. Translesion synthesis polymerases are dispensable for C. elegans reproduction but suppress genome scarring by polymerase theta-mediated end joining. PLoS Genet. 2020;16: e1008759. 10.1371/journal.pgen.1008759 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Lawrence CW, Hinkle DC. DNA polymerase zeta and the control of DNA damage induced mutagenesis in eukaryotes. Cancer Surv. 1996;28: 21–31. [PubMed] [Google Scholar]
  • 57.Lange SS, Wittschieben JP, Wood RD. DNA polymerase zeta is required for proliferation of normal mammalian cells. Nucleic Acids Res. 2012;40: 4473–4482. 10.1093/nar/gks054 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Choi J-Y, Angel KC, Guengerich FP. Translesion synthesis across bulky N2-alkyl guanine DNA adducts by human DNA polymerase kappa. J Biol Chem. 2006;281: 21062–21072. 10.1074/jbc.M602246200 [DOI] [PubMed] [Google Scholar]
  • 59.Kottemann MC, Smogorzewska A. Fanconi anaemia and the repair of Watson and Crick DNA crosslinks. Nature. 2013;493: 356–363. 10.1038/nature11863 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Hashimoto S, Anai H, Hanada K. Mechanisms of interstrand DNA crosslink repair and human disorders. Genes Environ. 2016;38: 9. 10.1186/s41021-016-0037-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Ceccaldi R, Sarangi P, D’Andrea AD. The Fanconi anaemia pathway: new players and new functions. Nat Rev Mol Cell Biol. 2016;17: 337–349. 10.1038/nrm.2016.48 [DOI] [PubMed] [Google Scholar]
  • 62.Jin H, Cho Y. Structural and functional relationships of FAN1. DNA Repair. 2017;56: 135–143. 10.1016/j.dnarep.2017.06.016 [DOI] [PubMed] [Google Scholar]
  • 63.MacKay C, Déclais A-C, Lundin C, Agostinho A, Deans AJ, MacArtney TJ, et al. Identification of KIAA1018/FAN1, a DNA repair nuclease recruited to DNA damage by monoubiquitinated FANCD2. Cell. 2010;142: 65–76. 10.1016/j.cell.2010.06.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Kratz K, Schöpf B, Kaden S, Sendoel A, Eberhard R, Lademann C, et al. Deficiency of FANCD2-associated nuclease KIAA1018/FAN1 sensitizes cells to interstrand crosslinking agents. Cell. 2010;142: 77–88. 10.1016/j.cell.2010.06.022 [DOI] [PubMed] [Google Scholar]
  • 65.Cheung I, Schertzer M, Rose A, Lansdorp PM. Disruption of dog-1 in Caenorhabditis elegans triggers deletions upstream of guanine-rich DNA. Nat Genet. 2002;31: 405–409. 10.1038/ng928 [DOI] [PubMed] [Google Scholar]
  • 66.Youds JL, Barber LJ, Ward JD, Collis SJ, O’Neil NJ, Boulton SJ, et al. DOG-1 is the Caenorhabditis elegans BRIP1/FANCJ homologue and functions in interstrand cross-link repair. Mol Cell Biol. 2008;28: 1470–1479. 10.1128/MCB.01641-07 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Tarailo-Graovac M, Wong T, Qin Z, Flibotte S, Taylor J, Moerman DG, et al. Spectrum of variations in dog-1/FANCJ and mdf-1/MAD1 defective Caenorhabditis elegans strains after long-term propagation. BMC Genomics. 2015;16: 210. 10.1186/s12864-015-1402-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Marsico G, Chambers VS, Sahakyan AB, McCauley P, Boutell JM, Antonio MD, et al. Whole genome experimental maps of DNA G-quadruplexes in multiple species. Nucleic Acids Res. 2019;47: 3862–3874. 10.1093/nar/gkz179 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.van Wietmarschen N, Merzouk S, Halsema N, Spierings DCJ, Guryev V, Lansdorp PM. BLM helicase suppresses recombination at G-quadruplex motifs in transcribed genes. Nat Commun. 2018;9: 271. 10.1038/s41467-017-02760-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Muzzini DM, Plevani P, Boulton SJ, Cassata G, Marini F. Caenorhabditis elegans POLQ-1 and HEL-308 function in two distinct DNA interstrand cross-link repair pathways. DNA Repair. 2008;7: 941–950. 10.1016/j.dnarep.2008.03.021 [DOI] [PubMed] [Google Scholar]
  • 71.Ward JD, Muzzini DM, Petalcorin MIR, Martinez-Perez E, Martin JS, Plevani P, et al. Overlapping mechanisms promote postsynaptic RAD-51 filament disassembly during meiotic double-strand break repair. Mol Cell. 2010;37: 259–272. 10.1016/j.molcel.2009.12.026 [DOI] [PubMed] [Google Scholar]
  • 72.Chan SW-L, Blackburn EH. Telomerase and ATM/Tel1p protect telomeres from nonhomologous end joining. Mol Cell. 2003;11: 1379–1387. 10.1016/s1097-2765(03)00174-6 [DOI] [PubMed] [Google Scholar]
  • 73.Mieczkowski PA, Mieczkowska JO, Dominska M, Petes TD. Genetic regulation of telomere-telomere fusions in the yeast Saccharomyces cerevisae. Proceedings of the National Academy of Sciences. 2003. pp. 10854–10859. 10.1073/pnas.1934561100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Bi X, Wei S-CD, Rong YS. Telomere protection without a telomerase; the role of ATM and Mre11 in Drosophila telomere maintenance. Curr Biol. 2004;14: 1348–1353. 10.1016/j.cub.2004.06.063 [DOI] [PubMed] [Google Scholar]
  • 75.Silva E, Tiong S, Pedersen M, Homola E, Royou A, Fasulo B, et al. ATM Is Required for Telomere Maintenance and Chromosome Stability during Drosophila Development. Curr Biol. 2004;14: 1341–1347. 10.1016/j.cub.2004.06.056 [DOI] [PubMed] [Google Scholar]
  • 76.Oikemus SR, McGinnis N, Queiroz-Machado J, Tukachinsky H, Takada S, Sunkel CE, et al. Drosophila atm/telomere fusion is required for telomeric localization of HP1 and telomere position effect. Genes Dev. 2004;18: 1850–1861. 10.1101/gad.1202504 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Lee SS, Bohrson C, Pike AM, Wheelan SJ, Greider CW. ATM Kinase Is Required for Telomere Elongation in Mouse and Human Cells. Cell Rep. 2015;13: 1623–1632. 10.1016/j.celrep.2015.10.035 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Jones MR, Huang JC, Chua SY, Baillie DL, Rose AM. The atm-1 gene is required for genome stability in Caenorhabditis elegans. Mol Genet Genomics. 2012;287: 325–335. 10.1007/s00438-012-0681-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Boulton SJ, Gartner A, Reboul J, Vaglio P, Dyson N, Hill DE, et al. Combined functional genomic maps of the C. elegans DNA damage response. Science. 2002;295: 127–131. 10.1126/science.1065986 [DOI] [PubMed] [Google Scholar]
  • 80.Ahmed S, Hodgkin J. MRT-2 checkpoint protein is required for germline immortality and telomere replication in C. elegans. Nature. 2000;403: 159–164. 10.1038/35003120 [DOI] [PubMed] [Google Scholar]
  • 81.Seo B, Kim C, Hills M, Sung S, Kim H, Kim E, et al. Telomere maintenance through recruitment of internal genomic regions. Nat Commun. 2015;6: 8189. 10.1038/ncomms9189 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Hector RE, Shtofman RL, Ray A, Chen B-R, Nyun T, Berkner KL, et al. Tel1p preferentially associates with short telomeres to stimulate their elongation. Mol Cell. 2007;27: 851–858. 10.1016/j.molcel.2007.08.007 [DOI] [PubMed] [Google Scholar]
  • 83.Bianchi A, Shore D. Increased association of telomerase with short telomeres in yeast. Genes Dev. 2007;21: 1726–1730. 10.1101/gad.438907 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Ritchie KB, Petes TD. The Mre11p/Rad50p/Xrs2p complex and the Tel1p function in a single pathway for telomere maintenance in yeast. Genetics. 2000;155: 475–479. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Wu Y, Xiao S, Zhu X-D. MRE11-RAD50-NBS1 and ATM function as co-mediators of TRF1 in telomere length control. Nat Struct Mol Biol. 2007;14: 832–840. 10.1038/nsmb1286 [DOI] [PubMed] [Google Scholar]
  • 86.Gartner A, Milstein S, Ahmed S, Hodgkin J, Hengartner MO. A conserved checkpoint pathway mediates DNA damage—induced apoptosis and cell cycle arrest in C. elegans. Mol Cell. 2000;5: 435–443. 10.1016/s1097-2765(00)80438-4 [DOI] [PubMed] [Google Scholar]
  • 87.Schumacher B, Hofmann K, Boulton S, Gartner A. The C. elegans homolog of the p53 tumor suppressor is required for DNA damage-induced apoptosis. Current Biology. 2001. pp. 1722–1727. 10.1016/s0960-9822(01)00534-6 [DOI] [PubMed] [Google Scholar]
  • 88.Derry WB, Putzke AP, Rothman JH. Caenorhabditis elegans p53: role in apoptosis, meiosis, and stress resistance. Science. 2001;294: 591–595. 10.1126/science.1065486 [DOI] [PubMed] [Google Scholar]
  • 89.Encalada SE, Willis J, Lyczak R, Bowerman B. A spindle checkpoint functions during mitosis in the early Caenorhabditis elegans embryo. Mol Biol Cell. 2005;16: 1056–1070. 10.1091/mbc.e04-08-0712 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Bertolini S, Wang B, Meier B, Hong Y, Gartner A. Caenorhabditis elegans BUB-3 and SAN-1/MAD3 Spindle Assembly Checkpoint Components Are Required for Genome Stability in Response to Treatment with Ionizing Radiation. G3. 2017;7: 3875–3885. 10.1534/g3.117.1122 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Hajeri VA, Stewart AM, Moore LL, Padilla PA. Genetic analysis of the spindle checkpoint genes san-1, mdf-2, bub-3 and the CENP-F homologues hcp-1 and hcp-2 in Caenorhabditis elegans. Cell Div. 2008;3: 6. 10.1186/1747-1028-3-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Grabowski MM, Svrzikapa N, Tissenbaum HA. Bloom syndrome ortholog HIM-6 maintains genomic stability in C. elegans. Mech Ageing Dev. 2005;126: 1314–1321. 10.1016/j.mad.2005.08.005 [DOI] [PubMed] [Google Scholar]
  • 93.Adamo A, Montemauri P, Silva N, Ward JD, Boulton SJ, La Volpe A. BRC-1 acts in the inter-sister pathway of meiotic double-strand break repair. EMBO Rep. 2008;9: 287–292. 10.1038/sj.embor.7401167 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.van Bostelen I, Tijsterman M. Combined loss of three DNA damage response pathways renders C. elegans intolerant to light. DNA Repair. 2017;54: 55–62. 10.1016/j.dnarep.2017.04.002 [DOI] [PubMed] [Google Scholar]
  • 95.Kruisselbrink E, Guryev V, Brouwer K, Pontier DB, Cuppen E, Tijsterman M. Mutagenic capacity of endogenous G4 DNA underlies genome instability in FANCJ-defective C. elegans. Curr Biol. 2008;18: 900–905. 10.1016/j.cub.2008.05.013 [DOI] [PubMed] [Google Scholar]
  • 96.Marini F, Wood RD. A human DNA helicase homologous to the DNA cross-link sensitivity protein Mus308. J Biol Chem. 2002;277: 8716–8723. 10.1074/jbc.M110271200 [DOI] [PubMed] [Google Scholar]
  • 97.Quinet A, Lemaçon D, Vindigni A. Replication Fork Reversal: Players and Guardians. Mol Cell. 2017;68: 830–833. 10.1016/j.molcel.2017.11.022 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Strome S. Specification of the germ line. WormBook. 2005; 1–10. 10.1895/wormbook.1.9.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Kim S-H, Michael WM. Regulated proteolysis of DNA polymerase eta during the DNA-damage response in C. elegans. Mol Cell. 2008;32: 757–766. 10.1016/j.molcel.2008.11.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Roerink SF, Koole W, Carine Stapel L, Romeijn RJ, Tijsterman M. A Broad Requirement for TLS Polymerases η and κ, and Interacting Sumoylation and Nuclear Pore Proteins, in Lesion Bypass during C. elegans Embryogenesis. PLoS Genetics. 2012. p. e1002800. 10.1371/journal.pgen.1002800 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Wong MM, Belew MD, Kwieraga A, Nhan JD, Michael WM. Programmed DNA Breaks Activate the Germline Genome in Caenorhabditis elegans. Dev Cell. 2018;46: 302–315.e5. 10.1016/j.devcel.2018.07.002 [DOI] [PubMed] [Google Scholar]
  • 102.Ou H-L, Kim CS, Uszkoreit S, Wickström SA, Schumacher B. Somatic Niche Cells Regulate the CEP-1/p53-Mediated DNA Damage Response in Primordial Germ Cells. Dev Cell. 2019;50: 167–183.e8. 10.1016/j.devcel.2019.06.012 [DOI] [PubMed] [Google Scholar]
  • 103.Saito TT, Lui DY, Kim H-M, Meyer K, Colaiácovo MP. Interplay between structure-specific endonucleases for crossover control during Caenorhabditis elegans meiosis. PLoS Genet. 2013;9: e1003586. 10.1371/journal.pgen.1003586 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.O’Neil NJ, Martin JS, Youds JL, Ward JD, Petalcorin MIR, Rose AM, et al. Joint molecule resolution requires the redundant activities of MUS-81 and XPF-1 during Caenorhabditis elegans meiosis. PLoS Genet. 2013;9: e1003582. 10.1371/journal.pgen.1003582 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Rausch T, Zichner T, Schlattl A, Stütz AM, Benes V, Korbel JO. DELLY: structural variant discovery by integrated paired-end and split-read analysis. Bioinformatics. 2012;28: i333–i339. 10.1093/bioinformatics/bts378 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Ye K, Schulz MH, Long Q, Apweiler R, Ning Z. Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads. Bioinformatics. 2009;25: 2865–2871. 10.1093/bioinformatics/btp394 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Nik-Zainal S, Alexandrov LB, Wedge DC, Van Loo P, Greenman CD, Raine K, et al. Mutational processes molding the genomes of 21 breast cancers. Cell. 2012;149: 979–993. 10.1016/j.cell.2012.04.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25: 1754–1760. 10.1093/bioinformatics/btp324 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 109.Neal RM, Others. MCMC using Hamiltonian dynamics. Handbook of Markov Chain Monte Carlo. 2011;2: 2. [Google Scholar]
  • 110.Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society: Series B (Methodological). 1995. pp. 289–300. 10.1111/j.2517-6161.1995.tb02031.x [DOI] [Google Scholar]
  • 111.Bao W, Kojima KK, Kohany O. Repbase Update, a database of repetitive elements in eukaryotic genomes. Mob DNA. 2015;6: 11. 10.1186/s13100-015-0041-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Pourkarimi E, Bellush JM, Whitehouse I. Spatiotemporal coupling and decoupling of gene transcription with DNA replication origins during embryogenesis in C. elegans. Elife. 2016;5. 10.7554/eLife.21728 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Clogg CC, Petkova E, Haritou A. Statistical Methods for Comparing Regression Coefficients Between Models. American Journal of Sociology. 1995. pp. 1261–1293. 10.1086/230638 [DOI] [Google Scholar]

Decision Letter 0

Arthur J Lustig

Transfer Alert

This paper was transferred from another journal. As a result, its full editorial history (including decision letters, peer reviews and author responses) may not be present.

27 Jan 2021

PONE-D-20-39672

Protection of the C. elegans germ cell genome depends on diverse DNA repair pathways during normal proliferation

PLOS ONE

Dear Dr. Gartner,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

  • The changes indicated below are required:
    • Statistical analysis: Please indicate how the limits of sequencing coverage influence the identification of homozygous vs heterozygous mutations. 
    • Data interpretation: 1.The major challenge is the authors' explanation of the lack of mutagenic effects in NHEJ relative to HR, a result that seems to differ with previous results. This needs to be more clearly described. 2. please discuss the possibility of redundancy within the HR pathways 3. The data consistent with  DNA polymerase slippage may also be explained by sister chromatid recombination.
    • Presentation: Ideally, a better separation of Results and Discussion would benefit the paper.  I understand that a full separation would lead to a lowered comprehension of the extensive data. However, please remove the redundancies present between these two sections. Also please remove any table from the Introduction.
    • Critiques of the four reviewers: Please address all of the issues that are raised by the Reviewers with the exception of those regarding the presentation which is summarized in the previous point. No major rewrite is required.
  • Conflicts between the reviewers: There are no conflicts between the reviewers; just differences in the emphasis of the critique.

  • AE Feedback: My independent reading of the manuscript led to a similar view as the other Reviewers. Of particular importance, I found the explanation for the lack of mutations generated in nhej mutations somewhat lacking. In addition, the results explained on pg. 7 as due to polymerase slippage may  also be explained by sister chromatid exchange.

Please submit your revised manuscript by Mar 11 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Arthur J. Lustig, PhD

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1.) Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2.) We noted in your submission details that a portion of your manuscript may have been presented or published elsewhere.

"The majority of the primary sequence data described in this manuscript was also used in our recent publication 'Mutational signatures are jointly shaped by DNA damage and repair. Nat Commun. 2020;11(1):2169. doi: 10.1038/s41467-020-15912-7.

Our Nat Commun. publication is largely focused on studying the effects of treatment with DNA damaging agents in wild-type and DNA repair deficient C. elegans (and only in Fig 1C a high level summary of mutagenesis for 6 mutants is shown). In this submitted manuscript (6 main Figures) we focus on describing mutation rates and patterns of wild-type and 61 repair defective lines when propagated over multiple generations without mutagen exposure. Including primary data in the previously published manuscript was necessary to determine a baseline of DNA damage-induced mutagenesis."

Please clarify whether this publication was peer-reviewed and formally published. If this work was previously peer-reviewed and published, in the cover letter please provide the reason that this work does not constitute dual publication and should be included in the current manuscript.

3.) Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: I Don't Know

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The manuscript from Meier and colleagues describes a systematic DNA sequencing analysis of 61 DNA repair mutants maintained in a laboratory setting. The paper follows up on earlier work describing the effects of various mutagens on DNA sequence in the wildtype genome. Here, the authors report the effects of loss of various DNA repair components on the accumulation of different types of DNA damage under mild laboratory conditions where individuals are not subject to mutagenic agents. The study is a very useful catalog of observed sequence changes. It provides information on the frequency and distribution of various mutation types. Interestingly, loss of some DNA repair factors did not impact the mutation rate in the laboratory setting and others correlated with relatively mild increases in mutation rate. The section of the Discussion dealing with the likelihood of mutations arising during meiosis is an important addition. Overall, the methods are thorough and the data are likely to be very useful for researchers studying DNA repair.

Minor comments

Page 3, line 4: substitute “difficult” for “testing” so the sentence reads “Measuring mutation rates in organisms not challenged with mutagens is difficult given the low numbers of mutations.”

Page 5, line 2 from the bottom: substitute a comma for the period after labour-intensive, so the sentence reads “As large-scale investigations of germline mutagenesis are highly time- and labour-intensive, C. elegans offers a suitable system to study mutation accumulation across multiple generations and genetic backgrounds based on its short life-cycle and its ability to self-fertilize.”

Page 7, paragraph 2, line 2: Remove 2nd “across” so the sentence reads “Across 61 C. elegans DNA repair deficient mutants ([10–12], Supplementary Table S1), the median mutation rate was close to that observed in wild-type:…”

Page 11, lines 17-21: The wording of this sentence is awkward so that the meaning is not clear. To clarify, maybe add “including” so the sentence reads: “We deduced that these events involved templated insertions of 200-4000 bp sequences, which showed strong homology to multiple genomic regions, including to one such homeologous region in cis located as far as 275 kb away from the donor sequence on the same chromosome, accompanied by a deletion of several hundred basepairs at the homeologous acceptor site (Supplementary Figure S7).”

Page 19, line 7: It’s confusing to refer to the “latter” in a list of more than two things. Substitute “last” for “latter” so the sentence reads: “Moreover, in ATM deficient yeasts, C.

elegans and Drosophila, the last depending on retrotransposon transposition rather

than telomerase activity for telomere maintenance, chromosome fusions have been

observed cytologically or through sequencing of PCR products across chromosomes

[71–76].”

Page 22, 1st paragraph of Discussion, and elsewhere in the text: “Data” is a plural word, e.g., “Our data provide a comprehensive picture…”

Reviewer #2: The manuscript by Meier et al describes in detailed the natural mutation signature of 60 mutants in various DNA repair pathways. This work is based on the study some of these authors published last year (Volkova et al, ref 10). The current study expanded the analysis from 53 to 61 genetic backgrounds and performed new analysis that was not present in Volkova et al. The detailed analysis provides interesting insights about mutagenesis in an organism under natural, non-mutagenic conditions. Despite the fact that some of the analysis for most of these mutants was done in other studies, there is an important value in doing a large-scale genomic comprehensive study with the same methodology in one publication. Therefore, I believe this analysis will have significant importance to the field. Below are some suggestions for improving the text and figures.

• Abstract. The transition from the first to the second sentence is very abrupt.

• Author Summary. I did not understand this sentence: “Measuring mutation rates in organisms not challenged with mutagens is testing, given low numbers of mutations”.

• Introduction: please remove the reference for data (Sup table 1) and figures form the introduction (it better placed in the results).

• Results: It will be helpful to have more information about how the assay is performed in the beginning of the results section. Although this is based on previously described methodology (ref 10), it will be better to explain this here as well (to the reader who is not familiar with these studies and/or C. elegans). The 2 sentences in the intro (page 6) discussing 1a starting with “In mutation accumulation experiments”, can be moved to the “results”. In addition, 1a referees to 20/40 generations, but some of the mutants were analyzed with less (1 or 5 generations). How mutation accumulation studies done for strains that show embryonic lethality (e.g., smc-5, smc-6) should be described earlier in the text, when the assays are first described. It will be helpful to describe how the 61 DNA repair deficient mutants were selected for the study- what were the criteria?

• Figure 1C- while point mutation are found at similar levels in wild type T>A (with A and T 5’ and 3’ bases) seem to be enriched (to similar levels as 1bp insertions), I don’t think that was reported in other studies- Is this not significant?

• SVs and SNV are defined twice (page 5 and 7)

• Page 9 “showed increased mutation rates without overt changes in mutational signatures…” based on mutation types underlined in figure 3A, it does look like there is mutation signature for base substitutions (with slight variation between mutants in the same pathway) that requires correction and explanation of why this specific pattern

• Page 9. The sentence stating with “comparison between…” is too long and requires more explaining (“COSMIC signature” SBS8…SBS5) for clarity.

• There are several places in the results where it seems would better be moved to the discussion. For example: Page 12 “given the role…” Page 14 the 2 sentences stating with “Clusters of mutations may arise through…” page 17-18 the whole paragraph starting with “At present, we can only speculate how these tandem...”.

• Page 14- “In addition, the NHEJ or MMEJ error-prone DSB repair pathways can also generate clustered mutations when DNA strands with incompatible ends are joined together” yet cku-80 and lig-4 did now show decrease in mutation (or insertions)- please discuss- how come?

• Figure 4C why the difference between smc-5 and smc-6 mutants in deletion size obtained when they are part of the same complex?

• In the text it is stated (page 10): “MMEJ component polq-1, did not produce significant and reliable changes in mutation rates compared to wild-type” and also in figure S4 polq-1 is not significant- while in figure 2 and it is S10 is it significantly different from wild type- is it or is it not significant?

• Page 22 “brc-1/Brca1 and rad-51 paralog mutants show elevated mutagenesis across all types of mutations” bases on 2A it’s not all but most types of mutations.

• It is not clear how HR mutants would lead to deletions, which may be done better in the discussion. With mus-81/slx-1 how D- loop falling apart would lead to loss of genetic material? It makes sense that when HR fails other repair pathways (SSA and MMEJ?) act leading to large deletions but I don’t think it’s explicitly said.

• Discussion and results page 11 and 14: it is not clear why when HR is compromised, base

substitutions are increased and why not all but only particular ones (why specific signature)? Is the signature related to specific TLS polymerases (which ones can be candidates)? Why do TLS polymerases act when HR is compromised or overactive (is this due to aberrant HR or the activation of an alternative repair pathway)?

• Discussion: Page 23- “Thus, the absence of significant mutagenic effects …underpins a high level of redundancy among different DNA repair pathways...” For HR, the mutants that were examined are in genes that only partially required for HR (understandably, since removing HR leads to embryonic lethality). Therefore, HR may be not redundant with other pathways, but the role of HR just cannot be examined. The data may reflect redundancy within HR, not between HR and other pathways.

• Discussion page 25 “It is tempting to speculate that these proteins may not have a

role in HR pathways directly linked with DNA replication” this sentence is not clear to me. Why “tempting to speculate” when a role for most of these proteins in meiotic HR is established?

• Page 26 “Finally, it is reasonable to suggest that many lesions we observe to accumulate in our transgenerational set-up occur in haploid germ cells, especially during meiosis. Indeed, it appears plausible that many SVs might be associated with meiotic recombination.” These 2 sentences don’t seem do lead from one to the other… Germ cells are haploid only after meiotic recombination, for a relatively short period (since recombination occur in meiotic prophase I when the divisions didn’t yet happen). Are these 2 different ideas or should the word “haploid” be removed?

Figures and tables

• Fig 1B- I don’t see a gray line on the diagram (SVs). If this not present in these samples should it be removed from the key to avoid confusion?

• Fig 1C- it’s very hard to see the histograms. I suggest adjusting the y axis to 0.2 max

• Fig 2- 95% CI intervals are truncated for SV plot, maybe extend Y axis further (<0.005?)

• Fig 2- since DSBR is used also to name HR sub pathway, it is confusing to include NHEJ proteins under “DSBR” on the labeling on the top and I suggest labeling cku-80 and lig-4 separately as cNHEJ and use “HR” instead “DSBR” for the others.

• In all the figures that represent mutation spectrum (like 2A) ¬¬- the authors used bars underlining what part of the comparison is significant. However, the sizes of the bars are not consistent and sometimes hard to distinguish from non-significant marking. Sometimes the gray bars are invisible.

• S2- what are the pink and orange boxes in partp-1 and parp-2 respectively?

• S7- italics for gene names

• S10 and S14- X axis is not clear, letter on top of each other.

• Table 1- Key for “DNA damage response pathway”, why “Helicase, TR” in green?

Reviewer #3: 6th Jan 2021

The authors systematically catalogued the mutational characteristics of DNA repair

deficiencies across the C. elegans DNA repair and damage response

pathways in animals that were cultivated to 40 generations.

They provide new information on the contributions of different DNA repair pathways towards mutagenesis that was not previously known. For example, they showed that

the DOG-1/FANCJ helicase has a unique function to unwind G-quadruplex forming sequences to prevent mutations. Likewise, they showed that the ATM-1 checkpoint kinase perform a specific role in avoiding mutations in the sub-telomeric repeats from

DSBs. The authors should be commended for a valuable and elaborate study.

Comment:

The article is excessively long (result section in particular) and the authors could shortened it to enhance readership by taking the gene deletion mutant that showed no effect and place these in a Table in the text. In addition, they could give a brief description of the gene function. Reading one section to the next gives the same pattern of information, except for focusing on a different pathway, which makes reading the results section somewhat redundant and less appealing.

Reviewer #4: The study by Meier et al. reports the mutational profile of 61 C. elegans DNA repair mutants propagated 40 generations in the absence of exogenous stress. The article is very dense but in general clearly presented.

The authors find that across DNA repair deficient mutants, the median mutation rate is close to wild-type. However, 42 out of the 61 DNA repair deficient strains analyzed displayed mutation rates significantly different from wild-type in at least one mutation class (base substitutions, indels or stuctural variants). Characterization of the mutational signatures over generations in DNA repair pathways deficient in DR, BER, and NER did not show large changes in the overall mutation spectra, but several small differences in particular mutation types. Inactivation of the core components of NHEJ did not produce significant changes in mutation rates compared to wild-type, while mutants in homologous recombination (HR) showed either increased mutations across all mutation classes, or primarily accumulated structural variants. The discussion speculated of possible mechanisms explaining the specific mutational signature observed, and relate these to the biology of C. elegans.

Collectively the data presented constitutes a useful resource for future studies on how conserved DNA repair pathways contribute to preserving germline genome integrity and is appropriate for publication in PloS One.

Specific comments:

1. Please specify from what stage DNA was isolated. Is is young adults?

2. The 10X coverage in these experients is on the low end on the scale, and while adequate for calling homozygous variants, may not be not be enough to accurately call structural variants or heterozygous mutations. The authors could mention this caveat somehere. Nonetheless this data provides may surve as a basis for future, more in depth studies.

3. Fig 1C: Single nucleotide variants are shown in the 'context of their 5’ and 3’ base'.

can authors please reformulate? 5' and 3' with respect to what?

General comments:

The study relies heavily on analysis of genome sequencing data. Methods are described in detail but this reviewer does not have the competences to judge whether the statistical methods and other analyses were adequately performed. All data is available in supplementary tables or from appropriate databases using provided accession numbers.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: Yes: Dindial Ramotar

Reviewer #4: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Apr 27;16(4):e0250291. doi: 10.1371/journal.pone.0250291.r002

Author response to Decision Letter 0


3 Mar 2021

Response to reviewers:

We think that is fair to say that all 4 reviewers strongly supported our manuscript and we are grateful for the comments that helped improve our manuscript (see our point by point response).

Following your and the reviewers guidance we proofread the manuscript again and think that it is now easier to read. We incorporated your and the reviewers comments as described in our point-by-point response appended below. While we still describe our results, pathway by pathway, we moved the more detailed discussion to the discussion section following your and the reviewers advice. We also formatted the manuscript according to the PLOS ONE format guidelines (9efad3c44db3a7f319fbf03644b95f8f986e8e6).

Two important points:

A) Reviewer 2 pointed out that our data indicates that ‘..fcd-2/FANCD2 and fnci-1/FANCI ICL repair defective lines, and polq-1 polymerase theta/POLQ (microhomology mediated end-joining) show reduced rates of indels and single nucleotide variants (Polq).

However we stand by our statement interpreting these data which are described in Figure 2 as follows

‘Several DNA repair mutants, namely fcd-2/FANCD2 and fnci-1/FANCI DNA ICL-repair defective lines, and the microhomology mediated end-joining (MMEJ) defective polq-1/POLQ mutant exhibited reduced indel rates (Fig 2, green dots) Additionally, polq-1 mutants harboured reduced SNVs (Fig 2, red dot). POLQ-1 dependent MMEJ is an error prone pathway, in which resected 3’ single-stranded overhangs pair at their complementary terminal nucleotide(s) to prime DNA synthesis, often leading to small deletions [15–18]. However, given that indel and SV mutation rates in our dataset are already low in wild-type and more wild-type than mutant samples were included in the analysis (wild-type n = 30, mutant n = 4-8) the sample variance in genotypes with mutation rates close or lower to wild-type may be underestimated. We therefore caution that the observed reductions in mutagenesis levels are likely to be false discoveries. ‘

We hope that the changes we have made to this paragraph clarifies our reasoning.

B) Reviewer 4 asked if 10 time sequence coverage allows for accurately calling structural variants and heterozygous mutations.

We apologise for mistakenly stating that the sequencing coverage was only 10 fold. Indeed, sequencing coverage across our analysis was around 50 fold. We have corrected our mistake and improved the methods sections. Additionally we now refer to the detailed methods collection which we published as part of our previous study Ref. 10, Volkova et al., Nat Commun. 2020;11: 2169, and which also provides a detailed description of the bioinformatics pipeline.

Finally responding to the note from PLOS ONE that ‘We noted in your submission details that a portion of your manuscript may have been presented or published elsewhere and the request to ‘Please clarify whether this publication was peer-reviewed and formally published. If this work was previously peer-reviewed and published, in the cover letter please provide the reason that this work does not constitute dual publication and should be included in the current manuscript’

As we indicated in our initial submission we largely use the same primary data as used in a previously published and reviewed paper Nat Commun. 2020;11(1):2169. doi: 10.1038/s41467-020-15912-7. It is common in the area of bioinformatics that primary source data are used in new publications as we did. Our previous Nat Commun. publication is largely focused on studying the effects of treatment with DNA damaging agents in wild-type and DNA repair deficient C. elegans. In the previous paper only Fig 1C provides a high level summary of background mutagenesis (6 mutants are shown). In our PLOS ONE manuscript we describe in detail the mutations arising in wild-type and 61 repair mutants when propagated over multiple generations without mutagen exposure. This description comprises 6 Figures and 14 Supplementary Figures describing in detail mutation rates and patterns of wild-type and 61 repair defective lines when propagated over multiple generations. In addition, we describe sequencing data from 5 double mutants. All 4 reviewers support our PLOS ONE publication supporting our case that the in-depth analysis of effects of DNA repair deficiency analysis over generations presented here exceeds our previous work published in Nat Commun.

Below please find a point-by point-response to the reviewers and the associate editor. We upload two text files, one where changes we made are indicated, the other one, where these changes are accepted.

Best regards,

Anton Gartner and Moritz Gerstung

5. Review Comments to the Author

Reviewer #1: The manuscript from Meier and colleagues describes a systematic DNA sequencing analysis of 61 DNA repair mutants maintained in a laboratory setting. The paper follows up on earlier work describing the effects of various mutagens on DNA sequence in the wildtype genome. Here, the authors report the effects of loss of various DNA repair components on the accumulation of different types of DNA damage under mild laboratory conditions where individuals are not subject to mutagenic agents. The study is a very useful catalog of observed sequence changes. It provides information on the frequency and distribution of various mutation types. Interestingly, loss of some DNA repair factors did not impact the mutation rate in the laboratory setting and others correlated with relatively mild increases in mutation rate. The section of the Discussion dealing with the likelihood of mutations arising during meiosis is an important addition. Overall, the methods are thorough and the data are likely to be very useful for researchers studying DNA repair.

Minor comments

Page 3, line 4: substitute “difficult” for “testing” so the sentence reads “Measuring mutation rates in organisms not challenged with mutagens is difficult given the low numbers of mutations.”

thanks, done

Page 5, line 2 from the bottom: substitute a comma for the period after labour-intensive, so the sentence reads “As large-scale investigations of germline mutagenesis are highly time- and labour-intensive, C. elegans offers a suitable system to study mutation accumulation across multiple generations and genetic backgrounds based on its short life-cycle and its ability to self-fertilize.

thanks, done

Page 7, paragraph 2, line 2: Remove 2nd “across” so the sentence reads “Across 61 C. elegans DNA repair deficient mutants ([10–12], Supplementary Table S1), the median mutation rate was close to that observed in wild-type:…”thanks, done

Page 11, lines 17-21: The wording of this sentence is awkward so that the meaning is not clear. To clarify, maybe add “including” so the sentence reads: “We deduced that these events involved templated insertions of 200-4000 bp sequences, which showed strong homology to multiple genomic regions, including to one such homeologous region in cis located as far as 275 kb away from the donor sequence on the same chromosome, accompanied by a deletion of several hundred basepairs at the homeologous acceptor site (Supplementary Figure S7).”

thanks, done

Page 19, line 7: It’s confusing to refer to the “latter” in a list of more than two things. Substitute “last” for “latter” so the sentence reads: “Moreover, in ATM deficient yeasts, C.

elegans and Drosophila, the last depending on retrotransposon transposition rather

than telomerase activity for telomere maintenance, chromosome fusions have been

observed cytologically or through sequencing of PCR products across chromosomes

[71–76].

thanks, done

Page 22, 1st paragraph of Discussion, and elsewhere in the text: “Data” is a plural word, e.g., “Our data provide a comprehensive picture…”

thanks, done throughout the manuscript.

Reviewer #2: The manuscript by Meier et al describes in detailed the natural mutation signature of 60 mutants in various DNA repair pathways. This work is based on the study some of these authors published last year (Volkova et al, ref 10). The current study expanded the analysis from 53 to 61 genetic backgrounds and performed new analysis that was not present in Volkova et al. The detailed analysis provides interesting insights about mutagenesis in an organism under natural, non-mutagenic conditions. Despite the fact that some of the analysis for most of these mutants was done in other studies, there is an important value in doing a large-scale genomic comprehensive study with the same methodology in one publication. Therefore, I believe this analysis will have significant importance to the field. Below are some suggestions for improving the text and figures.

• Abstract. The transition from the first to the second sentence is very abrupt.

Thanks, we took on this point and revised the beginning of the abstract which now reads as follows.

‘Maintaining genome integrity is particularly important in germ cells to ensure faithful transmission of genetic information across generations. Here we systematically describe germ cell mutagenesis in wild-type and 61 DNA repair mutants cultivated over multiple generations. ~44% of the DNA repair mutants analysed showed a >2-fold increased mutagenesis with a broad spectrum of mutational outcomes.’

• Author Summary. I did not understand this sentence: “Measuring mutation rates in organisms not challenged with mutagens is testing, given low numbers of mutations”.

Thanks, we changed this sentence to ‘Measuring mutation rates in organisms not challenged with mutagens is difficult, given low numbers of mutations’

• Introduction: please remove the reference for data (Sup table 1) and figures form the introduction (it better placed in the results).

done

• Results: It will be helpful to have more information about how the assay is performed in the beginning of the results section. Although this is based on previously described methodology (ref 10), it will be better to explain this here as well (to the reader who is not familiar with these studies and/or C. elegans). The 2 sentences in the intro (page 6) discussing 1a starting with “In mutation accumulation experiments”, can be moved to the “results”. In addition, 1a referees to 20/40 generations, but some of the mutants were analyzed with less (1 or 5 generations). How mutation accumulation studies done for strains that show embryonic lethality (e.g., smc-5, smc-6) should be described earlier in the text, when the assays are first described. It will be helpful to describe how the 61 DNA repair deficient mutants were selected for the study- what were the criteria?

Thanks: We have restructured the end of the introduction and the beginning of the result section to better describe our methodology upfront. This now better guides the non-expert reader and together with Figure 1 the methodology is made clear up front. Also, we improved the relevant methods section, and in addition better refer to our detailed methods section provided in Ref 10 (Volkova, Meier et al., 2020). We now also, both in the relevant results as well as in the methods section, refer to Suppl Table 1 where the number of lines sequenced for each strain is listed, as is the number of generations. In the methods section we now also state that rtel-1, smc-5 andsmc-6 mutants were only grown for 5 generations given that many of the lines grown in parallel had already become sterile at generation 5.

• Figure 1C- while point mutation are found at similar levels in wild type T>A (with A and T 5’ and 3’ bases) seem to be enriched (to similar levels as 1bp insertions), I don’t think that was reported in other studies- Is this not significant?

Thanks, we have added the information that T>A changes with A and T 5’ and 3’ bases represent the most abundant SNVs. The relevant section now reads ‘Mutations were equally distributed across the wild-type genome with no evidence of clustering (Figure 1B). The most frequent mutations were a) single base insertions, with T>A changes in the context of a 5’A and a 3’ T and b) deletions in homopolymeric sequences (Figure 1C).’

• SVs and SNV are defined twice (page 5 and 7)

Thanks, we have removed these duplications.

• Page 9 “showed increased mutation rates without overt changes in mutational signatures…” based on mutation types underlined in figure 3A, it does look like there is mutation signature for base substitutions (with slight variation between mutants in the same pathway) that requires correction and explanation of why this specific pattern

We want to be cautious in this case, focusing on where data is firm. Changes in signatures across NER mutants do not appear as uniform enough to make strong statements. This is why we say ‘without overt changes’. In other words, while our data clearly supports increased mutagenesis in NER defective lines, working our signatures changes associated with different NER repair mutants would need analyzing a larger number of analysed worm lines.

• Page 9. The sentence stating with “comparison between…” is too long and requires more explaining (“COSMIC signature” SBS8…SBS5) for clarity.

Thanks, we reworded this section which is now easier to read.

• There are several places in the results where it seems would better be moved to the discussion. For example: Page 12 “given the role…” Page 14 the 2 sentences stating with “Clusters of mutations may arise through…” page 17-18 the whole paragraph starting with “At present, we can only speculate how these tandem...”.

OK, we relooked at both sections, and would like to make the case for leaving the structure of the narrative as is in two cases Page 12 and 14. The tricky thing about our paper is us describing many genotypes, each requiring an introduction. This is why we ordered the results section pathway per pathway, in each case providing a short introduction and at the end a short discussion (eg introduction-experiment-conclusion). We think that this format is more readable. Concerning our discussion page 17-18 relating to how tandem duplications might arise in helq mutants we agree and shifted this entire section to the discussion.

• Page 14- “In addition, the NHEJ or MMEJ error-prone DSB repair pathways can also generate clustered mutations when DNA strands with incompatible ends are joined together” yet cku-80 and lig-4 did now show decrease in mutation (or insertions)- please discuss- how come?

Thanks. We added the following speculative sentence, “The absence of clustered mutations in NHEJ or MMEJ defective lines could be explained by the action of redundant, error-free HR pathways.”

• Figure 4C why the difference between smc-5 and smc-6 mutants in deletion size obtained when they are part of the same complex?

The plots in Figure 4C lower panel clearly show that average deletion size observed in smc-5 and smc-6 is similar. There is a difference in the size of tandem duplication (TD), however the number of TDs observed is small.

• In the text it is stated (page 10): “MMEJ component polq-1, did not produce significant and reliable changes in mutation rates compared to wild-type” and also in figure S4 polq-1 is not significant- while in figure 2 and it is S10 is it significantly different from wild type- is it or is it not significant?

Thanks, we now state that we observed a reduced number of SNV and indels in polq-1 mutants. We, however, did not observe a signature change (S10). However as explained to the editor our reasoning (incorporated into the manuscript) is as follows:

‘Several DNA repair mutants, namely fcd-2/FANCD2 and fnci-1/FANCI DNA ICL-repair defective lines, and the microhomology mediated end-joining (MMEJ) defective polq-1/POLQ mutant exhibited reduced indel rates (Figure 2, green dots) Additionally, polq-1 mutants harboured reduced SNVs (Figure 2, red dot). POLQ-1 dependent MMEJ is an error prone pathway, where resected 3’ single-stranded overhangs pair at their complementary terminal nucleotide(s) to prime DNA synthesis, often leading to small deletions. [15–18]. However, given that indel and SV mutation rates in our dataset are already low in wild-type and more wild-type samples were included in the analysis (wild-type n = 30, mutant n = 4-8) the sample variance in genotype showing mutation rates close or lower to wild-type, may be underestimated. We therefore caution that the observed reductions in mutagenesis levels are likely to be false discoveries. ‘

• Page 22 “brc-1/Brca1 and rad-51 paralog mutants show elevated mutagenesis across all types of mutations” bases on 2A it’s not all but most types of mutations.

Thanks, we changed to ‘most’

• It is not clear how HR mutants would lead to deletions, which may be done better in the discussion. With mus-81/slx-1 how D- loop falling apart would lead to loss of genetic material? It makes sense that when HR fails other repair pathways (SSA and MMEJ?) act leading to large deletions but I don’t think it’s explicitly said.

Thanks, we added the following sentence at the end of the HR section in the discussion.

‘An increased number of deletions in HR mutants might arise from the error prone activity of redundant SSA and MMEJ repair pathways

• Discussion and results page 11 and 14: it is not clear why when HR is compromised, base

substitutions are increased and why not all but only particular ones (why specific signature)? Is the signature related to specific TLS polymerases (which ones can be candidates)? Why do TLS polymerases act when HR is compromised or overactive (is this due to aberrant HR or the activation of an alternative repair pathway)?

Of course, we would like to know the exact answer to this question…

We speculate in the discussion as follows. ….We suspect that increased point mutations might be a scar indicative of error prone translesion synthesis, necessary when damaged bases are neither repaired by BER and NER nor by replication fork reversal which is linked to recombinational repair [95]. Point mutations and small deletions also occur when HR is replaced by more error-prone NHEJ or MMEJ pathways, the latter being associated with the occurrence of small deletions in human BRCA1 mutants [27]. Conversely, deficiencies of other HR proteins, like SLX-1 and MUS-81, and helicases including HIM-6, RTEL-1 and HELQ-1, are associated with a specific increase of SVs. It is tempting to speculate that these proteins may not have a role in HR pathways directly linked with DNA replication (see below). An increased number of deletions in HR mutants might arise from the error prone activity of redundant single strand annealing and MMEJ repair pathways.

• Discussion: Page 23- “Thus, the absence of significant mutagenic effects …underpins a high level of redundancy among different DNA repair pathways...” For HR, the mutants that were examined are in genes that, only partially required for HR (understandably, since removing HR leads to embryonic lethality). Therefore, HR may be not redundant with other pathways, but the role of HR just cannot be examined. The data may reflect redundancy within HR, not between HR and other pathways.

Thanks, we reflect this concern and now added ‘within and’ such the sentence reads ….high level of redundancy within and among different DNA repair pathways... ‘

• Discussion page 25 “It is tempting to speculate that these proteins may not have a

role in HR pathways directly linked with DNA replication” this sentence is not clear to me. Why “tempting to speculate” when a role for most of these proteins in meiotic HR is established?

Thanks, we changed to ..’we speculate…’. RT

• Page 26 “Finally, it is reasonable to suggest that many lesions we observe to accumulate in our transgenerational set-up occur in haploid germ cells, especially during meiosis. Indeed, it appears plausible that many SVs might be associated with meiotic recombination.” These 2 sentences don’t seem do lead from one to the other… Germ cells are haploid only after meiotic recombination, for a relatively short period (since recombination occur in meiotic prophase I when the divisions didn’t yet happen). Are these 2 different ideas or should the word “haploid” be removed?

Thanks, corrected, we removed the word ‘haploid’

Figures and tables

• Fig 1B- I don’t see a gray line on the diagram (SVs). If this not present in these samples should it be removed from the key to avoid confusion?

done

• Fig 1C- it’s very hard to see the histograms. I suggest adjusting the y axis to 0.2 max

thanks, done.

• Fig 2- 95% CI intervals are truncated for SV plot, maybe extend Y axis further (<0.005?)

thanks: CI are truncated because they extend to 0, which is impossible to show on a log scale blot. We added a sentence to mention this in the Figure legend reding. ‘All CIs which extend below the lower edge of the plot have zero as their lower border’

• Fig 2- since DSBR is used also to name HR sub pathway, it is confusing to include NHEJ proteins under “DSBR” on the labeling on the top and I suggest labeling cku-80 and lig-4 separately as cNHEJ and use “HR” instead “DSBR” for the others.

Here we would like to keep organisation as is. NHEJ and all HR subpathways fall into DSBR (double strand break repair). cku-80 and lig-4 are next to each other, at the left to the DSBR section.

• In all the figures that represent mutation spectrum (like 2A) ¬¬- the authors used bars underlining what part of the comparison is significant. However, the sizes of the bars are not consistent and sometimes hard to distinguish from non-significant marking. Sometimes the gray bars are invisible.

Thanks, We have double checked all main and Suppl Figures. The bars are clearly visible when PDF files are opened.

• S2- what are the pink and orange boxes in partp-1 and parp-2 respectively?

thanks, we have fixed this ‘The pink and orange shaded regions in parp-1 and parp-2 samples, respectively, indicate the location and extent of observed structural variants (pink - TD (tandem duplication); orange - Fold (foldback duplication)).’

• S7- italics for gene names

thanks, this is fixed.

• S10 and S14- X axis is not clear, letter on top of each other.

thanks, this is fixed.

• Table 1- Key for “DNA damage response pathway”, why “Helicase, TR” in green?

thanks, this is fixed.

Reviewer #3: 6th Jan 2021

The authors systematically catalogued the mutational characteristics of DNA repair

deficiencies across the C. elegans DNA repair and damage response

pathways in animals that were cultivated to 40 generations.

They provide new information on the contributions of different DNA repair pathways towards mutagenesis that was not previously known. For example, they showed that

the DOG-1/FANCJ helicase has a unique function to unwind G-quadruplex forming sequences to prevent mutations. Likewise, they showed that the ATM-1 checkpoint kinase perform a specific role in avoiding mutations in the sub-telomeric repeats from

DSBs. The authors should be commended for a valuable and elaborate study.

Comment:

The article is excessively long (result section in particular) and the authors could shortened it to enhance readership by taking the gene deletion mutant that showed no effect and place these in a Table in the text. In addition, they could give a brief description of the gene function. Reading one section to the next gives the same pattern of information, except for focusing on a different pathway, which makes reading the results section somewhat redundant and less appealing.

Thanks, as detailed in the response to the above reviewers we now provide a better introduction of the experimental system to non-expert readers, we improved the clarity of our wording in many instances, and also shifted the discussion of how TD may arise in Hel-Q mutants to the discussion section. Overall, we think that our paper now reads much better.

Reviewer #4: The study by Meier et al. reports the mutational profile of 61 C. elegans DNA repair mutants propagated 40 generations in the absence of exogenous stress. The article is very dense but in general clearly presented.

The authors find that across DNA repair deficient mutants, the median mutation rate is close to wild-type. However, 42 out of the 61 DNA repair deficient strains analyzed displayed mutation rates significantly different from wild-type in at least one mutation class (base substitutions, indels or stuctural variants). Characterization of the mutational signatures over generations in DNA repair pathways deficient in DR, BER, and NER did not show large changes in the overall mutation spectra, but several small differences in particular mutation types. Inactivation of the core components of NHEJ did not produce significant changes in mutation rates compared to wild-type, while mutants in homologous recombination (HR) showed either increased mutations across all mutation classes, or primarily accumulated structural variants. The discussion speculated of possible mechanisms explaining the specific mutational signature observed, and relate these to the biology of C. elegans.

Collectively the data presented constitutes a useful resource for future studies on how conserved DNA repair pathways contribute to preserving germline genome integrity and is appropriate for publication in PloS One.

Specific comments:

1. Please specify from what stage DNA was isolated. Is is young adults?

Genomic DNA was isolated from a mix stage, clonal population (first and second generation) derived from a single ‘final generation” worm. This is now more clearly explained in Materials and Methods.

2. The 10X coverage in these experiments is on the low end of the scale, and while adequate for calling homozygous variants, may not be not be enough to accurately call structural variants or heterozygous mutations. The authors could mention this caveat somewhere. Nonetheless this data provides may surve as a basis for future, more in depth studies.

Thanks for spotting this. Indeed, our mean coverage is 50X. Data acquisition, mutation calling, filtering and classification is described in great detail in our previous paper (ref 10). We rectified our error and better refer to ref 10. We can thus call structural variants or heterozygous mutations with great confidence.

3. Fig 1C: Single nucleotide variants are shown in the 'context of their 5’ and 3’ base'.

can authors please reformulate? 5' and 3' with respect to what?

These are the nucleotides 5’ and 3’ of the SNV.

General comments:

The study relies heavily on analysis of genome sequencing data. Methods are described in detail but this reviewer does not have the competences to judge whether the statistical methods and other analyses were adequately performed. All data is available in supplementary tables or from appropriate databases using provided accession numbers.

AE Feedback: My independent reading of the manuscript led to a similar view as the other Reviewers. Of particular importance, I found the explanation for the lack of mutations generated in nhej mutations somewhat lacking. In addition, the results explained on pg. 7 as due to polymerase slippage may also be explained by sister chromatid exchange.

We have included a sentence explaining not observing increased mutagenesis end joining mutants might be because of end joining in C. elegans predominantly acting in somatic cells and not in germ cells, a notion supported by previous reports we cite. Of course, in the absence of end-joining HR may take over, a possibility we now also discuss. For discussing the genesis of indels in homopolymeric repeat sequences we refer to our previous in-depth analysis of MMR defective stains which favours a slippage model [11].

Attachment

Submitted filename: Response to Reviewers (2021.03.04) (1).docx

Decision Letter 1

Arthur J Lustig

25 Mar 2021

PONE-D-20-39672R1

Protection of the C. elegans germ cell genome depends on diverse DNA repair pathways during normal proliferation

PLOS ONE

Dear Dr. Gartner,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during 

Only one trivial change. Please delete the worries indicated be Reviewet 4 and resubmittedments here and delete this placeholder text when finished. Be sure 

Please submit your revised manuscript by May 09 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Arthur J. Lustig, PhD

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: All comments have been addressed

Reviewer #3: All comments have been addressed

Reviewer #4: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: The authors appropriately addressed all my concerns and questions and I have no further reservations or concerns.

Reviewer #3: The authors have now carefully addressed all concerns and I am satisfy with the revisions. The authors have improved the manuscript.

Reviewer #4: The manuscript has been improved and this reviewers comments were taken into account

the phrase "Our study provides an analysis of 528 whole genome sequencing" appears at bottom of each page and should be deleted

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

Reviewer #3: Yes: Dindial Ramotar

Reviewer #4: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Apr 27;16(4):e0250291. doi: 10.1371/journal.pone.0250291.r004

Author response to Decision Letter 1


31 Mar 2021

I now uploaded the final manuscript. I removed the footer which reviewer 4 asked us to remove. (NOTE, this change is not indicated in the file that asked me to highlight all changes. Reviewers and the Editors did not ask for any other change

Decision Letter 2

Arthur J Lustig

5 Apr 2021

Protection of the C. elegans germ cell genome depends on diverse DNA repair pathways during normal proliferation

PONE-D-20-39672R2

Dear Dr. Gartner,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Arthur J. Lustig, PhD

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Arthur J Lustig

14 Apr 2021

PONE-D-20-39672R2

Protection of the C. elegans germ cell genome depends on diverse DNA repair pathways during normal proliferation

Dear Dr. Gartner:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Arthur J. Lustig

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Summary of mutations acquired in the screen.

    Total number of mutations (substitutions in black, indels in green, and structural variants in purple) per sample across wild-type and DNA repair mutant lines. The black line denotes the median number of mutations for each mutation class across all experiments.

    (EPS)

    S2 Fig. Mutational signatures and distribution of mutations in DR and BER deficient mutants.

    A. Mutational signatures of all tested DR and BER mutants, shown as numbers of mutations per generation. Bold coloured bars below each mutation profile indicate mutation types for which the sum across individual classes is different from wild-type. Three stars indicate genotypes with significantly different rates of substitutions, indels or SVs compared to wild-type with FDR < 5%. B. Distribution of mutations across chromosomes for DR and BER deficient samples. Only agt-2 shows a significant degree of mutation clustering. The pink and orange shaded regions in parp-1 and parp-2 samples, respectively, indicate the location and extent of observed structural variants (pink—TD (tandem duplication); orange—Fold (foldback duplication)).

    (EPS)

    S3 Fig. Mutational signatures and distribution of mutations in NER deficient mutants.

    A. Mutational signatures of all tested NER mutants displayed as numbers of mutations per generation. Bright bars denote individual mutation classes for which mutation numbers differ significantly from wild-type. Bold coloured lines below each mutation profile indicate a wider range of mutation types for which total mutation numbers are different from wild-type. Three stars indicate genotypes with significantly different rates of substitutions, indels or SVs compared to wild-type (FDR < 5%). B. Comparison between the humanised version of the combined mutational spectrum across all NER deficient C. elegans samples and COSMIC signatures SBS5 and SBS8 previously associated with NER deficiency (cosine similarity scores 0.64 and 0.56, respectively). No other COSMIC SBS signatures showed significant similarity to the C. elegans NER spectrum.

    (EPS)

    S4 Fig. Mutational signatures and characteristics of mutagenesis in DSBR mutants.

    A. Mutational signatures of all tested DSBR mutants displayed as numbers of mutations per generation. Bright bars denote individual mutation classes for which mutation numbers differ significantly from wild-type. Bold coloured lines below each mutation profile indicate a wider range of mutation types for which total mutation numbers are different from wild-type. Three stars indicate genotypes with significantly different rates of substitutions, indels or SVs compared to wild-type (FDR < 5%). B. Proportion of SVs and indels with breakpoint microhomology across DSBR mutants. C. Distribution of microhomology sizes at indel and SV breakpoints across DSBR mutants. D. Proportion of SVs with breakpoints in repetitive regions across wild-type and genotypes with elevated SV rates. Dotted lines represent the fraction of variants expected to arise in repetitive regions by chance.

    (EPS)

    S5 Fig. Features of structural variants in wild-type, atm-1, brc-1 brd-1, and helq-1 mutants.

    The number of lines grown for both F20 and/or F40 generations is indicated below each genotype. The presence or absence of repetitive or G4-forming sequences at breakpoints of SVs is indicated by a plus ‘+’ ‘or minus ‘-’, respectively. A single minus sign ‘-’ indicates that neither of the two breakpoints of the SV occurs in the indicated sequence context. ‘+/-’ indicates repetitive or G4-forming sequences at the left but not the right breakpoint (consistent with the direction on the reference genome chromosome), and ‘-/+’ indicates that such sequences only occur at the right breakpoint. The size and nature of SVs is shown on a log scale on the right. SV are classed into interchromosomal rearrangements (INTCHR), large deletions (DEL), tandem duplications (TD), and inversions (INV). Details on CD Sample IDs and their respective European Nucleotide Archive (ENA) accession numbers are available in the sample description of S1 Table.

    (EPS)

    S6 Fig. Features of structural variants in wild-type, rip-1, rfs-1, smc-5, and smc-6 HR mutants.

    The number of lines grown for both F20 and/or F40 generations is indicated below each genotype. The presence or absence of repetitive or G4-forming sequences at breakpoints of SVs is indicated by a plus ‘+’ ‘or minus ‘-’, respectively. A single minus sign ‘-’ indicates that neither of the two breakpoints of the SV occurs in the indicated sequence context. ‘+/-’ indicates repetitive or G4-forming sequences at the left but not the right breakpoint (consistent with the direction on the reference genome chromosome), and ‘-/+’ indicates that such sequences only occur at the right breakpoint. The size and nature of SVs is shown on a log scale on the right. SV are classed into interchromosomal rearrangements (INTCHR), large deletions (DEL), tandem duplications (TD), and inversions (INV). Details on CD Sample IDs and their respective ENA accession numbers are available in the sample description of S1 Table.

    (EPS)

    S7 Fig. Translocations in rip-1 mutants.

    Visual representation of the 3 duplications/translocation type events observed in two rip-1 F40 lines. CD numbers reflect the European Nucleotide Archive designation of sequenced samples. Translocations (TRLS) are shown using the wild-type genomic region as reference, nucleotides demarking key features, such as the position of the translocation, the donor sequence and the deletion in the acceptor locus being indicated. In the respective lower panes the level of sequence homology between donor and acceptor loci is indicated. Green arrows indicate tandem duplication-like (TD-like) breakpoints, blue arrows deletion-like (DEL-like) breakpoints.

    (EPS)

    S8 Fig. Features of structural variants in wild-type, mus-81, slx-1, and rtel-1 HR mutants.

    The number of lines grown for both F20 and/or F40 generations is indicated below each genotype. The presence or absence of repetitive or G4-forming sequences at breakpoints of SVs is indicated by a plus ‘+’ ‘or minus ‘-’, respectively. A single minus sign ‘-’ indicates that neither of the two breakpoints of the SV occurs in the indicated sequence context. ‘+/-’ indicates repetitive or G4-forming sequences at the left but not the right breakpoint (consistent with the direction on the reference genome chromosome), and ‘-/+’ indicates that such sequences only occur at the right breakpoint. The size and nature of SVs is shown on a log scale on the right. SV are classed into interchromosomal rearrangements (INTCHR), large deletions (DEL), tandem duplications (TD), and inversions (INV). Details on CD Sample IDs and their respective ENA accession numbers are available in the sample description of S1 Table.

    (EPS)

    S9 Fig. Features of structural variants in wild-type, him-6, wrn-1, and dog-1 HR mutants.

    The number of lines grown for both F20 and/or F40 generations is indicated below each genotype. The presence or absence of repetitive or G4-forming sequences at breakpoints of SVs is indicated by a plus ‘+’ ‘or minus ‘-’, respectively. A single minus sign ‘-’ indicates that neither of the two breakpoints of the SV occurs in the indicated sequence context. ‘+/-’ indicates repetitive or G4-forming sequences at the left but not the right breakpoint (consistent with the direction on the reference genome chromosome), and ‘-/+’ indicates that such sequences only occur at the right breakpoint. The size and nature of SVs is shown on a log scale on the right. SV are classed into interchromosomal rearrangements (INTCHR), large deletions (DEL), tandem duplications (TD), and inversions (INV). Details on CD Sample IDs and their respective ENA accession numbers are available in the sample description of S1 Table.

    (EPS)

    S10 Fig. Mutational signatures and characteristics of mutagenesis in TLS mutants.

    A. Mutational signatures of all tested TLS mutants displayed in numbers of mutations per generation. Bright bars denote individual mutation classes for which numbers differ significantly from wild-type. Bold coloured lines below each mutation profile indicate a wider range of mutation types for which total mutation numbers are different from wild-type. Three stars indicate genotypes with significantly different rates of substitutions, indels or SVs compared to wild-type (FDR < 5%).

    (EPS)

    S11 Fig. Mutational signatures and characteristics of mutagenesis in DNA crosslink repair mutants.

    A. Mutational signatures of tested DNA crosslink repair mutants displayed in numbers of mutations per generation. Bright bars denote individual mutation classes for which numbers differ significantly from wild-type. bold coloured bars below each mutation profile indicate a wider range of mutation types for which the total numbers of mutations are different from wild-type. Three stars indicate genotypes with significantly different rates of substitutions, indels or SVs compared to wild-type (FDR < 5%). B. Distribution of mutations across 11 samples of F20 dog-1 helicase deficient lines. Over 60% of variants occur in G4-forming regions (large symbols).

    (EPS)

    S12 Fig. Possible mechanism for inducing tandem duplications in loci containing inverted repeat sequences.

    Step-wise representation of a possible mechanism leading to tandem duplications (TDs) in helq-1 mutants. Inverted sequences are depicted by red lines with arrows indicating directionality. Replication forks are shown as splayed structures with lagging strand discontinuity indicated by arrows with dotted lines. Scissors indicate a putative endonuclease cleavage site.

    (EPS)

    S13 Fig. Telomere proximal structural variants observed in F20 generation atm-1 mutants.

    Structural variants in telomere proximal regions are shown for the indicated atm-1 F20 lines (S1 Table) with their chromosomal locations. Individual structural variants are coloured based on their classification as duplications (DUP), deletions (DEL), inversion (INV), and interchromosomal rearrangement (BND, breakends) with vertical lines denoting breakpoint locations and horizontal lines spanning the regions between related breakpoints. Black dots represent the coverage in bins of 50 bp or 100 bp (for regions above 20 kb) adjusted for the coverage in the corresponding control sample and indicate copy number.

    (EPS)

    S14 Fig. Mutational signatures and characteristics of mutagenesis in checkpoint and apoptosis mutants.

    Mutational signatures of the indicated genotypes are displayed in numbers of mutations per generation. Bright bars denote individual mutation classes for which the numbers differ significantly from wild-type. Bold coloured bars below each mutation profile indicate a wider range of mutation types in which the total number of mutations is different from wild-type. Three stars indicate genotypes with significantly different rates of substitutions, indels or SVs compared to wild-type (FDR < 5%).

    (EPS)

    S1 Table. Details of C. elegans strains used in and sequencing samples from this study.

    (XLSX)

    S2 Table. Analysis of tandem duplications in helq-1 mutants and their correlation with inverted repeat sequences.

    (XLSX)

    S1 File

    (DOCX)

    Attachment

    Submitted filename: Response to Reviewers (2021.03.04) (1).docx

    Data Availability Statement

    Sequencing data and variant calling files are available under ENA Study Accession Numbers ERP000975 and ERP004086. ENA sample IDs are annotated in S1 Table. R code for the analysis of mutation rates and signatures as well as mutational clustering is available on GitHub under http://github.com/gerstung-lab/mutationaccumulation.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES