Abstract
Error-corrected Duplex Sequencing (DuplexSeq) enables direct quantification of low-frequency mutations and offers tremendous potential for chemical mutagenicity assessment. We investigated the utility of DuplexSeq to quantify induced mutation frequency (MF) and spectrum in human lymphoblastoid TK6 cells exposed to a prototypical DNA alkylating agent, N-ethyl-N-nitrosourea (ENU). Furthermore, we explored appropriate experimental parameters for this application, and assessed inter-laboratory reproducibility. In two independent experiments in two laboratories, TK6 cells were exposed to ENU (25–200 μM) and DNA was sequenced 48, 72, and 96 h post-exposure. A DuplexSeq mutagenicity panel targeting twenty 2.4-kb regions distributed across the genome was used to sample diverse, genome-representative sequence contexts. A significant increase in MF that was unaffected by time was observed in both laboratories. Concentration-response in the MF from the two laboratories was strongly positively correlated (r=0.97). C:G>T:A, T:A>C:G, T:A>A:T, and T:A>G:C mutations increased in consistent, concentration-dependent manners in both laboratories, with high proportions of C:G>T:A at all time points; the consistent results across the three time points suggest that 48 h may be sufficient for mutation analysis post-exposure. The target sites responded similarly between the two laboratories and revealed a higher average MF in intergenic regions. These results, demonstrating remarkable reproducibility across time and laboratory for both MF and spectrum, support the high value of DuplexSeq for characterizing chemical mutagenicity in both research and regulatory evaluation.
Keywords: N-ethyl-N-nitrosourea, in vitro, error-corrected next-generation sequencing, mutagenesis, mutation spectrum
1. INTRODUCTION
Mutagenicity testing is an integral part of chemical and pharmaceutical toxicity assessment because mutations can cause cancer, age-related conditions, and genetic diseases. The current standard in vitro mutagenicity assays in mammalian cells (e.g., HPRT, XPRT, or Tk+/− gene mutation assays [1,2]) determine the frequency of mutations in specific reporter genes. Together with the Ames test in bacteria [3], these assays have been effective for mutagenic hazard identification and served as gold standards in genetic toxicology for decades. However, it is increasingly recognized that the current standard in vitro assays provide an incomplete picture of the genome-wide effects of mutagens.
The in vitro mammalian cell gene mutation assays are restricted to the analysis of simple mutant frequency in single, sometimes exogenous, reporter genes. They also provide limited information on the specific types of mutations needed to infer mechanisms of mutagenicity without extensive work on individual mutant clones. Importantly, these methodologies are limited to specific in vitro models and cannot readily be integrated with other model systems and assays. Consequently, there is a need for direct, quantitative mutation detection methods for mammalian genomes that readily provide information on mutational spectrum to improve mutagen identification and mechanistic understanding.
Next-generation sequencing (NGS) enables quantitative analysis of mutation spectra and its application in genetic toxicology is becoming more prevalent. NGS can provide information beyond simple mutant frequency by quantifying each type of base substitution and more complex forms of mutation, as well as identifying the local sequence context in which the mutations occur across the organism’s genome [4,5]. However, the high per base pair rate of technical errors (on the order of 1 in 103) of NGS leads to an overestimation of the true mutation frequency (MF) in cells that has hindered its application.
To circumvent this problem, error-corrected sequencing (ecNGS) techniques have been developed to improve accuracy and specificity for detecting true, low frequency mutations [6–8]. The highest resolution of ecNGS methods, such as Duplex Sequencing (DuplexSeq; TwinStrand Biosciences, Seattle, WA), involves labeling both strands of a DNA duplex with one or more forms of a unique molecular index (UMI) that both relates and distinguishes the sequences of the two strands from each other and from those of other molecules. Following sequencing, this information is used to create a consensus sequence that eliminates errors introduced during DNA handling, PCR amplification, and sequencing. DuplexSeq improves the error rate of conventional NGS by approximately 10,000-fold, enabling the detection of a mutation present among 107 bases or more [6,9–11].
The DuplexSeq approach enables the quantification of locus-specific mutation frequency and quantitative measurements of the extent of clonal expansion of mutations when applied in a targeted form using hybrid selection panels [12]. Genome-representative panels have been designed specifically for mutation analysis in human, mouse, or rat genomes for chemical mutagenicity assessment; the panels contain 20 endogenous genomic sites, each 2.4 kbp in length, randomly spread across 20 autosomal chromosomes (TwinStrand DuplexSeq Mutagenesis kits). The mutagenesis panels sample sequences that are collectively representative of the genome in terms of GC-content, genic and intergenic regions, and coding and non-coding sequences [13]. Selective sequencing at high depths makes DuplexSeq practical for higher throughput applications as a broadly accessible mutagenicity assay.
Within the field of genetic toxicology, DuplexSeq with genome-representative hybrid selection panels is an emerging methodology and only a handful of reports have focused on its application to chemical mutagenicity assessment. Valentine et al. [12] and LeBlanc et al. [13] demonstrated that DuplexSeq produced in vivo results that were comparable to, yet more data-rich than, TGR assays. Wang et al. [14] demonstrated its ability to detect a concentration-dependent increase in mutations in a specialized in vitro airway tissue model comprising donor-derived primary human cells exposed for 28 days to the potent alkylating agent ethyl methanesulfonate (EMS). The results thus far demonstrate the added value of DuplexSeq in investigating chemical mutagenicity.
However, the utility, inter-laboratory transferability and reproducibility, and optimal approach to incorporating DuplexSeq into a standardized, high-throughput in vitro mutagenicity assay for chemicals in regulatory testing are as-of-yet unaddressed. Advancing this application will require extensive validation studies benchmarked against traditional mutagenicity assays. Furthermore, it is necessary to investigate how the timeline and concentration selection for in vitro DuplexSeq mutagenicity assays can be integrated with standard genetic toxicology assays, such as the micronucleus (MN) test, to produce a modernized and efficient genotoxicity test paradigm.
The objectives of the work presented here were to: (1) identify the optimal sampling time for mutation analysis in human lymphoblastoid TK6 cells – a widely used in vitro model in regulatory genetic toxicology with a doubling time of approximately 12 – 14 h [15]; (2) evaluate inter-laboratory consistency of DuplexSeq results in this model; and (3) explore the additional information gained by analyzing NGS-based mutational endpoints in the 20 target sites. Toward this, we conducted two independent in vitro time-series and concentration-response analyses of a prototype mutagen using the same DuplexSeq methodologies at two different facilities: the Genomics Laboratory at Health Canada (HC; Ottawa, ON, CA) and Inotiv, Inc. (Inotiv-RTP; Research Triangle Park, North Carolina, US). Both laboratories independently exposed TK6 cells to N-ethyl-N-nitrosourea (ENU), a historical positive control for mutagenicity, and used viability measurements to select the top concentrations. Cells for DuplexSeq were sampled at the same time points over a 96 h period in both laboratories. DuplexSeq libraries were prepared at TwinStrand BioSciences; a replicate set of DNA for one time point was also used to independently prepare libraries at Health Canada. We examined the overall MF induced by ENU at different concentrations and time points, the absolute frequencies and relative proportions of individual base substitution types, and the concentration-response relationships to evaluate assay performance and inter-laboratory concordance in data generation. The DuplexSeq panel also enabled the comparison of MF across the target sites located in different chromosomes, as well as the trinucleotide mutation spectra. In aggregate, our findings indicate that a DuplexSeq-based approach for quantitative in vitro chemical mutagenesis assessment in human cells is feasible, highly reproducible across laboratories, and unlocks previously inaccessible types of genomic data relevant to genotoxicity assessment.
2. MATERIALS AND METHODS
2.1. Cell culture and exposure
Both the HC Genomics Laboratory and Inotiv-RTP independently obtained the human TK6 lymphoblastoid cells from American Type Culture Collection (ATCC) (ATCC# CRL-8015; Manassas, VA). At HC, cells were maintained in T75 flasks at a density between 1×105 and 1×106 cells/mL in RPMI 1640 media (Gibco) supplemented with 10% v/v heat-inactivated horse serum (Gibco; New Zealand origin), 1 mM sodium pyruvate (Gibco), and 2 mM L-glutamine (Gibco), at 37°C in 5% CO2. At Inotiv-RTP, TK6 cells were maintained in RPMI 1640 medium (Gibco) supplemented with 10% v/v heat-inactivated horse serum (Gibco; New Zealand origin), 1% v/v Pluronic F-68™ (Gibco), 0.5 % sodium pyruvate (Gibco), and antibiotics (penicillin at 20 Units/mL and streptomycin at 20 μg/mL) (Gibco), at 37 ± 1 °C in 6 ± 1 % CO2.
ENU (Sigma Aldrich) solutions were prepared in dimethyl sulfoxide (DMSO) (HC: Fisher Scientific; Inotiv-RTP: Sigma Aldrich) immediately prior to exposure at both facilities. At HC, cells were transferred to 6-well plates at a density of approximately 3 × 105 cells/mL with 5 mL of cell suspension in each well. Cells were exposed to a range of ENU concentrations (25, 50, 100, and 150 μM) or an equivalent volume of DMSO (1% v/v) in three replicates (n=3) at 37 °C and 5% CO2. After 24 h, cells were counted and 3 mL of each cell suspension were removed, while the remaining 2 mL of the suspension were transferred to a new 6-well plate. Three mL of RPMI 1640 media (supplement composition as described above) were added to each well to a final volume of 5 mL. Cell counting and sub-culturing procedures were repeated at the 48 h and 72 h time points following the initial exposure. At the 48 h and 72 h time points, 3 mL of cell suspension were collected and pelleted, followed by washing twice in 1x PBS (pH 7.4) before freezing the cell pellets at −80°C. At the 96 h time point, all cells were collected and frozen after counting and the two PBS washes. For the MN assay, a 96-well plate was prepared with 99 μL of TK6 cell suspension in each well at a density of 3 × 105 cells/mL. Cells were exposed to the same concentrations of ENU as described above with four replicates of each concentration (1% v/v). After 24 h, the MN assay was performed as described below.
At Inotiv-RTP, 50 ± 0.1 mL of TK6 cell suspension at a density of 4.0 ± 0.5 X 105 cells/mL were prepared in 600 mL suspension culture flasks. Cells were exposed to five concentrations of ENU (25, 50, 100, 150, and 200 μM) or an equivalent volume of DMSO (1 % v/v) in duplicate flasks (n=2) at 37 °C and 6% CO2. After 24 h, 5 × 105 cells were removed from the vehicle control flasks and equivalent volumes were collected from each ENU-treated flask to perform the MN assay. The remaining cells were pelleted and washed twice by resuspending in 1x PBS (pH 7.4) to remove the PBS. The cells were resuspended to a density of 3.5 ± 0.5 X 105 cells/mL in RPMI 1640 media and 50 mL were returned to the culture flasks in the incubator. The remaining cells were rinsed twice with 1x PBS containing 1 mM EDTA and the pellets were flash frozen in liquid nitrogen before storing at −80°C. The above procedures were repeated at the 48 h and 72 h time points. After 96 h, all cells were collected and frozen after counting and washing twice with 1x PBS containing 1 mM EDTA. Cells exposed to 150 and 200 μM were collected only at 24 h and 96 h.
2.2. In vitro micronucleus flow cytometry assay
The MN frequency was measured using the In Vitro MicroFlow kit (Litron Laboratories, Rochester, NY) on a MACSQuant Analyzer 10 flow cytometer (Miltenyi Biotec) at HC and a Becton-Dickinson FACSCalibur flow cytometer at Inotiv-RTP. The samples were prepared for the flow cytometric analysis according to Litron’s protocol for 96-well plates (HC) and for larger format vessels (Inotiv-RTP). At HC, for each ENU treatment concentration and vehicle control (n=4), 5000 cells were analyzed to determine the percentage of apoptotic cells, MN frequency, and the ratio of nuclei and the counting beads added to all samples during assay preparation as per the protocol. At Inotiv-RTP, 20,000 cells were analyzed from each culture replicate (n=2 for each ENU-exposure and the vehicle control) for the same parameters. Relative survival (RS) was determined by dividing the nuclei-to-bead ratio of all treated samples by that of the corresponding vehicle control. Apoptotic cells and MN were detected by the double staining method described in Litron’s protocol.
2.3. DNA isolation and quality assessment
DNA extraction from the HC TK6 cell pellets was performed in-house in Ottawa, Canada and frozen DNA samples (two replicate sets of samples collected at 48, 72, and 96 h) were shipped to TwinStrand Biosciences on dry ice. Two replicate sets of 48 h DNA samples were retained for in-house DuplexSeq library preparation at HC. The frozen cell pellets produced by Inotiv-RTP were shipped on dry ice to TwinStrand Biosciences for DNA extraction and DuplexSeq library preparation. In both laboratories, DNA samples were extracted and purified using the Dneasy Mini Kit (Qiagen) following the manufacturer’s protocol. DNA was eluted in low EDTA TE buffer (10 mM Tris-HCl pH 8.0, 0.1 mM EDTA). The quantity of DNA in each sample was measured using a Qubit 4 fluorometer and a dsDNA High Sensitivity Assay kit (Invitrogen). The extent of fragmentation of DNA was assessed using an Agilent TapeStation system with Genomic DNA ScreenTape reagents (Agilent Technologies). All DNA samples had a DNA integrity number (DIN) above 8.0.
2.4. Duplex Sequencing library preparation and sequencing
DuplexSeq was performed using the TwinStrand v1.0 Human Mutagenesis panel consisting of 20 target sites that are distributed across 20 autosomal chromosomes (Supplementary Table S1). The target regions are each 2,400 base pairs in length and include genic and intergenic sequences that are representative of the complete genome with regard to % GC-content (Supplementary Figure S1) and relative proportions of coding and non-coding regions (TwinStrand Biosciences). The 48 kb target territory excludes highly repetitive elements and pseudogenes where mapping quality could be compromised. The panel excludes genes with any significant reported role in cancer based on the COSMIC database to reduce the possibility of positive or negative selection influencing induced mutations in cells [12]. Library preparation was performed at the TwinStrand facility and at HC as previously described [13,14]. Inotiv-RTP samples and HC samples were processed and sequenced in August 2020 and in March 2021, respectively, at the TwinStrand facility. Samples were blinded during processing at TwinStrand. At HC, libraries for two additional replicates of 48 h samples were prepared in September 2021.
Briefly, for each sample 1000 ng of DNA was ultrasonically sheared to a median size of ~300 bp, end-repaired, A-tailed, and ligated to DuplexSeq Adapters and underwent target enrichment using the reagents and standard protocol included with the TwinStrand DuplexSeq Human Mutagenesis™ kit. The libraries prepared by TwinStrand were sequenced using 150 bp paired-end reads on an Illumina NovaSeq 6000. The 48 h libraries prepared at HC were shipped to and sequenced at Psomagen (Rockville, MD, US) under identical conditions.
TwinStrand Biosciences processed all raw sequencing data through the DuplexSeq human mutagenesis pipeline as previously described [12]. The raw sequencing data from the 48 h samples that were processed at HC were analyzed separately from the sequencing data produced by TwinStrand Biosciences. Mutation frequency is determined by dividing the number of unique mutations (i.e., ignoring the effect of clonal expansions) by the total number of duplex bases sequenced. The TwinStrand Biosciences pipeline provides sequencing quality metrics, MF, mutation spectra, trinucleotide frequency and MF per target data.
2.5. Statistical Analysis
Statistical analysis, including the estimation of MF and pairwise comparisons to controls, were conducted in the R environment for Statistical Computing [16]. Generalized linear models were fit to the data using the glm() function assuming a binomial error distribution. The Analysis of Deviance Table was estimated using the Anova() function in the cars R package [17]. For generalized linear models, this function presents the likelihood-ratio chi-square statistic from the model as an ANOVA table. Pairwise comparisons were conducted using the doBy R package [18]. The p-values from the hypothesis tests comparing the MF at each concentration to controls were adjusted for multiple testing using the Holm-Sidak correction. This multiple testing correction was applied within each time point and laboratory. This analysis was conducted on the pooled target sites as well as at each target site independently. Generalized linear mixed models were used to analyse the differences between target sites and laboratories and adjusted for multiple testing using the Holm-Sidak correction. The Wald Test was used to compare intergenic and genic target sites.
2.6. Trinucleotide mutation spectra and Catalogue of Somatic Mutations in Cancer (COSMIC) signature analysis
Cosine similarity between the mutational spectra obtained in the two laboratories and COSMIC signatures was evaluated in the R environment for Statistical Computing [16] using the MutationalPatterns 3.4.0 package [19]. First, mutation counts in the two replicates of each concentration at each time point were pooled. A control signature was generated at each time point using the vehicle control spectra. Using the fit_to_signatures_strict() function, the optimal combination of mutational signatures (among the control signature and the COSMIC signatures) that had the highest cosine similarity to each ENU-treated mutation spectrum was identified and the number of mutations that contributed to each signature in the mutation spectrum was determined. In addition, the cos_sim_matrix () function was used to evaluate the cosine similarity between each COSMIC signature and each ENU-treated mutation spectrum. Using the same function, the cosine similarity between the trinucleotide spectra from the two laboratories were analyzed.
2.7. Benchmark concentration modeling
Using the PROAST web application (version 70.1; RIVM National Institute for Public Health and the Environment; https://proastweb.rivm.nl), the concentration-response of MF was modeled using the Hill model to determine the benchmark concentration (BMC) of ENU that induced a 50% increase in MF at 48 h, 72 h, and 96 h. The limit for the AIC (Akaike Information Criterion), which describes the goodness of fit of the model, was set at 2. The lower and upper 90% confidence limits (BMCL and BMCU, respectively) were estimated in addition to the BMC.
3. RESULTS
3.1. Comparison of cytotoxicity and micronucleus frequency
The flow cytometry MN assay is a widely used assay that would fit well within an integrated test with DuplexSeq to identify potentially clastogenic and mutagenic agents. To select the concentrations for mutation analysis that induce DNA damage without excessive cell death, the MicroFlow assay was used to measure the MN frequency, relative survival (RS), and proportions of apoptotic and necrotic cells in TK6 cells 24 h after exposure to a range of ENU concentrations at HC and Inotiv-RTP (Figure 1). These measurements also provided an additional point of comparison to assess the consistency in exposure in the two laboratories.
ENU caused a similar reduction in cell survival in the two independently performed experiments. For example, at 150 μM, the highest concentration used at HC and the second highest at Inotiv-RTP, the average cell survival relative to the vehicle control was 57% and 60%, respectively. At HC, the 150 μM exposure induced a 3.1-fold increase in the apoptotic and necrotic cell population (from 2.3% to 7.2%), while at Inotiv-RTP, 150 μM induced a 2.2-fold increase (from 0.8% to 2.3%).
In both laboratories, ENU induced concentration-dependent increases in % MN. Table 1 summarizes the average fold change in MN frequency at each ENU treatment concentration compared to the vehicle control. The average % MN in the vehicle control in the HC experiment was 0.064%. The 100 μM exposure caused a 5.4-fold increase compared to control, increasing to 0.35% MN. In the Inotiv-RTP experiment, the average % MN in the vehicle control was 0.46%, which increased by 3.2-fold to 1.5% MN at 100 μM. The 100 μM concentrations caused an approximate 32–34% decline in relative survival in both laboratories. Concentrations larger than 100 μM were excluded from the HC study because the relative increase in cell count at 100 μM was approximately 50% after 24 h and dropped below 0% at 48 h at HC (data not shown); thus, 100 μM was selected as the top concentration for analysis and comparison of the two laboratories. The concentration-response in % MN in the two experiments was strongly and positively correlated (r = 0.98) (Supplementary Figure S2). Collectively, the MicroFlow assay confirmed the comparable responses to ENU in TK6 cells in both laboratories.
Table 1.
Fold change in % MN relative to vehicle |
||
---|---|---|
Concentration (μM) | HC | Inotiv-RTP |
| ||
25 | 1.1 | 1.2 |
50 | 1.9 | 1.8 |
100 | 5.4 | 3.2 |
150 | 9.2 | 4.0 |
200 | N/A | 6.2 |
3.2. DuplexSeq mutation analysis
3.2.1. Analysis of overall mutation frequency
Unique mutations were identified by DuplexSeq in each sample and MF were calculated. Figure 2A and Supplementary Table S2 show the average MF of the two biological replicates for 0, 25, 50, and 100 μM ENU at 48 h, 72 h, and 96 h from both studies. Overall, there was a concentration-dependent increase in MF at all three time points tested in both laboratories (Figure 2A and Supplementary Figures S3 and S4) but no significant time-dependent trend in MF at any of these concentrations in either laboratory.
The overall MF from the HC and Inotiv-RTP studies were strongly positively correlated (r = 0.97) (Figure 2B). At HC, the median MF in the vehicle control was approximately 5.5 × 10−7 and increased by an average of 1.9-fold at 100 μM to a median of 10.6 × 10-7. At Inotiv-RTP, the MF increased from a median of 4.7 × 10−7 in the vehicle control to a median of 11.4 × 10−7 at 100 μM, resulting in a 2.4-fold change. In the replicate set of 48 h samples where the libraries were prepared at HC, the average MF was 6.8 × 10−7 in the vehicle control and 12.0 × 10−7 at 100 μM, resulting in a 1.8-fold increase in MF (Supplementary Figure S3). The MF and the fold change in MF were congruent across the three datasets.
To further confirm that concentration-response to ENU in the overall MF was consistent in both laboratories at all three time points, BMC modeling was applied. The BMC50 values were determined using the Hill model (Figure 3 and Table 2); a benchmark response (BMR) of 50% is recommended for various mutagenicity assays and was, thus, selected as a point of comparison across the three time points [20]. All HC BMC50 90% CIs overlapped with the Inotiv-RTP values, indicating that there were no significant differences in concentration-response across the time points and between the two laboratories.
Table 2.
Hill model BMC50 (BMCL, BMCU) Individual analysis |
||
---|---|---|
Inotiv-RTP | HC | |
48h | 38.3 (32.8, 44.2) | 44.6 (33.9, 56.2) |
72h | 33.3 (28.3, 37.5) | 42.0 (34.8, 51.3) |
96h | 34.8 (30.9, 38.9) | 41.0 (34.7, 49.1) |
At Inotiv-RTP, treatment concentrations of 150 and 200 μM were analyzed only after 24 h and 96 h due to issues retrieving a sufficient number of cells at 48 h and 72 h. There were visible discrepancies in the magnitude of response in MF between the earlier and the later time point starting at 100 μM that increased with ENU concentration (Supplementary Figure S4). This difference in concentration-response was also evident in the BMC50 at 24 h (52.2 μM; BMCL: 44.0 μM; BMCU: 61.3 μM).
3.2.2. Mutation frequency across target regions
Typically, in vitro mutation assays are limited to detecting mutations in a single reporter gene. In contrast, the DuplexSeq Human Mutagenesis panel sequences 20 loci on different autosomal chromosomes. The panel includes 10 target regions that contain only genic sequences, six that are intergenic, and four that overlap both intergenic and genic sequences. We examined the MF in the individual target sites to determine if there were variabilities in the effects of ENU exposure in different regions of the genome. In addition, we examined the concordance in response at the 20 targets sites between the two laboratories.
We first examined the background MF in each target region by averaging the MF in the vehicle control across 48 h, 72 h, and 96 h because there was no statistically significant difference in the three time points; we then rank ordered the target regions by MF (Figure 4 and Supplementary Figure S5). Four of the five regions with the lowest background MF were the same in both laboratories (chromosomes 1, 6, 13, and 14). The two laboratories also shared four targets in the top five of the ranking (chromosomes 2, 12, 16, and 19). In the HC study, the difference in MF between the target site with the highest baseline MF (chromosome 21) and the lowest (chromosome 14) was 3-fold. In the Inotiv-RTP study, the difference in MF between the target site with the highest baseline MF (chromosome 9) and the lowest (chromosome 14) was 2.6-fold.
Next, we explored target responses to ENU. There was an observable concentration-dependent increase in MF at all 20 target sites in both studies (Figure 4). After ranking the targets by MF at 100 μM, we compared the target sites in the top and bottom 5 of the ranking for HC and Inotiv-RTP. The target sites on chromosomes 2, 10, and 22 were in the top 5 for both laboratories. In the bottom 5, chromosome 14 and 1 target sites were shared by the two laboratories. In the HC study, the difference in MF between the highest (chromosome 16) and the lowest (chromosome 14) ranked target sites was 1.9-fold. In the Inotiv-RTP study, the chromosome 19 target ranked the highest and the chromosome 14 target ranked lowest, with a difference of 1.7-fold between the two. On average, there was a 30% increase in MF in the intergenic target sites compared to the genic sites.
3.3. Analysis of interactions between experimental parameters
To determine if there were any statistically significant effects of laboratory (HC vs. Inotiv-RTP) on MF across the target sites at each concentration, we applied a generalized linear mixed model (GLMM) followed by the Holm-Sidak multiple testing correction to identify interactions between these two variables. There were no significant interactions between laboratory and target site at any of the concentrations, indicating that all loci responded similarly between the two laboratories. The interaction plots of MF across the 20 target sites at each ENU concentration in the two laboratories showed approximately parallel lines for the two datasets (Supplementary Figure S6a–d).
Next, the effects of concentration, laboratory, and time point on the overall MF (pooled analysis) as well as on MF at each target site and the interaction between these experimental parameters were examined in an Analysis of Deviance Table (Supplementary Tables S3 and S4). The Likelihood Ratio (LR) test statistic indicated highly significant and prominent concentration-related effects on the overall MF at all 20 target sites. There was no evidence of a time point-related effects on the overall MF and only one target site was significantly affected by time point. In terms of the influence of laboratory on MF, the pooled analysis indicated significant effects. However, upon analyzing individual target sites, only three target sites showed significant laboratory-related effects on MF, while at only two sites, an interaction between laboratory and concentration occurred at the highest exposure concentration. All in all, there were strong consistencies between the two laboratories across the 20 target sites.
3.4. Mutation spectra analysis
To explore the mechanistic information produced by DuplexSeq, and to compare the consistency of the mutation spectra across the two laboratories, MF were analyzed by base substitution type (Figure 5). As described above, the replicates across the three time points were averaged for this analysis. In both the HC and Inotiv-RTP control samples, C:G > T:A was the predominant type of mutation in the vehicle control followed by C:G > A:T with similar frequencies and proportions in the two laboratories. Overall, both the proportions and frequencies of C:G > T:A, T:A > C:G, and T:A > A:T significantly increased with ENU concentration by highly similar magnitudes in the two laboratories.
The mutation spectra data were examined in more detail by analyzing base substitutions in each trinucleotide context as a trinucleotide spectrum (Figure 6 and Supplementary Figures S7 and S8). Cosine similarities of the trinucleotide spectra produced in the two laboratories at each concentration and time point were determined; across matched pairs, we observed cosine similarities between 0.93 and 0.98 (Supplementary Figure S7). Consistent with all prior analyses, there was no time-dependent change in the spectra in either of the laboratories. Concentration-dependent changes in the trinucleotide spectra were observed in the COSMIC signature analysis (Supplementary Figure S8), with increases in the proportions of SBS7a and b (UV exposure-related) and SBS32 (azathioprine exposure /immunosuppression) contributing to the overall ENU mutational spectra in TK6 cells. In addition, the cosine similarity of the ENU spectra to SBS7a and b, SBS11 (temozolomide exposure), SBS30 (defective BER due to NTHL1 deficiency), and SBS32 also increased with concentration at all three time points in both laboratories (Supplementary Table S5). Overall, the two laboratories produced highly similar trinucleotide spectra and concordant results in the COSMIC analysis of trinucleotide spectra, indicating a high level of consistency in the mutation spectrum data produced by DuplexSeq.
4. DISCUSSION
Our study demonstrates how error-corrected DuplexSeq of a 48-kb panel of genic and intergenic regions within the human genome can precisely and reproducibly quantify in vitro MF for genetic toxicology applications. The human mutagenicity panel, which contains 20 targets across different autosomes, was specifically developed for applications in chemical and pharmaceutical mutagenicity assessment. This panel-based approach provides a pragmatic way to generate sufficient sequencing data for determining locus-specific MF and to perform an in-depth investigation into the characteristics of mutagenicity by analyzing the mutation spectra. The methodology is remarkably robust, demonstrating highly concordant results across time points and between sample preparations in different laboratories. Moreover, the results acquired provide detailed mechanistic insight enabling the understanding of target nucleotides and region-specific effects of mutagens.
The first objective of this comparative study was to determine an appropriate time point for analyzing mutations by DuplexSeq when assessing potentially mutagenic exposures in TK6 cells. Two laboratories, HC and Inotiv-RTP, independently exposed TK6 cells to ENU and sampled the cells 48, 72, and 96 h post-exposure to allow fixation of DNA damage into mutations. In both laboratories, the overall MF and the induced frequencies and proportions of base substitution types were highly similar for cells sampled from 48 to 96 h following the initial exposure to ENU at all concentrations. Furthermore, the overlapping BMC50 CIs at all sampling times in both laboratories supported the lack of time-dependent effects on concentration-response. Since the increases in MF in ENU-treated samples were statistically significant at 48 h and did not increase with longer incubations, the two later sampling times may not be necessary to detect mutations by DuplexSeq. As a point of contrast, the human TK6 cell HPRT gene mutation assay, one of the standard in vitro mutagenicity tests, takes up to 4 weeks before mutant scoring [21]. With a 48 h sampling time, induced mutation analysis by DuplexSeq drastically reduces the time and labour required compared to conventional tests, providing both an efficient and sensitive mutagenicity assessment methodology; this provides a tremendous advantage for the use of DuplexSeq for mutagenicity testing.
At Inotiv-RTP, a 24 h sampling time was also included in the experiment alongside MN analysis, with additional ENU concentrations of 150 and 200 μM at both 24 h and 96 h. While a concentration-response in the overall MF was observable at 24 h, the magnitude of response was smaller than at 48, 72, and 96 h, and the discrepancy became larger with increasing ENU concentration. This difference in concentration-response was also evident in the BMC50 at 24 h (52.2 μM; BMCL: 44.0 μM; BMCU: 61.3 μM), which was higher than the BMC50 at the three later time points measured at Inotiv-RTP. It is possible that cytostatic effects accompanying the increase in ENU concentration are causing a proportion of cells that retain ethyl adducts to remain viable but undivided at 24 h. Indeed, we observed a concentration-dependent reduction in the relative increase in cell count at 24 h, but the percentage of viable cells measured by trypan blue exclusion was unaffected up to 200 μM in our preliminary range finding experiment at HC at 24 h (data not shown). Based on these results, measuring both MN and mutations 24 h following the initial exposure is not recommended and, thus, these assays will require distinct time points for genotoxicity assessment.
Objective two aimed to assess the inter-laboratory reproducibility of mutagenicity assessment by DuplexSeq. The concordance in concentration-response in two experiments run at distinct laboratories was evaluated by correlation analysis of MF at each concentration and time point, in addition to BMC modeling at all three time points. We found a high, positive correlation between the two laboratories, with an r value of 0.97 in MF. This concordance in concentration-response was supported by the BMC analysis at each time point, where the BMC50 CIs derived from the two studies overlapped with each other. These finding suggest that different laboratories will obtain similar mutagenicity points of departure in DuplexSeq applications that use TK6 cells, which is critical to international efforts to adopt mutagenicity as an endpoint of regulatory concern [20,22].
As an added support to the inter-laboratory validation, two replicate cell samples that were collected 48 h post-exposure in the same experiment at HC were retained for in-house DuplexSeq library preparation and sequenced at a third party NGS facility (Psomagen Inc., Maryland). These samples were prepared and examined independently of the HC and Inotiv-RTP samples that were shipped to and processed by TwinStrand. While the MF measured in the 48 h samples processed in-house was higher than in both the 48 h HC and Inotiv-RTP samples that were processed by TwinStrand at all concentrations, the magnitude of increase in MF was highly similar. For example, at 100 μM, MF increased by 1.8-fold compared to control in the HC-processed samples and by 1.9- and 2.5-fold in the TwinStrand-processed samples. These additional data not only demonstrate the inter-laboratory consistency of DuplexSeq between HC and Inotiv-RTP, but also the promising reproducibility of DuplexSeq even when libraries are constructed and sequenced at different facilities, months apart.
In addition to inter-laboratory concordance in MF, we evaluated whether the mutation spectra, both simple and trinucleotide, were consistent between the two laboratories. Both types of spectra remained constant over time at both HC and Inotiv-RTP and were consistent between the two laboratories across concentrations. The COSMIC signature analysis of the trinucleotide spectra revealed concentration-dependent increases in cosine similarity to the same COSMIC signatures in both laboratories at all time points. A more in-depth discussion on the observed COSMIC signatures is in the Supplementary Discussion. Overall, the analysis supports that DuplexSeq generates highly reproducible spectral data.
As our third objective, we examined the MF at each of the 20 target sites of the Mutagenesis panel and investigated whether the MF at each locus was affected differently by experimental parameters including ENU concentration, laboratory, and time point. Almost all loci showed significant concentration responses in MF in both laboratories. At five target sites, there appeared to be laboratory-driven effects on MF in the analysis of deviance. The main effects at two sites were inconclusive, while the small but significant laboratory-driven effects on chromosomes 17, 20, and 21 indicated differences between the two laboratories at these three sites. This is likely due the lower baseline MF in the Inotiv-RTP relative to HC samples leading to a greater response in these target sites at Inotiv-RTP. However, we caution over-interpretation of these results because of the small sample size (n=2) and the fact that MF at these sites were within one order of magnitude with each other. Ranking of the target sites by MF in the vehicle control and at 100 μM (averaged across time points) revealed some shared target sites in the top and bottom five between the two laboratories. Furthermore, in the GLMM analysis of individual concentration groups, we observed no significant interaction between target sites and laboratory at any of the concentrations and, thus, no laboratory-dependent effects on target sites in this model. All in all, the target sites responded to ENU in a consistent manner and there was a good degree of similarity in locus-specific MF in the two different laboratories.
We further investigated what additional biological and mechanistic understanding we could gain from the Mutagenesis panel that add value to mutagenicity assessment. First, we examined the differences in MF between coding and non-coding regions. We expected genic regions, which are more likely to be repaired by transcription-coupled nucleotide excision repair (TCR-NER), to contain fewer background mutations and to show less mutagenesis in ENU-exposed cells [12,13,23]. Indeed, on average, the MF in the intergenic regions was 30% higher than in the genic regions, indicating differences between the two types of target regions as previously observed in an in vivo DuplexSeq study in transgenic rodents [13]. To estimate the transcription levels of the genic regions, we obtained transcript levels for 6 of the genic target sites from in-house DNA microarray gene expression profiles of TK6 cells exposed to a range of ENU concentrations for 4 h. These genes were all expressed at levels below the median expression level of the whole transcriptome and were not responsive to ENU treatment (data not shown). Given this, it may be possible that the genes in the genic target sites are not significantly affected by TC-NER, compared to more actively transcribed genes. Nonetheless, the work demonstrates additional biological insight that can be acquired from the use of locus-specific panels in DuplexSeq assays.
Notably, we observed an ENU mutation spectrum different from the previously reported mutation spectra in a human induced pluripotent stem cell line (iPSC) [24] and in transgenic rodent tissues (Big Blue and TgrasH2 mice) [12]. ENU ethylates guanine at the O6 and N7 positions and at the O4, O2, and N3 positions of thymine; unrepaired O6-alkylG and 7-alkylG can result in C:G > T:A transitions and O4-alkylT and 3-alkylT can lead to T:A > C:G, T:A > A:T, and T:A > G:C [25–27]. In an iPSC model, T:A > C:G was predominant, while in TGR models, T:A > A:T and T:A > C:G were the predominant types of base substitution following ENU exposures. However, our observations are similar to the ENU-induced spectrum generated in TK6 cells using another ecNGS method, whole genome paired-end and complementary consensus sequencing (PECC-seq) [8]. The difference in mutation spectra from other models is likely due to the elevated baseline levels of C:G > T:A and C:G > A:T in TK6 cells, which accounted for 40% and 25–28% of the overall baseline mutations, respectively, at HC and Inotiv-RTP. Mutation spectra are influenced not only by exposure to mutagens, but also by endogenous factors such as tissue-specific DNA repair mechanisms and polymerases [23,28], physiologic processes such as autoimmunity or inflammation [29], inherited genetic factors, and aging [30,31]. Indeed, the baseline and induced spectra in TK6 cells aligned with the known characteristics of the cell line and accompanying culture conditions. The reduced repair capacity for endogenously and exogenously alkylated guanine in TK6 cells is due to its O6-alkylguanine DNA alkyltransferase (MGMT) deficiency paired with spontaneous deamination occurring over the course of culturing, which are the likely causes of the elevated levels of baseline C:G > T:A [32–34]. MGMT deficiency also likely resulted in the predominant increase in the frequency of C:G > T:A over mutations of the thymine bases following ENU exposure. In fact, in MGMT-proficient cells, ENU-induced increases in the frequencies and proportions of thymine mutations are more prominent than what is observed in TK6 cells [12]. A possible cause of the higher baseline frequency of C:G > A:T is oxidative damage to the guanine bases forming mutagenic lesions such as 8-oxo-2’-deoxyguanine (8-oxo-dG). Culture conditions (e.g., culture media, O2 level, atmospheric pressure) could increase cellular susceptibility to oxidative stress by stimulating reactive oxygen species production and/or interfering with antioxidant defences [35]. Unrepaired 8-oxodG can pair with adenine during replication and lead to C:G > A:T transversions [36]. This may lead to the accumulation of C:G > A:T with successive cell divisions. Overall, we speculate that the above factors lead to TK6 cell line-specific mutation spectra.
Despite that fact that ENU is one of the most potent mutagens known in mammalian cells [37], we observed smaller fold increases in MF (1.8 – 2.5-fold) than previously reported following exposures to ENU and other less potent mutagens [12,38–40]. Several factors may contribute to the smaller fold increases in MF in our study. Baseline MF in TK6 cells in our study was relatively higher than other studies using ecNGS. For example, Miranda et al. (2022) [38] used whole genome HiFi sequencing and reported baseline MF values between 0.68 × 10−7 and 0.76 × 10−7 in TK6 cells; we observed values between 4.6 × 10−7 and 5.6 × 10−7, almost 7 times higher. There were critical differences between the two studies. Whole genomes were examined using HiFi sequencing while the Mutagenesis panel was used for DuplexSeq to target select areas that are proportionally comparable to the genome in terms of sequence contexts. Thus, dynamic ranges could differ between these technologies. However, the discrepancies between our study and the HiFi sequencing study are more likely attributable to the experimental design. Miranda et al. (2022) first prepared a homogenous TK6 cell culture originating from a single cell, from which 10 cells were isolated, exposed for five days, and expanded for 4–7 days prior to whole genome mutation analysis, whereas we did not isolate a sub-population of cells prior to our experiment. Similarly, background MF obscuring induced mutations was noted during the development of the in vitro PIG-A gene mutation assay and was overcome by excluding mutant cells from the culture prior to performing the assay [41]. These experimental design and model limitations should be carefully considered in future studies. In addition, compared to gene mutation assays in mammalian cells, the mutation frequency quantification methods vastly differ for DuplexSeq. Gene mutation assays rely on cloning and phenotypic selection over many generations of cells [21]. In contrast, we identified only unique mutations and excluded clonally expanded mutations that were observed more than once; such mutations are not discernable in the results produced by conventional assays, which would contribute to the overall larger fold changes in MF observed in phenotype-based mutation studies. In addition, the top concentration of ENU that we tested is relatively lower than the top concentrations (1 – 3 mM) used in many previous studies using traditional gene mutation assays [26,39,42]. The current concentration range may not capture the complete dynamic range of the DuplexSeq assay. Therefore, it is difficult to perform a direct, side-by-side comparison of results across different types of mutation assays and experimental designs.
Targeted sequencing using the Mutagenesis panel increases the throughput of mutation analysis by DuplexSeq. However, the types of mutations that are quantifiable using the Mutagenesis panel may only include small changes in the sequence such as point mutations and small indels, while excluding larger structural mutations such as large deletions and chromosome rearrangements, resulting in mutation spectra different from the ones typically produced by mammalian gene mutation assays [1,2]. Nonetheless, the single nucleotide variants and small indels measured by DuplexSeq are critical mutational endpoints that occur in and drive cancer and heritable genetic disease. The major advantage of mutation detection by DuplexSeq and ecNGS in general is the precise information regarding the location, sequence contexts, and the quantity of each type of mutation, as well as the flexibility in the applicable model systems, in contrast to conventional mutagenicity tests. Such data enable quantitative analysis and modeling (e.g., in vitro-in vivo extrapolation, bioactivity exposure ratio derivation) and the identification of mechanism-specific mutational signatures, which may be valuable in establishing links between chemical exposures and carcinogenesis [24]. Overall, additional research is required to gain a better understanding of the capacity of the DuplexSeq mutagenicity panel as a genetic toxicity assay and to determine how to effectively integrate this assay in genetic toxicology assessments.
The experimental design and the baseline MF values presented in this study provide a reference point for future DuplexSeq studies to further explore its utility in genetic toxicology and as a regulatory test. As discussed above, direct comparisons of the panel against whole genomes have not been conducted and would be an important step in validating the DuplexSeq assay. Target panels can be refined or improved as more information regarding the mutability of different genomic regions becomes available. Future studies must include genotoxicants with different mechanisms of action and mutagenic potencies to gain a more comprehensive understanding of the assay’s capabilities and dynamic range. Furthermore, broader inter-laboratory studies are needed to investigate cross-laboratory consistency and to establish standardized operating procedures for sample preparation and for data processing and analysis [43].
In summary, we identified 48 h post-exposure to be an optimal sampling time for in vitro mutation analysis in TK6 cells for a potent alkylating agent. Compared to gene mutation assays that are based on mutant selection and quantification, DuplexSeq, especially when combined with a genome-representative hybrid selection panel, presents a significant advantage in terms of efficiency and the diversity of information that can be obtained. However, it must be noted that the current associated costs may limit its accessibility. We also demonstrated high inter-laboratory consistency of the DuplexSeq human mutagenicity assay in two different laboratories. The results for both MF and spectra were remarkably concordant between the two experiments in all comparisons, suggesting robust reproducibility of the assay and its strong potential for test guideline development. In addition, the targeted panel approach enabled analyzing and comparing MF in different genomic contexts. Additional validation and data are required to support this emerging approach to mutagenicity testing in gaining regulatory acceptance, including its capability to detect mutagens with varying potencies and more complex mechanisms of action. Nonetheless, studies such as ours provide a step forward in building confidence in DuplexSeq for applications in genetic toxicology and set the stage for applying this methodology to modernize chemical risk assessment.
Supplementary Material
HIGHLIGHTS.
‐ Duplex Sequencing directly quantifies mutations in DNA from cultured human cells
‐ Mutation frequency and spectra can be determined in vitro using Duplex Sequencing
‐ Duplex Sequencing detects increases in mutation frequency 48h post mutagen exposure
‐ Duplex Sequencing produced concordant results in two different laboratories
ACKNOWLEDGMENTS
This study was funded by Health Canada’s Genomics Research and Development Initiative (GRDI) and genetic toxicity testing conducted for the Division of Translational Toxicology (DTT), National Institutes of Environmental Health Sciences, National Institutes of Health, US Department of Health and Human Services under contract [HHSN273201300009C]. CLY acknowledges the Canada Research Chairs Program. We would like to thank Annette Dodge (University of Ottawa), Danielle LeBlanc, Dr. Matthew Meier (Health Canada), and Mark Consugar (TwinStrand Biosciences) for their support in conducting the DuplexSeq assay and analysis. The authors would also like to thank Dr. Scott Auerbach and Ms. Julie Foley at the DTT, and Drs. Marc Beal and Julie Cox at HC for their careful review of the manuscript.
ABBREVIATIONS
- BMC
Benchmark concentration
- BMR
Benchmark response
- COSMIC
Catalogue of Somatic Mutations in Cancer
- DuplexSeq
Duplex Sequencing
- ecNGS
error-corrected next-generation sequencing
- ENU
N-ethyl-N-nitrosourea
- GLMM
Generalized linear mixed models
- HC
Health Canada
- MF
Mutation frequency
- MN
Micronucleus
- RS
Relative survival
- UMI
Unique molecular index
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
COMPETING INTERESTS
EC, CDS, MR, AW, CLY, FM, LR, KLW, and SLSR declare that they have no conflict of interest. EKS, JY, THS, PV, FYL, CCV, and JJS are employees and equity holders at TwinStrand Biosciences, Inc. and are authors on one or more Duplex Sequencing-related patents.
Declaration of Interest Statement
EC, CDS, MR, AW, CLY, FM, LR, KLW, and SLSR declare that they have no conflict of interest. EKS, JY, THS, PV, FYL, CCV, and JJS are employees and equity holders at TwinStrand Biosciences, Inc. and are authors on one or more Duplex Sequencing-related patents.
DATA AVAILABILITY STATEMENT
The datasets produced in this study have been uploaded to the NCBI’s Sequencing Read Archives (SRA). Health Canada datasets (bam files) are available under BioProject number PRJNA909196 (Reviewer link: https://dataview.ncbi.nlm.nih.gov/object/PRJNA909196?reviewer=aj5t3tmmd0pk60li9pj868mebq). Inotiv-RTP datasets (fastq files) are available under BioProject number PRJNA913202 (Reviewer link: https://dataview.ncbi.nlm.nih.gov/object/PRJNA913202?reviewer=cnjuifkosvjsfqbhk8indv7g02)
REFERENCES
- [1].OECD, Test No. 476: In Vitro Mammalian Cell Gene Mutation Tests using the Hprt and xprt genes, in: Paris, France, 2016. [Google Scholar]
- [2].OECD, Test No. 490: In Vitro Mammalian Cell Gene Mutation Tests Using the Thymidine Kinase Gene, OECD Publishing, Paris, France, 2016. [Google Scholar]
- [3].OECD, Test No. 471: Bacterial Reverse Mutation Test, in: OECD Publishing, Paris, France, 2020. [Google Scholar]
- [4].Salk JJ, Kennedy SR, Next-generation genotoxicity: Using modern sequencing technologies to assess somatic mutagenesis and cancer risk, Environ Mol Mutagen 61 (2020) 135–151. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [5].Beal MA, Meier MJ, LeBlanc DP, Maurice C, O’Brien JM, Yauk CL, Marchetti F, Chemically induced mutations in a MutaMouse reporter gene inform mechanisms underlying human cancer mutational signatures, Commun Biol 3 (2020) 438. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [6].Kennedy SR, Schmitt MW, Fox EJ, Kohrn BF, Salk JJ, Ahn EH, Prindle MJ, Kuong K, Shen JC, Risques R, Loeb LA, Detecting ultralow-frequency mutations by Duplex Sequencing, Nat Protoc 9 (2014) 2586–2606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Minko I, Jacobs A, de Leon A, Gruppi F, Donley N, Harris T, Rizzo C, McCullough A, Lloyd RS, Catalysts of DNA Strand Cleavage at Apurinic/Apyrimidinic Sites, Sci Rep 6 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [8].You X, Thiruppathi S, Liu W, Cao Y, Naito M, Furihata C, Honma M, Luan Y, Suzuki T, Detection of genome-wide low-frequency mutations with Paired-End and Complementary Consensus Sequencing (PECC-Seq) revealed end-repair-derived artifacts as residual errors, Arch Toxicol 94 (2020) 3475–3485. [DOI] [PubMed] [Google Scholar]
- [9].Schmitt MW, Kennedy SR, Salk JJ, Fox EJ, Hiatt JB, Loeb LA, Detection of ultra-rare mutations by next-generation sequencing, Proceedings of the National Academy of Sciences, USA 109 (2012) 14508–14513. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [10].Abascal F, Harvey L, Mitchell E, Lawson A, Lensing SV, Ellis P, Russell A, Alcantara RE, Baez-Ortega A, Wang Y, Kwa EJ, Lee-Six H, Cagan A, Coorens T, Chapman MS, Olafsson S, Leonard S, Jones D, Machado HE, Davies M, Martincorena I, Somatic mutation landscapes at single-molecule resolution, Nature 593 (2021) 405–410. [DOI] [PubMed] [Google Scholar]
- [11].Hoang ML, Kinde I, Tomasetti C, McMahon KW, Rosenquist TA, Grollman AP, Kinzler KW, Vogelstein B, Papadopoulos N, Genome-wide quantification of rare somatic mutations in normal human tissues using massively parallel sequencing, Proceedings of the National Academy of Sciences, USA 113 (2016) 9846–9851. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [12].Valentine III CC, Young RR, Fielden MR, Kulkarni R, Williams LN, Li T, Minocherhomji S, Salk JJ, Direct quantification of in vivo mutagenesis and carcinogenesis using duplex sequencing, Proceedings of the National Academy of Sciences, USA 117 (2020) 33414–33425. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [13].LeBlanc DP, Meier M, Lo FY, Schmidt E, C.C.V. III, Williams A, Salk JJ, Yauk CL, Marchetti F, Duplex sequencing identifies genomic features that determine susceptibility to benzo(a)pyrene-induced in vivo mutations, BMC Genomics [Computer File] 23 (2022) 542. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [14].Wang Y, Mittelstaedt RA, Wynne R, Chen Y, Cao X, Muskhelishvili L, Davis K, Robison TW, Sun W, Schmidt EK, Smith TH, Norgaard ZK, III CCV, Yaplee J, Williams LN, Salk JJ, Heflich RH, Genetic toxicity testing using human in vitro organotypic airway cultures: Assessing DNA damage with the CometChip and mutagenesis by Duplex Sequencing, Environ Mol Mutagen 62 (2021) 306–318. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [15].Lorge E, Moore MM, Clements J, O’Donovan M, Fellows MD, Honma M, Kohara A, Galloway S, Armstrong MJ, Thybaud V, Gollapudi BB, Aardema M, Tanir JY, Standardized cell sources and recommendations for good cell culture practices in genotoxicity testing, Mutat Res Fund Mol Mech Mutagen 809 (2016) 1–15. [DOI] [PubMed] [Google Scholar]
- [16].R.C. Team, R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing. 4.1.2 (2021). [Google Scholar]
- [17].Fox J, Weisberg S, An R Companion to Applied Regression, 3rd ed., Sage, Thousand Oaks, CA, 2019. [Google Scholar]
- [18].Hojsgaard S, Halekoh U, doBy: Groupwise Statistics, LSmeans, Linear Contrasts, Utilities. R package version 4.6.11 URL: https://CRAN.R-project.org/package=doBy, (2021).
- [19].Manders F, Blokzijl F, Janssen R, Oka R, de Kanter J, van Roosmalen M, van Boxtel R, Cuppen E, Introduction to MutationalPatterns URL: https://bioconductor.org/packages/release/bioc/vignettes/MutationalPatterns/inst/doc/Introduction_to_MutationalPatterns.html, (2021).
- [20].White PA, Long AS, Johnson GE, Quantitative Interpretation of Genetic Toxicity Dose‐Response Data for Risk Assessment and Regulatory Decision‐Making: Current Status and Emerging Priorities, Environ Mol Mutatgen 61 (2020) 66–83. [DOI] [PubMed] [Google Scholar]
- [21].Johnson GE, Mammalian Cell HPRT Gene Mutation Assay: Test Methods, in: Parry J, Parry E (Eds.), Springer, New York, NY, 2012: pp. 55–67. [DOI] [PubMed] [Google Scholar]
- [22].Heflich RH, Johnson GE, Zeller A, Marchetti F, Douglas GR, Wiit KL, Gollapudi BB, White PA, Mutation as a Toxicological Endpoint for Regulatory Decision-Making, Environ Mol Mutagen 61 (2020) 34–41. [DOI] [PubMed] [Google Scholar]
- [23].Saini N, Roberts SA, Klimczak LJ, Chan K, Grimm SA, Dai S, Fargo DC, Boyer JC, Kaufmann WK, Taylor JA, Lee E, Cortes-Ciriano I, Park PJ, Schurman SH, Malc EP, Mieczkowski PA, Gordenin DA, The Impact of Environmental and Endogenous Damage on Somatic Mutation Load in Human Skin Fibroblasts, PLoS Genet 12 (2016) e1006385. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [24].Kucab JE, Zou X, Morganella S, Joel M, Nanda AS, Nagy E, Gomez C, Degasperi A, Harris R, Jackson SP, Arlt V, Phillips D, Nik-Zainal S, A Compendium of Mutational Signatures of Environmental Agents, Cell 177 (2019) 821–836.e16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [25].Jansen JG, de Groot AJ, van Teijlingen CM, Lohman PH, Mohn GR, Vrieling H, van Zeeland AA, Formation and persistence of DNA adducts in pouch skin fibroblasts and liver tissue of rats exposed in vivo to the monofunctional alkylating agents N-methyl-N-nitrosourea or N-ethyl-N-nitrosourea, Mutat Res Fund Mol Mech Mutagen 307 (1994) 95–105. [DOI] [PubMed] [Google Scholar]
- [26].Yang J-L, Lee P-C, Lin S-R, Lin J-G, Comparison of mutation spectra induced by N-ethyl-N-nitrosourea in the hprt gene of Mer+ and Mer− diploid human fibroblasts, Carcinogenesis 15 (1994) 939–945. [DOI] [PubMed] [Google Scholar]
- [27].Shrivastav N, Li D, Essigmann JM, Chemical biology of mutagenesis and DNA repair: cellular responses to DNA alkylation, Carcinogenesis 31 (2010) 59–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [28].Saini N, Giacobone CK, Klimczak LJ, Papas BN, Burkholder AB, Li J-L, Fargo DC, Bai R, Gerrish K, Innes CL, Schurman SH, Gordenin DA, UV-exposure, endogenous DNA damage, and DNA replication errors shape the spectra of genome changes in human skin, PLoS Genet 17 (2021) e1009302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [29].Pleguezuelos-Manzano C, Puschhof J, Huber AR, Mutational signature in colorectal cancer caused by genotoxic pks+ E. coli, Nature 580 (2020) 269–273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [30].Alexandrov L, Kim J, Haradhvala NJ, Huang MN, Ng A, Wu Y, Boot A, Covington KR, Gordenin D, Bergstrom EN, Islam SM, Lopez-Bigas N, Klimczak LJ, McPherson JR, Morganella S, Sabarinathan R, Wheeler DA, Mustonen V, G. PCAWG Mutational Signatures Working Group Getz, Rozen SG, Stratton M, The repertoire of mutational signatures in human cancer, Nature 578 (2020) 94–101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [31].Alexandrov L, et al. , Signatures of mutational processes in human cancer, Nature 500 (2013) 415–421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [32].Kaina B, Christmann M, Naumann S, Roos WP, MGMT: Key node in the battle against genotoxicity, carcinogenicity and apoptosis induced by alkylating agents, DNA Repair (Amst) 6 (2007) 1079–1099. [DOI] [PubMed] [Google Scholar]
- [33].Nagel ZD, Beharry AA, Mazzucato P, Kitange GJ, Sarkaria JN, Kool ET, Samson LD, Fluorescent reporter assays provide direct, accurate, quantitative measurements of MGMT status in human cells, PLoS One 14 (2019) e0208341. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [34].Ehrlich M, Zhang X-Y, Inamdar NM, Spontaneous deamination of cytosine and 5-methylcytosine residues in DNA and replacement of 5-methylcytosine residues with cytosine residues, Mutat Res 238 (1990) 277–286. [DOI] [PubMed] [Google Scholar]
- [35].Halliwell B, Oxidative stress in cell culture: an under-appreciated problem?, FEBS Lett 540 (2003) 3–6. [DOI] [PubMed] [Google Scholar]
- [36].Markkanen E, Not breathing is not an option: How to deal with oxidative DNA damage, DNA Repair (Amst) 59 (2017) 82–105. [DOI] [PubMed] [Google Scholar]
- [37].Russell WL, Kelly EM, Hunsicker PR, Bangham JW, Maddux SC, Phipps EL, Specific-locus test shows ethylnitrosourea to be the most potent mutagen in the mouse., Proceedings of the National Academy of Sciences 76 (1979) 5818–5819. 10.1073/pnas.76.11.5818. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [38].Miranda JA, McKinzie PB, Dobrovolsky VN, Revollo JR, Evaluation of the mutagenic effects of Molnupiravir and N4‐hydroxycytidine in bacterial and mammalian cells by HiFi sequencing, Environ Mol Mutagen 63 (2022) 320–328. 10.1002/em.22510. [DOI] [PubMed] [Google Scholar]
- [39].Bronstein SM, Cochrane JE, Craft TR, Swenberg JA, Skopek TR, Toxicity, mutagenicity, and mutational spectra of N-ethyl-N-nitrosourea in human cell lines with different DNA repair phenotypes., Cancer Res 51 (1991) 5188–97. [PubMed] [Google Scholar]
- [40].Jensen JC, Thilly WG, Spontaneous and induced chromosomal aberrations and gene mutations in human lymphoblasts: mitomycin C, methylnitrosourea, and ethylnitrosourea, Mutation Research/Fundamental and Molecular Mechanisms of Mutagenesis 160 (1986) 95–102. 10.1016/0027-5107(86)90033-3. [DOI] [PubMed] [Google Scholar]
- [41].Rees BJ, Tate M, Lynch AM, Thornton CA, Jenkins GJ, Walmsley RM, Johnson GE, Development of an in vitro PIG-A gene mutation assay in human cells, Mutagenesis (2017) gew059. 10.1093/mutage/gew059. [DOI] [PMC free article] [PubMed]
- [42].Medcalf ASC, Wade MH, Comparison of mutagenicity of N-methyl-N-nitrosourea and N-ethyl-N-nitrosourea in human diploid fibroblasts, Carcinogenesis 4 (1983) 115–118. 10.1093/carcin/4.1.115. [DOI] [PubMed] [Google Scholar]
- [43].Marchetti F, Cardoso R, Chen CL, Douglas GR, Elloway J, Escobar PA, Harper T Jr, Heflich RH, Kidd D, Lynch AM, Myers MB, Parsons BL, Salk JJ, Settivari RS, Smith-Roe SL, Witt KL, Yauk C, Young RR, Zhang S, Minocherhomji S, Error-corrected next-generation sequencing to advance nonclinical genotoxicity and carcinogenicity testing, Nat Rev Drug Discov 22 (2023) 165–166. 10.1038/d41573-023-00014-y. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets produced in this study have been uploaded to the NCBI’s Sequencing Read Archives (SRA). Health Canada datasets (bam files) are available under BioProject number PRJNA909196 (Reviewer link: https://dataview.ncbi.nlm.nih.gov/object/PRJNA909196?reviewer=aj5t3tmmd0pk60li9pj868mebq). Inotiv-RTP datasets (fastq files) are available under BioProject number PRJNA913202 (Reviewer link: https://dataview.ncbi.nlm.nih.gov/object/PRJNA913202?reviewer=cnjuifkosvjsfqbhk8indv7g02)