Abstract
Etheno (ε)-adducts, e.g., 1,N2-ε−guanine (1,N2-ε-G) and 1,N6-ε−adenine (1,N6-ε-A), are formed through the reaction of DNA with metabolites of vinyl compounds or with lipid peroxidation products. These lesions are known to be mutagenic, but it is unknown how they lead to errors in DNA replication that are bypassed by DNA polymerases. Here we report the structural basis of misincorporation frequencies across from 1,N2-ε-G by human DNA polymerase (hpol) η. In single-nucleotide insertions opposite the adduct 1,N2-ε-G, hpol η preferentially inserted dGTP, followed by dATP, dTTP, and dCTP. This preference for purines was also seen in the first extension step. Analysis of full-length extension products by LC-MS/MS revealed that G accounted for 85% of nucleotides inserted opposite 1,N2-ε-G in single base insertion, and 63% of bases inserted in the first extension step. Extension from the correct nucleotide pair (C) was not observed, but the primer with A paired opposite 1,N2-ε-G was readily extended. Crystal structures of ternary hpol η insertion-stage complexes with nonhydrolyzable nucleotides dAMPnPP or dCMPnPP showed a syn orientation of the adduct, with the incoming A staggered between adducted base and the 5’-adjacent T, while the incoming C and adducted base were roughly coplanar. The formation of a bifurcated H-bond between incoming dAMPnPP and 1,N2-ε-G and T, compared with the single H-bond formed between incoming dCMPnPP and 1,N2-ε-G, may account for the observed facilitated insertion of dGTP and dATP. Thus, preferential insertion of purines by hpol η across from etheno adducts contributes to distinct outcomes in error-prone DNA replication.
Keywords: DNA polymerase, X-ray crystallography, mass spectrometry, DNA damage, DNA replication, DNA–protein interaction, translesion synthesis, etheno DNA adducts
Abbreviations: 1,N2-ε-G, 1,N2-ethenodeoxyguanosine or 1,N2-ethenoguanine; CID, collision-induced dissociation; dAMPnPP, 2′-deoxyadenosine-5′-[(α,β)-imido]triphosphate; dCMPnPP, 2′-deoxycytosine-5′-[(α,β)-imido]triphosphate; dNMPnPP, 2′-deoxynucleoside-5′-[(α,β)-imido]triphosphate; EIC, extracted ion chromatogram; ESI, electrospray ionization; FAM, 6-carboxyfluorescein; h, human; pol, DNA polymerase; UDG, uracil DNA glycosylase; UPLC, ultraperformance liquid chromatography
Exocyclic etheno (ε) nucleobase adducts, e.g., 1,N6-ε-A, 3,N4-ε-C, N2,3-ε-G, and 1,N2-ε-G (1, 2), arise from the action of various bis-electrophiles with cellular DNA (3) (Fig. 1). Reactive species that generate such lesions and originate from exogenous sources include epoxides derived from the oxidation of vinyl chloride and other vinyl monomers (4, 5) (Fig. 2A), urethane, certain nitrosamines (6), and mucochloric acid (7). In addition, etheno lesions are formed endogenously through interaction of nucleobases with lipid peroxidation-derived aldehydes and hydroxyalkenals (8, 9, 10) (Fig. 2B). Thus, etheno-DNA lesions have been detected in the human livers with nonalcoholic fatty liver disease, a cause of end-stage liver disease and hepatocellular carcinoma (11). Reactive oxygen species constitute an important factor in hepatocellular carcinoma pathogenesis and can react with polyunsaturated fatty acids that are derived from membrane phospholipids, e.g., aldehydes such as 2,3-epoxy-4-hydroxynoneal and 4-hydroperoxynonenal react with DNA to form exocyclic etheno adducts (8, 10, 11). Other potential sources of ROS are inflammatory processes and cytochrome P450 (P450) 2E1. However, it has been established that hepatic etheno DNA adducts correlated significantly with 4-hydroxynonenal but not with levels of P450 2E1 (12).
1,N2-ε-G was first synthesized in 1977 (13), and the 1,N2-ε-G base was later found to be excreted in human urine, indicating its presence in cellular DNA and removal by glycosylases (14). 1,N2-ε-G caused misincorporation and mutations with model DNA polymerases (pols) in vitro (e.g., G→T and G→C base changes) and in bacterial (G→T, G→C and G→A base changes in Escherichia coli) and mammalian cells (G→A and G→T base changes) (15, 16, 17). However, the mutation rates following introduction of the 1,N2-ε-G lesion into bacterial or mammalian cell lines were relatively low (16, 17), a finding that was subsequently attributed to some degree to excision of the lesioned base by glycosylases such as E. coli uracil DNA glycosylase (UDG) and human alkyl-N-purine DNA glycosylase (18). These results leave open the question of the mechanism of miscoding and the potential involvement of a wider spectrum of pols beyond the replicative enzymes initially tested. The relevance of the etheno adducts in mutagenesis and carcinogenesis (1, 19, 20, 21) provided a motivation to more closely examine the consequences of these lesions for DNA damage-inducible pols. The 1,N2-ε-G adduct in which the two exocyclic carbon atoms prevent the formation of normal Watson–Crick H-bonds served as a useful model system for other exocyclic G lesions in such pol structure/function investigations (3, 22, 23).
In addition to the high-fidelity replicative pols, eubacteria, eukaryotes, and archaebacteria all have at their disposal the so-called translesion synthesis (TLS) or bypass polymerases (pols) (24, 25, 26, 27, 28, 29). These Y-Family pols play major roles in replication past DNA damage, whereby TLS can proceed in both an error-prone and an error-free fashion. Y-family pols share the right-handed palm, finger, and thumb domains with replicative ones but feature an additional little finger domain and unique makeups of their active sites. E. coli possesses five pols, with pols IV and V being members of the Y-Family (30). Humans have at least 19 pols, of which pols η, κ, and ι and Rev1 belong to the Y-Family. In the model archaeal organism Sulfolobus solfataricus, DNA pol IV (Dpo4) exhibits bypass behavior that is in some cases similar to that of human pol η (29). In initial work directed at the activity of a bypass pol past the 1,N2-ε-G adduct, we investigated the steady-state kinetics of Dpo4-catalyzed insertion and extension reactions (31). Experiments with individual dNTPs showed that A was the base preferentially inserted opposite 1,N2-ε-G. The products from full-length extension reactions were identified by an LC-MS/MS approach that revealed Dpo4 used multiple bypass mechanisms. Dpo4 extended primers containing 1,N2-ε-G to full length such that frameshifts predominated over A incorporation opposite the lesion. Thus, for a 3′-(1,N2-ε-G)CACT-5′ template, 5′-GTGA-3′ (frameshift, no insertion opposite 1,N2-ε-G) was the major extension product and 5′-AGTGA-3′ was a minor product. Generation of the former entails a frameshift such that primer G base pairs with template C. Crystal structures of a binary Dpo4-DNA and three ternary Dpo4-DNA-ddNTP complexes revealed an anti conformation of 1,N2-ε-G that was stacked between adjacent base pairs but unopposed by a residue from the primer strand (31). In these so-called Type 2 structures, Dpo4 is able to accommodate two template bases in the active site, thereby trapping a frameshift intermediate and shedding light on the mechanism of incorporation into the growing primer of either A or G opposite T or C, respectively, located on the 5′-side of the lesion.
A subsequent study investigated the abilities of four human pols to synthesize past the 1,N2-ε-G adduct and their tendencies for error-prone or error-free bypass (32). One of them, the replicative human pol δ was completely blocked by the etheno lesion. The human Y-family pols κ and ι exhibited similar rates of either dTTP or dCTP incorporation. In comparison, hpol η was more efficient at bypassing the lesion than hpols κ and ι, but it also showed the highest error frequency, incorporating dGTP > dATP > dCTP. This order was maintained irrespective of whether the 5′-adjacent residue to the lesion was C or T. Therefore, Dpo4 and hpol η use different mechanisms of bypassing the 1,N2-ε-G lesion, whereby the latter pol appears to consistently insert a base opposite 1,N2-ε-G and then extend the primer in an error-free fashion.
In the present work, we conducted further insertion and extension experiments with the most active of the human bypass pols, pol η, opposite the 1,N2-ε-G lesion. The bypass behavior was assessed both with the 18mer 5′-d(CAT [1,N2-ε-G]AT GAC GCT TCC CCC)-3′ (T(εG)A, 1) and 5′-d(TCA T[1,N2-ε-G]G AAT CCT TCC CCC)-3′ (T(εG)G, 2) template strands, positioning the lesion between A and T and G and T, respectively. hpol η favored insertion of dGTP opposite the adduct in both sequence contexts, followed by dATP ≅ dTTP > dCTP for the 5′-T(εG)A-3′ (1) template and dATP > dTTP > dCTP for the 5′-T(εG)G-3′ (2) template. The ability of hpol η to extend from the lesion was tested after the correct (1,N2-ε-G:C) base pair and a (1,N2-ε-G:A) mispair. Interestingly, the polymerase was unable to extend from the former but capable of carrying out postlesion extension from the latter.
We determined crystal structures of two insertion-stage hpol η complexes with the template/primer duplex 5′-d(CAT [1,N2-ε-G]AT GAC GCT)-3′/3′-d(TA CTG CGA)-5′ and either incoming dAMPnPP or dCMPnPP (nonhydrolyzable analogs of dATP and dCTP, respectively) in the presence of Mg2+. The incoming dAMPnPP adopts a staggered orientation opposite 1,N2-ε-G in the syn orientation, with H-bond formation between the N6 atom of A and both the O6 atom of 1,N2-ε-G and the O4 atom of the 5'-T. In the complex with incoming dCMPnPP, the lesion also assumes the syn orientation but pairs opposite C with formation of an H-bond between the N4 atom of C and the O6 atom of 1,N2-ε-G.
The combined hpol η bypass and structural data with DNA template–primer duplexes containing the 1,N2-ε-G adduct contribute to a better understanding of the diverse mechanisms underlying the preferred insertion of dATP and dGTP by this pol opposite distinct lesions. In previous work directed at hpol η bypass of abasic sites and the 1,N6-ε-A adduct, we established the preferential insertion of dATP and dGTP opposite these lesions (33, 34), as seen here with the bypass of 1,N2-ε-G. However, structures of insertion stage complexes reveal that the “purine rule” with an abasic site involves interactions between the incoming nucleotide base and the template strand backbone that can also result in frameshifts (33). Both incoming dGTP and dATP stack onto the 1,N6-ε-A adduct, which adopts the anti conformation at the hpol η active site, whereas incoming dTTP is coplanar with the lesion in the syn conformation. Conversely, the preferred insertion of dGTP and dATP by hpol η occurs with the 1,N2-ε-G lesion in the syn conformation without the involvement of stacking between the lesioned and incoming nucleotide base moieties. 1,N2-ε-G maintains the syn orientation even opposite incoming dCTP, and unlike the case of bypass of abasic sites, the preferred insertion of purine nucleotides opposite 1,N2-ε-G is devoid of frameshifts.
Results
Translesion synthesis across from the 1,N2-ε-G adduct by hpol η
hpol η-mediated bypass of 1,N2-ε-G was performed using two different sequence contexts, 5′-T(εG)A-3′ (1) and 5′-T(εG)G-3′ (2) (see Table S1 for oligonucleotide sequences), employing full-length extension and single-nucleotide insertion assays. The oligonucleotide sequence context 5′-T(εG)A-3′ (1) was designed on the basis of the sequence used for crystallography. The 5′-T(εG)G-3′ (2) oligonucleotide was studied previously (32). Full-length primer extension reactions were performed in the presence of all four dNTPs using the respective primers (see Table S1 for oligonucleotide sequences). Under our experimental conditions, the bypass of 1,N2-ε-G was slower compared with unmodified templates (Fig. 3A and Fig. S4A) for both sequence contexts.
Single-nucleotide insertion assays were done utilizing individual dNTPs to determine the insertions across the adduct. The incorporation preference for the control templates was dCTP > dTTP > dGTP > dATP (Fig. 3B, Fig. S4B lanes 1–12). hpol η preferentially added dGTP across from 1,N2-ε-G, at almost similar rates for both sequence contexts. The incorporation preference for the 5′-T(εG)A-3′ (1) template was dGTP > dATP ≅ dTTP > dCTP (Fig. 3C, lanes 1–12) while for the 5′-T(εG)G-3′ (2) template, the preference was dGTP > dATP > dTTP > dCTP (Fig. S4C, lanes 1–12). These results show that hpol η tolerates 1,N2-ε-G in both sequence contexts.
Postlesion extension past the 1,N2-ε-G adduct by hpol η
hpol η-mediated postlesion extension for 1,N2-ε-G was performed for both the sequence contexts 5′-T(εG)A-3′ (1) and 5′-T(εG)G-3′ (2) (see Table S1 for oligonucleotide sequences) employing full-length extension and single-nucleotide insertion assays. We utilized two different primers for postlesion extension after the correct base pair (1,N2-ε-G:C) and a mispair (1,N2-ε-G:A). For a misincorporated base, A was used opposite the lesion because it showed the greatest misincorporation frequency, following that for dGTP.
Full-length primer extension reactions were performed in the presence of all four dNTPs using the respective primers (see Table S1 for oligonucleotide sequences). As a result, for both the sequence contexts, under our experimental conditions, extension from the 1,N2-ε-G:C pair was slower (Fig. 4A and Fig. S5A, lane 7) compared with the unmodified templates (Fig. 4A and Fig. S5A, lane 2). For the 1,N2-ε-G:A pair, extension was faster (Fig. 5A and Fig. S6A, lane 7) compared with unmodified templates (Fig. 5A and Fig. S6A, lane 2).
Single-nucleotide insertion assays were done utilizing individual dNTPs to analyze postlesion insertions after the correct base pair (1,N2-ε-G:C) as well as mispairing (1,N2-ε-G:A).
For correct base pairing, the incorporation preference for control template 1 was dTTP > dGTP > dCTP > dATP (Fig. 4B, lanes 1–12), and the incorporation preference for the control template 2 was dATP > dTTP >dGTP > dCTP (Fig. S5B lanes 1–12). In the case of the 1,N2-ε-G:C template–primer pairs, postlesion insertion was drastically retarded, indicating resistance to extension past the correct base pair C. Overall, hpol η-mediated formation of very small amounts of inserted products for both sequence contexts was observed, at similar rates. The postlesion insertion preference for 5′-T(εG)A-3′ (1) template was dTTP > dGTP > dATP > dCTP (Fig. 4C, lanes 1–12) while for the 5′-T(εG)G-3′ (2) template, the preference was dGTP > dATP ≅ dTTP ≅ dCTP (Fig. S5C, lanes 1–12).
For a mispair (G:A), postlesion insertion was drastically affected for the control template 1 (Fig. 5B, lanes 1–12), while the incorporation preference for the control template 2 was dTTP > dATP ≅ dGTP > dCTP (Fig. S6B, lanes 1–12). In the case of the 1,N2-ε-G:A template–primer pairs, the postlesion insertions were unaffected (Fig. 5C, Fig. S6C). The order of postlesion insertion for both the sequence contexts was dTTP > dGTP > dATP > dCTP (Fig. 5C, Fig. S6C, lanes 1–12). The preference for T insertion during hpol η-mediated postlesion bypass was also observed previously for unmodified (2′-F-G:C pair) as well as modified (N7-CH3 2′-F-G:C pair) templates (35).
Overall, hpol η was unable to perform postlesion extension for the correct pair (1,N2-ε-G:C). hpol η was able to perform postlesion extension for a mispair (1,N2-ε-G:A), indicating that the mispair was preferentially extended past the lesion.
LC-ESI-MS/MS analysis for hpol η-mediated primer extension across from 1,N2-ε-G
To elucidate details about other possible miscoding events, hpol η-mediated full-length extension reactions were performed for both the sequence contexts, 5′-T(εG)A-3′ (1) and 5′- T(εG)G-3′ (2) for LC-MS/MS analysis (see Table S1 for oligonucleotide sequences). The 2′-deoxyuridine (dU)-containing primers (see Table S1 for oligonucleotide sequences) were used for the extension reaction. Next, the fully extended reaction mixtures were treated with UDG and piperidine as described earlier (31, 36). Reaction mixtures were also analyzed by gel electrophoresis to confirm the elongations (Fig. S7). The relative yields of fully extended products were calculated by LC-MS/MS analysis and are summarized in Table 1 and Table S6. CID spectra of the products are shown in Figs. S8–S19 and Tables S3–S10.
Table 1.
Primer_4: 5’-FAM-GGGGGAAGCGUCAT-3’ Template: 3’-CCCCCTTCGCAGTAXTAC-5’ | ||||
---|---|---|---|---|
X | Sequence | Yield | Observed m/z (charge) | Base added |
G | 5’-pCATCATGA-3’ | 70% | 823.64 (−3) | C, plus, blunt end addition of A, and G |
5’-pCATCATGG-3’ | 30% | 828.64 (−3) | ||
1,N2-ε-G | 5′-pCATAGTGA-3′ | 15% | 836.64 (−3) | A, followed by misinsertion of G, plus, blunt end addition of A |
5′-pCATGATGA-3′ | 38% | 836.64 (−3) | G, plus, blunt end addition of A | |
5′-pCATGGTGA-3′ | 47% | 842.00 (−3) | G, followed by misinsertion of G plus, blunt end addition of A |
Replication of the control templates gave only error-free products, with blunt end addition of A and G (Table 1 and Table S6).
In case of the 5′-T(εG)A-3′ (1) template, hpol η replicated past the adduct in an error-prone manner only, resulting in three main products (Table 1). The first two products showed the same m/z (i.e., 836.64, −3 ion) and coeluted together at same retention time (tR 4.34 min, Figs. S8 and S10). In these two products, the first product corresponded to misincorporation of A opposite the adduct, followed by misinsertion of G with blunt end addition of A (m/z 836.64, −3 ion: 5′-pCATAGTGA-3′; 15%; Fig. 6 and Table S3), and the second product corresponded to misincorporation of G with blunt end addition of A (m/z 836.64, −3 ion: 5′-pCATGATGA-3′; 38%; Fig. S9 and Table S4). The last product corresponded to misincorporation of G opposite the adduct, followed by misinsertion of G and with blunt end addition of A (m/z 842.00, −3 ion (tR 4.34 min): 5′-pCATGGTGA-3′; 47%; Fig. S11 and Table S5). The insertion of G (opposite 1,N6-ε-A) followed by misinsertion of another G has been observed in previous studies (37).
For the 5′-T(εG)G-3′ (2) template, hpol η-mediated error-prone bypass resulted in four main products (Table S6). The first two products showed the same m/z (836.64, −3 ion) and coeluted at the same retention time (tR 4.33 min, Figs. S12 and S15). Interestingly, the next two products also coeluted at same tR and m/z (842.00, −3 ion, tR 4.31 min, Figs. S16 and S19). Among all the products, the first corresponded to misincorporation of A opposite to the adduct with blunt end addition of G (m/z 836.64, −3 ion: 5′-pTCAATGAG-3′; 20%; Fig. S13 and Table S7). The second product corresponded to misincorporation of G with blunt end addition of A (m/z 836.64, −3 ion: 5′-pTCGATGAA-3′; 25%; Fig. S14 and Table S8). The third product corresponded to misincorporation of A followed by misinsertion of G, with misinsertion of G opposite the 5′ T and blunt end addition of A (m/z 842.00, −3 ion: 5′-pTCAGTGGA-3′; 26%; Fig. S17 and Table S9). The last product corresponded to misincorporation of G opposite the adduct, misinsertion of G opposite the 5′ T, with blunt end addition of A (m/z 842.00, −3 ion: 5′-pTCGATGGA-3′; 29%; Fig. S18 and Table S10).
No products were observed containing the incorporation of correct base C (or its stalled products), indicating that correct base C can be inserted opposite but that hpol η is not able to extend the primer to obtain full-length products, as reflected in postlesion extension assays.
Crystal structures of ternary hpol η insertion stage complexes with 1,N2-ε-G opposite dAMPnPP and dCMPnPP
We determined two crystal structures of hpol η⋅DNA (oligonucleotide)⋅dNTP ternary complexes trapped in the insertion state. Both complexes feature the same 8mer oligonucleotide primer and a 12mer oligonucleotide template with the 1,N2-ε-G adduct flanked by 5′-T and 3′-A (Table S2). The structure of the complex with incoming dAMPnPP was refined to 2.59 Å resolution, and the complex with incoming dCMPnPP was refined to 2.23 Å resolution. Selected crystal data, data collection, and refinement parameters are listed in Table 2. Examples of the quality of the electron density in the final models are depicted in Figs. S20 and S21. Along with the polymerase, oligonucleotide duplex, and nucleotide triphosphates, the dAMPnPP and dCMPnPP structures revealed 163 and 273 water molecules, respectively, and two active site Mg2+ ions. dAMPnPP and dCMPnPP are nonhydrolyzable nucleotide triphosphates that allow cocrystallization of pols with oligonucleotide template–primer constructs in the presence of Mg2+ without the use of a primer with a 3′-terminal 2′,3′-dideoxynucleoside and/or divalent metal ions that are not catalytically competent.
Table 2.
Complex | 1,N2-ε-G:dAMPnPP | 1,N2-ε-G:dCMPnPP |
---|---|---|
Data Collection | ||
Wavelength [Å] | 0.97856 | 0.97856 |
Space group | P61 | P61 |
Resolution [Å] | 50.0–2.59 (2.63–2.59)a | 42.9–2.23 (2.29–2.23)a |
Unit cell a, b, c [Å] | 99.11, 99.11, 81.86 | 99.08, 99.08, 81.89 |
Unique reflections | 14,272 (713) | 22,384 (1659) |
Completeness [%] | 99.7 (100) | 100 (100) |
I/σ(I) | 12.0 (1.8) | 16.6 (2.9) |
Wilson B-factor [Å2] | 22.1 | 26.9 |
R-merge | 0.152 (0.910) | 0.093 (0.625) |
Redundancy | 5.7 (5.7) | 5.7 (5.7) |
Refinement | ||
R-work | 0.191 (0.235) | 0.158 (0.193) |
R-free | 0.248 (0.301) | 0.210 (0.280) |
Number of atoms | ||
Protein/DNA | 3426/391 | 3394/391 |
dNTP/Water/Mg2+ | 30/163/2 | 28/273/2 |
Protein residues | 431 | 430 |
B-factor [Å2] | ||
Average | 38.6 | 33.5 |
Protein/DNA | 37.8/45.9 | 32.6/37.5 |
dNTP/M2+/Water | 41.3/33.4/38.7 | 34.4/26.5/38.5 |
R.m.s. deviations | ||
Bonds [Å] | 0.003 | 0.008 |
Angles [deg.] | 0.6 | 1.0 |
Ramachandran | ||
Favored (%) | 96.8 | 97.0 |
Allowed (%) | 3.0 | 2.8 |
Outliers (%) | 0.2 | 0.2 |
PDB ID Code | 5F9I | 5F9N |
Statistics for the highest-resolution shell are shown in parentheses.
At the active sites of both complexes, 1,N2-ε-G adapts a syn orientation with the lesioned Watson–Crick edge of G jutting into the major groove (Figs. 7 and 8). The nucleobase of the incoming dAMPnPP is staggered relative to the 1,N2-ε-G base plane and sits halfway between the adduct and the 5′-adjacent thymine (Fig. 7A). This structure positions the N6 atom of adenine 2.9 Å from O6 of the adduct and 2.5 Å from O4 of thymine, consistent with the formation of two H-bonds. The projection, approximately perpendicular to the 1,N2-ε-G and adenine planes, shows that there is no stacking between the incoming nucleotide and the adduct (Fig. 7B). The 3′-hydroxyl group of the primer 3′-terminal dT is poised for attack at the α-phosphate of dAMPnPP with an O3’…Pα distance of 3.7 Å. Among hpol η side chains that project into the active site, Arg-61 plays a key role in that it is often engaged in multiple interactions with nucleobase atoms and/or phosphate groups of the incoming nucleotide (38, 39). In the case of the ternary complex with dAMPnPP, the guanidino moiety of arginine does not stack on adenine or engage in contacts with the Hoogsteen edge of the base, as Arg-61 is shifted slightly into the major groove vis-à-vis the incoming nucleotide (Fig. 7B). However, Arg-61 forms a somewhat long (3.4 Å) salt bridge with the α-phosphate and a water-mediated interaction with the γ-phosphate. Another amino acid side chain, Gln-38, that is positioned near the minor groove and occasionally engages in direct or water-mediated contacts with template base and/or incoming nucleotide (38, 39), is too far removed from both in the active site of this complex and does not form any with either.
Unlike dAMPnPP, the base moiety of the incoming dCMPnPP is coplanar with the base plane of 1,N2-ε-G (Fig. 8A). A single H-bond is formed between the N4 atom of cytosine and the O6 atom of the adduct. The O4 atom of the T 5′-adjacent to 1,N2-ε-G is too far removed from N4 of the incoming cytosine to engage in an H-bond interaction (3.5 Å). The 3′-hydroxyl group of the primer 3′-terminal T sits at a distance of 3.5 Å from the α-phosphate of dCMPNPP. As in the case of the complex with dAMPnPP, Arg-61 is shifted slightly into the major groove vis-à-vis the incoming cytosine (Fig. 8B), which precludes a stacking interaction. The guanidino moiety of Arg-61 is too far removed from the α-phosphate (4.1 Å) to engage in an effective interaction but forms a water-mediated contact with the γ-phosphate. In the minor groove, a water molecule bridges the Gln-38 side chain and O2 atom of the incoming dCMPnPP.
Discussion
The Y-Family hpol η plays a key role in translesion synthesis in human cells and features an active site composition that can accommodate bulky adducts (24, 40). Among the best-known base lesions that hpol η can cope with are those that arise as a consequence of UV damage, e.g., cyclic pyrimidine dimers, as well as cisplatin intrastrand cross-links (41). However, hpol η can also synthesize past a variety of other adducts with various proficiencies in a more or less error-prone fashion. Thus, hpol η can bypass the major oxidative lesion 8-oxoG efficiently and virtually error-free (42). Additional lesions that this polymerase can overcome include abasic sites (33), exocyclic etheno A and G adducts (32, 34), O6-methyl G (39), and O4-alkyl-T (43) as well as, in a remarkably efficient and mostly error-free mode, the methyl FAPY-G lesion (44), and a bulky aminobenzanthrone DNA adduct (45).
The insertion stage assays with both sequences, 5′-T(εG)A-3′ (1) and 5′-T(εG)G-3′ (2), demonstrate that hpol η preferentially inserts dGTP and dATP relative to the pyrimidine nucleoside triphosphates. In the case of sequence (1), dATP and dTTP were incorporated to a similar degree, but the correct dCTP nucleotide was the least preferred opposite 1,N2-ε-G in both sequence contexts (Fig. 3 and Fig. S4). The two crystal structures of hpol η ternary complexes reveal clear differences between the orientation of incoming dAMPnPP and dCMPnPP. The nucleobase of the former assumes a staggered orientation relative to the 1,N2-ε-G base plane and is thus positioned halfway between the adduct and template dT in the spacious active site of hpol η (Fig. 7). This allows formation of a bifurcated H-bond between incoming nucleotide (N6H2) and template dT (O4) and 1,N2-ε-G (O6). The overall preferred dGTP is likely to adopt a similar orientation and may use its N1H atom to establish a bifurcated H-bond to the O4 and O6 keto oxygens of dT and the adduct, respectively. Unlike an incoming dATP, dGTP features the exocyclic amino function in the minor groove that may also allow a direct or water-bridged H-bond interaction with the side chain of Gln-38. In the structure of the complex with incoming dCMPnPP, cytosine is virtually coplanar with the 1,N2-ε-G base plane, resulting in a single H-bond between N4H2 and the O6 keto oxygen of the adduct, but precluding an H-bond interaction with the 5′-adjacent template dT (Fig. 8). dTTP is preferentially inserted relative to dCTP in both sequence contexts and—assuming similar orientations of thymine and cytosine vis-à-vis 1,N2-ε-G—will likely establish an H-bond to the N7 atom of the adduct in the syn conformation via its N3 hydrogen donor function. As with an incoming dCMPnPP in the crystal structure, this pose would position the thymine O2 atom in the minor groove in the vicinity of the Gln-38 side chain so that the two could be linked by a bridging water molecule. Alternatively, it is possible that thymine shifts slightly toward the major groove, thus placing the N3 hydrogen donor roughly between the N7 and O6 acceptors of the adduct and potentially resulting in the formation of a bifurcated H-bond. The preferred insertion of dGTP and dATP by hpol η opposite 1,N2-ε-G is most likely a consequence of their ability to reach two template nucleotides, as stacking differences between incoming purine and pyrimidine nucleoside triphosphates do not appear to play a significant role in the observed discrimination. However, the staggered orientation of dAMPnPP (and presumably dGTP) does not result in frameshifting, and once inserted, the purine may settle opposite the adduct in a more or less coplanar fashion, thereby allowing hpol η to bypass 1,N2-ε-G without skipping a base.
Extension reactions were tested with primers that featured either A or C opposite the adduct. Not only was insertion of dCTP opposite the adduct disfavored relative to dGTP and dATP, but extension from the correct base opposite 1,N2-ε-G was severely hampered as well (Fig. 4A and Fig. S5A, lane 7). hpol η was able to extend from a G:A pair with one of the control templates and then inserted dTTP > dATP ≅ dGTP > dCTP, i.e., the extension proceeded in an error-prone fashion. Conversely, extensions from 1,N2-ε-G:A proceeded more efficiently than the control reactions from G:A, and irrespective of the template sequence, hpol η inserted dTTP > dGTP > dATP > dCTP opposite T (Fig. 5 and Fig. S6). In a previous study from our laboratories (35), we observed that dTTP misincorporation was sixfold more efficient during hpol η-mediated postlesion bypass for the N7-CH3 2′-F-G lesion compared with 2′-F-G unmodified template, when paired with the correct base C. In a similar way, hpol η catalyzed TLS past the 1,N2-ε-G adduct in an efficient but highly error-prone manner. The results of the full-length extension reactions analyzed by LC-MS/MS attest to the absence of frameshift products but are supportive of untemplated addition of A and G at the end of the primer. The polymerase preferentially incorporated dGTP opposite the adduct and in the first extension step (85% and 63%, respectively, Table 1).
In a previous study from our laboratories (32), we also reported that hpol η preferred to insert G and A opposite 1,N2-ε-G, with some C insertion. LC-MS extension analysis, with what is referred to here as oligonucleotide 2, yielded only the product with G incorporated opposite 1,N2-ε-G (and no frameshifts), in contrast to the results shown in Table S6. The only major differences in the earlier paper (32) were that the (longer) hpol η was expressed in a baculovirus-based system (instead of E. coli) and that a less sensitive mass spectrometer was used (Thermo DecaXP). In the earlier paper, a slippage mechanism was included that could have accounted for G insertion in the product (32). In reviewing the results of our present study, it is clear that both G and A were present in the extended primer product (Table 1 and Table S6, Fig. 6, Figs. S9, S11, S13, S14, S17 and S18), as evidenced by the presence of both in the products generated by hpol η from distinct oligonucleotides. A slippage mechanism is not very likely to explain the incorporation of A.
A result observed in the present work with oligonucleotide 2 is the insertion of A followed by misinsertion of G, apparently opposite the T immediately 5′ to the 1,N2-ε-G adduct (Table S6). A slippage mechanism would require insertion of A followed by a 2-base loop-out to the 5′ penultimate C and then realignment to complete the polymerization. Another anomaly was the apparent misincorporation of G opposite the 5′ T in about one-half of the product (but not with the unmodified oligonucleotide) (Table S6).
Presently, there are no crystal structures of ternary complexes with hpol η trapped at the extension step after 1,N2-ε-G bypass. The insertion stage complexes with either incoming dAMPnPP (Fig. 7) or dCMPnPP (Fig. 8) both reveal the adduct in the syn orientation. The absence of frameshifting in the full-length extension reactions supports the notion that the adduct maintains the syn orientation following TLS by hpol η. A coplanar arrangement of either G or A paired to 1,N2-ε-G at the −1 position can be expected to result in a locally distorted duplex geometry in order to accommodate the adducted and inserted purine moieties. Although extensions can proceed, they do so in an error-prone way. What is clear, however, is that an arrangement with the adduct in the anti orientation and either incoming dGTP or dATP stacked on top of 1,N2-ε-G during bypass is neither supported by the structural data nor by the sequences identified from the extension assays.
Our observations of hpol η-catalyzed TLS past the 1,N2-ε-G lesion reveal similarities in terms of the preferential insertion of purine nucleoside triphosphates by this pol opposite an array of other lesions. However, they also demonstrate contrasting underlying bypass mechanisms as well as the distinct means of hpol η to accommodate similar adducts, e.g., the exocyclic 1,N2-ε-G and 1,N6-ε-A, at its active site. For example, with abasic sites the purine rule applies—hpol η rarely inserts T and C (33)—and A and G are also preferred for insertion over T opposite 1,N6-ε-A by hpol η (34). However, crystal structures of ternary complexes showed that 1,N6-ε-A is in the anti conformation when the preferred A and G are inserted. Also, distinct from the bypass of 1,N2-ε-G, the bypass of 1,N6-ε-A by hpol η involves frameshifts. Another distinct feature of the bypass reactions opposite the two exocyclic etheno adducts is that structural data showed that 1,N6-ε-A is in the syn conformation opposite dTTP (34). Conversely, we demonstrated here that 1,N2-ε-G is in the syn orientation both with incoming dAMPnPP and dCMPnPP. The origins of the different behaviors of the two adducts in terms of adopting either the syn or anti orientation at the active site of hpol η are currently not clear.
Along with shared and distinct features of the bypass of similar adducts by hpol η, it is also worth noting in the context of 1,N2-ε-G that S. solfataricus Dpo4 and hpol η, two Y-family pols that have much in common in terms of both activity and structure, behave very differently vis-à-vis this adduct. Thus, both pols have active sites with high ceilings that allow accommodation of two template bases and efficient bypass of CPDs and cisplatin intrastrand cross-links. However, bypass reactions opposite 1,N2-ε-G by Dpo4 involve the adduct in the anti orientation with a stacked incoming nucleotide that result in −1 frameshifts (31). These differences contrast sharply with our previous finding that both Dpo4 and hpol η are capable of mostly correct bypass of 8-oxoG (42, 46). In the case of Dpo4, the preference of 8-oxoG in the anti conformation opposite incoming dCTP versus in the syn conformation opposite incoming dATP is about 19:1. For hpol η the discrimination is even larger, with the percentage of insertion of dATP (and therefore 8-oxoG in syn) only between 1 and 3%. It is therefore remarkable that the bypass of 1,N2-ε-G is so different for the two pols, i.e., in Dpo4, the adduct is always in the anti conformation but in hpol η it seems to be always in the syn conformation as per the two new structures.
Experimental procedures
hPol η catalytic core protein expression and purification
The hpol η plasmid (pET28a) comprising residues 1 to 432 was a generous gift from Dr. Wei Yang, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health. The polymerase was expressed in E. coli and purified as described previously (47), and the protein solution was concentrated to ∼5 mg/ml.
Materials
The 1,N2-ε-G-modified phosphoramidite (Fig. S1) was synthesized using a previously reported procedure (48). Unlabeled dNTPs and uracil DNA glycosylase (UDG) were purchased from New England Biolabs. C18 Sep-Pak columns were purchased from Waters. Piperidine was from Sigma-Aldrich. Unmodified oligonucleotides and FAM-labeled oligonucleotide primers were purchased from Integrated DNA Technologies. dAMPnPP and dCMPnPP were obtained from Jena Bioscience.
1,N2-ε-G-oligonucleotide synthesis, purification, and characterization
Solid-phase synthesis of the 1,N2-ε-G-modified oligonucleotides was done on a PerSeptive Biosystems Model 8909 DNA synthesizer (see Table S1 for in-house synthesis of modified oligonucleotide sequences) (Figs. S2 and S3). The modified DNAs were synthesized on a 1-μmol scale using the appropriate controlled pore glass (CPG) as solid support. Manual coupling for the modified phosphoramidite was carried out for 30 min. The modified oligonucleotides were deprotected using a reported procedure (49). The modified oligonucleotides were purified by denaturing polyacrylamide gel electrophoresis (20% PAGE, 7 M urea) at 55 W for 2.5 h using 1X TBE buffer (89 mM Tris-HCl buffer (pH 8.0) containing 89 mM boric acid and 2 mM EDTA). The gel was visualized under a UV lamp at 260 nm, and the desired oligonucleotide bands were isolated from the gel and extracted using TEN buffer (10 mM Tris-HCl buffer (pH 8.0) containing 1 mM EDTA and 300 mM NaCl) overnight at room temperature. The modified oligonucleotides were desalted using C18 Sep-Pak columns, and their integrity was confirmed by MALDI-TOF mass spectrometry (positive linear mode, Figs. S2 and S3).
Full-length extension assays
A primer–template complex consisting of a FAM-labeled oligonucleotide primer and an unmodified or modified template was annealed (1:1 M ratio) at 95 °C for 5 min followed by slow cooling overnight (see Table S1 for oligonucleotide sequences) (Figure 3, Figure 4, Figure 5A–5A and Fig. S4A–S6A). Full-length extension reactions were carried out using 50 mM Tris-HCl buffer (pH 7.5) containing 50 mM NaCl, 5 mM MgCl2, 5% glycerol (v/v), 5 mM DTT, and 100 μg/ml BSA at 37 °C for 2 h. The final concentration of the primer–template complex was 150 nM, and 120 nM hpol η was used to obtain fully extended primers. Reactions were initiated by adding a 1.25 μl mixture of dNTPs to a total volume of 25 μl. Aliquots (3.5 μl) of reaction mixtures were removed at times of 0, 5, 30, 60, and 120 min and quenched with 6.5 μl of 10 mM EDTA (pH 8.0) in 95% deionized formamide (v/v). Products were separated using 20% PAGE (7 M urea). Results were visualized on a Typhoon scanner (GE Healthcare) and analyzed with ImageJ software.
Single-nucleotide incorporation assays
Each primer–template oligonucleotide complex containing a FAM-labeled oligonucleotide primer and an unmodified or modified template was annealed (1:1 M ratio) at 95 °C for 5 min, followed by slow cooling overnight (see Table S1 for oligonucleotide sequences) (Figs. 3, B and C, 4, B and C and 5, B and C, Figs. S4, B and C, S5, B and C, and S6, B and C). All single-nucleotide insertion reactions were performed using 50 mM Tris-HCl buffer (pH 7.5) containing 50 mM NaCl, 5 mM MgCl2, 5% glycerol (v/v), 5 mM DTT, and 100 μg/ml BSA, at 37 °C. The final concentration of primer–template complex was 150 nM and 10 nM hpol η was used, followed by the addition of 1.25 μl of an individual dNTP to a total volume of 25 μl. Aliquots (3.5 μl) of reaction mixtures were removed at 0, 5, 10, and 30 min and quenched with 6.5 μl of 10 mM EDTA (pH 8.0) in 95% deionized formamide (v/v). Products were separated using 20% PAGE (7 M urea). Results were visualized on a Typhoon scanner (GE Healthcare) and analyzed with ImageJ software.
Full-length extension reactions followed by UDG and piperidine treatment for LC-ESI-MS/MS analysis
A dU-containing FAM-labeled primer and an unmodified or modified template were annealed (1:1 M ratio) at 95 °C for 5 min followed by slow cooling overnight (see Table S1 for oligonucleotide sequences) (Fig. S7). The full-length extension reactions were carried out under similar reaction conditions as described in full-length extension assays, except that the final concentrations were as follows: the primer–template complex concentration was 2.5 μM and hpol η was 1.35 μM for the oligonucleotide 1,N2-ε-G template–primer complex and 0.75 μM for the unmodified template–primer complex, in a total reaction volume of 85 μl. Reactions were started using a mixture of dNTPs (1 mM) at 37 °C for 4 h. The resulting products were desalted using Biospin columns. The fully extended products were treated with 25 U of UDG at 37 °C for 4 h, followed by 0.25 M piperidine at 95 °C for 1 h (31, 35), and the reaction mixture was evaporated to dryness by lyophilization. The dried pellet was resuspended in 500 μl of nuclease-free water and concentrated to dryness by lyophilization. The samples were dissolved in 30 μl of nuclease-free water for LC-ESI-MS/MS analysis.
LC-ESI-MS/MS analysis
LC-MS/MS analyses were performed on a Waters Acquity UPLC system connected directly to a Thermo-Finnigan LTQ mass spectrometer (electrospray ionization) (Fig. 6, Figs. S8–S19 and Tables S3–S10). The analysis was carried out in the negative ion mode using an octadecylsilane (C18) column (Acquity UPLC BEH, 1.7 μm, 2.1 mm × 100 mm) as reported previously (37). The extended product sequences were identified (Fig. 6, Figs. S8–S19, Table 1, Tables S3–S10) by comparing the theoretical m/z values using a Mongo Oligo Mass Calculator v2.06 with the observed CID fragments. For the coeluted products, relative peak areas were calculated on the basis of the intensity of representative fragments (as shown in Fig. S10 for a5-B(−2) and Figs. S15 and S19 for a4-B(−1) fragments). The relative yields of extended product sequences were calculated on the basis of relative peak areas of extracted ion chromatograms.
Crystallization of complexes
Primer and template sequences used in the crystallization experiments are shown in Table S2 (Figs. 7 and 8, Figs. S20 and S21). The DNA template–primer duplex was mixed with the protein in a 1.2:1 M ratio in the presence of excess 50 mM Tris-HCl buffer (pH 7.5) containing 450 mM KCl and 3 mM DTT. After adding 5 microl of 100 mM MgCl2, the complex was concentrated to a final concentration of 2 to 3 mg/ml by ultrafiltration. Nonhydrolyzable nucleotide triphosphates were added last to form the ternary complexes. Crystallization experiments were performed by the hanging drop vapor diffusion technique at 18 °C using a sparse matrix screen (Hampton Research) (50). One microliter of the complex solution was mixed with 1 μl of reservoir solution and equilibrated against 500 μl reservoir wells. Crystals appeared in droplets containing 0.1 M MES buffer (pH 5.5) containing 5 mM MgCl2 and 21 to 23% (w/v) PEG 2000 MME within 1 day and were harvested after 1 week. Crystals were mounted in nylon loops, cryo-protected in reservoir solution containing 22% glycerol (v/v), and frozen in liquid nitrogen.
X-ray diffraction data collection, structure determination, and refinement
Diffraction data were collected on the 21-ID-G beamline of the Life Sciences Collaborative Access Team (LS-CAT) at the Advanced Photon Source, Argonne National Laboratory. All data were integrated and scaled with the program HKL2000 (51). The structures were determined by the molecular replacement technique with the program MOLREP (52, 53), using the protein portion of structure of the hpol η complex with PDB ID 4O3N (42) as the search model. Structure refinement and model building were carried out with the programs Refmac (54) and COOT (55), respectively. Illustrations were prepared with the program UCSF Chimera (56). Final refinement parameters are summarized in Table 2. The quality of the model of the dAMPnPP complex structure is somewhat inferior compared with that of the dCMPnPP complex structure, in terms of both resolution and refinement parameters. However, we are confident that this structure too provides valid insights into the mechanism of hpol η-catalyzed bypass of the 1,N2-ε-G lesion.
Data availability
With the exception of the Protein Data Bank coordinates (PDB 5F9I, 5F9N), all data is contained within the article and Supporting Information. The atomic coordinates and structure factors have been deposited in the Protein Data Bank (http://wwpdb.org/): PDB ID codes 5F9I (insertion 1,N2-ε-G opposite dAMPnPP) and 5F9N (insertion 1,N2-ε-G opposite dCMPnPP).
Supporting information
This article contains supporting information.
Conflict of interest
The authors declare that they have no conflicts of interest with the contents of this article.
Acknowledgments
Vanderbilt University is a member institution of the Life Sciences Collaborative Access Team at sector 21 of the Advanced Photon Source, Argonne, IL. Use of the Advanced Photon Source at Argonne National Laboratory was supported by the United States Department of Energy, Office of Science, Office of Basic Energy Sciences, under Contract DE-AC02-06CH11357. Thanks are also extended to K. Trisler for assistance in preparation of the article.
Author contributions
A. P. crystallized the protein complexes and solved the structures, J. R. M. carried out synthesis, purification, and characterization of 1,N2-ε-G modified phosphoramidite and the modified oligonucleotides and full-length and single-nucleotide insertion as well as extension assays. P. P. G. did the LC-MS/MS analysis, C. J. R., F. P. G., and M. E. conceived the studies and wrote the paper, along with J. R. M., P. P. G., and A. P.
Funding and additional information
This work was supported, in whole or in part, by National Institutes of Health Grants R01 ES026955 (to F. P. G. and M. E.), R01 ES010546 (to F. P. G.), P01 CA160032 (to M. E. and C. J. R.), and Vanderbilt-Ingram Cancer Center (P30 CA-068485). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Edited by Patrick Sung
Supporting information
References
- 1.Barbin A. Etheno-adduct-forming chemicals: From mutagenicity testing to tumor mutation spectra. Mutat. Res. 2000;462:55–69. doi: 10.1016/s1383-5742(00)00014-4. [DOI] [PubMed] [Google Scholar]
- 2.Jahnz-Wechmann Z., Framski G.R., Januszczyk P.A., Boryski J. Base-modified nucleosides: Etheno derivatives. Front. Chem. 2016;4:19. doi: 10.3389/fchem.2016.00019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Singer B., Bartsch H., editors. Exocyclic DNA Adducts in Mutagenesis and Carcinogenesis, No. 150. International Agency for Research on Cancer Scientific Publications; Lyon, France: 1999. [Google Scholar]
- 4.Guengerich F.P., Kim D.-H., Iwasaki M. Role of human cytochrome P-450 IIE1 in the oxidation of many low molecular weight cancer suspects. Chem. Res. Toxicol. 1991;4:168–179. doi: 10.1021/tx00020a008. [DOI] [PubMed] [Google Scholar]
- 5.Guengerich F.P., Persmark M., Humphreys W.G. Formation of 1,N2- and N2,3-ethenoguanine derivatives from 2-halooxiranes: Isotopic labeling studies and formation of a hemiaminal derivative of N2-(2-oxoethyl)guanine. Chem. Res. Toxicol. 1993;6:635–648. doi: 10.1021/tx00035a008. [DOI] [PubMed] [Google Scholar]
- 6.Hecht S.S., Young-Sciame R., Chung F.L. Reaction of α-acetoxy-N-nitrosopiperidine with deoxyguanosine: Oxygen-dependent formation of 4-oxo-2-pentenal and a 1,N2-ethenodeoxyguanosine adduct. Chem. Res. Toxicol. 1992;5:706–712. doi: 10.1021/tx00029a018. [DOI] [PubMed] [Google Scholar]
- 7.Kronberg L., Sjöholm R., Karlsson S. Formation of 3,N4-ethenocytidine, 1,N6-ethenoadenosine, and 1,N2-ethenoguanosine in reactions of mucochloric acid with nucleosides. Chem. Res. Toxicol. 1992;5:852–855. doi: 10.1021/tx00030a019. [DOI] [PubMed] [Google Scholar]
- 8.Sodum R.S., Chung F.L. 1,N2-Ethenodeoxyguanosine as a potential marker for DNA adduct formation by trans-4-hydroxy-2-nonenal. Cancer Res. 1988;48:320–323. [PubMed] [Google Scholar]
- 9.Morinello E.J., Ham A.J.L., Ranasinghe A., Sangaiah R., Swenberg J.A. Simultaneous quantitation of N2,3-ethenoguanine and 1,N2-ethenoguanine with an immunoaffinity/gas chromatography/high resolution mass spectrometry assay. Chem. Res. Toxicol. 2001;14:327–334. doi: 10.1021/tx0002076. [DOI] [PubMed] [Google Scholar]
- 10.Lee S.H., Arora J.A., Oe T., Blair I.A. 4-Hydroperoxy-2-nonenal-induced formation of 1,N2-etheno-2'-deoxyguanosine adducts. Chem. Res. Toxicol. 2005;18:780–786. doi: 10.1021/tx0497088. [DOI] [PubMed] [Google Scholar]
- 11.Petrova K.V., Jalluri R.S., Kozekov I.D., Rizzo C.J. Mechanism of 1,N2-etheno-2'-deoxyguanosine formation from epoxyaldehydes. Chem. Res. Toxicol. 2007;20:1685–1692. doi: 10.1021/tx7001433. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Linhart K.-B., Glassen K., Peccerella T., Waldherr R., Linhart H., Bartsch H., Seitz H.K. The generation of carcinogenic etheno-DNA adducts in the liver of patients with nonalcoholic fatty liver disease. Hepatobiliary Surg. Nutr. 2015;4:117–123. doi: 10.3978/j.issn.2304-3881.2015.01.14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Sattsangi P.D., Leonard N.J., Frihart C.R. 1,N2-Ethenoguanine and N2,3-ethenoguanine. Synthesis and comparison of the electronic spectral properties of these linear and angular triheterocycles related to the Y bases. J. Org. Chem. 1977;42:3292–3296. doi: 10.1021/jo00440a020. [DOI] [PubMed] [Google Scholar]
- 14.Chen H.J., Chiu W.L. Association between cigarette smoking and urinary excretion of 1,N2-ethenoguanine measured by isotope dilution liquid chromatography-electrospray ionization/tandem mass spectrometry. Chem. Res. Toxicol. 2005;18:1593–1599. doi: 10.1021/tx050145p. [DOI] [PubMed] [Google Scholar]
- 15.Langouët S., Müller M., Guengerich F.P. Misincorporation of dNTPs opposite 1,N2-ethenoguanine and 5,6,7,9-tetrahydro-7-hydroxy-9-oxoimidazo[1,2-a]purine in oligonucleotides by Escherichia coli polymerases I exo- and II exo-, T7 polymerase exo-, human immunodeficiency virus-1 reverse transcriptase, and rat polymerase β. Biochemistry. 1997;36:6069–6079. doi: 10.1021/bi962526v. [DOI] [PubMed] [Google Scholar]
- 16.Langouët S., Mican A.N., Müller M., Fink S.P., Marnett L.J., Muhle S.A., Guengerich F.P. Misincorporation of nucleotides opposite five-membered exocyclic ring guanine derivatives by Escherichia coli polymerases in vitro and in vivo: 1,N2-ethenoguanine, 5,6,7,9-tetrahydro-9-oxoimidazo[1,2-a]purine, and 5,6,7,9-tetrahydro-7-hydroxy-9-oxoimidazo[1,2-a]purine. Biochemistry. 1998;37:5184–5193. doi: 10.1021/bi972327r. [DOI] [PubMed] [Google Scholar]
- 17.Akasaka S., Guengerich F.P. Mutagenicity of site-specifically located 1,N2-ethenoguanine in Chinese hamster ovary cell chromosomal DNA. Chem. Res. Toxicol. 1999;12:501–507. doi: 10.1021/tx980259j. [DOI] [PubMed] [Google Scholar]
- 18.Saparbaev M., Langouët S., Privezentev C.V., Guengerich F.P., Cai H., Elder R.H., Laval J. 1,N2-Ethenoguanine, a mutagenic DNA adduct, is a primary substrate of Escherichia coli mismatch-specific uracil-DNA glycosylase and human alkyl-N-purine DNA glycosylase. J. Biol. Chem. 2002;277:26987–26993. doi: 10.1074/jbc.M111100200. [DOI] [PubMed] [Google Scholar]
- 19.Bartsch H., Nair J., Velic I. Etheno-DNA base adducts as tools in human cancer aetiology and chemoprevention. Eur. J. Cancer Prev. 1997;6:529–534. doi: 10.1097/00008469-199712000-00007. [DOI] [PubMed] [Google Scholar]
- 20.Nair J., Barbin A., Velic I., Bartsch H. Etheno DNA-base adducts from endogenous reactive species. Mutat. Res. 1999;424:59–69. doi: 10.1016/s0027-5107(99)00008-1. [DOI] [PubMed] [Google Scholar]
- 21.Dechakhamphu S., Yongvanit P., Nair J., Pinlaor S., Sitthithaworn S., Bartsch H. High excretion of etheno adducts in liver fluke-infected patients: Protection by praziquantel against DNA damage. Cancer Epidemiol. Biomarkers Prev. 2008;17:1658–1664. doi: 10.1158/1055-9965.EPI-08-0191. [DOI] [PubMed] [Google Scholar]
- 22.VanderVeen L.A., Druckova A., Riggins J.N., Sorrells J.L., Guengerich F.P., Marnett L.J. Differential DNA recognition and cleavage by EcoRI dependent on the dynamic equilibrium between two forms of the malondialdehyde-deoxyguanosine adducts. Biochemistry. 2004;44:5024–5033. doi: 10.1021/bi0472898. [DOI] [PubMed] [Google Scholar]
- 23.Guengerich F.P. Exocyclic adducts. In: Wells R.D., Bond J.S., Klinman J., Masters B.S.S., Bell E., Kaguni L.S., editors. Molecular Life Sciences. Springer; New York, NY: 2018. pp. 379–383. [Google Scholar]
- 24.Vaisman A., Woodgate R. Translesion DNA polymerases in eukaryotes: What makes them tick? Crit. Rev. Biochem. Mol. Biol. 2017;52:274–303. doi: 10.1080/10409238.2017.1291576. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Yang W., Wodgate R. What a difference a decade makes: Insights into translesion DNA synthesis. Proc. Natl. Acad. Sci. U. S. A. 2007;104:15591–15598. doi: 10.1073/pnas.0704219104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Prakash S., Johnson R.E., Prakash L. Eukaryotic translesion synthesis DNA polymerases: Specificity of structure and function. Annu. Rev. Biochem. 2005;74:317–353. doi: 10.1146/annurev.biochem.74.082803.133250. [DOI] [PubMed] [Google Scholar]
- 27.Yang I.-Y., Miller H., Wang Z., Frank E.G., Ohmori H., Hanaoka F., Moriya M. Mammalian translesion DNA synthesis across an acrolein-derived deoxyguanosine adduct. Participation of DNA polymerase η in error-prone synthesis in human cells. J. Biol. Chem. 2003;278:13989–13994. doi: 10.1074/jbc.M212535200. [DOI] [PubMed] [Google Scholar]
- 28.Ohmori H., Friedberg E.C., Fuchs R.P.P., Goodman M.F., Hanaoka F., Hinkle D., Kunkel T.A., Lawrence C.W., Livneh Z., Nohmi T., Prakash L., Prakash S., Todo T., Walker G.C., Wang Z.G. The Y-family of DNA polymerases. Mol. Cell. 2001;8:7–8. doi: 10.1016/s1097-2765(01)00278-7. [DOI] [PubMed] [Google Scholar]
- 29.Boudsocq F., Iwai S., Hanaoka F., Woodgate R. Sulfolobus solfataricus P2 DNA polymerase IV (Dpo4): An archaeal DinB-like DNA polymerase with lesion-bypass properties akin to eukaryotic pol eta. Nucleic Acids Res. 2001;29:4607–4616. doi: 10.1093/nar/29.22.4607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Eoff R.L., Egli M., Guengerich F.P. Impact of chemical adducts on translesion synthesis in replicative and bypass polymerases: From structure to kinetics. In: Geacintov N.E., Broyde S., editors. The Chemical Biology of DNA Damage. Wiley-VCH; Weinheim, Germany: 2010. pp. 299–330. [Google Scholar]
- 31.Zang H., Goodenough A.K., Choi J.-Y., Irminia A., Loukachevitch L.V., Kozekov I.D., Angel K.C., Rizzo C.J., Egli M., Guengerich F.P. DNA adduct bypass polymerization by Sulfolobus solfataricus DNA polymerase Dpo4. Analysis and crystal structures of multiple base-pair substitution and frameshift product with the adduct 1,N2-ethenoguanine. J. Biol. Chem. 2005;280:29750–29764. doi: 10.1074/jbc.M504756200. [DOI] [PubMed] [Google Scholar]
- 32.Choi J.-Y., Zang H., Angel K.C., Kozekov I.D., Goodenough A.K., Rizzo C.J., Guengerich F.P. Translesion synthesis across 1,N2-ethenoguanine by human DNA polymerases. Chem. Res. Toxicol. 2006;19:879–886. doi: 10.1021/tx060051v. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Patra A., Zhang Q., Lei L., Su Y., Egli M., Guengerich F.P. Structural and kinetic analysis of nucleoside triphosphate incorporation opposite an abasic site by human translesion DNA polymerase η. J. Biol. Chem. 2015;290:8028–8038. doi: 10.1074/jbc.M115.637561. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Patra A., Su Y., Zhang Q., Johnson K.M., Guengerich F.P., Egli M. Structural and kinetic analysis of miscoding opposite the DNA adduct 1,N6-ethenodeoxyadenosine by human translesion DNA polymerase η. J. Biol. Chem. 2016;291:14134–14145. doi: 10.1074/jbc.M116.732487. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Njuma O.J., Su Y., Guengerich F.P. The abundant DNA adduct N7-methyl deoxyguanosine contributes to miscoding during replication by human DNA polymerase η. J. Biol. Chem. 2019;294:10253–10265. doi: 10.1074/jbc.RA119.008986. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Chowdhury G., Guengerich F.P. Liquid chromatography-mass spectrometry analysis of DNA polymerase reaction products. Curr. Protoc. Nucleic Acid Chem. 2011;47:7.16.1–7.16.11. doi: 10.1002/0471142700.nc0716s47. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Ghodke P.P., Guengerich F.P. Impact of 1,N6-ethenoadenosine, a damaged ribonucleotide in DNA, on translesion synthesis and repair. J. Biol. Chem. 2020;295:6092–6107. doi: 10.1074/jbc.RA120.012829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Su Y., Patra A., Harp J., Egli M., Guengerich F.P. Roles of residues Arg-61 and Gln-38 of human DNA polymerase η in bypass of deoxyguanosine and 7,8-dihydro-8-oxo-2'-deoxyguanosine. J. Biol. Chem. 2015;290:15921–15933. doi: 10.1074/jbc.M115.653691. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Patra A., Zhang Q., Guengerich F.P., Egli M. Mechanisms of insertion of dCTP and dTTP opposite the DNA lesion O6-methyl-2'-deoxyguanosine by human DNA polymerase η. J. Biol. Chem. 2016;291:24304–24313. doi: 10.1074/jbc.M116.755462. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Yang W. An overview of Y-family DNA polymerases and a case study of human DNA polymerase η. Biochemistry. 2014;53:2793–2803. doi: 10.1021/bi500019s. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Cruet-Hennequart S., Gallagher K., Sokòl A.M., Villalan S., Prendergast A.M., Carty M.P. DNA polymerase η, a key protein in translesion synthesis in human cells. Subcell. Biochem. 2009;50:189–209. doi: 10.1007/978-90-481-3471-7_10. [DOI] [PubMed] [Google Scholar]
- 42.Patra A., Nagy L.D., Zhang Q., Su Y., Müller L., Guengerich F.P., Egli M. Kinetics, structure and mechanism of 8-oxo-7,8-dihydro-2'-deoxyguanosine bypass by human DNA polymerase η. J. Biol. Chem. 2014;289:16867–16882. doi: 10.1074/jbc.M114.551820. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.O'Flaherty D.K., Patra A., Su Y., Guengerich F.P., Egli M., Wilds C.J. Lesion orientation of O4-alkylthymidine influences replication by human polymerase η. Chem. Sci. 2016;7:4896–4904. doi: 10.1039/c6sc00666c. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Patra A., Banerjee S., Johnson Salyard T., Malik C., Christov P., Rizzo C.J., Stone M.P., Egli M. Structural basis for error-free bypass of the 5-N-methylformamidopyrimidine-dG lesion by human DNA polymerase η and Sulfolobus solfataricus P2 polymerase IV (Dpo4) J. Am. Chem. Soc. 2015;137:7011–7014. doi: 10.1021/jacs.5b02701. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Patra A., Politica D.A., Chatterjee A., Tokarsky E.J., Suo Z., Basu A., Stone M.P., Egli M. Mechanism of error-free bypass of the environmental carcinogen N-(2'-deoxyguanosin-8-yl)-3-aminobenzanthrone adduct by human DNA polymerase η. ChemBioChem. 2016;17:2033–2037. doi: 10.1002/cbic.201600420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Zang H., Irimia A., Choi J.-Y., Angel K.C., Loukachevitch L.V., Egli M., Guengerich F.P. Efficient and high fidelity incorporation of dCTP opposite 7,8-dihydro-8-oxodeoxyguanosine by Sulfolobus solfataricus DNA polymerase Dpo4. J. Biol. Chem. 2006;281:2358–2372. doi: 10.1074/jbc.M510889200. [DOI] [PubMed] [Google Scholar]
- 47.Biertümpfel C., Zhao Y., Kondo Y., Ramon-Maiques S., Gregory M., Lee J.Y., Masutani C., Lehmann A.R., Hanaoka F., Yang W. Structure and mechanism of human DNA polymerase eta. Nature. 2010;465:1044–1048. doi: 10.1038/nature09196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Goodenough A.K., Kozekov I.D., Zang H., Choi J.-Y., Guengerich F.P., Harris T.M., Rizzo C.J. Site specific synthesis and polymerase bypass of oligonucleotides containing a 6-hydroxy-3,5,6,7-tetrahydro-9H-imidazo[1,2-a]purin-9-one base, an intermediate in the formation of 1,N2-etheno-2'-deoxyguanosine. Chem. Res. Toxicol. 2005;18:1701–1714. doi: 10.1021/tx050141k. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Shanmugam G., Kozekov I.D., Guengerich F.P., Rizzo C.J., Stone M.P. Structure of the 1,N2-etheno-2′-deoxyguanosine adduct in duplex DNA at pH 8.6. Chem. Res. Toxicol. 2007;20:1601–1611. doi: 10.1021/tx7001788. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Jancarik J., Kim S.-H. Sparse matrix sampling: A screening method for crystallization of proteins. J. Appl. Cryst. 1991;24:409–411. [Google Scholar]
- 51.Otwinowski Z., Minor W. Processing of X-ray diffraction data collected in oscillation mode. Methods Enzymol. 1997;276:307–326. doi: 10.1016/S0076-6879(97)76066-X. [DOI] [PubMed] [Google Scholar]
- 52.Vagin A., Teplyakov A. MOLREP: An automated program for molecular replacement. J. Appl. Cryst. 1997;30:1022–1025. [Google Scholar]
- 53.Collaborative Computational Project, Number 4 The CCP4 suite: Programs for protein crystallography. Acta Crystallogr. D Biol. Crystallogr. 1994;50:760–763. doi: 10.1107/S0907444994003112. [DOI] [PubMed] [Google Scholar]
- 54.Vagin A.A., Steiner R.S., Lebedev A.A., Potterton L., McNicholas S., Long F., Murshudov G.N. REFMAC5 dictionary: Organization of prior chemical knowledge and guidelines for its use. Acta Crystallogr. D Biol. Crystallogr. 2004;60:2284–2295. doi: 10.1107/S0907444904023510. [DOI] [PubMed] [Google Scholar]
- 55.Emsley P., Cowtan K. Coot: Model-building tools for molecular graphics. Acta Crystallogr. D Biol. Crystallogr. 2004;60:2126–2132. doi: 10.1107/S0907444904019158. [DOI] [PubMed] [Google Scholar]
- 56.Pettersen E.F., Goddard T.D., Huang C.C., Couch G.S., Greenblatt D.M., Meng E.C., Ferrin T.E. UCSF Chimera--a visualization system for exploratory research and analysis. J. Comput. Chem. 2004;25:1605–1612. doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
With the exception of the Protein Data Bank coordinates (PDB 5F9I, 5F9N), all data is contained within the article and Supporting Information. The atomic coordinates and structure factors have been deposited in the Protein Data Bank (http://wwpdb.org/): PDB ID codes 5F9I (insertion 1,N2-ε-G opposite dAMPnPP) and 5F9N (insertion 1,N2-ε-G opposite dCMPnPP).