Abstract
Background
Uracil-DNA glycosylases are evolutionarily conserved DNA repair enzymes. However, vaccinia virus uracil-DNA glycosylase (known as D4), also serves as an intrinsic and essential component of the processive DNA polymerase complex during DNA replication. In this complex D4 binds to a unique poxvirus specific protein A20 which tethers it to the DNA polymerase. At the replication fork the DNA scanning and repair function of D4 is coupled with DNA replication. So far, DNA-binding to D4 has not been structurally characterized.
Results
This manuscript describes the first structure of a DNA-complex of a uracil-DNA glycosylase from the poxvirus family. This also represents the first structure of a uracil DNA glycosylase in complex with an undamaged DNA. In the asymmetric unit two D4 subunits bind simultaneously to complementary strands of the DNA double helix. Each D4 subunit interacts mainly with the central region of one strand. DNA binds to the opposite side of the A20-binding surface on D4. Comparison of the present structure with the structure of uracil-containing DNA-bound human uracil-DNA glycosylase suggests that for DNA binding and uracil removal D4 employs a unique set of residues and motifs that are highly conserved within the poxvirus family but different in other organisms.
Conclusion
The first structure of D4 bound to a truly non-specific undamaged double-stranded DNA suggests that initial binding of DNA may involve multiple non-specific interactions between the protein and the phosphate backbone.
Electronic supplementary material
The online version of this article (doi:10.1186/s12900-015-0037-1) contains supplementary material, which is available to authorized users.
Keywords: Protein-DNA structure, Non-specific DNA, Early DNA recognition complex, Uracil-DNA glycosylase, Poxvirus
Background
Repair of damages in DNA is an essential cellular process regulated by specialized molecular machinery. Base excision repair (BER) pathway [1] for repair of small lesions is initiated by monofunctional DNA glycosylases. These enzymes use a water molecule as a nucleophile to cleave the N-glycosidic bond between the target base and deoxyribose, releasing the damaged base and leaving an apurinic/apyrimidinic (AP) site [2]. The BER pathway for the removal of uracil (Ura), which arises in DNA from deamination of cytosine (Cyt) or incorporation of dUTP during DNA synthesis, is initiated by uracil-DNA glycosylases (UDGs) [3]. UDGs are divided in different families based on their substrate specificity [4]. Members of family I-V share similar structure and motifs. Family I UDGs, which are also known as UNGs, specifically excise uracil from single-stranded DNA (ssDNA) and double-stranded DNA (dsDNA) with the preference ssU > dsU:G > dsU:A [5]. UNGs are ubiquitous enzymes that use highly conserved motifs for DNA binding, uracil recognition and excision.
The reaction mechanism for UNGs has been elucidated from a series of elegant structural studies in which wild-type and catalytically inactive mutant UNGs were captured in the DNA-bound form at various stages of action [6–12]. For detection of uracil in DNA, UNGs use a ‘pinch-push-pull’ mechanism, which involves a multi-step base flipping process. In the first step, serine and proline residues in three different loops (the ‘Pro-rich loop’, the ‘Gly-Ser loop’ and the ‘Leu-intercalation loop’) lead to a slight bending of the DNA through compression (‘pinch’) of the backbone. In the second step, a conserved leucine residue of the ‘Leu-intercalation loop’ penetrates into the minor groove and the DNA becomes fully bent and kinked leading to flipping of the uracil base (‘push’). The uracil nucleotide interacts with the ‘uracil recognition pocket’ where cleavage of the glycosidic bond takes place, and in the final step, the leucine residue is retracted (‘pull’). Two catalytic residues, an aspartic acid and a histidine, are found invariant in all UNGs (Table 1).
Table 1.
1FLZ (EcUNG) | 1SSP (hUNG) | 4QCB (vUNG) | |
---|---|---|---|
Catalytic water-activating loop | 62-GQDPYH-67 | 143-GQDPYH-148 | 66-GIDPYP-71 |
Pro-rich loop | 84-AIPPS-88 | 165-PPPPS-169 | 84-FTKKS-88 |
Uracil specificity β-strand | 120-LLLN-123 | 201-LLLN-204 | 117-IPWN-120 |
Gly-Ser loop | 165-GS-166 | 246-GS-247 | 160-KT-161 |
Leu intercalation loop | 187-HPSPLSVYR-195 | 268-HPSPLSAHR-276 | 181-HPAARDR-187 |
Active site residues | D64, Y66, F77, N123, H187, L191 | D145, Y147, F158, N204, H268, L272 | D68, Y70, F79, N120, H181, R185 |
Motifs as previously listed [13] are updated based on structural superimposition using the new D4-DNA complex structure
UNGs of poxviruses are, however, exceptional and most diverse members of this family. The motifs used by these enzymes are fully conserved in all orthopoxviruses but differ significantly from their counterparts in other organisms (Additional file 1: Table S1) [4, 13–15]. More importantly, UNGs of poxviruses assume a novel and essential role in viral replication and serve as an intrinsic component of the replication machinery. In the prototypic poxvirus Vaccinia, UNG (known as D4) binds to another viral protein A20 to form the heterodimeric processivity factor. A20 also binds to the DNA polymerase E9 and thereby tethers D4 to E9 [15–20] and assembles the heterotrimeric (D4:A20:E9) core of the processive polymerase complex. In the absence of D4, Vaccinia E9 synthesizes only short (<10 nucleotides) stretches of DNA [18, 21]. This essential role of D4 in supporting processive DNA synthesis is not dependent on its catalytic (glycosylase) activity [15].
Crystal structures of D4 have been reported in the free form [13,22; 2owr1,12owq, 3 nt7, 4dof, 4dog] and as a non-productive22 uracil complex [14; 4lzb]. Three dimensional structure of D4 retains the overall fold of UNG proteins- a central 4-stranded parallel β-sheet with α-helices on both sides (Fig. 1). There are two additional β-sheets in D4 (residues 2-6/12-16 and 107-109/215-217) not seen in any other UNG (Figs. 1a, b). Although the UNG-specific motifs are very different, the architecture of the uracil-binding pocket in D4 is remarkably similar to other UNGs and five of the six residues forming the pocket are identical. These residues include the catalytic Asp68 and His181, which are appropriately positioned for binding uracil and cleavage of the glycosidic bond [14]. Considering the unique properties of poxvirus UNGs, it should be highly interesting to understand how these proteins adapt to utilizing the altered motifs for binding DNA, and recognizing and excising uracil.
All crystal structures of free-D4 and the structure of the uracil complex feature a characteristic dimeric packing. Although specific protein-protein interactions between D4 subunits vary to some extent the dimer interface is very similar in these structures [13, 14, 22]. This homodimer interface is formed by residues located in the C-terminal β-strand of the central β-sheet in each subunit, a flexible loop/short helix (residues 161–173) and a long helix (187–206) located near the C-terminus. The first site, which is located in a loop connecting the two C-terminal β-strands, represents one of the most flexible areas in the D4 structures (Fig. 1a). At high protein concentration D4 utilizes the same interface for dimerization in solution. However, when A20 is present, very specific interaction at this surface leads to the formation of a D4:A20 heterodimer (Fig. 1c) [19, 20]. Cumulative structural and biochemical evidences indicate that in the processive DNA polymerase complex, D4 remains catalytically active and possibly continuously scans and removes uracil from newly synthesized DNA [18–20]. Details of D4’s ability to bind and scan DNA, its enzymatic function and substrate specificity have been reported [22, 23]. However, structural characterization of the DNA-binding site(s) on D4 is missing.
Here, we report the first detailed view of D4-DNA interactions from a crystal structure of D4 in complex with a non-specific undamaged dsDNA. The structure of this complex for the first time reveals the DNA-binding interactions by an UNG from the poxvirus family. It also represents the first three-dimensional structure of a UNG bound to a truly undamaged dsDNA and provides a representative snapshot of the initial interaction between UNG and DNA. We also present a structural comparison of D4 and human UNG (hUNG) for which crystal structures are available in the DNA-free state, in complex with uracil-containing DNA and in complex with a DNA containing an abasic site [6–8, 10].
Results and discussion
Overall quality of the D4-DNA complex structure
The D4-DNA structure was refined to 2.89 Å resolution with Rcryst and Rfree values of 21.6 and 26.5 %, respectively. The overall quality of the structural model is good as indicated by MolProbity scores [24] and map correlation coefficients (Table 2). The final model consists of two D4 subunits (chains A and B), a DNA double helix with 12 nucleotides in chain C and 10 in chain D, 4 glycerol (GOL) molecules of which one is in each active site and 118 solvent atoms. Glycerol molecules have been previously observed in the active site of D4 (2owq, 2owr) and E. coli UNG (3eug). Quality of electron density for residues 184–188 of the ‘Leu-intercalation’ loop was poor. Therefore, these five residues were not modeled. As a result, the discussion of the ‘Leu-intercalation’ loop for this structure is limited to interactions of loop residues 180–183. Also, side chain atoms of residues 189 and 190 (except for CB) are not included in the final model due to lack of electron density. These residues are located in a flexible area in crystal structures of D4 [13, 14].
Table 2.
Wavelength [Å] | 0.97918 |
Space group | P212121 |
Unit cell parameters [Å] | a = 39.40, b = 92.32, c = 142.88 |
Data collection statistics | |
Resolution limit [Å] | 56.50-2.89 (3.05-2.89)a |
Rmerge b,c | 0.158 (0.505) |
Rmeas b,c | 0.190 (0.609) |
Rpim b,c | 0.105 (0.335) |
Total number of observations | 35850 (5327) |
Total number unique | 11991 (1750) |
Mean I/σ (I) | 6.1 (2.3) |
CC1/2 | 0.986 (0.855) |
CC* | 0.996 (0.960) |
Completeness [%] | 97.9 (99.3) |
Multiplicity | 3.0 (3.0) |
Refinement statistics | |
Resolution range (Å) | 56.50 - 2.89 (2.97 - 2.89)a |
Number of unique reflections | 11966 (886) |
Completeness (%) | 97.3 (99.1) |
Rcryst (%)d | 21.6 (29.7) |
Rfree (%)d | 26.5 (32.6) |
No. of protein residues | 426 |
No. of DNA nucleotides | 22 |
No. of GOL molecules | 4 |
No. of water molecules | 118 |
Wilson B-factor (Å2) | 29.4 |
Average B-factors (Å2) | |
Overall | 31.4 |
Protein | 28.5 |
DNA | 56.5 |
GOL | 36.6 |
Water | 18.6 |
Coordinate error (maximum likelihood) | 0.39 |
Correlation coefficient Fo-Fc | 0.93 |
Correlation coefficient Fo-Fc free | 0.88 |
Overall map CC (Fc, 2mFo-DFc) | 0.82e,f |
Ramachandran allowed (%) | 99.5 |
Ramachandran disallowed (%) | 0.5 [2 outliers, 1 in each subunit] |
MolProbity clash score | 8.7 [97th percentile] |
MolProbity score | 1.9 [99th percentile] |
aValues in parentheses represent highest resolution shell
bRmeasand Rpimwere calculated with SCALA [38] in the CCP4 program suite [39] using unmerged and not scaled data preprocessed by XDS [36, 37]. Rmeasis a merging R-factor independent of data redundancy while Rpim provides the precision of the averaged measurement, which improves with higher multiplicity [48]
cR values for the low resolution shell of 56.50-9.14 Å are: Rmerge 0.053; Rmeas 0.065; Rpim 0.037
dThe data included in the Rfree set (5 %) were excluded from refinement
eFinal R and Rfree values based on map calculation in PHENIX [45, 46] are 20.8 % and 24.7 %, respectively
fFinal R and Rfree values based on comprehensive validation in PHENIX [45, 46] are 21.5 % and 26.8 %, respectively
In chain C all 12 nucleotides were modeled but only 10 could be modeled in chain D due to insufficient electron density. Of the 10 base pairs forming the DNA double helix, 9 show regular Watson-Crick (WC) base pairing interactions and one non-WC interaction (Fig. 2; Additional file 1: Table S2). In general electron density for the DNA is good for the central regions (C: nucleotides 5–8; D: nucleotides 23–26) that interact with protein residues as indicated by map correlation coefficients (0.92-0.96; average 0.94) and temperature factors (30.5-46.3 Å2; average 37.6 Å2). The overall quality of the electron density for the DNA double helix is displayed in Additional file 1: Figure S1.
A comparison of map correlation coefficients of the DNA nucleotides and the interface residues in D4 is presented in Additional file 1: Table S3. An electron density map (2mFo-DFc map contoured at 1.5σ level) for both D4-DNA interfaces (chains A and D; chains B and C) is displayed in Additional file 1: Figure S2.
As expected the DNA represents a B-form right-handed double helix. Puckering of the sugar ring for several nucleotides differs from the predominant C2’-endo conformation seen in B-DNA (Additional file 1: Table S4). Similar deviations in sugar puckering conformation have been observed in various protein-DNA complexes.
Assembly of D4-DNA complex
DNA parameters derived from the analysis using the w3DNA web server (http://w3dna.rutgers.edu/) [25] are presented in Additional file 1: Tables S2 and S4. Analyses of DNA-DNA, protein-DNA and protein-protein interfaces using the PDBePISA web server (http://www.ebi.ac.uk/msd-srv/prot_int/pistart.html) [26] are shown in Table 3 and Additional file 1: Table S5.
Table 3.
1. D4-DNA (chains A, D) | 2. D4-DNA (chains B, C) |
#1 D:DA24 [OP2] 2.69 A:LYS131 [NZ] | #1 C:DA5 [OP1] 3.76 B:THR130 [N] |
#2 D:DA25 [OP1] 3.10 A:LYS160 [N] | #2 C:DC6 [OP2] 2.59 B:LYS131 [NZ] |
#3 D:DA25 [OP2] 3.20 A:LYS160 [N] | #3 C:DG7 [OP1] 3.03 B:LYS160 [N] |
#4 D:DA25 [OP2] 2.97 A:THR161 [N] | #4 C:DG7 [OP2] 3.22 B:LYS160 [N] |
#5 D:DC26 [OP1] 3.13 A:TYR180 [OH] | #5 C:DG7 [OP2] 2.95 B:THR161 [N] |
#6 D:DA25 [OP1] 2.88 A:HIS181 [N] | #6 C:DT8 [OP1] 2.76 B:TYR180 [OH] |
#7 C:DG7 [OP1] 2.91 B:HIS181 [N] | |
3. D4-DNA (chains A, C) | |
#1 C:DC12 [OP1] 3.00 A:LYS87 [NZ] |
Conformations of the two D4 chains in the complex are also very similar with an rmsd value of 0.15 Å for superposition of all 213 residues in A and B. Each D4 chain binds only to one DNA strand. D4 chain A binds to DNA strand D and D4 chain B binds to DNA strand C (Figs. 2a, b). The electrostatic potential distribution shown in Fig. 2c illustrates the charge complementarity of the DNA binding surface of D4 and the negatively charged phosphate backbone of the DNA strands.
D4 chain B interacts with nucleotides 5–8 (ACGT) in the central part of chain C while D4 chain A interacts with nucleotides 23–26 (AAAC) of chain D. D4 residues at the protein-DNA interface are Ile67, Pro71, Gly128, Glu129, Thr130, Lys131, Gly159, Lys160, Thr161, Asp162, Tyr180, His181 and Ala183 (Table 3 and Additional file 1: Table S5). Polar protein-DNA interactions with the phosphate backbone involve hydrogen-bonds (2.6-3.2 Å) with Lys131, Lys160, Thr161, Tyr180 and His181 from each D4 chain (Table 3; Fig. 2). DNA binding residues in the two D4 chains superimpose very well (rmsd 0.13 Å).
Distortion in the DNA chains C and D such as bending and kinking are small compared to those observed in complexes of hUNG with specific uracil-containing DNA [6–8, 10]. The length of the DNA (~37-38 Å) is comparable to the value of 39–41 Å expected for an ideal extended B-form DNA; the widths of the major and minor groove are also similar to those for ideal B-form DNA.
Incidentally, the DNA double helix extends in the unit cell through non-covalent interactions between the ends of the DNA helices generated by the space group symmetry in a head-to-tail fashion (Additional file 1: Figure S3). Similar extended packing through DNA-DNA contacts of dsDNA with sticky or blunt ends (head-to-head and head-to-tail fashion) has been observed in other protein-DNA structures [27, 28]. On the other hand this arrangement of DNA chains may be a result of crystal packing.
Both protein-DNA interfaces in the complex have similar interface areas and interface residues (Additional file 1: Table S5). Interactions between the two D4 chains in the complex are minimal and confined to residues Glu32, Val33, Ser35, Trp36, Arg39, Ser132, Ile135, Try136 and Lys139. There is a salt bridge between residues Glu32 and Arg39 and a stacking interaction between Trp36 of two subunits.
Conformational changes induced by DNA binding
Upon formation of productive DNA complexes, UNGs undergo an ‘open to close’ conformational transition [6–10]. Such a conformational transition is not expected in the present complex since the DNA in the complex does not contain uracil. D4 residues involved in binding DNA are localized in three structural areas: the extended DNA-binding loop (residues 126–132), the Gly-Ser loop (residues 159–162) and the Leu-intercalation loop (residues 180–187). These three regions are labeled in Fig. 3. Involvement of these sites in DNA-binding and the resulting structural changes in the protein are discussed below. Largest deviations in the DNA complex are noticed in the extended helix/loop segment (residues 164–174,s1 site) joining the two C-terminal strands of the central β-sheet, and in the Leu-intercalation loop (s2 site). These two regions show varying degrees of disorder in D4 structures and portions of these areas are missing from several final models. Therefore, we examined the electron density maps for D4 structures especially focusing on these areas. In the following discussion we used reference free-D4 structures in which completeness of the model in relevant regions of the protein was good and supported by electron density. We also used the structure of the D4:uracil complex (4lzb) for comparison [14]. This structure contains 12 molecules in the asymmetric unit and the s1 and s2 sites could be modeled in 11.
Extended DNA binding loop (residues 126–132)
In the DNA complex the side chain amino group of Lys131 of each D4 subunit forms a strong hydrogen bond (2.6 and 2.7 Å) with a phosphate oxygen atom of the partner DNA strand. However, there is no movement in this region of D4 upon DNA binding. Main chain and side chain atoms of Lys131 and neighboring residues superimpose very well with corresponding atoms in free-D4 structures 4dof, 4dog and 4lzb. In all structures the Lys131 side chain extends towards the spatially close Gly-Ser loop and engages in hydrogen bonding with the carboxyl oxygen atoms of Asp162. As a result, the extended DNA-binding loop tilts slightly towards the Gly-Ser loop (Fig. 4). Quality of electron density for Lys131, Asp162 and areas adjacent to these residues is usually very good in these crystal structures (Additional file 1: Figure S4). In a previously published study, the Lys131Val mutant was found to be defective in processivity function [22]. Since this mutant did not show glycosylase activity it was assumed to have a major conformational defect [22]. Both Lys131 and Asp162 are strictly conserved in UNGs of the orthopoxviruses [15] and thus may play an important role in specialized functions of these viral UNGs, such as a component of the processivity factor. Since neither of these residues is conserved in other UNGs they are less likely to play any direct role in catalysis. On the other hand interactions of Lys131 with Asp162 on the spatially adjacent helix/loop may be important for the stability of the local structure. Therefore the loss of glycosylase activity in the valine mutant may be due to structural impairment. At the same time, D4 surface near Asp162 is critical for its interactions with A20 [22] and therefore, structural changes due to the loss of interactions between Lys131 and Asp162 may affect the processivity function.
In a study combining H/D exchange mass spectroscopy and computational docking Roberts et al. [29] identified two extended non-sequence-specific DNA-binding surfaces in hUNG (residues 210–220 and 251–264) that did not show contacts with DNA in the crystal structures. In D4 the corresponding regions are residues 126–136 and 165–178. The latter segment in D4 is engaged in homodimer formation in the absence of A20 and in heterodimer formation in the presence of A20 [20]. Thus this (residue 165–178) may not represent a potential DNA-binding site in D4.
Gly-Ser loop (residues 159–162)
The s2 site encompasses the Gly-Ser loop. This region shows varying degrees of flexibility in different D4 structures but could be modeled in both subunits in the DNA complex. Quality of electron density in this area of D4 was sufficient for unambiguously placing main chain and side chain atoms of all residues. For comparison of this site we selected the free-D4 structure 4dof (chains A, B) and the D4:uracil complex (4lzb, chain D, F). Comparison of the s2 site in the DNA-complex with free-D4 structures reveals a slight reorganization in this area of the molecule (Fig. 5). Two amino acid residues, Lys160 and Thr161, are involved in hydrophilic interactions with the DNA. Main chain nitrogen atom of Lys160 interacts with two oxygen atoms on one phosphate group (Table 3). Thr161 hydroxyl oxygen atom is hydrogen bonded to one of the oxygen atoms of the same phosphate also through the peptide nitrogen atom. Notably, Lys160 and Thr161 are unique residues in poxvirus UNGs and are replaced by glycine and serine respectively in both E. coli and human enzymes. We showed that D4 mutant Lys160Val was deficient in processivity function but retained catalytic activity and DNA binding ability [22]. In all D4 structures, side chain of Lys160 extends toward the neighboring β-strand and its NZ atom forms a hydrogen bond with the peptide oxygen atom of Val178 on this strand. Loss of this contact in the Lys160Val mutant may impact the structure of this area of D4. Notably, a hydrophobic pocket at this site has been shown to be important for D4’s interaction with A20 [20].
Leu-intercalation loop (residues 180–187)
The s3 area, which includes the ‘Leu-intercalation loop’, is important for the catalytic mechanism. Electron density for this segment of D4 structure in 4dog and 3 nt7 was good [22]. In the DNA-complex quality of electron density for this loop was inadequate for modeling the entire loop. Therefore, residues 184–188 were omitted from the final model. Two amino acid residues immediately preceding the disordered region of D4, namely Tyr180 and His181, directly interact with DNA. Side chain hydroxyl group of Tyr180 and the main chain nitrogen atom of His181 form contact with oxygen atoms of two phosphate groups from the DNA (Fig. 6). Tyr180 is conserved in UNGs of poxviruses but not in other UNGs. His181 is a critical catalytic residue and is conserved in UNGs across the species. In 4zlb we noticed some deviations in the position of His181 in different subunits. Generally, in a productive uracil complex, the distance between the NE2 atom of histidine and the O2 atom of uracil is significantly shorter than in non-productive complexes [6–8]. In 4lzb, the distance between O2 atom of uracil and His181 NE2 atom varied from 2.8-4.8 Å in different subunits [14]. In the DNA complex His181 side chain moved farther away from the active site area. The missing segment includes Arg185, which is equivalent to the conserved leucine residue (272 in hUNG) of the Leu-intercalation loop. In the mutational analysis mentioned above D4 Arg187Val mutant was also shown to be unable to support processive DNA synthesis but retained binding to A20 and DNA [22]. Unfortunately, due to disorder in this loop detailed comparison of this functionally significant region is not possible from the present structure.
Movement of Arg167 and implications for A20 binding
Most significant changes in the DNA complex were noticed in residues Arg167 and Pro173 although neither of these residues is involved in direct interaction with DNA. In 4dof and 4lzb, orientation of the Arg167 side chain is stabilized by a strong hydrogen bond between its NH2 atom and the main oxygen atom of Thr176. In both D4 subunits in the DNA complex, Arg167 side chain is directed away from Thr176 (Fig. 7a). The Arg167 side chain is oriented such that nitrogen atoms of the terminal amino groups form hydrogen bonds with the peptide oxygen atom of Val174. Position of Arg167 side chain is further stabilized by polar interactions of its amino groups with the hydroxyl oxygen atom of Thr176 side chain on one side and the main chain oxygen atom of Ser172 on the other (Fig. 7b).
In the crystal structure of the D4:A20 complex, the aromatic side chain of Trp43 of A20 occupies a hydrophobic pocket referred to as groove by Contesto-Richefeu et al. [22] and inserts into the space between Pro173 and Arg167 side chain. Movement of Arg167 side chain and Pro173 in the A20 complex (as compared to free D4 structures) favors this interaction. On the other hand in the DNA-complex, orientation of Arg167 side chain would interfere with binding of A20 at this site. Interestingly, movement of Pro173 in the DNA-complex also seems to create an unfavorable steric environment for binding of A20. While in the D4:A20 complex, Pro173 of D4 moves in a direction that allowed packing of Lys44 side chain of A20, in the DNA complex the carbonyl oxygen Pro173 would be in close contact (~1.5 Å) with the CB atom of Lys44 of A20 (Fig. 7c). Thus the structure of the present DNA complex suggests that DNA binding induces conformational changes in D4 that may interfere with its binding to A20. While D4 can independently bind both DNA and A20, perhaps in the formation of the heterotrimeric complex, D4-A20-interaction precedes the DNA-binding event.
Comparison of hUNG and D4
Crystal structures of hUNG have been reported in the free-enzyme form [1akz; 6] and in complex with damaged DNA containing uracil [1ssp; 6]. Structure of mutant hUNG in which the leucine residue (272) of the Leu-intercalation loop was altered to alanine was described in complex with DNA containing an abasic site [2ssp; 6]. Superimposition and comparison of the structure of hUNG with D4 in their DNA complexes demonstrate the divergences and general overlap in the areas important for DNA binding and UNG catalytic mechanism. The three DNA binding sites described above are shown in Fig. 8. Structural comparison of hUNG in the free-state and in complex with different DNA constructs demonstrated that major conformational changes in UNGs are promoted by the steps involved in generating and stabilizing the flipped out nucleotide upon recognition of uracil [10]. Upon binding an undamaged DNA, these conformational changes are not expected in D4. Superimposition of the structure of the D4:DNA complex and the DNA complex of hUNG (1ssp) reveals that the Gly-Ser loop and the Leu-intercalation loop provide the main interactions with DNA. Part of the Leu-intercalation loop is disordered in the D4:DNA complex. In hUNG, the catalytic histidine (His268 in 1ssp) clearly moves slightly closer towards the uracil binding pocket as compared to the D4:DNA complex. It should be noted that in DNA-free state this histidine residue superimposes well in D4 and hUNG (see Fig. 8). The interactions between the DNA and D4 at the extended DNA-binding site may be facilitated because of the larger size of the bound DNA as compared to hUNG:DNA complexes.
Conclusions
The D4-DNA complex described here provides the structural framework for recognizing the DNA binding residues and motifs in poxvirus UNGs. The process of sequence-specific recognition and binding of DNA includes non-specific interaction as an early or intermediate step [30–33]. The structure with undamaged dsDNA may represent a snapshot of an early DNA-protein interaction in preparation for the recognition complex for UNG [34]. It remains to be seen if the Leu-intercalation loop transitions from its disordered state to an ordered state upon recognition of a damage in the DNA. Some of the unique structural features of D4 have evolved for undertaking its novel role in viral replication. Structure presented here provides potential understanding of DNA-binding by D4 in the poxvirus replication machinery.
Methods
Preparation and crystallization of D4-DNA complex
Custom DNA oligos purified by standard desalting method were ordered from Integrated DNA Technologies (IDT). The 12mer DNA oligo was designed to be self-complementary: 5’-GCA AAC GTT TGC-3’. The DNA oligo was dissolved in annealing buffer (10 mM Tris, pH 7.5, 50 mM NaCl, 1 mM EDTA) at a concentration of 5 mM. Annealing using a Perkin Elmer GeneAmp PCR System 2400 resulted in a blunt-end 12mer double-stranded DNA construct with 12 base pairs (final concentration 2.5 mM):
5’-GCA AAC GTT TGC-3’ (forward strand)
3’-CGT TTG CAA ACG-5’ (reverse complementary strand)
The following annealing protocol was used:
Heating to 368 K in 5 min;
Cooling to 338 K in 5 min;
Cooling to 328 K in 5 min;
Cooling to 318 K in 5 min;
Cooling to 298 K in 5 min;
Cooling to 277 K in 40 min.
DNA oligos and annealed dsDNA were kept at 253 K until use.
Tag-free D4 (tfD4) was purified as previously described [14, 35]. The purified protein contained three vector encoded residues (−GSH) after the hexa-histidine tag was cleaved off.
Protein solution at 3 mg/mL (~120 μM in 25 mM HEPES buffer, pH 7.3, 60 mM KCl, 1 mM TCEP) was incubated with previously annealed self-complementary non-specific dsDNA in a 1:1.2 molar ratio at 277 K for 1 h. High-throughput crystallization screening using the PEGs and Protein Complex Suites from Qiagen was conducted on a Crystal Gryphon (Art Robbins Instruments). Thin plate-like crystals were obtained in hanging drop vapor diffusion at 293 K. The drops contained 1 μL protein plus 0.5 μL reservoir solution containing 16 % PEG6K, 0.08 M TRIS buffer, pH 8.5, 20 % glycerol. For cryo-freezing, crystals were directly plunged into liquid nitrogen.
Data collection and refinement
Intensity data were collected on a Pilatus 6 M detector at the Advanced Photon Source on NE-CAT beamline 24-ID-C. Data were processed with XDS [36, 37] and SCALA [38] of the CCP4 program suite [39] as part of the RAPD data-collection strategy at NE-CAT (https://chem.cornell.edu/rapd/). A total of 120 images (1° width) were collected. While the first 75 frames showed overall good statistics frames 76–120 showed increased mosaicity and Rmerge values. The processed data from frames 1–75 were indexed in the orthorhombic crystal system. Probabilities for Laue group P222 and space group P212121 based on systematic absences were 0.856 and 0.967, respectively. Diffraction data extending to 2.89 Å resolution were used for molecular replacement and refinement. CC1/2 and CC* values at 2.89 Å resolution were 0.86 and 0.96, respectively [40, 41]. Data-collection statistics are listed in Table 1.
The unit-cell parameters (a = 39.4 Å, b = 92.3 Å, c = 142.9 Å; space group P212121) suggested two protein subunits in the asymmetric unit (52 % solvent; Matthews coefficient VM = 2.6). The crystal structure was solved by molecular replacement with Phaser (version 2.5.6) [42] using the coordinates of one subunit from the native D4 structure (4dof) as search model. The observed pseudo-translation vector was used to position the second subunit in the asymmetric unit. Of the alternative space groups in Laue group P222 only space group P212121 provided solutions for the translation function. The R value of the refined solution was 45.2 % (RFZ = 5.8, TFZ = 9.3, LLG = 1803). After 1 round of refinement (10 cycles) of the protein coordinates (Rcryst and Rfree dropped to 29.2 % and 32.9 %, respectively) in REFMAC (version 5.8.0073) [43] electron density for the DNA double helix became visible in sigmaA weighted 2mFo-DFc and mFo-DFc difference electron density maps (m is the figure of merit and D is the sigmaA weighting factor). Using modeling tools in Coot (version 0.8-pre) [44] B-form DNA nucleotides were fitted into the electron density maps. The forward strand (chain C) of the dsDNA is numbered from 1–12 (5’ to 3’) while the reverse complementary strand (chain D) is numbered 21–32 (5’ to 3’). Base pairing occurs between the base of nucleotide 1 and the base of nucleotide 32 and similarly between the nucleotide 12 and 21 (and so forth). For structure refinement we used automatically generated local non-crystallographic symmetry (NCS) restraints in REFMAC. Structure validation was accomplished using PHENIX (version 1.9-1692) [45, 46], MolProbity [24], QualityCheck (http://smb.slac.stanford.edu/jcsg/QC/) [47], and the new wwPDB X-ray validation server (http://wwpdb-validation.wwpdb.org/validservice/). PHENIX and REFMAC were used for final map calculations. Refinement statistics are listed in Table 1. The structure has been deposited in PDB under 4qcb. The difference between R and Rfree, although within reasonable limit for structures in similar resolution, is somewhat higher and may result from incompleteness of the models (for DNA and protein).
Acknowledgements
This work was supported by the National Institutes of Health grant 5U01-A1-082211.
The work is based upon research conducted at the Advanced Photon Source on the Northeastern Collaborative Access Team beamlines, which are supported by a grant from the National Institute of General Medical Sciences (P41 GM103403) from the National Institutes of Health. Use of the Advanced Photon Source, an Office of Science User Facility operated for the U.S. Department of Energy (DOE) Office of Science by Argonne National Laboratory, was supported by the U.S. DOE under Contract No. DE-AC02-06CH11357.
Abbreviations
- D4
Vaccinia uracil-DNA glycosylase
- BER
Base excision repair
- AP
Apurinic/apyrimidinic
- Cyt
Cytosine
- Ura
Uracil
- UDG
Uracil-DNA glycosylase
- UNG
Family I uracil-DNA glycosylase
- ssDNA
Single-stranded DNA
- dsDNA
Double-stranded DNA
- hUNG
Human UNG
- GOL
glycerol
- EDTA
Ethylenediaminetetraacetic acid
- PEG
Polyethylene glycol
Additional files
Footnotes
PDBID’s are shown in italics
Uracil in this complex is not a product of glycosylase activity
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
NS was involved in crystallization, structure determination, structure refinement and manuscript preparation. SB performed data collection and data processing. RR and DC were involved in manuscript preparation. DC designed and participated in the experiments, and provided technical and scientific guidance. All authors read and approved the final manuscript.
Contributor Information
Norbert Schormann, Email: nschorm@uab.edu.
Surajit Banerjee, Email: sbanerjee@anl.gov.
Robert Ricciardi, Email: ricciard@upenn.edu.
Debasish Chattopadhyay, Email: debasish@uab.edu.
References
- 1.Nilsen H, Krokan HE. Base excision repair in a network of defense and tolerance. Carcinogenesis. 2001;22:987–98. doi: 10.1093/carcin/22.7.987. [DOI] [PubMed] [Google Scholar]
- 2.Jacobs AL, Schȁr P. DNA glycosylases in DNA repair and beyond. Chromosoma. 2012;121:1–20. doi: 10.1007/s00412-011-0347-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Visnes T, Doseth B, Sahlin Pettersen H, Hagen L, Sousa MML, Akbari M, et al. Uracil in DNA and its processing by different DNA glycosylases. Phil Trans R Soc Lond B. 2009;364:563–8. doi: 10.1098/rstb.2008.0186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Schormann N, Ricciardi R, Chattopadhyay D. Uracil-DNA glycosylases – Structural and functional perspectives on an essential family of DNA repair enzymes. Protein Sci. 2014;23:1667–85. doi: 10.1002/pro.2554. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Pearl LH. Structure and function in the uracil-DNA glycosylase superfamily. Mutat Res. 2000;460:165–81. doi: 10.1016/S0921-8777(00)00025-2. [DOI] [PubMed] [Google Scholar]
- 6.Parikh SS, Mol CD, Slupphaug G, Bharati S, Krokan HE, Tainer JA. Base excision repair initiation revealed by crystal structures and binding kinetics of human uracil-DNA glycosylase with DNA. EMBO J. 1998;17:5214–26. doi: 10.1093/emboj/17.17.5214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Parikh SS, Walcher G, Jones CD, Slupphaug G, Krokan HE, Blackburn GM, et al. Uracil-DNA glycosylase-DNA substrate and product structures: conformational strain promotes catalytic efficiency by coupled stereoelectronic effects. Proc Natl Acad Sci U S A. 2000;97:5083–8. doi: 10.1073/pnas.97.10.5083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Werner RM, Jiang YL, Gordley RG, Jagadeesh GJ, Ladner JE, Xiao G, et al. Stressing-out DNA? The contribution of serine-phosphodiester interactions in catalysis by uracil DNA glycosylase. Biochemistry. 2000;39:12585–94. doi: 10.1021/bi001532v. [DOI] [PubMed] [Google Scholar]
- 9.Jiang YL, Stivers JT. Mutational analysis of the base-flipping mechanism of uracil DNA glycosylase. Biochemistry. 2002;41:11236–47. doi: 10.1021/bi026226r. [DOI] [PubMed] [Google Scholar]
- 10.Slupphaug G, Mol CD, Kavli B, Arvai AS, Krokan HE, Tainer JA. A nucleotide-flipping mechanism from the structure of human uracil-DNA glycosylase bound to DNA. Nature. 1996;384:87–92. doi: 10.1038/384087a0. [DOI] [PubMed] [Google Scholar]
- 11.Savva R, McAuley-Hecht K, Brown T, Pearl L. The structural basis of specific base-excision repair by uracil-DNA glycosylase. Nature. 1995;373:487–93. doi: 10.1038/373487a0. [DOI] [PubMed] [Google Scholar]
- 12.Parker JP, Bianchet MA, Krosky DJ, Friedman JI, Amzel LM, Stivers JT. Enzymatic capture of an extrahelical thymine in the search for uracil in DNA. Nature. 2007;449:433–8. doi: 10.1038/nature06131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Schormann N, Grigorian A, Samal A, Krishnan R, DeLucas L, Chattopadhyay D. Crystal structure of vaccinia virus uracil-DNA glycosylase reveals dimeric assembly. BMC Struct Biol. 2007;7:45. doi: 10.1186/1472-6807-7-45. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Schormann N, Banerjee S, Ricciardi R, Chattopadhyay D. Structure of the uracil complex of vaccinia virus uracil DNA glycosylase. Acta Cryst F. 2013;69:1328–34. doi: 10.1107/S1744309113030613. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.De Silva FS, Moss B. Vaccinia virus uracil DNA glycosylase has an essential role in DNA synthesis that is independent of its glycosylase activity: catalytic site mutations reduce virulence but not virus replication in cultured cells. J Virol. 2003;77:159–66. doi: 10.1128/JVI.77.1.159-166.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Ishii K, Moss B. Mapping interaction sites of the A20R protein component of the vaccinia virus DNA replication complex. Virology. 2002;303:232–9. doi: 10.1006/viro.2002.1721. [DOI] [PubMed] [Google Scholar]
- 17.Stanitsa ES, Arps L, Traktman P. Vaccinia virus uracil DNA glycosylase interacts with the A20 protein to form a heterodimeric processivity factor for the viral DNA polymerase. J Biol Chem. 2006;281:3439–51. doi: 10.1074/jbc.M511239200. [DOI] [PubMed] [Google Scholar]
- 18.Boyle KA, Stanitsa ES, Greseth MD, Lindgren JK, Traktman P. Evaluation of the role of the vaccinia virus uracil DNA glycosylase and A20 proteins as intrinsic components of the DNA polymerase holoenzyme. J Biol Chem. 2011;286:24702–13. doi: 10.1074/jbc.M111.222216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Sèle C, Gabel F, Gutsche I, Ivanov I, Burmeister WP, Iseni F, et al. Low- resolution structure of vaccinia virus DNA replication machinery. J Virol. 2013;87:1679–89. doi: 10.1128/JVI.01533-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Contesto-Richefeu C, Tarbouriech N, Brazzolotto X, Betzi S, Morelli X, Burmeister WP, et al. Crystal structure of the vaccinia virus DNA polymerase holoenzyme subunit D4 in complex with the A20 N-terminal domain. PLoS Pathog. 2014;10:e1003978. doi: 10.1371/journal.ppat.1003978. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.McDonald WF, Traktman P. Vaccinia virus DNA polymerase. In vitro analysis of parameters affecting processivity. J Biol Chem. 1994;269:31190–7. [PubMed] [Google Scholar]
- 22.Druck Shudofsky AM, Silverman JE, Chattopadhyay D, Ricciardi RP. Vaccinia virus D4 mutants defective in processive DNA synthesis retain binding to A20 and DNA. J Virol. 2010;84:12325–35. doi: 10.1128/JVI.01435-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Scaramozzino N, Sanz G, Crance JM, Saparbaev M, Drillen R, Laval J, et al. Characterisation of the substrate specificity of homogenous vaccinia virus uracil-DNA glycosylase. Nucleic Acids Res. 2003;31:4950–7. doi: 10.1093/nar/gkg672. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Davis IW, Leaver-Fay A, Chen VB, Block JN, Kapral GJ, Wang X, et al. MolProbity: all-atom contacts and structure validation for proteins and nucleic acids. Nucleic Acids Res. 2007;35:W375–83. doi: 10.1093/nar/gkm216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Zheng G, Lu X-J, Olson WK. Web 3DNA – A web server for the analysis, reconstruction, and visualization of three-dimensional nucleic-acid structures. Nucleic Acids Res. 2009;37:W240–6. doi: 10.1093/nar/gkp358. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Krissinel E, Henrick K. Inference of macromolecular assemblies from crystalline state. J Mol Biol. 2007;372:774–97. doi: 10.1016/j.jmb.2007.05.022. [DOI] [PubMed] [Google Scholar]
- 27.Rice PA, Yang S, Mizuuchi K, Nash HA. Crystal structure of an IHF-DNA complex: A protein-induced DNA U-turn. Cell. 1996;87:1295–306. doi: 10.1016/S0092-8674(00)81824-3. [DOI] [PubMed] [Google Scholar]
- 28.Dürr H, Körner C, Müller M, Hickmann V, Hopfner KP. X-ray structures of the Sulfolobus solfataricus SWI2/SNF2 ATPase core and its complex with DNA. Cell. 2005;121:363–73. doi: 10.1016/j.cell.2005.03.026. [DOI] [PubMed] [Google Scholar]
- 29.Roberts VA, Pique ME, Hsu S, Li S, Slupphaug G, Rambo RP, et al. Combining H/D exchange mass spectroscopy and computational docking reveals extended DNA-binding surface on uracil-DNA glycosylase. Nucleic Acids Res. 2012;40:6070–81. doi: 10.1093/nar/gks291. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Cao C, Jiang YL, Stivers JT, Song F. Dynamic opening of DNA during the enzymatic search for a damaged base. Nat Struct Mol Biol. 2004;11:1230–6. doi: 10.1038/nsmb864. [DOI] [PubMed] [Google Scholar]
- 31.Kalodimos CG, Biris N, Bonvin AMJJ, Levandoski MM, Guennuegues M, Boelens R, et al. Structure and flexibility adaptation in nonspecific and specific protein-DNA complexes. Science. 2004;305:386–9. doi: 10.1126/science.1097064. [DOI] [PubMed] [Google Scholar]
- 32.Viadiu H, Aggarwal AK. Structure of BamHI bound to nonspecific DNA: a model for DNA sliding. Mol Cell. 2000;5:889–95. doi: 10.1016/S1097-2765(00)80329-9. [DOI] [PubMed] [Google Scholar]
- 33.Zharkov DO, Mechetin GV, Nevinsky GA. Uracil-DNA glycosylase: Structural, thermodynamic and kinetic aspects of lesion search and recognition. Mutat Res. 2010;685:11–20. doi: 10.1016/j.mrfmmm.2009.10.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Schonhoft JD, Kosowicz J, Stivers JT. DNA translocation by human uracil DNA glycosylase: Role of DNA phosphate charge. Biochemistry. 2013;52:2526–35. doi: 10.1021/bi301561d. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Sartmatova D, Nash T, Schormann N, Nuth M, Ricciardi R, Banerjee S, et al. Crystallization and preliminary X-ray diffraction analysis of three recombinant mutants of Vaccinia virus uracil DNA glycosylase. Acta Cryst F. 2013;69:295–301. doi: 10.1107/S1744309113002716. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Kabsch W. XDS. Acta Cryst D Biol Cryst. 2010;66:125–32. doi: 10.1107/S0907444909047337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Kabsch W. Integration, scaling, space-group assignment and post-refinement. Acta Cryst D Biol Cryst. 2010;66:133–44. doi: 10.1107/S0907444909047374. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Evans P. Scaling and assessment of data quality. Acta Cryst D Biol Cryst. 2006;62:72–82. doi: 10.1107/S0907444905036693. [DOI] [PubMed] [Google Scholar]
- 39.Winn MD, Ballard CC, Cowtan KD, Dodson EJ, Emsley P, Evans PR, et al. Overview of the CCP4 suite and current developments. Acta Cryst D Biol Cryst. 2011;67:235–42. doi: 10.1107/S0907444910045749. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Karplus PA, Diederichs K. Linking crystallographic model and data quality. Science. 2012;336:1030–3. doi: 10.1126/science.1218231. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Diederichs K, Karplus PA. Better models by discarding data? Acta Cryst D Biol Cryst. 2013;69:1215–22. doi: 10.1107/S0907444913001121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.McCoy AJ, Grosse-Kunstleve RW, Adams PD, Winn MD, Storoni LC, Read RJ. Phaser crystallographic software. J Appl Cryst. 2007;40:658–74. doi: 10.1107/S0021889807021206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Murshudov GN, Skubák P, Lebedev AA, Pannu NS, Steiner RA, Nicholls RA, et al. REFMAC5 for the refinement of macromolecular crystal structures. Acta Cryst D Biol Cryst. 2011;67:355–67. doi: 10.1107/S0907444911001314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Emsley P, Cowtan K. Coot: Model-building tools for molecular graphics. Acta Cryst D Biol Cryst. 2004;60:2126–32. doi: 10.1107/S0907444904019158. [DOI] [PubMed] [Google Scholar]
- 45.Adams PD, Afonine PV, Bunkoczi G, Chen VB, Davis IW, Echols N, et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Cryst D Biol Cryst. 2010;66:213–21. doi: 10.1107/S0907444909052925. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Afonine PV, Grosse-Kunstleve RW, Echols N, Headd JJ, Moriarty NW, Mustyakimov M, et al. Towards automated crystallographic structure refinement with phenix.refine. Acta Cryst D Biol Cryst. 2012;68:352–367. doi: 10.1107/S0907444912001308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Yang H, Guranovic V, Dutta S, Feng Z, Berman HM, Westbrook JD. Automated and accurate deposition of structures solved by X-ray diffraction to the Protein Data Bank. Acta Cryst D Biol Cryst. 2004;60:1833–9. doi: 10.1107/S0907444904019419. [DOI] [PubMed] [Google Scholar]
- 48.Weiss MS. Global indicators of X-ray data quality. J Appl Cryst. 2001;34:130–5. doi: 10.1107/S0021889800018227. [DOI] [Google Scholar]
- 49.Luscombe NM, Laskowski RA, Thornton JM. NUCPLOT: a program to generate schematic diagrams of protein-nucleic acid interactions. Nucleic Acids Res. 1997;25:4940–5. doi: 10.1093/nar/25.24.4940. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Pettersen EF, Goddard TD, Huang CC, Couch GS, Greenblatt DM, Meng EC, et al. UCSF Chimera – A visualization system for exploratory research and analysis. J Comput Chem. 2004;25:1605–12. doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
- 51.DeLano WL. PyMOL. Schrödinger, LLC: Open-Source PyMOLTM; 2002. [Google Scholar]