Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2018 Apr 30;46(11):5618–5633. doi: 10.1093/nar/gky293

Regional conformational flexibility couples substrate specificity and scissile phosphate diester selectivity in human flap endonuclease 1

Ian A Bennet 1,2, L David Finger 1,✉,2, Nicola J Baxter 2,3, Benjamin Ambrose 1, Andrea M Hounslow 2, Mark J Thompson 1, Jack C Exell 1,3, Nur Nazihah B Md Shahari 1, Timothy D Craggs 1,, Jonathan P Waltho 2,3, Jane A Grasby 1,
PMCID: PMC6009646  PMID: 29718417

Abstract

Human flap endonuclease-1 (hFEN1) catalyzes the divalent metal ion-dependent removal of single-stranded DNA protrusions known as flaps during DNA replication and repair. Substrate selectivity involves passage of the 5′-terminus/flap through the arch and recognition of a single nucleotide 3′-flap by the α2–α3 loop. Using NMR spectroscopy, we show that the solution conformation of free and DNA-bound hFEN1 are consistent with crystal structures; however, parts of the arch region and α2–α3 loop are disordered without substrate. Disorder within the arch explains how 5′-flaps can pass under it. NMR and single-molecule FRET data show a shift in the conformational ensemble in the arch and loop region upon addition of DNA. Furthermore, the addition of divalent metal ions to the active site of the hFEN1–DNA substrate complex demonstrates that active site changes are propagated via DNA-mediated allostery to regions key to substrate differentiation. The hFEN1–DNA complex also shows evidence of millisecond timescale motions in the arch region that may be required for DNA to enter the active site. Thus, hFEN1 regional conformational flexibility spanning a range of dynamic timescales is crucial to reach the catalytically relevant ensemble.

INTRODUCTION

Flap endonuclease 1 (FEN1) is a member of the 5′-nuclease superfamily that is predominantly involved in Okazaki fragment maturation, but also has roles in long-patch base excision repair and ribonucleotide excision repair (1,2). All three of these pathways create bifurcated nucleic acid structures known as 5′-flaps that must be removed precisely to create single-stranded (ss) and nicked double-stranded (ds) DNA products (Figure 1A). In line with a critical role in DNA replication, FEN is present in all organisms, from bacteriophage to mammals (3). In humans, cancer cells overexpress FEN1, and tumor aggressiveness correlates with FEN1 protein levels (4). As such, human FEN1 (hFEN1) has been postulated to be a potential cancer therapeutic target (5,6), and evidence suggests that combinatorial targeting of hFEN1 has therapeutic relevance (7). Moreover, FEN1 is the prototypical member of the 5′-nuclease superfamily whose activities span all major DNA metabolic pathways. As nucleases, specificity is paramount as unwanted hydrolysis of DNA or RNA can be deleterious; thus, how hFEN1 and paralogues achieve substrate and scissile phosphate diester specificity has been an area of considerable debate.

Figure 1.

Figure 1.

Regions of hFEN1 assume various conformations in crystal structures. (A) Representation of hFEN1-catalyzed reaction of double-flap DF#,1 creating ssDNA and nicked dsDNA products (2). # denotes a 5′-flap of any length, 1 denotes a single nucleotide 3′-flap. (B) The hFEN1D86N-(DF2,1) complex (5UM9) (21) shows that the arch (α4-α5, red), α2–α3 loop (blue) and β-pin (β6-β7 loop) (pink) are ordered in the presence of DNA substrate (colored as in panel A). The α10-α11 loop (H2tH motif) that binds the reacting duplex approximately 20 Å from the scissile phosphate and the α14-loop-α15 that is unique to FEN proteins are shown in orange and black, respectively. The potassium and active site samarium cations are shown as orange and yellow spheres, respectively. (C) Rear view of 5UM9 (cartoon with transparent surface representation) with regions colored as in panels A and B and the DNA (DF2,1) shown as spheres. (DF) The three crystal structures of apo-hFEN1 (1UL1) show the arch, α2–α3 loop and β-pin regions in various states of order and disorder, dashed lines indicate missing electron density (22). (D) The tan shape illustrates the saddle-region of hFEN1 that binds the dsDNA and active site metals.

Earlier studies revealed that hFEN1 specificity arises from the recognition of three structural features of its substrate (Figure 1A) (2,8). Firstly, only junction dsDNAs can be bound stably by the protein as two juxtaposed dsDNA binding sites require the substrate to bend roughly 100° (Figure 1B) (9–11). Secondly, despite removing 5′-flaps, hFEN1 recognizes substrates bearing a one nucleotide (1-nt) 3′-flap (12–14). In structures of hFEN1–DNA complexes, the 3′-flap is cradled in a pocket created from the α14–α15 loop, the first turn of α15 and the α2–α3 loop (11). The α2–α3 loop folds into a non-regular secondary structure known as an Ω-loop (15), often associated with allosterically-regulated regions of proteins (Supplementary Figure S1) (16). Thirdly, the 5′-flap portion of the substrate must pass under the arch region (α4-α5) for biologically-relevant rates of reaction to occur (Figure 1B and C) (17,18). This latter substrate selection feature is also shared by the 5′-nuclease superfamily member human exonuclease-1 (19,20), but not by other family members that act on continuous DNAs like bubble-cutting XPG and the Holliday junction resolvase GEN1 (3). When these three substrate recognition tests are met, the DNA can untwist and enter the active site for catalysis. Of the three hFEN1 substrate feature selection processes, the threading of the 5′-flap through the arch remains the most enigmatic.

Crystal structures of hFEN1–DNA complexes (3Q8K, 3Q8L, 3Q8M, 5UM9) show the arch is composed of two α-helices above the active site (11,21). Together these create a hole in the protein that is large enough to accommodate ssDNA but not dsDNA (11,21), which could explain hFEN1 specificity for 5′-ss flaps (Figure 1C). However, this is inconsistent with the ability of hFEN1 to process 5′-flaps with some duplex secondary structure (12,18). A further puzzle relates to how threading could occur through a small hole in the absence of an energy source. In contrast, apo-hFEN1 structures show that the regions corresponding to the arch and α2–α3 loop sometimes lack discernible electron density; what has been observed is either random coil or a limited degree of secondary structure (Figure 1DF) with high B-factors (Supplementary Figure S2) (22). Furthermore, structures of FEN proteins from a wide array of organisms show the arch region in various conformations and positions (Supplementary Figure S3) (2). It is possible that structural heterogeneity of the arch and α2–α3 loop has some functional relevance. However, direct confirmation of the structural status of the arch in free and substrate bound hFEN1 is necessary to define its mechanism.

To extend our understanding, multidimensional NMR spectroscopy was used to assess the properties and dynamics of hFEN1 alone and in complex with DNA in solution. We show that the solution conformation of hFEN1 is consistent with the crystal structure in the dsDNA binding region known as the saddle (Figure 1D), but a large discrepancy is observed in the arch region (traditionally referred to as the helical arch). Using a combination of kinetic, spin relaxation, chemical shift and single molecule FRET data, the contribution of hFEN1 conformationally dynamic regions to substrate specificity is revealed. Chemical shift perturbation mapping provides evidence that divalent cations induce conformational changes in the hFEN1–DNA complex that are not observed in hFEN1 alone. These data demonstrate that the active site communicates with regions involved in dsDNA binding and substrate feature recognition. Combined data raise the possibility that both conformational selection and induced fit mechanisms occur in the arch region in response to structural features of the substrate and reveal that the hFEN1–DNA complex exhibits millisecond timescale motions in the arch. Taken together, our analysis shows that regional conformational flexibility spanning a range of dynamic timescales is channeled into the catalytically-competent conformational ensemble in response to substrate features, providing the required link between specificity and catalysis in a structure-sensing nuclease. Because the structures of other 5′-nuclease superfamily show disorder or indicate flexibility in the area that corresponds to the hFEN1 arch region (19,20,23–25), a similar conformational ensemble shift mechanism in response to other paralogue-specific triggers is likely important in their mechanisms as well.

MATERIALS AND METHODS

Protein expression

Human hFEN1-336 was expressed from Escherichia coli BL21(DE3)-RILP cells transformed with pET29b-hFEN1-336 plasmid, encoding for hFEN1-336 (herein referred to as hFEN1) protein containing a human rhinovirus (HRV) type 14 3C protease-cleavable (His)6-tag located at the C-terminus. Natural abundance protein for kinetic analyses was expressed and purified as described previously (7). 15N-labeled protein was expressed using 15N autoinduction media (26). The final concentrations of 15N autoinduction media components were: 50 mM Na2HPO4 and 50 mM KH2PO4 pH 7.5, 50 mM 15NH4Cl, 5 mM Na2SO4, 2 mM MgSO4, 0.5% glycerol, 0.05% d-glucose, 0.2% α-lactose, 12× BME vitamins (USBiological B0110), 1× Trace Metals (Teknova 1000x Trace Metals T1001), 400 μg ml−1 kanamycin and 34 μg ml−1 chloramphenicol. Cultures for 15N-labeled protein expression were inoculated with a 100-fold dilution of a saturated 2× YT starter culture, and were then allowed to grow at 37°C for 4 h before the temperature was reduced to 18°C to allow for overnight expression by autoinduction.

2H,13C,15N-labeled protein was prepared using a high cell-density procedure in combination with isopropyl-β-d-thiogalatopyranoside (IPTG) induction (27). The final concentrations of all media components were: 50 mM Na2HPO4 and 25 mM KH2PO4 pH 7.5, 18 mM 15NH4Cl, 1% 13C6,2H7-d-glucose (U-13C6, 99%; 1,2,3,4,5,6,6-d7 97-98%), 0.2 mM CaCl2, 5 mM MgSO4, 10 mM NaCl, 0.25 × BME vitamins (USBiological B0110), 0.25 × Trace Metals (Teknova 1000x Trace Metals T1001), 400 μg.ml−1 kanamycin and 34 μg ml−1 chloramphenicol. Initially, a 100-fold dilution of a saturated culture of BL21(DE3)-RILP cells transformed with pET29b-hFEN1-336 was used to inoculate 2× YT media prepared with 50% D2O and was grown until saturation. A 100-fold dilution of this saturated 50% D2O 2× YT culture was then used to inoculate 2× YT media in 100% D2O to allow the cells to acclimatize for growth. Aliquots (50 ml) of these D2O acclimatized BL21(DE3)-RILP cells were added to 1 L of 100% D2O 2× YT media and were grown at 37°C until the A600 = 5–6. The culture was pelleted by ultracentrifugation (6000 × g at 4°C for 30 min) and the supernatant was discarded. The pellet was then resuspended in 1 L of 15N,2H,13C minimal media (as described above) and then allowed to grow at 37°C for a further 2 h to clear unlabeled metabolites. Afterward, IPTG was added to a final concentration of 0.1 mM IPTG, the temperature was reduced to 18°C, and the culture was grown for a further 72 h until A600 = 10–13. 2H,15N-labeled protein was prepared analogously except that the 1% 13C6,2H7-d-glucose (U-13C6, 99%; 1,2,3,4,5,6,6-d7 97–98%) was replaced with 1% 2H7-d-glucose (1,2,3,4,5,6,6-d7 97–98%).

In all cases, cells were harvested by centrifugation (6000 × g at 4°C for 15 min) and the supernatant was removed. The cell pellet was resuspended in 50 ml of ice-cold phosphate buffered saline (10 mM Na2HPO4 and 2 mM KH2PO4 pH 7.4, 137 mM NaCl, 2.7 mM KCl), the cells were pelleted again at (6000 × g at 4°C for 15 min) and the supernatant was removed. The cells were then resuspended in 50 ml of Buffer A (20 mM Tris–HCl pH 7.4, 1 M NaCl, 5 mM imidazole) containing 5 mM β-mercaptoethanol (βME), 0.1 mg ml−1 lysozyme and 1 EDTA-free protease inhibitor cocktail tablet (Sigma S8830). Cells were incubated on ice for at least 1 h and were frozen at −20°C until required.

Protein purification

To purify hFEN1 protein, cells were thawed and then sonicated on ice for 60 cycles of pulsation for 5 s with 25 s cooling intervals. To each lysate, 5 ml of Buffer A containing 1% Tween-20 was added, and the lysate was clarified by centrifugation (40 000 × g at 4°C for 30 min). The supernatant was applied to a column (ID = 1.6 cm, length = 12 cm) containing Chelating Sepharose 6 Fast Flow agarose beads (GE Healthcare Life Sciences) that had been previously charged with Ni2+ ions according to the manufacturer's protocol and pre-equilibrated with 5 column volumes of Buffer A. The column was washed with 5 column volumes of Buffer A and 5 column volumes of Buffer B (25 mM Tris–HCl pH 7.4, 0.5 M NaCl, 40 mM imidazole, 0.01% Tween-20, 5 mM βME). The bound hFEN1 protein was eluted with 8 column volumes of Buffer C (250 mM imidazole pH 7.2, 0.5 M NaCl, 5 mM βME). Fractions containing hFEN1 protein were identified by SDS-PAGE, pooled and immediately diluted with an equal volume of Buffer D (20 mM Tris–HCl pH 7.4, 1 mM EDTA, 20 mM βME). The solution was then applied to a 5 mL HiTrap Q HP column (GE Healthcare Life Sciences) pre-equilibrated with the buffer D. hFEN1 protein was found in the flow through, whereas nucleic acid contamination was retained by the column and eluted using 10 column volumes of Buffer D containing a linear gradient of 0–1 M NaCl. The amount of hFEN1 protein in the flow through was estimated by measuring the absorbance at 280 nm (ϵ280 = 22 920 M−1 cm−1) (28) using a UV-Vis NanoDrop spectrophotometer (ThermoFisher Scientific). Two units of Turbo3C (HRV3C protease) (BioVision) were added for every mg of protein, and the mixture was allowed to stand overnight at 4°C to catalyze cleavage of the (His)6-tag from the hFEN1 protein. The extent of affinity tag removal was assessed by SDS-PAGE, and the solution was diluted further with an equal volume of buffer E (25 mM Tris–HCl pH 7.5, 1 mM CaCl2, 20 mM βME). The hFEN1 protein solution was then applied to a 20 ml HiPrep Heparin FF 16/10 column (GE Healthcare Life Sciences) pre-equilibrated with 2 column volumes of buffer E and eluted using 15 column volumes of Buffer E containing a linear gradient of 0–1 M NaCl (hFEN1 protein eluted at 350 mM NaCl). Fractions containing hFEN1 protein were pooled and concentrated to less than 10 ml using an Amicon Ultrafiltration cell with a 10 kDa MWCO PES membrane (MerckMillipore) pressurised using nitrogen gas. The protein was exchanged into the appropriate buffer (see below) using a HiPrep 26/10 desalting column (GE Healthcare Life Sciences). The hFEN1 protein concentration was determined using the absorbance at 280 nm as described above. Unlabeled samples for kinetic measurements were exchanged into 100 mM HEPES–KOH pH 7.5, 200 mM KCl, 1 mM EDTA, 10 mM DTT and 0.04% NaN3, and adjusted to a concentration of 200 μM. An equal volume of glycerol was added to each sample to allow optimal storage conditions (final concentration: 100 μM hFEN1, 50 mM HEPES–KOH pH 7.5, 100 mM KCl, 0.5 mM EDTA, 5 mM DTT, 0.02% NaN3 and 50% glycerol). Samples were stored at −20°C. Isotopically labeled hFEN1 protein samples were exchanged into the appropriate NMR buffer and concentrated to 0.5 mM using a Vivaspin 20 (10 kDa MWCO) concentrator (4000 × g at 4°C).

All purification buffers were filtered and degassed prior to use, and an ÄKTA pure chromatography system was used for all column purification steps. Unless otherwise stated, all reagents were purchased from Sigma-Aldrich, Fisher Scientific, or VWR International and used as received.

NMR assignment

All NMR experiments were performed at 298 K (relative to d4-methanol signal) using standard pulse sequences on either a Bruker 600 MHz Avance DRX spectrometer equipped with a 5-mm TXI cryoprobe with z-axis gradients or a Bruker 800 MHz Avance I spectrometer equipped with a 5-mm TXI probe with z-axis gradients. Both spectrometers were operated with TopSpin 2. Comparison of 1H–15N TROSY spectra of 2H,15N,13C-labeled hFEN1 with 15N-labeled hFEN1 showed that all backbone amide groups had back exchanged from deuterium to protium atoms. For the backbone resonance assignment of hFEN1, 0.5 ml samples contained 0.5 mM 2H,15N,13C-labeled hFEN1 in 10 mM HEPES–KOH pH 7.5, 76 mM KCl, 0.1 mM EDTA, 4 mM NaN3, 100 mM βME. D2O (10% v/v) and 0.05 mM trimethylsilyl propanoic acid (TSP) were added for the deuterium lock and chemical shift reference standard, respectively. After transferring the sample into a 5 mm NMR tube, the sample was sealed with a Precision Seal® rubber septa cap (Sigma-Aldrich Z554014) and argon was passed over the sample to help minimize oxidation.

hFEN1 backbone 1HN, 15N, 13Cα, 13Cβ and 13C' were assigned using the standard suite of 1H–15N TROSY and 3D TROSY-based constant time experiments (HNCA, HN(CO)CA, HNCACB, HN(CO)CACB, HN(CA)CO and HNCO) (29). 1H chemical shifts were referenced relative to the internal TSP signal, whereas 15N and 13C chemical shifts were indirectly referenced using nuclei-specific gyromagnetic ratios. Peak picking and frequency matching was performed within CCPNMR Analysis version 2.4 (30), and the backbone assignments were confirmed independently using a simulated annealing algorithm employed by the asstools assignment program (31). The secondary structure content and Random Coil Index-S2 (RCI-S2) prediction of hFEN1 was conducted by uploading the backbone 1HN, 15N, 13Cα, 13Cβ and 13C' chemical shifts to the TALOS-N webserver (32).

NMR relaxation measurements

hFEN1 samples for 15N backbone fast timescale relaxation measurements were performed using 2H,15N-labeled hFEN1 in a 5 mm Shigemi NMR microtube assembly matched with D2O. Sample conditions were 10 mM HEPES–KOH pH 7.5, 76 mM KCl, 4 mM NaN3 100 mM βME, 10% D2O, 0.05 mM TSP and 0.1 mM EDTA. Spin-lattice 15N relaxation rates (R1), rotating frame 15N relaxation rates (R) and heteronuclear steady-state 15N-{1H} NOE (hNOE) values were obtained using interleaved TROSY-readout pulse sequences (33). Temperature compensation was applied in the R1 experiment by incorporating a spin-lock pulse placed off-resonance in the inter-scan delay, equal to the longest spin-lock time and RF power of the R experiment. Relaxation delays of 0, 80, 240, 400, 400, 640, 800, 1200, 1760 and 2400 ms were used to calculate R1, and delays of 1, 20, 20, 30, 40, 60, 90, 110, 150 and 200 ms were used to calculate R. The inter-scan delay was 3.5 s and the strength of the RF spin-lock field during R measurement was 1400 Hz at 600 MHz and 1866.7 Hz at 800 MHz. For the hNOE measurement, two interleaved experiments were acquired with relaxation delays of 10 s. For the determination of R1 and R rates, the decay of backbone amide peak intensities were fitted using an exponential function in CCPNMR (30). Relaxation parameters were obtained for 192 residues because 59 residues were omitted from further analysis due to peak overlap or poor signal to noise ratios. For hNOE experiments, the ratio of the saturated to unsaturated peak height was measured. R2 values were calculated from R1 and R rates according to Equation ((1)).

graphic file with name M5.gif (1)

where tan θ = ω1 / Ω, ω1 is the spin lock RF field (1400 Hz at 600 MHz and 1866.7 Hz at 800 MHz) and Ω is the offset of the 15N resonance of interest with respect to the 15N carrier frequency (33).

Model-free analysis

Model-free analysis was performed using relax (34–38) on the Sheffield-WRGRID ICEBERG high performance computing cluster. Using R1, R2 and hNOE values at 600 MHz and 800 MHz and the coordinate geometry of backbone amide N-H bonds as provided by the crystal structure 3Q8K (11), model-free analysis was executed. However, residues from the arch region, α2–α3 loop and the C-terminus were excluded due to lack of coordinates in the crystal structures (1UL1) (22) or low R2/R1 values. Instead, these 16 residues were modeled with a spherical diffusion tensor. Both analyses were conducted for the 192 residues for which relaxation data at both field strengths was available; of these, only 179 were processed fully, as 13 were removed due to large errors or computational eliminations.

Titration of Ca2+ and Mg2+ cations into hFEN1K93A

A 15N-labeled hFEN1 K93A mutant (hFEN1K93A), which hindered catalysis, was prepared in the pET29b-hFEN1-336 vector as described previously (11). The hFEN1K93A protein was prepared as described for the wild type above using the 15N autoinduction method. 1H–15N TROSY spectra were separately recorded at 10 mM HEPES–KOH pH 7.5, 76 mM KCl, 0.1 mM EDTA, 4 mM NaN3, 100 mM βME, 10% D2O and 0.05 mM TSP with either 0 mM, 8 mM MgCl2 and 8 mM CaCl2 added. Absolute changes in weighted chemical shifts (ω) were determined using Equation ((2)) where the correction factor for 15N was α = 0.14.

graphic file with name M6.gif (2)

DNA preparation

DNA oligonucleotides were purchased with purification from LGC Biosearch Technologies. The sequence of the oligonucleotides are shown in Table 1. Oligonucleotide concentrations were quantified using absorbance at 260 nm and molar extinction coefficients calculated according to the ‘nearest-neighbor’ method (39,40).

Table 1. Oligonucleotide sequences used herein.

Name Sequence
DHPS1 5′dTGAAAGGCAGAGCGCTAGCTCTGCCTTTCGAGCGAAGCTCC3′
F1* 5′FAM*-dTTTTTACAAGGACTGCTCGACAC3′
F1 5′dTTTTTACAAGGACTGCTCGACAC3′
T1 5′dGTGTCGAGCAGTCCTTGTGACGACGAAGTCGTCC3′

*5′FAM is a 5′-terminal fluorescein modification produced using the single isomer 5-carboxyfluorescein-aminohexyl amidite.

hFEN1K93A-DNA complex formation

DHPS1 (DNA substrate) was heated and annealed in 10 mM HEPES–KOH pH 7.5, 6 mM KCl, 4 mM NaN3 and 0.1 mM EDTA. Initially, 0.5 mM 15N-labeled hFEN1K93A, 2H,13C,15N-labeled hFEN1K93A or 2H,15N-labeled hFEN1K93A, prepared as described above, in 10 mM HEPES–KOH pH 7.5, 76 mM KCl, 4 mM NaN3 and 0.1 mM EDTA was titrated with sub-stoichiometric aliquots of lyophilized DHPS1 until the DNA was in excess of the protein (>500 μM). At 50–100 μM of DNA, the sample precipitated and the quality of the spectra decreased rapidly. To overcome protein precipitating in complex with DNA, an hFEN1K93A-DNA complex was prepared in 10 mM HEPES–KOH pH 7.5, 6 mM KCl, 4 mM NaN3 and 0.1 mM EDTA in the presence of excess DNA substrate (1:1.1) under dilute conditions of protein and DNA (roughly 5 μM). Low monovalent salt concentrations were used to slow dissociation of the complex. The sample was concentrated to ∼500 μM using Vivaspin 20 (10 kDa MWCO) spin columns and 100 mM βME, 10% D2O (10% v/v) and 0.05 mM TSP were added. The same resonance assignment strategy using 3D NMR spectra was conducted with a 2H,15N,13C-labeled hFEN1K93A–DNA complex prepared in this manner. 1H-15N TROSY spectra were acquired at 0, 0.5, 1, 2, 4, 6 and 8 mM CaCl2, and chemical shift differences were recorded as described in Equation (2).

Preparation of hFEN1 cysteine mutants labeled with fluorophores

Surface residues C235 and C311 were successively mutated to alanine in the pET28b-hFEN1 plasmid using Agilent site-directed mutagenesis protocols (41) and the appropriate oligonucleotides (Supplementary Table S1). The resulting double mutant hFEN1 protein (C235A/C311A) was expressed and purified as described above. This mutant did not react with maleimide dyes on a timescale of 1 h. Subsequent mutagenesis as above created hFEN1Q, E120C/C235A/S293C/C311A. hFEN1Q protein was expressed and purified as described above. ESI-MS calculated: 43210.6 Da experimental: 43210.9 Da.

To stochastically label hFEN1Q with the appropriate fluorophores, 250 μl of 100 μM (25 nmol) purified hFEN1Q in 50% glycerol storage buffer was diluted 1:1 with buffer F (50 mM Tris–HCl pH25°C 7, 150 mM NaCl, 1 mM EDTA and 0.1 mM TCEP) to dilute the glycerol. DTT was added to the diluted hFEN1Q to a final concentration of 20 mM, and the solution allowed to incubate on ice for 15 min. The hFEN1Q solution was then subjected to size exclusion chromatography (SEC) using a Superdex 75 GL 10/300 column (GE Healthcare Life Sciences) and buffer F. Protein containing fractions were pooled and concentrated using a 10 kDa MWCO Vivaspin-20. The hFEN1Q protein concentration was quantified using the absorbance at A280 nm and calculated extinction coefficient (28). Stocks of 10 mM Atto 647N maleimide (Sigma-Aldrich) and 10 mM Cy3b maleimide (GE Healthcare Life Sciences) were prepared in DMSO. To label hFEN1Q, the protein was added to a tube containing the appropriate volume of each stock of fluorophore maleimide conjugate to give a molar ratio of 1:2:2 of hFEN1:Atto 647N:Cy3b. Aliquots (usually 50 μl) of the reaction were removed at 2, 4, 8, 16 and 26 min and quenched by addition of excess DTT. To remove excess free fluorophore, the quenched aliquots were exchanged into buffer F using micro Bio-Spin 6 columns (BioRad) according to the manufacturer's instructions. The labeled protein was again subjected to SEC as described above. The protein containing fractions were pooled and concentrated as before. The concentrations of the protein and the covalently attached Cy3b and Atto 647N were quantified using a UV/Vis Nanodrop spectrohphotometer (ThermoFisher) at A280, A559 and A646, respectively, and the associated extinction coefficients (hFEN1 protein ϵ280 = 22 920 M−1 cm−1, Cy3b ϵ559 = 130 000 M−1 cm−1 and Atto 647N ϵ646 = 150 000 M−1 cm−1). A280 values were corrected for contributions from both dyes using A280/A556 and A280/A646 ratios.

Kinetic measurements of hFEN1, mutants and fluorescently labeled proteins

The activities of hFEN1, hFEN1Q and hFEN1Q with attached fluorophores (hFEN1QF) were assessed using a 5′-fluorescein (FAM) labeled bimolecular DNA oligo construct DF5,1*, which was prepared by heating and annealing F1* and T1 in a 1:1.1 ratio (Table 1), respectively, in 50 mM HEPES–KOH pH 7.5, 100 mM KCl and 0.02% NaN3. Reactions mixtures (180 μl) containing 50 nM DF5,1* in reaction buffer (55 mM HEPES–KOH pH 7.5, 110 mM KCl, 8 mM MgCl2, 0.1 mg ml-1 BSA. 1 mM DTT) were initiated by adding appropriate amounts of hFEN1 proteins (3–6 pM final concentration) that afforded less than ∼10% product formation in 10 min at 37°C. Aliquots (20 μl) of reaction were quenched in excess EDTA (50 μl of 500 mM EDTA) at 2, 4, 6, 8, 10 and 20 min. The amount of product formation was assessed by denaturing high performance liquid chromatography (dHPLC) as described previously (18,42). Plots of concentration of product versus time yielded the initial rate of reaction (ν0 nM min−1), which could be normalized for enzyme concentration (ν0/[E]0 min−1).

For second order rate constant determination as a function of viscosity, reaction mixtures (180 μl) were assayed at 2.5, 5, 7.5 and 10 nM DF5,1* substrate in reaction buffer at 37°C and the indicated relative viscosity. Buffer viscosity was adjusted using either glycerol (0−36% v/v) or sucrose (0−35%). Glycerol relative viscosities and their corresponding %v/v were calculated using temperature corrected density calculations at 37°C (43). Relative viscosity was adjusted with sucrose as described previously (42). Product formation was monitored by dHPLC and normalized initial rates of reaction were generated as described above. Estimates of second order rate constants were derived from the slope of normalized initial rate (v/[E]) versus the concentration of substrate [S]. A calculated second order rate constant (kcat/KM)0 value at relative viscosity of 1 was derived from the Y-intercept from a plot of kcat/KM versus relative viscosity (44).

Single molecule FRET measurements and analysis

Labeled hFEN1 (hFEN1QF) was diluted to ∼50 pM in binding buffer (55 mM HEPES–KOH pH 7.5, 110 mM KCl, 8 mM CaCl2, 0.1 mg ml-1 BSA, 1 mM DTT)) and incubated at room temperature for 5 min with or without 20 nM substrate DNA (i.e. DF5,1, prepared by heating and annealing F1 and T1 in a 1:1.1 ratio (Table 1)). Glass slides were passivated with 1 mg/ml BSA for 5 min prior to each measurement. Three triplicate 10-min data sets were acquired for each sample giving a total of 90 min of data for each condition, yielding ∼4000 bursts.

smFRET data were acquired using a custom built confocal microscope and alternating laser excitation (45). Two dioide lasers (515 nm and 635 nm – LuxX plus) were directly modulated (100 us, duty cycle 45%) and combined into an optical fibre. The output beam was collimated and then cropped to 2.5 mm diameter by an iris. The beam was directed into the back of the objective (Olympus UPLSAPO 60× NA = 1.35 oil immersion) using a dichroic mirror (Chroma ZT532/640 rpc 3 mm) with the fluorescence emission collected by the same objective, focussed onto a 20 um pinhole and then split (dichroic mirror: Chroma NC395323 – T640lpxr) for detection by two avalanche photodiodes (SPCM-AQRH-14 and SPCM-NIR-14, Excilitas). Photon arrival times were recorded by a national instruments card (PCIe-6353), with the acquisition controlled using custom software (LabView 7.1).

smFRET data analysis was performed using custom Jupyter notebooks, based on the open source python software, FRETBusts, described previously (46). Briefly, background rates in each channel were obtain for each 60 seconds of the acquisition by fitting a poisson distribution to inter-photon delays. This background rate was recalculated every 60 s to take account of any change in background throughout the measurement time. Bursts were identified using a dual channel burst search as previously described (47).

Apparent FRET efficiencies (E*) and apparent stoichiometries (S*) were calculated for each burst according to Equations (3) and (4), respectively:

graphic file with name M7.gif (3)
graphic file with name M8.gif (4)

where I represents the background corrected intensity in the (i) acceptor emission channel after donor excitation (IA|D), (ii) donor emission channel after donor excitation (ID|D) and (iii) acceptor emission channel after acceptor excitation (IA|A). Apparent FRET efficiencies were corrected for donor leakage, acceptor direct excitation, and the detection efficiencies/quantum yields (gamma correction) according to published protocols (48) (arXiv:1710.03807), using labeled DNA standards for the gamma correction.

After correction factors were applied, bursts were selected with at least 40 photons under green excitation, and 10 photons under red excitation, with a maximum of 300 photons to filter out an acceptor heavy aggregate observed in the data set. Bursts with an observed stoichiometry between 0.5 and 0.85 were plotted to generate a histogram of relative frequency versus FRET efficiency and fitted with an unrestrained double Gaussian function.

RESULTS

NMR assignments are consistent with known secondary structure and peak intensities suggest regions with unusual dynamic properties

Human FEN1 (hFEN1) is a 380 amino acid protein that has a nuclease core domain (amino acid residues 1–336) that is sufficient for catalysis in vitro. As we were most interested in the dynamics associated with catalysis, the hFEN1–336 construct from X-ray crystallographic studies was used for NMR studies (11,21). Thus, the final 38 kDa hFEN1-336 construct (herein referred to as hFEN1) contains 336 native amino acids with an additional six residues present at the C-terminus from the rhinovirus 3C protease recognition site (49). Optimization of both buffer and temperature yielded stable hFEN1 samples that afforded good 1H–15N TROSY spectra (Figure 2A and Supplementary Figure S4). Using standard 1H–15N TROSY and TROSY-based 3D experiments (29), 251 amino acid residues out of the 324 theoretically-detectable backbone amides (excluding prolines and initiator methionine) were assigned in the absence of divalent metal ions. Many of the unassigned residues were localized to the base of the arch region, the active site amino acids and some of the α2–α3 loop (Figure 2B and C). Furthermore, the peaks in the 1H–15N TROSY spectrum displayed variable intensities, suggesting that not all residues relaxed uniformly. Some of the interfaces between the assignable and unassignable regions had peaks with decreased intensity, which suggested that the missing regions were undergoing exchange on the millisecond timescale (Figure 2D).

Figure 2.

Figure 2.

Relative peak heights suggest differences in dynamics in regions of hFEN1. (A) 1H–15N-TROSY spectrum of hFEN1 under experimental conditions. Peaks from amide side chains and the two tryptophan side-chain indole groups (Hϵ–Nϵ) are also observed. Expanded views of the shaded region can be found in Supplementary Figure S4. (B) Front and (C) rear views of the hFEN1 structure (3Q8K) (11) with labeled secondary structure elements and colored backbone to denote assigned (red) or unassigned (gray) residues. (D) Relative peak height (based on the lowest intensity peak E257) obtained from the 1H–15N-TROSY spectrum of hFEN1 plotted versus residue number show that loops are generally more intense and are flanked by decreasing peak intensities and sometimes missing residues. A secondary structure schematic of hFEN1 (3Q8K) is included (11). Blue rectangles, green arrows and black lines indicate α-helices, β-strands and loops, respectively. Loops known to have structural heterogeneity (1UL1 versus 3Q8K) are indicated by red lines (11,22). Red and white stripped rectangles highlight regions of structural heterogeneity. The H2tH α10–α11 loop is highlighted in orange. The location of the active site carboxylate (D34, D86, E158, E160, D179, D181 and D233) and basic residues (K93, R100, R104, K125, K128, R129 and R238) with respect to secondary structure elements are indicated by magenta and cyan ovals, respectively.

Chemical shift analysis of protein backbone nuclei (15N, 1HN, 13Cα, 13Cβ and 13C') was conducted using TALOS-N (32). The predicted secondary structure elements agreed with the structure of the saddle region of the protein in 3Q8K and 1UL1x, thereby supporting the accuracy of our assignments (Supplementary Figure S5). Prediction of β-strand ϕ and ψ dihedral angles for residues E295−S299 were confirmed with further analysis of this region in 3Q8K and 1UL1z (11,22), which showed that the dihedral angles were consistent with the β-strand prediction despite being a loop. Furthermore, a single α-helical turn in the β1–β2 loop (residues D22−Y26) is present consistent with the TALOS-N assignment (11,22). In the arch region, the analysis for A106−A111 suggested that these residues were in an α-helical conformation. For Q112−Q115, α-helix was also predicted, but with decreasing probability. Finally, an extended loop conformation was predicted for residues A116−E124. Therefore, only the N-terminal half of the arch region was consistent with the secondary structure present in 3Q8K, whereas the C-terminal portion was not.

The observable residues of the arch region and α2–α3 loop are disordered

To characterize the dynamics of the free protein, 15N spin-lattice (R1) and spin-spin (R2) relaxation rates and 15N-{1H} heteroNOE (hNOE) values were measured using 2H,15N-labeled hFEN1 (33). Of the 251 residues that were assigned, only 203 and 197 were analyzed at 600 and 800 MHz (Supplementary Figure S6A-F), respectively, due to peaks either being too weak to analyze or overlapped. The data generally showed that residues in secondary structure have similar relaxation rates, indicative of the overall rotational correlation time of the molecule (τc). Using R2/R1 ratios (Supplementary Figure S6G and H), an overall τc was estimated to be 25 ns (50), consistent for a protein of this size (38 kDa) (51). Loops including β6–β7 (i.e. β-pin), α10–α11, α12–α13 and α13–α14 as well as the start of α15 displayed intermediate R2/R1 ratios. Regions that showed particularly low R2/R1 ratios and low hNOE values were the α2–α3 loop, the arch region and the non-native C-terminal residues. These regions were associated with higher than average B-factors or lacked observable electron density in crystal structures (Figure 1DF and Supplementary Figure S2) (11,22). Furthermore, the higher peak intensities of the α2–α3 loop, the arch region and the non-native C-terminal residues (Figure 2D) are consistent with the lower R2/R1 ratios (Supplementary Figure S6G and H).

We used relax in combination with the 3Q8K protein structure to calculate model-free order parameters (52). Residues with low R2/R1 ratios and a lack of density in the free protein structures were treated separately as an isotropic spherical diffusion tensor (16 residues), while the other backbone amide residues were derived using an oblate rotational diffusion tensor (163 residues) (Supplementary Figure S6I and J). Although a large range of S2 values were generated with various models (Supplementary Table S2), most residues in the protein in secondary structure elements afforded an average S2 of ∼0.8, showing that relaxation of most residues in the saddle region of hFEN1 was influenced predominantly by the tumbling of the molecule in solution (Figure 3). Therefore, these residues were relatively rigid. The α10–α11, α12–α13 and α13–α14 loops, the C-terminal end of α10 and α14 as well as most of α15 displayed intermediate S2 (0.5 < S2 < 0.8) values indicating increased flexibility in these regions. The lowest S2 values (<0.5) were generally observed in the C-terminal tail, arch and α2–α3 loop residues, thereby indicating that these residues were extremely flexible in solution. Rex terms were reported for approximately 70% of the residues in hFEN1 that were amenable to model-free analysis (Supplementary Figure S7), indicating that millisecond timescale motions contributed to spin-spin relaxation (R2). Most of the Rex terms were associated with residues in the saddle region.

Figure 3.

Figure 3.

Model-free analysis of hFEN1 relaxation data identifies regions with increased flexibility. (A) Generalized order parameters (S2) were derived from relax (52) using backbone 15N relaxation data acquired at two field strengths and plotted versus residue number. Black circles represent data fitted to the oblate spheroid diffusion tensor, whereas pink triangles were fitted to a spherical diffusion tensor. The purple line in (A) represents the TALOS-N (32) random coil index S2 prediction. Secondary structure map as described in Figure 2D. (B) S2 values plotted on a cartoon depiction of the hFEN1 protein structure (3Q8K) (11). The spheres represent the nitrogen nuclei for which data were derived. The S2 and Rex spectrum bars illustrate the magnitude of S2 and Rex values with respect to sphere color and size.

S 2 values were compared with Random Coil Index-S2 (RCI-S2) values predicted by TALOS-N (32) and were congruent apart from an underestimation of the degree of regional flexibility (Figure 3A). Considering the flexibility of the arch, α2–α3 loop and C-terminal tail, we decided to confirm the nature of these regions using the neighbor-corrected random coil chemical shift library for intrinsically disordered proteins (ncIDP) (53). Deviations greater than 1 ppm from ncIDP 15N and 13C chemical shift values were shown to be a powerful indicator of secondary structure. The observable regions of the arch, α2–α3 loop and C-terminal residues showed only small differences (≤1) (Figure 4A and B), suggesting that these residues were disordered in the free protein. The small deviations from the random coil value suggested that residues A107−Q113 were transiently sampling α-helical ϕ,ψ-space. The postulated preference for occupying α-helical-ϕ,ψ space showed a gradual decrease occurring from Q110−A114. For residues A114−A120, a progressive switch to extended ϕ,ψ-space was implied (53). These data agreed with AGADIR predictions (54) and TALOS-N analysis (32) (Figure 4C and Supplementary Figure S5A). Overall, these data suggested that the middle of α4 was poised to form α-helical structure, whereas the top of α5 was not.

Figure 4.

Figure 4.

15NH and 13Cα chemical shifts for the disordered arch region indicate that only some of the residues are sampling helical conformations. Bar graphs indicate the magnitude of the difference of experimental (δe) and sequence-corrected random coil chemical shifts (δr) for (A) 15NH and (B) 13Cα for the indicated regions. Differences less than one (dotted line) suggest disorder. The anti-correlated nature of 15NH and 13Cα is used to suggest which segments of the disordered region populate either helical or extended peptide backbone angles. Positive and negative 15NH and 13Cα chemical shift differences, respectively, imply extended backbone angles, whereas negative and positive 15NH and 13Cα chemical shift differences, respectively, suggest sampling α-helical backbone angles (53). (C) Agadir helical percentage predictions for the indicated residues assessed at 114 mM ionic strength, pH 7.5 and 25°C (NMR conditions) (54). Black bars indicate the residues for which NMR data is available to validate the prediction, whereas open bars indicate an absence of NMR assignments. A secondary structure map is illustrated below as described in Figure 2D.

The analysis of the sequence of the arch residues (P90−L130) using CIDER (55,56) (Classification of Intrinsically Disordered Ensemble Regions) categorized this IDR as a polyampholytic coil or hairpin. Whether a polyampholytic IDR prefers to be a hairpin collapsed on itself or an extended coil (Supplementary Figure S3C,D,F) was shown to depend on the linear sequence distribution of charged residues, which can be assessed by the value κ (57). A κ value of 0 indicates no self-assembly, whereas a κ value of 1 indicates full collapse. The arch region has a very low κ value of 0.12, which suggested that it prefers to adopt an extended coil rather than collapsed conformation in solution.

Divalent cations induce chemical shift changes in the vicinity of the hFEN1 active site

Because hFEN1 binds two Mg2+ metal ions in the active site to catalyze the hydrolysis of the scissile phosphate diester bond of the optimal substrate (1,2), we wanted to assess the effects of Mg2+ and Ca2+ metal ion coordination by hFEN1 using chemical shift perturbation mapping. Ca2+ was included in the analysis for studies of an hFEN1–DNA complex, because it facilitates the hFEN1-induced DNA conformational change required for scissile phosphate diester placement on the active site metal ions without substrate hydrolysis (9,58). Furthermore, the analysis was performed using hFEN1K93A because addition of Ca2+ alone was unable to completely inhibit phosphate diester hydrolysis, presumably due to small amounts of Mg2+ contamination (59). Removal of K93 by mutation to alanine reduces scissile phosphate diester hydrolysis by 2000-fold, because it acts as an electrophilic catalyst during scissile phosphate diester hydrolysis (42). Nonetheless, hFEN1K93A still facilitates substrate conformational change similarly to hFEN1 (9,58). The K93A mutation is in a region of hFEN1 that was unable to be assigned. Comparison of the 1H–15N TROSY spectra of hFEN1 and hFEN1K93A showed only small and localized changes in chemical shifts, thereby indicating that it had no widespread effect on the protein (Figure 5A).

Figure 5.

Figure 5.

Addition of Ca2+ ions to hFEN1K93A perturbs chemical shifts nearest the active site. (A) Superposed 1H–15N TROSY spectra of hFEN1 (gray) and hFEN1K93A (red) shows only minimal changes in localized regions close to the mutation site indicating no widespread effects on global structure. (B) Superposed 1H-15N TROSY spectra of hFEN1K93A in the absence (red) and presence (blue) of Ca2+. Well-dispersed residues are labeled accordingly. (C) Chemical shift changes observed on the addition of Ca2+ ions to hFEN1K93A are mapped onto a cartoon representation of the hFEN1 structure (3Q8K) (11). The black-dotted spheres indicate the locations of active site metal ions that are coordinated by the side chains of carboxylate residues shown as magenta sticks. Secondary structure elements, the H2tH motif and the N-terminus (Nt) are labeled. The locations of the G231 and R239 spheres are highlighted. The magnitudes of the nitrogen nuclei chemical shift perturbations (ω) are represented by sphere color according to the associated spectrum bar. The absence of a sphere either indicates the lack of assignment in the protein or the inability to follow chemical shift changes due to peak overlap. (D) Magnified view of the area indicated by the dashed box in panel C to highlight the location of residues most affected by the addition of Ca2+ ions to hFEN1K93A. Labels are as described in panel C except for the omission of R239 and addition of G87.

Titration of either MgCl2 or CaCl2 (0–10 mM) into samples of hFEN1 was monitored using 1H–15N TROSY spectra and showed saturation between 8–10 mM for both divalent salts, consistent with the optimal concentrations of MgCl2 used in routine kinetic assays. Furthermore, 1H–15N TROSY spectra of hFEN1 with 8 mM Ca2+ or 8 mM Mg2+ are nearly identical, but show small differences in residues near to the active site including the N-terminal residues (Supplementary Figure S8A) This indicated that Ca2+ binds in the active site, consistent with Ca2+ being competitive with respect to Mg2+ (60). The small deviations in chemical shifts of residues closest to the active site could be due to the larger ionic radius and looser coordination geometry of Ca2+ compared to Mg2+. Comparison of 1H-15N TROSY spectra of hFEN1K93A in the absence and presence of 8 mM Ca2+ showed similar chemical shift perturbations as observed in spectra of hFEN1 when Mg2+ or Ca2+ were added. The largest changes between apo-hFEN1K93A and Ca2+-hFEN1K93A occurred in regions close to the metal chelating carboxylates (D34, D86, E158, E160, D179, E181, D233) and the N-terminal residues (Figure 5B−D). Other regions affected included β5, α3, α7, β4–α7 loop, the H2tH motif, the β3–α4 loop and the β6-β7 β-pin, which are all regions that are near active site residues. Some of the β-strands also showed smaller perturbations further from the active site that suggested that conformational rearrangement in the active site is propagated throughout the β-sheet. These data indicated that divalent ions predominantly affected the structure immediately surrounding the active site in agreement with their locations in X-ray crystal structures (11). Notably, the addition of Ca2+ did not change the higher relative intensities of the peaks associated with the arch region and α2–α3 loop, suggesting that Ca2+ did not significantly alter the dynamics of these regions.

Substrate binding changes the exchange regime for the arch region and α2–α3 loop

To prepare an hFEN1K93A–DNA complex and to analyze NMR spectral changes upon binding, consecutive sub-stoichiometric amounts of DNA substrate (DHPS1; Table 1 and Supplementary Figure S9A) were added to an hFEN1K93A sample containing with 76 mM KCl until the DNA was in excess. However, addition of substrate induced protein precipitation in the sample suggesting non-specific DNA binding at concentrations used for NMR spectroscopy. Despite this, chemical shift changes upon addition of DNA were observed for a number of residues, consistent with locations known to interact with duplex DNA. Residues that were affected by DNA binding, but remained in the fast exchange regime (e.g. Q4, A175, L190, R192, R239 and G324) could be followed with the addition of DNA to saturation. Interestingly, the assignable α2–α3 loop and arch residues disappeared from the spectrum at sub-stoichiometric amounts of DNA, suggesting that they entered an exchange regime that prevented their detection (29). The magnitude of the exchange rate at which this happens at these field strengths should be faster than 10 s−1 but slower than 1000 s−1 as the peaks do not re-appear upon further addition of DNA to saturation. The switch from sub-nanosecond dynamics to millisecond exchange suggested a conformational change of the α2–α3 loop and the arch upon DNA binding.

Divalent metal ions induce large chemical shift changes far from the active site

To overcome issues associated with the precipitation of hFEN1K93A when preparing an hFEN1K93A–DNA complex (Supplementary Figure S9A), a different strategy was adopted where the complex was prepared under dilute conditions and at lower KCl concentrations in the presence of EDTA, followed by concentration. Preparing hFEN1K93A-DNA complexes in this manner resulted in stable samples that afforded acceptable 1H–15N TROSY spectra (Supplementary Figure S8B). Comparison of 1H–15N TROSY spectra of hFEN1K93A and the hFEN1K93A-DNA complex showed chemical shift changes for residues in or near areas known to interact with the substrate, such as the H2tH motif (e.g. G231 and R239), active site (e.g. Q4, G5, L6, G87, K88, A175 and R192), the hydrophobic wedge (e.g. A45 and G66,) and the 3′-flap binding pocket (e.g. M310, E318 and G324). This indicated that the solution complex was consistent with the hFEN1–DNA complex crystal structures (3Q8K and 5UM9) (11,21). Before assigning the hFEN1K93A–DNA complex, Ca2+ was titrated into the sample (0–8 mM) and chemical shift perturbations upon each successive addition were monitored by 1H-15N TROSY spectra. Assignment of the hFEN1K93A-DNA complex in 8 mM Ca2+ was conducted using the same methodology as for hFEN1; however, only 57% of the visible peaks (132 peaks out 230 peaks in the spectrum) could be assigned presumably due to signal attenuation in the 3D spectra. Some assignments were corroborated by following the DNA titration data (i.e. R19, G149, A175, S255, R261).

Addition of Ca2+ to hFEN1K93A and the hFEN1K93A-DNA complex showed that residues immediately surrounding the active site were perturbed similarly (Figure 5BDcf. Figure 6A and B), indicating that Ca2+ is coordinated in the active site in both free and bound forms. In the hFEN1K93A–DNA complex, however, the presence of Ca2+ additionally resulted in the disappearance of the N-terminal residues Q4, G5 and L6 (Supplementary Figure S8C). For example, the first addition of Ca2+ to the complex resulted in an upfield chemical shift with peak broadening for Q4 (Figure 6C). These N-terminal residues remained observable in spectra when Ca2+ was added to hFEN1K93A (Figure 5B); thus, this unique effect was due to the presence of the DNA. Together this suggested that the addition of Ca2+ to the complex resulted in the N-terminal residues experiencing changes in their chemical environment due to movement of the residues themselves or movement of the DNA into the active site, thereby resulting in intermediate exchange broadening.

Figure 6.

Figure 6.

Addition of Ca2+ ions to the hFEN1K93A–DNA complex show chemical shift perturbations (ω) throughout the protein. (A) Front and (B) rear views of a cartoon depiction of the hFEN1–DNA structure (3Q8K) (11) with the magnitudes of nitrogen nuclei chemical shift perturbations (ω) on addition of Ca2+ represented by sphere color according to the associated spectrum bar. Note, because the chemical shift change for G231 was larger than for most other residues, the spectrum bar is discontinuous. (C−E) Highlighted regions from Supplementary Figure S8C showing several residues with significant chemical shift perturbations that occur upon titration with Ca2+ at 0 mM (black), 0.5 mM (cyan), 1 mM (olive), 2 mM (magenta), 4 mM (yellow), 6 mM (green) and 8 mM (orange). The panels show a large ω for (C) L190, R239 and E318, (D) G149 and G231 and (E) the appearance of S317 at higher Ca2+ concentrations. Intermediate exchange behavior is highlighted for (C) Q4 and (E) D86.

Unlike hFEN1K93A, the addition of Ca2+ to the hFEN1K93A–DNA complex resulted in larger perturbations in regions significantly removed from the active site (Figure 5C and Dcf. Figure 6A and B). The most pronounced weighted 1H–15N chemical shift change (ω) was observed for G231, which lies at the beginning of the H2tH motif. In the absence of DNA, this residue showed a slight change in chemical shift upon addition of Ca2+ ions (ω = 0.022 ppm, Figure 5C), whereas the change increased greatly (ω = 0.39 ppm, Figure 6D) when DNA was present. Another H2tH motif residue whose Ca2+ induced chemical shift change was significantly larger for the hFEN1K93A–DNA complex was R239 (ω = 0.013 ppm, Figure 5Bcf. ω = 0.17 ppm, Figure 6C), which lines the minor groove of the dsDNA in the reacting duplex. This showed that the H2tH DNA binding motif and residues in that region, which are approximately 20 Å from the active site metal ions, responded to the metal occupancy status of the active site in the presence of DNA. Residue G87, which is next to the divalent metal binding residue D86, also showed a larger chemical shift perturbation in response to Ca2+ in the hFEN1K93A-DNA complex (ω = 0.22 ppm, Figure 6B and Supplementary Figure S8C) compared to hFEN1K93A (ω = 0.07 ppm, Figure 5B and Supplementary Figure S8B). In the threaded hFEN1–DNA substrate crystal structure (5UM9), residue G87, which is in the β3–α4 loop, is near the 5′-flap exit, and this larger perturbation was likely due to its proximity to the exit of the 5′-flap (Figure 1C) (21). Moreover, chemical shift changes in this region are consistent with the reported changes in the 5′-flap threading equilibrium when Ca2+ was added to hFEN1-DNA complexes (18). Furthermore, addition of Ca2+ to the hFEN1K93A–DNA complex also gave rise to larger chemical shift perturbations than in hFEN1K93A in the β3–α4, α12–α13 and α13–α14 loops and helices α6, α14 and α15 (Figure 6A and B). These regions were proximal to the 5′-flap exit site and close to the 3′-flap binding pocket.

In contrast to the N-terminal residues, S317 of the hFEN1K93A–DNA complex reappeared near the end of the Ca2+ titration (Figure 6E). In the hFEN1–DNA product and substrate complexes (3Q8K and 5UM9) (11,21), the S317 backbone amide group hydrogen bonds with one of the non-bridging phosphate oxygen atoms of the 3′-flap phosphate diester moiety (Supplementary Figure S1A). In 1H–15N TROSY spectra of hFEN1K93A, S317 was observable in the presence and absence of Ca2+; however, the peak was missing from spectra of the hFEN1K93A-DNA complex in the absence of Ca2+ (Supplementary Figure S8B). The weighted chemical shift perturbation (ω) for S317 in hFEN1K93A and the hFEN1K93A–DNA complex in the presence of Ca2+ was 0.68 ppm downfield, consistent with it forming a hydrogen bond with a non-bridging phosphate oxygen atom in the DNA. Regardless of the presence or absence of Ca2+, residues associated with the arch region and α2–α3 loop were neither observable nor assignable in spectra of the hFEN1K93A–DNA complex.

Single molecule FRET measurements confirm conformational changes in the arch region

To investigate further the range of conformations adopted by FEN1 in the presence and absence of substrate DNA (Supplementary Figure S9B without the 5′-FAM label), we used single-molecule Forster Resonance Energy transfer (smFRET). Modeling of dye attachment sites for donor (Cy3B) and acceptor (Atto647N) dyes identified sites in the arch (E120) and the saddle region (S293) opposite the DNA binding site that would produce a FRET pair sensitive to the conformation and position of the arch (Supplementary Figure S10A). However, hFEN1 has two partially solvent accessible cysteines (C235 and C311) that easily label with maleimide dyes (data not shown). Thus, a quadruple mutant was generated by successive site-directed mutagenesis (E120C/C235A/S293C/C311A; hFEN1Q) (41), and stochastic incorporation of the maleimide dyes was achieved at the desired postions to generate hFEN1QF (Supplementary Table S3) (61). Measurement of hFEN1Q and hFEN1QF activities showed that the mutations and attachment of the dyes had only a small effect on enzymatic activity (Supplementary Figure S10B). In the absence of DNA but with Ca2+, a broad range of FRET efficiencies were observed (Figure 7A), indicative of multiple conformations and positions of the arch. In these experiments, single donor–acceptor labeled FEN1 molecules are observed as they diffuse through the confocal volume, a process which takes ∼ 1 ms. Conformations that interconvert much faster than the observation time yield a single, tight FRET efficiency signal, whereas a broad profile is indicative of multiple conformations converting on slower (> 1 ms) time scales, as was observed here (Figure 7A).

Figure 7.

Figure 7.

smFRET shows a change in conformational ensemble upon addition of DNA and reveals millisecond conformational dynamics occurring in the arch region. (A) Relative frequencies of FRET efficiencies for hFEN1QF alone (gray) and in complex with 20 nM DNA (white). (B) The difference in relative frequencies of FRET efficiencies upon addition of 20nM DNA. (C) hFEN1QF alone and (D) hFEN1QF with 20 nM DNA showing the unrestrained fit to the sum of two Gaussian functions (see Supplementary Table S4 for fitting parameters) to show that upon addition of DNA the lower FRET population decreases and the higher FRET population increases.

On addition of 20 nM substrate DNA, subtle changes in the FRET efficiency distribution were observed (Figure 7A); a decrease in the relative frequency of FRET efficiencies below 0.7 with a concomitant increase in FRET efficiencies above 0.7 (Figure 7B). This is indicative of a change in the conformational ensemble for FEN1 upon binding the DNA substrate. This change in the conformational ensemble could explain the unusual viscogen dependence of hFEN1 reaction (Supplementary Figure S11). The FRET efficiency distributions were well described by the sum of two gaussians, with free fits yielding the same means for the high and low FRET populations in both the presence and absence of the DNA substrate (Supplementary Table S4). On addition of DNA, a shift in the relative amplitude from the low-FRET to the high-FRET ensemble was observed (Figure 7C and D, Supplementary Table S4), showing a change in the conformational ensemble of the arch with respect to the saddle upon binding DNA. These data indicated that a change in the conformational ensemble in the arch occurred when substrate was added.

DISCUSSION

Earlier studies have suggested that hFEN1-specificity for DNA structure rather than sequence arises from the recognition of three substrate structural features: (i) the bending of the two-way dsDNA junction, (ii) 5′-flap threading and (iii) 3′-flap binding (8–12,21). The latter two substrate-feature selection steps involve regions of the protein that we show are conformationally dynamic (i.e., the arch region and the α2–α3 loop). NMR spin relaxation data for the free protein shows that residues in the top of the arch (A107, L111, A117–E120 and E122) and the middle of the α2–α3 loop (V52–Q54) are very flexible, exhibiting motions on the timescale of ∼109 s−1; hence, they are disordered. However, residues either side of these flexible regions cannot be detected in NMR experiments presumably due to intermediate exchange broadening caused by millisecond timescale motions. Consistent with this, smFRET data for the free protein suggests that the arch region is moving relative to the saddle region of hFEN1 at a rate slower than a 1000 s−1 (Figure 8). The highly mobile residues in the arch and α2–α3 loop that are observable by NMR may experience this millisecond motion as well, but because these regions likely experience negligible chemical shift changes, the sub-nanosecond timescale motions dominate the relaxation.

Figure 8.

Figure 8.

The flexible regions of hFEN1 respond to the appropriate structural features of the substrate, resulting in a shift to the catalytically viable ensemble. (A) In the absence of substrate, hFEN1 contains flexible regions in the ‘arch’ (red) and the α2–α3 loop (blue). The tops of these regions are disordered and display very fast motions (∼109 s−1), whereas the flanking regions are absent presumably due to intermediate exchange broadening. The observable regions of the arch and loop are likely in fast exchange because they experience little to no change in chemical environment from the millisecond timescale motion. (B) Upon binding DNA, the disordered regions experience exchange broadening, likely due to a change in chemical environment and/or change in rate of motions. However, the slow millisecond timescale movements persist in both the arch and the α2–α3 loop despite the change in the conformational ensemble. (C) Eventually, the catalytically viable ensemble forms possibly coupling motions within the arch, the α2–α3 loop and the DNA.

The observable residues in arch and α2–α3 loop undergo a change of motional timescale from ∼109 s−1 in the free protein to an intermediate exchange regime (10–1000 s−1) in the hFEN1–DNA complex, suggesting their chemical environment is perturbed more significantly when bound to DNA. Such perturbation can arise from changes in rates of exchange, secondary structure formation, and changes in chemical environment due to proximity to the DNA. In line with this, the smFRET data indicate a change in the conformational ensemble upon addition of DNA. Interestingly, both the smFRET and NMR data suggest that there are still millisecond time motions involving the arch and α2–α3 loop. Thus, the earlier disorder to order model upon correct substrate binding that was based on crystallographic observations (2,11,21,62,63) is an oversimplification as it suggests there are no motions in the hFEN1–DNA complex. Rather, the complex is likely better described as an ensemble of conformations. The smFRET data shows that addition of DNA shifts the equilibrium bias of the arch to higher FRET efficiency states, implying the net position of the arch moves closer to the saddle. Interestingly, the arch has been observed in different positions relative to the saddle in hFEN1 (11,21) and hEXO1 X-ray structures (20,64). Notably, in structures where DNA has untwisted and moved into the active site (21), the arch has moved forward toward the saddle. It is possible that arch motions play an important role in the catalytic cycle, by facilitating untwisting of the DNA to transfer the scissile phosphate to the active site. Indeed, the single turnover rates of the hFEN1-catalyzed reaction are 15–30 s−1 (12); thus, it is possible that the slower conformational changes in the arch and α2–α3 loop regions could limit the rate of reaction.

In the hFEN1–DNA complex, residues near the active site are perturbed upon addition of Ca2+ as is also seen with substrate-free hFEN1. Interestingly, the N-terminal residues disappear from the spectra upon addition of Ca2+ to hFEN1–DNA, but not with hFEN1 alone. The recent crystal structures of the hFEN1–DNA complex (5UM9) suggests that the N-terminal amine moiety of G2 in conjunction with a metal ion and D233 activates the attacking water to generate the hydroxide ion nucleophile (21). Hence, the dynamics resulting in the disappearance of the N-terminal residues upon the addition of Ca2+ may be associated with this role. We also observed large chemical shift changes in response to addition of Ca2+ to the hFEN1–DNA complex in regions of the protein distal to the active site that were not observed at the same magnitude upon addition of Ca2+ to hFEN1. Several studies have shown that divalent metal ions and arch region ordering are necessary to move the scissile phosphate into the active site by untwisting (65) the reacting duplex (9,18,58,59). Thus, these large Ca2+ induced chemical shift changes in the H2tH motif, which directly contacts the reacting duplex 20 Å from the active site metal ions, are likely due the untwisting of the reacting duplex to move the scissile phosphate of into the active site. Remarkably, chemical shift perturbations are also propagated to the region around the 3′-flap binding pocket. The reappearance of S317 and other chemical shift changes in the 3′-flap binding region strongly suggests that the positioning of the 3′-flap duplex is altered upon the addition of Ca2+. Therefore, our study provides direct evidence of DNA-mediated allostery (66) between the active site and both reacting duplex and 3′-flap recognition, thereby linking regions involved in DNA structural recognition and catalysis.

When the 5′-flap is threaded through the helical arch, the top of α4 and α5 form a mini-hydrophobic core (21). The importance of this ordered state has previously been demonstrated; it is required for untwisting of the reacting duplex DNA to place the scissile phosphate on the active site while delivering two basic residues to the active site that are essential for biologically-relevant rates of reaction (11,18,59). Chemical shift analysis (53) of the NMR-observable region corresponding to the top of α4 (A107−A116) suggests that it is transiently sampling an α-helical conformation in its disordered state, whereas the corresponding region of α5 is not. This suggests that the disorder-to-order transition for α4 arises due to conformational selection (i.e., population shift), whereas the transition in α5 results from induced fit (67–69). Having a completely induced fit model for the arch would allow for enhanced specificity but would involve large entropic penalties and slow ordering. Using a conformational ensemble shift for the larger portion of the arch (α4) reduces these penalties. A potential trigger for the ensemble population shift for the α4 region could be the catalytically-crucial contact to the +1 phosphate of the 5′-flap as well as the formation of the mini-hydrophobic core (21). On the other side of the arch, several factors are likely to induce the disorder to α-helical transition of α5. These include interaction with the α2–α3 loop when it forms an Ω-loop (15) in response to the 3′-flap and electrostatic interaction of K125, K128 and R129 with template DNA phosphate diesters (11,21). Interestingly, the distance these lysine residues are from the template phosphate diesters is shorter when the arch moves forward towards the saddle (21).

Because hFEN1 requires a disordered region that can fold in response to the appropriate ligand, the evolutionary conservation of disorder and order promoting residues to maintain the delicate balance is necessary (55,70). CIDER (56) analyses of the arch suggest that substrate-free hFEN1 has evolved to populate an extended rather than a collapsed conformation. The extended conformation is necessary to allow DNA to thread. Seemingly innocuous mutations or even interaction with small molecules may alter the conformation of the disordered state (extended coil vs. collapsed) or change the balance between disorder and order, thereby inhibiting catalysis (55). Thus, the arch region and α2–α3 loop, which are not conserved with other superfamily members, could be potential drug targets for hFEN1 (71).

Why hFEN1 has evolved to have an intrinsically disordered region and loop to achieve catalysis may seem counterintuitive initially, because enzymology has traditionally focused on structure-function paradigms (70,71). However, there is a rationale for hFEN1 evolution of intrinsically disordered regions. Firstly, because hFEN1 needs to thread a 5′-flap through the helical arch, doing so when the arch is disordered would be easier due a larger capture radius and would provide the ability to accommodate 5′-flaps with some secondary structure. Secondly, the combination of induced fit and conformational selection with subsequent ensemble shifts for the arch allows fast ordering with low energy barriers in response to the appropriate substrate features necessary for specificity. However, there is also fast unfolding and release of incorrect binding partners (72). Thirdly, it allows for low affinity for the catalytically-relevant ensemble, which arises when the enthalpic gains in ordering are balanced by the entropic cost (70,71). Hence, specificity can be achieved while allowing the enzyme to disassociate once catalysis has occurred (i.e. turnover). Together, this work illustrates how conformational dynamic regions respond to appropriate substrate features, resulting in a shift in the ensemble towards the catalytically viable state (Figure 8).

Although all 5′-nuclease superfamily members recognize DNA junctions of some kind, each protein has its own specific DNA substrate (3). However, the use of flexible regions spanning a range of dynamic timescales for substrate recognition and control of active site assembly is likely to be a common theme in these structure-sensing nucleases. As hEXO1 also uses the threading mechanism to enforce substrate free 5′-ends (19,20) and requires a helical arch for active site assembly, it too is likely to use changes in protein dynamics for substrate accommodation and catalysis. In contrast, the Holiday junction resolvase GEN1 lacks the arch feature. A much shorter peptide linker that is the equivalent in GEN1 is not observable in current structures (23,24), despite containing superfamily conserved active site residues. This is the predicted dimerization interface, as two GEN1 monomers come together to sense the four-way-junction. Similarly, the very large equivalent of the arch in XPG, which also contains active site residues, presumably assembles as a result of the binding of other nucleotide excision repair protein partners and DNA (25). Thus, somewhat surprisingly, protein reversible plasticity appears to be the answer to coupling recognition with reaction for a range of nucleic acid hydrolases.

DATA AVAILABILITY

The backbone 1H, 15N and 13C chemical shifts of the free protein in EDTA and the hFEN1K93A–DNA complex in Ca2+ have been deposited in the BioMagResBank (http://www.bmrb.wisc.edu/) under the BMRB accession codes 27160 and 27404, respectively.

Supplementary Material

Supplementary Data

ACKNOWLEDGEMENTS

The author wish to acknowledge Dr Susan Tsutakawa for assistance with the graphical abstract.

Author Contributions: I.A.B. and L.D.F. prepared and purified isotopically-enriched protein samples and their DNA complexes. I.A.B., L.D.F., A.M.H. and N.J.B. conducted NMR spectroscopic measurements for assignment purposes, relaxation measurements and divalent metal titrations. I.A.B., L.D.F. and A.M.H. assigned the protein. I.A.B., J.C.E. and N.N.B.Md.S. performed enzyme kinetic measurements. I.A.B., L.D.F. and M.J.T. performed site-directed mutagenesis, protein purification and fluorophore maleimide attachment to hFEN1 for single molecule FRET measurements. I.A.B., B.A. and T.D.C. performed single molecule measurements. I.A.B., L.D.F., N.J.B., B.A., T.D.C., J.P.W. and J.A.G. contributed to the interpretation of the results and preparation of the manuscript. L.D.F., J.P.W. and J.A.G. conceived and coordinated the project. I.A.B., L.D.F., B.A., T.D.C. and J.A.G. prepared the figures and wrote the manuscript.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

Biotechnology and Biological Sciences Research Council [BB/K009079/1 and BB/M00404X/1 to J.A.G.]; Biotechnology and Biological Sciences Research Council White Rose Doctoral Training Program in Mechanistic Biology Studentship [BB/J014443/1 to I.A.B.]. B.A. thanks the Engineering and Physical Sciences Research Council for his PhD stipend. N.N.B.Md.S. thanks the Public Service Department of Malaysia, Human Capitol Development Division for her PhD stipend. Funding for open access charge: The University Publication Fund of The University of Sheffield from RCUK.

Conflict of interest statement. None declared.

REFERENCES

  • 1. Balakrishnan L., Bambara R.A.. Flap endonuclease 1. Annu. Rev. Biochem. 2013; 82:119–138. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Finger L.D., Atack J.M., Tsutakawa S., Classen S., Tainer J., Grasby J., Shen B.. MacNeill S. The Eukaryotic Replisome: A Guide to Protein Structure and Function. 2012; Dordrecht: Springer; 301–326. [Google Scholar]
  • 3. Grasby J.A., Finger L.D., Tsutakawa S.E., Atack J.M., Tainer J.A.. Unpairing and gating: sequence-independent substrate recognition by FEN superfamily nucleases. Trends Biochem. Sci. 2012; 37:74–84. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Zheng L., Jia J., Finger L.D., Guo Z., Zer C., Shen B.. Functional regulation of FEN1 nuclease and its link to cancer. Nucleic Acids Res. 2011; 39:781–794. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. van Pel D.M., Barrett I.J., Shimizu Y., Sajesh B.V., Guppy B.J., Pfeifer T., McManus K.J., Hieter P.. An evolutionarily conserved synthetic lethal interaction network identifies FEN1 as a broad-spectrum target for anticancer therapeutic development. PLoS Genet. 2013; 9:e1003254. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Ward T.A., McHugh P.J., Durant S.T.. Small molecule inhibitors uncover synthetic genetic interactions of human flap endonuclease 1 (FEN1) with DNA damage response genes. PLoS ONE. 2017; 12:e0179278. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Exell J.C., Thompson M.J., Finger L.D., Shaw S.J., Debreczeni J., Ward T.A., McWhirter C., Sioberg C.L., Martinez Molina D., Abbott W.M. et al. . Cellularly active N-hydroxyurea FEN1 inhibitors block substrate entry to the active site. Nat. Chem. Biol. 2016; 12:815–821. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Rashid F., Harris P.D., Zaher M.S., Sobhy M.A., Joudeh L.I., Yan C., Piwonski H., Tsutakawa S.E., Ivanov I., Tainer J.A. et al. . Single-molecule FRET unveils induced-fit mechanism for substrate selectivity in flap endonuclease 1. eLife. 2017; 6:e21884. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Algasaier S.I., Exell J.C., Bennet I.A., Thompson M.J., Gotham V.J.B., Shaw S.J., Craggs T.D., Finger L.D., Grasby J.A.. DNA and protein requirements for substrate conformational changes necessary for human flap endonuclease-1-catalyzed reaction. J. Biol. Chem. 2016; 291:8258–8268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Chapados B.R., Hosfield D.J., Han S., Qiu J., Yelent B., Shen B., Tainer J.A.. Structural basis for FEN-1 substrate specificity and PCNA-mediated activation in DNA replication and repair. Cell. 2004; 116:39–50. [DOI] [PubMed] [Google Scholar]
  • 11. Tsutakawa S.E., Classen S., Chapados B.R., Arvai A.S., Finger L.D., Guenther G., Tomlinson C.G., Thompson P., Sarker A.H., Shen B.I. et al. . Human flap endonuclease structures, DNA double-base flipping, and a unified understanding of the FEN1 superfamily. Cell. 2011; 145:198–211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Finger L.D., Blanchard M.S., Theimer C.A., Sengerová B., Singh P., Chavez V., Liu F., Grasby J.A., Shen B.. The 3′-flap pocket of human flap endonuclease 1 is critical for substrate binding and catalysis. J. Biol. Chem. 2009; 284:22184–22194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Kao H.I., Henricksen L.A., Liu Y., Bambara R.A.. Cleavage specificity of Saccharomyces cerevisiae flap endonuclease 1 suggests a double-flap structure as the cellular substrate. J. Biol. Chem. 2002; 277:14379–14389. [DOI] [PubMed] [Google Scholar]
  • 14. Lyamichev V., Brow M.A., Varvel V.E., Dahlberg J.E.. Comparison of the 5′ nuclease activities of taq DNA polymerase and its isolated nuclease domain. Proc. Natl. Acad. Sci. U.S.A. 1999; 96:6143–6148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Fetrow J.S. Omega loops: nonregular secondary structures significant in protein function and stability. FASEB J. 1995; 9:708–717. [PubMed] [Google Scholar]
  • 16. Papaleo E., Saladino G., Lambrughi M., Lindorff-Larsen K., Gervasio F.L., Nussinov R.. The Role of Protein Loops and Linkers in Conformational Dynamics and Allostery. Chem. Rev. 2016; 116:6391–6423. [DOI] [PubMed] [Google Scholar]
  • 17. AlMalki F.A., Flemming C.S., Zhang J., Feng M., Sedelnikova S.E., Ceska T., Rafferty J.B., Sayers J.R., Artymiuk P.J.. Direct observation of DNA threading in flap endonuclease complexes. Nat. Struct. Mol. Biol. 2016; 23:640–646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Patel N., Atack J.M., Finger L.D., Exell J.C., Thompson P., Tsutakawa S., Tainer J.A., Williams D.M., Grasby J.A.. Flap endonucleases pass 5′-flaps through a flexible arch using a disorder-thread-order mechanism to confer specificity for free 5′-ends. Nucleic Acids Res. 2012; 40:4507–4519. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Shaw S.J., Finger L.D., Grasby J.A.. Human Exonuclease 1 Threads 5′-Flap Substrates through Its Helical Arch. Biochemistry. 2017; 56:3704–3707. [DOI] [PubMed] [Google Scholar]
  • 20. Shi Y., Hellinga H.W., Beese L.S.. Interplay of catalysis, fidelity, threading, and processivity in the exo- and endonucleolytic reactions of human exonuclease I. Proc. Natl. Acad. Sci. U.S.A. 2017; 114:6010–6015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Tsutakawa S.E., Thompson M.J., Arvai A.S., Neil A.J., Shaw S.J., Algasaier S.I., Kim J.C., Finger L.D., Jardine E., Gotham V.J.B. et al. . Phosphate steering by Flap Endonuclease 1 promotes 5′-flap specificity and incision to prevent genome instability. Nat. Commun. 2017; 8:15855. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Sakurai S., Kitano K., Yamaguchi H., Hamada K., Okada K., Fukuda K., Uchida M., Ohtsuka E., Morioka H., Hakoshima T.. Structural basis for recruitment of human flap endonuclease 1 to PCNA. EMBO J. 2005; 24:683–693. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Lee S.H., Princz L.N., Klugel M.F., Habermann B., Pfander B., Biertumpfel C.. Human Holliday junction resolvase GEN1 uses a chromodomain for efficient DNA recognition and cleavage. Elife. 2015; 4:e12256. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Liu Y., Freeman A.D., Declais A.C., Wilson T.J., Gartner A., Lilley D.M.. Crystal Structure of a Eukaryotic GEN1 Resolving Enzyme Bound to DNA. Cell Rep. 2015; 13:2565–2575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Miętus M., Nowak E., Jaciuk M., Kustosz P., Studnicka J., Nowotny M.. Crystal structure of the catalytic core of Rad2: insights into the mechanism of substrate binding. Nucleic Acids Res. 2014; 42:10762–10775. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Studier F.W. Protein production by auto-induction in high density shaking cultures. Protein Expr. Purif. 2005; 41:207–234. [DOI] [PubMed] [Google Scholar]
  • 27. Sivashanmugam A., Murray V., Cui C., Zhang Y., Wang J., Li Q.. Practical protocols for production of very high yields of recombinant proteins using Escherichia coli. Protein Sci. 2009; 18:936–948. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Gasteiger E., Gattiker A., Hoogland C., Ivanyi I., Appel R.D., Bairoch A.. ExPASy: The proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res. 2003; 31:3784–3788. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Cavanagh J., Fairbrother W.J., Palmer A.G. III, Rance M., Skelton N.J.. Protein NMR Spectroscopy Principles and Practice. 2007; 2nd ednLondon: Elsevier Academic Press. [Google Scholar]
  • 30. Vranken W.F., Boucher W., Stevens T.J., Fogh R.H., Pajon A., Llinas M., Ulrich E.L., Markley J.L., Ionides J., Laue E.D.. The CCPN data model for NMR spectroscopy: development of a software pipeline. Proteins. 2005; 59:687–696. [DOI] [PubMed] [Google Scholar]
  • 31. Reed M.A.C., Hounslow A.M., Sze K.H., Barsukov I.G., Hosszu L.L.P., Clarke A.R., Craven C.J., Waltho J.P.. Effects of domain dissection on the folding and stability of the 43 kDa protein PGK probed by NMR. J. Mol. Biol. 2003; 330:1189–1201. [DOI] [PubMed] [Google Scholar]
  • 32. Shen Y., Bax A.. Protein backbone and sidechain torsion angles predicted from NMR chemical shifts using artificial neural networks. J. Biomol. NMR. 2013; 56:227–241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Lakomek N.A., Ying J., Bax A.. Measurement of 15N relaxation rates in perdeuterated proteins by TROSY-based methods. J. Biomol. NMR. 2012; 53:209–221. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. d’Auvergne E.J., Gooley P.R.. The use of model selection in the model-free analysis of protein dynamics. J. Biomol. NMR. 2003; 25:25–39. [DOI] [PubMed] [Google Scholar]
  • 35. d’Auvergne E.J., Gooley P.R.. Model-free model elimination: a new step in the model-free dynamic analysis of NMR relaxation data. J. Biomol. NMR. 2006; 35:117–135. [DOI] [PubMed] [Google Scholar]
  • 36. d’Auvergne E.J., Gooley P.R.. Set theory formulation of the model-free problem and the diffusion seeded model-free paradigm. Mol. Biosyst. 2007; 3:483–494. [DOI] [PubMed] [Google Scholar]
  • 37. d’Auvergne E.J., Gooley P.R.. Optimisation of NMR dynamic models II. A new methodology for the dual optimisation of the model-free parameters and the Brownian rotational diffusion tensor. J. Biomol. NMR. 2008; 40:121–133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. d’Auvergne E.J., Gooley P.R.. Optimisation of NMR dynamic models I. Minimisation algorithms and their performance within the model-free and Brownian rotational diffusion spaces. J. Biomol. NMR. 2008; 40:107–119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Cantor C.R., Warshaw M.M., Shapiro H.. Oligonucleotide interactions. 3. Circular dichroism studies of the conformation of deoxyoligonucleotides. Biopolymers. 1970; 9:1059–1077. [DOI] [PubMed] [Google Scholar]
  • 40. Cavaluzzi M.J., Borer P.N.. Revised UV extinction coefficients for nucleoside-5′-monophosphates and unpaired DNA and RNA. Nucleic Acids Res. 2004; 32:e13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Braman J., Papworth C., Greener A.. Site-directed mutagenesis using double-stranded plasmid DNA templates. Methods Mol. Biol. 1996; 57:31–44. [DOI] [PubMed] [Google Scholar]
  • 42. Sengerova B., Tomlinson C., Atack J.M., Williams R., Sayers J.R., Williams N.H., Grasby J.A.. Bronsted analysis and rate-limiting steps for the T5 flap endonuclease catalyzed hydrolysis of exonucleolytic substrates. Biochemistry. 2010; 49:8085–8093. [DOI] [PubMed] [Google Scholar]
  • 43. Cheng N.-S. Formula for the viscosity of a Glycerol−Water mixture. Ind. Eng. Chem. Res. 2008; 47:3285–3288. [Google Scholar]
  • 44. Brouwer A.C., Kirsch J.F.. Investigation of diffusion-limited rates of chymotrypsin reactions by viscosity variation. Biochemistry. 1982; 21:1302–1307. [DOI] [PubMed] [Google Scholar]
  • 45. Kapanidis A.N., Lee N.K., Laurence T.A., Doose S., Margeat E., Weiss S.. Fluorescence-aided molecule sorting: analysis of structure and interactions by alternating-laser excitation of single molecules. Proc. Natl. Acad. Sci. U.S.A. 2004; 101:8936–8941. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Ingargiola A., Lerner E., Chung S., Weiss S., Michalet X.. FRETBursts: an open source toolkit for analysis of freely-diffusing single-molecule FRET. PLoS One. 2016; 11:e0160716. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Nir E., Michalet X., Hamadani K.M., Laurence T.A., Neuhauser D., Kovchegov Y., Weiss S.. Shot-noise limited single-molecule FRET histograms: comparison between theory and experiments. J. Phys. Chem. B. 2006; 110:22103–22124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Hohlbein J., Craggs T.D., Cordes T.. Alternating-laser excitation: single-molecule FRET and beyond. Chem. Soc. Rev. 2014; 43:1156–1171. [DOI] [PubMed] [Google Scholar]
  • 49. Cordingley M.G., Callahan P.L., Sardana V.V., Garsky V.M., Colonno R.J.. Substrate requirements of human rhinovirus 3C protease for peptide cleavage in vitro. J. Biol. Chem. 1990; 265:9062–9065. [PubMed] [Google Scholar]
  • 50. Kay L.E., Torchia D.A., Bax A.. Backbone dynamics of proteins as studied by 15N inverse detected heteronuclear NMR spectroscopy: application to staphylococcal nuclease. Biochemistry. 1989; 28:8972–8979. [DOI] [PubMed] [Google Scholar]
  • 51. Rossi P., Swapna G.V., Huang Y.J., Aramini J.M., Anklin C., Conover K., Hamilton K., Xiao R., Acton T.B., Ertekin A. et al. . A microscale protein NMR sample screening pipeline. J. Biomol. NMR. 2010; 46:11–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Bieri M., d’Auvergne E.J., Gooley P.R.. relaxGUI: a new software for fast and simple NMR relaxation data analysis and calculation of ps-ns and mus motion of proteins. J. Biomol. NMR. 2011; 50:147–155. [DOI] [PubMed] [Google Scholar]
  • 53. Tamiola K., Acar B., Mulder F.A.. Sequence-specific random coil chemical shifts of intrinsically disordered proteins. J. Am. Chem. Soc. 2010; 132:18000–18003. [DOI] [PubMed] [Google Scholar]
  • 54. Lacroix E., Viguera A.R., Serrano L.. Elucidating the folding problem of alpha-helices: local motifs, long-range electrostatics, ionic-strength dependence and prediction of NMR parameters. J. Mol. Biol. 1998; 284:173–191. [DOI] [PubMed] [Google Scholar]
  • 55. Das R.K., Ruff K.M., Pappu R.V.. Relating sequence encoded information to form and function of intrinsically disordered proteins. Curr. Opin. Struct. Biol. 2015; 32:102–112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Holehouse A.S., Das R.K., Ahad J.N., Richardson M.O., Pappu R.V.. CIDER: Resources to analyze Sequence-Ensemble relationships of intrinsically disordered proteins. Biophys. J. 2017; 112:16–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Das R.K., Pappu R.V.. Conformations of intrinsically disordered proteins are influenced by linear sequence distributions of oppositely charged residues. Proc. Natl. Acad. Sci. U.S.A. 2013; 110:13392–13397. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Finger L.D., Patel N., Beddows A., Ma L., Exell J.C., Jardine E., Jones A.C., Grasby J.A.. Observation of unpaired substrate DNA in the flap endonuclease-1 active site. Nucleic Acids Res. 2013; 41:9839–9847. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Patel N., Exell J.C., Jardine E., Ombler B., Finger L.D., Ciani B., Grasby J.A.. Proline scanning mutagenesis reveals a role for the flap endonuclease-1 helical cap in substrate unpairing. J. Biol. Chem. 2013; 288:34239–34248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60. Syson K., Tomlinson C., Chapados B.R., Sayers J.R., Tainer J.A., Williams N.H., Grasby J.A.. Three metal ions participate in the reaction catalyzed by T5 flap endonuclease. J. Biol. Chem. 2008; 283:28741–28746. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Kim Y., Ho S.O., Gassman N.R., Korlann Y., Landorf E.V., Collart F.R., Weiss S.. Efficient site-specific labeling of proteins via cysteines. Bioconjug. Chem. 2008; 19:786–791. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Tsutakawa S.E., Lafrance-Vanasse J., Tainer J.A.. The cutting edges in DNA repair, licensing, and fidelity: DNA and RNA repair nucleases sculpt DNA to measure twice, cut once. DNA Repair (Amst.). 2014; 19:95–107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Tsutakawa S.E., Tainer J.A.. Double strand binding-single strand incision mechanism for human flap endonuclease: implications for the superfamily. Mech. Ageing. Dev. 2012; 133:195–202. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64. Orans J., McSweeney Elizabeth A., Iyer Ravi R., Hast Michael A., Hellinga Homme W., Modrich P., Beese Lorena S.. Structures of human exonuclease 1 DNA complexes suggest a unified mechanism for nuclease family. Cell. 2011; 145:212–223. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. Kannan S., Kohlhoff K., Zacharias M.. B-DNA under stress: over- and untwisting of DNA during molecular dynamics simulations. Biophys. J. 2006; 91:2956–2965. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Chaires J.B. Allostery: DNA does it, too. ACS Chem. Biol. 2008; 3:207–209. [DOI] [PubMed] [Google Scholar]
  • 67. Boehr D.D., Nussinov R., Wright P.E.. The role of dynamic conformational ensembles in biomolecular recognition. Nat. Chem. Biol. 2009; 5:789–796. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Csermely P., Palotai R., Nussinov R.. Induced fit, conformational selection and independent dynamic segments: an extended view of binding events. Trends Biochem. Sci. 2010; 35:539–546. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Ma B., Nussinov R.. Enzyme dynamics point to stepwise conformational selection in catalysis. Curr. Opin. Chem. Biol. 2010; 14:652–659. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70. Oldfield C.J., Dunker A.K.. Intrinsically disordered proteins and intrinsically disordered protein regions. Annu. Rev. Biochem. 2014; 83:553–584. [DOI] [PubMed] [Google Scholar]
  • 71. Habchi J., Tompa P., Longhi S., Uversky V.N.. Introducing protein intrinsic disorder. Chem. Rev. 2014; 114:6561–6588. [DOI] [PubMed] [Google Scholar]
  • 72. Nussinov R., Ma B., Tsai C.J.. Multiple conformational selection and induced fit events take place in allosteric propagation. Biophys. Chem. 2014; 186:22–30. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

Data Availability Statement

The backbone 1H, 15N and 13C chemical shifts of the free protein in EDTA and the hFEN1K93A–DNA complex in Ca2+ have been deposited in the BioMagResBank (http://www.bmrb.wisc.edu/) under the BMRB accession codes 27160 and 27404, respectively.


Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES