Skip to main content
Nature Communications logoLink to Nature Communications
. 2020 Feb 28;11:1120. doi: 10.1038/s41467-020-14856-2

Cryo-EM structures of the XPF-ERCC1 endonuclease reveal how DNA-junction engagement disrupts an auto-inhibited conformation

Morgan Jones 1, Fabienne Beuron 2, Aaron Borg 3, Andrea Nans 4, Christopher P Earl 1, David C Briggs 1, Ambrosius P Snijders 3, Maureen Bowles 1, Edward P Morris 2, Mark Linch 5, Neil Q McDonald 1,6,
PMCID: PMC7048804  PMID: 32111838

Abstract

The structure-specific endonuclease XPF-ERCC1 participates in multiple DNA damage repair pathways including nucleotide excision repair (NER) and inter-strand crosslink repair (ICLR). How XPF-ERCC1 is catalytically activated by DNA junction substrates is not currently understood. Here we report cryo-electron microscopy structures of both DNA-free and DNA-bound human XPF-ERCC1. DNA-free XPF-ERCC1 adopts an auto-inhibited conformation in which the XPF helical domain masks the ERCC1 (HhH)2 domain and restricts access to the XPF catalytic site. DNA junction engagement releases the ERCC1 (HhH)2 domain to couple with the XPF-ERCC1 nuclease/nuclease-like domains. Structure-function data indicate xeroderma pigmentosum patient mutations frequently compromise the structural integrity of XPF-ERCC1. Fanconi anaemia patient mutations in XPF often display substantial in-vitro activity but are resistant to activation by ICLR recruitment factor SLX4. Our data provide insights into XPF-ERCC1 architecture and catalytic activation.

Subject terms: Biochemistry, Cryoelectron microscopy, Diseases


The endonuclease XPF-ERCC1 is a key component of the repair machinery to process both intra-strand and inter-strand DNA crosslinks. Here the authors present the cryo-EM structures of DNA-free and DNA-bound human XPF-ERCC1 and find that DNA-free XPF-ERCC1 adopts an auto-inhibited conformation, which is opened up upon DNA binding and they also characterise the biochemical properties of patient-derived XPF-ERCC1 mutations.

Introduction

Structure-specific endonucleases (SSEs) are found in all branches of life and play crucial roles in genome repair, replication and recombination1. These endonucleases act on similar DNA structures with defined polarity but use different catalytic mechanisms. The structurally related XPF/MUS81 family are an important group of human 3′-nucleases that associate to form two active endonuclease heterodimers (XPF–ERCC1 and MUS81–EME1) and a DNA translocase (FANCM–FAAP24) with a pseudo-nuclease architecture2. XPF–ERCC1 recognises double-stranded/single-stranded (ds/ss) DNA junctions which have a 3′-ssDNA overhang, nicking the dsDNA backbone to produce a substrate for subsequent steps in DNA repair pathways. XPF–ERCC1 activity is essential for removing helical DNA distortions arising from ultraviolet-induced damage and bulky adducts as part of the nucleotide excision repair (NER) pathway3. In this context XPF–ERCC1 nicks the damaged DNA strand 5′ of the lesion at the ds/ss junction of an NER repair bubble. It is also required for interstrand cross-link repair (ICLR), some double‐stranded break repair processes, base excision repair, Holliday junction resolution, gene-conversion and telomere maintenance410. Mutations in XPF and ERCC1 genes are associated with genetic disorders exhibiting diverse phenotypes. These pathologies are caused by defects in the genome maintenance pathways that involve XPF–ERCC1, including xeroderma pigmentosum (XP), Cockayne’s syndrome, Fanconi anaemia (FA), XPFE progeria and cerebro-oculo-facio-skeletal syndrome1115. The genotype–phenotype correlations of XPF–ERCC1 driven diseases are still poorly understood.

XPF is the enzymatically active subunit of the heterodimeric XPF–ERCC1 endonuclease and is comprised of a helicase-like module (HLM) and a catalytic module (CM) (Fig. 1a). The XPF HLM is related to the superfamily 2 helicases, with two divergent RecA-like domains that flank an all α-helical domain16 (Fig. 1a). Both XPF RecA-like domains, termed RecA-like domain 1 (RecA1) and RecA-like domain 2 (RecA2) lack the residues necessary to bind and hydrolyse ATP17,18. Despite this, the HLM is required for full XPF activity and binds both the ICLR recruitment factor SLX4 and ds/ssDNA structures19,20. The XPF CM consists of a nuclease domain containing a metal-dependent GDXnERKX3D active site motif and a tandem helix–hairpin–helix, termed an (HhH)2 domain21. The smaller ERCC1 subunit has no catalytic activity but is structurally related to the XPF CM, consisting of a nuclease-like domain (NLD) and a dsDNA-binding (HhH)2 domain. Both ERCC1 domains heterodimerise with their equivalent domains in the XPF CM, forming discrete nuclease–NLD and 2×(HhH)2 functional units. As well as contributing to XPF stability, ERCC1 can recognise ds/ssDNA substrates and engages the XPA repair protein that is required for XPF–ERCC1 recruitment to sites of NER22. Currently, there are no available structures of the XPF HLM or of any full-length XPF–Mus81 family members. By solving the structure of a near full-length human XPF–ERCC1 we have defined its overall architecture and uncovered a previously unreported autoregulatory mechanism. We show XPF–ERCC1 adopts an auto-inhibited conformer in the absence of DNA in order to prevent promiscuous cleavage and provide structural evidence for the initial steps of XPF–ERCC1 activation upon binding a DNA junction.

Fig. 1. Structure of auto-inhibited human XPF–ERCC1 endonuclease.

Fig. 1

a Domain architecture of XPF–ERCC1 colour coded by domain. XPF: RecA1 (blue), helical (green), RecA2 (pink), nuclease (gold) and (HhH)2 (dark grey). ERCC1: NLD (orange) and (HhH)2 (light grey). Residue numbering indicates domain boundaries and dotted arrows indicate dimerisation interfaces. Two insert sequences within RecA2 are shown in white embellishing the RecA domain fold. Grey lines define the helicase-like module (HLM) and catalytic module (CM). b SDS-PAGE gel of purified recombinant XPF–ERCC1 used for cryo-EM studies. c Two orthogonal views of the composite XPF–ERCC1 cryo-EM map ranging from a global resolution of 3.6–4 Å, coloured by domain according to panel a. d Final XPF–ERCC1 atomic model coloured by domain according to panel a, displayed within a transparent cryo-EM potential map. The XPF nuclease–(HhH)2 and ERCC1 NLD–(HhH)2 domain linker is visible at lower map thresholds. e Representative region of the cryo-EM map close to strand  ß-11 and sidechains from the final model in pink. f Representative part of the cryo-EM map close to α-helix 20 with sidechains shown from the final model in pink.

Results

Structure determination of human XPF–ERCC1 endonuclease

A single particle cryo-electron microscopy (cryo-EM) density map of purified recombinant XPF–ERCC1 complex (128 kDa) (Fig. 1b) was determined at a global resolution of 4.0 Å (Supplementary Fig. 3a, c, f and Supplementary Movie 1) enabling the assignment of XPF–ERCC1 domain organisation (Fig. 1c, d). The map represents the single dominant conformer observed following 3D classification protocols (Supplementary Fig. 2) and exhibits clear secondary structure features throughout (Fig. 1c and Supplementary Movie 2). Local resolution analysis (Supplementary Fig. 3a) indicated that the heterodimeric 2×(HhH)2 domain exhibited some mobility, so signal subtraction of this domain was carried out followed by local refinement. This process improved the global resolution of the resulting sub-volume to 3.6 Å (Supplementary Fig. 3a, d, g) which enabled building, refinement and validation of an atomic model (Fig. 1d). The locally refined map shows clear sidechain density throughout with the local resolution ranging from 3.4 Å in the RecA1 and RecA2 domain cores (Fig. 1e, f) to 7 Å at the periphery of the ERCC1 NLD. Regions modelled as polyalanine or omitted from the final structure are shown in Supplementary Table 1. There is no density recovered for the ERCC1 N-terminus, consistent with it being proteolytically cleaved (Supplementary Fig. 1b). The N-terminus of ERCC1 is not required for wild-type activity in vitro (Supplementary Fig. 1d). Inspection of the angular distribution of assigned particle images during refinement, the 3DFSC curves and 3D flexibility analysis indicate that resolution differences were due to intrinsic flexibility rather than a lack of contributing particle images (Supplementary Fig. 3b, e–g).

Overall architecture of human XPF–ERCC1 endonuclease

The cryo-EM structure of near full-length XPF–ERCC1 reveals a compact conformation with extensive interactions between the XPF HLM and CM modules (Fig. 1d). Overall, the HLM adopts a “C”-shape that has dimensions of approximately 70 × 40 × 60 Å. The two RecA-like domains form a rigid platform and lack a nucleotide cleft characteristic of many ATP-driven helicases. Instead the two XPF RecA-like domains are linked through the intimate intertwining of secondary structural elements that extend beyond their globular portion (Supplementary Fig. 4d). While RecA1 caps one edge of the HLM and engages the XPF nuclease domain in the CM, the helical domain caps the other HLM extremity and engages the CM and the dsDNA-binding ERCC1 (HhH)2 domain (Figs. 1d and 2a). This arrangement serves to separate and uncouple both functional domains of ERCC1 through its connecting linker. These interactions confirm the key regulatory role for the HLM by engaging crucial elements within the XPF CM and ERCC1. Interfaces observed in the XPF–ERCC1 structure were largely validated using cross-linking mass spectrometry (XL-MS) (Fig. 2f, g) (Supplementary Table 2). Cross-links are found predominately between both the XPF (HhH)2 domain and the ERCC1 NLD, and between the XPF RecA2 and ERCC1 NLD. In addition, several cross-links exceeding the distance cut-off are consistent with two principal vectors of dynamic movement in solution.

Fig. 2. Architecture of the XPF helicase-like module and coupling with the catalytic module.

Fig. 2

a View of the XPF–ERCC1 structure showing the helicase-like module (HLM as surface rendering) contacts the XPF nuclease domain (gold ribbon cartoon) at two interfaces (dashed red boxes). Domains are coloured according to the scheme used in Fig. 1. b, c Close-up view of interaction interfaces overlaid with the composite cryo-EM map. Selected residues are displayed as sticks and coloured by heteroatom, blue—N, Red—O. b The hydrophobic interaction interface between XPF RecA2 (blue) and XPF nuclease domain (gold). c Interaction of XPF helical domain residues 273–275 (green) with the XPF nuclease domain (gold). d the XPF–ERCC1 HLM (surface rendered) contacts with the ERCC1 (HhH)2 domain at a single interface (dashed red box). e Interaction of XPF helical domain helix α13 (green) and the ERCC1 (HhH)2 domain close to its dsDNA-binding residues (pink). f Two orthogonal views of the XPF–ERCC1 structure with XL-MS distance constraints overlaid. Distances within the allowed Cα–Cα cut-off distance of 30 Å are displayed in blue, distances greater than this cut-off displayed in red. Blue dotted line indicates a cluster of allowed distances between the XPF helical and ERCC1 (HhH)2 domains. g Cartoon schematic representing inter-domain cross-links detected by mass spectrometry. Each black line indicates a single unique cross-link between residues in different domains. Domains within the pink ellipsoid form the XPF HLM, whereas domains within the XPF CM and ERCC1 are within the pale blue ellipsoid.

Structure of the XPF HLM

The XPF HLM is typical of other helicase superfamily 2 (SF2) members with a RecA1–helical domain–RecA2 organisation, but with substantial inserts within RecA2 (Fig. 1a). In the absence of ATP binding and hydrolysis motifs or a nucleotide binding cleft, RecA1–RecA2 are linked together through a predominantly polar interface (2007 Å2). Major interface contributions are made by secondary structural elements ß8 and α20 that form a C-terminal extension to RecA1 and RecA2, respectively, as well as the XPF amino-terminus (Supplementary Fig. 4d). ß8 extends the smaller RecA2 four parallel ß-stranded sheet while α20 packs against the larger RecA1 seven-stranded parallel beta sheet (ß1–ß7). Additional RecA1–RecA2 contacts centre on a π-ring stacking interaction between RecA1 domain Y71XPF and RecA2 domain Y564XPF at one interface edge (Supplementary Fig. 4c) and L39XPF and I592XPF on the other edge. Polar residues make up the remaining contacts with a small cavity. No protein expression was observed for a Y71AXPF mutant (Table 1). The observed structural rigidity of the RecA1–RecA2 unit is structurally homologous to equivalent domains in nucleosome-bound chromatin remodellers ISW1 and INO8023,24.

Table 1.

XPF–ERCC1 mutants that disrupt protein folding.

Protein Mutant Mutation rationale Effect on protein expression
XPF L608P XP Soluble aggregates
XPF T567A XP Soluble aggregates
XPF R799W XP No expression
XPF Y71A Structural integrity No expression
ERCC1 T248A Autoinhibition No expression
ERCC1 T252A Autoinhibition No expression

XPF RecA2 has two large inserts with unknown functions. Insert one (residues 345–377) separates the helical and RecA2 domains and insert two (residues 441–550) interrupts the RecA2 fold. There is sufficient density in our map to trace the backbone of residues 345–362 and 366–377 from insert one projecting away from the body of the structure. However, no density was recovered for insert two, in agreement with predictions that this region is intrinsically disordered in the absence of DNA. Futhermore, XL-MS data identified a large number of intra-insert cross-links within inserts one and two, consistent with these highly basic regions being flexible (Supplementary Table 2).

The XPF helical domain is an integral part of the HLM and folds as a five anti-parallel helical bundle. This domain packs tightly against RecA2 and is anchored through an interface centred close to residues Q300XPF/D302XPF and S412XPF/Q419XPF (Supplementary Fig. 4b). The Q300AXPF mutant significantly reduces XPF–ERCC1 expression and increases aggregation (Table 1). Helix α17 (residues 426–440) also contributes to tethering the helical domain to RecA2. The observed position of the helical domain determines the orientation and angle of the extended RecA2 C-terminal α20 helix (Supplementary Fig. 4d), stabilising the HLM conformation through interaction between Q226XPF and T614XPF.

The XPF helical domain regulates XPF–ERCC1 activity

The XPF HLM is coupled to the CM through contacts from RecA1 and the helical domain (Fig. 2a). RecA1 forms a substantial interface (1684 Å2) with the XPF nuclease domain involving aromatic and hydrophobic residues from RecA1 α5 and α6 helices and XPF nuclease domain η4 and α21 helices and ß14 strand (Fig. 2b). The hydrophobic nature of the contact suggests that anchoring of the HLM to the XPF nuclease domain through RecA1 forms a permanent part of the XPF–ERCC1 architecture.

The XPF helical domain forms a contact with the XPF nuclease domain that sterically prevents the ds/ssDNA substrate from reaching the XPF active site (Fig. 2c and Supplementary Movie 3). A key contact within this auto-inhibited conformation is between sidechains of H275XPF and S730XPF. A H275AXPF, W274AXPF double mutant, likely to disrupt this contact, displays a 1.5-fold increase in catalytic efficiency relative to the wild type (Table 2).

Table 2.

Kinetic data for purified XPF–ERCC1 mutants.

Protein Mutant Mutation rationale Vmax (fmol min−1) Km (nM) kcat (s−1) kcat/Km (nM−1 min−1)
XPF WT 104.7 ± 2.8 12.1 ± 1.4 20.9 ± 0.66 1.73
XPF L230R FA 65 ± 2.1 8.1 ± 1.1 13.0 ± 3.3 1.60
XPF L236R FA/CS 91.2 ± 2.5 10.8 ± 1.2 18.2 ± 0.5 1.69
XPF E239K FA 88.9 ± 4.7 12.4 ± 2.1 16.9 ± 0.7 1.36
XPF S786F ICLR deficient 80.9 ± 3.8 32.5 ± 1.5 7.9 ± 0.8 0.24
XPF 323–326 Δ ICLR deficient 120.2 ± 1.4 20.5 ± 0.8 24.2 ± 0.3 1.18
XPF S312A Autoinhibition 126.1 ± 28.2 5.1 13.5 ± 1.8 2.65
XPF W274A, H275A Autoinhibition 193.4 ± 8.6 15.2 ± 2.4 38.7 ± 1.3 2.55
XPF R112A DNA binding 24.5 ± 1.1 4.32 ± 0.63 5.6 ± 0.2 1.30
XPF 829–833 Δ Autoinhibition linker disruption 20.2 ± 0.5 1.9 ± 1.2 3.8 ± 0.2 2.00
XPF R589W XP 8.22 ± 2.6 25.9 ± 6.1 1.4 ± 0.9 0.05
ERCC1 L253A Autoinhibition 33.1 ± 2.7 28.7 ± 22.3 3.4 ± 0.3 0.12

Each kinetic value was obtained from 3 technical replicates (n = 3) ± standard deviation (SD).

A second autoinhibitory interface exists between the XPF helical domain and the ERCC1 (HhH)2 domain (Fig. 2d, e and Supplementary Movie 3). This interface is formed through predominantly polar contacts involving the highly conserved T248ERCC1, T252ERCC1 residues and both S312XPF and T316XPF. Previous structural and biochemical data suggest that the ERCC1 (HhH)2 domain binds dsDNA through hairpin residues S244ERCC1–N246ERCC1 and G276ERCC1–G278ERCC1 mainchain atoms25,26. These motifs are proximal to T248ERCC1 and T252ERCC1, and are not accessible in the DNA-free conformation of XPF25. The S312AXPF mutant displays a 1.5-fold higher catalytic efficiency than the wild type likely due to the disruption of this autoinhibitory interaction (Table 2). Equally, shortening the connecting linker between the XPF nuclease and (HhH)2 domain would be predicted to shift the 2×(HhH)2 unit towards the nuclease domain releasing the DNA-binding residues. Indeed, a 829–833ΔXPF mutant displayed a modest 1.2-fold increase in catalytic efficiency and a 7.5-fold tighter Km relative to wild type (Table 2).

Heterodimerisation of XPF and ERCC1 through two interfaces

ERCC1 is intimately coupled to the XPF CM through two obligate dimerisation surfaces at the equivalent domains of each molecule. The XPF nuclease domain uses a helix–strand–helix motif (α25–ß19–α26) to heterodimerise with the equivalent surface of the ERCC1 NLD (α3–ß8–α4) forming a kidney-shaped dimer with an extensive interaction interface (1684 Å2) (Supplementary Fig. 4a). The contact is predominantly hydrophobic and is flanked by three salt bridges (Supplementary Fig. 4a). This interface uses equivalent elements to those mediating heterodimerisation of homologous domains from Mus81–Eme1 and FANCM–FAAP24 complexes27,28. We note that the XPF (HhH)2 domain hetero-dimerises with the ERCC1 (HhH)2 domain through predominantly hydrophobic contacts close to F851XPF and F900XPF as previously observed26,29. The (HhH)2 domain from XPF and ERCC1 are connected to their XPF nuclease domain/ERCC1 NLD domain through ordered linker sequences. There is sufficient density in our cryo-EM map to trace the mainchain atoms for both linkers (Fig. 1d). The ERCC1 linker makes unexpected interactions with the XPF nuclease domain via Y215ERCC1 and D221ERCC1 (Fig. 3b). We note that Y215ERCC1 lies adjacent to S786XPF suggesting the FA mutation S786FXPF would disrupt this contact with ERCC1. Despite the close association of XPF CM and ERCC1 through heterodimerization, their respective functional domains remain uncoupled and held apart through the extended conformation of their connecting linkers. This is important to consider when comparing with the DNA-bound conformations (see later).

Fig. 3. Mapping XPF–ERCC1 disease mutations and DNA repair pathway recruitment sites.

Fig. 3

a Top, a ribbon model of XPF–ERCC1 highlighting the spatially distinct binding sites of XPA and SLX4. XPA binds to the ERCC1 NLD (orange) and SLX4 binds to the XPF helical domain (green). The XPA peptide (residues 66–77) atoms are displayed as red spheres (PDB: 2JNW). The key SLX4 binding residue L230 sidechain atoms are also displayed as red spheres. Bottom, representative DNA structures targeted by XPF–ERCC1 through SLX4 (interstrand cross-link) or XPA (intrastrand cross-link) recruitment. bd The molecular environment of patient-derived disease mutations are indicated on the structure, superposed with the cryo-EM map displayed close to the mutation site. Selected residues are displayed using stick rendering coloured by heteroatom. Residues associated with Fanconi anaemia (FA) patient mutations are coloured red whilst those associated with XP are coloured black. Black or orange dashed ellipses indicate the environment close to XP or FA mutations respectively. b Mutations in the XPF nuclease domain and ERCC1 NLD lie close to their interface and give rise to both FA and XP. c XP-associated mutations disrupt key structural contacts in the XPF RecA2 domain are shown overlaid with the composite cryo-EM map. d FA-associated mutations cluster within the XPF helical domain. The helical domain also contains the XP-associated mutant, I225 (black).

Structural context of XP and FA patient mutations in XPF

Recruitment of XPF–ERCC1 into either NER or ICLR pathway complexes is dependent on interaction with partner proteins XPA or SLX4 at their respective damaged DNA structures (Fig. 3a). A previous study mapped the XPA-binding site to a cleft within the ERCC1 NLD (Fig. 3a)30. This interaction is spatially distinct from the proposed SLX4 site centred within the helical domain at L230XPF19. Insights from disease mutations have shown that repair pathway recruitment can be disrupted by separation-of-function (FA) or partial loss-of-function (XP) mutations, however the structural basis for this is unclear31.

With the availability of a three-dimensional XPF–ERCC1 structure, it was possible to explore the location and structural environment of disease-causing mutations and correlate this with their impact on enzyme stability and catalytic activity. Patient-derived XP or FA-associated mutations were characterised in vitro using a previously reported fluorescence incision assay20. Mutations associated with XP mapped primarily to the XPF RecA2 domain and its inserts15,32,33. L608XPF, R589XPF and T567XPF are located in the folded region of the RecA2 domain, with the latter two forming structurally important intra-domain contacts32 (Fig. 3c). Indeed, L608PXPF and T567AXPF mutant proteins formed soluble aggregates when expressed recombinantly, as measured by analytical size exclusion chromatography (SEC) and an R589WXPF mutant exhibited 35-fold reduction in catalytic efficiency (Table 2). The R799WXPF XP mutation failed to express recombinantly and lies on the periphery of the heterodimeric nuclease–NLD interface with ERCC1 (Fig. 3b). These data, taken in the context of our structure, suggest the L608PXPF, T567AXPF, R589WXPF and R799WXPF XP disease mutants compromise XPF–ERCC1 structural stability (Table 1). I225XPF is also associated with XP32 and maps onto the hydrophobic core of the helical domain (Fig. 3d) suggesting it is also likely to contribute to XPF–ERCC1 structural integrity.

FA patients are proficient in NER but deficient in ICLR, indicating a likely separation of function19,34. Our structure indicates the FA point mutations within XPF such as L230RXPF, C236RXPF and G325EXPF cluster within the XPF helical domain (Fig. 3d)11. These mutants, when expressed recombinantly, were found to have a similar level of endonuclease activity to wild-type XPF–ERCC1 against a stem–loop substrate (Table 2). Previous studies indicated these FA mutations are unable to engage SLX419. This would impact both the ability of SLX4 to stimulate XPF–ERCC1 activity35 as well as recruit XPF–ERCC1 to ICLR sites in vivo19. We found that XPF–ERCC1 co-expressed with a truncated form of human SLX4 (XPF–ERCC1–SLX4NTD) indeed showed a six-fold increase in catalytic efficiency (Table 3 and Supplementary Fig. 9a–e). To confirm whether FA XPF–ERCC1 mutant 323–326ΔXPF had a reduced SLX4 association and/or a negative impact on activity, we measured the amount of XPF–ERCC1 endonuclease activity recovered after affinity purification followed by gel filtration. The 323–326ΔXPF FA mutant showed substantially less endonuclease activity (Supplementary Fig. 9d). The FA mutant L230RXPF lies close to XPF residues 323–326 and was previously shown to be unable to bind full-length SLX4, indicating that it forms a key determinant of the SLX4 binding site19. Our data are consistent with a differential impact of XPF mutants (loss-of-function) affecting NER from those XPF mutations (separation-of-function) that impact SLX4-driven activation and interaction in ICLR36.

Table 3.

SLX41–758–XPF–ERCC1 mutation data summary.

Protein Mutant Vmax (fmol min−1) Km (nM) kcat (min−1) kcat/Km (nM−1 min−1)
XPF(–ERCC1–SLX4NTD) WT 219.2 ± 9.3 4.2 ± 0.7 43.8 ± 1.1 10.4
XPF(–ERCC1–SLX4NTD) 323–326 Δ 49 ± 6.5 32.2 ± 9.6 9.8 ± 1.4 0.30
XPF(–ERCC1) WT 104.7 ± 2.8 12.1 ± 1.4 20.9 ± 0.66 1.73

Each kinetic value was obtained from 3 technical replicates (n = 3) ± the standard deviation (SD).

XPF–ERCC1 conformational activation on DNA-junction binding

We hypothesised that the autoinhibitory interactions formed by the XPF helical domain need to be released following XPF–ERCC1 DNA-junction engagement, prior to the incision reaction. To probe the nature of such potential conformational changes, we assembled a complex of XPF–ERCC1 bound to a DNA stem–loop model substrate (10-duplex 20-T single-strand stem–loop) that we previously showed presents a single incision site to XPF–ERCC120. Using an electrophoretic mobility shift assay (EMSA) we observed 1:1:1 stoichiometric binding of the stem–loop DNA to XPF–ERCC1 (Supplementary Fig. 5b, c).

This sample was used for cryo-EM data collection leading to a single-particle cryo-EM density map at a global resolution of 7.7 Å (Supplementary Fig. 6b). Signal subtraction of the dimeric 2×(HhH)2 domain and DNA density, followed by local refinement, improved the resolution of the resulting sub-volume to 5.9 Å (Supplementary Fig. 6c). The locally refined map shows evidence of helical features, with the local resolution highest in the core of the RecA domains (Supplementary Fig. 6a). 3DFSC (Supplementary Fig. 6e, f and Supplementary Movie 4) analysis indicates that the map does not suffer heavily from anisotropy and the lower resolution of the DNA-bound map relative to the DNA-free is as a result of increased flexibility. Indeed, XPF–ERCC1 does not engage DNA in vivo unless recruited by XPA in complex with TFIIH37. It is likely that the DNA-bound XPF–ERCC1 complex only becomes fully stabilised in the presence of these additional factors.

The DNA-bound reconstruction enabled the placement of all XPF–ERCC1 domains using the DNA-free structure as an initial template (Fig. 4a and Supplementary Movie 5). Aligning the DNA-bound and DNA-free maps identified key changes in the architecture of XPF–ERCC1, the most dramatic being the disengagement of the 2×(HhH)2 domain from the XPF helical domain and it’s repositioning adjacent to the XPF nuclease—ERCC1 NLD dimer, as seen for other XPF/Mus81 family endonucleases27,28 (Fig. 4e). An additional region of density was identified adjacent to the 2×(HhH)2 domain but segmented into a distinct volume (Fig. 4b). This density was assigned as the duplex portion of the stem–loop substrate due to the unambiguous presence of a 19 Å concave major groove and its length measuring the distance of 10 base pairs (Fig. 4c). In order to correctly position the 2×(HhH)2 domain with respect to the dsDNA, the structure of the Aeropyrum pernix XPF homodimer in complex with dsDNA was fit into the map and used to align the human 2×(HhH)2—dsDNA functional unit (Fig. 4b, c). The fit to density was then optimised for the human structure using Flex-EM38. This positions the 2×(HhH)2 domain–dsDNA-binding residues S244ERCC1–N246ERCC1 and G276ERCC1–G278ERCC1 in close proximity to the dsDNA minor groove in a homologous fashion to other family members (Fig. 4e). Furthermore, comparison of the DNA-free and DNA-bound 2D class averages clearly indicates a repositioning of the 2×(HhH)2 domain upon substrate engagement (Fig. 4a).

Fig. 4. Conformational reorganisation of XPF–ERCC1 engaged by a DNA-junction substrate.

Fig. 4

a Two orthogonal views of DNA-bound XPF–ERCC1 ribbon structure coloured by domain according to Fig. 1a. The dsDNA duplex is shown in purple. The ribbon model is shown with the cryo-EM composite map, ranging from 5.9–7.7 Å global resolution. b Top and middle: two orthogonal views comparing segmented DNA-free and DNA-bound maps. The DNA-bound map displayed is the globally refined and unsharpened 7.7 Å map. The DNA-free map displayed is the globally refined and unsharpened 4.0 Å map low-pass filtered to 9 Å resolution to display comparable levels of detail to the DNA-bound map. Both maps were segmented in UCSF Chimera revealing sub-volumes for the XPF HLM (pink), XPF nuclease–ERCC1 NLD dimer (blue), 2×(HhH)2 domain (yellow) and dsDNA (white). Bottom: two orthogonal views of the HLM and the CM/ERCC1 dsDNA-binding module. Each sub-volume contains the ribbon model of DNA-bound XPF–ERCC1, orange dotted line indicates unmodeled density corresponding to the position of the first RecA2 domain flexible insert. c Fitted model for the dimeric 2×(HhH)2 domain engaging dsDNA via the minor groove, placed within map density. Major groove distance and dsDNA-binding residues are indicated. The dsDNA-binding hairpin residues of ERCC1 are highlighted in yellow. d Model for the XPF RecA1–nuclease/ERCC1 NLD interface following substrate engagement. Labels and red circles indicate the XPF active site location. e Human XPF nuclease–ERCC1 NLD dimer, 2×(HhH)2 domain and dsDNA positioned within the cryo-EM map together with a structurally superposed A. pernix XPF structure (PDB code 2BGW) bound to dsDNA. A similar dsDNA trajectory is evident (purple box). f 2D class averages of DNA-bound XPF–ERCC1 with coloured dotted lines indicating the position of domains according to the key. g Comparison of 2D classes of the same molecular views from DNA-free (left column) and DNA-bound (right column) XPF–ERCC1 with coloured circles indicating the position of key domains, coloured according to panel (f).

The remaining domains of XPF–ERCC1 can be fit unambiguously into the density. The RecA1–RecA2 unit remains structurally rigid, with high-resolution features present in 2D class averages (Fig. 4f, g), reaffirming its role as an inactive helicase. Whilst the remainder of the complex increases in flexibility upon substrate engagement (Fig. 4g), the interface between the XPF RecA1 and nuclease domains remains intact (Fig. 4d). Comparison with the DNA-free structure reveals that the XPF helical domain pivots by approximately 15°, rotating ~11 Å away from the nuclease domain (Supplementary Fig. 7a). The increased flexibility of the XPF helical domain following its disengagement with the XPF nuclease domain can be visualised by the loss of high-resolution features in 2D class averages following substrate engagement (Fig. 4g). This conformational change breaks the autoinhibitory contact formed between H275XPF and S730XPF as predicted from the DNA-free structure. The remaining unmodeled map density likely corresponds to the flexible first RecA2 domain insert (Fig. 4b).

A model for DNA junction-based activation

Tight regulation of endonuclease catalytic activity is needed to prevent inappropriate DNA cleavage. Indeed XPF–ERCC1 displays no activity towards DNA duplexes, ssDNA or an equimolar mixture of ds and ssDNA substrate (Fig. 5b). This implies that it is the proximity of the ssDNA and dsDNA elements in a junction context that is uniquely required to stimulate XPF–ERCC1 activation and overcome complex autoinhibition. Analysis of our DNA-bound structure reveals that the presence of a junction shifts the dimeric 2×(HhH)2 domain by 47 Å to contact the XPF nuclease–ERCC1 NLD dimer, disrupting contacts with the XPF helical domain (Fig. 5a, b, Supplementary Movies 9 and 10). In this configuration the dimeric 2×(HhH)2 domain lies proximal to the ERCC1 NLD domain, coupling both known ssDNA-binding elements of the endonuclease25,27,28,39 within the ERCC1 NLD and XPF (HhH)2 domain (Fig. 5a, b). Others have proposed that XPF–ERCC1 2×(HhH)2 domain is sufficient to recognise ds/ssDNA junctions40, however, the precise arrangement of multiple ssDNA and dsDNA domains required for DNA-junction recognition remains to be determined. The final DNA-bound model lacks the single-stranded portion of the stem–loop and places the scissile phosphodiester bond approximately 15 Å from the XPF active site motif (residues 725–727) (Fig. 5b). We interpret the DNA-bound structure as showing important features of an initial step towards full DNA-junction recognition prior to the incision reaction. The low resolution of the DNA component within the cryo-EM map (approximately 9 Å) suggests that the dimeric 2×(HhH)2–DNA complex can adopt multiple conformers. Equally, the accessibility of the dsDNA major groove opposite to the 2×(HhH)2 minor groove interaction could be re-oriented towards the positively charged concave surface within the XPF HLM (Fig. 5c, d).

Fig. 5. Comparison of DNA-free and DNA-bound XPF–ERCC1 gives insights into DNA-junction activation.

Fig. 5

a Two orthogonal views of DNA-free XPF–ERCC1 highlighting known ssDNA binding sites. An NMR structure of XPF (HhH)2 domain bound to a ssDNA (PDB: 2KN7) mapped on to the full-length DNA-free structure indicates a distance of ~50Å between the ssDNA-binding site and residues known to bind ssDNA in the ERCC1 NLD. b Equivalent views as in panel a for DNA-bound XPF–ERCC1 indicating the 2×(HhH)2 domain undergoes a substantial movement to engage the ERCC1 NLD. This positions the two ssDNA-binding sites close enough to simultaneously engage ssDNA. Inset shows only a stem loop is a substrate for XPF-ERCC1. c The XPF HLM from the DNA-bound structure contains a concave surface lines with basic residues (yellow) that could potentially bind to dsDNA. d Equivalent view to panel c of the helicase module from MDA5 bound to A-form dsRNA (PDB: 4GL2) through an equivalent positively charged concave surface. e Proposed model for XPF–ERCC1 domain rearrangements upon binding stem–loop DNA. Red arrows indicate direction of domain rearrangements from DNA-free to DNA-bound. Dotted yellow lines indicate domains that undergo significant conformational changes. Scissors indicate the approximate location of the active site.

The closest structural homologue of both DNA-bound and DNA-free structures, as identified by the DALI protein structural comparison server41, is the helicase/translocase MDA5 that binds dsRNA4042 (rmsd of 4.1 Å over 283 C-alphas) (Fig. 5d and Supplementary Fig. 7b, c). MDA5 binds to the major groove of A-form dsRNA using a concave surface lined with basic residues and sequences equivalent to the XPF RecA2 insert two spanning residues 441–550 (Fig. 5d). A similar positively charged concave surface is evident for XPF HLM. Additional density is apparent adjacent to the RecA2 ß-sheet and could represent part of the missing insert two (disordered in the DNA-free structure), and is analogous to a dsRNA-binding region of MDA5. In the absence of DNA, the concave surface of the auto-inhibited conformation of the XPF HLM is too narrow to accommodate dsDNA, however. Upon release of the autoinhibitory contact between the XPF helical and nuclease domains following substrate engagement the HLM opens up into a conformation more conducive to dsDNA major groove binding. Further experiments using substrates with longer dsDNA regions or with A-/B-form DNA duplexes are required in order to validate this proposed mode of binding (Supplementary Fig. 8c).

Superposition of an XPF (HhH)2 domain bound to ssDNA (PDB: 2KN7) with our DNA-free structure reveals that the distance between the ssDNA-binding sites on the XPF (HhH)2 domain and the ERCC1 NLD is too far (>50 Å) to be engaged simultaneously by the 20-thymine residue stem–loop (Fig. 5a). Movement of the 2×(HhH)2 domain in the presence of a stem–loop shortens this distance to approximately 30 Å (Fig. 5b) This is consistent with changes in (HhH)2 domain position and linkers observed in published structures for A. pernix XPF and Mus81–Eme1 in the presence and absence DNA. It is also supported by both our 3D variability analysis (Supplementary Movies 68) and by XLMS data. We therefore speculate that longer junction substrates may reveal even further dynamic rearrangements sufficient to place a junction at the nuclease active site (Fig. 5e).

Discussion

The structural and functional studies described in this report provide insights into XPF–ERCC1 architecture, regulation and activation. The XPF–ERCC1 endonuclease catalyses the first irreversible step in NER repair by nicking the 5′-edge of the repair bubble structure on the damaged strand. The structure of DNA-free XPF–ERCC1 reveals how the heterodimer is auto-inhibited by blocking both DNA binding and active site access through contacts with the XPF helical domain. This structure reveals inter-domain interfaces not previously described and rationalises our previous report that the HLM impacts on endonuclease activity and substrate interaction20. Whilst the ssDNA-binding surfaces of XPF (HhH)2 and ERCC1–NLD are fully solvent accessible in the auto-inhibited structure, they are uncoupled from their respective dsDNA-binding surfaces (ERCC1 (HhH)2 and XPF–HLM), which are sterically blocked. The structure also confirms the presence of a heterodimeric interface between the XPF nuclease and ERCC1 NLD as described for other family members25,27,28.

This study provides evidence linking conformational activation of XPF–ERCC1 through DNA-junction recognition, with a likely contribution from recruitment partner proteins at DNA-junction sites prepared for either NER or ICLR pathways. Mapping the XPA interaction site within ERCC130 and the SLX4 site within XPF helical domain reveals spatial separation of each recruitment partner site in the auto-inhibited state. It suggests the critical binding determinants are non-overlapping, but full structures of XPF–ERCC1 with SLX4 or XPA combined with competition binding studies are required to prove this. XPF–ERCC1 activation by SLX4 is disrupted by some FA mutations that map to the helical domain, in agreement with previous in vivo work19,34. Given its proposed regulatory role, the helical domain may be repositioned on binding SLX4 to stimulate activity35,43. In contrast, XP-associated mutations were found to generally reduce endonuclease activity in vitro towards an NER substrate by destabilising the complex whereas FA mutants exhibited activity similar to wild type. Interestingly, our XPF–ERCC1 preparations were found to contain a significant amount of active XPF–ERCC1 heterotetramer (Supplementary Fig. 1a, c, d). Cryo-EM data was collected for this sample, although it was not possible to obtain a reconstruction below 14 Å resolution due to intrinsic flexibility (Supplementary Fig. 1e, f). Despite this, future work will seek to address whether the XPF–ERCC1 heterodimer and heterotetramer play distinct roles in DNA repair pathways.

XPF–ERCC1 cryo-EM structures described here reveal how binding a DNA-junction substrate is able to disengage the XPF helical domain from the XPF CM and release the heterodimeric 2×(HhH)2 domain. A role for the linker regions in enabling this release is likely. The released 2×(HhH)2 domain is then able to engage a minor groove in a dsDNA duplex adjacent to the DNA ds/ss junction and packs against the XPF nuclease–ERCC1 NLD dimer, as observed for structures of Mus81–Eme1 and A. pernix XPF. The repositioning of the dimeric 2×(HhH)2 domain has three consequences. First, it destabilises the autoinhibition interface with the XPF helical domain. Second, it exposes the dsDNA-binding surface of ERCC1 (HhH)2. Third, it enables the proper coupling of the ERCC1 ssDNA and dsDNA-binding functions by shortening the linker regions and forming a compact conformation with ERCC1–NLD–(HhH)2 domain contacts. The structures described here do not reveal the full basis for DNA-junction recognition or the extent of conformational flexing required to place the scissile bond proximal to the XPF catalytic centre. We speculate that the similarities between XPF HLM and the MDA5 helicase point to a concave surface that could engage the major groove of a DNA duplex within a DNA junction to promote movement of the ds-ssDNA discontinuity into the XPF catalytic site. Evidently further high-resolution structures are required with longer DNA substrates and recruitment partner complexes in order to fully understand how the scissile phosphodiester bond is presented to the XPF catalytic site and the extent of the conformational alterations required.

Whilst this paper was in preparation, the structure of a ds/ssDNA-bound TFIIH–XPA (PDB code: 6RO4) was published representing a 5′-NER pre-incision complex that can recruit XPF–ERCC137. Superposition of the ERCC1 (HhH)2 domain–dsDNA complex onto the exposed DNA minor groove at the TFIIH–XPA–ds-ssDNA junction (Supplementary Fig. 8b) revealed a non-overlapping complementarity in DNA binding with XPA. ERCC1 engaged precisely the available DNA elements that were not engaged by XPA (Supplementary Fig. 8a). The resulting model predicts extensive interfaces between the XPF–ERCC1 and TFIIH–XPA–DNA with few steric clashes, many of which were within the flexible XPA loop region (residues 104–131). In this model, the dimeric 2×(HhH)2 domain lies adjacent to the TFIIH subunit XPB and DNA whilst the XPF nuclease–ERCC1 NLD dimer is positioned close to XPD, XPA and DNA. The highly basic and flexible RecA2 insert one (residues 345–377) is oriented to interact with either the extended XPA helix or dsDNA. Further structural studies are required to validate such a model.

Finally, there is a pressing need to explore chemical inhibition of XPF–ERCC1 to sensitise cancer cells to platinum-based therapeutics and reduce drug resistance mediated by XPF-ERCC1. Equally, XPF-ERCC1 inhibitors could target cancer cell vulnerabilities including XPF-FANCM synthetic lethality relevant to FANCM-deficient tumours44 and potentially other platinum-sensitive contexts45. The availability of an atomic structure for human XPF–ERCC1 described here will encourage efforts to develop new precision medicines as well as to overcome cancer chemoresistance46.

Methods

XPF–ERCC1 expression, purification and complex assembly

All reagents purchased from Sigma-Aldrich unless otherwise stated. A pFastBac Dual vector containing full length, wild type, human XPF (NCBI reference sequence: NM_005236.2) and ERCC1 (NCBI reference sequence: NM_001166049.2) cDNA was modified to include a C-terminal ERCC1 Twin-Strep-tag using restriction enzyme cloning. All primer sequences used in this study are shown in Supplementary Table 3. This plasmid was transformed into competent DH10BAC Escherichia coli cells (Thermo-Fisher) and recombinant bacmid DNA purified. Recombinant baculoviruses expressing XPF and ERCC1 were generated using standard protocols47 (Oxford Expression Technologies). In short, 1 × 106 SF21 cells (Thermo-Fisher) grown in SFIII media (Thermo-Fisher) and 10 μg/ml gentamycin (Life Technologies) were infected at a multiplicity of infection (MOI) of 2 and harvested after 72 h. Cell pellets were resuspended in extract buffer (20 mM HEPES pH 7.8, 150 mM NaCl, 1 mM tris(2-carboxyethyl)phosphine (TCEP), 10% glycerol, 2 mM MgCl2, 0.01% 3-((3-cholamidopropyl) dimethylammonio)-1-propanesulfonate (CHAPS), 0.25 tablet of EDTA-free protease-inhibitor cocktail per litre of culture, and 1 μl per 250 mL lysate BaseMuncher (Expedeon)) and lysed by sonication. The lysate was cleared of insoluble cell debris by centrifugation at 35,000g for 45 min and incubated with Strep-tactin resin (GE Healthcare) for 1 h at 4 °C. The resin was extensively washed with extract buffer minus protease inhibitors and BaseMuncher and incubated for 12 hours with Tobacco Etch Virus protease (supplier NEB). The eluate, containing XPF–ERCC1 was concentrated and loaded onto an anion-exchange column (HiTrap-Q, GE Healthcare) and XPF–ERCC1 containing fractions eluted using a gradient across 20 ml of extract buffer + 1 M NaCl before a final SEC step using a Superdex-200i column (GE Healthcare) in cryo buffer (20 mM HEPES pH 7.8, 150 mM NaCl, 1 mM TCEP, 0.01% CHAPS). Mutants were cloned using the Q5 site-directed mutagenesis kit (New-England Biotech) and were then expressed using the same protocol as described above for wild-type XPF–ERCC1.

XPF–ERCC1 DNA complex assembly

DNA with a modified phosphorothioate backbone (SLp DNA) was resuspended in DNA resuspension buffer (10 mM Tris, pH 7.8, 1 mM EDTA and 75 mM NaCl) and annealed to form a stem–loop structure. Purified XPF–ERCC1 was buffer exchanged into XPF–ERCC1 DNA cryo buffer (20 mM HEPES pH 7.8, 150 mM NaCl, 1 mM TCEP, 0.01% CHAPS, 5 mM CaCl2, 0.5 mM EDTA) and then incubated with SLp DNA at a 1:2 protein:DNA molar ratio for 10 min at 4 °C followed by cross-linking with 0.05% (v/v) glutaraldehyde for 10 min at 4 °C. The cross-linking reaction was quenched by the addition of 1 mM Tris-HCl, pH 7.8 and the complex further purified via SEC using a Superdex 200i column.

Stem–loop sequence: CAGCG*C*T*U*G*G*TTTTTTTTTTTTTTTTTTTT*C*C*A*A*G*CGCTG, where the asterisk * represents a phosphorothioate backbone.

XPF–ERCC1 cryo-EM grid preparation and data collection

For cryo-EM analysis, 4 μl of the purified XPF–ERCC1 heterodimer at 1.5 mg/ml was applied to both R1.2/1.3 400 mesh UltraFoil® and QuantiFoil® grids that had been previously glow discharged for 45 s at 42 mA. The grids were blotted for 4 s at 100% humidity and 4 °C and plunged into liquid ethane cooled by liquid nitrogen using a FEI Vitrobot MK IV. The grids were then loaded onto a Titan Krios transmission electron microscope operated at 300 kV (Thermo-Fisher). Images were collected in counting mode using a Gatan K2 Summit direct electron detector camera mounted behind a GIF Quantum energy filter operating in zero-loss mode. Exposures were 15 s, with a total dose of 63 e2 dose-fractionated into 40 frames with a calibrated pixel size of 1.38 Å. Images were recorded with a defocus of 1.5 µm to 4 µm. A total of 15,315 micrographs were collected from three separate data collection sessions.

XPF–ERCC1 cryo-EM image processing

Movie frames were corrected for motion using MotionCor248, and contrast transfer function was estimated using CTFfind4.149 within Scipion1.250. The total number of movies used for processing was 14,453. Two-hundred micrographs were selected from the first collection from which 82,412 particles were picked using Xmipp351 semi-automated picking and extracted using RELION-352. The particles were sorted using Xmipp351 screen particles followed by three rounds of reference-free 2D classification in CryoSPARC-253. A subset of six 2D classes were selected that represented different views of the molecule and used as templates for reference-based particle picking using Gautomatch54 on the full dataset. This approach yielded 396,106, 1,201,881 and 2,391,900 particles for data collection runs one, two and three, respectively. The particles were extracted and binned twofold using RELION-352, sorted using Xmipp351 to screen particles and then submitted for three rounds of reference-free 2D classification in CryoSPARC-253. This reduced the particle numbers to 151,412, 390,007 and 1,074,111 particles for data collection runs one, two and three, respectively. Four initial models were generated using the ab initio reconstruction programme in CryoSPARC-253 and were used as references for 3D classification using heterogeneous refinement in CryoSPARC-253. Multiple rounds of heterogeneous refinement yielded 44,312, 126,492 and 390,712 particles in well-defined classes for data collection runs one, two and three respectively. All 561,516 particles from the three collections were re-extracted in an un-binned 200 ×200 pixel box using RELION-352 and csparc2star and then merged. The data then underwent 3D classification without alignment in RELION-352 to identify the most stable, high-resolution class. The two classes that displayed the highest-resolution features, comprising 405,339 particles, were refined to 4.1 Å resolution in CryoSPARC-253 using non-uniform refinement. Per-particle motion correction was carried out using Bayesian polishing in RELION-352. The shiny, polished particles were then refined to 4.0 Å resolution in CryoSPARC-253 using non-uniform refinement.

Inspection of the 4.0 Å resolution map rendered by local resolution in Chimera55 identified the dimeric XPF–ERCC1 2×(HhH)2 domain as the lowest resolution region of the map, suggesting some degree of mobility. A mask which excluded the low-resolution XPF–ERCC1 2×(HhH)2 hairpins was generated in Chimera55 and using the particle subtraction tool in CryoSPARC-253 the portion of the particle images aligning to the hairpin density in the map was removed. Non-uniform local refinement in CryoSPARC-253 was performed on the subtracted particles, re-aligning them to the masked reference volume, leading to a reconstruction at 3.6 Å resolution which excluded the hairpin portion of the 4.0 Å map.

All resolutions reported here were determined by Fourier shell correlation (at FSC = 0.143) based on the “gold-standard” protocol using a soft mask around the complex density56. To avoid over-masking, the masked maps were visually inspected to exclude the possibility of clipping. In addition, the occurrence of over-masking was monitored by inspecting the shapes of FSC curves. The two-half maps had their phases randomised beyond the resolution at which the no-mask FSC drops below the FSC = 0.143 criterion. The tight mask is applied to both half maps, and an FSC is calculated. This FSC is used along with the original FSC before phase randomisation to compute the corrected FSC. Local resolution was calculated using Blocres within CryoSPARC-253. For visualisation, maps were sharpened by applying an automated local resolution weighted negative B factor using the local filtering function of CryoSPARC-253.

XPF–ERCC1 model building

Initially the crystal structures of the ERCC1 NLD (PDB code: 2A1I) and the tandem helix–hairpin–helix domains comprising XPF and ERCC1 chains (PDB code: 2A1J) were rigid body fitted into the locally filtered and sharpened map obtained at 4.0 Å resolution. Homology models were generated for the XPF RecA1 domain and rigid body fit into the map using the same procedure. Subsequently, the fitted domains were rebuilt manually using COOT57 optimising the fit where sidechain densities were evident prior to using FlexEM38 and real-space refinement as implemented in PHENIX58 whilst imposing secondary structural and geometric restraints to prevent overfitting (Table 4). The RecA2 and helical domains were built de novo and subjected to PHENIX58 real-space refinement. A further 6 cycles of rebuilding and refinement in COOT57 and PHENIX58 lead to a model containing 743 residues from XPF and 195 from ERCC1. Linkers regions connecting the XPF nuclease and ERCC1 NLD domains to their respective (HhH)2 domains were built manually into the map and the N-terminal portion of the XPF nuclease domain homology model was rebuilt in COOT57 to fit the map. The final atomic model was evaluated using MolProbity59 (Table 4). The location of patient mutations and sidechains referred to in the text are mapped onto the primary sequence, together with sequence conservation within XPF and ERCC1 homologues respectively (Supplementary Figs. 10 and 11).

Table 4.

Cryo-EM statistics for XPF–ERCC1 structures and associated maps.

Data Collection and Processing XPF–ERCC1 XPF–ERCC1DNA
EMDB ID EMD-10337 EMD-10338
PDB ID 6SXA 6SXB
Magnification 36,232× 36,232×
Voltage (kV) 300 300
Electron exposure (e/Å2) 63 63
Defocus range (μm) 1–4 1–3
Pixel size (Å) 1.38 1.38
Symmetry imposed C1 C1
Initial particle images 3,989,887 3,432,565
Final particle images 405,339 199,022
Map resolution (Å) 3.6 (local refinement) 7.9 (globally refined)
4.0 (globally refined) 7.1 (local refinement)
Map resolution range (Å) 3.4–8 5.5–14
Refinement
Initial model Cryo-SPARC ab initio Cryo-SPARC ab initio
Model resolution (Å) 3.6–4 5.9–7.7
FSC threshold 0.5 0.5
Map sharpening B factor (Å2) −168 −560
Model composition
Non-hydrogen atoms 7218 7345
Protein residues 945 892
B factor (Å2) 163 530
R.m.s. deviations
Bond lengths (Å) 0.004 0.23
Bond angles (°) 0.682 0.37
Validation
MolProbity score 1.88 2.9
Clashscore 7.01 4
Poor rotamers (%) 0 0
Ramachandran plot
Favoured (%) 91.7 99
Allowed (%) 8.4 1
Disallowed (%) 0 0

XPF–ERCC1–DNA complex cryo-EM grids and data collection

XPF–ERCC1–DNA complex was concentrated to 1.3 mg/ml and applied to Quantifoil R1.2/1.3 300 mesh copper grids. The freezing and imaging conditions used were the same as for the DNA-free XPF–ERCC1 complex described above. A total of 8965 movies were collected from a single data collection using the same electron microscope and detector as described above.

XPF–ERCC1–DNA complex cryo-EM image processing

Motion correction and CTF estimation was performed as previously described for the XPF–ERCC1 data collections. Totally, 7982 micrographs were manually selected for processing. Particle picking was carried out as described for the XPF–ERCC1 data collections. 3,432,565 particles were extracted and sorted using Xmipp351 screen particles and then submitted for six rounds of reference-free 2D classification in CryoSPARC-253. A total of 688,821 particles were used to generate 4 ab initio reconstructions which were then used as references for 3D classification using heterogeneous refinement in CryoSPARC-253. Multiple rounds of heterogeneous refinement were carried out yielding one well-ordered reconstruction comprising 199,022 particle images (Table 4). This class was refined to 7.7 Å resolution using non-uniform refinement in CryoSPARC-253. A mask was generated using UCSF Chimera55 that excluded both the DNA and hairpin domain density which was used to carry out masked refinement improving the resolution of the sub-volume to 5.9 Å (Table 4).

XPF–ERCC1–DNA complex model building

Individual domains of XPF–ERCC1 were taken from the DNA-free structure and fitted into the DNA-bound cryo-EM map density as rigid bodies using the UCSF Chimera55 fit-in-map tool. The homodimeric A. pernix XPF (PDB:2BGW) bound to dsDNA through its (HhH)2 hairpins was fitted into the DNA-bound map density and the subsequent position of the DNA-bound A. pernix hairpins used as a reference to align the human hairpin domain using MatchMaker in UCSF Chimera55. The DNA from the A. pernix structure was reduced to a 10 base-pair duplex and modelled into the map whilst preserving the hairpin domain–DNA contacts. The sequence conservation of the functional human ERCC1 and A. pernix (HhH)2 domains is high: 25.5% identical and 69.1% similar residues. The ds-RNA bound structure of MDA5 (PDB: 4GL2) was placed into the DNA-bound map density as a guide to place the helical domain of XPF by inspecting the position of the homologous domain in MDA5.

XPF–ERCC1–DNA–TFIIH–XPA complex modelling

The XPF–ERCC1–DNA structure was aligned to the TFIIH–DNA–XPA structure (PDB code: 6RO4) through structural super-imposition in UCSF Chimera55 and alignment with the two DNA strands of a single duplex from each structure. The ds/ss DNA junction was defined by the high-resolution DNA structure in the TFIIH–XPA complex and demarcated by the position of the XPA β-hairpin.

XPF–ERCC1 cross-linking mass spectrometry

All chemicals were purchased from Sigma-Aldrich unless otherwise stated. A total of 100 µg XPF–ERCC1 heterodimer at a concentration of 1 mg/ml in 20 mM HEPES, pH 7.8, 10% Glycerol, 0.01% CHAPS, 150 mM NaCl, 1 mM TCEP, 0.5 mM EDTA was cross-linked using 1 mM disuccinimidyl sulfoxide (DSSO) (Thermo-Fisher) with mild shaking for 30 min at 37 °C. The reaction was quenched using a final concentration of 50 mM ammonium bicarbonate for a further 20 min at 37 °C. To remove potential aggregates, gradient ultracentrifugation was employed using a 5–30% glycerol gradient in 20 mM Hepes, 150 mM NaCl, mixed using a Gradient Master (BioComp), and centrifuged for 16 h at 4 °C at 200,000×g using a SW 55 Ti Rotor (Beckman Coulter)60. Totally, 100 µL fractions were collected and silver stained to identify fractions containing cross-linked non-aggregated XPF–ERCC1. Fractions containing cross-linked proteins were then pooled and buffer exchanged into 8 M urea using a Vivaspin 500, 30,000 molecular weight cut off (MWCO) PES filter (Sartorius, VS0122). Cysteine reduction was carried out using 2.5 mM TCEP for 30 min at 37 °C and alkylated in the dark using 5 mM iodoacetamide at room temperature. The urea was then buffer exchanged for 50 mM ammonium bicarbonate and proteins were proteolysed using trypsin (Promega) at 1:50 w/w trypsin:protein overnight at 37 °C. The solution was acidified using 2% formic acid and peptides were the spun through the MWCO filter and desalted using in-house built STAGE tips made using Empore SPE C18 discs (3 M, 66883-U). The eluent was then dried to completion. Peptides were reconstituted in 0.1% trifluoroacetic acid (TFA) and chromatographically resolved using an Ultimate 3000 RSLCnano (Dionex) HPLC. Peptides were first loaded onto an Acclaim PepMap 100 C18, 3 µm particle size, 100 Å pore size, 20 mm × 75 µm ID (Thermo Scientific, 164535) trap column using a loading buffer (2% acetonitrile (MeCN) and 0.05% TFA in 97.05 % H2O) with a flow rate of 7 µL/min. Chromatographic separation was achieved using an EASY-Spray column, PepMap C18, 2 µm particles, 100 Å pore size, 500 mm × 75 µm ID (Thermo Scientific, ES803). The gradient utilised a flow of 0.3 µl/min, starting at 98% mobile A (0.1% formic acid, 5% dimethyl sulfoxide (DMSO) in H2O) and 2% mobile B (0.1% formic acid, 75% MeCN, 5% DMSO and 19.9% H2O). After 6 min, mobile B was increased to 30% over 69 min, to 45% over 30 min, further increased to 90% in 16 min and held for 4 min. Finally, Mobile B was reduced back to 5% over 1 min for the rest of the acquisition. Data were acquired in real time over 140 min using an Orbitrap Fusion Lumos Tribrid mass spectrometer in positive, top speed mode with a cycle time of 5 s. The chromatogram (MS1) was captured using 60,000 resolution, a scan range of 375–1500 with a 50 ms maximum injection time, and 4e5 AGC target. Dynamic exclusion with repeat count 2, exclusion duration of 30 s, 20 ppm tolerance window was used, along with isotope exclusion, a minimum intensity exclusion of 2e4, charge state inclusion of 3–8 ions and peptide mono isotopic precursor selection. Precursors within a 1.6 m/z isolation window were then fragmented using 25% normalised CID, 100 ms maximum injection time and 5e4 AGC target. Scans were recorded using 30,000 resolution in centroid mode, with a scan range of 120–2000 m/z. Spectra containing peaks with a mass difference of 31.9721 Da were further fragmented with a 30% normalised higher collision induced dissociation, using a 2 m/z isolation window, 150 ms maximum injection time and 2e4 AGC target. Four scans were recorded using an ion trap detection in rapid mode starting at 120 m/z.

XL-MS data analysis

Data processing were carried out using Proteome Discoverer Version 2.4 (Thermo Scientific) with the XlinkX61 node where the minimum XlinkX score was set to 63. The acquisition strategy was set to MS2_MS3 mode. The database comprised solely of the specific XPF and ERCC1 sequences. Trypsin was selected as the proteolytic enzyme allowing up to two missed cleavages with a minimal peptide length of five residues. Masses considered were in the range of 300–10000 Da. The precursor mass tolerance, FTMS fragment mass tolerance, and ITMS Fragment Mass Tolerance were set to 10 ppm, 20 ppm and 0.6 Da, respectively. A static carbamidomethyl (+57.021 Da) modification was utilised for cysteine residues, with additional dynamic modifications considered including; amidated and hydrolysed DSSO (+142.050 and +176.014 Da, respectively) on lysine serine and threonine residues, oxidation (+15.995 Da) on methionine residues, and protein N-terminal acetylation (+42.011 Da). The FDR threshold was set to one with the strategy set to simple. The list of reported cross-linked spectral matches were manually examined and cross-links with spectra that did not contain acceptable b and y ion coverage were excluded. We note that this method requires accessible lysine sidechains therefore predominantly hydrophobic interfaces, such as the RecA1–nuclease, did not return any cross-links62. A number of cross-links were observed that exceed the permitted the Cα–Cα cut-off distance of 30 Å.

XPF–ERCC1–SLX4NTD complex assembly

cDNA encoding the SLX4NTD (residues 1–758) (NCBI reference sequence: NM_032444) was shuttled into a pGEX-1 vector (Sigma). Recombinant baculoviruses expressing the SLX4NTD were generated as previously described and used to infect 1 × 106 SF21 cells (Thermo-Fisher) grown in SFIII media (Thermo-Fisher) and 10 μg/ml gentamycin (Life Technologies) at an MOI of 0.5. These cells were co-infected with XPF–ERCC1 expressing baculovirus at an MOI of 2. Cells were pelleted after 72 h and protein extracted as previously described for XPF–ERCC1. Following Strep-tactin affinity purification, the complex was purified using anion-exchange (HiTrap-Q, GE Healthcare) using a gradient of 150 mM NaCl to 500 mM NaCl over 20 ml of extract buffer minus protease inhibitors and BaseMuncher. This separated the SLX4NTD–XPF–ERCC1 complex from unbound XPF–ERCC1. Fractions containing the SLX4NTD–XPF–ERCC1 complex were pooled and concentrated prior to a final SEC step using a Superose-6 increase column equilibrated in extract buffer minus protease inhibitors and BaseMuncher (GE Healthcare). Fractions containing both XPF and SLX4NTD were identified via Western blot.

Real-time fluorescence incision assay

Fluorescently labelled stem–loop (SLF) DNA substrates, containing a 5′ 6-FAM fluorophore and 3′-BHQ1 quench, were purified by SEC (Superdex-200i, GE Healthcare) in assay buffer (5 mM HEPES, 10% glycerol, 0.5 mM DTT, 1 mM MnCl2 and 40 mM NaCl. The purified substrates were then annealed by heating to 95 °C for 1 min followed by cooling to 4 °C and dispensed into the assay plate. Reactions were carried out in 384-well black, flat-bottomed microtitre plates (Corning 3854). Purified XPF–ERCC1 was buffer exchanged into assay buffer and 5 nM added to each in a total volume of 20 µl to initiate the endonuclease reaction. Fluorescence measurements were carried out using the CLARIOstar plate reader (BMG Labtech) using an excitation wavelength of 483 nm and an emission wavelength of 525 nm. Sixty readings were collected at 30-s intervals and the linear response range for each substrate was used to determine the change in fluorescence per unit time. Kinetic parameters were calculated using the Michaelis–Menten equation. Experimental product release was quantified by plotting the relative fluorescence units produced by known amounts of the cleavage products against their concentration to generate a standard curve.

SLF sequence: 6-FAM-5′-CAGCGCTUGGTTTTTTTTTTTTTTTTTTTTCCAAGCGCTG-3′-BHQ1.

Cleavage product #1: 6-FAM-5′-CAGCGCTC 3′.

Cleavage product #2: 5′-GGTTTTTTTTTTTTTTTTTTTTCCGAGCGCTG-3′-BHQ1.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Supplementary information

41467_2020_14856_MOESM2_ESM.pdf (97.1KB, pdf)

Description of Additional Supplementary Files

Supplementary Movie 1 (14.5MB, mp4)
Supplementary Movie 2 (227.8MB, mp4)
Supplementary Movie 3 (68.7MB, mp4)
Supplementary Movie 4 (4.5MB, mp4)
Supplementary Movie 5 (73.1MB, mp4)
Supplementary Movie 6 (3.1MB, mp4)
Supplementary Movie 7 (2.3MB, mp4)
Supplementary Movie 8 (3.5MB, mp4)
Supplementary Movie 9 (3.9MB, mp4)
Supplementary Movie 10 (3.9MB, mp4)
Reporting Summary (203.6KB, pdf)

Acknowledgements

We thank members of the McDonald laboratory for helpful discussions and comments on the paper, in particular Amy Whittaker who also assisted in running the activity assays. We thank Andrew Purkiss of the structural biology science technology platform for assistance in refining the structures and Raffaella Carzaniga for EM training and support. We also thank Steve West for an anti-SLX4 antibody and Rick D. Wood (MD Anderson Centre, Texas) for stimulating and encouraging the project in its early phases. We acknowledge the helpful advice and support of Peter Rosenthal on all aspects of cryo-electron microscopy. M.J. was funded by a Crick/UCL joint PhD studentship. N.Q.M. acknowledges that this work was supported by the Francis Crick Institute, which receives its core funding from Cancer Research UK (FC001115), the UK Medical Research Council (FC001115) and the Wellcome Trust (FC001115). M.L. is supported by the National Institute for Health Research University College London Hospitals Biomedical Research Centre. E.P.M. and F.B. acknowledge support from Cancer Research UK (C12209/A16749).

Source Data

Source Data (270.9KB, pdf)

Author contributions

M.J. cloned and purified the XPF–ERCC1 complexes, processed all cryo-EM data and built the atomic models. M.J. and F.B. carried out the EM grid preparations. F.B., M.J. and C.E. screened the EM grids, A.N. ran the data collections. F.B. and E.P.M. assisted the XPF–ERCC1 single particle reconstruction and interpretation. M.B. assisted with XPF–ERCC1 biochemistry, D.B. assisted in model refinement and validation. A.B. and A.S. carried out the XLMS and data analysis. M.L. contributed to the study design. M.J. and N.Q.M. designed the study, interpreted the results and wrote the paper.

Data availability

The coordinates for the DNA-free and DNA-bound XPF–ERCC1 complex are available in the PDB with codes 6SXA [https://www.ebi.ac.uk/pdbe/entry/pdb/6sxb] and 6SXB [https://www.ebi.ac.uk/pdbe/entry/pdb/6sxb] and the cryo-EM maps are available in EMDB with codes EMD-10337 [https://www.ebi.ac.uk/pdbe/entry/emdb/EMD-10337] and EMD-10338 [https://www.ebi.ac.uk/pdbe/entry/emdb/EMD-10337]. The source data underlying Figs. 1b, 5b Supplementary Figs. 1b, 9d, e are provided as a Source Data file. Other data that support the findings of this study are available from the corresponding author upon request.

Competing interests

The authors declare no competing interests.

Footnotes

Peer review information Nature Communications thanks Gang Cai and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary information is available for this paper at 10.1038/s41467-020-14856-2.

References

  • 1.Dehe PM, Gaillard PH. Control of structure-specific endonucleases to maintain genome stability. Nat. Rev. Mol. Cell Biol. 2017;18:315–330. doi: 10.1038/nrm.2016.177. [DOI] [PubMed] [Google Scholar]
  • 2.Ciccia A, McDonald N, West SC. Structural and functional relationships of the XPF/MUS81 family of proteins. Annu Rev. Biochem. 2008;77:259–287. doi: 10.1146/annurev.biochem.77.070306.102408. [DOI] [PubMed] [Google Scholar]
  • 3.Faridounnia M, Folkers GE, Boelens R. Function and interactions of ERCC1-XPF in DNA damage response. Molecules. 2018;23:E3205. doi: 10.3390/molecules23123205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Marteijn JA, Lans H, Vermeulen W, Hoeijmakers JHJ. Understanding nucleotide excision repair and its roles in cancer and ageing. Nat. Rev. Mol. Cell Biol. 2014;15:465. doi: 10.1038/nrm3822. [DOI] [PubMed] [Google Scholar]
  • 5.Klein Douwel D, Hoogenboom WS, Boonen RACM, Knipscheer P. Recruitment and positioning determine the specific role of the XPF‐ERCC1 endonuclease in interstrand crosslink repair. EMBO J. 2017;36:2034–2046. doi: 10.15252/embj.201695223. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Wyatt HDM, Laister RC, Martin SR, Arrowsmith CH, West SC. The SMX DNA repair tri-nuclease. Mol. Cell. 2017;65:848–860.e11. doi: 10.1016/j.molcel.2017.01.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Wu Y, Mitchell TR, Zhu XD. Human XPF controls TRF2 and telomere length maintenance through distinctive mechanisms. Mech. Ageing Dev. 2008;129:602–610. doi: 10.1016/j.mad.2008.08.004. [DOI] [PubMed] [Google Scholar]
  • 8.Woodrick J, et al. A new sub‐pathway of long‐patch base excision repair involving 5′ gap formation. EMBO J. 2017;36:1605–1622. doi: 10.15252/embj.201694920. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Al-Minawi AZ, Saleh-Gohari N, Helleday T. The ERCC1/XPF endonuclease is required for efficient single-strand annealing and gene conversion in mammalian cells. Nucleic Acids Res. 2008;36:1–9. doi: 10.1093/nar/gkm888. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Ahmad A, et al. ERCC1-XPF endonuclease facilitates DNA double-strand break repair. Mol. Cell Biol. 2008;28:5082–5092. doi: 10.1128/MCB.00293-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Bogliolo M, et al. Mutations in ERCC4, encoding the DNA-repair endonuclease XPF, cause Fanconi anemia. Am. J. Hum. Genet. 2013;92:800–806. doi: 10.1016/j.ajhg.2013.04.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Jaspers NGJ, et al. First reported patient with human ERCC1 deficiency has cerebro-oculo-facio-skeletal syndrome with a mild defect in nucleotide excision repair and severe developmental failure. Am. J. Hum. Genet. 2007;80:457–466. doi: 10.1086/512486. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Kashiyama K, et al. Malfunction of nuclease ERCC1-XPF results in diverse clinical manifestations and causes Cockayne syndrome, xeroderma pigmentosum, and Fanconi anemia. Am. J. Hum. Genet. 2013;92:807–819. doi: 10.1016/j.ajhg.2013.04.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Niedernhofer LJ, et al. A new progeroid syndrome reveals that genotoxic stress suppresses the somatotroph axis. Nature. 2006;444:1038. doi: 10.1038/nature05456. [DOI] [PubMed] [Google Scholar]
  • 15.Sijbers AM, et al. Xeroderma pigmentosum group F caused by a defect in a structure-specific DNA repair endonuclease. Cell. 1996;86:811–822. doi: 10.1016/S0092-8674(00)80155-5. [DOI] [PubMed] [Google Scholar]
  • 16.Fairman-Williams ME, Guenther UP, Jankowsky E. SF1 and SF2 helicases: family matters. Curr. Opin. Struct. Biol. 2010;20:313–324. doi: 10.1016/j.sbi.2010.03.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Sgouros J, Gaillard PH, Wood RD. A relationship betweena DNA-repair/recombination nuclease family and archaeal helicases. Trends Biochem. Sci. 1999;24:95–97. doi: 10.1016/S0968-0004(99)01355-9. [DOI] [PubMed] [Google Scholar]
  • 18.Gaillard PH, Wood RD. Activity of individual ERCC1 and XPF subunits in DNA nucleotide excision repair. Nucleic Acids Res. 2001;29:872–879. doi: 10.1093/nar/29.4.872. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Klein Douwel, D., Hoogenboom, W. S., Boonen, R. A. & Knipscheer, P. Recruitment and positioning determine the specific role of the XPF-ERCC1 endonuclease in interstrand crosslink repair. Embo J. 37, 2034–2046 (2017). [DOI] [PMC free article] [PubMed]
  • 20.Bowles M, et al. Fluorescence-based incision assay for human XPF-ERCC1 activity identifies important elements of DNA junction recognition. Nucleic Acids Res. 2012;40:e101. doi: 10.1093/nar/gks284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Enzlin JH, Scharer OD. The active site of the DNA repair endonuclease XPF-ERCC1 forms a highly conserved nuclease motif. EMBO J. 2002;21:2045–2053. doi: 10.1093/emboj/21.8.2045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Orelli B, et al. The XPA-binding domain of ERCC1 is required for nucleotide excision repair but not other DNA repair pathways. J. Biol. Chem. 2010;285:3705–3712. doi: 10.1074/jbc.M109.067538. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Yan L, Wu H, Li X, Gao N, Chen Z. Structures of the ISWI-nucleosome complex reveal a conserved mechanism of chromatin remodeling. Nat. Struct. Mol. Biol. 2019;26:258–266. doi: 10.1038/s41594-019-0199-9. [DOI] [PubMed] [Google Scholar]
  • 24.Eustermann S, et al. Structural basis for ATP-dependent chromatin remodelling by the INO80 complex. Nature. 2018;556:386–390. doi: 10.1038/s41586-018-0029-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Newman M, et al. Structure of an XPF endonuclease with and without DNA suggests a model for substrate recognition. EMBO J. 2005;24:895–905. doi: 10.1038/sj.emboj.7600581. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Tsodikov OV, Enzlin JH, Scharer OD, Ellenberger T. Crystal structure and DNA binding functions of ERCC1, a subunit of the DNA structure-specific endonuclease XPF-ERCC1. Proc. Natl Acad. Sci. USA. 2005;102:11236–11241. doi: 10.1073/pnas.0504341102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Gwon GH, et al. Crystal structures of the structure-selective nuclease Mus81-Eme1 bound to flap DNA substrates. EMBO J. 2014;33:1061–1072. doi: 10.1002/embj.201487820. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Coulthard R, et al. Architecture and DNA recognition elements of the Fanconi anemia FANCM-FAAP24 complex. Structure. 2013;21:1648–1658. doi: 10.1016/j.str.2013.07.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Tripsianes K, et al. The structure of the human ERCC1/XPF interaction domains reveals a complementary role for the two proteins in nucleotide excision repair. Structure. 2005;13:1849–1858. doi: 10.1016/j.str.2005.08.014. [DOI] [PubMed] [Google Scholar]
  • 30.Tsodikov OV, et al. Structural basis for the recruitment of ERCC1-XPF to nucleotide excision repair complexes by XPA. EMBO J. 2007;26:4768–4776. doi: 10.1038/sj.emboj.7601894. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Marín M, et al. Functional comparison of XPF missense mutations associated to multiple DNA repair disorders. Genes. 2019;10:E60. doi: 10.3390/genes10010060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Matsumura Y, Nishigori C, Yagi T, Imamura S, Takebe H. Characterization of molecular defects in xeroderma pigmentosum group F in relation to its clinically mild symptoms. Hum. Mol. Genet. 1998;7:969–974. doi: 10.1093/hmg/7.6.969. [DOI] [PubMed] [Google Scholar]
  • 33.Sijbers AM, et al. Homozygous R788W point mutation in the XPF gene of a patient with xeroderma pigmentosum and late-onset neurologic disease. J. Invest. Dermatol. 1998;110:832–836. doi: 10.1046/j.1523-1747.1998.00171.x. [DOI] [PubMed] [Google Scholar]
  • 34.Klein Douwel D, et al. XPF-ERCC1 acts in Unhooking DNA interstrand crosslinks in cooperation with FANCD2 and FANCP/SLX4. Mol. Cell. 2014;54:460–471. doi: 10.1016/j.molcel.2014.03.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Hodskinson MR, et al. Mouse SLX4 is a tumor suppressor that stimulates the activity of the nuclease XPF-ERCC1 in DNA crosslink repair. Mol. Cell. 2014;54:472–484. doi: 10.1016/j.molcel.2014.03.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Hoogenboom WS, Boonen R, Knipscheer P. The role of SLX4 and its associated nucleases in DNA interstrand crosslink repair. Nucleic Acids Res. 2019;47:2377–2388. doi: 10.1093/nar/gky1276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Kokic G, et al. Structural basis of TFIIH activation for nucleotide excision repair. Nat. Commun. 2019;10:2885. doi: 10.1038/s41467-019-10745-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Topf M, et al. Protein structure fitting and refinement guided by cryo-EM density. Structure. 2008;16:295–307. doi: 10.1016/j.str.2007.11.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Das D, et al. Single-stranded DNA binding by the helix-hairpin-helix domain of XPF protein contributes to the substrate specificity of the ERCC1-XPF protein complex. J. Biol. Chem. 2017;292:2842–2853. doi: 10.1074/jbc.M116.747857. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Wu B, et al. Structural basis for dsRNA recognition, filament formation, and antiviral signal activation by MDA5. Cell. 2013;152:276–289. doi: 10.1016/j.cell.2012.11.048. [DOI] [PubMed] [Google Scholar]
  • 41.Holm Liisa. Benchmarking fold detection by DaliLite v.5. Bioinformatics. 2019;35(24):5326–5327. doi: 10.1093/bioinformatics/btz536. [DOI] [PubMed] [Google Scholar]
  • 42.Yu Q, Qu K, Modis Y. Cryo-EM structures of MDA5-dsRNA filaments at different stages of ATP hydrolysis. Mol. Cell. 2018;72:999–1012.e6. doi: 10.1016/j.molcel.2018.10.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Abdullah Ummi B, McGouran Joanna F, Brolih Sanja, Ptchelkine Denis, El‐Sagheer Afaf H, Brown Tom, McHugh Peter J. RPA activates the XPF ‐ ERCC 1 endonuclease to initiate processing of DNA interstrand crosslinks. The EMBO Journal. 2017;36(14):2047–2060. doi: 10.15252/embj.201796664. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Li S, et al. ERCC1/XPF is important for repair of DNA double-strand breaks containing secondary structures. iScience. 2019;16:63–78. doi: 10.1016/j.isci.2019.05.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Mesquita KA, et al. ERCC1-XPF deficiency is a predictor of olaparib induced synthetic lethality and platinum sensitivity in epithelial ovarian cancers. Gynecol. Oncol. 2019;153:416–424. doi: 10.1016/j.ygyno.2019.02.014. [DOI] [PubMed] [Google Scholar]
  • 46.McNeil EM, Melton DW. DNA repair endonuclease ERCC1-XPF as a novel therapeutic target to overcome chemoresistance in cancer therapy. Nucleic Acids Res. 2012;40:9990–10004. doi: 10.1093/nar/gks818. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Kost TA, Condreay JP. Recombinant baculoviruses as expression vectors for insect and mammalian cells. Curr. Opin. Biotechnol. 1999;10:428–433. doi: 10.1016/S0958-1669(99)00005-1. [DOI] [PubMed] [Google Scholar]
  • 48.Zheng SQ, et al. MotionCor2: anisotropic correction of beam-induced motion for improved cryo-electron microscopy. Nat. Methods. 2017;14:331–332. doi: 10.1038/nmeth.4193. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Rohou A, Grigorieff N. CTFFIND4: fast and accurate defocus estimation from electron micrographs. J. Struct. Biol. 2015;192:216–221. doi: 10.1016/j.jsb.2015.08.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.de la Rosa-Trevin JM, et al. Scipion: a software framework toward integration, reproducibility and validation in 3D electron microscopy. J. Struct. Biol. 2016;195:93–99. doi: 10.1016/j.jsb.2016.04.010. [DOI] [PubMed] [Google Scholar]
  • 51.de la Rosa-Trevin JM, et al. Xmipp 3.0: an improved software suite for image processing in electron microscopy. J. Struct. Biol. 2013;184:321–328. doi: 10.1016/j.jsb.2013.09.015. [DOI] [PubMed] [Google Scholar]
  • 52.Zivanov J, et al. New tools for automated high-resolution cryo-EM structure determination in RELION-3. Elife. 2018;7:e42166. doi: 10.7554/eLife.42166. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Punjani A, Rubinstein JL, Fleet DJ, Brubaker MA. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat. Methods. 2017;14:290–296. doi: 10.1038/nmeth.4169. [DOI] [PubMed] [Google Scholar]
  • 54.Zhang, K. MRC, LMB. www.mrc-lmb.cam.ac.uk/kzhang/.
  • 55.Pettersen EF, et al. UCSF chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 2004;25:1605–1612. doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
  • 56.Rosenthal PB, Henderson R. Optimal determination of particle orientation, absolute hand, and contrast loss in single-particle electron cryomicroscopy. J. Mol. Biol. 2003;333:721–745. doi: 10.1016/j.jmb.2003.07.013. [DOI] [PubMed] [Google Scholar]
  • 57.Brown A, et al. Tools for macromolecular model building and refinement into electron cryo-microscopy reconstructions. Acta Crystallogr D Biol. Crystallogr. 2015;71:136–153. doi: 10.1107/S1399004714021683. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Afonine PV, et al. Real-space refinement in PHENIX for cryo-EM and crystallography. Acta Crystallogr. D Struct. Biol. 2018;74:531–544. doi: 10.1107/S2059798318006551. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Williams CJ, et al. MolProbity: more and better reference data for improved all-atom structure validation. Protein Sci. 2018;27:293–315. doi: 10.1002/pro.3330. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Kao A, et al. Development of a novel cross-linking strategy for fast and accurate identification of cross-linked peptides of protein complexes. Mol. Cell Proteom. 2011;10:M110.002212. doi: 10.1074/mcp.M110.002212. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.de Graaf SC, Klykov O, van den Toorn H, Scheltema RA. Cross-ID: analysis and visualization of complex XL-MS-driven protein interaction networks. J. Proteome Res. 2019;18:642–651. doi: 10.1021/acs.jproteome.8b00725. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.O’Reilly FJ, Rappsilber J. Cross-linking mass spectrometry: methods and applications in structural, molecular and systems biology. Nat. Struct. Mol. Biol. 2018;25:1000–1008. doi: 10.1038/s41594-018-0147-0. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

41467_2020_14856_MOESM2_ESM.pdf (97.1KB, pdf)

Description of Additional Supplementary Files

Supplementary Movie 1 (14.5MB, mp4)
Supplementary Movie 2 (227.8MB, mp4)
Supplementary Movie 3 (68.7MB, mp4)
Supplementary Movie 4 (4.5MB, mp4)
Supplementary Movie 5 (73.1MB, mp4)
Supplementary Movie 6 (3.1MB, mp4)
Supplementary Movie 7 (2.3MB, mp4)
Supplementary Movie 8 (3.5MB, mp4)
Supplementary Movie 9 (3.9MB, mp4)
Supplementary Movie 10 (3.9MB, mp4)
Reporting Summary (203.6KB, pdf)

Data Availability Statement

The coordinates for the DNA-free and DNA-bound XPF–ERCC1 complex are available in the PDB with codes 6SXA [https://www.ebi.ac.uk/pdbe/entry/pdb/6sxb] and 6SXB [https://www.ebi.ac.uk/pdbe/entry/pdb/6sxb] and the cryo-EM maps are available in EMDB with codes EMD-10337 [https://www.ebi.ac.uk/pdbe/entry/emdb/EMD-10337] and EMD-10338 [https://www.ebi.ac.uk/pdbe/entry/emdb/EMD-10337]. The source data underlying Figs. 1b, 5b Supplementary Figs. 1b, 9d, e are provided as a Source Data file. Other data that support the findings of this study are available from the corresponding author upon request.


Articles from Nature Communications are provided here courtesy of Nature Publishing Group

RESOURCES