Abstract
Transcription-coupled repair is essential for the removal of DNA lesions from the transcribed genome. The pathway is initiated by CSB protein binding to stalled RNA polymerase II. Mutations impairing CSB function cause severe genetic disease. Yet, the ATP-dependent mechanism by which CSB powers RNA polymerase to bypass certain lesions while triggering excision of others is incompletely understood. Here we build structural models of RNA polymerase II bound to the yeast CSB ortholog Rad26 in nucleotide-free and bound states. This enables simulations and graph-theoretical analyses to define partitioning of this complex into dynamic communities and delineate how its structural elements function together to remodel DNA. We identify an allosteric pathway coupling motions of the Rad26 ATPase modules to changes in RNA polymerase and DNA to unveil a structural mechanism for CSB-assisted progression past less bulky lesions. Our models allow functional interpretation of the effects of Cockayne syndrome disease mutations.
Subject terms: Molecular modelling, Computational biophysics, Nucleotide excision repair
Here the authors provide models of RNA polymerase II bound to the yeast CSB ortholog Rad26 in different nucleotide states; explain how Rad26 domain motions help the polymerase progress past DNA lesions; and interpret the effects of CSB-associated disease mutations.
Introduction
Genomic DNA is under constant assault from a host of endogenous and environmental factors causing DNA damage. Nucleotide excision repair (NER) is arguably the most versatile pathway to repair damaged DNA. NER has evolved to remove a wide array of DNA lesions from the genome, including cyclobutane–pyrimidine dimers (CPD) and 6–4 pyrimidine–pyrimidone (6–4 PP) photoproducts, which are the major lesions induced by ultraviolet radiation; intrastrand crosslinks caused by cancer therapeutic drugs such as cisplatin; cyclopurines generated by reactive oxygen species; and bulky chemical adducts caused by carcinogen exposure1–3. NER is also exceptional for the variety of clinical manifestations associated with its genetic impairment1,4–6. Deficiencies in NER’s two sub-pathways, transcription-coupled repair and global genome repair, cause severe human diseases7–10—ultraviolet radiation‑sensitive syndrome (UVSS), the genetic disorder xeroderma pigmentosum (XP) linked to extreme cancer predisposition, cerebro-oculo-facio-skeletal syndrome (COFS), trichothiodystrophy (TTD), and Cockayne syndrome (CS) spectrum of disorders associated with premature ageing and accelerated neurodegeneration. This striking clinical heterogeneity is still incompletely understood at the level of structure and biological mechanisms.
In particular, transcription-coupled repair11–13 is essential for lesion removal from the template strand during transcription and is activated by the recruitment of Cockayne Syndrome B protein (CSB/ERCC6) to lesion-arrested RNA polymerase II (Pol II)14–19. In turn, CSB association triggers recruitment of downstream NER factors, such as Cockayne syndrome protein A (CSA/ERCC8), transcription factor IIH (TFIIH), and UV-sensitive syndrome protein (UVSSA)20,21. Mutations in CSB are associated with Cockayne syndrome, an autosomal-recessive genetic disorder, characterized by postnatal growth failure, progressive neurological dysfunction, premature aging, and photosensitivity22,23. Among Cockayne syndrome patients there is a high incidence of mutations in the CSB gene (~70%) with over 80 disease mutations characterized to date24,25.
Despite decades of genetic and biochemical studies, the role of CSB-Pol II association in the early stages of transcription-coupled repair remains incompletely understood11,26. Insights have come from analysis of yeast CSB orthologs, Rad26 in Saccharomyces cerevisiae and Rhp26 in Schizosaccharomyces pombe2,27–30. Notably, the recent cryo-EM structure of a Pol II–Rad26 complex was a breakthrough, revealing the binding mode of Rad26 and the prominent change in the DNA path induced by its presence28. Rad26 belongs to the SWI2/SNF2 family of chromatin remodelers. SWI2/SNF2 proteins share a common catalytic core, comprised of two RecA-like domains (RecA1, RecA2), which by themselves are sufficient to power DNA remodeling. In Rad26, RecA1 and RecA2 are separated by a flexible hinge region (residues 571-578) to facilitate the relative motion of the two domains during each cycle of ATP hydrolysis and DNA translocation (Fig. 1)28,30. The domains feature a set of seven conserved ATPase sequence motifs (Fig. 1a, c; Supplementary Fig. 1). The RecA1 motifs (I, Ia, II, and III) are primarily responsible for binding ATP, while the RecA2 motifs (IV, V and VI) mediate upstream dsDNA binding and, thus, may have a role in energy transduction. Three additional motifs (Supplementary Fig. 1, 2a) have putative roles in dsDNA stimulation of Rad26 ATPase activity (IIa) or ATP-driven translocation (IVa and Va)31. Importantly, the Rad26 ATPase modules have pronounced structural similarity to DExx box helicases, suggesting a shared ATP hydrolysis mechanism and similar nucleotide-induced conformational changes32. Moreover, there is evidence from biochemistry that Rad26 can function as a molecular motor to promote the forward movement of Pol II along the DNA template28,30. However, no CSB/Rad26 structure has been captured with bound nucleotides, leaving an important gap in our understanding of the ATP hydrolysis cycle of Rad26 and its role in remodeling the transcription bubble of a stalled RNA polymerase.
To address this gap, we extended the structural model of the nucleotide-free Pol II–Rad26 complex to include previously unmodeled regions. We also created structural models of Pol II–Rad26 in the nucleotide-bound ATP and ADP states. We then employed extensive molecular dynamics simulations and applied novel graph-theoretical approaches33 to define dynamic networks, communities and allosteric communication mechanisms in all three functional states of the Pol II–Rad26 assembly. Our analyses uncovered the structural elements, key protein residues and allosteric paths coupling ATP-hydrolysis to DNA binding and unveiled a possible structural basis for Rad26-mediated DNA remodeling to assist Pol II progression past damaged sites on DNA. Our computational analyses were complemented with mutational experiments, which validated the significance of key residues in the Pol II–Rad26 dynamic network.
Results
Reconstruction of a complete Pol II–Rad26 complex including the N- and C-terminal domains of Rad26
The previous Pol II–Rad26 cryo-EM structure was significant for offering an unprecedented molecular view of this complex protein machinery and explaining the functional role of Rad26 in early transcription-coupled repair28. The structure revealed how Rad26 engages the Pol II elongation complex, binding the upstream DNA duplex and dramatically altering its path. Importantly, it supported a mechanism wherein Rad26 recognizes stalled Pol II and acts as a molecular motor to reduce Pol II backtracking, promote forward translocation on the DNA template and facilitate transcriptional bypass of less bulky DNA lesions28. While Pol II and the Rad26 ATPase core could be modeled with confidence into the cryo-EM density at 5.8 Å resolution, the N- and C-terminal domains (NTD and CTD) remained unmodeled. Previous studies revealed that the NTD and CTD domains of S. cerevisiae Rad26, its S. pombe ortholog Rhp26 and human CSB play critical roles in regulating the ATPase and chromatin remodeling/DNA translocase activities of the central core domain and are involved in the recruitment of downstream repair factors and in UV damage dependent chromatin association20,34–38. Therefore, inclusion of these regions is essential for the success of molecular simulations aimed at elucidating the functional dynamics of the Pol II–Rad26 assembly.
To shed light on the functional roles of the CTD and NTD, we built a complete structural model of the Pol II–Rad26 complex from the available cryo-EM densities (EMD-8735 and EMD-8736)28. We used structures of SWI2/SNF2 proteins (PDB ID: 5HZR39, 5X0Y40, and 5JXR41) as templates for homology modeling to reconstruct the missing residues from the C-terminal end of Rad26 (798–861), which were subsequently docked into the cryo-EM density (EMD-8735). The missing residues from the NTD (94–228) and CTD (862–1085) were traced in the original EM density and built de novo. Rad26 and Pol II were then separately flexibly fitted into the cryo-EM density (EMD-8735) and combined to assemble the final model. The newly modeled regions of Rad26 are shown in Fig. 1 and Supplementary Fig. 3.
The Rad26 N-terminal domain is comprised of five extended helices connected by long linkers. As a result, the NTD is very extended and traverses the Pol II surface, making extensive contacts with Rpb1, Rpb2 and Rpb4/7 while also wrapping around the RecA1 domain (Fig. 1b, Supplementary Fig. 3a, and Supplementary Movie 1). Notably, a segment of NTD residues (171–264) lodges between the Pol II stalk (Rpb4/Rpb7) and the Rad26 RecA1 domain with linker residues (188–235) tracking along the Pol II Rpb1 and Rpb2 subunits. Importantly, along the path of the NTD are functionally significant elements of Pol II (Fig. 1e, g, Supplementary Fig. 3c, e, and Supplementary Movie 1), e.g. the Rpb1 coiled-coil domain and the Pol II stalk, which are common docking sites for transcription factors. Additionally, two helices from the Rad26 NTD domain make direct contacts with the upstream DNA duplex and the upstream fork of the DNA bubble (Fig. 1e, g, Supplementary Fig. 3c, e, and Supplementary Movie 1). Collectively, these contacts provide a handle for Rad26 to 1) exert regulatory influence on Pol II by affecting the binding of transcription factors; and 2) affect the stability of the transcription bubble and Pol II translocation along DNA. Tethering of Rad26 to Pol II via the NTD and the positioning of NTD between the Rad26 ATPase core and the RNA polymerase are key for maintaining the structural integrity of the entire Pol II–Rad26 assembly. Consistent with these observations, Rad26 becomes entirely or partially dispensable for transcription-coupled repair in yeast cells in the absence of Rpb429,42.
The C-terminal domain of Rad26 is also predominantly helical and wraps around the back side of the ATPase core, serving as a latch between the RecA1 and RecA2 domains (Fig. 1b, Supplementary Fig. 3a, and Supplementary Movie 1). Notably, residues from the CTD (883–919) and the HD2-1 linker (593–632) intercalate between the two DNA strands at the fork of the upstream transcription bubble (Fig. 1f, Supplementary Fig. 2b and 3d; Supplementary Movie 1). Indeed, we previously identified a CSB-specific coupling motif in the CTD (residues 900–910 in Rhp26 and 904–914 in Rad26) that couples the ATPase and chromatin remodeling/translocation activities37. The interactions between the newly modeled CTD and HD2-1 linker may explain how this region couples the ATPase and chromatin remodeling/translocation activities. The NTD and CTD of Rad26 pack against the HD1 and HD2 insertions of the ATPase domain to form key interactions with the upstream DNA fork, which is crucial for stabilizing the DNA bubble (Supplementary Fig. 2b). Moreover, in order to function as a molecular motor on dsDNA, Rad26 requires stable attachment to Pol II. In this respect, the newly modeled CTD and the NTD regions are critical. Together, the CTD and NTD contribute 1649 Å2 (71%) of buried surface area to the overall Rad26–Pol II interface, making them indispensable for the structural integrity of the complex (Supplementary Table 1).
Structural basis for Rad26 conformational switching during the ATPase cycle
To shed light on the nucleotide-induced conformational changes during the Rad26 ATP hydrolysis cycle, we modeled the ATP- and ADP-bound states of the Pol II–Rad26 complex based on a yeast SNF2 structure obtained with ADP-BeFx as a ligand43. We then simulated the Pol II–Rad26 assembly for ~ 11 μs in each of three functional states—apo, ATP and ADP. Clustering analysis of the MD trajectories past the point of convergence allowed us to identify the dominant conformers (Fig. 2) and uncover the intricate conformational rearrangements of the Rad26 ATPase core during the ATP-hydrolysis cycle. The observed binding modes of the ATP and ADP nucleotides are shown in Fig. 2a, b, respectively. Nucleotide binding occurs at the interdomain cleft formed by RecA1 and RecA2. Residues from the conserved ATPase motifs form the primary nucleotide-binding pocket: L326, K328, I330 from motif I, D469 and E470 from motif II, and R763 and R766 from motif VI. Specifically, the adenine group is wedged between the hydrophobic L326, I330, and W368 residues. In addition, five residues associated with motifs I, II, and VI (K328, R763, R766, D469, and E470) contact the phosphate moiety either directly or indirectly through Mg2+ ion coordination. Based on the Sth1 and Snf2 structures43,44, the residues functionally equivalent to R763 and R766 are coordinated and positioned to serve as sensors of the nucleotide state of the active site. Additionally, there is mutational evidence of the functional significance of the sensor residues31,45. Notably, even conservative mutation of K328 to arginine completely abolishes Rad26 ATPase activity28. Similarly, in CSB mutation of E646, which is the functional equivalent of E470 in Rad26, abolishes ATPase activity whereas mutations in motifs V and VI diminish the ATP-hydrolysis activity46.
Nucleotides induce mutual rotation of the RecA1 and RecA2 domains accompanied by closure of the interdomain cleft (Supplementary Movie 2). Correspondingly, the RecA1–RecA2 interface becomes progressively tighter from the apo state to the ADP-bound and eventually to the ATP-bound conformer (Fig. 2c–h). These changes are reflected in the dramatic increase in buried surface area (BSA) at the domain interface. The loosely assembled apo interface features BSA of only 866 Å2 while the ADP-bound (1293 Å2) and ATP-bound (1322 Å2) interfaces are considerably more extended. The same progression is evident in the computed B-factors for the Rad26 core (Supplementary Fig. 4a–l), showing decreased mobility and increased interface stability going from the apo to the ADP and the ATP functional states.
To gain insight into the residue interactions responsible for the observed changes, we relied on persistent contact analysis of the MD trajectories. Specifically, we classified interface contacts as hydrogen-bond, salt–bridge or hydrophobic and defined suitable geometric criteria for each class of interactions to identify them in the simulation trajectories. Contacts occurring in more than 50% of the MD trajectory frames were considered persistent. Predictably, the RecA1–RecA2 interface is dominated by persistent contacts between conserved polar and charged residues (Fig. 2c–h and Supplementary Fig. 4g–i). Electrostatics plays a critical role in accommodating the negative charge of ATP by positioning the ligand between two overwhelmingly positive Rad26 surfaces (Supplementary Fig. 4g–i). This also ensures a marked decrease in interface compactness upon release of the negative γ-phosphate after ATP hydrolysis. Correspondingly, in the apo state RecA1 is rotated away from RecA2 leaving a significant gap between the ATPase modules. The domain interface is held together by a cluster of hydrogen bonding residues (G498, T499, Y539 from RecA1 and S755, Q759, H790 from RecA2) and a few additional contacts e.g. the hydrophobic V569-I767 pair (Fig. 2c, d). By contrast, the presence of ATP reorients the ATPase modules resulting in a wide, compact, electrostatically compatible interface between RecA1 and RecA2. The interface features numerous persistent contacts between conserved Rad26 functional motifs—I, II, IIa and HD1 in RecA1 and Va, VI and HD2 in RecA2 (Fig. 2 and Supplementary Fig. 1). Hydrogen bonding (Q360-R766, K570-P578, L326-Q726), salt–bridge (E507-K795, R558-D805, K328-ATP, R766-ATP, R763-ATP) and hydrophobic interactions (W368-ATP, L326-ATP, I330-ATP, Y539-L798) are all present, strengthening RecA1-RecA2 domain association.
Motions of the Rad26 ATPase domains upon change of nucleotide state result in direct DNA pulling
Our Pol II–Rad26 models also shed light on the key question of how Rad26 harnesses the energy from ATP hydrolysis to effect conformational changes in DNA. Rad26 features a DNA binding groove formed by the RecA1 and RecA2 domains (Fig. 3a, d, g). DNA recognition involves charged contacts along the minor groove of the DNA duplex immediately preceding the transcription bubble. These interactions involve structural elements from the conserved Rad26 motifs (Ia and IIa from RecA1; IV, IVa, V, and Va from RecA2; Fig. 3g–i). Important contacts are also made by the RecA2 domain with single-stranded DNA at the fork of the transcription bubble. Among the RecA2 contacts, a tryptophan residue, W752 from motif Va, stands out. W752 is conserved not only in the CSB protein subfamily, but also in the Chd1, Snf2, ISWI, and INO80 chromatin remodelers (Supplementary Fig. 5). The bulky tryptophan side chain inserts into the upstream DNA fork and is accommodated through base-stacking interactions (Fig. 3 and Supplementary Fig. 5), highlighting the importance of W752 for stabilizing the transcription bubble and for DNA translocation. W752 is also adjacent to the N-terminus of motif VI, which harbors the conserved arginine residues (R763 and R766) essential for ATP hydrolysis. Therefore, W752 may provide a dual connection, on one side to the Rad26 ATP sensing elements, and on the other to the DNA fork. Strikingly, mutation of the functionally equivalent W936 residue in CSB has been reported to cause type I Cockayne syndrome47. Another key observation is that the number of persistent DNA contacts is much greater for the RecA2 domain as compared to RecA1. Weaker DNA binding allows RecA1 to slide along the minor groove during the translocation step while the RecA2 maintains a tighter grip at the fork of the transcription bubble (Fig. 3 and Supplementary Movie 2).
Importantly, our Pol II–Rad26 models display prominent differences in the DNA binding modes of the apo and nucleotide-bound conformers. The conformational transition from the compact ATP-bound state to the more open ADP-bound and apo states is accompanied by concerted swing motions of the RecA1 and RecA2 modules on the opposing sides of the upstream DNA duplex. In response, the duplex shifts back and rotates by 14° in the Rad26 DNA-binding groove (Supplementary Movie 2). Thus, DNA translocation induced by the Rad26 molecular motor occurs by stepwise winding and pushing along the axis of the upstream DNA duplex. The basic features of this translocation mechanism are shared with the bacterial Mfd protein, which was recently visualized by cryo-EM48. Strikingly, Rad26 and Mfd appear to have achieved functional convergence despite having no sequence conservation and only limited structural similarity.
In the apo state, structural elements from the HD1, HD2-1 and HD2-2 motifs and the CTD contact the ssDNA of the template strand via hydrogen-bond, salt–bridge and hydrophobic interactions. This tight grip on the ssDNA in the apo state allows RecA2 to act as a ratchet to prevent DNA slippage during the ATP-hydrolysis cycle. In contrast, weakened RecA2–ssDNA interactions in the nucleotide-bound states could allow easy DNA translocation. Switching of nucleotide state thus results in alternating weak and strong DNA binding at the edge of the transcription bubble and in Rad26-mediated pulling on the template strand. Collectively, these motions may facilitate lesion scanning in early transcription-coupled repair.
Network analysis unveils Pol II–Rad26 dynamic communities and key allosteric communication mechanisms
To discover Pol II–Rad26 allosteric residue networks and communication mechanisms, we applied graph-theoretical approaches33,49,50 that map dynamic information from our extensive MD simulations onto graphs representing the protein topology (i.e. residues are represented by graph nodes; edges connect contacting residues). Graph edges are weighted by persistent contact probabilities derived from our MD ensembles, allowing allosteric communication to be quantified by the graph edge betweenness measure. We then applied the Girvan-Newman algorithm to partition the complexes into dynamic communities—tightly connected clusters of residues that move together as modules.
To accomplish systematic comparisons of allosteric responses in all three simulated complexes, we used the difference contacts network analysis method (dCNA)51. Briefly, dCNA involves the following steps: (1) individual residue contact networks are computed for each simulated complex separately; (2) a consensus network is calculated, in which edges represent stable contacts across all simulations; (3) communities are detected and mapped onto the consensus network; 4) subtracting contact probabilities of the individual networks leads to difference contact networks represented as graphs, indicating which communities/interfaces are gaining or losing stable contacts. In this way, dCNA maps multiple MD ensembles onto a single consensus network graph in order to monitor contact probability changes across community interfaces. In turn, this mapping reveals intricate contact differences and subunit interface rearrangements, indicative of allosteric communication.
We constructed a consensus network from ensemble simulations of the three Pol II–Rad26 functional states and identified 22 distinct communities (Fig. 4a, Supplementary Movie 3). Changes in contact probabilities between communities during the ATP hydrolysis cycle (apo → ATP-bound → ADP-bound → apo) are shown in Fig. 4b–d. Residues responsible for the largest changes in contact probability are mapped back onto the Pol II–Rad26 structure and shown in Fig. 5a–c. Rad26 encompasses five distinct communities. RecA1 and RecA2 correspond to communities F and C (Fig. 4a–d), which separate precisely along the ATP-binding cleft reflective of their function as independent ATPase modules. Community R covers the CTD latch at the back of the Rad26 ATPase core while communities S and T represent segments of the extended N-terminal domain. Notably, communities subdivide the complex into dynamic modules independently from the Pol II–Rad26 domain structure. For instance, community C includes not only RecA2 but also tightly associated parts of Rpb2 and DNA. Predictably, upon ATP binding we observe a large contact probability increase between communities C and F. This gain of contacts at the RecA1–RecA2 interface does not occur in isolation but, instead, triggers concomitant net contact losses between several adjacent communities (e.g. between the F–R, R–I, C–I, I–E community pairs). Strikingly, this change in interfacial contacts is not confined to Rad26 but extends from the Rad26 docking site on Rpb2 (community I) through the Rpb2 lobe (community E) all the way to the Rpb1 jaw (community P). Conversely, on the opposite side of Pol II the Rpb1 clamp (community B) and cleft (community H) gain stable contacts with one another and with the downstream dsDNA (community U). Together, these changes indicate a cascading allosteric response along the Pol II central cavity, which surrounds the transcription bubble. As expected for a cyclic process, the observed changes are partially reversed in the ATP to ADP transition and then fully reversed in the ADP to apo transition. Correspondingly, two-stage loss of contact probability is seen between communities C and F as the RecA1–RecA2 interface opens up on the way back to the apo state. On the other hand, interactions between RecA2 and the Rpb2 are enhanced in the ATPADP transition suggesting the ATP-hydrolysis affects the RecA2 binding to Rpb2 and the DNA fork (community I). In the ADPapo transition we observe net contact gain across the interfaces of communities R, I and E via the same allosteric network.
Notably, dCNA detects a global allosteric network encompassing not only CSB but extending along the circumference of the Pol II central cavity. Our results support a mechanism whereby allosteric changes initiated within the Rad26/CSB molecular motor propagate to Pol II and, in turn, affect the stability of the transcription bubble and the opening/closing dynamics of the Pol II cleft. Such changes may allow Pol II to bypass lesions in the transcribed strand.
Models shed light on the impact of CS disease mutations on CSB structure and dynamics
Our Pol II–Rad26 model and dynamics simulations also aid in interpreting the effects of disease mutations associated with Cockayne syndrome. Based on the severity of symptoms and age of onset, CS patients are classified into three types: moderate type I CS, with the first symptoms appearing from the end of the first year of life and mortality occurring prior to the age of 20; early-onset and/or severe type II CS; and late-onset type III CS. Cerebro-oculo-facio-skeletal (COFS) syndrome has also been used to describe a very severe form of the disorder with disease onset at the prenatal stage25,52. First, we mapped the positions of 19 missense CSB mutations to the equivalent residues of our Pol II–Rad26 model after aligning the human CSB and yeast Rad26 sequences (Fig. 6a, Supplementary Fig. 6, Supplementary Table 2, and Supplementary Movie 1). Disease mutations impact primarily the CSB ATPase core (9 in the RecA1 and 10 in the RecA2 domain).
Sequence comparison of the C-terminal ends between human CSB (residues 1010–1493) and yeast Rad26 (residues 861–1085) shows < 30% sequence identity, which complicates mapping of disease mutations for this region. Three mutations found in the CTD sequences (P1042S, P1095R and R1213G) fall outside the extent of our structural model and could not be mapped. Mutations P1042S, P1095R and R1213G (and all mutations thereafter) are numbered for the human CSB protein.
All but two mutants are positioned in or near the seven conserved Rad26 motifs. Most are found at the ends of helices or in loop regions, indicating their impact on protein dynamics or on the stability of RecA1/RecA2 secondary-structure elements. Importantly, mutations preferentially localize at the interfaces of three Rad26 dynamic communities—F (RecA1), C (RecA2), R (CTD latch). Five are at the F–C community interface (L536W, D646N, N680D, Q953K and L987P); two at the F–R interface (W686C and S687L) and one at the juncture of all three communities (L987P). Additionally, two mutants map onto the functionally important HD2-2 region. Mutations at community interfaces may impact Rad26 dynamics or disrupt key interactions of the ATPase modules, affecting nucleotide binding during ATP-hydrolysis. Mutations may also act allosterically. To address allosteric communication through Rad26, we generated correlation-weighted dynamic networks for the complex in apo, ATP and ADP states and used these to compute suboptimal paths connecting the nucleotide-binding site and the DNA-insertion helix (Supplementary Fig. 7). Suboptimal paths encode dynamic residue correlation from MD and are, thus, indicative of the allosteric pathways traversing the Pol II–Rad26 complex. In the nucleotide-bound states, ATP or ADP serves as a linchpin holding the ATPase core together. Consequently, most suboptimal paths pass directly through the RecA1–RecA2 interface (via helices α23 and α24), across the RecA2 central beta sheet (strands β7, β8, β10, β11, and β12), along the α20 helix, and eventually converge on the DNA-insertion helix (α19). By contrast, in the apo state suboptimal paths are diverted away from the RecA1–RecA2 interface towards dynamic community R, following a series of alpha helices—α16, α17, α31, α25, and α19. Both sets of paths are important for Rad26 dynamics during the ATPase cycle. Strikingly, 70% (14 out of 19) of CS mutations line along these dominant allosteric pathways. Out of the 14 mutants seven (D532G, S687L, N680D, W686C, L987P, V773G and Q953K) are distal from both the ATP-binding site and DNA and, therefore, exert their effect by disrupting allosteric communication through the ATPase core (Fig. 6 and Supplementary Fig. 7). This finding highlights the crucial role of dynamic communication for energy transduction through the Rad26 ATPase core.
Another distinct set of mutations localize to the domain cores (Fig. 6a), likely impacting protein stability or compromising the integrity of key secondary-structure elements (G528E, R670W, I543F, W851R, L871P, L875P, A926P). Mutations to proline are particularly disruptive as they redirect the protein backbone. Finally, three mutants (W936C, P934T and R975W) directly impact DNA binding. W936C in motif Va prevents stabilization of the DNA upstream fork, P934T redirects the protein backbone near DNA, and R975W creates a clash with DNA.
We targeted six CSB missense mutations for experimental validation using restriction enzyme accessibility assays in the yeast CSB ortholog Rhp26. Four mutants were classified as important for allosteric communication: N680D positioned at the F–C community interface; W686C and S687L at the F–R interface; and L987P at the junction of the F, C and R communities. Two mutations, W851R and V957G, were positioned in the domain cores likely affecting protein stability or packing of secondary-structure elements. Three mutants (W686C, S687L, W851R) failed to express as soluble proteins, indicating that these residues are important for proper folding and stability Rhp26, which is consistent with our functional annotation of mutants in Supplementary Table 2. We then measured the capabilities of remaining three yeast CSB mutants in remodeling nucleosome arrays, which requires their DNA translocation activities. All three mutants showed reduced activities in comparison with WT (Fig. 6b) in the order WT > V957G > N680D > L803P. Intriguingly, the observed ordering correlates well with clinical phenotype: e.g. L803P (Type II, COFS) has more severe phenotype than N680D and V957G (Type I). The V957G is the least severe among the mutants we tested, which is consistent with the structural observation that this residue localizes at the juncture of a helix and loop of RecA2 that are important for allosteric communication. N680D and L987P greatly impede CSB translocation activities, highlighting the functional importance of the F-R community interface.
We also classified several deletion/insertion mutations causing severe CS (Supplementary Fig. 6c and Supplementary Table 2). Motif I and III are responsible for ATP hydrolysis and deletions R467-R562del, Y510-R562del, F665-Q723del, and W589del directly impact ATPase activity. Deletions V724-Q762del and V763-Q794del occur in HD2-1 region and disrupt DNA binding. The M752-Q762del deletion localizes to the hinge region between RecA1 and RecA2, affecting dynamics and the ability of the ATPase modules to close in response to nucleotide binding. One insertion, K538-T539delinsKNVF, disrupts motif I impairing ATPase activity53. E608-Q723del is a very extensive deletion affecting the entire ATPase core. Predictably these deletion/insertion mutations are known to cause the most severe type II CS phenotype.
Discussion
The wide variety of lesions processed by the transcription-coupled repair pathway has led to the evolution of a remarkably complex protein machinery. To unravel the precise functional roles of CSB (or Rad26) in this machinery, we built suitably complete structural models of Pol II–Rad26 complexes in apo, ATP- and ADP-bound states. We then employed extensive molecular dynamics simulations and novel graph-theoretical methods to analyze the functional dynamics of these assemblies.
We support a mechanism wherein Rad26 binds to the upstream DNA duplex of a stalled Pol II, redirects the path of DNA and recognizes the edge of the transcription bubble through specific interactions, involving conserved structural elements—HD1, HD2-1, HD2-2 and the CTD. Nucleotide-driven closure and opening of the ATPase modules allow the Rad26 molecular motor to shift and rotate the upstream DNA duplex while alternating tight and loose DNA binding of the RecA2 domain at the transcription bubble fork causes pulling on the template strand (Fig. 7). Thus, our models reveal a possible structural basis for Rad26-mediated DNA remodeling to assist Pol II progression on a damage-containing DNA template. While such forward motion along the DNA template may aid transcriptional bypass of less bulky DNA lesions, it may also result in extreme Pol II stalling upon encounter of a bulky lesion. In turn, this may trigger recruitment of downstream NER factors—CSA, UVSSA and TFIIH. In this way, Rad26 provides a mechanism for lesion discrimination in the early steps of transcription-coupled repair. For this model to be operational, the Rad26 molecular motor must be firmly attached to the rest of the Pol II machinery. Correspondingly, we find that the previously unmodeled CTD and NTD domains play a key role in anchoring Rad26 to Pol II and establishing a productive orientation with respect to the transcription bubble.
Furthermore, dynamic network analysis of the vast conformational ensembles from MD led to the discovery of a global allosteric network extending from Rad26 along the circumference of the Pol II central cavity. Thus, we support an allosteric mechanism for CSB/Rad26 to power distal conformational changes, affecting the opening/closing dynamics of Pol II and the DNA transcription bubble. Such conformational changes may allow Pol II to bypass lesions in the transcribed strand. We also mapped Cockayne syndrome disease mutations onto our model and examined their positioning in the context of dynamic communities and allosteric pathways identified from network analysis. This allowed us to predict their effects in disrupting local structure and dynamics, providing insights into disease etiology.
Collectively, our results impact understanding of CSB (Rad26) as a central constituent of the transcription-coupled repair machinery and shed light on fundamental mechanisms by which harmful DNA lesions are detected and removed from the transcribed genome.
Methods
Model building
To model Rad26, we used the available Pol II–Rad26 cryo-EM density (EMDB accession code: EMD-8735)28. The Rad26 NTD (residues 94–228) had no known structural homologs and was, therefore, built de novo. The structures (229–297 and 559–577) of 5VVR were rebuilt based on the EM density and secondary-structure prediction (Supplementary Fig. 2c and Supplementary Table 3). We used the GeneSilico metaserver for consensus secondary-structure prediction54 and applied the results to establish the sequence register in the EM density. Individual secondary-structure fragments were constructed using COOT55 to generate a backbone only model by tracing the EM density. The resulting polypeptide chain segments were connected by extending the main-chain trace. Side chain orientations were built and manually adjusted based on the electron density. Bulky residues such as phenylalanine, tyrosine, tryptophan, and arginine were instrumental in validating model construction and sequence registration. To model the Rad26 CTD (residues 798-860), the structures of the yeast T. thermophilus ISWI (PDB ID: 5JXR41), S. cerevisiae SWI2/SNF2 (PDB ID: 5X0Y40), and M. thermophila SNF2 (PDB ID: 5HZR39) were used as templates to construct the Rad26 CTD homology structure. The missing residues from the CTD (861–1085) were traced in the original EM density and built de novo. We then built the entire Rad26 by docking the newly constructed NTD and CTD into the density. The docked model was then manually adjusted to the cryo-EM density using COOT55. The final Rad26 structure was added to Pol II from the 5VVR structure to produce the complete Pol II–Rad26 complex, which was subsequently refined in a 4-ns molecular dynamics flexible fitting (MDFF) simulation56–58. The MDFF biasing potential was applied with a scaling factor ξ of 0.2. At the end of our protocol, we performed real space refinement to the EMD-8735 cryo-EM density using the PHENIX package59,60, Finally, we constructed Pol II–Rad26 complexes in the ATP- and ADP-bound state by placing the ATP or ADP ligands into the Rad26 nucleotide-binding pocket based on an yeast SNF2 structure crystallized with ADP-BeFx (PDB ID: 5Z3V)43. Mg2+ ion coordination was also based on the yeast SNF2 structure. The evaluate the map-to-model fit we computed cross-correlation coefficients (CCC) with UCSF Chimera for Rad26 only and for the entire Pol II–Rad26–DNA assembly. The CCC values were 0.66 for Rad26 and 0.77 for the Pol II–Rad26 complex, respectively.
Molecular dynamics
To address the Pol II–Rad26 functional dynamics we performed extensive molecular dynamics simulations on the Summit machine at the Oak Ridge Leadership Computing Facility. All simulation systems were set up with the TLeap module of AMBER61 and solvated with TIP3P water molecules62 in a box with a minimum distance of 12.0 Å from the protein to the edge of the simulation box. The simulation boxes for the apo, ATP- and ADP-bound systems had dimensions of 178.3 × 189.1 × 186.2 Å3, 182.4 × 192.2 × 190.3 Å3 and 182.2 × 192.3 × 190.3 Å3 and contained 630,217, 579,191 and 579,192 atoms, respectively. ADP and ATP were obtained from the AMBER parameter database63. Counterions were added to neutralize the total charge of the protein complex and reach 150-mM NaCl concentration to mimic physiological conditions.
Energy minimization was conducted for 3000 steps with fixed protein backbone atoms and for an additional 1500 steps with harmonic restraints on the backbone atoms (k = 10 kcal mol−1 Å−2). The temperature of the simulated systems was then gradually increased to 300 K over 500 ps of dynamics in the NVT ensemble. Positional restraints were imposed on all heavy atoms (k = 10 kcal mol−1 Å−2). Equilibration was continued for another 5 ns in the NPT ensemble, and positional restraints were gradually released to fully equilibrate the systems. Production runs were conducted with a 2-fs timestep in the NPT ensemble (1 atm and 300 K) for ~11 μs for each of the Pol II–Rad26 complexes (apo, ATP- and ADP-bound). The smooth particle mesh Ewald (SPME)64 electrostatics was employed with 10 Å cutoff for short-range non-bonded interactions. The simulations were performed with the CUDA version of the Amber18 PMEMD code using the Parm14SB force field65 and bsc1 modifications66 to the nucleic acid parameters. In total, 11 independent production runs of ~1 μs length were completed for each system (apo, ATP and ADP). We selected the last ~280 ns of each independent trajectory based on root-mean-square deviation (RMSD) convergence. RMSD values were computed over the protein Cα atoms and nucleic acid P atoms. More than 100,000 conformations per functional state were selected for clustering analysis to identify the dominant conformations and for dynamic network analysis with the dCNA code44. All figures were generated using UCSF Chimera60.
Difference contacts network analysis
In the difference contacts network analysis method (dCNA), a consensus network was first constructed for all functional states (apo, ATP- and ADP-bound). The Girvan-Newman algorithm was then applied on the consensus network to subdivide the assembly into dynamic communities. Once the community structure was identified, a second step involving subtraction of contact probability maps was carried out to detect changes in between the functional states. Contact maps were generated with the MDTraj package61 using the last 280 ns from the MD trajectories of each Pol II–Rad26 functional state (apo, ATP- and ADP-bound). To obtain the consensus network, contact maps for the ADP and ATP states were subtracted from the apo map. Edges were drawn between nodes and assigned a weight of 1.0 if the change in contact probability was > = 90%, indicating persistence across all functional states. In order to derive the community structure from the consensus network, we relied on a non-weighted version of the Girvan-Newman algorithm, available in the Python package NetworkX. The consensus network was continually subdivided using Girvan-Newman62 until the difference in modularity between subsequent partitions was <0.001. The final partition yielded 22 distinct communities with a modularity of 0.91. Finally, the change in contact probability between the consensus communities was decomposed into individual transitions representing the ATP hydrolysis cycle (APOATP, ATPADP, ADPAPO). This was achieved by subtracting the relevant contact maps from each other (i.e. ATP–APO, ADP–ATP, APO–ADP) to obtain three distinct difference networks. We then focused on the overall change in contact probability between the consensus communities in each transition. The overall change (ΔP) was determined by summing the individual differences between two distinct communities,
1 |
Note that we only consider the difference in contact probability between residue i and j (pij) if they reside in two different communities A or B. Since pij can take on both positive and negative values, the value of the overall change is either negative (indicating loss in interactions), positive (indicating gain in interactions) or zero (indicating no change).
Suboptimal paths
To compute suboptimal paths, we relied on dynamic network analysis. In dynamic networks, protein and DNA residues are represented as nodes with edges that connect nodes in persistent contact. Persistent contacts, in this case, were defined as having one or more heavy atoms within a distance of 4.5 Å of each other for more than 75% of the MD trajectory. Edge weights were then calculated using cross-correlation,
2 |
where r is the positional vector of residues i, j and is the time-evolved average of the positional vector computed from the MD trajectory. The final edge weights are then transformed through,
3 |
We determined networks for all three functional states (apo, ADP, ATP) using the last 280-ns of each trajectory. From these networks, the first 5000 suboptimal paths were computed using the SOAN method67. The source and target residues were R612 and D469, respectively, and were selected based on their positioning within the Rad26 structure. This choice was aimed at identifying allosteric communication paths originating in RecA1 (proximal to the Rad26 active site) and leading to the insertion helix at the edge of the DNA transcription bubble. In the SOAN method, all nodes two neighbors away from the optimal path were considered when reducing the original graph to improve the efficiency of the suboptimal path determination.
Restriction enzyme accessibility assays
To evaluate the significance of individual residues, we used Rhp26 as a model due to the technical challenge of expressing and purifying pure Rad26 proteins (wild-type and mutants). Expression and purification of Rhp26 wild type and mutants were performed essentially as previously described30. Restriction enzyme accessibility assays were used to characterize the chromatin remodeling activity of Rhp26 (DNA translocase activity on chromatin template) and performed as described previously with minor modifications. Briefly, chromatins were reconstituted by the gradient salt dialysis method by using Xenopus laevis core histones and an ∼3-kb plasmid DNA. Then, 200 ng of chromatin were gently mixed with Rhp26 in 1X NEB cut smart buffer containing 3 mM ATP and 5 mM MgCl2. The remodeling reaction was performed at 30 °C for 1 h followed by adding 15 U of HaeIII restriction enzyme (NEB) to digest the remodeled chromatin. After digestion for 1 h at 27 °C, samples were deproteinized, and DNA was purified, resolved by 1% agarose gel, and visualized by Gel Red (Biotium) DNA staining.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
This work was supported by National Institutes of Health grants R35GM139382 (I.I.) and GM102362 (D.W.) and National Science Foundation grant MCB-2027902 (I.I.). An award of computer time to I.I. was provided by the INCITE program. This research also used resources of the Oak Ridge Leadership Computing Facility, which is a DOE Office of Science User Facility supported under Contract DE-AC05-00OR22725.
Source data
Author contributions
I.I. and D.W. directed the study. C.Y., J.X., D.W. and I.I. contributed to the design of the study. C.Y. performed model building and molecular simulations of the models. J. X. B.L. and J.O. purified protein mutants and performed biochemical assays. C.Y., T.D., J.X., J.Y., D.W. and I.I. analyzed the data. C.Y., T.D., J.X., J.Y., D.W. and I.I. wrote the manuscript.
Data availability
The data that support the findings of this study are available from the corresponding authors upon reasonable request. The models of Pol II–Rad26 in apo, ATP-bound and ADP-bound states have been deposited in the ModelArchive database with DOI accession codes: 10.5452/ma-hxt70, 10.5452/ma-bd9wm and 10.5452/ma-5liiv, respectively. Source data are provided with this paper.
Competing interests
The authors declare no competing interests.
Footnotes
Peer review information Nature Communications thanks Jung-Hyun Min and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Chunli Yan, Thomas Dodd.
Contributor Information
Dong Wang, Email: dongwang@ucsd.edu.
Ivaylo Ivanov, Email: iivanov@gsu.edu.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-021-27295-4.
References
- 1.Scharer OD. Nucleotide excision repair in eukaryotes. Cold Spring Harb. Perspect. Biol. 2013;5:a012609. doi: 10.1101/cshperspect.a012609. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Wang W, Xu J, Chong J, Wang D. Structural basis of DNA lesion recognition for eukaryotic transcription-coupled nucleotide excision repair. DNA Repair (Amst.) 2018;71:43–55. doi: 10.1016/j.dnarep.2018.08.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Brueckner F, Hennecke U, Carell T, Cramer P. CPD damage recognition by transcribing RNA polymerase II. Science. 2007;315:859–862. doi: 10.1126/science.1135400. [DOI] [PubMed] [Google Scholar]
- 4.Marteijn JA, Lans H, Vermeulen W, Hoeijmakers JH. Understanding nucleotide excision repair and its roles in cancer and ageing. Nat. Rev. Mol. Cell Biol. 2014;15:465–481. doi: 10.1038/nrm3822. [DOI] [PubMed] [Google Scholar]
- 5.Kamileri I, Karakasilioti I, Garinis GA. Nucleotide excision repair: new tricks with old bricks. Trends Genet. 2012;28:566–573. doi: 10.1016/j.tig.2012.06.004. [DOI] [PubMed] [Google Scholar]
- 6.Araujo SJ, Kuraoka I. Nucleotide excision repair genes shaping embryonic development. Open Biol. 2019;9:190166. doi: 10.1098/rsob.190166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Berneburg M, Lehmann AR. Xeroderma pigmentosum and related disorders: defects in DNA repair and transcription. Adv. Genet. 2001;43:71–102. doi: 10.1016/s0065-2660(01)43004-5. [DOI] [PubMed] [Google Scholar]
- 8.Lehmann AR. The xeroderma pigmentosum group D (XPD) gene: one gene, two functions, three diseases. Genes Dev. 2001;15:15–23. doi: 10.1101/gad.859501. [DOI] [PubMed] [Google Scholar]
- 9.Fassihi H, et al. Deep phenotyping of 89 xeroderma pigmentosum patients reveals unexpected heterogeneity dependent on the precise molecular defect. Proc. Natl Acad. Sci. USA. 2016;113:E1236–E1245. doi: 10.1073/pnas.1519444113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Pugh J, et al. Use of big data to estimate prevalence of defective DNA repair variants in the US Population. JAMA Dermatol. 2019;155:72–78. doi: 10.1001/jamadermatol.2018.4473. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Hanawalt PC, Spivak G. Transcription-coupled DNA repair: two decades of progress and surprises. Nat. Rev. Mol. Cell Biol. 2008;9:958–970. doi: 10.1038/nrm2549. [DOI] [PubMed] [Google Scholar]
- 12.Saxowsky TT, Doetsch PW. RNA polymerase encounters with DNA damage: transcription-coupled repair or transcriptional mutagenesis? Chem. Rev. 2006;106:474–488. doi: 10.1021/cr040466q. [DOI] [PubMed] [Google Scholar]
- 13.Lindsey-Boltz LA, Sancar A. RNA polymerase: the most specific damage recognition protein in cellular responses to DNA damage? Proc. Natl Acad. Sci. USA. 2007;104:13213–13214. doi: 10.1073/pnas.0706316104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Lahiri I, et al. 3.1A structure of yeast RNA polymerase II elongation complex stalled at a cyclobutane pyrimidine dimer lesion solved using streptavidin affinity grids. J. Struct. Biol. 2019;207:270–278. doi: 10.1016/j.jsb.2019.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Troelstra C, et al. ERCC6, a member of a subfamily of putative helicases, is involved in Cockayne’s syndrome and preferential repair of active genes. Cell. 1992;71:939–953. doi: 10.1016/0092-8674(92)90390-x. [DOI] [PubMed] [Google Scholar]
- 16.Sarker AH, et al. Recognition of RNA polymerase II and transcription bubbles by XPG, CSB, and TFIIH: insights for transcription-coupled repair and Cockayne Syndrome. Mol. Cell. 2005;20:187–198. doi: 10.1016/j.molcel.2005.09.022. [DOI] [PubMed] [Google Scholar]
- 17.Laine JP, Egly JM. Initiation of DNA repair mediated by a stalled RNA polymerase II. EMBO J. 2006;25:387–397. doi: 10.1038/sj.emboj.7600933. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Fousteri M, Vermeulen W, van Zeeland AA, Mullenders LH. Cockayne syndrome A and B proteins differentially regulate recruitment of chromatin remodeling and repair factors to stalled RNA polymerase II in vivo. Mol. Cell. 2006;23:471–482. doi: 10.1016/j.molcel.2006.06.029. [DOI] [PubMed] [Google Scholar]
- 19.Selby CP, Sancar A. Cockayne syndrome group B protein enhances elongation by RNA polymerase II. Proc. Natl Acad. Sci. USA. 1997;94:11205–11209. doi: 10.1073/pnas.94.21.11205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.van der Weegen Y, et al. The cooperative action of CSB, CSA, and UVSSA target TFIIH to DNA damage-stalled RNA polymerase II. Nat. Commun. 2020;11:2104. doi: 10.1038/s41467-020-15903-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Tsutakawa SE, et al. Envisioning how the prototypic molecular machine TFIIH functions in transcription initiation and DNA repair. DNA Repair (Amst.) 2020;96:102972. doi: 10.1016/j.dnarep.2020.102972. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Colella S, Nardo T, Botta E, Lehmann AR, Stefanini M. Identical mutations in the CSB gene associated with either Cockayne syndrome or the DeSanctis-Cacchione variant of xeroderma pigmentosum. Hum. Mol. Genet. 2000;9:1171–1175. doi: 10.1093/hmg/9.8.1171. [DOI] [PubMed] [Google Scholar]
- 23.Mallery. Molecular analysis of mutations in the CSB (ERCC6) gene in patients with Cockayne syndrome. Am. J. Hum. Genet. 1999;64:1491–1491. doi: 10.1086/301686. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Licht CL, Stevnsner T, Bohr VA. Cockayne syndrome group B cellular and biochemical functions. Am. J. Hum. Genet. 2003;73:1217–1239. doi: 10.1086/380399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Vessoni, A. T., Guerra, C. C. C., Kajitani, G. S., Nascimento, L. L. S. & Garcia, C. C. M. Cockayne Syndrome: the many challenges and approaches to understand a multifaceted disease. Genet. Mol. Biol.43, e20190085 (2020). [DOI] [PMC free article] [PubMed]
- 26.Lans H, Hoeijmakers JHJ, Vermeulen W, Marteijn JA. The DNA damage response to transcription stress. Nat. Rev. Mol. Cell Biol. 2019;20:766–784. doi: 10.1038/s41580-019-0169-4. [DOI] [PubMed] [Google Scholar]
- 27.Vangool AJ, et al. Rad26, the functional Saccharomyces-cerevisiae homolog of the Cockayne-syndrome-B gene ERCC6. Embo J. 1994;13:5361–5369. doi: 10.1002/j.1460-2075.1994.tb06871.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Xu J, et al. Structural basis for the initiation of eukaryotic transcription-coupled DNA repair. Nature. 2017;551:653–657. doi: 10.1038/nature24658. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Li SS. Transcription coupled nucleotide excision repair in the yeast Saccharomyces cerevisiae: The ambiguous role of Rad26. DNA Repair. 2015;36:43–48. doi: 10.1016/j.dnarep.2015.09.006. [DOI] [PubMed] [Google Scholar]
- 30.Wang L, et al. Regulation of the Rhp26ERCC6/CSB chromatin remodeler by a novel conserved leucine latch motif. Proc. Natl Acad. Sci. USA. 2014;111:18566–18571. doi: 10.1073/pnas.1420227112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Durr H, Korner C, Muller M, Hickmann V, Hopfner KP. X-ray structures of the Sulfolobus solfataricus SWI2/SNF2 ATPase core and its complex with DNA. Cell. 2005;121:363–373. doi: 10.1016/j.cell.2005.03.026. [DOI] [PubMed] [Google Scholar]
- 32.Hilbert M, Karow AR, Klostermeier D. The mechanism of ATP-dependent RNA unwinding by DEAD box proteins. Biol. Chem. 2009;390:1237–1250. doi: 10.1515/BC.2009.135. [DOI] [PubMed] [Google Scholar]
- 33.Verkhivker GM, Agajanian S, Hu G, Tao P. Allosteric regulation at the crossroads of new technologies: Multiscale modeling, networks, and machine learning. Front Mol. Biosci. 2020;7:136. doi: 10.3389/fmolb.2020.00136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Anindya R, et al. A Ubiquitin-binding domain in Cockayne syndrome B required for transcription-coupled nucleotide excision repair. Mol. Cell. 2010;38:637–648. doi: 10.1016/j.molcel.2010.04.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Lake RJ, Geyko A, Hemashettar G, Zhao Y, Fan HY. UV-induced association of the CSB remodeling protein with chromatin requires ATP-dependent relief of N-terminal autorepression. Mol. Cell. 2010;37:235–246. doi: 10.1016/j.molcel.2009.10.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Wang LF, et al. Regulation of the Rhp26(ERCC6/CSB) chromatin remodeler by a novel conserved leucine latch motif. Proc. Nat. Acad. Sci. USA. 2014;111:18566–18571. doi: 10.1073/pnas.1420227112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Wang W, et al. Molecular basis of chromatin remodeling by Rhp26, a yeast CSB ortholog. Proc. Nat. Acad. Sci. USA. 2019;116:6120–6129. doi: 10.1073/pnas.1818163116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Xu J, et al. Cockayne syndrome B protein acts as an ATP-dependent processivity factor that helps RNA polymerase II overcome nucleosome barriers. Proc. Nat. Acad. Sci. USA. 2020;117:25486–25493. doi: 10.1073/pnas.2013379117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Xia X, Liu XY, Li T, Fang XY, Chen ZC. Structure of chromatin remodeler Swi2/Snf2 in the resting state. Nat. Struct. Mol. Biol. 2016;23:722–729. doi: 10.1038/nsmb.3259. [DOI] [PubMed] [Google Scholar]
- 40.Liu XY, Li MJ, Xia X, Li XM, Chen ZC. Mechanism of chromatin remodelling revealed by the Snf2-nucleosome structure. Nature. 2017;544:440–445. doi: 10.1038/nature22036. [DOI] [PubMed] [Google Scholar]
- 41.Yan LJ, Wang L, Tian YY, Xia X, Chen ZC. Structure and regulation of the chromatin remodeller ISWI. Nature. 2016;540:466–469. doi: 10.1038/nature20590. [DOI] [PubMed] [Google Scholar]
- 42.Li SS, Smerdon MJ. Rpb4 and Rpb9 mediate subpathways of transcription-coupled DNA repair in Saccharomyces cerevisiae. EMBO J. 2002;21:5921–5929. doi: 10.1093/emboj/cdf589. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Li MJ, et al. Mechanism of DNA translocation underlying chromatin remodelling by Snf2. Nature. 2019;567:409–413. doi: 10.1038/s41586-019-1029-2. [DOI] [PubMed] [Google Scholar]
- 44.Baker RW, et al. Structural insights into assembly and function of the RSC chromatin remodeling complex. Nat. Struct. Mol. Biol. 2021;28:71–80. doi: 10.1038/s41594-020-00528-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Smith CL, Peterson CL. A conserved Swi2/Snf2 ATPase motif couples ATP hydrolysis to chromatin remodeling. Mol. Cell Biol. 2005;25:5880–5892. doi: 10.1128/MCB.25.14.5880-5892.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Christiansen M, et al. Functional consequences of mutations in the conserved SF2 motifs and post-translational phosphorylation of the CSB protein. Nucleic Acids Res. 2003;31:963–973. doi: 10.1093/nar/gkg164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Wilson BT, Lochan A, Stark Z, Sutton RE. Novel missense mutations in a conserved loop between ERCC6 (CSB) helicase motifs V and VI: Insights into Cockayne Syndrome. Am. J. Med. Genet. A. 2016;170:773–776. doi: 10.1002/ajmg.a.37501. [DOI] [PubMed] [Google Scholar]
- 48.Kang JY, et al. Structural basis for transcription complex disruption by the Mfd translocase. Elife. 2021;10:e62117. doi: 10.7554/eLife.62117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Yan C, et al. Transcription preinitiation complex structure and dynamics provide insight into genetic diseases. Nat. Struct. Mol. Biol. 2019;26:397–406. doi: 10.1038/s41594-019-0220-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Dodd T, et al. Polymerization and editing modes of a high-fidelity DNA polymerase are linked by a well-defined path. Nat. Commun. 2020;11:5379. doi: 10.1038/s41467-020-19165-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Yao XQ, Momin M, Hamelberg D. Elucidating allosteric communications in proteins with difference contact network analysis. J. Chem. Inf. Model. 2018;58:1325–1330. doi: 10.1021/acs.jcim.8b00250. [DOI] [PubMed] [Google Scholar]
- 52.Laugel V, et al. Mutation update for the CSB/ERCC6 and CSA/ERCC8 genes involved in Cockayne syndrome. Hum. Mutat. 2010;31:113–126. doi: 10.1002/humu.21154. [DOI] [PubMed] [Google Scholar]
- 53.Citterio E, et al. Biochemical and biological characterization of wild-type and ATPase-deficient Cockayne syndrome B repair protein. J. Biol. Chem. 1998;273:11844–11851. doi: 10.1074/jbc.273.19.11844. [DOI] [PubMed] [Google Scholar]
- 54.Kurowski MA, Bujnicki JM. GeneSilico protein structure prediction meta-server. Nucleic Acids Res. 2003;31:3305–3307. doi: 10.1093/nar/gkg557. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Casanal A, Lohkamp B, Emsley P. Current developments in Coot for macromolecular model building of electron cryo-microscopy and crystallographic data. Protein Sci. 2020;29:1069–1078. doi: 10.1002/pro.3791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.McGreevy R, Teo I, Singharoy A, Schulten K. Advances in the molecular dynamics flexible fitting method for cryo-EM modeling. Methods. 2016;100:50–60. doi: 10.1016/j.ymeth.2016.01.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Trabuco LG, Villa E, Schreiner E, Harrison CB, Schulten K. Molecular dynamics flexible fitting: A practical guide to combine cryo-electron microscopy and X-ray crystallography. Methods. 2009;49:174–180. doi: 10.1016/j.ymeth.2009.04.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Dodd T, Yan C, Ivanov I. Simulation-based methods for model building and refinement in cryoelectron microscopy. J. Chem. Inf. Model. 2020;60:2470–2483. doi: 10.1021/acs.jcim.0c00087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Liebschner D, et al. Macromolecular structure determination using X-rays, neutrons and electrons: recent developments in Phenix. Acta Crystallogr. D. 2019;75:861–877. doi: 10.1107/S2059798319011471. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Afonine PV, et al. New tools for the analysis and validation of cryo-EM maps and atomic models. Acta Crystallogr. D. 2018;74:814–840. doi: 10.1107/S2059798318009324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Case DA, et al. The Amber biomolecular simulation programs. J. Comput. Chem. 2005;26:1668–1688. doi: 10.1002/jcc.20290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 1983;79:926–935. [Google Scholar]
- 63.Meagher KL, Redman LT, Carlson HA. Development of polyphosphate parameters for use with the AMBER force field. J. Comput. Chem. 2003;24:1016–1025. doi: 10.1002/jcc.10262. [DOI] [PubMed] [Google Scholar]
- 64.Essmann U, et al. A smooth particle mesh Ewald method. J. Chem. Phys. 1995;103:8577–8593. [Google Scholar]
- 65.Maier JA, et al. ff14SB: Improving the accuracy of protein side chain and backbone parameters from ff99SB. J. Chem. Theory Comput. 2015;11:3696–3713. doi: 10.1021/acs.jctc.5b00255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Galindo-Murillo R, et al. Assessing the current state of Amber force field modifications for DNA. J. Chem. Theory Comput. 2016;12:4114–4127. doi: 10.1021/acs.jctc.6b00186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Dodd, T., Yao, X.-Q., Hamelberg, D. & Ivanov, I. Subsets of adjacent nodes (SOAN): a fast method for computing suboptimal paths in protein dynamic networks. Mol. Phys. e1893847, 10.1080/00268976.2021.1893847 (2021).
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data that support the findings of this study are available from the corresponding authors upon reasonable request. The models of Pol II–Rad26 in apo, ATP-bound and ADP-bound states have been deposited in the ModelArchive database with DOI accession codes: 10.5452/ma-hxt70, 10.5452/ma-bd9wm and 10.5452/ma-5liiv, respectively. Source data are provided with this paper.