Abstract
In bacteria, transcription complexes stalled on DNA represent a major source of roadblocks for the DNA replication machinery that must be removed in order to prevent damaging collisions. Gram-positive bacteria contain a transcription factor HelD that is able to remove and recycle stalled complexes, but it was not known how it performed this function. Here, using single particle cryo-electron microscopy, we have determined the structures of Bacillus subtilis RNA polymerase (RNAP) elongation and HelD complexes, enabling analysis of the conformational changes that occur in RNAP driven by HelD interaction. HelD has a 2-armed structure which penetrates deep into the primary and secondary channels of RNA polymerase. One arm removes nucleic acids from the active site, and the other induces a large conformational change in the primary channel leading to removal and recycling of the stalled polymerase, representing a novel mechanism for recycling transcription complexes in bacteria.
Subject terms: Enzyme mechanisms, Transcription, Cryoelectron microscopy
Gram-positive bacteria contain a transcription factor HelD that is able to remove and recycle stalled transcription complexes. Here the authors provide mechanistic insights into this process by determining the cryo-EM structures of the Bacillus subtilis RNA polymerase (RNAP) elongation complex and the RNAP-HelD transcription recycling complex and propose a model of HelD catalysed transcription recycling.
Introduction
In bacteria, transcription and DNA replication occur concomitantly, making potentially damaging collisions of DNA replication forks with transcription complexes inevitable1–5. Transcription is highly sensitive to DNA damage, which causes the elongation complex (EC) to pause, and multiple redundant systems have evolved to ensure rapid removal of RNAP and/or the repair of damaged DNA6–10. This reduces the chance of replication forks colliding with stalled transcription complexes whilst also serving as an efficient system for maintaining genome integrity, especially within coding regions. However, independent of DNA damage, ~15% of paused transcription complexes are inactive for a significant period of time11 and require removal through the action of factors such as the transcription recycling factor HelD12.
HelD is widely distributed in Gram-positive bacteria and has superficial similarity to superfamily 1 (SF1) DNA helicases such as UvrD/PcrA, and its catalytic activity is ATP-dependent12. Low-resolution small-angle X-ray scattering data indicate that HelD undergoes an ATP-dependent conformational change and is capable of binding to DNA13, suggesting that these are important properties of HelD in transcription complex recycling. SF1 helicases UvrD/PcrA bind on the upstream side of RNAP, and are able to reverse-translocate it away from a site of DNA damage6,14. Similarly, Rad26 (eukaryotic) and RapA (prokaryotic), Swi2/Snf2 family helicases also bind on the upstream side of RNAP and reverse-translocate stalled complexes during their reactivation15,16. Although HelD has been shown to bind on the downstream side of RNAP12, it seemed reasonable to assume that it may facilitate transcription complex recycling in a similar manner to UvrD/PcrA, RapA and Rad26, utilising ATP-dependent translocation of stalled complexes along a DNA template.
In this work, we show this is not the case and that the structure of HelD enables an unusual mode of transcription complex recycling involving a large conformational change in RNA polymerase (RNAP). Using single particle cryo-electron microscopy (cryo-EM), we determined the structures of Bacillus subtilis RNAP elongation and HelD complexes, enabling analysis of the conformational changes that occur in RNAP driven by HelD interaction. HelD represents a class of motor protein distantly related to the SF1 helicases, containing two arms that flank the helicase-like domains. One arm anchors HelD to RNAP, binding deep within the secondary channel of RNAP where it sterically clashes with nucleic acids in the active site and in doing so distorts the highly conserved bridge helix of RNAP. The other arm pushes open the primary DNA-binding channel of RNAP, causing a conformational change that releases bound DNA. Thus, HelD is a prototypical member of a widely dispersed and divergent branch of the SF1 helicase family that maintain genome integrity by removing non-productive transcription roadblocks.
Results
Structure determination of the transcription elongation complex
Bacillus subtilis is the model representative organism of the medically and industrially important Firmicutes phylum that have genomes with a low G + C content. Despite considerable effort, no structure of RNAP from the low G + C Gram positives has been determined to date. RNAP core (α2ββω) was purified (Supplementary Fig. 1), and it’s activity established using a HelD-dependent stimulation of multi-round transcription assay which gave identical results to those observed in previous studies12 (Fig. 1a). We then used cryo-EM to determine the structure of the B. subtilis RNAP transcription elongation complex (EC) at 3.36 Å, to enable an understanding of the conformational changes caused by HelD during transcription complex recycling (Fig. 1b, c, Table 1; Supplementary Fig. 2, Movie 1). RNAP in the Firmicutes is the smallest multi-subunit polymerase17,18, and given the industrial and clinical importance of this group of bacteria this complex will serve as an invaluable reference structure.
Table 1.
RNAP elongation complex (EMD-21920, PDB 6WVJ) | RNAP-HelD complex (EMD-21921, PDB 6WVK) | |
---|---|---|
Data collection and processing | ||
Molecular mass (kDa) | 344.200 | 442.060 |
Magnification | 59,524 | 59,524 |
Voltage (kV) | 300 | 300 |
Electron exposure (e−Å−2) | 52.2 | 63.6 |
Defocus range (μm) | 0.6–2.8 | 0.6–2.8 |
Pixel size (Å) | 0.84 | 0.84 |
Symmetry imposed | C1 | C1 |
Initial particle images (no.) | 1069336 | 580,468 |
Final particle images (no.) | 58,854 | 65,356 |
Relative abundance (%) | 5.5% | 11.2% |
Map resolution (Å) | 3.36 | 3.36 |
FSC threshold | 0.143 | 0.143 |
Dimensions (Length × width × hight in Å) | 150 × 112 × 123 | 156 × 154 × 139 |
Refinement | ||
Initial model used | 6WVK | 4NJC (ε) |
Model resolution (Å) | 3.38 | 3.27 |
FSC threshold | 0.5 | 0.5 |
Model composition | ||
Non-hydrogen atoms | 22367 | 28225 |
Protein residues | 2802 | 3631 |
Nucleic acid residues | 37 | — |
Ligands | ZN: 2, MG: 1 | ZN: 2, MG: 1 |
B Factors (Å2) | ||
Protein | 63.03 | 46.39 |
Nucleic | 111.00 | — |
Ligand | 86.05 | 68.02 |
r.m.s deviations | ||
Bond lengths (Å) | 0.005 | 0.005 |
Bond angles (°) | 0.692 | 0.774 |
Validation | ||
MolProbity score | 2.61 | 2.69 |
Clashscore | 12.44 | 10.95 |
Poor rotamers (%) | 5.49 | 6.70 |
Ramachandran plot | ||
Favoured (%) | 93.14 | 91.42 |
Allowed (%) | 6.75 | 8.39 |
Disallowed (%) | 0.11 | 0.19 |
Despite nuclease treatment of the cell lysate, upon 3D reconstruction of the core structure, nucleic acid was clearly visible indicating that throughout the purification process the core enzyme remained tightly bound to nucleic acids which protected them from nuclease treatment (Supplementary Table 1). Therefore, the structure presented represents an elongation complex (EC) with non-specified nucleic acid sequence (i.e., the reconstructed density shows well-defined ribose-phosphate groups with an average of random base sequences). Typically, the subunit composition of core bacterial RNAP is represented as α2ββʹω, but in previous work we identified an additional small subunit called ε was present in B. subtilis RNAP in addition to ω19. However, holoenzyme preparations from the strain (LK637, Δ δ; Methods) used in this study lacked ε (Supplementary Fig. 1a) and so the core structure is presented lacking this subunit (although ε is present in the structure of RNAP core in complex with HelD, see below). Previous studies have also shown deletion of ε causes no detectable phenotype or change in gene expression profiles, and RNAP core preparations lacking ε have indistinguishable activity compared to those that do contain it19,20. Comparison of EC and RNAP-HelD complexes showed no significant structural differences in the region where ε binds and so in Fig. 1b the ε binding site is indicated as a dotted circle, and in Supplementary Fig. 4a ε is shown as it is clear that RNAP isolated from B. subtilis is a heterogeneous mixture of core (α2ββ’ω) ± δ, ε, and HelD, in addition to multiple different σ factors21.
The EC is 150 Å × 112 Å × 123 Å (L × W × H, Table 1), and is broadly comparable to the dimensions of core/elongation complexes from other species (157 × 153 × 136 Å; E. coli 6ALF, 183 × 107 × 115 Å; Mycobacterium smegmatis 6F6W, and 170.1 × 110.1 × 127.8 Å; Thermus thermophilus 2O5I)22–24, although it appears to be more slender and elongated than the roughly globular E. coli, and shorter than the M. smegmatis and T. thermophilus enzymes due to the lack of insertion sequences (Supplementary Fig. 4).
Due to the high level of sequence conservation amongst RNAPs, the overall structure of the EC was similar to those from other organisms and largely consistent with homology models used in previous work on structure/function studies with B. subtilis RNAP25–28. However, modelling had been unable to establish the structure of the ~180 amino acid βln5 insertion in the β2 lobe. The β2 lobe is one of the least well-conserved regions of bacterial RNAPs, and is a hot-spot for the presence of lineage-specific insertions17 (Supplementary Fig. 4a). The only other region that was significantly different to other bacterial RNAPs was a 10 amino acid loop from β E696-G705 that protrudes from the bottom of the enzyme (Fig. 1c, Supplementary Fig. 4a). Refinement and building sequence into the resulting density indicated the B. subtilis β2 lobe is a continuous globular structure and that the βln5 insertion increases the size asymmetry between the β lobes compared to other Gram positive RNAPs such as those from M. smegmatis and M. tuberculosis23,29 (Fig. 1b, Supplementary Fig. 4). Searches using DALI30 found no structural matches to the βln5 insertion leaving its function similarly enigmatic to those of most other lineage-specific insertions.
The absence of lineage-specific insertions perhaps helps to account for the additional subunits found in the Firmicutes such as δ and ε, and this and the accompanying paper by Pei et al.31 identify the location of these subunits. This suggestion is potentially supported by examination of the T. thermophilus structure around its βln10 and βln12 insertions that localise to a region very close to the ε binding site. Superimposition of ε into the T. thermophilus EC structure shows steric clashes between ε and the insertions (Supplementary Fig. 4c) raising the possibility that they serve similar functions. The location of ε also corresponds to that of a domain of archaeal and eukaryotic Rpo3/RPB3 subunits associated with enzyme stability (boxed insert, Supplementary Fig. 4a) and it is interesting to note that both B. subtilis (able to grow up to ~52 °C) and the thermophile T. thermophilus (up to ~79 °C) have structural elements/subunits located in this area that links the α2, β, and β’ subunits whereas the mesophilic E. coli and M. smegmatis do not.
The ω subunit in B. subtilis is 67 amino acids vs the 80 amino acid length of E. coli ω. The main structural difference appears to be in the lack of a C-terminal α-helix which is prominent in E. coli RNAP, but lacking in B. subtilis and Mycobacterial structures. As with all other RNAP core and EC structures solved to date, the C-terminal domains of the α subunits were not visible due to the flexible linker connecting the N- and C-terminal domains.
Detailed examination of the elongation complex also revealed important features associated with mechanistic aspects of RNA synthesis. The density for fork-loop 2 (FL2) is well defined, consistent with its role in DNA strand separation on the downstream edge of the transcription bubble. The EC active site is similar in structure to that reported previously for the T. thermophilus and E. coli ECs22,24 (2O5I, and 6ALF, respectively) and is in a post-translocation conformation with the 3′ end of the RNA transcript adjacent to the +1 site, with an unbent bridge-helix (BH) and the trigger-loop (TL) in the open conformation (Fig. 1d). This conformation is consistent with an elongation complex primed to receive an incoming NTP via the secondary channel.
FL2 residue β R498 interacts with the ribose and phosphate moieties of the final base in the non-template DNA strand prior to strand separation and formation of the transcription bubble and likely acts to facilitate formation of the downstream edge of the transcription bubble (Fig. 1d). The template base in the +1 site is held in position for base-pairing with the incoming substrate NTP through interaction with the highly conserved T794 and A795 of the BH, and may also be stabilised through stacking with the base in the −1 position (Fig. 1d). β R496 of FL2 interacts with the phosphodiester backbone of RNA bases 4 and 5 of the new transcript (Fig. 1d). In addition, residues Q469, P520, E521, N524, I528, K924 and K932 of the rifampicin binding pocket of the β subunit form numerous interactions with the newly formed transcript (RNA residues 1–5) as has been previously reported32,33. The salt bridge between β R800 and β’ D245 that closes the primary channel off from the RNA exit channel34,35 is clearly visible confirming that the elements on the upstream side of the transcription bubble, the rudder and lid, that are responsible for facilitating reannealing of the template and non-template strands and guiding RNA into the exit channel are in positions consistent with these assigned roles (Fig. 1e).
Electron density for RNA beyond the 8th nucleotide is poor, preventing further mapping of the transcript up to and through the exit channel. Likewise, density for DNA on the upstream side is poorly defined consistent with conformational flexibility in this region of RNAP26. Structural modelling, and comparison with ECs from other organisms22,24, is consistent with there being sufficient space for a transcription bubble comprising a 9 bp template DNA-RNA hybrid prior to upstream DNA strand re-annealment and entry of the transcript into the exit channel guided by hydrophobic interaction with conserved β’ lid residues V242 and L244 (dotted box, Fig. 1e). The 9th RNA-DNA base pair has likely been degraded by nuclease activity during preparation of the complex. Overall, this structure serves as a valuable resource for structure-function studies with RNAP from the Firmicutes as well as being a reference structure to enable full understanding of the conformational changes involved in transcription complex recycling induced upon binding to HelD (below).
The structure of an RNAP-HelD transcription recycling complex
RNAP-HelD complexes were isolated from a culture of B. subtilis carrying a deletion of the rpoE gene that encodes the δ subunit, shown previously to act synergistically with HelD12 (Supplementary Fig. 1). HelD itself is required for transcription complex recycling, and can perform this function independently of δ12 which is absent in many organisms that contain genes encoding HelD proteins (e.g. Clostridia). The purified complex stimulated transcription ~2-fold, similar to that observed with in vitro assembled complexes12, establishing its biological activity (Fig. 1a).
We determined the structure of the RNAP–HelD complex using single particle cryo-electron microscopy (cryo-EM) to 3.36 Å resolution (Supplementary Fig. 3), followed by atomic modelling (Fig. 2a, Table 1; Supplementary Movie 2). The resulting structure revealed that HelD, which is located on the downstream side of RNAP, has two arm domains that penetrate deep into the primary and secondary channels of RNAP (clamp arm; CA, and secondary channel arm; SCA, respectively, Fig. 2a–c), which account for the strong HelD-RNAP interaction12,13,36. The native RNAP-HelD preparation also contained the RNAP ε subunit19 and showed it bound on the downstream side of RNAP in a concave space between the two α, β, and β’ subunits (Fig. 2a; see Supplementary Fig. 4).
HelD itself has an unusual 4-domain structure (Fig. 3a, b). The first 203 amino acids (aa) form the secondary channel arm (SCA), which is joined to a super-family 1 (SF1) 1 A domain (aa 204–291 and 539–610). In SF1 helicases, domain 1 A is split by the insertion of a 1B domain associated with helicase function37, but in HelD it is split by the clamp arm (CA; aa 292–538). Residues 610–774 form a continuous SF1 2 A domain, which is usually split by a 2B insertion in SF1 helicases, that represents the ‘head’ of HelD. The overall appearance of the protein is that of a torso and head (domains 1 A and 2 A, respectively) flanked by a pair of muscular arms (SCA and CA), giving it a rather thuggish appearance (Fig. 3b, c).
Although HelD is widely distributed amongst Gram-positive bacteria, the distinctive arm domains represent the regions of lowest sequence conservation despite being responsible for the majority of interactions with RNAP as well as for its transcription recycling activity13 (Fig. 3a, Supplementary Figs. 5–7). It is also clear that there are at least two distinct classes of HelD (Classes I and II, Supplementary Figs. 5, 6, Supplementary Table 2); Class I is represented by the B. subtilis protein and is present in the low G + C Gram positives, whereas Class II is represented by the M. smegmatis protein (see accompanying paper by Kouba et al.38), and is present in the high G + C Gram-positives. Some organisms contain multiple copies of HelD (e.g. Lactobacillus plantarum, Class I; Nonomuraea wenchangensis, Class II; Supplementary Fig. 5), and even within the same organism, sequence conservation between the copies is relatively low in the SCA and CA domains (Supplementary Fig. 8). Previous studies showed that HelD in which the SCA (aa 1–203) had been deleted was still capable of binding RNAP, hydrolysing ATP, and binding DNA, but not transcription recycling13. These observations suggest that the function of the arm domains is centred around mechanical work rather than the formation of highly-conserved functionally-significant interprotein interactions.
Despite the clear separation into two classes, sequence alignment allowed the identification of conserved motifs common to all HelD proteins (Fig. 3a; Supplementary Table 2). The transcription recycling function of HelD is dependent on ATP hydrolysis12, with ATP-binding motifs located in the 1 A (torso) domain (cyan residues, Fig. 3a, b). Alteration of the absolutely conserved K239 to A in the Walker A motif resulted in the complete loss of transcription recycling and ATPase activity (Fig. 1a). The remaining conserved motifs form a network of interactions that are mainly centred in the region between the SCA and 1 A domains, with the absolutely conserved residue W137 in a hydrophobic pocket between them (purple residues, Fig. 3a, c). These extensive interactions anchor the SCA to the 1 A domain, helping to couple ATP hydrolysis to mechanical movement of the CA (see below).
HelD causes major conformational changes in RNAP
Comparison of the core elements of the EC and RNAP-HelD structures (α2ββ’ω subunits) shows HelD causes a major conformational change mainly due to the opening of the β’ clamp by the CA, with very little change elsewhere (Fig. 4a, b; Supplementary Movie 3, and see below). PISA39 was used to analyse protein-protein contacts in the RNAP-HelD and elongation complexes (Supplementary Table 3). Complexation with HelD reduces the contact area between RNAP subunits β and β’ by over 6% while other contact areas remain similar, consistent with the extensive conformational change caused to the EC upon binding of HelD.
As part of transcription complex recycling, the elongating RNA as well as the DNA template needs to dissociate from RNAP. RNA passes through the exit channel on the upstream side of RNAP. There was no major conformational change to elements at the entry of the exit channel other than those that are translocated as part of the opening of the β’ clamp (Fig. 4). The translocation of the β’ clamp results in breaking of the conserved salt bridge between β R800 and β’ D245 that is important in guiding RNA into the exit channel34,35, increasing the width of the aperture from 11 to 20 Å (αc–αc; Fig. 4c, Supplementary Movie 3). This separation, along with widening of the primary channel, facilitates RNA exit from the complex.
The most dramatic effect of HelD on RNAP is the widening of the primary channel from 21 to 47 Å between β2 lobe P242 and β’ clamp helix N283, that would cause a loss of contact with DNA in the primary channel, enabling recycling of RNAP (Fig. 4b, c). This is facilitated by the proximity of the CA to the SW5 region of the β’ clamp, that acts as a hinge during clamp movement18,40 (Supplementary Fig. 9, Movie 3).
Detailed examination of the SCA and CA interactions with RNAP enabled us to define the molecular events that occur during transcription complex recycling. Images of the active site region in the EC (Fig. 5a), RNAP-HelD complex (Fig. 5b) and an overlay of the two views (Fig. 5c) shows how HelD SCA insertion via the secondary channel causes distortion of the bridge-helix and trigger-loop as well as steric clashes with nucleic acids. The prokaryotic Gre factors, DksA, and eukaryotic TFIIS are known to bind in the secondary channel of RNAP via a pair of anti-parallel α helices/hairpin loop41–43. The acidic tips of these proteins reside close to, but on the downstream side of the catalytic Mg2+. The SCA of HelD bears superficial similarity to GreB/DskA, but is longer and the tip extends past the catalytic Mg2+ (Supplementary Fig. 10). The acidic tip (D56 and D57) will electrostatically repel the transcript upon penetration of the SCA into the active site, with the SCA causing significant steric clashes with the transcript and template DNA strand when fully inserted (Fig. 5c). The bridge-helix and trigger-loop, are dynamic structures that play a key role in the transcription cycle44; the entry of the SCA into the secondary channel causes partial folding of the open trigger-loop conformation observed in the EC structure and a major distortion of the bridge-helix that would sterically clash with the template DNA in the active site (Fig. 5a–c; Supplementary Movie 3). Thus, the SCA tip itself, in combination with the distortion its insertion causes in the bridge-helix, will result in physical displacement of template DNA and RNA from the active site of RNAP, facilitated by electrostatic repulsion between the acidic SCA tip residues and the transcript.
The fully inserted tip of the SCA is in close proximity to the absolutely conserved active-site β’447NADFDGD453, forming a network of interactions around this motif, but does not directly interact with either the catalytic Mg2+ or the Asp residues that coordinate it (Fig. 5d, Supplementary Table 4). Thus, upon dissociation of HelD, the core RNAP would be competent for re-use in transcription, as seen in the transcription recycling assays in Fig. 1a. Finally, insertion of the SCA into the secondary channel would block NTP entry into the active site.
The salt-bridge and H-bond contacts the CA makes with the β’ clamp are listed in Supplementary Table 4, but the bulk of interactions are made by hydrophobic residues with little sequence conservation between even closely-related genera (Supplementary Table 5, Supplementary Fig. 6a). This region is the location of an insertion that spans across the primary channel towards the active site in Class II HelD proteins (see accompanying paper by Kouba et al.38; Supplementary Fig. 6b). The tip of this insertion has a similar location to the tip of the SCA of B. subtilis Class I HelD and is also acidic, suggesting electrostatic repulsion of nucleic acids is also important in the activity of Class II HelDs. In our Class I HelD structure, the site of this insertion is close to an area of density in the cryo-EM reconstruction that at low threshold values could be consistent with the presence of nucleic acid (Supplementary Fig. 11). Examination of the surface charge of HelD revealed a region of high overall positive charge on the inside of the CA. Superposition with the nucleic acids from the EC show that this positively-charged patch is in a position where it could interact with the downstream dsDNA (Supplementary Fig. 11a), consistent with nucleic acid-binding data13. It is also possible that this patch may be important for interaction with the unstructured negatively-charged C-terminal domain of δ which acts synergistically with HelD during transcription complex recycling12 (see accompanying paper by Pei et al.31). The end of the CA forms a relatively flat ~320 Å2 surface that acts as a platform to push up against the β’ clamp, resulting in loss of contact with the DNA bound in the EC (Figs. 2a–c, 4, Supplementary Movie 3). Therefore, the purpose of the CA appears to involve the opening of the β’ clamp through brute force rather than by the formation of a specific network of conserved interactions.
Movement of the clamp arm of HelD drives conformational change in RNAP
Closer examination of the RNAP–HelD complex using 3D variability analysis (3DVA)45 allowed identification of regions of conformational flexibility that underpin the dynamic processes of HelD activity in transcription recycling. Overall, the region behind SW5 towards the α dimer, including the secondary channel and SCA of HelD, showed little or no
conformational variability, but the primary channel encompassing elements of the β1 and 2 lobes and the β’ clamp did (Fig. 6a, Supplementary Movie 4).
The 3D variability analysis indicates that HelD causes the β’ clamp to open and twist so that the downstream side of RNAP opens slightly (curved cyan arrow, Fig. 6a). At the same time, the β2 lobe moves up (straight cyan arrow, Fig. 6a) along with a slight twisting of the β1 lobe and β flap (Fig. 6a, Supplementary Movie 4). With respect to HelD, there was no change in the SCA tip adjacent to the RNAP active site, but there was lateral movement of the portion located outside the secondary channel towards the β’ jaw (Fig. 6b). This resulted in little, if any, conformational change in the hydrophobic ‘cage’ surrounding the conserved W137 residue. Accordingly, there was relatively little change in the torso (1 A) domain and ATP-binding site, but the head (2 A) domain moved away from the downstream side of RNAP (curved and straight orange arrows Fig. 6a, b, respectively). The CA of HelD rises up and out slightly, causing the upward twist on the downstream side of the β’ clamp (orange arrow, Fig. 6a; Supplementary Movie 4). Therefore, the results of the 3D variability analysis are consistent with the SCA acting as a wedge that permits conformational change through movement of the CA. The CA is located in a position equivalent to an SF1 helicase 1B domain that utilises ATP hydrolysis to undergo conformational changes required for helicase/translocase activity46,47 and ATP binding/hydrolysis is required for release of HelD (see accompanying papers by Kouba et al. and Pei et al.31,38), most likely due to movement of the CA arm, consistent with the observed conformational flexibility in this region.
In our structure, and those of the accompanying papers by Pei et al. and Kouba et al., no density for any NTP could be detected in the ATP binding site, even on addition of ATP or non-hydrolysable analogues. In order to bind ATP, domains 1 A (torso) and 2 A (head) need to rotationally open as observed for SF1 helicases48. Our 3DVA suggests this is most likely via movement of the 2 A (head) domain (Fig. 6a, b). However, the sequence from F183-G190 linking the SCA to the 1 A (torso) domain sterically blocks access to the ATP binding site and this ‘gate’ region will also need to open to allow ATP binding and subsequent ADP release (Supplementary Fig. 12). There is no intramolecular bonding between residues T185-I189 and either the SCA or IA (torso) domain, and this may provide the necessary flexibility for gate opening and closing. Given that the ATP binding site was not accessible in all of the structures that are forcing open the DNA binding clamp of RNAP, gate opening may be an event that occurs on conformational change of the CA during nucleic acid release and RNAP recycling.
Discussion
These results provide the foundations for a model of HelD catalysed transcription recycling (Fig. 6c). HelD is present at low intracellular levels compared to RNAP12,49 and this may help restrict it to targeting ECs that have entered a long-term pause11. The SCA penetrates the secondary channel (Punch), displacing the transcript and template DNA, blocking the catalytic Mg2+ and NTP entry. Through conserved inter-domain interactions with the torso, the SCA anchors HelD on RNAP. HelD then forces the primary channel open through interaction of the CA with the β’ clamp (Uppercut). The action of both arms serves to displace nucleic acids from the active site, through separation of the β-flap and β’-clamp by ~10 Å, and opening the primary channel by ~36 Å. Dissociation of HelD follows conformational change and ATP binding/hydrolysis (see accompanying papers by Kouba et al. and Pei et al.31,38) consistent with SF1 helicase dynamics that are transmitted to the CA46,47, closing of the primary channel, and recycling of RNAP. The conservation of HelD across the Gram positive bacteria indicates this previously unknown mechanism for transcription complex recycling is of considerable importance. Determination of the precise molecular details by which highly diverged structures perform this role represents an exciting new avenue of research.
Methods
Strains
E. coli BL21 (DE3) was used for overproduction of core B. subtilis RNAP, HelD and σA. RNAP holoenzyme (HE) and HelD complexes were purified from B. subtilis LK63750 carrying a deletion to the rpoE gene encoding δ, and a 3′ his tag on the rpoC gene to facilitate purification.
Plasmids
Recombinant B. subtilis RNAP core (α2ββ’ω) has previously been overproduced using a two plasmid system20 although the use of two different plasmids could result in poor yields if one plasmid was present at lower levels than the other during overproduction. To make the process more efficient a single plasmid system was constructed. pNG21920 containing rpoA, rpoB, and rpoC was linearised with NotI, purified and used in a Gibson assembly reaction with a 356 bp gBlock® (IDT, Singapore) construct comprising 40 bp 5′ and 3′ homology to the linearised pNG219 DNA flanking an additional phage T7 promoter, NcoI and XbaI restriction sites, a ribosome binding site and the rpoZ gene. The resulting plasmid was named pNG1256 (sequence and plasmid DNA available through Addgene, ID 149710).
The HelD K239A mutant was constructed by PCR mutagenesis51 of the pHelD-His6 plasmid12. The PCR contained 1X NEB Q5 reaction buffer (B9027), 200 µM dNTPs, 0.5 µM forward primer 5′ GCGGGGCAACATCGGCCGCGCTTCAG 3′, 0.5 µM reverse primer 5′ ATGTTGCCCCGCTGCCAGCCGCTCCC 3′, 10 ng pHelD-His, 0.25 µl Q5 DNA polymerase (NEB) and sterile H2O up to 25 µl total reaction volume. Thermocycling conditions were initiated at 98 °C for 3 min, followed by denaturation at 98 °C for 10 s, annealing at 69 °C for 30 s, extension at 72 °C for 4.5 min for 12 cycles, then a single cycle of annealing at 59 °C for 30 s and a final extension at 72 °C for 30 min. Plasmid sequence (pNG1304) was confirmed by Sanger sequencing (AGRF). Overproduction and purification of the protein was the same as for the native HelD (below).
Plasmid template for the MGA transcription assay consists of a construct containing three strong consensus promoters, Thermus VV1-2/D252, pGP31 from Bacillus phage SPO1 and LacUV5 that directed transcription of an array containing 12 direct repeats of the MGA sequence (5′-GGATCCCGACTGGCGAGAGCCAGGTAACGAATGGATCCTAAAAAC-3′) followed by an E. coli tRNA-trp terminator. This construct was synthesised by GenScript and cloned into the EcoRV site of pUC57-Simple to give pNG1299, (sequence and plasmid available through Addgene, ID 149709). pNG1299 was propagated in E. coli NEB® Stable (C3040H, New England Biolabs Inc) to avoid recombination of the MGA array. Supercoiled plasmid template was prepared as described by53 using the reagents from an ISOLATE II Plasmid Mini Kit (BIO-52057, Bioline).
B. subtilis EC RNAP purification
A seed culture (40 ml) of E. coli BL21(DE3) transformed with pNG1256 was grown in LB supplemented with 100 μg/ml ampicillin at 37 °C to an A600 of 0.5 and was used to inoculate 4 L of auto-induction medium54 supplemented with 100 μg/ml ampicillin. The culture was grown at 30 °C for 30 h, cells harvested by centrifugation at 4,000 × g, 4 °C, 20 min, and washed pellets stored at −80 °C. The frozen cell pellet was resuspended in 100 ml HisA buffer (20 mM KH2PO4 pH7.8, 500 mM NaCl, 20 mM imidazole) supplemented with EDTA-free protease inhibitor cocktail (1 × concentration S8830, Sigma-Aldrich) and 100 μl of 4 mg/ml DNaseI (DN25, Sigma-Aldrich) at 4 °C. Cells were lysed by repeated passage through an Avestin C5 homogeniser at ~20 kPa, and the lysate clarified by centrifugation at 16,000 × g, 4 °C, 20 min.
The resulting supernatant was loaded onto a 5 ml HisTrap FF column pre-equilibrated with HisA buffer. The column was washed with 4% HisB (HisA + 500 mM imidazole), and RNAP eluted with 50% HisB. Following dialysis in QA buffer (20 mM Tris-HCl pH 7.8, 150 mM NaCl, 10 mM MgCl2, 1 mM DTT) the sample was loaded onto a 1 ml MonoQ column pre-equilibrated in TrisA. A gradient of 0–50% TrisA supplemented with 1 M NaCl over 10 ml was used to elute proteins with RNAP core eluting as a peak at ~0.35 M NaCl.
Purified RNAP was dialysed into QA buffer (20 mM Tris-HCl pH 7.8, 150 mM NaCl, 10 mM MgCl2, 1 mM DTT), concentrated with an Amicon® Ultra-15 Centrifugal Filter with a 3 KDa NMWCO (UFC900324, Merck Millipore) and small aliquots snap-frozen in N2(l) and stored at −80 °C. Two different concentrations of purified RNAP were prepared at 23.3 mg/ml and 8.2 mg/ml for storage. Protein and nucleic acid content was determined with a Qubit fluorometer using the protein and HS DNA assays (Supplementary Table 1).
Bacillus subtilis σA purification
σA was overproduced and purified25 with the following modifications; after HisTrap FF column purification, fractions containing σA were dialysed overnight into 50 mM NaH2PO4 pH 8.0, 150 mM NaCl. Dialysate was loaded onto a Mono Q 5/50 GL column (17516601, GE Life Sciences) at a rate 0.5 ml/min, and washed with 50 mM NaH2PO4 pH 8.0, 150 mM NaCl for 10 ml at a flow rate of 0.5 ml/min. σA was eluted with a gradient of 150–500 mM NaCl over 20 ml followed by a step to 1 M NaCl for 3 ml. Fractions containing σA were dialysed into 50 mM NaH2PO4 pH 8.0, 150 mM NaCl and concentrated with an Amicon® Ultra-15 Centrifugal Filter with a 3 kDa NMWCO. Concentrated samples were snap-frozen in N2 (l) and stored at −80 °C before use.
B. subtilis HelD purification
HelD and HelD K239A were overproduced and purified12 with the following modifications; after HisTrap FF column purification fractions containing HelD were dialysed overnight at 4 °C into (20 mM Tris-HCl pH7.8, 10 mM MgCl2, 150 mM NaCl, 5 mM DTT) and applied to a 1 ml HiTrap Heparin HP column (17040601, GE Lifesciences) at a flow rate of 1 ml/min. The column was washed with 5 ml 20 mM Tris-HCl pH7.8, 10 mM MgCl2, 150 mM NaCl) at 1 ml/min and eluted with a gradient of 150 mM-1000 mM NaCl over 20 ml. Fractions containing HelD were dialysed overnight into QA buffer and concentrated using an Amicon® Ultra-15 Centrifugal Filter with a 3 kDa NMWCO (UFC900324, Merck Millipore). The concentrated sample was snap-frozen in N2 (l).
B. subtilis HelD-RNAP complex
B. subtilis LK63750 (Δδ) was grown at 45 °C in LB in baffled flasks to maximise aeration to late exponential phase, cells pelleted by centrifugation, and washed pellets stored at −80 °C. Frozen pellets from 8 L culture were resuspended in 50 ml HisA buffer (above) supplemented with EDTA-free protease inhibitor cocktail (1.8× concentration S8830, Sigma-Aldrich) at 4 °C, and lysed by multiple passage through an Avestin C5 homogeniser at ~25 kPa, and the lysate clarified by centrifugation at 16,000 × g, 4 °C, 20 min.
RNAP was purified using a 5 ml HisTrap FF as above and the eluted sample dialysed overnight against QA buffer (20 mM Tris-HCl pH 7.8, 150 mM, 10 mM MgCl2, 1 mM EDTA, 5 mM DTT) at 4 °C. Following clarification by centrifugation, the overnight dialysate was loaded onto a 1 ml MonoQ column pre-equilibrated in QA buffer. RNAP was eluted using a 13.5 ml 150–500 mM NaCl gradient in QA buffer. RNAP containing fractions were pooled and dialysed overnight at 4 °C against QA buffer prior to loading onto a 1 ml HiTrap Heparin HP column pre-equilibrated in QA buffer. RNAP was eluted as two peaks using a 150–1000 mM NaCl gradient over 15 ml. The first peak corresponded to Holoenzyme (~650 mM NaCl), and the second, minor peak, to the HelD complex (~950 mM NaCl).
Holoenzyme and HelD complex fractions were dialysed separately against buffer QA overnight at 4 °C, concentrated and small aliquots snap-frozen in N2(l) and stored at −80 °C. Holoenzyme fractions were 7.46 mg/ml and HelD complex fractions 2.57 mg/ml.
Transcription assays
An in vitro transcription assay adapted from55 was used to assay the activities of RNAP and RNAP-HelD complexes. Briefly, 20 μl transcription reactions containing 80 nM of core RNAP, 240 nM of σA and 0–2560 nM of HelD or HelD K239A in transcription buffer (40 mM Tris-HCl pH 7.5, 50 mM KCl, 10 mM MgCl2 0.02% (v/v) Triton-X100, 8 mM DTT) were assembled in a well of a black, half area, flat-bottomed, non-binding 96 well microplate (CLS3686, Corning) and were incubated for 15 min at 37 °C with shaking. To initiate the transcription reaction 20 μl of start solution containing 1 mM rNTPs (N0466S, New England Biolabs Inc) and 10 nM pNG1299, in transcription buffer was added. The plate was sealed with a clear adhesive plate seal (WHA-7704-0001, Whatman) and incubated with shaking at 37 °C for 15 min. Reactions were stopped by the addition of 40 μl of ice-cold stop solution (10 mM Tris-HCl pH8.0, 1 mM EDTA, 144 μM malachite green oxalate salt M6880, Sigma-Aldrich) and developed on ice for 5 min. The plate was read using a Pherastar FS (BMG Labtech) using a 610 nm excitation, 675 nm emission optical module. Percentage transcription was calculated relative to a 1:0 RNAP:HelD reaction. Each experiment was performed three times in technical duplicate.
ATPase assays
ATPase activity was determined by malachite green assay56. Malachite green reagent was prepared immediately prior to use by mixing 0.045% (w/v) malachite green with 5% (w/v) ammonium molybdate in 4 M H2SO4 in a 3:1 (v/v) ratio, and passing through a 0.45 µm filter. Reactions were carried out based on methods described in work13 with the following modifications. Reaction mixtures contained 100 pmol of protein, 10 mM ATP, 20 mM Tris-HCl pH 7.8, 10 mM MgCl2, and 150 mM NaCl in a final volume of 100 µl. All reaction components, with the exception of ATP, were assembled and incubated at 25 °C for 5 min to equilibrate. Following incubation, ATP was added to each mixture and the reaction was allowed to proceed for 30 min at 25 °C. Upon completion, 800 µl of malachite green reagent was added to each reaction, incubated for 1 min, followed by addition of 100 µl of 34% (w/v) sodium citrate. The colorimetric change was allowed to develop for 20 min at 25 °C, following which 250 µl of each reaction was transferred to a microplate and absorbance read at 660 nm on a Pherastar FS (BMG Labtech) plate reader. Phosphate released was quantified by comparison to a standard curve constructed from KH2PO4. Reactions were performed in technical duplicates and results averaged across 3 independent assay replicates.
Preparation of cryo-EM grids and cryo-electron microscopy
Between 2 and 2.5 μl of 2.57 mg/ml RNAP-HelD complex or 3.0 mg/ml EC diluted in QA buffer were deposited onto glow-discharged UltrAuFoil 1.2/1.3 or Quantifoil 1.2/1.3 cryo-electron microscopy grids and blotted for 5 s before plunge freezing into liquid ethane using a Mark IV Vitrobot (FEI). Data were collected on a Titan Krios (Thermo Fisher) electron microscope operated at 300 kV and equipped with a Gatan BioQuantum LS 967 energy filter and Gatan K2 Summit detector, operated in unfiltered mode. Data were collected in electron counting mode at a pixel size of 0.84 Å/pixel and a calibrated sample-to-pixel magnification of 59524 x, (microscope user interface listed magnification, 165000 × EFTEM). For the HelD complex, movies were collected as a series of 60 frames with a total accumulated dose of 63.6 e−/Å2. For the EC, movies were collected as a series of 40 frames and a total accumulated dose of 52.2 e−/Å2. For both datasets, data were collected using automated data collection in EPU, with a defocus range of 0.6–2.8 μm. Approximately 60% of the data for both the RNAP HelD complex and the EC were collected at 20° stage tilt to compensate for the effects of preferred particle orientation.
Image processing of the HelD complex
A total of 4331 images (1859 at 0° tilt, 2472 at 20° tilt) were collected for the RNAP-HelD complex. All image processing was performed in RELION 3.157 unless otherwise indicated. Movies were aligned and dose-weighted using MotionCor258 before contrast-transfer function (CTF) estimation was performed on the motion-corrected images using GCTF59. Particle picking was performed in CrYOLO60 using a pre-trained general model which had been refined using a subset of manually picked training data. Particle coordinates were contrast-inverted, normalised and extracted in RELION. Following particle picking, a total of 580,468 particles were extracted and subjected to several rounds of 2D classification for initial cleaning of the particle data, which resulted in a subset of 379,179 particles. Subsequent rounds of 3D classification isolated a smaller subset of 65,356 particles. Due to moderate preferred orientation of the specimen, a large portion of the particles were excluded to minimise resolution anisotropy in the final reconstruction. These particles were then subjected to iterative Bayesian polishing and CTF refinement with higher-order aberration correction in RELION until further processing ceased to yield an increase in resolution. Following refinement of the per-particle motion and CTF parameters, post-processing of the final reconstruction yielded a resolution of 3.36 Å as determined by the Gold-standard Fourier-Shell Correlation (FSC = 0.143) criterion in RELION61. The final refined map was then subjected to density modification and automated model-based based sharpening in Phenix62,63 (see Supplementary Fig. 3).
Following reconstruction in RELION, the final subset of 65,356 particles was subjected to 3D variability analysis in Cryosparc45. 3D variability analysis was performed solving for 3 conformational modes, and the results visualised in ChimeraX64.
Image processing of the elongation complex
A total of 5185 images (2955 at 0° tilt, 2230 at 20° tilt) were collected for the EC. All processing was performed as described for the HelD complex above unless otherwise indicated. Following particle picking, a total of 1,069,336 particles were extracted in RELION and subjected to several rounds of 2D classification to obtain a smaller subset of 355,795 particles. This subset was subjected to several rounds of 3D classification to remove incomplete particles and over-represented orientations to isolate a final subset of 58,854 particles. These particles were then subjected to Bayesian polishing, CTF-refinement and postprocessing as for the HelD complex above. Following postprocessing, the final reconstruction yielded a resolution of 3.36 Å as determined by the GSFSC criterion in RELION. As for the RNAP-HelD complex, the final refined map was then subjected to density modification and automated model-based based sharpening in Phenix62,63 (see Supplementary Fig. 2).
Structure building and refinement
Initial model building for the core RNAP subunits commenced using a homology model generated previously19, which was fitted into the density of the HelD complex using rigid body fitting in CHIMERA followed by molecular-dynamics flexible fitting in NAMD65. The model was subject to cycles of manual model building in COOT66 followed by refinement by phenix.real_space_refine. De novo atomic modelling for the HelD subunit was performed by initial modelling of HelD in Phenix using phenix.map_to_model62, followed by model building in COOT66.
The ε subunit was modelled based on homology with a known structure from Geobacillus stearothermophilus (PDB ID: 4NJC)67. Refinement of the elongation complex commenced using the RNAP-HelD model, which was placed into density using phenix.dock_in_map, followed by cycles of model building in COOT and refinement in phenix.real_space_refine. Density-based sequence was built for nucleic acids. All models were further refined in ISOLDE68 and phenix.real_space_refine until deemed final.
Sequence analysis
B. subtilis HelD sequence (UniProtKB - O32215) was used to identify similar proteins using the NCBI CDART search programme69. Sequences from diverse organisms were selected from the 24426 hits defined as HelD-related helicases for alignment using ClustalX 2.170. Sequences were also selected from the DUF4968 domain-containing protein (74 hits) and the multispecies: DUF4968 domain-containing protein (37 hits) categories for characterisation of their HelD-like sequences. Phylogenetic trees were constructed using NCBI COBALT71, and sequence conservation mapped to structure using ConSurf72.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
We would like to thank Profs Nick Dixon and Rick Lewis for helpful comments on the manuscript, and members of our respective laboratories and those of Dr Libor Krasny and Prof Markus Wahl for discussions. P.J.L. acknowledges the assistance of Sarah Pichereau during the purification of RNAP-HelD complexes. This work was supported by grants from the Priority Research Centre for Drug Discovery, University of Newcastle (P.J.L), NUW Alliance (G1801287 to P.J.L., A.J.O and G.T.), and NHMRC (GNT1184012 to G.T.). T.N., M.M., and C.J.D. were funded through PhD scholarships from the Australian Government.
Source data
Author contributions
P.J.L., M.M. and C.J.D. cloned genes, produced proteins/complexes and performed experiments. T.N. prepared and imaged cryo-EM samples with S.H.J.B. and J.B., T.N. processed cryo-EM data with S.H.J.B., P.J.L. and G.T., and built atomic models with A.J.O. and G.T. All authors contributed to the analysis of the data and the interpretation of the results. P.J.L. wrote the manuscript with contributions from the other authors. P.J.L. and G.T. supervised work in their respective groups. P.J.L. conceived and coordinated the project.
Data availability
CryoEM maps have been deposited in the Electron Microscopy Data Bank (https://www.ebi.ac.uk/pdbe/emdb/) under accession codes EMD-21921 (RNAP-HelD) and EMD-21920 (RNAP elongation complex). Structure coordinates have been deposited in the RCSB Protein Data Bank (https://www.rcsb.org/) with accession codes 6WVK (RNAP-HelD) and 6WVJ (RNAP elongation complex). Plasmids pNG1256, pNG1299 and pNG1304 are available from Addgene (https://www.addgene.org) under accession numbers 149710, 149709, and 162488, respectively. Other data supporting the findings of this study are available from the corresponding authors on request. Source data are provided with this paper.
Competing interests
The authors declare no competing interests.
Footnotes
Peer review information Nature Communications thanks Yu Zhang and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Gökhan Tolun, Email: gokhan_tolun@uow.edu.au.
Peter J. Lewis, Email: Peter.Lewis@newcastle.edu.au
Supplementary information
Supplementary information is available for this paper at 10.1038/s41467-020-20157-5.
References
- 1.Pomerantz RT, O’Donnell M. The replisome uses mRNA as a primer after colliding with RNA polymerase. Nature. 2008;456:762–766. doi: 10.1038/nature07527. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Pomerantz RT, O’Donnell M. Direct restart of a replication fork stalled by a head-on RNA polymerase. Science. 2010;327:590–592. doi: 10.1126/science.1179595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Gupta MK, et al. Protein-DNA complexes are the primary sources of replication fork pausing in Escherichia coli. Proc. Natl Acad. Sci. USA. 2013;110:7252–7257. doi: 10.1073/pnas.1303890110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Rocha EPC. The replication-related organization of bacterial genomes. Microbiology. 2004;150:1609–1627. doi: 10.1099/mic.0.26974-0. [DOI] [PubMed] [Google Scholar]
- 5.Adelman K, Lis JT. Promoter-proximal pausing of RNA polymerase II: emerging roles in metazoans. Nat. Rev. Genet. 2012;13:720–731. doi: 10.1038/nrg3293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Epshtein V, et al. UvrD facilitates DNA repair by pulling RNA polymerase backwards. Nature. 2014;505:372–377. doi: 10.1038/nature12928. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Ghodke H, Ho HN, van Oijen AM. Single-molecule live-cell imaging visualizes parallel pathways of prokaryotic nucleotide excision repair. Nat. Commun. 2020;11:1477. doi: 10.1038/s41467-020-15179-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Ho HN, van Oijen AM, Ghodke H. The transcription-repair coupling factor Mfd associates with RNA polymerase in the absence of exogenous damage. Nat. Commun. 2018;9:1570. doi: 10.1038/s41467-018-03790-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Sanders K, et al. The structure and function of an RNA polymerase interaction domain in the PcrA/UvrD helicase. Nucleic Acids Res. 2017;45:3875–3887. doi: 10.1093/nar/gkx074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Sikova M, et al. The torpedo effect in Bacillus subtilis: RNase J1 resolves stalled transcription complexes. EMBO J. 2020;39:e102500. doi: 10.15252/embj.2019102500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Saba J, et al. The elemental mechanism of transcriptional pausing. Elife. 2019;8:e40981. doi: 10.7554/eLife.40981. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Wiedermannova J, et al. Characterization of HelD, an interacting partner of RNA polymerase from Bacillus subtilis. Nucleic Acids Res. 2014;42:5151–5163. doi: 10.1093/nar/gku113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Koval T, et al. Domain structure of HelD, an interaction partner of Bacillus subtilis RNA polymerase. FEBS Lett. 2019;593:996–1005. doi: 10.1002/1873-3468.13385. [DOI] [PubMed] [Google Scholar]
- 14.Hawkins M, et al. Direct removal of RNA polymerase barriers to replication by accessory replicative helicases. Nucleic Acids Res. 2019;47:5100–5113. doi: 10.1093/nar/gkz170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Liu B, Zuo Y, Steitz TA. Structural basis for transcription reactivation by RapA. Proc. Natl Acad. Sci. USA. 2015;112:2006–2010. doi: 10.1073/pnas.1417152112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Xu J, et al. Structural basis for the initiation of eukaryotic transcription-coupled DNA repair. Nature. 2017;551:653–657. doi: 10.1038/nature24658. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Lane WJ, Darst SA. Molecular evolution of multisubunit RNA polymerases: sequence analysis. J. Mol. Biol. 2010;395:671–685. doi: 10.1016/j.jmb.2009.10.062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Lane WJ, Darst SA. Molecular evolution of multisubunit RNA polymerases: structural analysis. J. Mol. Biol. 2010;395:686–704. doi: 10.1016/j.jmb.2009.10.063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Keller AN, et al. epsilon, a new subunit of RNA polymerase found in gram-positive bacteria. J. Bacteriol. 2014;196:3622–3632. doi: 10.1128/JB.02020-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Yang X, Lewis PJ. Overproduction and purification of recombinant Bacillus subtilis RNA polymerase. Protein Expr. Purif. 2008;59:86–93. doi: 10.1016/j.pep.2008.01.006. [DOI] [PubMed] [Google Scholar]
- 21.Helmann JD. Purification of Bacillus subtilis RNA polymerase and associated factors. Methods Enzymol. 2003;370:10–24. doi: 10.1016/S0076-6879(03)70002-0. [DOI] [PubMed] [Google Scholar]
- 22.Kang J, et al. Structural basis of transcription arrest by coliphage HK022 Nun in an Escherichia coli RNA polymerase elongation complex. Elife. 2017;6:e25478. doi: 10.7554/eLife.25478. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Kouba T, et al. The Core and Holoenzyme Forms of RNA Polymerase from Mycobacterium smegmatis. J Bacteriol. 2019;201:e00583-18. doi: 10.1128/JB.00583-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Vassylyev DG, Vassylyeva MN, Perederina A, Tahirov TH, Artsimovitch I. Structural basis for transcription elongation by bacterial RNA polymerase. Nature. 2007;448:157–162. doi: 10.1038/nature05932. [DOI] [PubMed] [Google Scholar]
- 25.Johnston EB, Lewis PJ, Griffith R. The interaction of Bacillus subtilis sigmaA with RNA polymerase. Protein Sci. 2009;18:2287–2297. doi: 10.1002/pro.239. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Ma C, et al. RNA polymerase-induced remodelling of NusA produces a pause enhancement complex. Nucleic Acids Res. 2015;43:2829–2840. doi: 10.1093/nar/gkv108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Ma C, et al. Inhibitors of bacterial transcription initiation complex formation. ACS Chem. Biol. 2013;8:1972–1980. doi: 10.1021/cb400231p. [DOI] [PubMed] [Google Scholar]
- 28.Yang X, et al. The structure of bacterial RNA polymerase in complex with the essential transcription elongation factor NusA. EMBO Rep. 2009;10:997–1002. doi: 10.1038/embor.2009.155. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Lin W, et al. Structural Basis of Mycobacterium tuberculosis Transcription and Transcription Inhibition. Mol. Cell. 2017;66:169–179 e168. doi: 10.1016/j.molcel.2017.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Holm L. Benchmarking fold detection by DaliLite v.5. Bioinformatics. 2019;35:5326–5327. doi: 10.1093/bioinformatics/btz536. [DOI] [PubMed] [Google Scholar]
- 31.Pei, H.-H. et al. The δ subunit and NTPase HelD institute a two-pronged mechanism for RNA polymerase recycling. Nat. Comm.10.1038/s41467-020-20159-3. [DOI] [PMC free article] [PubMed]
- 32.Artsimovitch I, et al. Allosteric modulation of the RNA polymerase catalytic reaction is an essential component of transcription control by rifamycins. Cell. 2005;122:351–363. doi: 10.1016/j.cell.2005.07.014. [DOI] [PubMed] [Google Scholar]
- 33.Molodtsov V, et al. X-ray crystal structures of the Escherichia coli RNA polymerase in complex with benzoxazinorifamycins. J. Med. Chem. 2013;56:4758–4763. doi: 10.1021/jm4004889. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Naryshkina T, Kuznedelov K, Severinov K. The role of the largest RNA polymerase subunit lid element in preventing the formation of extended RNA-DNA hybrid. J. Mol. Biol. 2006;361:634–643. doi: 10.1016/j.jmb.2006.05.034. [DOI] [PubMed] [Google Scholar]
- 35.Toulokhonov I, Landick R. The role of the lid element in transcription by E. coli RNA polymerase. J. Mol. Biol. 2006;361:644–658. doi: 10.1016/j.jmb.2006.06.071. [DOI] [PubMed] [Google Scholar]
- 36.Delumeau O, et al. The dynamic protein partnership of RNA polymerase in Bacillus subtilis. Proteomics. 2011;11:2992–3001. doi: 10.1002/pmic.201000790. [DOI] [PubMed] [Google Scholar]
- 37.Singleton MR, Dillingham MS, Wigley DB. Structure and mechanism of helicases and nucleic acid translocases. Annu Rev. Biochem. 2007;76:23–50. doi: 10.1146/annurev.biochem.76.052305.115300. [DOI] [PubMed] [Google Scholar]
- 38.Kouba, T. et al. Mycobacterial HelD is a nucleic acids-clearing factor for RNA polymerase. Nat. Comm.10.1038/s41467-020-20158-4. [DOI] [PMC free article] [PubMed]
- 39.Krissinel E, Henrick K. Inference of macromolecular assemblies from crystalline state. J. Mol. Biol. 2007;372:774–797. doi: 10.1016/j.jmb.2007.05.022. [DOI] [PubMed] [Google Scholar]
- 40.Gnatt AL, Cramer P, Fu J, Bushnell DA, Kornberg RD. Structural basis of transcription: an RNA polymerase II elongation complex at 3.3 A resolution. Science. 2001;292:1876–1882. doi: 10.1126/science.1059495. [DOI] [PubMed] [Google Scholar]
- 41.Abdelkareem M, et al. Structural basis of transcription: RNA polymerase backtracking and its reactivation. Mol. Cell. 2019;75:298–309 e294. doi: 10.1016/j.molcel.2019.04.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Molodtsov V, et al. Allosteric effector ppGpp potentiates the inhibition of transcript initiation by DksA. Mol. Cell. 2018;69:828–839 e825. doi: 10.1016/j.molcel.2018.01.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Kettenberger H, Armache KJ, Cramer P. Complete RNA polymerase II elongation complex structure and its interactions with NTP and TFIIS. Mol. Cell. 2004;16:955–965. doi: 10.1016/j.molcel.2004.11.040. [DOI] [PubMed] [Google Scholar]
- 44.Brueckner F, Ortiz J, Cramer P. A movie of the RNA polymerase nucleotide addition cycle. Curr. Opin. Struct. Biol. 2009;19:294–299. doi: 10.1016/j.sbi.2009.04.005. [DOI] [PubMed] [Google Scholar]
- 45.Punjani, A. & Fleet, D. J. 3D variability analysis: directly resolving continuous flexibility and discrete heterogeneity from single particle cryo-EM images.Preprint at https://www.biorxiv.org/content/10.1101/2020.04.08.032466v2 (2020). [DOI] [PubMed]
- 46.Stelter M, Acajjaoui S, McSweeney S, Timmins J. Structural and mechanistic insight into DNA unwinding by Deinococcus radiodurans UvrD. PLoS ONE. 2013;8:e77364. doi: 10.1371/journal.pone.0077364. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Velankar SS, Soultanas P, Dillingham MS, Subramanya HS, Wigley DB. Crystal structures of complexes of PcrA DNA helicase with a DNA substrate indicate an inchworm mechanism. Cell. 1999;97:75–84. doi: 10.1016/S0092-8674(00)80716-3. [DOI] [PubMed] [Google Scholar]
- 48.Yang W. Lessons learned from UvrD helicase: mechanism for directional movement. Annu. Rev. Biophys. 2010;39:367–385. doi: 10.1146/annurev.biophys.093008.131415. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Nicolas P, et al. Condition-dependent transcriptome reveals high-level regulatory architecture in Bacillus subtilis. Science. 2012;335:1103–1106. doi: 10.1126/science.1206848. [DOI] [PubMed] [Google Scholar]
- 50.Rabatinova A, et al. The delta subunit of RNA polymerase is required for rapid changes in gene expression and competitive fitness of the cell. J. Bacteriol. 2013;195:2603–2611. doi: 10.1128/JB.00188-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Liu H, Naismith JH. An efficient one-step site-directed deletion, insertion, single and multiple-site plasmid mutagenesis protocol. BMC Biotechnol. 2008;8:91. doi: 10.1186/1472-6750-8-91. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Peredultchuk, M., Vonstein, V. and Demirjian, D. C. Thermus promoters for gene expression. USA patent (1999).
- 53.Pronobis MI, Deuitch N, Peifer M. The Miraprep: a protocol that uses a Miniprep kit and provides maxiprep yields. PLoS ONE. 2016;11:e0160509. doi: 10.1371/journal.pone.0160509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Studier FW. Protein production by auto-induction in high density shaking cultures. Protein Expr. Purif. 2005;41:207–234. doi: 10.1016/j.pep.2005.01.016. [DOI] [PubMed] [Google Scholar]
- 55.Scharf NT, Molodtsov V, Kontos A, Murakami KS, Garcia GA. Novel chemical scaffolds for inhibition of rifamycin-resistant RNA polymerase discovered from high-throughput. Screen. SLAS Disco. 2017;22:287–297. doi: 10.1177/2472555216679994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Lanzetta PA, Alvarez LJ, Reinach PS, Candia OA. An improved assay for nanomole amounts of inorganic phosphate. Anal. Biochem. 1979;100:95–97. doi: 10.1016/0003-2697(79)90115-5. [DOI] [PubMed] [Google Scholar]
- 57.Zivanov, J. et al. New tools for automated high-resolution cryo-EM structure determination in RELION-3. Elife7, e42166 (2018). [DOI] [PMC free article] [PubMed]
- 58.Zheng SQ, et al. MotionCor2: anisotropic correction of beam-induced motion for improved cryo-electron microscopy. Nat. Methods. 2017;14:331–332. doi: 10.1038/nmeth.4193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Zhang K. Gctf: real-time CTF determination and correction. J. Struct. Biol. 2016;193:1–12. doi: 10.1016/j.jsb.2015.11.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Wagner T, et al. SPHIRE-crYOLO is a fast and accurate fully automated particle picker for cryo-EM. Commun. Biol. 2019;2:218. doi: 10.1038/s42003-019-0437-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Rosenthal PB, Henderson R. Optimal determination of particle orientation, absolute hand, and contrast loss in single-particle electron cryomicroscopy. J. Mol. Biol. 2003;333:721–745. doi: 10.1016/j.jmb.2003.07.013. [DOI] [PubMed] [Google Scholar]
- 62.Afonine PV, et al. New tools for the analysis and validation of cryo-EM maps and atomic models. Acta Crystallogr D. Struct. Biol. 2018;74:814–840. doi: 10.1107/S2059798318009324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Terwilliger TC, Ludtke SJ, Read RJ, Adams PD, Afonine PV. Improvement of cryo-EM maps by density modification. Nat. Methods. 2020;17:923–927. doi: 10.1038/s41592-020-0914-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Pettersen, E. F. et al. UCSF ChimeraX: structure visualization for researchers, educators, and developers. Protein Sci, 10.1002/pro.3943 (2020). [DOI] [PMC free article] [PubMed]
- 65.Trabuco LG, Villa E, Mitra K, Frank J, Schulten K. Flexible fitting of atomic structures into electron microscopy maps using molecular dynamics. Structure. 2008;16:673–683. doi: 10.1016/j.str.2008.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Emsley P, Cowtan K. Coot: model-building tools for molecular graphics. Acta Crystallogr D. Biol. Crystallogr. 2004;60:2126–2132. doi: 10.1107/S0907444904019158. [DOI] [PubMed] [Google Scholar]
- 67.Keller A, et al. A new subunit of RNA polymerase found in Gram positive bacteria. J. Bacteriol. 2014;196:3622–3632. doi: 10.1128/JB.02020-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Croll TI. ISOLDE: a physically realistic environment for model building into low-resolution electron-density maps. Acta Crystallogr D. Struct. Biol. 2018;74:519–530. doi: 10.1107/S2059798318002425. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Geer LY, Domrachev M, Lipman DJ, Bryant SH. CDART: protein homology by domain architecture. Genome Res. 2002;12:1619–1623. doi: 10.1101/gr.278202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Larkin MA, et al. Clustal W and Clustal X version 2.0. Bioinformatics. 2007;23:2947–2948. doi: 10.1093/bioinformatics/btm404. [DOI] [PubMed] [Google Scholar]
- 71.Papadopoulos JS, Agarwala R. COBALT: constraint-based alignment tool for multiple protein sequences. Bioinformatics. 2007;23:1073–1079. doi: 10.1093/bioinformatics/btm076. [DOI] [PubMed] [Google Scholar]
- 72.Ashkenazy H, et al. ConSurf 2016: an improved methodology to estimate and visualize evolutionary conservation in macromolecules. Nucleic Acids Res. 2016;44:W344–W350. doi: 10.1093/nar/gkw408. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
CryoEM maps have been deposited in the Electron Microscopy Data Bank (https://www.ebi.ac.uk/pdbe/emdb/) under accession codes EMD-21921 (RNAP-HelD) and EMD-21920 (RNAP elongation complex). Structure coordinates have been deposited in the RCSB Protein Data Bank (https://www.rcsb.org/) with accession codes 6WVK (RNAP-HelD) and 6WVJ (RNAP elongation complex). Plasmids pNG1256, pNG1299 and pNG1304 are available from Addgene (https://www.addgene.org) under accession numbers 149710, 149709, and 162488, respectively. Other data supporting the findings of this study are available from the corresponding authors on request. Source data are provided with this paper.