Skip to main content
eLife logoLink to eLife
. 2018 May 9;7:e36349. doi: 10.7554/eLife.36349

The universally-conserved transcription factor RfaH is recruited to a hairpin structure of the non-template DNA strand

Philipp K Zuber 1, Irina Artsimovitch 2,3, Monali NandyMazumdar 2,3,, Zhaokun Liu 2,3,, Yuri Nedialkov 2,3,§, Kristian Schweimer 1, Paul Rösch 1, Stefan H Knauer 1,
Editor: Andrés Aguilera4
PMCID: PMC5995543  PMID: 29741479

Abstract

RfaH, a transcription regulator of the universally conserved NusG/Spt5 family, utilizes a unique mode of recruitment to elongating RNA polymerase to activate virulence genes. RfaH function depends critically on an ops sequence, an exemplar of a consensus pause, in the non-template DNA strand of the transcription bubble. We used structural and functional analyses to elucidate the role of ops in RfaH recruitment. Our results demonstrate that ops induces pausing to facilitate RfaH binding and establishes direct contacts with RfaH. Strikingly, the non-template DNA forms a hairpin in the RfaH:ops complex structure, flipping out a conserved T residue that is specifically recognized by RfaH. Molecular modeling and genetic evidence support the notion that ops hairpin is required for RfaH recruitment. We argue that both the sequence and the structure of the non-template strand are read out by transcription factors, expanding the repertoire of transcriptional regulators in all domains of life.

Research organism: E. coli

Introduction

NusG/Spt5 proteins are the only transcription factors that coevolved with RNA polymerase (RNAP) since the last universal common ancestor (NandyMazumdar and Artsimovitch, 2015). These proteins have an N-terminal domain (NTD) of mixed α/β topology connected to at least one β-barrel C-terminal domain (CTD) bearing a KOW motif via a flexible linker. The NTD binds across the DNA-binding channel, bridging the RNAP pincers composed of the β' clamp and β lobe domains and locking elongating RNAP in a pause-resistant state (Sevostyanova et al., 2011), a mechanism likened to that of processivity clamps in DNA polymerases (Klein et al., 2011). The CTDs modulate RNA synthesis by making contacts to nucleic acids or to proteins involved in diverse cellular processes; Escherichia coli NusG binds either to termination factor Rho to silence aberrant transcription (Mooney et al., 2009b; Peters et al., 2012) or to ribosomal protein S10 to promote antitermination (Said et al., 2017) and transcription-translation coupling (Burmann et al., 2010).

In addition to housekeeping NusG, diverse bacterial paralogs, typified by E. coli RfaH, activate long operons that encode antibiotics, capsules, toxins, and pili by inhibiting Rho-dependent termination, an activity inverse to that of NusG (NandyMazumdar and Artsimovitch, 2015). To prevent interference with NusG, action of its paralogs must be restricted to their specific targets. Targeted recruitment is commonly achieved through recognition of nucleic acid sequences, for example, by alternative σ factors during initiation. Indeed, all RfaH-controlled operons have 12-nt operon polarity suppressor (ops) signals in their leader regions. RfaH is recruited at ops sites in vitro and in vivo (Artsimovitch and Landick, 2002; Belogurov et al., 2009) through direct contacts with the non-template (NT) DNA strand in the transcription bubble (Artsimovitch and Landick, 2002), a target shared with σ (Sevostyanova et al., 2008). However, E. coli NusG is associated with RNAP transcribing most genes and lacks sequence specificity (Mooney et al., 2009a) arguing against an alternative recognition sites model.

In a working model, off-target recruitment of RfaH is blocked by autoinhibition (Figure 1). RfaH-CTD, unlike the CTDs of all other known NusG/Spt5 proteins, which adopt a β-barrel structure, folds as an α-helical hairpin that masks the RNAP-binding site on the NTD (Belogurov et al., 2007). Contacts with the ops element in the NT DNA are thought to trigger domain dissociation, transforming RfaH into an open, active state in which the NTD can bind to RNAP (Belogurov et al., 2007); consistently, destabilization of the domain interface enables sequence-independent recruitment (Belogurov et al., 2007; Shi et al., 2017). On release, the α-helical CTD spontaneously refolds into a NusG-like β-barrel (Burmann et al., 2012), classifying RfaH as a transformer protein (Knauer et al., 2012). Activated RfaH remains bound to the transcription elongation complex (TEC) until termination (Belogurov et al., 2009), excluding NusG present in 100-fold excess (Schmidt et al., 2016). The β-barrel CTD recruits the 30S subunit of the ribosome to leader sequences that lack Shine-Dalgarno elements via interactions with S10 (Burmann et al., 2012). These interactions could be either maintained throughout translation elongation or broken upon the 70S formation; evidence exists in support of either scenario (Kohler et al., 2017; Saxena et al., 2018). Following TEC dissociation, RfaH has been proposed to regain the autoinhibited state (Tomar et al., 2013), thus completing the cycle.

Figure 1. Life cycle of RfaH.

Figure 1.

Available experimental data demonstrate RfaH recruitment to the ops-paused RNAP in vitro (Artsimovitch and Landick, 2002) and in vivo (Belogurov et al., 2009) via a hairpin in the NT DNA (this work). Belogurov et al. (2007) showed that destabilization of the interdomain interface was required for RfaH switch from the autoinhibited into the active state, and proposed that the RfaH-CTD refolds into a β-barrel upon release. The RfaH-CTD refolding and interactions with S10 were demonstrated by NMR spectroscopy, and functional evidence in support of their role in ribosome recruitment in vivo was reported (Burmann et al., 2012). A hypothesis that the autoinhibited state is regained after RfaH is released from TEC at a terminator has been proposed (Tomar et al., 2013) and awaits testing. The details of RfaH:RNAP contacts that mediate initial recruitment at ops, the molecular mechanism of ribosome recruitment, and hypothetical coupling of transcription and translation by RfaH (Burmann et al., 2012) remain to be investigated.β'CH, β' clamp helices; βGL, β gate loop.

A model of E. coli RfaH bound to Thermus thermophilus TEC was constructed by arbitrarily threading the NT DNA (absent in the X-ray structure) through the TEC (Belogurov et al., 2007). While subsequent functional analysis of RfaH supports this model (Belogurov et al., 2010), the path of the NT DNA and the details of ops:RfaH interactions remain unclear. The NT DNA is flexible in the TEC (Kang et al., 2017) and could be trapped in a state incompatible with productive elongation; RfaH/NusG and yeast Spt5 have been proposed to constrain the NT strand to increase processivity (Crickard et al., 2016; NandyMazumdar et al., 2016). Direct contacts to the NT DNA have been demonstrated recently for Bacillus subtilis NusG (Yakhnin et al., 2016) and Saccharomyces cerevisiae Spt5 (Crickard et al., 2016).

Here we combined structural and functional analyses to dissect RfaH:ops interactions. Our data argue that ops plays two roles in RfaH recruitment: it halts RNAP to aid loading of RfaH and makes specific contacts with RfaH-NTD. Strikingly, we found that a small hairpin extruded from the NT DNA is required for RfaH recruitment, demonstrating how NT DNA flexibility could be harnessed for transcriptional regulation in this and potentially many other systems.

Results

Functional dissection of RfaH:ops interactions

Ubiquity of the ops sequence in RfaH targets implies a key role in RfaH function. First, ops is a representative of class II signals that stabilize RNAP pausing through backtracking, a finding that predates demonstration of direct ops:RfaH interactions (Artsimovitch and Landick, 2000). Native-elongation-transcript sequencing analysis revealed that ops matches the consensus pause signal (Figure 2A) and is one of the strongest pauses in E. coli (Larson et al., 2014; Vvedenskaya et al., 2014). The observation that all experimentally validated E. coli RfaH targets (Belogurov et al., 2009) share a pause-inducing TG dinucleotide (Chan et al., 1997; Vvedenskaya et al., 2014) at positions 11 and 12 (Figure 2A) suggests that delaying RNAP at the ops site may be necessary for loading of RfaH. Second, ops bases are expected to make specific contacts to RfaH-NTD. However, potential interactions with RfaH are restricted to the central 5–6 nts of ops in the NT DNA strand, as these are expected to be exposed on the surface of the ops-paused RNAP (Kang et al., 2017). Third, binding to ops could induce conformational changes in RfaH-NTD that destabilize the interdomain interface to trigger RfaH activation. Finally, pausing at ops could be required for ribosome recruitment, a key step in RfaH mechanism (Figure 1). In the case of RfaH, pausing could favor 30S loading at sites lacking canonical ribosome binding sites either kinetically or by remodeling the nascent RNA.

Figure 2. Contribution of individual ops bases to RNAP pausing and RfaH recruitment.

(A) Consensus pause and E. coli ops sequences. (B) Expression of luxCDABE reporter fused to ops mutants in the absence and presence of RfaH determined in three independent experiments, each with three biological replicates (see source file), is presented as average ± standard deviation. Only the data obtained in the presence of RfaH are plotted; the levels of expression in the absence of RfaH are very low. RfaH effect, the ratio of lux activities observed with and without RfaH, is shown below each mutant. (C) In vitro analysis of ops mutants. Transcript generated from the T7A1 promoter on a linear DNA template is shown on top; the transcription start site (bent arrow), ops element (green box), and transcript end are indicated. Halted A24 TECs were formed as described in Materials and Methods on templates with single substitutions in the ops element. Elongation was restarted upon addition of NTPs and rifapentin in the absence or presence of 50 nM RfaH. Aliquots were withdrawn at 10, 20, 40, 80, 160, 320, 640, and 1280 s and analyzed on 8% denaturing gels. Positions of the paused and run-off transcripts are indicated with arrows. Pause sites within the ops region are numbered relative to the ops consensus sequence and color-coded. Results with WT, C3G, G5A, and G12C ops variants are shown, for all other variants see Figure 2—figure supplement 1. (D) Analysis of RfaH effects in vitro (from (C)). The assays were performed in triplicates. RfaH effects at U11 reflect the antipausing modification of RNAP by RfaH. RfaH effects at G12/C13 reflect RfaH binding to the NT DNA strand, which hinders RNAP escape from ops. Fractions of U11 RNA (left) and G12 +C13 RNAs (right) at 20 s in the absence or the presence of RfaH, presented as average ± standard deviation from three independent experiments. RfaH effects (determined as a ratio of RNA fractions with vs. without RfaH) are shown below the variant. The core ops region is indicated by a black box.

Figure 2—source data 1. In vivo analysis of ops mutants by a lux reporter assay.
DOI: 10.7554/eLife.36349.005
Figure 2—source data 2. In vitro analysis of the effect of ops mutants on RNAP pausing and RfaH recruitment.
DOI: 10.7554/eLife.36349.006

Figure 2.

Figure 2—figure supplement 1. In vitro analysis of ops mutants.

Figure 2—figure supplement 1.

Transcript generated from the T7A1 promoter on a linear DNA template is shown on top; the transcription start site (bent arrow), ops element (green box), and transcript end are indicated. Halted A24 TECs were formed as described in Methods on templates with single substitutions in the ops element. Elongation was restarted upon addition of NTPs and rifapentin in the absence or presence of 50 nM RfaH. Aliquots were withdrawn at 10, 20, 40, 80, 160, 320, 640, and 1280 s and analyzed on 8% denaturing gels. Positions of the paused and run-off transcripts are indicated with arrows. Pause sites within the ops region are numbered relative to the ops consensus sequence and color-coded.

To evaluate the roles of individual ops bases in vivo we used a luciferase (lux) reporter system (Burmann et al., 2012) in which RfaH increases expression ~40 fold with the wild-type (WT) ops (Figure 2B). We constructed reporters with single-base substitutions of all ops positions and measured the lux activity of the mutant reporters in the presence and absence of ectopically-expressed RfaH. The stimulating effect of RfaH was reduced by every ops substitution except for G2C (Figure 2B), with the strongest defects observed for substitutions G5A, T6A, G8C, and T11G. Since T11 is buried in the RNAP active site (Kang et al., 2017), the strong effect of the T11G substitution is consistent with the essential role of pausing in RfaH activity.

To distinguish between the effects of the ops substitutions on RNAP pausing and RfaH binding, we used a defined in vitro system in which RNA chain extension is slowed by limiting NTPs. Figure 2C shows assays on the WT, C3G, G5A, and G12C templates, while representative results with all other variants are presented in Figure 2—figure supplement 1. The effect of RfaH was determined as ratio of RNA fractions in the presence vs. in the absence of RfaH (Figure 2D). On the WT ops template, RNAP paused at C9 and U11. In the presence of RfaH, pausing at U11 was significantly reduced, but strongly enhanced at G12, a well-documented consequence of RfaH recruitment attributed to persistent RfaH-NTD:DNA contacts (Belogurov et al., 2007) and akin to σ-induced delay of RNAP escape from promoters and promoter-like sequences during elongation (Perdue and Roberts, 2011). While C3G and T6A substitutions reduced RfaH recruitment and antipausing activity ~3 fold, G4C, G5A, A7T, and G8C abolished both effects completely (Figure 2D). Neither of these central bases was required for RNAP pausing (Figure 2D and Figure 2—figure supplement 1), consistent with their variability in the consensus pause sequence (Figure 2A). Conversely, the G12C substitution eliminated the pause at U11, making measurements of RfaH antipausing activity unreliable, but did not abrogate RfaH recruitment (Figure 2C,D), suggesting that pausing at U11 is dispensable for RfaH binding when RNAP is transcribing slowly.

Observations that RfaH is recruited to RNAP transcribing the G12C template raised a possibility that recruitment may not be restricted to the U11 position; for example, on this template, RNAP also pauses at the C9 position. To determine whether the entire ops element has to be transcribed to recruit RfaH, we assembled TECs on a scaffold in which RNAP is halted three nucleotides upstream from the ops site and walked them in one-nt steps to the ops pause at U11 (Figure 3). To probe RfaH recruitment, we used footprinting with Exo III. In a post-translocated TEC, RNAP protects 14 bp upstream from the RNAP active site (inferred from the position of the RNA 3’ end) from Exo III, in a pre-translocated TEC – 15 bp (Nedialkov and Burton, 2013). When bound, RfaH alters the trajectory of the upstream DNA duplex to protect additional 6–7 bp of DNA from Exo III (Nedialkov et al., 2018). We observed that RfaH induces a strong block to Exo III at U11 (Figure 3), as expected based on previous studies (Artsimovitch and Landick, 2002). RfaH was also recruited to TECs halted at C9 and G10, but not to G8 TEC in which Exo III was able to digest up to 14 bp of the upstream DNA (Figure 3). We conclude that RfaH can bind to TECs halted two nucleotides ahead of the ops site. This ‘out-of-register’ recruitment may be explained by lateral movements of RNAP, which effectively shift the ops position (Figure 3). In the absence of RfaH, RNAP halted at U11 can backtrack by 2–3 nt and by one nt at G10, whereas C9 TECs are resistant to backtracking (Nedialkov et al., 2018); in all three TECs, the same region of the NT DNA will be accessible to RfaH, at least in a fraction of complexes; see Discussion.

Figure 3. RfaH recruitment to RNAP transcribing through the ops element.

Figure 3.

(A) Schematic of Exo III footprinting of free and RfaH-bound TECs. Numbers indicate the upstream footprint boundaries relative to the RNA 3’ end. (B) The G8 TEC was assembled on the scaffold, with RNA and template (T) DNA strands labeled with [γ32P]-ATP and T4 polynucleotide kinase (PNK), and walked in one-nucleotide steps to C9, G10, and U11 positions in the presence of the matching NTP substrates. (C) RfaH was added to 50 nM, where indicated. Following the addition of Exo III, the reactions were quenched at indicated times (0 represents an untreated DNA control) and analyzed on a 12% urea-acrylamide (19:1) gel in 0.5X TBE. Numbers indicate the distance from the RNA 3’ end. Hypothetical TEC structures are shown below. G8 and C9 complexes are predominantly post-translocated, as indicated by 14 bp protection of the upstream DNA. In G10 TEC, the pre-translocated state (15 bp protection) is observed, and in U11 an additional backtracked state (16 bp protection). Exo III may counteract backtracking; the sensitivity of the nascent RNA in G10 and U11 TECs to GreB-assisted cleavage (Nedialkov et al., 2018) was used to infer the translocation states shown in the schematics.

Structural analysis of RfaH:ops contacts

Strong effects of substitutions of ops bases 3 through 8 on RfaH recruitment but not on RNAP pausing (Figure 2D) support a model in which these nucleotides make direct contacts with RfaH. To visualize the molecular details of RfaH:DNA interactions, we determined a crystal structure of RfaH bound to a 9-nt ops DNA encompassing bases G2 – G10 (ops9) at a resolution of 2.1 Å (Figure 4A, Table 1). The asymmetric unit contains two molecules of the complex, in which RfaH maintains the closed, autoinhibited state typical for free RfaH (Figure 4—figure supplement 1A, (Belogurov et al., 2007). The DNA binds to a basic patch on RfaH-NTD opposite the RNAP/RfaH-CTD binding site and forms a hairpin structure (Figure 4B).

Figure 4. Specific recognition of ops by RfaH.

(A) Crystal structure of the RfaH:ops9 complex with the 2Fo – Fc electron density map contoured at 1 σ. (B) Structure of RfaH:ops9 complex with RfaH shown in surface representation, colored according to its electrostatic potential and ops9 as sticks. (C) Details of RfaH:ops9 interactions. Hydrogen bonds are shown as black dashed lines. RfaH residues that interact with ops are labeled in green. Alanine substitutions of RfaH residues that make base-specific contacts to G5 and T6 via their side chains and that compromise RfaH recruitment (Belogurov et al., 2010) are highlighted in red (strongly defective) and orange (moderately defective). (D) RfaH:ops interactions in solution. [1H, 15N]-HSQC spectra of 110 μM [13C, 15N]-RfaH titrated with 803 μM ops12 DNA. Arrows indicate changes of chemical shifts. Selected signals are labeled. (E) Mapping of normalized chemical shift perturbations observed in (D) on the RfaH:ops9 structure.

Figure 4—source data 1. Analysis of the chemical shift perturbations during the HSQC-titration of 15N-RfaH with ops12.
DOI: 10.7554/eLife.36349.011

Figure 4.

Figure 4—figure supplement 1. Analysis of RfaH:ops interactions.

Figure 4—figure supplement 1.

(A) Superposition of free RfaH (PDB ID 2OUG, blue) and RfaH in the RfaH:ops9 complex (RfaH-NTD, green; RfaH-CTD, cyan), both in ribbon representation. The superposition is based on Cα atoms, yielding a root mean square deviation value of 0.72 Å. (B) Two-dimensional view of ops9:RfaH interactions. Water molecules are shown as cyan spheres. Bases are labeled in blue, amino acids in green. Hydrogen bonds between amino acids and DNA are shown as pink dashed lines, water-mediated hydrogen bonds as blue dashed lines. Van-der-Waals interactions are indicated in red. (C) Crystal packing of RfaH:ops9 crystals (space group P1). The content of one asymmetric unit is shown in color, molecules from neighboring asymmetric units are in grey (all molecules in ribbon representation). The panel shows an enlargement of the arrangement of DNA molecules from neighboring asymmetric units. Bases are shown as sticks (oxygen, red; nitrogen, blue; carbon, according to backbone color) and are labeled. Hydrogen bonds are indicated by dashed lines.
Figure 4—figure supplement 2. Secondary structure of isolated ops9 and RfaH:ops9 interaction in solution.

Figure 4—figure supplement 2.

(A) Downfield region of the 1D [1H] NMR spectra of ops9 DNA measured at the temperatures indicated. The region characteristic for imino protons involved in Watson-Crick base pairs is highlighted in yellow. The inferred secondary structure of ops9 DNA is shown to the right. (B) RfaH:ops12 interaction in solution. [1H, 15N]-HSQC-derived normalized chemical shift perturbations vs. residue position in RfaH. The chemical shift perturbation were obtained from a titration of 15N-RfaH with ops12 (for spectra see Figure 3D). Horizontal lines: significance levels of Δδnorm = 0.04 ppm, black;=0.1 ppm, orange,=0.15 ppm, red. Disappearing signals are highlighted in red. The domain arrangement of RfaH is indicated above.

Table 1. Data collection and refinement statistics.

Data collection
Wavelength (Å) 0.9184
Space group P1
Unit cell parameters
a, b, c (Å) 36.309/43.187/61.859
α, β, γ (°) 80.449/75.485/75.392
Resolution (Å)a 41.55–2.1 (2.2–2.1)
Unique/observed reflectionsa,b 19,931/107,345 (2,633/14,210)
Rsym (%) a,c 6.3 (42.9)
IIa 13.96 (3.47)
Completeness (%)a 97.3 (97.9)
Molecules per asymmetric unit 2
Refinement statistics
Rwork (%)d 18.62
Rfree (%)e 23.34
Number of atoms
Protein 4283
Nucleic acid 574
Water 116
B-factors
Protein 56.062
Nucleic acid 87.427
water 48.058
r.m.s. deviations
Bond lengths (Å) 0.013
Bond angles (°) 1.149

aHighest-Resolution shell values are given in parentheses.

bFriedel mates were not treated as independent reflections.

cRsym = Σh ΣI | Ii(h) - <I(h)> | / ΣhΣiI(h); where I are the independent observations of reflection h.

dRwork = Σh ||Fobs| - |Fcalc|| / Σh |Fobs|.

eThe free R-factor was calculated from 5 % of the data, which were removed at random before the structure was refined.

The DNA:protein interface encompasses 420 Å2. The hairpin loop comprises G4-A7, with T6 flipped out so that its nucleobase is completely exposed. The other nucleobases of the loop make stacking interactions. Flipped T6 inserts into a deep, narrow, positively charged pocket on RfaH-NTD, which is mainly formed by H20, R23, Q24, and R73 located in helices α1 and α2. G5 packs against the positive surface next to this cavity (Figure 4B). RfaH-NTD exclusively contacts nucleotides in the loop region, involving K10, H20, R23, Q24, T68, N70, A71, T72, R73, G74, and V75 (Figure 4C and Figure 4—figure supplement 1B). Some well-ordered water molecules are located in the ops-binding region, but only one participates in the recognition of a base (G4). Base-specific interactions with RfaH-NTD are made by G4, G5, and T6 (Figure 4C and Figure 4—figure supplement 1B); however, only G5 and T6 form a hydrogen-bond network with RfaH-NTD that may underlie sequence-specific recognition. The side chains of K10, H20, R23, and R73 directly interact with the ops DNA (Figure 4C and Figure 4—figure supplement 1B) and no aromatic residues for stacking interactions are located near T6 or G5. Thus, contacts between only two nucleobases and four amino acids mediate specific recognition of ops by RfaH. Observations that single Ala substitutions of each RfaH side chain that makes base-specific contacts to G5 and T6 (Figure 4C) compromise RfaH recruitment to the ops-pausedTEC (Belogurov et al., 2010) argue that the RfaH:DNA contacts observed in the binary ops9:RfaH complex are functionally important.

The stem of the DNA hairpin is formed by base pairs C3:G8 and G2:C10, with T9 being flipped out. The G2:C10 base pair is likely an artifact of crystal packing as the stems of neighboring DNA molecules stack on each other (Figure 4—figure supplement 1C) and could not form in a TEC that contains a 10–11 nt bubble. In contrast, the C3:G8 base pair is compatible with the TEC structure and may be physiologically relevant. C3G and G8C substitutions reduce and abolish RfaH recruitment (Figure 2C,D), yet these bases lack specific contacts with RfaH (Figure 4C), suggesting that a hairpin structure may be necessary.

The NT DNA hairpin is required for RfaH recruitment

To corroborate the crystallographic data, we carried out solution-state NMR analyses. In the [1H]-NMR spectrum of ops9 the single peak at ~13 ppm is characteristic of an imino proton signal of a G or T nucleobase in a DNA duplex, indicating the existence of a hairpin with a single base pair in solution (Figure 4—figure supplement 2A). Next, we titrated 15N-labeled RfaH with WT ops (ops12) and recorded [1H,15N]-HSQC spectra after each titration step (Figure 4D). Mapping of the normalized chemical shift perturbations (Figure 4—figure supplement 2B) on the structure of the RfaH:ops9 complex revealed a continuous interaction surface comprising mainly helices α1 and α2 that perfectly matched the DNA-binding site observed in the crystal structure (Figure 4E). The signals of 15N-RfaH-CTD were not affected during the titration, indicating that binding to the ops DNA is not sufficient to induce domain dissociation.

The above results demonstrate that base pair C3:G8 forms both in solution and in the crystal of the binary ops9:RfaH complex. To evaluate if this hairpin could form in the context of the TEC, we modeled RfaH-NTD bound to the ops-paused TEC (Figure 5A) based on a recent cryo-EM structure of the E. coli TEC (Kang et al., 2017) using our ops9:RfaH structure. Since NusG and its homologs share the RNAP-binding mode (Belogurov et al., 2010; Bernecky et al., 2017; Ehara et al., 2017; Said et al., 2017), the crystal structure of Pyrococcus furiosus Spt5 bound to the RNAP clamp domain (Klein et al., 2011; Martinez-Rucobo et al., 2011) served as a template for modeling. The NT DNA hairpin observed in the ops9:RfaH structure could be readily modeled into the TEC. In the modeled complex, RfaH-NTD binds to the β’ clamp helices (β’CH) so that the β-hairpin of RfaH, consisting of β-strands 3 and 4, may establish stabilizing interactions with the upstream DNA, as proposed for E. coli NusG-NTD (Turtola and Belogurov, 2016).

Figure 5. The role of NT DNA hairpin.

Figure 5.

(A) Model of RfaH-NTD bound to the ops-paused TEC. Surface-accessible NT DNA bases are shown as sticks. (B) The double C3G + G8C substitution partially restores RfaH-dependent recruitment. The assay was done as in Figure 2. The position of an RfaH-induced delay in RNAP escape is shown with a blue bar, solid if delay is enhanced.

To test if DNA secondary structure, rather than the identity of the paired nucleotides, is essential for RfaH recruitment to the TEC, we combined strongly defective C3G and G8C substitutions in a flipped G3:C8 base pair. We found that the double substitution partially restored RfaH recruitment, as reflected by RfaH-induced delay at positions 12/13 (Figure 5B). We conclude that the C3:G8 base pair (i) can form in the ops-paused TEC and (ii) plays an indirect, architectural role in RfaH binding by stabilizing a small DNA loop in which the bases are perfectly positioned to make direct contacts to RfaH-NTD.

Discussion

The consensus pause as a versatile regulator

Our findings portray the consensus pause as a chimeric, versatile target for diverse regulatory proteins. Pausing of RNAP is induced by the conserved flanking sequences and would favor recruitment of regulatory factors kinetically, via widening the time window for engagement of proteins in low abundance. The central region of the consensus pause is highly variable, and the primary and secondary structures of the surface-accessible NT DNA strand could mediate direct and indirect readout by a protein ligand. We hypothesize that, in addition to RfaH homologs which could be expected to use a similar mode of binding, other unrelated proteins may employ the same general principle during their recruitment to the elongating RNAP. Moreover, contacts with the NT DNA strand that persist after recruitment may underpin regulation of RNA chain elongation in all cells.

The role of ops in RfaH recruitment

Our results confirm that the ops element plays several roles in RfaH recruitment. First, consistent with the observation of direct contacts with the NT DNA by crosslinking (Artsimovitch and Landick, 2002), RfaH interacts with ops residues 4 through 7. The interactions are corroborated by previous ‘blind’, that is, uninformed by the structure, functional studies of RfaH-NTD in which substitutions of RfaH residues that interact with ops were found to cause defects in RfaH function (Belogurov et al., 2010). However, the pattern of ops:RfaH-NTD contacts, and in particular the extrusion of the hairpin, have not been anticipated. We propose that when RNAP pauses at the ops site, the NT DNA strand forms a transient hairpin exposed on the surface (Figures 4 and 5). Autoinhibited RfaH interacts with the loop nucleotides (G4 through A7), stabilizing the hairpin and forming a transient encounter complex (Figure 1). We observe that T6 flips into a pocket on RfaH-NTD, apparently a common pattern in NT DNA strand contacts since the RNAP σ and β subunits employ analogous capture mechanisms (Bae et al., 2015; Zhang et al., 2012).

Second, pausing at ops appears to be required for efficient RfaH recruitment. Substitutions of ops residues that reduce pausing compromise RfaH function, even though they do not make direct contacts to RfaH. While the simplest explanation is that pausing simply prolongs the lifespan of the RfaH target, additional roles of pausing could be considered. RNAP backtracks when paused at ops in vitro (Artsimovitch and Landick, 2000), effectively shifting the exposed NT DNA two nucleotides back. RfaH is recruited to RNAP halted two nts upstream from ops (Figure 3), suggesting that backtracking at ops, assuming it occurs in vivo, may be needed to place the ops bases in an optimal position for direct interactions. However, RfaH binds to a scaffold ops TEC locked in the post-translocated state (Nedialkov et al., 2018), arguing that the NT DNA strand may be sufficiently flexible (Kang et al., 2017) to interact with RfaH at several template positions. Although it is also possible that conformational changes that accompany the formation of the paused state may favor RfaH binding to RNAP, recent structures of paused TECs (Guo et al., 2018; Kang et al., 2018) and our observations that RfaH binds to scaffolds in which the RNA strand is present or missing similarly (Artsimovitch and Landick, 2002) do not support this interpretation.

Third, given that recruitment of the isolated RfaH-NTD does not require ops, we considered a possibility that RfaH contacts to ops trigger NTD dissociation from CTD. However, this idea is refuted by our observations that domain interface remains intact in the binary complex, implying that additional interactions with RNAP or nucleic acids relieve autoinhibition. Structural studies of an encounter complex formed when the closed RfaH recognizes ops would be required to address this question.

Finally, pausing at ops may assist in the recruitment of a ribosome, which is thought to be critical for RfaH-mediated activation of its target genes which lack canonical Shine-Dalgarno elements (Burmann et al., 2012). RfaH and NusG make similar contacts to S10 (Burmann et al., 2010; Burmann et al., 2012) and could bridge RNAP and 30S during translation initiation and 70S during elongation; the ops-induced delay could favor the initial RfaH:30S interactions. While a cryo-EM structure of a coupled RNAP:70S complex argues against bridging by NusG or RfaH (Kohler et al., 2017), a recent study supports the role of the experimentally determined NusG:S10 interface (Burmann et al., 2010) in binding to 70S and transcription-translation coupling in vivo (Saxena et al., 2018).

Specific recognition of ops by RfaH

Despite low sequence identity (21% as compared to E. coli NusG-NTD), E. coli RfaH-NTD has the typical fold of all NusG proteins (Figure 6A,B) and is thought to make similar contacts to the β’CH. However, in contrast to sequence-independent NusG, RfaH requires contacts with the ops DNA for recruitment. These interactions are highly specific, as illustrated by strong effects of single base substitutions (Figure 2) and lack of off-target recruitment in the cell (Belogurov et al., 2009). Our present data reveal that the specificity of RfaH:DNA contacts is determined by just a few direct interactions, mediated by a secondary structure in the DNA. We observe that the ops DNA forms a hairpin which exposes the invariant G5 and T6, the only two nucleobases that establish a base-specific hydrogen-bond network with RfaH-NTD (Figure 4C and Figure 4—figure supplement 1B), for specific recognition. In RfaH, the basic patch identified by previous analysis (Belogurov et al., 2010) constitutes the DNA binding site, with only the side chains of K10, H20, R23, and R73 making direct contacts to ops (Figures 4B and 6C). Alanine substitutions of K10, H20, and R73 dramatically compromised the delay of RNAP escape from the ops pause, and thus RfaH recruitment (Figures 4C and 6C, [Belogurov et al., 2010]), in agreement with their base-specific interactions in the RfaH:ops9 structure. The R16A substitution also had a strong defect (Belogurov et al., 2010). However, while one nitrogen atom of the guanidinium group of R16 is in hydrogen bonding distance to the oxygen atom of the G4 base (3.57 Å) in one of the complexes in the asymmetric unit, the distance is larger in the other copy (3.82 Å). Together with the effect of the R16A substitution, this suggests that the R16:G4 interaction may become relevant in the context of the ops TEC, where RfaH is more constrained by RfaH:RNAP interactions. Although R23A substitution compromised RfaH recruitment only slightly, our structure reveals that R23 directly contacts T5 via its guanidinium group. Q13A, H65A, T66A, and T68A variants showed only mild effects, which may be indirect. Q13 could be necessary to position R16, while H65, T66, and T68 may be involved in interactions with the β subunit gate loop (Sevostyanova et al., 2011). High conservation of K10, H20, R23, and R73 residues (Shi et al., 2017) and ops sequences (Belogurov et al., 2009) suggests a common recognition mechanism for all RfaH proteins.

Figure 6. Specificity of RfaH for ops.

Figure 6.

Superposition based on backbone atoms of NusG-NTD (PDB ID 2K06, light blue) and RfaH-NTD (taken from the RfaH:ops9 structure, green; root mean square deviation: 4.3 Å). Both proteins in ribbon representation. (B) Structure-based sequence alignment of NusG and RfaH. RfaH residues whose substitutions for Ala compromise RfaH recruitment (Belogurov et al., 2010) are highlighted in red (strongly defective) and orange (moderately defective). RfaH residues that make base-specific interactions with ops via their side chains are marked by an asterisk. (C) Structure of RfaH-NTD in (left) surface representation colored according to its electrostatic potential (from −3kBT/e, red, to +3 kBT/e, blue) and (right) ribbon representation with residues highlighted in (B) shown as sticks (C atoms, red or orange; N atoms, blue; O atoms, light red). (D) Structure of NusG-NTD (PDB ID 2K06) in (left) surface representation colored according to its electrostatic potential and (right) ribbon representation. Residues corresponding to the amino acids of RfaH highlighted in (B) are shown as sticks (C atoms, pink; N atoms, blue; O atoms, light red).

In contrast, the residues that form the basic patch in RfaH are mostly hydrophobic in E. coli NusG (Figure 6) and are not conserved within the NusG family (Shi et al., 2017), consistent with NusG function as a general transcription factor. However, specific contacts with DNA could explain unusual, pause-enhancing NusG effects on RNA synthesis in some bacteria (Czyz et al., 2014; Sevostyanova and Artsimovitch, 2010; Yakhnin et al., 2016).

Different read-out modes of the NT DNA strand

The flipping out of T6 in the ops element and its insertion into a pocket on RfaH-NTD is reminiscent of a mechanism utilized by σ to recognize the −10 promoter element during initiation (Bae et al., 2015; Zhang et al., 2012). The melted DNA strand is draped across a positively charged surface of σ, with highly conserved −11A and −7T flipped out into deep pockets of σ, whereas nucleotides at positions −10, –9, and −8 are mainly bound via extensive interactions between their sugar-phosphate backbone and σ. In the ops9:RfaH complex only one base, T6, is flipped out, but the neighboring G5 packs against the RfaH-NTD surface and also establishes base-specific interactions.

Although both RfaH and σ employ base flipping to specifically bind their target sequences, their recognition mechanisms differ in key details. While the RfaH:ops interaction relies only on a very limited number of interactions, σ establishes extensive, base-specific contacts. RfaH exhibits only few interactions with the phosphate backbone and recognizes just two bases specifically, whereas σ makes extensive interactions with the phosphate backbone of the NT DNA strand and establishes base-specific contacts not only with −11A and −7T of the −10 region, but also with −6G of the discriminator element (Feklistov and Darst, 2011; Zhang et al., 2012). Furthermore, in contrast to RfaH, σ uses a wedge residue (W433 in E. coli σ70), which rotates into the DNA duplex, mimicking the flipped-out base (Bae et al., 2015), a principle that is commonly used by a variety of proteins to stabilize the extrahelical conformation of a flipped-out base (Davies et al., 2000; Lau et al., 1998; Yang et al., 2009; Yi et al., 2012). In contrast, RfaH requires that the NT DNA folds into a hairpin to position the two central ops nucleotides for specific recognition. The ops hairpin thus constitutes an alternative way of stabilizing a DNA conformation with a flipped-out base.

These differences likely reflect distinct roles of NT DNA:protein interactions in the function of RfaH and σ. Although many examples of σ-dependent pauses that are stabilized by σ contacts to promoter-like elements during elongation have been documented (Perdue and Roberts, 2011), the primary role of σ is to mediate promoter recognition and DNA melting (Feklistov et al., 2014). Interactions with the NT DNA strand are established after initial recruitment to the duplex DNA and are only possible as a result of σ-dependent DNA strand separation. NT DNA:σ interactions are highly specific and utilize the same determinants in promoter and paused complexes (Marr et al., 2001; Zenkin et al., 2007; Zhilina et al., 2012). In contrast, RfaH is recruited to the ops element in a pre-made transcription bubble and relies on different DNA contacts for initial binding and for sequence-independent post-recruitment activity. Thus, ops recognition by RfaH seems to be more similar to sequence readout by σ during σ-induced promoter-proximal pausing than during promoter melting. Overall, base flipping provides an effective means to read sequence as it allows contacts with all atoms of a base and may be a general mechanism to recruit specific transcription factors throughout transcription.

The NT DNA strand as a general target for transcription regulation

A growing body of evidence supports a key role of the NT DNA in the regulation of transcription. NT DNA contacts to the β and σ subunits (Bae et al., 2015; Zhang et al., 2012) determine the structure and stability of promoter complexes, control start site selection, and mediate the efficiency of promoter escape, in part by modulating DNA scrunching (Haugen et al., 2006; NandyMazumdar et al., 2016; Strobel and Roberts, 2015; Winkelman and Gourse, 2017). Upon promoter escape and σ release, the NT DNA loses contacts with RNAP (Kang et al., 2017), except for transient interactions with β that control elongation and pausing (NandyMazumdar et al., 2016; Petushkov et al., 2015; Vvedenskaya et al., 2014). Our results suggest that the NT DNA is sufficiently flexible to adopt stable secondary structures and reveal interesting parallels and differences between DNA recognition by σ and RfaH, which bind to similar sites on transcription complexes via high-affinity interactions with the β’CH (Sevostyanova et al., 2008) and interact specifically with the NT DNA strand via base flipping.

NusG homologs from bacteria and yeast that bind NT DNA specifically may employ similar readout modes, allowing them to exert functions opposing those of E. coli NusG (Crickard et al., 2016; Yakhnin et al., 2016). The available evidence thus suggests that conformational flexibility of the NT DNA and neighboring RNAP elements may produce rich regulatory diversity despite the short length of the exposed NT DNA strand, mediating recruitment of factors that control initiation, elongation, and termination of transcription in all domains of life.

Materials and methods

Key resources table.

Reagent type (species)
or resource
Designation Source or reference Identifiers Additional information
Strain, strain background (E. coli) BL21 (λ DE3) Novagen N/A
Strain, strain background (E. coli) DH5α ΔrfaH (λ DE3) Belogurov et al. (2010) IA lab stock
#149
Recombinant DNA reagent list of recombinant plasmids used Table 2
Sequence-based reagent ops9
GCGGTAGTC
IDT N/A
Sequence-based reagent ops12
GGCGGTAGCGTG
Biomers.net N/A
Sequence-based reagent T7A1 promoter
AAAAAGAGTATTGACTTAAAGTCTAACCTATAGGATACTTACAGCCATCGAGCAGGCAGCGGCAAAGCCATGG
Sigma Aldrich IA lab stock #2536
Sequence-based reagent DN PCR primer
AAATAAGCGGCTCTCAGTTT
Sigma Aldrich IA lab stock #2536
Sequence-based reagent UP PCR primer
AAAAAGAGTATTGACTTAAAG
Sigma Aldrich IA lab stock #2499
Sequence-based reagent R40 RNA oligo
UUUAUCGGCGGUAG
IDT DNA Technologies N/A
Sequence-based reagent NT44 DNA oligo
CACCACCACGCGGGCGGTAGCGTGCTTTTTTCGATCTTCCAGTG
IDT DNA Technologies N/A
Sequence-based reagent T44 DNA oligo
CACTGGAAGATCGAAAAAAGCACGCTACCGCCCGCGTGGTGGTG
IDT DNA Technologies N/A
Peptide, recombinant protein E. coli RfaH (transcription assays, NMR) Belogurov et al. (2007) N/A
Peptide, recombinant protein E. coli RfaH (crystallization) Vassylyeva et al. (2006) N/A
Peptide, recombinant protein E. coli RNA polymerase Svetlov and Artsimovitch, 2015 N/A
Peptide, recombinant protein Exo III nuclease New England Biolabs Cat#: MO206
Peptide, recombinant protein T4 polynucleotide kinase New England Biolabs Cat#: MO0201
Commercial assay or kit QIAquick PCR purification kit Qiagen Cat#: 28104
Commercial assay or kit QIAquick Nucleotide Removal Kit Qiagen Cat#: 28306
Chemical compound, drug (15NH)4SO4 Campro Scientific Cat#: CS01-185_148
Chemical compound, drug D2O Eurisotop Cat#: D216L
Chemical compound, drug ApU Sigma-Aldrich Cat #: A6800
Chemical compound, drug [α−32P]-CTP Perkin Elmer Cat#: BLU008H
Chemical compound, drug Rifapentin Artsimovitch et al., 2005 N/A
Chemical compound, drug PEG monomethyl ether 500 Sigma-Aldrich Cat#: 202487
Chemical compound, drug 4-(2-hydroxyethyl)piperazineethanesulfonic acid (HEPES) for crystallization Sigma-Aldrich Cat#: H4034
Chemical compound, drug MgCl2 for crystallization Merck Cat#: 105833
Chemical compound, drug Glutaraldehyde for crystallization Fluka Cat#: 49629
Chemical compound, drug Tris(hydroxymethyl)aminomethane (Tris) for crystallization Roth Cat#: 4855.3
Chemical compound, drug KCl for crystallization VWR Cat#: 26764.298
Chemical compound, drug Dithiothreitol (DTT) for crystallization Roth Cat#: 6908.1
Chemical compound, drug Perfluoropolyether cryo oil Hampton Research Cat#: HR2-814
Software, algorithm PyMol v. 1.7 The PyMOL Molecular Graphics System, Schrödinger, LLC. https://pymol.org/2/
Software, algorithm COOT Emsley et al. (2010) https://www2.mrc-lmb.cam.ac.uk/personal/pemsley/coot/
Software, algorithm XDS Kabsch, 2010b http://xds.mpimf-heidelberg.mpg.de/
Software, algorithm XDSAPP Sparta et al., 2016 https://www.helmholtz-berlin.de/forschung/oe/np/gmx/xdsapp/index_en.html
Software, algorithm PHASER McCoy et al. (2007)
Software, algorithm PHENIX suite Adams et al. (2010) https://www.phenix-online.org/
Software, algorithm LigPlot Wallace et al. (1995) https://www.ebi.ac.uk/thornton-srv/software/LIGPLOT/
Software, algorithm NMRViewJ One Moon Scientific, Inc. http://www.onemoonscientific.com/nmrviewj
Software, algorithm GraFit v. 6.0.12 Erithacus Software Ltd. http://www.erithacus.com/grafit/
Software, algorithm MatLab v. 7.1.0.183 The MathWorks, Inc. https://de.mathworks.com/products/matlab.html
Software, algorithm ImageQuant GE Healthcare Life Sciences www.gelifesciences.com/
Software, algorithm PISA Server Krissinel and Henrick (2007) http://www.ebi.ac.uk/pdbe/pisa/
Other 24-well VDXm plates with sealant Hampton Research HR3-306

Plasmids

Plasmids are listed in Table 2.

Table 2. Plasmids.

Name Description Source
ops variants
pIA1087 PBADopsWTluxCDABE Burmann et al. (2012)
pZL6 PBADops(G2C)‒luxCDABE This work
pZL7 PBADops(A7T)‒luxCDABE This work
pZL12 PBADops(T11G ‒luxCDABE This work
pZL14 PBADops(G5A)‒luxCDABE This work
pZL21 PBADops(G4C)‒luxCDABE This work
pZL22 PBADops(T6A)‒luxCDABE This work
pZL23 PBADops(G8C)‒luxCDABE This work
pZL24 PBADops(G12C)‒luxCDABE This work
pZL25 PBADops(G1C)‒luxCDABE This work
pZL26 PBADops(C3G)‒luxCDABE This work
pZL27 PBADops(C9G)‒luxCDABE This work
pZL28 PBADops(G10C)‒luxCDABE This work
pIA1286 PBADops(C3G + G8C)‒luxCDABE This work
Gene expression vectors
pVS10 PT7 promoter– E. coli rpoA–rpoB–rpoCHis6rpoZ Belogurov et al. (2007)
pVS12 E. coli rfaH in pTYB1 Vassylyeva et al. (2006)
pIA238 E. coli rfaH in pET28a Artsimovitch and Landick (2002)

Gene expression and protein purification

RfaH used in crystallization experiments and in vitro transcription assays was produced as described (Vassylyeva et al., 2006), as was RfaH used in NMR experiments (Burmann et al., 2012), and RNAP for in vitro transcription assays (Svetlov and Artsimovitch, 2015). All expression plasmids are listed in Table 2.

The purity was checked by SDS-PAGE, the absence of nucleic acids was checked by recording UV/Vis spectra on a Nanodrop ND-1000 spectrometer (PEQLAB, Erlangen, Germany). Concentrations were determined by measuring the absorbance at 280 nm (A280) in a 10 mm quartz cuvette (Hellma, Müllheim, Germany) on a Biospectrometer basic (Eppendorf, Hamburg, Germany).

Isotopic labeling

15N-labeled proteins were obtained from E. coli cells grown in M9 minimal medium containing (15NH4)2SO4 (Campro Scientific, Berlin, Germany) as sole nitrogen source (Meyer and Schlegel, 1983; Sambrook and Russel, 1994). Expression and purification were as described for the production of unlabeled proteins.

Crystallization

RfaH was cocrystallized with ops9 DNA (5’-GCG GTA GTC-3’; IDT, Coralville IA) based on a published condition (Vassylyeva et al., 2006). The protein was dialyzed against crystallization buffer (10 mM tris(hydroxymethyl)aminomethane (Tris)/HCl (pH 7.8), 50 mM KCl, 2 mM DTT). ops9 (20 mM in H2O) was diluted with crystallization buffer and a 5-fold molar excess of MgCl2 before being added to RfaH in a molar ratio of 1:1 (complex concentration 400 μM).

The RfaH:ops9 complex was crystallized by vapor diffusion techniques at 4°C using the hanging-drop setup from a reservoir containing 21% (v/v) PEG monomethyl ether (MME) 550, 44.4 mM 4-(2-hydroxyethyl)−1-piperazineethanesulfonic acid (HEPES) (pH 7.0), 4 mM MgCl2 (2 µl protein:DNA solution +2 µl reservoir). Due to crystal instability crosslinking was carried out prior to harvesting by placing 4 µl of 25% (v/v) glutaraldehyde next to the crystallization drop and resealing the well. After an incubation for 2 hr at 4°C the crystal was immersed in perfluoropolyether (Hampton Research) before being frozen in liquid nitrogen.

Data collection and refinement

Diffraction data were collected at the synchrotron beamline MX-14.1 at Helmholtz-Zentrum Berlin (HZB) at the BESSY II electron storage ring (Berlin-Adlershof, Germany) (Mueller et al., 2015) at 100 K using a Pilatus 6M detector and a wavelength of 0.9184 Å. Data were processed and scaled with XDS (Kabsch, 2010a; Kabsch, 2010b) within the graphical user interface of XDSAPP (Sparta et al., 2016). To obtain initial phases Patterson search techniques with homologous search model were performed by PHASER (McCoy et al., 2007) using free RfaH (PDB ID 2OUG) as search model. To minimize the model bias a simulated annealing energy minimization using the PHENIX program suite (Adams et al., 2010) was performed. Subsequent rounds of model building and refinement were performed using COOT (Emsley et al., 2010) and the PHENIX program suite (Adams et al., 2010).

NMR spectroscopy

NMR experiments were performed on Bruker Avance 700 MHz spectrometer, which was equipped with a cryo-cooled, inverse triple resonance probe. Processing of NMR data was carried out using in-house routines. 2D spectra were visualized and analyzed by NMRViewJ (One Moon Scientific, Inc., Westfield, NJ, USA), 1D spectra by MatLab (The MathWorks, Inc., Version 7.1.0.183). Measurements involving RfaH were conducted at 15°C, measurements with isolated ops9 at temperatures from 4-30°C as indicated. The initial sample volume was 500 μl, if not stated otherwise. The resonance assignments for the backbone amide protons of RfaH was from a previous study (Burmann et al., 2012).

The components in the measurement of the 15N-RfaH:ops12 (5’-GGC GGT AGC GTG-3’; biomers.net GmbH, Ulm, Germany) interaction were in 10 mM K2HPO4/KH2PO4 (pH 7.5), 50 mM KCl, 10% D2O. For the determination of the secondary structure of ops9 (5’-GCG GTA GTC-3’; metabion international AG, Planegg/Steinkirchen, Germany) the DNA was in 20 mM Na2HPO4/NaH2PO4 (pH 7.0), 50 mM NaCl, 1 mM MgCl2, 10% D2O.

Interaction studies with chemical shifts changes in the fast regime on the chemical shift timescale were analyzed by calculating the normalized chemical shift perturbation (Δδnorm) according to Equation 1 for [1H,15N] correlation spectra.

Δδnorm= (Δδ1H)2+[0.1(Δδ15N)]2 (1)

where Δδ is the resonance frequency difference in ppm.

RfaH:ops TEC model

The composite model of RfaH bound to the ops-paused TEC was generated based on an available cryo EM structure of the E. coli TEC (Kang et al., 2017) and the complex of P. furiosus Spt5 bound to the RNAP clamp domain (Martinez-Rucobo et al., 2011). The Spt5:clamp complex was superimposed on the β’ subunit of the E. coli TEC, and then the RfaH:ops9 structure was positioned by superimposing RfaH-NTD on the NTD of Spt5 using COOT (Emsley et al., 2010). Nucleotides 2, 9, and 10 of ops9 were manually moved in COOT (Emsley et al., 2010) to superimpose with the NT strand keeping the C3:G8 base pair intact so that G2 is the first paired nucleotide on the upstream end of the bubble. The sequence of the remaining ops element as well as the corresponding sequences in the T DNA strand and the RNA were adapted.

Luciferase reporter assays

Luciferase reporter assays were performed as described in (Belogurov et al., 2010). A selected lux reporter plasmid (Table 2) was co-transformed with a plasmid containing the rfaH gene (pIA947) or an empty vector (pIA957) into IA149 (ΔrfaH in DH5αDE3) and plated on 100 μg/ml carbenicillin (Carb), 50 μg/ml chloramphenicol (Cam) lysogeny broth (LB) plates. Single colonies were inoculated into 3 ml of LB supplemented with Carb and Cam and incubated at 37°C. Overnight cultures were diluted into fresh LB with the antibiotics to optical density at 600 nm (OD600) ∼0.05 and grown at 37°C for 6 hrs. No induction was required for the PBAD-controlled lux or Ptrc-controlled rfaH, as leaky expression from both these vectors was enough to produce a reproducible signal. Luminescence was measured at approximately equal density for all cultures in triplicates using FLUOstar OPTIMA plate reader (BMG LABTECH, Offenburg, Germany) and normalized for cell density. Three sets of assays were done for each condition, with 3 biological replicates and 6 technical replicates each. We note that low levels of luciferase expression in the absence of RfaH are associated with large errors.

In vitro transcription assays

Templates for in vitro transcription were made by PCR amplifying pIA1087 (WT ops) or the plasmids having ops substitutions (Table 2) with a T7A1 promoter-encoding primer (5’-AAAAAGAGTATTGACTTAAAGTCTAACCTATAGGATACTTACAGCCATCGAGCAGGCAGCGGCAAAGCCATGG-3’) and a complementary downstream primer (DN: 5’-AAATAAGCGGCTCTCAGTTT-3’). A second PCR was performed with primers 5’-AAAAAGAGTATTGACTTAAAG-3’ and DN to reduce the concentration of the unused large primer, followed by purification via a QIAquick PCR purification kit (Qiagen, Valencia, CA). The resulting linear templates contained T7A1 promoter followed by an initial 24 nt T-less transcribed region; the run-off transcript generated on these templates is 79-nt long. Linear DNA template (30 nM), holo RNAP (40 nM), ApU (100 µM), and starting NTP subsets (1 µM CTP, 5 µM ATP and UTP, 10 µCi [α32P]-CTP, 3000 Ci/mmol) were mixed in 100 µl of TGA2 (20 mM Tris-acetate, 20 mM Na-acetate, 2 mM Mg-acetate, 5% glycerol, 1 mM DTT, 0.1 mM EDTA, pH 7.9). Reactions were incubated for 15 min at 37°C; thus halted TECs were stored on ice. RfaH (or an equal volume of storage buffer) was added to the TEC, followed by a 2 min incubation at 37°C. Transcription was restarted by addition of nucleotides (10 µM GTP, 150 µM ATP, CTP, and UTP) and rifapentin to 25 µg/ml. Samples were removed at time points indicated in the figures and quenched by addition of an equal volume of STOP buffer (10 M urea, 60 mM EDTA, 45 mM Tris-borate; pH 8.3). Samples were heated for 2 min at 95°C and separated by electrophoresis in denaturing 8% acrylamide (19:1) gels (7 M Urea, 0.5X TBE). The gels were dried and RNA products were visualized and quantified using FLA9000 Phosphorimaging System, ImageQuant Software, and Microsoft Excel. In vitro transcription assays were carried out in triplicates and averaged.

Exonuclease footprinting

To assemble a scaffold TEC, the RNA primer and the T DNA strand were end-labeled with [γ32P]-ATP using PNK (NEB). Following labeling, oligonucleotides were purified using QIAquick Nucleotide Removal Kit (Qiagen). To assemble a scaffold, RNA and T DNA oligonucleotides were combined in PNK buffer and annealed in a PCR machine as follows: 5 min at 45°C; 2 min each at 42, 39, 36, 33, 30, and 27°C, 10 min at 25°C. 12 pmoles of T/RNA hybrid were mixed with 14 pmoles of His-tagged core RNAP in 30 μl of TB [20 mM Tris-Cl, 5% glycerol, 40 mM KCl, 5 mM MgCl2, 10 mM β-mercaptoethanol, pH 7.9], and incubated at 37°C for 10 min. 15 μl of His-Select HF Nickel Affinity Gel (Sigma Aldrich) was washed once in TB and incubated with 20 μg Bovine Serum Albumin in a 40 μl volume for 15 min at 37°C, followed by a single wash step in TB. The T/RNA/RNAP complex was mixed with the Affinity Gel for 15 min at 37°C on a thermomixer (Eppendorf) at 900 rpm, and washed twice with TB. 30 pmoles of the NT oligonucleotide were added, followed by incubation for 20 min at 37°C, one 5 min incubation with TB-1000 in a thermomixer, and five washes with TB. The assembled TECs were eluted from beads with 90 mM imidazole in a 15 μl volume, purified through a Durapore (PVDF) 0.45 μm Centrifugal Filter Unit (Merck Millipore), and resuspended in TB. The TEC was divided in two aliquots; one was incubated with 100 nM RfaH and the other with storage buffer for 3 min at 37°C. For each time point, 5 μl TEC were mixed with 5 μl of Exo III (NEB, 40 U) and incubated at 21°C. At times indicated in the Figure 3 legend, the reactions were quenched with an equal volume of Stop buffer (8 M Urea, 20 mM EDTA, 1x TBE, 0.5% Brilliant Blue R, 0.5% Xylene Cyanol FF).

Programs

All molecular structures were visualized using The PyMOL Molecular Graphics System (Version 1.7, Schrödinger, LLC.) Superpositions of protein and nucleic acid structures were prepared with COOT (Emsley et al., 2010). Interactions between ops9 and RfaH were analyzed using LigPlot (Wallace et al., 1995). The size of the RfaH:ops9 interface was calculated by the PDBePISA server (Krissinel and Henrick, 2007).

Data availability

Coordinates and structure factor amplitudes of the RfaH:ops9 complex are deposited in the Protein Data Bank under ID code 5OND.

Acknowledgements

We thank Angela Fleig and Ramona Heißmann for technical assistance, Birgitta M Wöhrl, Claus Kuhn, and Andrey Feklistov for helpful discussions, and Dmitri Svetlov for comments on the manuscript. We also thank Michael Weyand, Julian Pfahler, and Clemens Steegborn for collecting diffraction data, the HZB for the allocation of synchrotron radiation beamtime and the technical staff of the MX beamline for support. The work was supported by grants Ro 617/21–1 and Ro 617/17–1 (both to PR.) from the Deutsche Forschungsgemeinschaft, and GM67153 (to IA.) from the National Institutes of Health.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Stefan H Knauer, Email: stefan.knauer@uni-bayreuth.de.

Andrés Aguilera, CABIMER, Universidad de Sevilla, Spain.

Funding Information

This paper was supported by the following grants:

  • Deutsche Forschungsgemeinschaft Ro 617/21-1 to Paul Rösch.

  • National Institutes of Health GM67153 to Irina Artsimovitch.

  • Deutsche Forschungsgemeinschaft Ro 617/17-1 to Paul Rösch.

Additional information

Competing interests

No competing interests declared.

Author contributions

Writing—original draft, Writing—review and editing, Carried out NMR experiments and analyzed data, Crystallized the RfaH:ops9 complex, Wrote the manuscript.

Conceptualization, Supervision, Funding acquisition, Writing—original draft, Project administration, Writing—review and editing, Carried out in vitro transcription assays, Designed and supervised in vitro and in vivo experiments, Wrote the manuscript.

Carried out in vitro transcription assays.

Carried out luciferase reporter assays.

Carried out exonuclease foot printing experiments.

Supervision, Writing—original draft, Designed and supervised the NMR experiments, Wrote the manuscript.

Conceptualization, Supervision, Funding acquisition, Writing—original draft, Project administration, Supervised NMR experiments and crystallographic experiments, Wrote the manuscript.

Conceptualization, Supervision, Visualization, Writing—original draft, Writing—review and editing, Carried out NMR experiments, Solved the crystal structure of the RfaH:ops9 complex, Generated the RfaH:opsTEC model, Designed and supervised NMR and crystallography experiments, Wrote the manuscript.

Additional files

Supplementary file 1. RfaH-opsTEC_model.pdb: PDB file of the RfaH:opsTEC model.
elife-36349-supp1.pdb (2.2MB, pdb)
DOI: 10.7554/eLife.36349.016
Transparent reporting form
DOI: 10.7554/eLife.36349.017

Data availability

Diffraction data have been deposited in PDB under the accession code 5OND. All data generated or analyzed during this study are included in the manuscript and supporting files. Source data files have been provided for Figures 2 and 4. The PDB file of the RfaH:ops TEC model has been provided.

The following dataset was generated:

Zuber PK, author; Artsimovitch I, author; Roesch P, author; Knauer SH, author. RfaH from Escherichia coli in complex with ops DNA. 2018 http://www.rcsb.org/pdb/search/structidSearch.do?structureId=5OND Publicly available at the RCSB Protein Data Bank (accession no. 5OND)

References

  1. Adams PD, Afonine PV, Bunkóczi G, Chen VB, Davis IW, Echols N, Headd JJ, Hung LW, Kapral GJ, Grosse-Kunstleve RW, McCoy AJ, Moriarty NW, Oeffner R, Read RJ, Richardson DC, Richardson JS, Terwilliger TC, Zwart PH. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallographica Section D Biological Crystallography. 2010;66:213–221. doi: 10.1107/S0907444909052925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Artsimovitch I, Landick R. Pausing by bacterial RNA polymerase is mediated by mechanistically distinct classes of signals. PNAS. 2000;97:7090–7095. doi: 10.1073/pnas.97.13.7090. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Artsimovitch I, Landick R. The transcriptional regulator RfaH stimulates RNA chain synthesis after recruitment to elongation complexes by the exposed nontemplate DNA strand. Cell. 2002;109:193–203. doi: 10.1016/S0092-8674(02)00724-9. [DOI] [PubMed] [Google Scholar]
  4. Artsimovitch I, Vassylyeva MN, Svetlov D, Svetlov V, Perederina A, Igarashi N, Matsugaki N, Wakatsuki S, Tahirov TH, Vassylyev DG. Allosteric modulation of the RNA polymerase catalytic reaction is an essential component of transcription control by rifamycins. Cell. 2005;122:351–363. doi: 10.1016/j.cell.2005.07.014. [DOI] [PubMed] [Google Scholar]
  5. Bae B, Feklistov A, Lass-Napiorkowska A, Landick R, Darst SA. Structure of a bacterial RNA polymerase holoenzyme open promoter complex. eLife. 2015;4:e08504. doi: 10.7554/eLife.08504. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Belogurov GA, Mooney RA, Svetlov V, Landick R, Artsimovitch I. Functional specialization of transcription elongation factors. The EMBO Journal. 2009;28:112–122. doi: 10.1038/emboj.2008.268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Belogurov GA, Sevostyanova A, Svetlov V, Artsimovitch I. Functional regions of the N-terminal domain of the antiterminator RfaH. Molecular Microbiology. 2010;76:286–301. doi: 10.1111/j.1365-2958.2010.07056.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Belogurov GA, Vassylyeva MN, Svetlov V, Klyuyev S, Grishin NV, Vassylyev DG, Artsimovitch I. Structural basis for converting a general transcription factor into an operon-specific virulence regulator. Molecular Cell. 2007;26:117–129. doi: 10.1016/j.molcel.2007.02.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bernecky C, Plitzko JM, Cramer P. Structure of a transcribing RNA polymerase II-DSIF complex reveals a multidentate DNA-RNA clamp. Nature Structural & Molecular Biology. 2017;24:809–815. doi: 10.1038/nsmb.3465. [DOI] [PubMed] [Google Scholar]
  10. Burmann BM, Knauer SH, Sevostyanova A, Schweimer K, Mooney RA, Landick R, Artsimovitch I, Rösch P. An α helix to β barrel domain switch transforms the transcription factor RfaH into a translation factor. Cell. 2012;150:291–303. doi: 10.1016/j.cell.2012.05.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Burmann BM, Schweimer K, Luo X, Wahl MC, Stitt BL, Gottesman ME, Rösch P. A NusE:NusG complex links transcription and translation. Science. 2010;328:501–504. doi: 10.1126/science.1184953. [DOI] [PubMed] [Google Scholar]
  12. Chan CL, Wang D, Landick R. Multiple interactions stabilize a single paused transcription intermediate in which hairpin to 3' end spacing distinguishes pause and termination pathways. Journal of Molecular Biology. 1997;268:54–68. doi: 10.1006/jmbi.1997.0935. [DOI] [PubMed] [Google Scholar]
  13. Crickard JB, Fu J, Reese JC. Biochemical analysis of yeast suppressor of Ty 4/5 (Spt4/5) reveals the importance of nucleic acid interactions in the prevention of RNA polymerase II arrest. Journal of Biological Chemistry. 2016;291:9853–9870. doi: 10.1074/jbc.M116.716001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Czyz A, Mooney RA, Iaconi A, Landick R. Mycobacterial RNA polymerase requires a U-tract at intrinsic terminators and is aided by NusG at suboptimal terminators. mBio. 2014;5:e00931-14. doi: 10.1128/mBio.00931-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Davies DR, Goryshin IY, Reznikoff WS, Rayment I. Three-dimensional structure of the Tn5 synaptic complex transposition intermediate. Science. 2000;289:77–85. doi: 10.1126/science.289.5476.77. [DOI] [PubMed] [Google Scholar]
  16. Ehara H, Yokoyama T, Shigematsu H, Yokoyama S, Shirouzu M, Sekine SI. Structure of the complete elongation complex of RNA polymerase II with basal factors. Science. 2017;357:921–924. doi: 10.1126/science.aan8552. [DOI] [PubMed] [Google Scholar]
  17. Emsley P, Lohkamp B, Scott WG, Cowtan K. Features and development of Coot. Acta Crystallographica Section D Biological Crystallography. 2010;66:486–501. doi: 10.1107/S0907444910007493. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Feklistov A, Darst SA. Structural basis for promoter-10 element recognition by the bacterial RNA polymerase σ subunit. Cell. 2011;147:1257–1269. doi: 10.1016/j.cell.2011.10.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Feklistov A, Sharon BD, Darst SA, Gross CA. Bacterial sigma factors: a historical, structural, and genomic perspective. Annual Review of Microbiology. 2014;68:357–376. doi: 10.1146/annurev-micro-092412-155737. [DOI] [PubMed] [Google Scholar]
  20. Guo X, Myasnikov AG, Chen J, Crucifix C, Papai G, Takacs M, Schultz P, Weixlbaumer A. Structural basis for NusA stabilized transcriptional pausing. Molecular Cell. 2018;69:816–827. doi: 10.1016/j.molcel.2018.02.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Haugen SP, Berkmen MB, Ross W, Gaal T, Ward C, Gourse RL. rRNA promoter regulation by nonoptimal binding of sigma region 1.2: an additional recognition element for RNA polymerase. Cell. 2006;125:1069–1082. doi: 10.1016/j.cell.2006.04.034. [DOI] [PubMed] [Google Scholar]
  22. Kabsch W. Integration, scaling, space-group assignment and post-refinement. Acta Crystallographica Section D Biological Crystallography. 2010a;66:133–144. doi: 10.1107/S0907444909047374. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Kabsch W. XDS. Acta Crystallographica. Section D, Biological Crystallography. 2010b;66:125–132. doi: 10.1107/S0907444909047337. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Kang JY, Mishanina TV, Bellecourt MJ, Mooney RA, Darst SA, Landick R. RNA polymerase accommodates a pause RNA hairpin by global conformational rearrangements that prolong pausing. Molecular Cell. 2018;69:802–815. doi: 10.1016/j.molcel.2018.01.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Kang JY, Olinares PD, Chen J, Campbell EA, Mustaev A, Chait BT, Gottesman ME, Darst SA. Structural basis of transcription arrest by coliphage HK022 Nun in an Escherichia coli RNA polymerase elongation complex. eLife. 2017;6:e25478. doi: 10.7554/eLife.25478. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Klein BJ, Bose D, Baker KJ, Yusoff ZM, Zhang X, Murakami KS. RNA polymerase and transcription elongation factor Spt4/5 complex structure. PNAS. 2011;108:546–550. doi: 10.1073/pnas.1013828108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Knauer SH, Artsimovitch I, Rösch P. Transformer proteins. Cell Cycle. 2012;11:4289–4290. doi: 10.4161/cc.22468. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Kohler R, Mooney RA, Mills DJ, Landick R, Cramer P. Architecture of a transcribing-translating expressome. Science. 2017;356:194–197. doi: 10.1126/science.aal3059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Krissinel E, Henrick K. Inference of macromolecular assemblies from crystalline state. Journal of Molecular Biology. 2007;372:774–797. doi: 10.1016/j.jmb.2007.05.022. [DOI] [PubMed] [Google Scholar]
  30. Larson MH, Mooney RA, Peters JM, Windgassen T, Nayak D, Gross CA, Block SM, Greenleaf WJ, Landick R, Weissman JS. A pause sequence enriched at translation start sites drives transcription dynamics in vivo. Science. 2014;344:1042–1047. doi: 10.1126/science.1251871. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Lau AY, Schärer OD, Samson L, Verdine GL, Ellenberger T. Crystal structure of a human alkylbase-DNA repair enzyme complexed to DNA: mechanisms for nucleotide flipping and base excision. Cell. 1998;95:249–258. doi: 10.1016/S0092-8674(00)81755-9. [DOI] [PubMed] [Google Scholar]
  32. Marr MT, Datwyler SA, Meares CF, Roberts JW. Restructuring of an RNA polymerase holoenzyme elongation complex by lambdoid phage Q proteins. PNAS. 2001;98:8972–8978. doi: 10.1073/pnas.161253298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Martinez-Rucobo FW, Sainsbury S, Cheung AC, Cramer P. Architecture of the RNA polymerase-Spt4/5 complex and basis of universal transcription processivity. The EMBO Journal. 2011;30:1302–1310. doi: 10.1038/emboj.2011.64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. McCoy AJ, Grosse-Kunstleve RW, Adams PD, Winn MD, Storoni LC, Read RJ. Phaser crystallographic software. Journal of Applied Crystallography. 2007;40:658–674. doi: 10.1107/S0021889807021206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Meyer O, Schlegel HG. Biology of aerobic carbon monoxide-oxidizing bacteria. Annual Review of Microbiology. 1983;37:277–310. doi: 10.1146/annurev.mi.37.100183.001425. [DOI] [PubMed] [Google Scholar]
  36. Mooney RA, Davis SE, Peters JM, Rowland JL, Ansari AZ, Landick R. Regulator trafficking on bacterial transcription units in vivo. Molecular Cell. 2009a;33:97–108. doi: 10.1016/j.molcel.2008.12.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Mooney RA, Schweimer K, Rösch P, Gottesman M, Landick R. Two structurally independent domains of E. coli NusG create regulatory plasticity via distinct interactions with RNA polymerase and regulators. Journal of Molecular Biology. 2009b;391:341–358. doi: 10.1016/j.jmb.2009.05.078. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Mueller U, Förster R, Hellmig M, Huschmann FU, Kastner A, Malecki P, Pühringer S, Röwer M, Sparta K, Steffien M, Ühlein M, Wilk P, Weiss MS. The macromolecular crystallography beamlines at BESSY II of the Helmholtz-Zentrum Berlin: current status and perspectives. The European Physical Journal Plus. 2015;130:141–150. doi: 10.1140/epjp/i2015-15141-2. [DOI] [Google Scholar]
  39. NandyMazumdar M, Artsimovitch I. Ubiquitous transcription factors display structural plasticity and diverse functions: nusg proteins - Shifting shapes and paradigms. BioEssays : News and Reviews in Molecular, Cellular and Developmental Biology. 2015;37:324–334. doi: 10.1002/bies.201400177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. NandyMazumdar M, Nedialkov Y, Svetlov D, Sevostyanova A, Belogurov GA, Artsimovitch I. RNA polymerase gate loop guides the nontemplate DNA strand in transcription complexes. PNAS. 2016;113:14994–14999. doi: 10.1073/pnas.1613673114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Nedialkov YA, Burton ZF. Translocation and fidelity of Escherichia coli RNA polymerase. Transcription. 2013;4:136–143. doi: 10.4161/trns.25527. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Nedialkov Y, Svetlov D, Belogurov GA, Artsimovitch I. Locking the non-template DNA to control transcription. Molecular Microbiology. 2018:e13983. doi: 10.1111/mmi.13983. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Perdue SA, Roberts JW. Σ(70)-dependent transcription pausing in Escherichia coli. Journal of Molecular Biology. 2011;412:782–792. doi: 10.1016/j.jmb.2011.02.011. [DOI] [PubMed] [Google Scholar]
  44. Peters JM, Mooney RA, Grass JA, Jessen ED, Tran F, Landick R. Rho and NusG suppress pervasive antisense transcription in Escherichia coli. Genes & Development. 2012;26:2621–2633. doi: 10.1101/gad.196741.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Petushkov I, Pupov D, Bass I, Kulbachinskiy A. Mutations in the CRE pocket of bacterial RNA polymerase affect multiple steps of transcription. Nucleic Acids Research. 2015;43:5798–5809. doi: 10.1093/nar/gkv504. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Said N, Krupp F, Anedchenko E, Santos KF, Dybkov O, Huang YH, Lee CT, Loll B, Behrmann E, Bürger J, Mielke T, Loerke J, Urlaub H, Spahn CMT, Weber G, Wahl MC. Structural basis for λN-dependent processive transcription antitermination. Nature Microbiology. 2017;2:17062. doi: 10.1038/nmicrobiol.2017.62. [DOI] [PubMed] [Google Scholar]
  47. Sambrook J, Russel DW. Molecular Cloning - a Laboratory Manual. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press; 1994. [Google Scholar]
  48. Saxena S, Myka KK, Washburn R, Costantino N, Court DL, Gottesman ME. Escherichia coli transcription factor NusG binds to 70S ribosomes. Molecular Microbiology. 2018:e13953. doi: 10.1111/mmi.13953. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Schmidt A, Kochanowski K, Vedelaar S, Ahrné E, Volkmer B, Callipo L, Knoops K, Bauer M, Aebersold R, Heinemann M. The quantitative and condition-dependent Escherichia coli proteome. Nature Biotechnology. 2016;34:104–110. doi: 10.1038/nbt.3418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Sevostyanova A, Artsimovitch I. Functional analysis of Thermus thermophilus transcription factor NusG. Nucleic Acids Research. 2010;38:7432–7445. doi: 10.1093/nar/gkq623. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Sevostyanova A, Belogurov GA, Mooney RA, Landick R, Artsimovitch I. The β subunit gate loop is required for RNA polymerase modification by RfaH and NusG. Molecular Cell. 2011;43:253–262. doi: 10.1016/j.molcel.2011.05.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Sevostyanova A, Svetlov V, Vassylyev DG, Artsimovitch I. The elongation factor RfaH and the initiation factor sigma bind to the same site on the transcription elongation complex. PNAS. 2008;105:865–870. doi: 10.1073/pnas.0708432105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Shi D, Svetlov D, Abagyan R, Artsimovitch I. Flipping states: a few key residues decide the winning conformation of the only universally conserved transcription factor. Nucleic Acids Research. 2017;45:8835–8843. doi: 10.1093/nar/gkx523. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Sparta KM, Krug M, Heinemann U, Mueller U, Weiss MS. Xdsapp 2.0. Journal of Applied Crystallography. 2016;49:1085–1092. doi: 10.1107/S1600576716004416. [DOI] [Google Scholar]
  55. Strobel EJ, Roberts JW. Two transcription pause elements underlie a σ70-dependent pause cycle. PNAS. 2015;112:E4374–E4380. doi: 10.1073/pnas.1512986112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Svetlov V, Artsimovitch I. Purification of bacterial RNA polymerase: tools and protocols. Methods in Molecular Biology. 2015;1276:13–29. doi: 10.1007/978-1-4939-2392-2_2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Tomar SK, Knauer SH, Nandymazumdar M, Rösch P, Artsimovitch I. Interdomain contacts control folding of transcription factor RfaH. Nucleic Acids Research. 2013;41:10077–10085. doi: 10.1093/nar/gkt779. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Turtola M, Belogurov GA. NusG inhibits RNA polymerase backtracking by stabilizing the minimal transcription bubble. eLife. 2016;5:18096. doi: 10.7554/eLife.18096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Vassylyeva MN, Svetlov V, Klyuyev S, Devedjiev YD, Artsimovitch I, Vassylyev DG. Crystallization and preliminary crystallographic analysis of the transcriptional regulator RfaH from Escherichia coli and its complex with ops DNA. Acta Crystallographica Section F Structural Biology and Crystallization Communications. 2006;62:1027–1030. doi: 10.1107/S174430910603658X. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Vvedenskaya IO, Vahedian-Movahed H, Bird JG, Knoblauch JG, Goldman SR, Zhang Y, Ebright RH, Nickels BE. Interactions between RNA polymerase and the "core recognition element" counteract pausing. Science. 2014;344:1285–1289. doi: 10.1126/science.1253458. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Wallace AC, Laskowski RA, Thornton JM. LIGPLOT: a program to generate schematic diagrams of protein-ligand interactions. "Protein Engineering, Design and Selection". 1995;8:127–134. doi: 10.1093/protein/8.2.127. [DOI] [PubMed] [Google Scholar]
  62. Winkelman JT, Gourse RL. Open complex DNA scrunching: a key to transcription start site selection and promoter escape. BioEssays. 2017;39:1600193. doi: 10.1002/bies.201600193. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Yakhnin AV, Murakami KS, Babitzke P. NusG is a Sequence-specific RNA polymerase pause factor that binds to the Non-template DNA within the paused transcription bubble. Journal of Biological Chemistry. 2016;291:5299–5308. doi: 10.1074/jbc.M115.704189. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Yang CG, Garcia K, He C. Damage detection and base flipping in direct DNA alkylation repair. ChemBioChem. 2009;10:417–423. doi: 10.1002/cbic.200800580. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Yi C, Chen B, Qi B, Zhang W, Jia G, Zhang L, Li CJ, Dinner AR, Yang CG, He C. Duplex interrogation by a direct DNA repair protein in search of base damage. Nature Structural & Molecular Biology. 2012;19:671–676. doi: 10.1038/nsmb.2320. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Zenkin N, Kulbachinskiy A, Yuzenkova Y, Mustaev A, Bass I, Severinov K, Brodolin K. Region 1.2 of the RNA polymerase sigma subunit controls recognition of the -10 promoter element. The EMBO Journal. 2007;26:955–964. doi: 10.1038/sj.emboj.7601555. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Zhang Y, Feng Y, Chatterjee S, Tuske S, Ho MX, Arnold E, Ebright RH. Structural basis of transcription initiation. Science. 2012;338:1076–1080. doi: 10.1126/science.1227786. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Zhilina E, Esyunina D, Brodolin K, Kulbachinskiy A. Structural transitions in the transcription elongation complexes of bacterial RNA polymerase during σ-dependent pausing. Nucleic Acids Research. 2012;40:3078–3091. doi: 10.1093/nar/gkr1158. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision letter

Editor: Andrés Aguilera1

In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.

Thank you for submitting your article "The universally-conserved transcription factor RfaH is recruited to a hairpin structure of the non-template DNA strand" for consideration by eLife. Your article has been reviewed by three peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Gisela Storz as the Senior Editor. One of the three reviewers, Richard Ebridght, has agreed to reveal his identity.

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

Summary:

Zuber et al., combined structural and biochemical studies to shed light on the mechanism of the transcription antitermination factor RfaH, a representative of the NusG/Spt5 family of universally conserved factors, which expands the impact of this research. The authors present evidence that RfaH is recruited to a transcription elongation complex (TEC) at an operon polarity suppression (ops) site through a combination of sequence-dependent TEC pausing at ops, and sequence-dependent DNA-hairpin formation and base flipping in the transcription-bubble non-template strand of the TEC paused at ops. They demonstrate that the nt strand of the RfaH-binding site, ops, adopts a hairpin conformation with a flipped-out base, allowing for structure- and sequence-specific read-out of transcription signals. This discovery has implications not just for RfaH mechanism of action, but for our understanding of transcription regulation by signals embedded in DNA as well as mechanisms of recruitment of non-specific regulatory factors, such as DNA helicase UvrD. Conclusions are well supported by the reported NMR, and X-ray crystallographic data, as well as in vivo, and in vitro assays and are also consistent with previous mapping and computational discovery of ntDNA-binding site in RfaH NTD (Belogurov et al., 2010; Svetlov and Nudler, 2012). In conclusion, the manuscript provides important structural and functional insights into the mechanism of transcription regulation by RfaH and related factors, suggesting that sequence-dependent pausing and DNA-hairpin formation can function together to mediate sequence-specific recognition of DNA signals during transcription elongation. The findings should be of broad interest to researchers in transcription and transcriptional regulation. It is thus appropriate for publication once the following suggestions are attended.

Essential revisions:

- The functional importance of the observed interactions for binding affinity and binding specificity (subsection “Structural analysis of RfaH:ops contacts”) should be demonstrated experimentally with minimal effort. Although biochemical results are presented to support the inferred DNA secondary structure, no biochemical results are presented to support the inferred protein-DNA interactions. This is an important omission in view of the unusually, potentially worrisomely, small size of the inferred protein-DNA interface: "only 214 A^2.") Just 4 amino acids (K10, H20, R23, and R73) and 2 bases (T6 and G5) are inferred to be important for specificity. Accordingly, a comprehensive analysis -involving substitution of each amino acid by Ala, and quantification of affinities of wild-type and Ala-substituted RfaH derivatives for wild-type, T5-substituted, and G-6-substituted ops derivatives- would be possible.

- The current description of the model for RfaH loading and action is vague (no real Figure legend for Figure 1) and it would be useful to be more concrete both in what is known and what is not known. E.g. in the modelled RfaH-TEC complex, which is very compelling, RfaH is bound to ops in the context of a TEC. The TEC is in a register where U11 would be post-translocated or possibly pre-translocated (assuming G12 has been added). It was believed that the backtracking happens at the U11 pause site (according to Figure 3, Artsimovitch and Landick, 2000) but in that case it is not easy to see how RfaH can bind to ops because the NT DNA should be shifted downstream by at least one, possibly two bases. Do the authors think that RNAP is backtracked at U11? Do the authors think that RfaH binds to this state or do they think it binds to a post-translocated TEC at U11?

- The ExoIII footprints seem to argue against backtracking since the tDNA gets shorter as RNAP progresses towards U11 and the results are not consistent with the schematics at the bottom of Figure 3. It seems that the major ExoIII cleavage products in absence of RfaH are not labelled correctly: G8 should be 14, C9 should be 13, G10 should be 13/12, U11 should be 12. Likewise, in the presence of RfaH, the ExoIII footprints also get shorter (not to the same extent) – major ExoIII cleavage products seem to be 19 for C9, 19/18 for G10 and 18 for U11. These results argue against backtracking because ExoIII gains access to the tDNA strand as RNAP adds bases to the RNA 3'-end. Authors should clarify and discuss this.

eLife. 2018 May 9;7:e36349. doi: 10.7554/eLife.36349.022

Author response


Essential revisions:

- The functional importance of the observed interactions for binding affinity and binding specificity (subsection “Structural analysis of RfaH:ops contacts”) should be demonstrated experimentally with minimal effort. Although biochemical results are presented to support the inferred DNA secondary structure, no biochemical results are presented to support the inferred protein-DNA interactions. This is an important omission in view of the unusually, potentially worrisomely, small size of the inferred protein-DNA interface: "only 214 A^2.") Just 4 amino acids (K10, H20, R23, and R73) and 2 bases (T6 and G5) are inferred to be important for specificity. Accordingly, a comprehensive analysis -involving substitution of each amino acid by Ala, and quantification of affinities of wild-type and Ala-substituted RfaH derivatives for wild-type, T5-substituted, and G-6-substituted ops derivatives- would be possible.

We currently do not have an assay that would allow us to measure RfaH affinities to the EC directly and quantitatively; an EMSA with 32P-labeled RfaH that we used in (Artsimovitch and Landick, 2002) is messy and semi-quantitative. We use an indirect way to assess defects in recruitment by measuring RNAP retention 1 and 2 nucleotides downstream from the ops site (referred to as 12 and 13 in the current manuscript). We assume that the strength of RfaH contacts to the NT strand is reflected in the delay in RNAP escape from the ops site, but this is not a true affinity measurement. Moreover, only a fraction of RNAP is delayed downstream from the ops site. Using this assay, we showed that K10A, H20A, and R73A substitutions had strong defects in delaying RNAP escape from ops, whereas R23A had a moderate effect (Belogurov et al., 2010). Based on these results, we proposed that K10, H20, R23, and R73 directly interact with the ops DNA.

In light of the reviewers’ concerns about the small size of the RfaH:ops9 interaction surface, we recalculated the interface using the PISA server. We found that the interface comprises 420Å2 (average of both complexes in the asymmetric unit), a value updated in the revised manuscript. The crystal structure and the NMR data presented in the current work and a cryoEM structure of RfaH:ops EC obtained in the Darst lab support this interpretation and reveal additional contacts that were not predicted by modeling.

We have already demonstrated the importance of RfaH K10, H20, R23, and R73 (Belogurov et al., 2010) and G5 and T6 ops residues (this work) in RfaH function in vitro and in vivo and we do not yet have an assay to provide adequate quantitative analysis of RfaH binding to its real target, the EC. Thus, we cannot properly carry out a thorough analysis of these interactions in a reasonable time frame.

We agree that a systematic analysis of RfaH:EC interactions would be necessary to validate structural observations, particularly because the details of RfaH:DNA contacts are not identical. However, this cannot be achieved with “a minimal effort” because roles of all determinants, not just RfaH K10, H20, R23, R73 and G5/T6 in ops, need to be probed, and because we need to develop an equilibrium binding assay to probe contributions of RfaH and DNA residues implicated by X-ray and cryoEM structures.

At the same time, a manuscript reporting the cryoEM structure of RfaH-ops EC is expected to be accepted any day now. This manuscript was submitted five months after the results of the current study have been communicated to the authors and went through review at Cell with a lightning speed. Given these developments, we cannot afford a significant delay in publication. Notably, the two manuscripts are complementary because the Darst manuscript, on which IA and YN are coauthors, does not present any functional data on the NT DNA determinants and conformation.

To better describe what is known about RfaH:DNA interactions, we revised the text and figures to summarize the available functional data about the RfaH:DNA contacts. We indicated the demonstrated effects of RfaH substitutions on recruitment to ops (Belogurov et al., 2010) in Figure 4C and Figure 6 and discussed the observed effects of RfaH substitutions in the light of our RfaH:ops9 complex structure.

- The current description of the model for RfaH loading and action is vague (no real Figure legend for Figure 1) and it would be useful to be more concrete both in what is known and what is not known. E.g. in the modelled RfaH-TEC complex, which is very compelling, RfaH is bound to ops in the context of a TEC. The TEC is in a register where U11 would be post-translocated or possibly pre-translocated (assuming G12 has been added). It was believed that the backtracking happens at the U11 pause site (according to Figure 3, Artsimovitch and Landick, 2000) but in that case it is not easy to see how RfaH can bind to ops because the NT DNA should be shifted downstream by at least one, possibly two bases. Do the authors think that RNAP is backtracked at U11? Do the authors think that RfaH binds to this state or do they think it binds to a post-translocated TEC at U11?

Backtracking by 1 and 2-3 nt is observed (by Gre-assisted cleavage) in ECs stalled by substrate deprivation G10 and U11 ECs, respectively, in the absence of RfaH; no backtracking is seen in C9 EC. RfaH inhibits backtracking if added to the preformed ECs because it stabilizes the -10 bp at the upstream fork junction, which has to melt for backtracking to occur, as inferred from an increase in inter-strand DNA crosslinking by psoralen. These data are from a manuscript that has been just accepted for publication by Nedialkov et al.

We assume that in vivo RfaH is recruited at U11 because a strong pause at this position was observed by permanganate footprinting (Leeds and Welch, 1996) and later by NET seq, and because mutations that interfere with RNAP pausing at U11 compromise RfaH function, but we do not know whether this complex is pre-, post-, hybrid- (Guo et al., 2018 and Kang et al., 2018), or reverse-translocated. Because RfaH binds to scaffolds in which the RNA strand is present or missing similarly (Artsimovitch and Landick, 2002), it is likely that RfaH primarily senses the NT DNA conformation during the initial recruitment and binds to the EC at different positions and in different translocation states, owing to the inherent flexibility of the NT DNA. RfaH binds to the U11 ops EC which is backtracked but also to the ops EC in which both RNA and T DNA are post-translocated in the cryoEM structure of RfaH:opsEC; in the latter complex, RNA translocation is forced by the scaffold topology, and the NT DNA forms a hairpin with the C3:G8 bp.

We modified the text and the legend of Figure 1 to provide more details on what is and what is not known about the life cycle of RfaH. We also expanded the discussion of RfaH recruitment in the revised manuscript.

- The ExoIII footprints seem to argue against backtracking since the tDNA gets shorter as RNAP progresses towards U11 and the results are not consistent with the schematics at the bottom of Figure 3. It seems that the major ExoIII cleavage products in absence of RfaH are not labelled correctly: G8 should be 14, C9 should be 13, G10 should be 13/12, U11 should be 12. Likewise, in the presence of RfaH, the ExoIII footprints also get shorter (not to the same extent) – major ExoIII cleavage products seem to be 19 for C9, 19/18 for G10 and 18 for U11. These results argue against backtracking because ExoIII gains access to the tDNA strand as RNAP adds bases to the RNA 3'-end. Authors should clarify and discuss this.

We apologize for a cryptic description of the ExoIII data. We used ExoIII as a reporter of RfaH recruitment: an extension of ExoIII upstream protection is due to RfaH effects on the NT DNA and the upstream DNA duplex trajectories. RfaH induces a strong block at C9, G10, and U11 ECs, consistent with its recruitment at these positions. In Figure 3, ExoIII footprint boundaries are labeled with respect to the RNA 3’ end; in a completely post-translocated EC, RNAP protects 14 upstream bp from ExoIII (9 bp RNA:DNA hybrid + 5 extra bp), in a half- translocated (Guo et al., 2018 and Kang et al., 2018) or pre-translocated EC– 15 bp. G8 EC, in which the upstream RNA is not complementary to the T DNA, is post-translocated (the reviewer is correct and this number should have been 14). C9 is also post-translocated, the T DNA becomes 1 nt shorter, and only 1 predominant species is observed. In G10, even both 14- and 15- products are seen, whereas in U11, an additional backtracked (16) species is seen. Exo III may not be an ideal probe because once ExoIII cleaves the DNA, RNAP cannot go back. We infer the translocation register in these complexes from sensitivity to Gre cleavage (Nedialkov et al., 2018), but the Exo probing also shows that RNAP is reluctant to move forward as it approaches ops.

We expanded the section on ExoIII probing and modified Figure 3 and its legend to make it more clear.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Figure 2—source data 1. In vivo analysis of ops mutants by a lux reporter assay.
    DOI: 10.7554/eLife.36349.005
    Figure 2—source data 2. In vitro analysis of the effect of ops mutants on RNAP pausing and RfaH recruitment.
    DOI: 10.7554/eLife.36349.006
    Figure 4—source data 1. Analysis of the chemical shift perturbations during the HSQC-titration of 15N-RfaH with ops12.
    DOI: 10.7554/eLife.36349.011
    Supplementary file 1. RfaH-opsTEC_model.pdb: PDB file of the RfaH:opsTEC model.
    elife-36349-supp1.pdb (2.2MB, pdb)
    DOI: 10.7554/eLife.36349.016
    Transparent reporting form
    DOI: 10.7554/eLife.36349.017

    Data Availability Statement

    Diffraction data have been deposited in PDB under the accession code 5OND. All data generated or analyzed during this study are included in the manuscript and supporting files. Source data files have been provided for Figures 2 and 4. The PDB file of the RfaH:ops TEC model has been provided.

    The following dataset was generated:

    Zuber PK, author; Artsimovitch I, author; Roesch P, author; Knauer SH, author. RfaH from Escherichia coli in complex with ops DNA. 2018 http://www.rcsb.org/pdb/search/structidSearch.do?structureId=5OND Publicly available at the RCSB Protein Data Bank (accession no. 5OND)


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES