Abstract
The RNA polymerase (RNAP) of bacteriophage T7 is a single subunit enzyme that can transcribe DNA to RNA in the absence of additional protein factors. In this work, we present a model of T7 RNAP translocation during elongation. Based on structural information and experimental data from single-molecule force measurements, we show that a small component of facilitated translocation or power stroke coexists with the Brownian-ratchet-driven motions, and plays a crucial role in nucleotide selection at pre-insertion. The facilitated translocation is carried out by the conserved Tyr639 that moves its side chain into the active site, pushing aside the 3′-end of the RNA, and forming a locally stabilized post-translocation intermediate. Pre-insertion of an incoming nucleotide into this stabilized intermediate state ensures that Tyr639 closely participates in selecting correct nucleotides. A similar translocation mechanism has been suggested for multi-subunit RNAPs involving the bridge-helix bending. Nevertheless, the bent bridge-helix sterically prohibits nucleotide binding in the post-transolocation intermediate analog; moreover, the analog is not stabilized unless an inhibitory protein factor binds to the enzyme. Using our scheme, we also compared the efficiencies of different strategies for nucleotide selection, and examined effects of facilitated translocation on forward tracking.
Introduction
During gene transcription (1), RNA polymerases (RNAPs) act as molecular motors (2,3). They move processively along double-stranded (ds) DNA to synthesize a complementary RNA strand from the template DNA strand. The free energy fueling RNAP elongation comes from binding and incorporating nucleotides for RNA synthesis. RNAPs come in two molecular architectures: the single subunit RNAPs from certain viral and mitochondrial species (4), and the multi-subunit RNAPs from bacteria, eukaryotes, and archaea (5,6). The single subunit RNAPs share many biochemical properties with the more common multi-subunit RNAPs, the most essential of which is their two-magnesium ion catalysis mechanism that employs both RNA and DNA polymerases (7,8). However, there appear to be no structural or sequence similarities between the single and multi-subunit RNAPs. Instead, the single subunit RNAPs resemble, in both sequence and structure, the Family I DNA polymerases (DNAPs) (9,10).
The RNAP from bacteriophage T7 is a prototype single subunit RNAP (11–13). It can carry out all transcriptional functions, from initiation and elongation to termination, without additional protein partners. The molecular architecture of the T7 RNAP resembles the hand-like configuration of DNAPs (13,14), with the active site for nucleotide incorporation located on the palm of the hand. Among the single subunit RNAPs and some of DNAPs, the highly conserved O-helix from the fingers subdomain abuts the active site (see Fig. 1, a and b). The O-helix, along with the fingers subdomain, moves between open and closed conformations during each nucleotide addition or elongation cycle. Due to their small size and self-sufficiency, T7 RNAPs are widely used for synthesizing specific transcripts. This RNAP makes an ideal model system to study the transcriptional process in its simplest form and it has been investigated extensively in structural (15–18) and biochemical studies (19–22), and in single-molecule measurements as well (23–26).
During transcription elongation of RNAP, each nucleotide addition cycle can be viewed as taking place in two stages: polymerization and translocation. In the polymerization stage, a new RNA 3′-end is generated from an incoming NTP by phosphoryl transfer and pyrophosphate (PPi) dissociation. In the translocation stage, RNAP moves 1-nt forward on the dsDNA, breaking 1-bp downstream and reannealing 1-bp upstream. At the same time, the 3′-end of the RNA moves along with its template DNA, vacating the active site for the next incoming NTP (see Fig. 1 c). Concurrently, the 5′-end of the RNA is released 1-nt from the DNA-RNA hybrid, resulting in a dynamically growing RNA transcript. In addition to normal elongation and translocation, multi-subunit RNAPs can track either forward or backward (see Fig. 1 c). In forward-tracking (or hypertranslocation), the RNAP hops forward without synthesizing RNA. This can shorten the DNA-RNA hybrid and may lead to termination (22,27). In back-tracking, the 3′-end RNA is extruded (opposite to that in normal translocation) into an exit channel as the RNAP moves backward along the dsDNA (28–31). Back-tracking, however, has not been detected in T7 RNAP.
Most experimental evidence suggests that RNAPs use a Brownian-ratchet mechanism of translocation (32–34) wherein the RNAP fluctuates back and forth equally fast (Brownian) until the incoming NTP binds, preventing backward movement (the ratchet), producing a net forward motion (20,24,35,36). Interestingly, single-molecule force measurements of T7 RNAP showed that there is a slight free energy bias (∼1.3 kBT) toward the post-translocated state compared with the pre-translocated state (24). However, experiments still favor a Brownian ratchet mechanism of translocation over a power-stroke mechanism—the latter requiring a significant free energy drop to drive translocation coupled with the chemical transition (37). Controversially, high-resolution structural studies of T7 RNAP intimate a power-stroke translocation mechanism in which PPi release triggers a rotational pivoting of the O-helix from the closed to open conformation, directly driving the RNAP translocation (13,16).
In this work, we examine the translocation mechanism more closely to resolve these conflicting views. The translocation dynamics take place on a millisecond timescale, too fast to be examined thoroughly in experiments but still much too slow for atomistic molecular dynamics simulations, which are limited to nano- to microseconds. To address this difficulty, we construct a semi-phenomenological model of RNAP elongation combining single-molecule force measurements (24,25) with information from structural studies (16,17). To accomplish this, we regard published high-resolution structures as highly populated intermediate states in the enzymatic cycle. There are, however, more sparsely populated intermediate states that are unlikely to be captured by crystallography. This viewpoint allows us to extrapolate contemporary experimental knowledge to finer timescales, and to provide guidance for further detailed structure-dynamics studies.
To focus on elongation-translocation, we dissect translocation into two parallel paths as follows. The RNAP can either translocate without a free energy bias (as in pure Brownian motion) or the translocation can be locally biased (or facilitated) via pushing by the side chain of Tyr639 that resides on the C-terminal of the O-helix. Tyr639 is highly conserved and is essential for selecting rNTP over dNTP (38). In building the model, we utilize the same set of high-resolution structures from which the power-stroke mechanism was proposed (16). We nevertheless assume that PPi release precedes translocation, and that the O-helix opens partially upon PPi release. These assumptions are supported by recent molecular dynamics simulation studies on DNA polymerase I that is structurally homologous to T7 RNAP (39).
Importantly, the parallel-path scheme introduces an essential degree of freedom that links the two paths (Brownian and facilitated), i.e., fluctuation of the Tyr639 side chain IN and OUT of the active site. This degree of freedom is important for both translocation and transcription fidelity. Under this scheme, the facilitated path terminates in a locally stabilized post-translocated intermediate wherein Tyr639 stays close to the pre-insertion site (i.e., IN). In this position, Tyr639 can screen incoming nucleotides, thus ensuring transcription fidelity. Without this energy stabilization or bias, Tyr639 would frequently fluctuate (i.e., OUT) far from the pre-insertion site, impairing nucleotide selection and hence transcription fidelity.
Another important property of the parallel-path scheme is that it provides a common representation of the translocation-elongation kinetics for both single and multi-subunit RNAPs. Our model of T7 RNAP translocation is similar in some respects to the two-pawl-ratchet model previously proposed for a bacterial multi-subunit RNAP (36). In Fig. 1 b, we show molecular views of the essential structural elements involved in translocation/elongation for both the single subunit T7 RNAP (left) and the multi-subunit RNAP (right). In the multi-subunit RNAPs (40,41), the bridge-helix alternates between bent and straight configurations (36,42), similar to Tyr639 IN and OUT, respectively. The bending of the bridge-helix also appears to assist RNAP translocation, as suggested for Tyr639. Moreover, in the multi-subunit RNAPs the trigger loop folds and unfolds during each cycle, similar to the O-helix closing and opening in T7 RNAP. Hence, the two types of polymerases can carry out analogous functions by different structural elements. We show that, under the common parallel-path scheme, the two systems show comparable—but different—reaction topologies.
Below we first calibrate the model with single-molecule force measurements and then address steady-state properties of T7 RNAP elongation. Details on how we constructed the model shown in Fig. 2 can be found in the Supporting Material. The focus of our work is to explain the functional role of facilitated translocation as the RNAP ratchets along DNA during elongation. Our model also shows how efficient nucleotide selection can be achieved during the elongation cycle, and examines whether facilitated translocation affects forward tracking. The model also suggests that the functional role of translocation is somewhat different for multi-subunit RNAPs.
Model and Results
Fitting elongation rates with single-molecule measurements
To calibrate our model, we fit numerical simulations of our model with single-molecule force measurement data for T7 RNAP elongation (24,25). The measurements were taken at different NTP concentrations, against a mechanical force 5 ≤ F ≤ 15 pN opposing forward translocation of RNAP. The experimental data had been fitted to a simplified three-state scheme, composed of pre-translocated, post-translocated, and NTP-loaded states. The fitting gave a maximum elongation rate (vmax) ∼130 nt/s. It also indicated a ∼1.3 kBT free energy bias during translocation to the post-translocated state (24).
Using the kinetic scheme shown in Fig. 2, we can also fit the force measurement data, as shown in Fig. 3 a (see the Supporting Material for technical details). The most essential feature of the model, compared with the simpler scheme used for fitting experiments (24), is that the translocation is now split into two parallel paths. The path (I → II) with the Tyr639 side chain positioned out of the active site (OUT) is assumed to be diffusive so that the RNAP can move forward and backward equally fast. The frequency of the movements is >∼103 s−1 so that at high NTP concentrations, translocation takes place much faster than the O-helix closing that happens at ∼102 s−1. At high NTP concentrations, the O-helix closing is assumed to be the rate-limiting step. Thus, even though the load force slows down translocation, it may not much affect the overall elongation rate. Along the alternate pathway (I′ → II′), however, the Tyr639 side chain inserts into the active site (IN) at the beginning (I′), pushing the 3′ end of the RNA out of the active site by the end (II′). Hence, Tyr639 directly assists the movement of the RNA-DNA hybrid. According to our fitting, the free energy bias along I′ →II′ is ∼3 kBT (see Table S1 in the Supporting Material: , with α assumed ∼1 kBT and β fitted ∼2 kBT). Consequently, the two parallel paths give an overall energy bias toward the post-translocated state by an amount between 0 and 3 kBT, consistent with the ∼1.3 kBT bias inferred from the experiments (24,25) (according to the standard errors of the measured elongation rates, the overall energy bias would vary around 1 ∼2 kBT, see the Supporting Material).
Steady-state probability distributions of elongation intermediates
We obtained the steady-state probability distributions for all intermediate states in the elongation cycle by simulating RNAP elongation for a long time (100 s for each trajectory), or equivalently by solving the master equation (see the Supporting Material). The probabilities at various NTP concentrations for these intermediate states are shown in Fig. 3 b.
At high NTP concentrations (588 μM by default to compare with experimental data), the RNAP elongates fast (∼100 nt/s). Under this condition the most populated state is the pre-insertion state III (PIII ∼ 57%). Those states that are moderately populated include: the stabilized post-translocated state II′ with Tyr639 IN (PII′ ∼ 16%), the NTP insertion or substrate state IV (PIV ∼11%), and the product state V (PV ∼ 9%). Other intermediates are only marginally populated: the pre-translocated states I or I′ (PI ∼ 3% and PI′ ∼ 1%) and the nonstabilized post-translocated state II with the Tyr639 OUT (PII ∼ 2%). Indeed, those more populated intermediates (II′, III, IV, and V in Fig. 2) had all been crystallized in the previous structural studies of T7 RNAP (16,17). This consistently shows that the captured structures are almost always those abundantly populated intermediates.
In particular, the pre-insertion state III populates most at high NTP concentrations. By contrast, the stabilized NTP insertion state IV is populated much less. The is due to the slow transition III → IV and the fast transition IV → V, such that the populations leave III slowly while passing through IV quickly. This property allows sufficient time for nucleotide selection upon pre-insertion (at III). Note that the stabilized NTP insertion state IV can become the most populated state if a noncatalytic analog of NTP is provided (16,17) (as IV → V inhibited).
Importantly, our results always show that PII′ > PII at post-translocation (see Fig. 3 b), which is essential for maintaining transcription fidelity. Because the Tyr639 side chain fluctuates IN and OUT rapidly (>104 s−1), II′ and II intermediates stay close to local equilibrium. By fitting the single-molecule experimental data we obtained the free energy difference between II and II′ (or I and II′, because EII − EII′ ∼ EI − EII′ ≡ β) ∼2 kBT > 0, giving a population bias of PII′/ PII ∼ 7–8. This explains why the post-translocation structure II′ with Tyr639 IN was detected and crystallized (16), but not the structure II with Tyr639 OUT (see Fig. 2).
In the pre-insertion state III, the Tyr639 side chain was also captured IN rather than OUT (16,17). From the structural comparisons shown in Fig. 1 b (left), we note that the Tyr639 side chain is close to the pre-insertion site when it is IN the active site (II′), whereas the side chain stays far from the pre-insertion site when it is OUT (II). Hence, the performance of nucleotide selection by Tyr639 at the pre-insertion site relies strongly on whether it populates more in the II′ or II state, or on the population bias PII′ > PII. When PII′ ≤ PII (i.e., without the post-translocation energy bias), the pre-inserted nucleotide would not be screened well because Tyr639 frequently stays (OUT) far from the pre-insertion site.
Comparing the ratcheting scheme with multi-subunit RNAPs
As mentioned earlier, using the parallel-path scheme allows us to describe the translocation-elongation kinetics for single and multi-subunit RNAPs in a unified way. The IN and OUT configurations of Tyr639 in the single subunit T7 RNAP correspond to the bent and straight configurations, respectively, of the bridge-helix in the multi-subunit RNAP (36,42,43). Moreover, the closing and opening of the O-helix in T7 RNAP correspond, respectively, to folding and unfolding of the trigger loop in the multi-subunit RNAPs.
There are distinctive features, however, that vary between the two ratcheting schemes (see Fig. S1 in the Supporting Material):
First, our structural examination of T7 RNAP reveals that binding or pre-insertion of an incoming nucleotide can take place without steric hindrance whether the Tyr639 side chain is IN (II′) or OUT (II) of the active site (see Fig. 1 b and Fig. 2, and see Fig. S1 a). Whereas for the multi-subunit RNAP, the nucleotide pre-insertion seems to take place only when the bridge-helix is straight (II) (see Fig. S1 b). The bending region of the bridge-helix (pink surface) has steric clashes with the pre-inserted nucleotide (van der Waals spheres inside the yellow oval), as demonstrated in Fig. 1 b, right.
Second, the configuration II′ is more stabilized than II as we have found for T7 RNAP, likely due to stacking of the side chain of Tyr639 with the end base pair of the RNA-DNA hybrid. Whereas in the multi-subunit RNAP, there is no evidence that the bent configuration of the bridge-helix (II′) is more stabilized than the straight one (II). In fact, a post-translocated state with the bridge-helix straight (II) was captured structurally (43,44), implying that II is likely more stabilized than II′ in the multi-subunit RNAP. A high-energy translocation intermediate captured in complex with an inhibitor is consistent with this idea (45).
In Eqs. S6 and S7 in the Supporting Material, we derive approximate formulas for the elongation rates of both types of RNAPs. Correspondingly, Fig. 3 c shows the curves of elongation rate, v versus the post-translocation free energy bias, β (or β + γ in general when γ ≡ EII − EI ≠ 0) for each type of the RNAPs. For T7 RNAP, we see that increasing the post-translocation free energy bias β beyond 2 kBT does not improve the elongation rate much (e.g., <5% increase) as the rate is already close to its saturation value. On the other hand, in comparison to a pure Brownian ratchet, the 3 kBT (α + β) free energy bias along I′ → II′, or an overall ∼1.3 kBT for both paths, does improve the elongation rate somewhat (e.g., ∼14% increase).
For multi-subunit RNAPs, however, it is likely that β < 0 (EII′ > EII; see Brueckner et al. (43) and Fig. 1 b in that work as well), with the bridge-helix more stabilized in the straight (II) than the bent (II′) configuration at post-translocation. In this case, the elongation rate still stays close to its saturation. Remarkably, the elongation rate can be significantly reduced when the bent configuration (II′) of the bridge-helix becomes more stabilized than the straight conformation (II) by binding some inhibitory factor such that β > 0 (EII > EII′; see Fig. 3 c).
Efficient nucleotide selection before stable insertion
Because T7 RNAP lacks a proof-reading mechanism, accurate nucleotide selection is crucial for transcription fidelity. For a wild-type T7 RNAP, experiments estimated that transcriptional errors occur at an average frequency ∼10−4 (46). If there were only one checkpoint for nucleotide selection in T7 RNAP, a free energy difference between wrong and right nucleotides at this checkpoint would be ΔE ∼ − kBT ln(10−4) ∼10 kBT to achieve this accuracy. Such a large energy disparity can hardly be maintained in a fluctuating protein cavity such as the active site of T7 RNAP. The site is not buried deeply inside the protein, and cocrystallized water molecules are observed close to the site (16) and the water molecules would smear the energy difference between the right and wrong nucleotides (47). Therefore, T7 RNAP would likely have two or more steps, or checkpoints, dedicated to its nucleotide selection.
Using our elongation scheme, we compared the energy efficiencies of several individual selection mechanisms, i.e., the error rates of elongation under different selection methods, using a constant energy penalty for differentiating between right and wrong nucleotides (see the Supporting Material for details). We find that selection methods (No. 1 and No. 2) that take place before the rate-limiting step (nucleotide insertion III → IV) are more efficient than (No. 3 and No. 4) that take place after. For example, using an energy of ∼5 kBT to reject the wrong nucleotide upon pre-insertion (No. 1) or to inhibit its insertion rate (No. 2) can achieve an error rate ∼10−2, whereas using the same amount of energy to reject the wrong nucleotide right after insertion (No. 3) or to inhibit the chemical reaction rate (No. 4) gives an error rate ∼10−1 (see Table S2). The property is closely related to where the rate-limiting transition is located in the elongation cycle. If one makes the chemical transition IV → V rate-limiting, then selections (Nos. 1–3) that take place before IV → V become similarly efficient, whereas the one after (No. 4) is still the least efficient. This happens because a rate-limiting forward transition presumably crosses a large activation barrier that can further hinder the backward transition. The hindered backward transition makes releasing the wrong nucleotide difficult.
Our results also show that, when there is no selection against wrong nucleotides at pre-insertion, or if the selection at pre-insertion costs an energy only at the thermal fluctuation level (1–2 kBT), the elongation rate becomes quite low. For example, the rate drops below 30 nt/s when single selection happens but not at pre-insertion (selections Nos. 2–4, see Table S2), which is much lower than the error-free rate ∼100 nt/s. These results suggest that the pre-insertion step is the most crucial checkpoint in T7 RNAP for achieving efficient nucleotide selection as well as maintaining a sufficiently high elongation rate.
Forward tracking with facilitated translocation
In addition to studying the effect of facilitated translocation on the nucleotide selection at pre-insertion, we also examined whether the facilitated translocation affects forward tracking. We assume that in the stabilized post-translocated state II′, T7 RNAP can randomly switch between regular elongation and forward tracking, i.e., moving forward without synthesizing RNA (illustrated in Fig. 1 c). Consequently, the RNA-DNA hybrid shortens and association between RNAP and the DNA weakens. Forward tracking likely takes place frequently at terminator regions causing intrinsic termination (22,27).
We examined the probability of T7 RNAP forward tracking around two terminator regions: one is T-ϕ, a late terminator found in the T7 genome (48), and the other is a threonine (thr) attenuator in Escherichia coli (49). To take into account sequence dependence during translocation (see the Supporting Material), we evaluated the free energy difference between the post (II) and pre-translocated states (I), γ, by a sum of energies calculated from unzipping/zipping of downstream/upstream of the dsDNA and RNA-DNA hybrid (50). Similarly we evaluated β by the translocation energy difference plus an additional energy contribution from Tyr639 stabilization at II′. We found that the forward tracking can take place at high efficiency at the terminator and at a very low efficiency at nonterminator regions, if one allows the forward rate to decrease exponentially with the stabilization energy of the RNA-DNA hybrid upstream. The forward tracking efficiency was determined by counting the number of extensive forward tracking events (beyond 3 nt) for every 100 trial events.
We then compared the forward-tracking efficiency (with the facilitated path I′ ↔ II′) with that of a pure Brownian ratchet scheme (without the facilitated path I′ ↔ II′). We found that the forward tracking efficiencies are almost identical in both cases. For example, when the Brownian ratchet case is tuned with 48 ± 3% forward tracking efficiency at the thr attenuator, the efficiency is 51 ± 5%, and further increase of the post-translocation energy bias (β + γ) hardly improves the efficiency (∼53 ± 5% maximum). In all these cases, the forward tracking efficiency at the nonterminator region remains at the same low value (∼2% for equally long regions). Similar trends were found also for the T-ϕ terminator. Hence, compared with the pure Brownian translocation case, the facilitated translocation in T7 RNAP does not enhance the likelihood of forward tracking. Note that the forward tracking may not be the mechanism for intrinsic termination (51,52), so the conclusion does not extend in general to termination.
Discussion
A large translocation free energy bias is not an advantage for sequence detection
In our model, T7 RNAP translocation is driven largely—but not completely—by a Brownian ratchet mechanism. Each translocation step involves changes in DNA and RNA structures that depend on sequence stabilities. Therefore, translocation can be utilized by RNAP to sense and respond to sequence signals along dsDNA. When there is no structural element or energy output from RNAP to assist in translocation, its movements are diffusive. The RNAP moves forward and backward equally fast until an incoming NTP binds to prevent the backward movements. Because the RNAP itself does not induce a free energy bias along the translocation path, sequence variations on the DNA can readily be detected, and RNAP can respond to the variation by slowing down, tracking forward or backward. By contrast, a large translocation energy bias can reduce the activation barrier and accelerate the forward movements, while it can mask relatively small energy differences arising from different DNA sequences. In addition, the translocation is not rate-limiting during the elongation cycle, so a large translocation energy bias can hardly improve the overall elongation rate. In our translocation-elongation model of T7 RNAP, for example, the sequence signal is encoded in the translocation energy γ that affects the elongation rate (see Eq. S6 in the Supporting Material). When the post-translocation free energy bias (β + γ) becomes large, the elongation rate increases to saturation independent of the sequence signal (γ). In multi-subunit RNAPs, a large post-translocation free energy bias simply stalls the elongation (see Eq. S7 in the Supporting Material and Fig. 3 c). Hence, a large translocation energy bias of RNAP does not improve elongation rate nor confer an advantage for sequence detection during transcription elongation.
A small post-translocation free energy bias in T7 RNAP aids in transcription fidelity
Our analysis of T7 RNAP data shows that a small free energy bias that stabilizes the post-translocated intermediate can aid in nucleotide selection at pre-insertion, without interfering with sequence detection.
Using experimental data from single-molecule force measurements (24,25), we identified a post-translocation free energy bias of ∼2 kBT imposed by the RNAP. The structural elements that achieve this bias include, but are not necessarily limited to, Tyr639 at the C-terminal end of the O-helix (see Fig. 1). The translocation can take place without a free energy bias (i.e., via Brownian path I → II in Fig. 2), during which the 3′-end of the RNA moves out of the active site without assistance (with the Tyr639 side chain OUT). Alternatively, the side chain of Tyr639 could squeeze IN the active site at the pre-translocated state when the active site is still occupied by the 3′-end of the RNA. Because the 3′-end of the RNA appears quite flexible (53), the squeezing (I → I′) costs but a small amount of free energy (e.g., ∼1 kBT). The unstable intermediate (I′) can be easily driven by ∼3 kBT free energy bias toward a more stabilized post-translocated configuration (II′), in which the 3′-end of the RNA is pushed out of the active site, leaving only the side-chain Tyr639 in occupancy. Indeed, this stabilized intermediate (II′) had been captured in high-resolution structural studies (16), and led to the suggestion that PPi release powers the translocation mechanism. Our studies indicate, however, that the intermediate (II′) results from the small power stroke exerted by Tyr639, not from PPi release. In the post-translocated state, II′ is stabilized by the Tyr639 side chain stacking with the base pair at the end of the RNA-DNA hybrid (see Fig. 2). Due to the side-chain fluctuation, there also exists a nonstabilized, and therefore lowly populated post-translocated state II, in which Tyr639 is OUT. Because the energy changes among the translocation intermediates (I, I′, II′, and II) are small, the translocation mechanism appears to be largely a Brownian ratchet.
Although the post-translocation energy bias appears quite small, there is still a way to distinguish the biased translocation from the pure Brownian case. At high NTP concentrations, the post-translocated intermediate II′ is moderately populated during the elongation cycle. Its population grows as NTP concentration decreases, and becomes dominant at very low NTP concentrations (see Fig. 3 b). By comparison, in the pure Brownian ratchet scheme, both the pre-translocated (I) and the post-translocated states (II) become dominant and equally populated (∼50% each) at low NTP concentrations. Hence, to test whether the translocation is purely Brownian or slightly biased, one should examine whether there exist two equally populated conformations (Brownian) or one dominant conformation (biased) at very low NTP concentrations, close to equilibrium. Because sequence effects can locally bias state populations and interfere with accurate measurements, experimental tests should be conducted when RNAP transcribes homogeneous DNA sequences.
Taking into account both translocation paths, the overall free energy bias of translocation is ∼1.3 kBT (24). This small post-translocation energy bias can actually slightly improve the elongation rate compared to a pure Brownian ratchet, whereas further increasing the energy bias improves the elongation rate very little (see Fig. 3 c). Thus, the major role of the facilitated translocation in T7 RNAP is not to improve the elongation rate, it is to bring about a locally stabilized post-translocated intermediate (II′) before NTP pre-insertion. In this intermediate state, the Tyr639 side chain inserts IN next to the pre-insertion site; its hydroxyl group can subsequently coordinate with a magnesium ion such as to discriminate between ribo- versus deoxy-ribo nucleotides (17) when the nucleotide pre-inserts. As the side chain of Tyr639 is OUT, its hydroxyl group moves farther (∼6 Å; see Fig. 1 b) away from the pre-insertion site and cannot participate in nucleotide selection. In brief, by consuming only a small amount of energy to stabilize the Tyr639 side chain IN the active site, T7 RNAP can guarantee an ∼80∼90% chance of keeping the tyrosine abutting the pre-insertion site for nucleotide selection.
Our studies point out that there are two reasons why the pre-insertion is the most crucial checkpoint of nucleotide selection. First, it appears energetically more efficient to reject the wrong nucleotide before its slow insertion than afterward. This property depends on where the rate-limiting step is located in the nucleotide addition cycle: selection before the rate-limiting transition always seems more efficient than selection after. Second, to maintain both high fidelity (low error rate) and sufficiently high elongation rate, it is necessary to have an energy above the thermal fluctuation level to filter out the incorrect nucleotide at pre-insertion. Exactly how—and by how much—Tyr639 and other structural elements contribute to nucleotide selection awaits further studies.
In our further examination, forward tracking is allowed during elongation via the stabilized post-translocated intermediate (II′). The forward tracking seems to happen at high efficiencies at the terminator sequences. However, we have not identified any substantial effect that the facilitated translocation brings to the forward tracking compared to pure Brownian ratchet translocation (with forward tracking via II). Indeed, increasing the post-translocation energy bias along the facilitated path hardly affects the II′ population in T7 RNAP. However, this property does not necessarily hold for multi-subunit RNAPs as increasing the post-translocation bias does substantially increase the population of II′ as well as inhibit RNAP elongation.
What about translocation and ratcheting in the multi-subunit RNAPs?
In comparison with the single subunit T7 RNAP, we suggest a slightly different ratcheting scheme under our parallel-path scheme for the multi-subunit RNAPs (see Fig. S1). In the multi-subunit RNAPs, bending and straightening of the bridge helix are analogous to IN and OUT fluctuations of the Tyr639 side chain in T7 RNAP. In this alternative ratcheting scheme, however, there are two essential features that distinguish the multi-subunit RNAPs from T7 RNAP:
First, pre-insertion of nucleotide can only happen when the bridge helix is straight (II → III allowed), but not when it is bent (II′ → III forbidden). This is because the bent bridge helix sterically occludes the pre-insertion site (see Fig. 1 b, right). In T7 RNAP, by contrast, the nucleotide can pre-insert at either IN (II′) or OUT (II) configuration of Tyr639 without steric hindrance.
Second, the bridge helix bent configuration in the post-translocated state (II′) does not appear to be stabilized as is the corresponding IN configuration of Tyr639 in T7 RNAP. Indeed, the bent configuration seems to be less stable than the straight configuration of the bridge helix in the post-translocated state because crystal structures of both the post-translocated state (II) and the pre-insertion state (III) of the multi-subunit RNAP were caught with the straight form of the bridge helix (43,44). Whereas in T7 RNAP, it is Tyr639 IN that was structurally captured (II′ and III) but not OUT (16).
Nevertheless, the bent configuration of the bridge helix has been structurally identified when an inhibitory factor was bound to the multi-subunit RNAP. Previously, a toxin-bound translocation intermediate (α-amanitin) was structurally resolved (45). The bridge helix is bent in this structure with the trigger loop in a wedged conformation. The captured structure is likely a stabilized form of a high-energy intermediate in translocation. This intermediate corresponds nicely to II′ in our model (Fig. 2). According to our interpretation, the post-translocation bias β + γ (≡ EII− EII′) < 0 is originally set for the multi-subunit RNAP, while the toxin stabilizes the II′ intermediate such that β + γ > 0 (as in T7 RNAP). However, because the II′ configuration cannot bind incoming nucleotide in the multi-subunit RNAP, it competes with the nucleotide for the binding (pre-insertion) site. In multi-subunit RNAPs, only a few kBT stabilization of II′ can lower the effective concentration of NTP significantly and cause a large drop of the elongation rate (see Fig. 3 c). Therefore, II′ appears to be a target intermediate state for inhibitory factors of elongation in the multi-subunit RNAPs: once the factor stabilizes the state (i.e., by increasing the post-translocation free energy bias), the elongation activity is greatly reduced. Notably, in recent high-resolution studies of a multi-subunit RNAP from a bacterial species (54), a ratcheted state has been identified that is in complex with a transcription inhibitor (Gfh1). This inhibitor protein occludes the channel for NTP entry into the active site. The bridge helix is kinked in the ratcheted state. Accordingly, this structure appears to be a stabilized form of II′ as well. Once II′ becomes more stabilized than the other post-translocated intermediate II (bridge helix straight), II′ dominantly populates at low NTP concentration.
In summary, we have dissected the ratchet mechanism of T7 RNAP during its transcription elongation. A small post-translocation free energy bias (∼2 kBT) is identified and its functional role is proposed: The energy bias creates a locally stabilized post-translocated intermediate in which the highly conserved Tyr639 stays close to the pre-insertion site for nucleotide selection. Without this post-translocation energy bias (equivalently, local stabilization) the tyrosine side chain frequently moves away from the pre-insertion site so that nucleotide selection is less effective, lowering transcription fidelity. Further studies will be necessary to determine if other structural elements participate in nucleotide selection at pre-insertion, how subsequent nucleotide selection proceeds, and whether a similar property exists for other single subunit polymerases.
Remarkably, one can identify analogous functional elements and build an analogous but distinct ratcheting scheme for the multi-subunit RNAPs. The variations of structural features of the RNAPs lead to different reaction topologies. The post-translocation intermediate with the bent bridge-helix does not allow NTP binding. Nor is it destabilized unless some inhibitory factor binds and enhances the post-translocation energy bias. Hence, bending of the bridge-helix, in addition to coordinating backtracking, appears to support inhibitory control (36).
Acknowledgments
J.Y. is supported by National Science Foundation grant No. DMS 1062396. G.O. was supported by National Science Foundation grant No. DMS 0414039.
Footnotes
This is an Open Access article distributed under the terms of the Creative Commons-Attribution Noncommercial License (http://creativecommons.org/licenses/by-nc/2.0/), which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Contributor Information
Jin Yu, Email: jinyu@berkeley.edu.
George Oster, Email: goster@berkeley.edu.
Supporting Material
References
- 1.Jacob F., Monod J. On the regulation of gene activity. Cold Spring Harb. Symp. Quant. Biol. 1961;26:193–211. doi: 10.1101/sqb.1961.026.01.048. [DOI] [PubMed] [Google Scholar]
- 2.Buc H., Strick T. The Royal Society of Chemistry; Cambridge, UK: 2009. RNA Polymerase as Molecular Motors. [Google Scholar]
- 3.Greive S.J., von Hippel P.H. Thinking quantitatively about transcriptional regulation. Nat. Rev. Mol. Cell Biol. 2005;6:221–232. doi: 10.1038/nrm1588. [DOI] [PubMed] [Google Scholar]
- 4.McAllister W.T., Raskin C.A. The phage RNA polymerases are related to DNA polymerases and reverse transcriptases. Mol. Microbiol. 1993;10:1–6. doi: 10.1111/j.1365-2958.1993.tb00897.x. [DOI] [PubMed] [Google Scholar]
- 5.Borukhov S., Nudler E. RNA polymerase: the vehicle of transcription. Trends Microbiol. 2008;16:126–134. doi: 10.1016/j.tim.2007.12.006. [DOI] [PubMed] [Google Scholar]
- 6.Hirata A., Murakami K.S. Archaeal RNA polymerase. Curr. Opin. Struct. Biol. 2009;19:724–731. doi: 10.1016/j.sbi.2009.10.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Sosunov V., Sosunova E., Goldfarb A. Unified two-metal mechanism of RNA synthesis and degradation by RNA polymerase. EMBO J. 2003;22:2234–2244. doi: 10.1093/emboj/cdg193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Steitz T.A., Steitz J.A. A general two-metal-ion mechanism for catalytic RNA. Proc. Natl. Acad. Sci. USA. 1993;90:6498–6502. doi: 10.1073/pnas.90.14.6498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Ollis D.L., Brick P., Steitz T.A. Structure of large fragment of Escherichia coli DNA polymerase I complexed with dTMP. Nature. 1985;313:762–766. doi: 10.1038/313762a0. [DOI] [PubMed] [Google Scholar]
- 10.Sousa R., Chung Y.J., Wang B.C. Crystal structure of bacteriophage T7 RNA polymerase at 3.3 Å resolution. Nature. 1993;364:593–599. doi: 10.1038/364593a0. [DOI] [PubMed] [Google Scholar]
- 11.Kochetkov S.N., Rusakova E.E., Tunitskaya V.L. Recent studies of T7 RNA polymerase mechanism. FEBS Lett. 1998;440:264–267. doi: 10.1016/s0014-5793(98)01484-7. [DOI] [PubMed] [Google Scholar]
- 12.Sousa R., Mukherjee S., Kivie M. Progress in Nucleic Acid Research and Molecular Biology. Academic Press; New York: 2003. T7 RNA polymerase; pp. 1–41. [DOI] [PubMed] [Google Scholar]
- 13.Steitz T.A. The structural changes of T7 RNA polymerase from transcription initiation to elongation. Curr. Opin. Struct. Biol. 2009;19:683–690. doi: 10.1016/j.sbi.2009.09.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Steitz T.A. DNA polymerases: structural diversity and common mechanisms. J. Biol. Chem. 1999;274:17395–17398. doi: 10.1074/jbc.274.25.17395. [DOI] [PubMed] [Google Scholar]
- 15.Tahirov T.H., Temiakov D., Yokoyama S. Structure of a T7 RNA polymerase elongation complex at 2.9 Å resolution. Nature. 2002;420:43–50. doi: 10.1038/nature01129. [DOI] [PubMed] [Google Scholar]
- 16.Yin Y.W., Steitz T.A. The structural mechanism of translocation and helicase activity in T7 RNA polymerase. Cell. 2004;116:393–404. doi: 10.1016/s0092-8674(04)00120-5. [DOI] [PubMed] [Google Scholar]
- 17.Temiakov D., Patlan V., Vassylyev D.G. Structural basis for substrate selection by T7 RNA polymerase. Cell. 2004;116:381–391. doi: 10.1016/s0092-8674(04)00059-5. [DOI] [PubMed] [Google Scholar]
- 18.Durniak K.J., Bailey S., Steitz T.A. The structure of a transcribing T7 RNA polymerase in transition from initiation to elongation. Science. 2008;322:553–557. doi: 10.1126/science.1163433. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Brieba L.G., Sousa R. Roles of histidine 784 and tyrosine 639 in ribose discrimination by T7 RNA polymerase. Biochemistry. 2000;39:919–923. doi: 10.1021/bi992324+. [DOI] [PubMed] [Google Scholar]
- 20.Guo Q., Sousa R. Translocation by T7 RNA polymerase: a sensitively poised Brownian ratchet. J. Mol. Biol. 2006;358:241–254. doi: 10.1016/j.jmb.2006.02.001. [DOI] [PubMed] [Google Scholar]
- 21.Anand V.S., Patel S.S. Transient state kinetics of transcription elongation by T7 RNA polymerase. J. Biol. Chem. 2006;281:35677–35685. doi: 10.1074/jbc.M608180200. [DOI] [PubMed] [Google Scholar]
- 22.Zhou Y., Navaroli D.M., Martin C.T. Dissociation of halted T7 RNA polymerase elongation complexes proceeds via a forward-translocation mechanism. Proc. Natl. Acad. Sci. USA. 2007;104:10352–10357. doi: 10.1073/pnas.0606306104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Skinner G.M., Baumann C.G., Hoggett J.G. Promoter binding, initiation, and elongation by bacteriophage T7 RNA polymerase. A single-molecule view of the transcription cycle. J. Biol. Chem. 2004;279:3239–3244. doi: 10.1074/jbc.M310471200. [DOI] [PubMed] [Google Scholar]
- 24.Thomen P., Lopez P.J., Heslot F. Unraveling the mechanism of RNA-polymerase forward motion by using mechanical force. Phys. Rev. Lett. 2005;94:128102. doi: 10.1103/PhysRevLett.94.128102. [DOI] [PubMed] [Google Scholar]
- 25.Thomen P., Lopez P.J., Heslot F. T7 RNA polymerase studied by force measurements varying cofactor concentration. Biophys. J. 2008;95:2423–2433. doi: 10.1529/biophysj.107.125096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kim J.H., Larson R.G. Single-molecule analysis of 1D diffusion and transcription elongation of T7 RNA polymerase along individual stretched DNA molecules. Nucleic Acids Res. 2007;35:3848–3858. doi: 10.1093/nar/gkm332. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Yarnell W.S., Roberts J.W. Mechanism of intrinsic transcription termination and antitermination. Science. 1999;284:611–615. doi: 10.1126/science.284.5414.611. [DOI] [PubMed] [Google Scholar]
- 28.Wang D., Bushnell D.A., Kornberg R.D. Structural basis of transcription: backtracked RNA polymerase II at 3.4 Ångstrom resolution. Science. 2009;324:1203–1206. doi: 10.1126/science.1168729. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Galburt E.A., Grill S.W., Bustamante C. Backtracking determines the force sensitivity of RNAP II in a factor-dependent manner. Nature. 2007;446:820–823. doi: 10.1038/nature05701. [DOI] [PubMed] [Google Scholar]
- 30.Shaevitz J.W., Abbondanzieri E.A., Block S.M. Backtracking by single RNA polymerase molecules observed at near-base-pair resolution. Nature. 2003;426:684–687. doi: 10.1038/nature02191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Komissarova N., Kashlev M. Transcriptional arrest: Escherichia coli RNA polymerase translocates backward, leaving the 3′ end of the RNA intact and extruded. Proc. Natl. Acad. Sci. USA. 1997;94:1755–1760. doi: 10.1073/pnas.94.5.1755. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Wang H.-Y., Elston T., Oster G. Force generation in RNA polymerase. Biophys. J. 1998;74:1186–1202. doi: 10.1016/S0006-3495(98)77834-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Peskin C.S., Odell G.M., Oster G.F. Cellular motions and thermal fluctuations: the Brownian ratchet. Biophys. J. 1993;65:316–324. doi: 10.1016/S0006-3495(93)81035-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Simon S.M., Peskin C.S., Oster G.F. What drives the translocation of proteins? Proc. Natl. Acad. Sci. USA. 1992;89:3770–3774. doi: 10.1073/pnas.89.9.3770. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Abbondanzieri E.A., Greenleaf W.J., Block S.M. Direct observation of base-pair stepping by RNA polymerase. Nature. 2005;438:460–465. doi: 10.1038/nature04268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Bar-Nahum G., Epshtein V., Nudler E. A ratchet mechanism of transcription elongation and its control. Cell. 2005;120:183–193. doi: 10.1016/j.cell.2004.11.045. [DOI] [PubMed] [Google Scholar]
- 37.Bustamante C., Keller D., Oster G. The physics of molecular motors. Acc. Chem. Res. 2001;34:412–420. doi: 10.1021/ar0001719. [DOI] [PubMed] [Google Scholar]
- 38.Sousa R., Padilla R. A mutant T7 RNA polymerase as a DNA polymerase. EMBO J. 1995;14:4609–4621. doi: 10.1002/j.1460-2075.1995.tb00140.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Golosov A.A., Warren J.J., Karplus M. The mechanism of the translocation step in DNA replication by DNA polymerase I: a computer simulation analysis. Structure. 2010;18:83–93. doi: 10.1016/j.str.2009.10.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Vassylyev D.G., Vassylyeva M.N., Artsimovitch I. Structural basis for transcription elongation by bacterial RNA polymerase. Nature. 2007;448:157–162. doi: 10.1038/nature05932. [DOI] [PubMed] [Google Scholar]
- 41.Vassylyev D.G., Vassylyeva M.N., Landick R. Structural basis for substrate loading in bacterial RNA polymerase. Nature. 2007;448:163–168. doi: 10.1038/nature05931. [DOI] [PubMed] [Google Scholar]
- 42.Gnatt A.L., Cramer P., Kornberg R.D. Structural basis of transcription: an RNA polymerase II elongation complex at 3.3 Å resolution. Science. 2001;292:1876–1882. doi: 10.1126/science.1059495. [DOI] [PubMed] [Google Scholar]
- 43.Brueckner F., Ortiz J., Cramer P. A movie of the RNA polymerase nucleotide addition cycle. Curr. Opin. Struct. Biol. 2009;19:294–299. doi: 10.1016/j.sbi.2009.04.005. [DOI] [PubMed] [Google Scholar]
- 44.Kettenberger H., Armache K.J., Cramer P. Complete RNA polymerase II elongation complex structure and its interactions with NTP and TFIIS. Mol. Cell. 2004;16:955–965. doi: 10.1016/j.molcel.2004.11.040. [DOI] [PubMed] [Google Scholar]
- 45.Brueckner F., Cramer P. Structural basis of transcription inhibition by α-amanitin and implications for RNA polymerase II translocation. Nat. Struct. Mol. Biol. 2008;15:811–818. doi: 10.1038/nsmb.1458. [DOI] [PubMed] [Google Scholar]
- 46.Huang J., Brieba L.G., Sousa R. Misincorporation by wild-type and mutant T7 RNA polymerases: identification of interactions that reduce misincorporation rates by stabilizing the catalytically incompetent open conformation. Biochemistry. 2000;39:11571–11580. doi: 10.1021/bi000579d. [DOI] [PubMed] [Google Scholar]
- 47.Petruska J., Sowers L.C., Goodman M.F. Comparison of nucleotide interactions in water, proteins, and vacuum: model for DNA polymerase fidelity. Proc. Natl. Acad. Sci. USA. 1986;83:1559–1562. doi: 10.1073/pnas.83.6.1559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Dunn J.J., Studier F.W. Complete nucleotide sequence of bacteriophage T7 DNA and the locations of T7 genetic elements. J. Mol. Biol. 1983;166:477–535. doi: 10.1016/s0022-2836(83)80282-4. [DOI] [PubMed] [Google Scholar]
- 49.Jeng S.T., Gardner J.F., Gumport R.I. Transcription termination in vitro by bacteriophage T7 RNA polymerase. The role of sequence elements within and surrounding a ρ-independent transcription terminator. J. Biol. Chem. 1992;267:19306–19312. [PubMed] [Google Scholar]
- 50.Bai L., Shundrovsky A., Wang M.D. Sequence-dependent kinetic model for transcription elongation by RNA polymerase. J. Mol. Biol. 2004;344:335–349. doi: 10.1016/j.jmb.2004.08.107. [DOI] [PubMed] [Google Scholar]
- 51.Datta K., von Hippel P.H. Direct spectroscopic study of reconstituted transcription complexes reveals that intrinsic termination is driven primarily by thermodynamic destabilization of the nucleic acid framework. J. Biol. Chem. 2008;283:3537–3549. doi: 10.1074/jbc.M707998200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Yager T.D., von Hippel P.H. A thermodynamic analysis of RNA transcript elongation and termination in Escherichia coli. Biochemistry. 1991;30:1097–1118. doi: 10.1021/bi00218a032. [DOI] [PubMed] [Google Scholar]
- 53.Woo H.-J., Liu Y., Sousa R. Molecular dynamics studies of the energetics of translocation in model T7 RNA polymerase elongation complexes. Proteins. Structure, Function, and Bioinformatics. 2008;73:1021–1036. doi: 10.1002/prot.22134. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Tagami S., Sekine S.-i., Yokoyama S. Crystal structure of bacterial RNA polymerase bound with a transcription inhibitor protein. Nature. 2010;468:978–982. doi: 10.1038/nature09573. [DOI] [PubMed] [Google Scholar]
- 55.Vassylyev D.G., Sekine S., Yokoyama S. Crystal structure of a bacterial RNA polymerase holoenzyme at 2.6 Å resolution. Nature. 2002;417:712–719. doi: 10.1038/nature752. [DOI] [PubMed] [Google Scholar]
- 56.Humphrey W., Dalke A., Schulten K. VMD: visual molecular dynamics. J. Mol. Graph. 1996;14:33–38. doi: 10.1016/0263-7855(96)00018-5. 27–28. [DOI] [PubMed] [Google Scholar]
- 57.Rice D.M., Wittebort R.J., Scheraga H.A. Rotational jumps of the tyrosine side chain in crystalline enkephalin. Hydrogen 2NMR line shapes for aromatic ring motions in solids. J. Am. Chem. Soc. 1981;103:7707–7710. [Google Scholar]
- 58.Gillespie D. A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. J. Comp. Phys. 1976;22:403–434. [Google Scholar]
- 59.Senior A. Catalytic sites of Escherichia coli F1-ATPase. J. Bioenerg. Biomembr. 1992;24:479–483. doi: 10.1007/BF00762365. [DOI] [PubMed] [Google Scholar]
- 60.Erie D.A., Yager T.D., von Hippel P.H. The single-nucleotide addition cycle in transcription: a biophysical and biochemical perspective. Annu. Rev. Biophys. Biomol. Struct. 1992;21:379–415. doi: 10.1146/annurev.bb.21.060192.002115. [DOI] [PubMed] [Google Scholar]
- 61.Zuker M. MFold web server for nucleic acid folding and hybridization prediction. Nucl. Acids Res. 2003;31:3406–3415. doi: 10.1093/nar/gkg595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Sugimoto N., Nakano S., Sasaki M. Thermodynamic parameters to predict stability of RNA/DNA hybrid duplexes. Biochemistry. 1995;34:11211–11216. doi: 10.1021/bi00035a029. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.