Significance
Cellular RNA polymerase (RNAP) carries out RNA synthesis and proofreading reactions utilizing a mobile catalytic domain known as the trigger loop/helix. In cyanobacteria, this essential domain acquired a large Si3 insertion during the course of evolution. Despite its elongated shape and large swinging motion associated with the RNA synthesis reaction, Si3 is effectively accommodated within cyRNAP, with no impact on the catalytic reaction associated with the trigger loop/helix. Understanding the significance of Si3 in cyanobacteria and chloroplasts is expected to reveal unique transcription mechanism in photosynthetic organisms.
Keywords: cyanobacteria, RNA polymerase, transcription, cryo-EM
Abstract
Cyanobacteria and evolutionarily related chloroplasts of algae and plants possess unique RNA polymerases (RNAPs) with characteristics that distinguish them from canonical bacterial RNAPs. The largest subunit of cyanobacterial RNAP (cyRNAP) is divided into two polypeptides, β′1 and β′2, and contains the largest known lineage-specific insertion domain, Si3, located in the middle of the trigger loop and spanning approximately half of the β′2 subunit. In this study, we present the X-ray crystal structure of Si3 and the cryo-EM structures of the cyRNAP transcription elongation complex plus the NusG factor with and without incoming nucleoside triphosphate (iNTP) bound at the active site. Si3 has a well-ordered and elongated shape that exceeds the length of the main body of cyRNAP, fits into cavities of cyRNAP in the absence of iNTP bound at the active site and shields the binding site of secondary channel-binding proteins such as Gre and DksA. A small transition from the trigger loop to the trigger helix upon iNTP binding results in a large swing motion of Si3; however, this transition does not affect the catalytic activity of cyRNAP due to its minimal contact with cyRNAP, NusG, or DNA. This study provides a structural framework for understanding the evolutionary significance of these features unique to cyRNAP and chloroplast RNAP and may provide insights into the molecular mechanism of transcription in specific environment of photosynthetic organisms and organelle.
Cyanobacteria and chloroplasts of algae and higher plants are characterized by oxygen-evolving photosynthesis and are phylogenetically closely related. These genomes are transcribed by a bacterial-type RNA polymerase (cyRNAP and plastid-encoded RNAP, PEP, respectively) aided by transcription initiation σ factors for recognition of specific promoters (1–3). Although cyRNAPs and chloroplast PEPs retain the fundamental functions of bacterial RNAPs, they possess several characteristics that distinguish them from canonical bacterial RNAPs.
First, the largest subunit of cyRNAP is separated into two polypeptides, β′1 and β′2, which are encoded by the rpoC1 and rpoC2 genes, respectively (Fig. 1A). In Synechococcus elongatus, which is the cyanobacterium used for the cryo-EM structural study of RNAP described herein, the 624 residue β′1 and 1,318 residue β′2 subunits correspond to the amino (N)-terminal one-third and the carboxy (C)-terminal two-thirds of the 1,407 residue β′ subunit in Escherichia coli, respectively. A junction between the β′1 and β′2 subunits is positioned before the conserved region E (4, 5). The β′1 subunit contains the clamp and the catalytic double-psi-β-barrel domain coordinating a Mg2+ ion; the β′2 subunit contains the rim helix, bridge helix, trigger loop, and jaw domain.
Second, cyRNAP contains the largest known lineage-specific insertion domain, Si3 (645 residues), which spans approximately half the size of the β2’ subunit and is located in the middle of the trigger loop (Fig. 1A) (6, 7). The trigger loop plays a central role in nucleotide selection, RNA synthesis, and RNA cleavage during proofreading by cellular RNAPs (8). In the absence of nucleotide triphosphate (NTP) substrate, the tip of the trigger loop is located away from the active site (9). Upon binding of complementary incoming NTP (iNTP) at the active site, the trigger loop folds to form a trigger helix containing two α-helices, which extensively interacts with the base and triphosphate groups of iNTP and facilitates the nucleotidyl transfer reaction (8). The Si3 insertion is found in RNAPs of gram-negative bacteria in the middle of the trigger loop (evolutionarily conserved region G; Fig. 1A). Si3 is composed of repeats of the conserved sandwich-barrel hybrid motif (SBHM). E. coli RNAP contains two copies of SBHM (SI Appendix, Fig. S1A) (10), and sequence analysis indicates that up to seven copies of SBHM are present in cyRNAP (6). The structure and function of the Si3 insertion in E. coli RNAP have been well characterized; it is involved in stabilizing the open complex and RNA hairpin-dependent (his) and -independent (ops) transcription pausing (11, 12) and is highly mobile, with its confirmation being dependent on the folded/unfolded state of the trigger loop/helix and binding of transcription factors (Gre, DksA) at the secondary channel of RNAP (13, 14). In addition, structural and functional analyses of Si3 in cyRNAP have recently been initiated. According to the cryo-electron microscopy (cryo-EM) structure of the cyRNAP promoter complex (15), Si3 forms an “arch” with region 2 of the σ factor, the element involved in opening the DNA duplex at the −10 element of the promoter. This arch stabilizes the promoter complex, and its removal affects the fitness and stress resistance of cyanobacteria. Notably, the Si3-σ contact remains intact upon trigger loop refolding into the trigger helix after iNTP addition to the initiation complex with the short RNA transcript. After transition to the elongation phase, it is unknown whether Si3 becomes mobile in the presence of transcription elongation factors such as NusG and how Si3 affects refolding of the trigger helix and the catalytic activity of cyRNAP.
In this work, we structurally and biochemically analyzed cyRNAP elongation complex (EC) to understand the functional importance of Si3 in the elongation phase of transcription. We solved the X-ray crystal structure of Si3 and cryo-EM structures of the cyRNAP EC with NusG in the presence and absence of iNTP bound at the active site.
Results
X-ray Crystal Structure of Thermosynechococcus elongatus BP-1 Si3 (TelSi3).
We investigated the structure of Si3 of the thermophilic cyanobacterium Thermosynechococcus elongatus BP-1 (TelSi3) by X-ray crystallography. The DNA sequence encoding Si3 (residues 345 to 983) was cloned and inserted into a vector for expression in E. coli cells, and the resulting protein was purified to homogeneity. Initial attempts to crystallize TelSi3 were unsuccessful. Limited trypsinolysis revealed that the amino-terminal (N-terminal) 91 residues of TelSi3 are sensitive to proteolysis (SI Appendix, Fig. S2A), indicating flexibility, which potentially hindered crystallization. We then cloned and expressed TelSi3, which lacks the N-terminal 91 residues (TelSi3ΔN, residues 435 to 983) and thus forms large crystals (Fig. 1B) belonging to the P3(2) 21 space group (six TelSi3ΔN copies per asymmetric unit; Fig. 1C). We were unable to generate a TelSi3ΔN model suitable for molecular replacement based on the protein sequence (e.g., by SWISS-MODEL; SI Appendix, Fig. S2C). Therefore, the experimental phase was achieved by the single-wavelength anomalous dispersion (SAD) method using selenomethionine (SeMet)-labeled TelSi3ΔN protein (SI Appendix, Fig. S2B). The 3.2 Å resolution experimental density map allowed us to build the structures of four full-length and two partial models of TelSi3ΔN in the asymmetric unit (SI Appendix, Table S1). The AlphaFold (16) structural prediction for TelSi3ΔN was in close agreement with the X-ray structure, with an RMSD of 1.08 Å (SI Appendix, Fig. S2C).
TelSi3ΔN (150 Å in length and 50 Å in width) is longer than the canonical bacterial RNAP (e.g., 110 × 130 Å: E. coli RNAP) (SI Appendix, Fig. S1B). TelSi3ΔN comprises seven SBHMs (SBHM-2 to SBHM-8). The X-ray crystal structure of the N-terminal region (81 residues) of Si3 from S. elongatus PCC 7942 (15) showed an independently folded SBHM (SBHM-1). This region corresponds to the 91 N-terminal residues of TelSi3 (missing in the crystallized TelSi3ΔN), indicating that cyRNAP contains eight copies of SBHM within Si3 (Fig. 1C and SI Appendix, Fig. S3).
TelSi3 has a swordfish-shaped profile, with distinct “tail,” “fin,” “body,” and “head” subdomains formed by SBHM-1, SBHM-2/8, SBHM-3/4/5, and SBHM-6/7, respectively (Fig. 1D). Notably, the SBHMs in TelSi3 are not structured in a simple tandem arrangement (Fig. 2D and SI Appendix, Fig. S3), in contrast to E. coli Si3, which contains two independently folded SBHMs connected by a short linker (SI Appendix, Fig. S1A) (10). Although each SBHM has a core antiparallel β-sheet topology, connections between the β-sheets vary as the polypeptide chain folds over itself (Fig. 1D). In addition, the sequences of SBHM-1, -6, -7, and -8 are continuous; the others (SBHM-2, -3, -4, and -5) contain structural elements from distant regions of the polypeptide sequence. We assessed conformational flexibility by comparing the four full-length TelSi3ΔN structures from the asymmetric unit using the Si3-fin as a reference for superimposition. This showed substantial conformational variation in the Si3-head, allowing for a 24 Å displacement associated with an 11° rotation (Fig. 1E).
Cryo-EM Structure of the S. elongatus RNAP Elongation Complex with NusG.
To investigate the structure of cyRNAP and the dynamics of Si3 at the elongation stage, we determined the cryo-EM single-particle reconstruction structure of the cyRNAP EC (SI Appendix, Fig. S4 and Table S2). NusG was also included in the EC, as most ECs contain NusG under physiological transcription conditions (17, 18), and physical contact between Si3 and NusG was investigated.
We used recombinant double affinity-tagged S. elongatus cyRNAP to avoid isolation of any chimeric cyRNAP containing E. coli RNAP subunits. EC was assembled by mixing cyRNAP, NusG, and the DNA/RNA scaffold (Fig. 2A). The preferred particle orientation issue of EC-NusG was resolved by adding CHAPSO (final concentration of 0.8 mM) to the sample before application to the cryo-EM grid (19). The cryo-EM structure was determined with an overall resolution of 3 Å, revealing well-defined cryo-EM densities for cyRNAP, the N-terminal domain of NusG (residues 19 to 138), and the DNA/RNA hybrid (Fig. 2B). The densities of the single-stranded nontemplate DNA in the transcription bubble and the single-stranded RNA within the RNA exit channel were traceable due to their respective interactions with NusG and the RNA exit channel (Fig. 2C). The carboxyl-terminal (C-terminal) domains of the α subunits and the Kyrpides–Ouzounis–Woese (KOW) domain of NusG were disordered.
By contacting both the upstream and downstream DNA duplexes, NusG seemed to maintain a 90° bend in the DNA centered at the RNAP active site (SI Appendix, Fig. S5A), which may stabilize the DNA/RNA holding of cyRNAP. To evaluate the role of NusG, we immobilized reconstituted ECs on agarose beads and challenged the complex with 300 mM NaCl in the absence of NusG. There was a significant reduction in the proportion of RNA released from the complex compared with the EC in the presence of NusG (SI Appendix, Fig. S5B), indicating its stabilizing effect. Notably, compared with its orthologs from E. coli, Bacillus subtilis, and Mycobacterium tuberculosis, the cyanobacterial NusG possesses a longer and more positively charged loop (residues 110 to 122) within the N-terminal domain. Basic residues (K113, R118, and R120) within this loop extend toward the downstream DNA (K113 and R118) and single-stranded non-template DNA (R120) within the transcription bubble to form salt bridges (SI Appendix, Fig. S5A). Deletion of this cyanobacteria-specific loop (NusGΔ110–122) significantly reduced the stabilizing effect of NusG (SI Appendix, Fig. S5B).
Si3 Runs along the Cavities of cyRNAP and Shields the Binding Site of DksA/Gre Factors.
By fitting the models of RNAP (without Si3), NusG, and the DNA/RNA scaffold, we elucidated a density corresponding to Si3, which extends starting from the trigger loop and then moves below the rim helix (β′2 subunit), running along the lobe/protrusion domains (β subunit), and nearly reaching the upstream DNA (Fig. 2B and Movie S1). The overall structure of cyRNAP is nearly identical to the structures of other bacterial RNAPs, including those of E. coli and M. tuberculosis (16, 20), indicating that Si3 runs along the cavities of RNAP without influencing its general shape or conformation. The crystal structures of Si3 containing both SBHM2-8 and SBHM1 were fitted to their corresponding cryo-EM density. The cryo-EM density of the Si3-head was weak and had a low resolution (Fig. 2B and SI Appendix, Fig. S4E), suggesting its mobility.
Si3-tail is positioned in front of the rim helix (Fig. 3A). Si3-fin is positioned below the rim helix, and the extended SBHM2 loop (residues 463 to 471) fills a gap between the β′2 jaw and β lobe domains. Si3-body is located beside the lobe and protrusion domains of the β subunit, and Si3-head reaches the upstream DNA (Figs. 2B and 3A). Si3-fin contacts the bottom part of the rim helix, but only a few amino acid residues of Si3 contact the main body of RNAP and NusG, suggesting that Si3-tail and Si3-body/head can move their positions without restraint. Si3 spans the entire length of cyRNAP, reaching from the secondary channel to the upstream DNA. However, it likely does not interfere with any basic function of cyRNAP (i.e., DNA binding, RNA elongation, binding of initiation factor σ, or elongation factors NusA and NusG), as it runs along the sidewall of cyRNAP (Figs. 2D and 3A).
During transcription, the secondary channel of all cellular RNAPs, including bacterial RNAPs, serves as the only access route between the active site found in the center of RNAP and the external milieu, serving as an entry point for substrate NTPs and an exit route for the RNA 3′-end during backtracking (prior to RNA cleavage). In cyRNAP, the secondary channel appears to be open enough to allow these functions. In addition to these basic functions, the secondary channel serves as a binding platform for proofreading factors such as Gre and regulatory factors such as DksA, known as secondary channel binding factors (13, 14). These factors use the RNAP rim helix as a primary binding site, after which the coiled-coil domain is inserted to access the active site of RNAP (Fig. 3B). In cyRNAP, Si3-tail and -fin occupy the front and bottom sides of the rim helix, respectively, thereby preventing any potential association of secondary channel binding factors (Fig. 3A).
Dynamic Motion of Si3 Associated with the Transition between the Trigger Loop and Helix during iNTP Binding at the Active Site.
To investigate the Si3 conformational change associated with trigger helix refolding, we prepared an iNTP-bound form of the EC by extending RNA with 3′-deoxy adenosine triphosphate (3′-dATP), which arrested further RNA extension, followed by cytosine triphosphate (CTP) addition as the iNTP (SI Appendix, Fig. S6). The resulting cryo-EM structure was determined at 2.79 Å resolution (SI Appendix, Fig. S6). Although an excess amount of CTP was added to the EC, a substantial population of ECs (~40%) remained unbound to iNTP. However, the iNTP-bound EC could be clearly distinguished from the iNTP-free EC during 3D classification of the cryo-EM data process due to its unique Si3 orientation relative to the main body of cyRNAP associated with iNTP binding (SI Appendix, Fig. S6 B and D). This allowed for a well-defined density map of the cyRNAP active site. In the iNTP-bound EC, the B-site Mg2+ (known as the nucleotide-binding metal) was present at the active site. However, the A-site Mg2+ (known as the catalytic metal) was absent, likely due to the lack of a hydroxyl group at the 3′-end of the RNA. Trigger helix folding establishes several essential contacts between the iNTP and amino acid residues, including β′2-M339 in contact with the nucleobase and β′2-H343 in contact with the β-phosphate group (SI Appendix, Fig. S6D).
Trigger helix folding induces significant motion of Si3 relative to the main body of cyRNAP. Specifically, the trigger helix formation pulls a linker connecting the C-terminal half of the trigger helix and the Si3-fin, and during this process, the tip of the rim helix acts as a pivot point, converting the lateral motion of the linker (~10 Å) into the rotational motion of Si3, resulting in an ~50-Å distance and a 24° swing of Si3-head (Fig. 4 A and B and Movie S2). Si3-body/head swings down from the main body of cyRNAP, ejecting it from the cavity of cyRNAP; thus, the β protrusion domain no longer contacts Si3-body/head in the iNTP-bound EC (Fig. 4A). The swing motion of Si3 couples: 1) closing of the rim helix toward the secondary channel, and 2) establishing new contacts between the β′1-jaw domain and the Si3-fin, which shifts a position of jaw domain toward the secondary channel of cyRNAP. Complete deletion of Si3 did not markedly alter the catalytic properties of cyRNAP (Fig. 4C). Three ECs containing 14, 15, and 16 nucleotide long RNAs (EC14, 15, and 16) were prepared by extending the initial 5′-labeled 13 nt long RNA in the nucleic acid scaffold shown above the summary table. Nucleotide addition, its direct reversal by pyrophosphorolysis and transcript cleavage were performed for the ECs that formed with either wild-type (WT) or Si3-lacking (ΔSi3) cyRNAP. Rates of the NTP addition, pyrophosphorolysis, and RNA hydrolysis were similar between the WT and ΔSi3 cyRNAPs (Fig. 4C and SI Appendix, Fig. S7). The relative rates of these reactions also allowed us to attribute a predominant translocation state to the EC tested because nucleotide addition proceeded from post-translocation, pyrophosphorolysis from pre-translocation and hydrolysis from the backtracked state (scheme on Fig. 4C). Comparison of the rates of these reactions for the three complexes used in the present study suggested that EC14 is mainly stabilized in a post-translocated state (characterized by fast NTP addition), EC15 is mainly pre-translocated (fast pyrophosphorolysis), and EC16 is mainly backtracked/paused (faster hydrolysis), similar to the ECs formed by Thermus thermophilus RNAP (21), which does not contain Si3, on this template. These results imply that Si3 does not influence the catalysis or translocation equilibrium of cyRNAP.
The cryo-EM structure of the cyRNAP-promoter DNA complex containing σA (both from Synechocystis sp. PCC 6803, which is closely related to the S. elongatus PCC 7942 used in this study), promoter DNA, and 4-mer RNA was determined by Shen et al. (15); the results showed that Si3-head contacts σA domain 2. This interaction clamps the single-stranded DNA around the −10 region, stabilizing the open complex and facilitating transcription initiation. Comparison of the structures of the cyRNAP promoter complex (15) with those of the EC (this study) revealed that Si3-body and -head move toward σA domain 2 for interaction but that the other cyRNAP structures, including Si3-tail and -fin and the main body of the RNAP, are nearly identical (Fig. 5A).
Si3 wraps around the main body of cyRNAP, which may facilitate RNAP folding, subunit assembly, and/or maturation to form an active and mature form of RNAP as DNA and a σ factor that enhances reconstitution of E. coli RNAP (23). To test the function of Si3 during cyRNAP assembly and maturation, we performed a refolding experiment with WT, ΔSi3 cyRNAP, and ΔSi3 cyRNAP in combination with the separately expressed and purified Si3 protein (ΔSi3+Si3) (Fig. 5B). The proteins were denatured with 6 M guanidine-HCl and renatured by gradual removal of guanidine-HCl via dialysis against renaturation buffer. The activities of the reconstituted ΔSi3 cyRNAP in the absence and presence of the Si3 protein, as judged by their ability to extend 13-nt long RNA in the assembled duplex with template DNA oligonucleotide, were nearly the same as those of the WT cyRNAP, indicating that Si3 does not play a role in cyRNAP assembly and maturation. This conclusion is supported by the similar yields of recombinant WT and ΔSi3 cyRNAPs routinely isolated from E. coli. Remarkably, however, the separate Si3 protein binds ΔSi3 cyRNAP but not the WT cyRNAP when it is added externally to cyRNAP (Fig. 5C). When complex formation between Si3 and ΔSi3 cyRNAP was assessed by a blue native polyacrylamide gel electrophoresis, a band with a lower mobility similar to that of the WT cyRNAP was observed (Fig. 5C, Lane 4). Interaction between WT cyRNAP and Si3 was not detected, i.e., no complex with lower mobility than that of WT cyRNAP was detected (Lane 5).
Discussion
In this study, we determined the structures of cyRNAP Si3 by X-ray crystallography (Fig. 1), and of cyRNAP EC-NusG with and without iNTP by cryo-EM (Figs. 2 and 4). We investigated the function of Si3 by comparing the catalytic activities of WT and ΔSi3 cyRNAPs. The results of structural and biochemical investigations of cyRNAP showed that when the trigger loop is unfolded, Si3 is accommodated within the cavities of cyRNAP, but it shields the site of secondary channel binding proteins. The iNTP binding at the active site refolds the trigger loop to form the trigger helix, which drastically rearranges the Si3 location, ejecting it out from the cavity of cyRNAP. Remarkably, a minor structural transition between the trigger loop and trigger helix causes a major swinging motion of Si3 (Fig. 4 and Movie S2). The presence of Si3 in the middle of the trigger loop/helix did not affect cyRNAP catalysis under our experimental conditions (Fig. 4C). Because of the large conformational change that occurs during the transcription reaction, changes in cyRNAP activity could be observed when the motion of Si3 is hindered, such as by binding of external factors. This hypothesis could be tested by carrying out an experiment that would limit the Si3 motion. For example, by introducing artificial disulfide cross-linking (24) between the Si3-body and the β lobe domain to impede the motion of Si3 as a result of the iNTP binding at the active site. Further proteomics for searching factors binding Si3, structural, single-molecule, and biochemical studies are required to elucidate its role in regulating transcription by cyRNAP, such as by sensing environmental signals (e.g., trafficking of RNAP or transcription-translation coupling) to optimizing cyRNAP activity. Alternatively, the oscillating motion of Si3 might function as a regulatory signal for cellular processes. Photosynthetic cyanobacteria synchronize their gene expression patterns with diurnal light cycles (25). Conceivably, lack of Si3 movement might trigger initiation of cyRNAP hibernation through binding to cellular factors or its oligomerization during the night. Additionally, Si3 movement might help RNAP propel through the densely packed cytoplasm of cyanobacteria during transcription.
The primary proofreading mechanism employed by RNAP involves backtracking followed by hydrolysis of misincorporated nucleotides at the 3′-end of nascent RNA. This process is significantly enhanced by elongation factors that bind to RNAP secondary channel, such as Gre in bacteria, TFS in archaea, and TFIIS in eukaryotes (26). However, unlike the absolute majority of living organisms, cyanobacteria lack secondary channel binding factors. The intracellular concentration of Mn2+ is two orders of magnitude greater in cyanobacteria than in other bacteria to support photosynthesis (17). It is possible that Mn2+ replaces the catalytic Mg2+ of RNAP and thus promotes misincorporation of NTPs (27, 28). Potentially as a compensating mechanism, cyRNAP has been shown to possess proficient intrinsic proofreading activity (7, 29). However, this intrinsic activity is still approximately 10 times lower than the Gre-stimulated activity of E. coli RNAP. Gre-like factors either emerged after the split of cyanobacteria from their last common ancestor with other bacteria or were subsequently lost. The distinctive characteristics of cyRNAP—the absence of Gre/DksA factors and the split of the largest subunit may be intrinsically linked to Si3 acquisition. The Si3-tail/fin position around the rim helix of RNAP prevents association of secondary channel binding proteins, such as GreA and DksA, with cyRNAP (Fig. 3). As secondary channel binding proteins play critical roles in transcription fidelity and regulation in bacteria, the Si3-GreA/DksA trade-off in cyanobacteria might be advantageous but remains to be fully understood. With Si3 acquisition, β′ increased to 210 kDa in size, and separation of the original rpoC gene into two genes was perhaps beneficial to facilitate expression of cyRNAP subunits. The observed change in the position and mobility of Si3 in cyRNAP ECs compared to those in the promoter complex (Fig. 5A) raises questions about the role of Si3 in promoter escape. Si3 may complicate promoter escape by binding to the σ factor; conversely, its large-range movement upon RNA synthesis may contribute to weakening σ association with core and/or promote σ release at transition to elongation stage.
The structure corresponding to Si3 of cyRNAP has not been found in other bacterial RNAPs. However, the structure and arrangement of the Rpb9 subunit in eukaryotic RNAPII show remarkable similarity to those of the Si3 subunit of cyRNAP (Fig. 3C). Rpb9 is positioned within a cavity between the rim helix and the lobe domain of RNAPII, akin to the Si3-fin of cyRNAP (highlighted in red in cyRNAP and RNAPII). Rpb9 is a unique subunit found only in RNAPII and plays a critical role in enhancing the accuracy of transcription (30). Although both Rpb9 and Si3-tail are located away from the active site of RNAP, their presence may enhance transcription fidelity, which coordinates RNAP confirmation changes such as RNAP swiveling and/or movement of the rim helix during the nucleotide addition cycle (16). The presence of these unique structural features in different types of RNAPs suggests a common mechanism for enhancing transcriptional accuracy and specificity across different organisms. Further investigation of Si3 function at different stages of transcription and under several growth conditions in cyanobacteria will be required to determine the full array of its biological functions.
Experimental Procedures
Protein Preparation.
The DNA fragment encoding Thermosynechococcus elongatus BP-1 Si3 in the β′2 subunit (TelSi3, RpoC2 residues 345 to 983, 69 kDa) was cloned between the NdeI and BamHI sites of the pET15b expression vector to introduce an N-terminal His6-tag, and the protein was overexpressed in E. coli BL21(DE3)/pLysS cells. Transformants were subsequently grown in LB media supplemented with ampicillin (100 µg/mL) and chloramphenicol (25 µg/mL) at 37 °C until the OD600 reached ~0.5, after which protein expression was induced by adding 0.5 mM IPTG for 10 h at 4 °C. The harvested cells were lysed by sonication, and proteins in the soluble fraction were purified by Ni-affinity column chromatography (HisTrap 5 mL column, GE Healthcare). The His6-tag was removed by thrombin digestion (1 µg of thrombin per mg of TelSi3 protein) for 20 h at 4 °C, and the protein was further purified by Q Sepharose column chromatography (GE Healthcare) and gel-filtration column chromatography (HiLoad Superdex75 16/60, GE Healthcare). The purified protein was concentrated to 15 mg/mL and exchanged into buffer containing 10 mM Tris-HCl (pH 8.0), 50 mM NaCl and 0.1 mM EDTA.
Limited Trypsinolysis.
Limited trypsinolysis was used to remove flexible regions from TelSi3, and N-terminal amino acid sequencing was used to identify protein fragments suitable for crystallization. The trypsin digests were carried out in 10 mM Tris–HCl (pH 8), 100 mM NaCl, 5% (v/v) glycerol, 0.1 mM EDTA, and 1 mM DTT. TelSi3 (10 mg/mL) was digested in a 10 µL volume with different amounts of trypsin (5 nM to 5 µM) for 10 min at 25 °C. The reactions were terminated by addition of PMSF. The trypsinized fragments were separated by SDS–PAGE and blotted onto PVDF membranes, and the N-terminal sequences were determined by Edman-based protein sequencing. The TelSi3 fragment containing residues 435 to 938 (TelSi3ΔN, 60 kDa) was PCR subcloned and inserted into the pET15b expression vector between the NdeI and BamHI sites. The protein was overexpressed and purified as described above for full-length TelSi3.
Crystallization.
Initial crystals of TelSi3ΔN were obtained by the hanging-drop vapor diffusion method by mixing equal volumes of the protein solution (20 mg/mL) and crystallization solution [0.1 M sodium citrate (pH 3.5), 0.2 M MgCl2, and 10% PEG6000] and incubating at 4 °C over the same crystallization solution. The large crystals (0.5 × 0.2 × 0.2 mm) used for X-ray data collection were prepared by microseeding by mixing 2 µL of protein solution, 2 µL of crystallization solution [0.1 M sodium citrate (pH 5), 0.4 to 0.6 M MgCl2, 4 to 6% PEG3350 and 50 µg/mL heparin], and 0.2 µL of seed solution. The crystals were then dehydrated by transfer to crystallization solution (without heparin) with increasing concentrations of PEG3350 (in 5% steps) to a final concentration of 20% and incubated for 5 to 10 h. For all procedures, crystal preparation, growth, and dehydration were performed at 4 °C. The crystals were transferred to a crystallization solution with 25% (v/v) propylene glycol as a cryoprotective solution and flash frozen in liquid nitrogen. Selenomethionine-substituted proteins were prepared for SAD analysis by suppressing methionine biosynthesis.
X-ray Data Collection and Crystal Structure Determination.
In addition to the four original methionine residues found in TelSi3ΔN (including an N-terminal methionine residue resulting from cloning into the pET15b vector), three methionine residues were introduced by replacing the leucine residues at 508, 738, and 922 by site-directed mutagenesis to obtain the experimental phase via single-anomalous dispersion (SAD) experiments using SeMet-labeled proteins. The TelSi3ΔN protein with seven selenomethionine residues (TelSi3ΔNMet7) was generated by suppressing methionine biosynthesis during overexpression of the TelSi3ΔNMet7 protein. The protein was purified as described above.
Diffraction data were corrected at National Synchrotron Light Source (NSLS) beamline X25. There are six TelSi3ΔNMet7 molecules in an asymmetric unit of the crystal belonging to the P3(2)21 space group. The crystallographic datasets were processed using HKL2000 (31). With the anomalous signal from SeMet, the experimental phase (figure of merit: 0.273) was calculated using automated structure solution (AutoSol) in PHENIX (32). Density modification yielded a map suitable for manual model building by Coot (33) followed by structure refinement using PHENIX. The final coordinates and structure factors have been deposited in Protein Data Bank (PDB) under the accession codes listed in SI Appendix, Table S1.
Expression and Isolation of the S. elongatus RNAP.
The core enzyme of cyRNAP was overexpressed in E. coli T7Express cells (New England Biolabs) cells transformed with a pET28a expression vector containing the α, β, β′1, β′2, and ω encoding genes (β and β′2 contain a Strep-tag and His-tag, respectively) (15). The cells were grown in LB media supplemented with kanamycin (50 μg/mL) at 37 °C until the OD600 was ~0.6. Afterward, the cells were induced with IPTG (1 mM) and grown overnight at 22 °C. The biomass was harvested and suspended in lysis buffer [50 mM Tris-HCl (pH 8.0), 250 mM NaCl, 10% glycerol, 20 mM imidazole, and 1 mM β-mercaptoethanol and protease inhibitors from Roche according to the manufacturer’s instructions]. The cells were sonicated, lysate centrifuged at 18 k × g, after which the supernatant was collected. The protein was purified at 4 °C sequentially through a HisTrap (5 mL) column and a Strep-Tactin XT (1 mL) column (both from Cytiva). The latter column was washed with three column volumes (CVs) of Buffer W (100 mM Tris-HCl pH 8.0, 150 mM NaCl, 1 mM EDTA). The bound protein was eluted by applying 1 CV of Buffer E (100 mM Tris-HCl pH 8.0, 150 mM NaCl, 1 mM EDTA, and 2.5 mM desthiobiotin). The purified cyRNAP (20 μM) was assessed using SDS–PAGE, dialyzed against Storage Buffer (40 mM Tris–HCl pH 8.0, 200 mM KCl, 1 mM EDTA, 1 mM DTT, and 5% glycerol), and stored at −80 °C.
Cloning, Expression, and Isolation of the S. elongatus NusG and Si3 Proteins.
The NusG was overexpressed in E. coli T7Express cells (New England Biolabs) cells transformed with pET28a expression vector where the gene for the C-terminal His6-tagged S. elongatus NusG was cloned. Cells were grown in LB medium supplemented with kanamycin (50 μg/mL) at 37 °C until OD600 ~0.5 and then induced with IPTG (1 mM) and grown overnight at 22 °C. Culture pellets were sonicated in 50 mL Lysis Buffer (10 mM Tris-HCl pH 7.9, 300 mM KCl, protease inhibitors from Roche according to manufacturer), spun at 18K rpm, and filtered through 0.22 μM syringe filter. Filtered supernatant was subjected to Ni-NTA affinity chromatography in 10 mM Tris-HCl pH 7.9, 600 mM KCl, 5% glycerol with 50 mM imidazole washes and 100 mM imidazole elution. The eluted protein (in 600 mM KCl) was diluted (~100 mM KCl) and applied to a pre-equilibrated with (10 mM Tris-HCl pH 8.0, 100 mM KCl, 5% glycerol) 5 mL Resource Q column, Cytiva. The column was washed with 5 CV of equilibration buffer, and the protein was eluted by applying a linear salt gradient (100-1 M KCl) over 10 CV. The purified NusG (90 μM) was checked using SDS-PAGE, stored in Storage Buffer (40 mM Tris-HCl pH 8.0, 200 mM KCl, 1 mM EDTA, 1 mM DTT, and 5% glycerol) at −80 °C. Cyanobacteria-specific loop of NusG (residues 110 to 122) was deleted by site-directed mutageneses, and the mutant NusG was isolated as the WT protein.
The open reading frame encoding separate full-size Si3 domain was cloned into pET28 vector, overexpressed E. coli T7Express cells (New England Biolabs) as the N-terminal His6-tagged protein, and isolated via Ni-NTA affinity chromatography on HisTrap column, Cytiva, similarly to NusG protein. After affinity chromatography, the protein was dialyzed against the storage buffer (20 Tris-HCl pH 8.0, 200 mM KCl, 1 mM EDTA, 1 mM DTT, and 50% glycerol).
Sample Preparation for Cryo-EM.
The cyRNAP EC with NusG was reconstituted in vitro by mixing 5 μM cyRNAP with equimolar amounts of template DNA and RNA (Fig. 2A) in storage buffer at 37 °C for 10 min, followed by mixing with 7 μM nontemplate DNA and incubating further for 10 min. The resulting EC was mixed with 7 μM NusG and incubated for 10 min at 37 °C. CHAPSO (8 mM) was added to the sample just before vitrification. The iNTP-bound EC was prepared by adding 1 mM 3′-deoxyATP (as an extension terminator after its incorporation at 3′-end RNA) and CTP (as iNTP binding at the active site) to the EC with NusG and incubating for 5 min at 37 °C. Another difference between the EC- and iNTP-bound ECs is the nontemplate DNA used in the scaffold, the latter of which contains complementary transcription bubbles.
Grid Preparation for Cryo-EM.
C-flat Cu grids (CF-1.2/1.3 400 mesh, Protochips) were glow-discharged for 40 s using the PELCO easiGlow™ system prior to application of 3.5 μL of the sample (2.5 to 3.0 mg/mL protein concentration) and plunge-freezing in liquid ethane using a Vitrobot Mark IV (FEI) with 100% chamber humidity at 5 °C.
Cryo-EM Data Acquisition and Processing.
Data were collected using a Titan Krios (Thermo Fisher) microscope equipped with a Falcon IV direct electron detector (Gatan) at Penn State Cryo-EM Facility. Sample grids were imaged at 300 kV, with an intended defocus range of −2.5 to −0.75 μm and a magnification of 75,000× in electron counting mode (0.87 Å per pixel). Movies were collected with a total dose of 45 electrons/Å2. Downstream processing was performed with CryoSPARC (34). The movies were corrected and aligned using patch motion correction followed by patch CTF correction. Particles were picked using a template-based autopicker and multiple rounds of 2D classification to discard bad particles. The 2D classes with EC-NusG particles were selected and used for training the Topaz model (35). The Topaz-extracted particles were subjected to multiple rounds of heterogeneous refinement to remove junk particles. Finally, a nonuniform refinement operation was run on the final set of particles to yield the reconstruction (SI Appendix, Figs. S4 and S6).
Structure Refinement and Model Building.
A model of the cyRNAP core enzyme was constructed by homology modeling using core RNAP from the cryo-EM structure of the Syn 6803 RNAP-σA promoter DNA open complex as a reference model. A model of cyNusG was constructed with the AlphaFold2 gene (36). DNA and RNA models were constructed using the E. coli RNAP elongation complex (PDB: 7MKO) as a guide. The cyRNAP gene was manually fitted into the cryo-EM density map using Chimera (37), followed by rigid body and real-space refinement using Coot (33) and Phenix (38).
In Vitro Transcription in the Assembled Elongation Complexes.
ECs were assembled and immobilized as described (39). Sequences of the oligonucleotides used for the assembly of ECs are shown in Fig.4C. For assembly of ECs used for experiments on Fig. 4C, 13-nt long RNA was radiolabeled at the 5′-end with [γ-32P] ATP and T4 Polynucleotide kinase (New England Biolabs) prior to complexes assembly. Stalled elongation complexes EC14, EC15, and EC16 were obtained by extension of the initial RNA13 in EC13 with 10 μM NTP sets according to the sequence for 5 min and then were washed with TB to remove Mg2+ and NTPs. Reactions were initiated by addition of 10 mM MgCl2 with or without either 1 μM NTPs or 250 μM PPi. Single nucleotide addition and pyrophosphorolysis experiments were performed at 30 °C in transcription buffer (TB) containing 20 mM Tris–HCl pH 6.8, 40 mM KCl, and 10 mM MgCl2, transcript hydrolysis was done in the same buffer except at pH 7.9. After incubation for intervals of time specified on Figures, reactions were stopped with formamide-containing buffer. Products were resolved by denaturing 23% polyacrylamide gel electrophoresis (PAGE) (8 M Urea), revealed by PhosphorImaging (Cytiva) and visualized using ImageQuant (Cytiva) software. Kinetics data were fitted to a single exponential equation using SigmaPlot software by non-linear regression to determine rate constants of the reactions.
Denaturation and Renaturation of cyRNAP.
Denaturation of cyRNAP was performed by incubating the purified protein for 20 min in denaturing buffer containing 20 mM Tris-HCl (pH 7.9), 6 M guanidine-HCl, 5% glycerol, 1 mM EDTA, and 10 mM DTT at 30 °C in a 100 µL volume and with a cyRNAP concentration of 0.5 mg/mL. Recombinant Si3 was included in 2.5 molar excess. The proteins were renatured via overnight dialysis at 7 °C against renaturing buffer containing 20 mM Tris-HCl (pH 7.9), 200 mM KCl, 10% glycerol, 2 mM MgCl2, 10 µM ZnCl2, 1 mM EDTA, and 1 mM DTT. Aliquots of the renaturation mixture and their serial dilutions were used for nucleotide addition experiments on assembled constructs containing template DNA and RNA oligonucleotides. A 13 nt RNA oligonucleotide was radiolabeled at the 5′ end with [γ-32P] ATP and T4 polynucleotide kinase (New England Biolabs) prior to EC assembly. The indicated on the Fig. 5C amount of assembly # mixture was incubated with the RNA-DNA duplex for 5 min at room temperature, then 10 µM GTP was added for 10 min at 30 °C. Reactions were stopped and products analyzed as before.
Complex Formation between the Si3 Protein and Core cyRNAP.
For the binding experiment 150 nM core enzymes and 1.5 µM Si3 proteins were incubated for 10 min at 4 °C in 20 mM Tris-HCl pH 7.9, 40 mM KCl, mixed with loading dye (final concentration is 50 mM BisTris pH 7.2, 50 mM NaCl, 10% glycerol, 0.001% Ponceau S) and resolved on the NativePAGE 3 to 12% Bis-Tris gel, Invitrogen using running buffers prepared according to the manufacturer, for 90 min at 150 V. Gel was fixed with 50% methanol, 10% acetic acid solution, and additionally de-stained by boiling in 8% acetic acid.
Salt Stability of Elongation Complexes.
Elongation complex was assembled using oligos shown in SI Appendix, Fig. S5B. 14 nt RNA in ECs on was radiolabeled at the 3′ end by incorporation of [α-32P] GTP into original 13 nt long RNA. To examine the stability of ECs, ECs bound to the streptavidin sepharose beads, Cytiva via strep tag on β subunit of cyRNAP, were incubated in TB containing 300 mM KCl at 30 °C for times specified on SI Appendix, Fig. S5B. WT or mutant NusGΔ110–122 were added where specified at 1 μM final concentration. Supernatant and total fractions were collected for analysis. Reactions were stopped and products analyzed as before.
Supplementary Material
Acknowledgments
We thank Jean-Paul Armache at Penn State for the technical support. We thank Akira Hirata for the Si3 crystallization and the National Synchrotron Light Source (NSLS) Brookhaven National Laboratory for X-ray data collection. We would like to acknowledge the Penn State Huck Life Science Institutes Cryo-EM Core Facility (RRID:SCR_024456) for use of the Talos Arctica G2 TEM and the Vitrobot Mark IV and Sung Hyun Cho for data collection. We thank Yu Zhang at the Shanghai Institute of Plant Physiology and Ecology for kindly sharing the coordinates of Synechocystis sp. PCC 6803 RNAP. This work was supported by a NIH grant (R35 GM131860 to K.S.M.) and a Biotechnology and Biological Sciences Research Council grant BB/W017385/1 to Y.Y. Research reported in this publication was supported by the Office of the Director, NIH, under award number S10OD026822-01.
Author contributions
Y.Y. and K.S.M. designed research; M.Z.Q., M.I., M.L., R.K.V., and A.R.-B. performed research; M.I., Y.Y., and K.S.M. analyzed data; and Y.Y. and K.S.M. wrote the paper.
Competing interests
The authors declare no competing interest.
Footnotes
This article is a PNAS Direct Submission. I.A. is a guest editor invited by the Editorial Board.
Contributor Information
Yulia Yuzenkova, Email: y.yuzenkova@ncl.ac.uk.
Katsuhiko S. Murakami, Email: kum14@psu.edu.
Data, Materials, and Software Availability
The X-ray crystallographic density map and the refined model have been deposited in Protein Data Bank (https://www.rcsb.org) under accession number 8EMB (40). The cryo-EM density map and the refined model have been deposited in Electron Microscopy Data Bank (https://www.ebi.ac.uk/emdb/) under accession numbers EMD-40874 (iNTP-free EC-NusG) (41) and EMD-42502 (iNTP-bound EC-NusG) (42) and in Protein Data Bank (https://www.rcsb.org) under accession numbers 8SYI (iNTP-free EC-NusG) (43) and 8URW (iNTP-bound EC-NusG) (44). All other data are included in the manuscript and/or supporting information.
Supporting Information
References
- 1.Borner T., Aleynikova A. Y., Zubo Y. O., Kusnetsov V. V., Chloroplast RNA polymerases: Role in chloroplast biogenesis. Biochim. Biophys. Acta 1847, 761–769 (2015). [DOI] [PubMed] [Google Scholar]
- 2.Pfannschmidt T., et al. , Plastid RNA polymerases: Orchestration of enzymes with different evolutionary origins controls chloroplast biogenesis during the plant life cycle. J. Exp. Bot. 66, 6957–6973 (2015). [DOI] [PubMed] [Google Scholar]
- 3.Schneider G. J., Tumer N. E., Richaud C., Borbely G., Haselkorn R., Purification and characterization of RNA polymerase from the cyanobacterium Anabaena 7120. J. Biol. Chem. 262, 14633–14639 (1987). [PubMed] [Google Scholar]
- 4.Xie W. Q., Jager K., Potts M., Cyanobacterial RNA polymerase genes rpoC1 and rpoC2 correspond to rpoC of Escherichia coli. J. Bacteriol. 171, 1967–1973 (1989). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Schneider G. J., Hasekorn R., RNA polymerase subunit homology among cyanobacteria, other eubacteria and archaebacteria. J. Bacteriol. 170, 4136–4140 (1988). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Lane W. J., Darst S. A., Molecular evolution of multisubunit RNA polymerases: Sequence analysis. J. Mol. Biol. 395, 671–685 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Riaz-Bradley A., James K., Yuzenkova Y., High intrinsic hydrolytic activity of cyanobacterial RNA polymerase compensates for the absence of transcription proofreading factors. Nucleic Acids Res 48, 1341–1352 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Qayyum M. Z., Shin Y., Murakami K. S., Encyclopedia of Biological Chemistry III (Elsevier, 2021), pp. 358–364, 10.1016/b978-0-12-819460-7.00252-8. [DOI]
- 9.Cheung A. C., Cramer P., A movie of RNA polymerase II transcription. Cell 149, 1431–1437 (2012). [DOI] [PubMed] [Google Scholar]
- 10.Chlenov M., et al. , Structure and function of lineage-specific sequence insertions in the bacterial RNA polymerase beta’ subunit. J. Mol. Biol. 353, 138–154 (2005). [DOI] [PubMed] [Google Scholar]
- 11.Kang J. Y., et al. , RNA polymerase accommodates a pause RNA hairpin by global conformational rearrangements that prolong pausing. Mol. Cell 69, 802–815.e805 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Artsimovitch I., Svetlov V., Murakami K. S., Landick R., Co-overexpression of Escherichia coli RNA polymerase subunits allows isolation and analysis of mutant enzymes lacking lineage-specific sequence insertions. J. Biol. Chem. 278, 12344–12355 (2003). [DOI] [PubMed] [Google Scholar]
- 13.Abdelkareem M., et al. , Structural basis of transcription: RNA polymerase backtracking and its reactivation. Mol. Cell 75, 298–309.e294 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Shin Y., et al. , Structural basis of ribosomal RNA transcription regulation. Nat. Commun. 12, 528 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Shen L., et al. , An SI3-sigma arch stabilizes cyanobacteria transcription initiation complex. Proc. Natl. Acad. Sci. U.S.A. 120, e2219290120 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Vishwakarma R. K., Qayyum M. Z., Babitzke P., Murakami K. S., Allosteric mechanism of transcription inhibition by NusG-dependent pausing of RNA polymerase. Proc. Natl. Acad. Sci. U.S.A. 120, e2218516120 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Mooney R. A., et al. , Regulator trafficking on bacterial transcription units in vivo. Mol. Cell 33, 97–108 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Yakhnin A. V., Kashlev M., Babitzke P., NusG-dependent RNA polymerase pausing is a frequent function of this universally conserved transcription elongation factor. Crit. Rev. Biochem. Mol. Biol. 55, 716–728 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Chen J., Noble A. J., Kang J. Y., Darst S. A., Eliminating effects of particle adsorption to the air/water interface in single-particle cryo-electron microscopy: Bacterial RNA polymerase and CHAPSO. J. Struct. Biol. X 1, 100005 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Qayyum M. Z., Molodtsov V., Renda A., Murakami K. S., Structural basis of RNA polymerase recycling by the Swi2/Snf2 family of ATPase RapA in Escherichia coli. J. Biol. Chem. 297, 101404 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Yuzenkova Y., et al. , Tagetitoxin inhibits transcription by stabilizing pre-translocated state of the elongation complex. Nucleic Acids Res. 41, 9257–9265 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Bochkareva A., Yuzenkova Y., Tadigotla V. R., Zenkin N., Factor-independent transcription pausing caused by recognition of the RNA-DNA hybrid sequence. EMBO J. 31, 630–639 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Fukuda R., Ishihama A., Subunits of RNA polymerase in function and structure; Maturation in vitro of core enzyme from Escherichia coli. J. Mol. Biol. 87, 523–540 (1974). [DOI] [PubMed] [Google Scholar]
- 24.Bao Y., Landick R., Obligate movements of an active site-linked surface domain control RNA polymerase elongation and pausing via a Phe pocket anchor. Proc. Natl. Acad. Sci. U.S.A. 118, e2101805118 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Dong G., Golden S. S., How a cyanobacterium tells time. Curr. Opin. Microbiol. 11, 541–546 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Conaway R. C., Kong S. E., Conaway J. W., TFIIS and GreB: Two like-minded transcription elongation factors with sticky fingers. Cell 114, 272–274 (2003). [DOI] [PubMed] [Google Scholar]
- 27.Niyogi S. K., Feldman R. P., Effect of several metal ions on misincorporation during transcription. Nucleic Acids Res. 9, 2615–2627 (1981). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Hurwitz J., Yarbrough L., Wickner S., Utilization of deoxynucleoside triphosphates by DNA-dependent RNA polymerase of E. coli. Biochem. Biophys. Res. Commun. 48, 628–635 (1972). [DOI] [PubMed] [Google Scholar]
- 29.Imashimizu M., Tanaka K., Shimamoto N., Comparative study of cyanobacterial and E. coli RNA polymerases: Misincorporation, abortive transcription, and dependence on divalent cations. Genet. Res. Int. 2011, 572689 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Nesser N. K., Peterson D. O., Hawley D. K., RNA polymerase II subunit Rpb9 is important for transcriptional fidelity in vivo. Proc. Natl. Acad. Sci U.S.A. 103, 3268–3273 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Otwinowski Z., Minor W., Processing of X-ray diffraction data collected in oscillation mode. Methods Enzymol. 276, 307–326 (1997). [DOI] [PubMed] [Google Scholar]
- 32.Liebschner D., et al. , Macromolecular structure determination using X-rays, neutrons and electrons: Recent developments in Phenix. Acta Crystallogr. D Struct. Biol. 75, 861–877 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Emsley P., Lohkamp B., Scott W. G., Cowtan K., Features and development of Coot. Acta Crystallogr. D Biol. Crystallogr. 66, 486–501 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Punjani A., Rubinstein J. L., Fleet D. J., Brubaker M. A., cryoSPARC: Algorithms for rapid unsupervised cryo-EM structure determination. Nat. Methods 14, 290–296 (2017). [DOI] [PubMed] [Google Scholar]
- 35.Bepler T., et al. , Positive-unlabeled convolutional neural networks for particle picking in cryo-electron micrographs. Nat. Methods 16, 1153–1160 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Jumper J., et al. , Applying and improving AlphaFold at CASP14. Proteins 89, 1711–1721 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Pettersen E. F., et al. , UCSF Chimera–a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004). [DOI] [PubMed] [Google Scholar]
- 38.Afonine P. V., et al. , Real-space refinement in PHENIX for cryo-EM and crystallography. Acta Crystallogr. D Struct. Biol. 74, 531–544 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Yuzenkova Y., et al. , Stepwise mechanism for transcription fidelity. BMC Biol. 8, 54 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Murakami K.S., Imashimizu M., Qayyum M.Z., Vishwakarma R.K., Yuzenkova Y., X-ray crystal structure of Thermosynechococcus elongatus Si3 domain of RNA polymerase RpoC2 subunit. PDB. https://www.rcsb.org/structure/8EMB. Deposited 27 September 2022.
- 41.Qayyum M.Z., et al. , Cyanobacterial RNAP-EC. PDB, EMDB. https://www.rcsb.org/structure/8SYI, https://www.ebi.ac.uk/emdb/EMD-40874. Deposited 25 May 2023.
- 42.Qayyum M.Z., et al. , Cyanobacterial RNA polymerase elongation complex with NusG and CTP. PDB, EMDB. https://www.rcsb.org/structure/8URW, https://www.ebi.ac.uk/emdb/EMD-42502. Deposited 26 October 2023.
- 43.Qayyum M.Z., et al. , Cyanobacterial RNAP-EC. Protein Data Bank. https://www.rcsb.org/structure/8SYI. Deposited 25 May 2023.
- 44.Qayyum M.Z., et al. , Cyanobacterial RNA polymerase elongation complex with NusG and CTP. Protein Data Bank. https://www.rcsb.org/structure/8URW. Deposited 26 October 2023.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The X-ray crystallographic density map and the refined model have been deposited in Protein Data Bank (https://www.rcsb.org) under accession number 8EMB (40). The cryo-EM density map and the refined model have been deposited in Electron Microscopy Data Bank (https://www.ebi.ac.uk/emdb/) under accession numbers EMD-40874 (iNTP-free EC-NusG) (41) and EMD-42502 (iNTP-bound EC-NusG) (42) and in Protein Data Bank (https://www.rcsb.org) under accession numbers 8SYI (iNTP-free EC-NusG) (43) and 8URW (iNTP-bound EC-NusG) (44). All other data are included in the manuscript and/or supporting information.