Abstract
Background
Cellular RNA polymerases (RNAPs) are complex molecular machines that combine catalysis with concerted conformational changes in the active center. Previous work showed that kinking of a hinge region near the C-terminus of the Bridge Helix (BH-HC) plays a critical role in controlling the catalytic rate.
Results
Here, new evidence for the existence of an additional hinge region in the amino-terminal portion of the Bridge Helix domain (BH-HN) is presented. The nanomechanical properties of BH-HN emerge as a direct consequence of the highly conserved primary amino acid sequence. Mutations that are predicted to influence its flexibility cause corresponding changes in the rate of the nucleotide addition cycle (NAC). BH-HN displays functional properties that are distinct from BH-HC, suggesting that conformational changes in the Bridge Helix control the NAC via two independent mechanisms.
Conclusions
The properties of two distinct molecular hinges in the Bridge Helix of RNAP determine the functional contribution of this domain to key stages of the NAC by coordinating conformational changes in surrounding domains.
Background
RNA polymerases (RNAPs) play a central role in the regulation of gene expression. Like the majority of the enzymes involved in fundamental biological information-processing functions (for example, replication, transcription, recombination, repair), RNAPs are probably best viewed as intricate molecular machines. The movement of nucleic acid substrates, coupled with various types of active site chemistries, requires a precisely orchestrated sequence of conformational changes of protein domains during the transcription cycle (for recent reviews see [1-4]).
The nanomechanical mechanisms guiding the structural rearrangements of domains within the active site are still very poorly understood. Thus far, models of the fundamental reaction catalyzed by RNAPs, the nucleotide addition cycle (NAC), have predominantly been derived from a series of crystal structures that contain RNAPs as apoenzymes (for example [5-9]), or complexed with various substrates and inhibitors (for example [10-15]). Such structures, revealing (among other features) pre- and post-translocation states of RNAPs, have provided the basis for various hypotheses concerning the molecular mechanism of the NAC [1-4,16,17]. There are, however, two potential shortcomings associated with such approaches. First, in order to 'freeze' the RNAPs in a crystallizable conformation, substrate analogs or inhibitors need to be chosen that stop the reaction cycle at a specific point. This may result in the adoption of 'off-pathway' conformations that do not represent normal enzyme states. A second, more fundamental, problem is that short-lived intermediate structures cannot be captured in crystals because they are thermodynamically or kinetically unstable. Yet, it is likely that an awareness of the existence and functional significance of such intermediates will be required to develop a deeper understanding of the mechanisms operating within molecular machines.
We have designed new experimental tools to complement ongoing structural investigations. Based on the ability to assemble an active RNAPII-like enzyme from recombinant subunits in vitro, it is possible to modify any residue within an intact RNAP to introduce a variety of targeted mutations into any functional domain participating in the NAC [18]. Such a strategy not only allows specific predictions based on available X-ray structures to be tested, but also can be used to explore systematically the functional contributions of individual domains to biochemically detectable activities. New robotic methods that facilitate the labor-intense high-throughput mutagenesis/assembly steps and transcription assays allow this approach to be implemented on a large scale [19]. In a recent study we systematically replaced each of 17 consecutive residues of the Bridge Helix domain with all other 19 possible amino acid side chains [20]. The Bridge Helix, a 35 amino acid α-helix spanning the RNAP active site, controls the flow of nucleic acid substrates and nucleotide precursors through the catalytic site (Figure 1A). Some of the earliest models of the NAC were based on the observation that the C-terminal portion of the Bridge Helix was kinked in some X-ray crystal structures of bacterial RNAPs [5,8], but appeared straight in numerous other crystals of bacterial, archaeal and eukaryotic RNAPs. The existence of two alternative Bridge Helix conformations made it seem likely that a periodic oscillation between straight and kinked conformations would be implicated in translocating RNAPs in single base-pair steps along the DNA template strand [7,10,21-24]. High-throughput mutagenesis of the C-terminal portion of the Bridge Helix provided unexpectedly clear evidence in support of Bridge Helix kinking; mutations destabilizing the normal α-helical conformation in certain positions cause a substantial increase in the specific activity of RNAPs. This phenomenon, referred to as superactivity, relieves a constraint on the catalytic mechanism by increasing the frequency of Bridge Helix isomerization between straight and kinked conformations [20,25].
While the combination of structural observations and mutagenesis data clearly highlights the functional contribution of the C-terminal portion of the Bridge Helix towards controlling the rate of the NAC, the role of the N-terminal portion of the Bridge Helix has thus far remained enigmatic. The primary sequence of this region is exceptionally highly conserved during evolution; for example, the sequences of the N-terminal 15 amino acids are identical between the archaeon Methanocaldococcus jannaschii and humans, and differ by only a single residue from yeast (Figure 1B). Such a high degree of structural conservation over more than two billion years of evolution can be partially accounted for by the fact that the Bridge Helix N-terminus is tightly surrounded by other domains and may therefore be spatially and evolutionarily constrained due to the need to maintain an extensive network of protein-protein interactions (Figure 1A; Additional files 1, 2 and 3; [12,15,25,26]). In apparent agreement with this view, all available X-ray structures of RNAPs show the N-terminal portion of the Bridge Helix in a rigidly α-helical conformation, suggesting the absence of significant conformational changes. For this reason, none of the current models of RNAP function consider the Bridge Helix N-terminus to play any dynamic role during the NAC [1-4,16,17].
New evidence presented here, based on a combination of high-throughput mutagenesis studies and molecular dynamics simulations, demonstrates that such a static view of the Bridge Helix N-terminus is untenable. The results show that this region contains a highly localized molecular hinge, and that the conformation of this site has a substantial influence on the rate of the NAC. In combination with the previously identified C-terminal hinge region, the data reinforces the overarching concept that the Bridge Helix plays a predominantly nanomechanical role during the translocation stage of the NAC by coordinating conformational changes in surrounding domains.
Results
High-throughput mutagenesis reveals evidence for an N-terminal hinge
The N-terminus of the Bridge Helix of the RNAP from the euryarchaeon Methanocaldococcus jannaschii was dissected by high-throughput mutagenesis [19,20]. In this automated approach, each amino acid within the target region (Figure 1B) was replaced with all other 19 residues to reveal local structural requirements. The mutants were then assayed in robotic promoter-independent transcription assays, which provide a consistent measure of the synthetic rate of the NAC and correlate directly with results obtained from a variety of promoter-dependent, abortive- and elongation transcription assays ([18-20]; S. Wiesler and ROJW, unpublished data).
The results reveal unexpected insights into the function of the Bridge Helix N-terminus (Figure 2A). The most eye-catching phenotypes are associated with the mjA' M808 position, which constitutes - both qualitatively and quantitatively - a hotspot for superactivity caused by a chemically diverse range of substitutions. This suggests that the M808 position is structurally very delicately balanced and substitutions with more hydrophilic, bulky hydrophobic, and/or charged residues cause a substantial local disturbance. The degree of superactivity in the strongest N-terminal substitution (mjA' M808-P; >240% superactivity) greatly exceeds the highest level of superactivity displayed by the strongest C-terminal mutant (mjA' S824-P; approximately 170% superactivity; Figure 2A). The observation that the replacement of M808 with proline results in the highest increase in specific activity immediately suggests that the phenotypes are caused predominantly by disruption of the local secondary structure; proline residues destabilize α-helices due to a lack of hydrogen bonding and steric interference with the backbone of the preceding turn [27]. In a manner highly reminiscent of the previously characterized mjA' S824-P substitution phenotype, proline substitutions of the residues immediately surrounding mjA' M808 cause a sharp drop in RNAP catalytic activity, emphasizing the highly localized effect of such conformationally induced changes (Figure 2B; [20]). The locations of two functionally acceptable proline substitutions thus mark the presence of two discrete and separate molecular hinges, that will subsequently be referred to as BH-HN ('Bridge Helix - Hinge N-terminal'; typified by mjA' M808-P) and BH-HC ('Bridge Helix - Hinge C-terminal'; typified by mjA' S824-P). The shared functional property of M808-P and S824-P substitutions, causing maximal levels of superactivity, emphasizes that increased Bridge Helix kinking at the two hinges correlates directly with an increased rate of nucleotide addition [20,25].
Extensive database searches, covering completed genome sequences of a large variety of pro- and eukaryotic species, show that naturally occurring proline substitutions in the Bridge Helix primary sequences are exceptionally rare. The only known instances of proline residues occurring naturally anywhere in the N-terminal part of the Bridge Helix are found in the bacteria Orientia tsutsugamushi [28,29], and certain isolates of Arcobacter butzleri [30] and Bacillus subtilis [31]. In each case, the substituted position is precisely orthologous to mjA' M808 (Figure 2C). In the C-terminal half of the Bridge Helix, the highly divergent plant RNAPIV and RNAPV enzymes display a strong tendency for a proline residue at the position orthologous to archaeal S824 (Figure 2C; [32,33]).
Molecular dynamics reveal a structural basis for the BH-HN hinge mechanism
Insights into the conformational changes of the BH-HC are based on certain X-ray structures of bacterial RNAPs that were fortuitously crystallized in a kinked Bridge Helix conformation [5,8]. Kinking of BH-HC is stabilized by intramolecular interactions between amino acid side chains flanking each side of the hinge ([8,20]; Heindl et al., unpublished observations). The location of the BH-HC and its kinking properties are thus pre-determined by the local amino acid sequence. This raises the question as to whether intrinsic structural features could also account for the molecular mechanism underlying BH-HN function. Inspection of the primary sequence surrounding BH-HN, coupled with insights obtained from molecular dynamic (MD) simulations [34-36], provide a plausible explanation for the structural basis of the BH-HN hinge. Systematic sampling of conformational states at five picoseconds (ps) intervals in a series of 27 independent 200 ps semi-quantitative MD simulations highlights the presence of distinct areas prone to local unfolding, with the most prominent peak centered around mjA' G810 (Figure 3A). Under these simulation conditions, most other regions of the Bridge Helix, including the N- and C-termini, maintain their α-helical conformations at all stages throughout the simulations. Closer inspection of the simulated structures formed by BH-HN kinking reveals a molecular switching mechanism that can be rationalized directly on the basis of the primary amino acid sequence and α-helical geometry (Figure 3B). The kinking of BH-HN critically involves the two glycine residues G809 and G810 that are located immediately C-terminal to M808 and are essentially invariant in all archaeal and eukaryotic RNAPIIs (Figure 1B). Glycine residues display low helix-forming propensity because their high conformational flexibility is entropically unfavourable within geometrically constrained α-helical structures [37,38]. This increased flexibility allows G809 and G810 to flip out of the α-helical conformation to create a flexible hinge (Figure 3B). The flipped conformation is then stabilized further through a variety of non-covalent interactions of M808 with R811 and E812 (Figure 3C). The kinking model is based on elementary structural and thermodynamic principles (as represented by the MD force field), but also strongly supported by the phenotypes of the G809 and G810 mutagenesis series (Figure 2A). In both positions, any residue other than glycine causes a severe reduction in the catalytic activity of RNAP due to an increased helix propensity, which reduces the likelihood of BH-HN kinking. In contrast, another glycine residue located slightly more C-terminal, G813, is noticeably less sensitive to change (Figure 2A). Similarly, the stabilization of BH-HN through van der Waals contacts between M808 and R811 and E812 appears to be relatively non-specific, so that a chemically diverse group of side chains are either acceptable (especially in the R811 position), or result in enhanced stabilization of the BH-HN kink (M808 substitution series). The fundamental requirement for a large side-chain in the M808 position is particularly evident from the fact that certain amino acids with smaller groups (G, A) are not capable of causing the superactivity associated with enhanced BH-HN kinking. It is therefore apparent that the molecular properties of BH-HN, such as its structural stability, are directly determined by the primary amino acid sequence and emerge spontaneously from MD simulations as a property inherent in the primary sequence of the Bridge Helix domain. Mutagenesis changes the biophysical properties of BH-HN, which is, in turn, directly reflected in altered NAC rates (Figure 2A).
BH-HN and BH-HC operate in environments with widely different structural constraints
The presence of two distinct hinges in the Bridge Helix raises the question whether BH-HN and BH-HC are involved in the same mechanism during the NAC. Kinking of either of the two hinges will result in considerable local distortions, predicted to include a spatial redeployment of amino acid side chains and changes in the overall length, flexibility and general topology of the Bridge Helix domain. Kinking of BH-HN could result in altered interactions with adjacent domains, such as the βD-II, Link and F-Loop domains (Figure 1A; Additional files 1, 2 and 3), whereas hinge movements in BH-HC are expected to affect the position and/or mobility of the DNA-RNA hybrid and Trigger Loop conformation (Figure 1A; [20]).
Intriguingly, RNAP IV Bridge Helices also contain an additional two amino acid deletion, which would cause an even more radical change in the Bridge Helix by creating a local 180° twist of the α-helical structure (Figure 2C). This class of mutants combines strictly confined effects (removal of two adjacent side chains) with complex long range effects (realignment of remaining side chains on either side of the deletion to new positions and localized underwinding of the helical structure), which may possibly be coupled with the propagation of stress forces to the N- and C-terminal anchoring points of the Bridge Helix. The radical nature of such twisting mutations serves as an ideal tool to gain a deeper insight into structural constraints acting on the entire Bridge Helix. A series of two-amino acid deletions was prepared (Figure 4A). The results show that the BH-HC region is indeed remarkably resistant to such major conformational changes (Figure 4B). Although X-ray structures provide apparently persuasive arguments for certain residues of the Bridge Helix domain assisting in catalytic functions (for example, residues orthologous to T821 contacting the 3' end of the nascent transcript and/or the incoming rNTP [22]), the deletion phenotypes described here prove that such contributions are either redundant or non-existent in our archaeal system, which is highly conserved in this region. The results furthermore illustrate that, although BH-HC is located much closer than BH-HN to the substrates involved in the NAC, there are surprisingly few topological restraints. In marked contrast, none of the two-amino acid deletions support significant activity in the N-terminal part of the Bridge Helix (Figure 4B), proving that conformational changes are much more restricted in that region (which is consistent with the higher degree of evolutionary identity in the N-terminal half of the Bridge Helix compared to the C-terminal half; Figure 1). The results from the two amino acid deletion scan thus provide a first indication that the local conformational requirements for BH-HN and BH-HC differ quite radically.
The substitutions mjA' M808-P and S824-P provide the strongest pieces of evidence for the existence and functional significance of BH-HN and BH-HC, respectively (Figure 2B[20]). Work in other systems has shown that the physicochemical properties of residues adjacent to prolines contribute considerably to the prevalence of cis/trans peptide bond isomers and therefore strongly influence kink geometry [39-42]. Additional proline substitutions of the positions immediately N- and C-terminal to either M808 or S824 (that is, mjA' A807-P/M808-P; M808-P/G809-P and Q823-P/S824-P; S824-P/G825) revealed examples of further intriguing differences between BH-HN and BH-HC: the double-proline substitution in BH-HN abolished the superactivity caused by M808-P, whereas the presence of an additional proline residue N-terminal to S824 continued to support the superactivity of S824-P (Figure 5B). The activity of the Q823-P/S824-P double mutant proves explicitly that the presence of two proline residues in this particular location of the Bridge Helix is not only compatible with catalytic function, but also compatible with superactivity. Prolyl-proline preferentially adopt an elongated polyproline II structure (87%), or less frequently (13%) a β-turn (Additional file 4A; [43,44]). Either of the structures would cause a substantial local increase in the flexibility of BH-HC.
In order to investigate this unexpected tolerance to the presence of two adjacent prolines in positions 823 and 824 in more detail, a complete substitution series of the residues around position mjA' S824-P was prepared, generating an assortment of systematic double-mutants (Figure 5A; mjA' Q823-X/S824-P and S824-P/G825-X; with X denoting 19 different variants). All substitutions N-terminal to S824-P (i.e. mjA' Q823-X/S824-P) displayed an almost completely invariant degree of extensive superactivity that was indistinguishable from the original S824-P mutant (Figure 5B). This result is remarkable because previous mutagenesis of Q823 revealed a broad spectrum of activity, ranging from substantial loss of function (Q823-C or I) to superactivity (Q823-D or E; [20]). It is evident that in the double-mutants the chemical nature of the side-chain residue in position 823 exerts no further functional influence, presumably because of the major distortion already caused by the proline substitution in position 824. Once such a gross structural alteration has occurred in S824-P, any additional changes in the adjacent N-terminal residue become structurally irrelevant.
Because Q823-P/S824-P displayed no loss of superactivity, the residue immediately N-terminal to the double-proline substitution was also permutated, resulting in variants containing three adjacent substitutions in the BH-HC hinge region (Figure 5A; mjA' A822-X/Q823-P/S824-P). The majority of these substitutions, including the triple proline mutant A822-P/Q823-P/S824-P (Figure 5C; Additional file 4B), still displayed clearly detectable superactivity, albeit at a slightly reduced level. The tolerance of the BH-HC hinge to radical mutagenesis, as previously observed in the two-amino acid deletion screen, is therefore also reflected in the unexpectedly high tolerance to multiple proline substitutions in that region. This geometric freedom is, however, also spatially limited: substitutions in positions C-terminal to S824-P (Figure 5A) were mostly inactive, indicating that despite the structural flexibility of the positions N-terminal to S824-P, the C-terminal positions are functionally constrained (Figure 5D; a similar trend is also apparent in the two amino acid deletion scan data [see Figure 4B]).
BH-HN and BH-HC control activity of the catalytic site differently
Taken together, the extensive body of mutagenesis data leads to the conclusion that the BH-HN and BH-HC structures operate within intramolecular environments that vary widely in their conformational constraints. The chemistry of the NAC is typically catalyzed by two magnesium ions, immobilized non-covalently within the active site (via the 'metal A' and 'metal B' motifs; for example, [45,46]). A complete substitution of Mg2+ by Mn2+ reduces the activity of archaeal RNAP to approximately 40% (data not shown); the reduced activity is probably caused by the slightly larger Mn2+ ions causing suboptimal spacing [47], and Mn2+-catalyzed reactions being more promiscuous in their requirements for alignment of the reactive groups [48]. Intriguingly, assays of superactive BH-HN and BH-HC mutants in the presence of Mn2+ demonstrated that they fell into distinct categories. mjA' M808-D, M808-E and M808-P continued to display superactivity, whereas the activities of S824-P or Q823-P/S824-P only reached wildtype levels under these conditions (Figure 6). We can therefore conclude that the superactive substitutions in the BH-HN and BH-HC regions have different consequences because conformational changes in different parts of the Bridge Helix affect separate stages of the NAC. The reduced catalytic activity caused by the presence of Mn2+ ions in the active site becomes rate-limiting in Q823-D/E or S824-P, whereas M808-P overcomes this limitation to a large extent by stimulating transcription through an independent pathway, most likely involving the β-D II and Link domains.
Discussion
An expanded conformational repertoire of the Bridge Helix domain
The results presented here reveal several new surprising insights, including compelling evidence for the existence of a molecular hinge region in the N-terminal portion of the Bridge Helix and evidence for an unexpectedly large degree of tolerance to radical structural changes in the C-terminal part of this domain. It is apparent that the Bridge Helix domain displays a much greater conformational freedom than anticipated from currently available X-ray structures of RNAPs. Few, if any, of the residues of the Bridge Helix appear to make any specific contribution to catalysis other than through defining the nanomechanical properties intrinsic to the α-helical structure. The implications for mechanistic models aimed at describing the NAC are manifold, ranging from a re-evaluation of the structural basis of the RNAP translocation mechanism, to highlighting the hitherto neglected role of highly conserved domains in the catalytic site, and to obtaining a better understanding of the evolutionary diversity of Bridge Helices in different organisms.
Currently, we have only a limited understanding of the forces acting on the Bridge Helix that could drive localized conformational changes. Attempts to model the full NAC using molecular dynamics studies are severely limited by the large size of multi-subunit RNAPs and the immense computational effort that would be required to simulate the molecular events expected to last from 10s to 100s of milliseconds for the extension of a nascent transcript by a single nucleotide (approximately 30 ms/rNTP incorporation under optimal in vivo conditions; for example, [49]). The study of the intrinsic structural properties of individual domains by fully atomistic computer simulations reveal, however, interesting nanomechanical properties that have functional implications for the RNAP translocation mechanism [50-52]. The Bridge Helix domain contains intrinsically unstable α-helical regions that undergo spontaneous kinking motions, even in the absence of externally applied forces (Figure 3A). At least two of these unstable regions correspond precisely to the biochemically-mapped BH-HN and BH-HC regions. Strategically-placed glycine residues, such as mjA' G809, G810 (for BH-HN) and G825 (for BH-HC) provide the structural basis for forming these molecular hinges, with surrounding residues determining additional kinking parameters, such as the likelihood of kinking and/or the half-life of the kink after its isomerization (Figure 2A; Heindl et al., unpublished observations). Interestingly, the simulations also highlight a potential third structurally labile region near the center of the Bridge Helix domain, spanning residues mjA' Q817 to R820. The relative sensitivity of this sequence to proline substitutions (Figure 2B) suggests, however, that this area of instability behaves functionally differently to BH-HN and BH-HC. It is possible that structural fluctuations in the central part of the Bridge Helix play a more dynamic role in supporting short-lived conformational changes that can either compensate for major structural rearrangements due to BH-HN and BH-HC kinking, or act as a store of 'fast' motions to lubricate the kinking of the hinge regions kinetically [34]. Although such hypotheses are currently beyond experimental proof, it is interesting to note that several, structurally unexplained superactivity mutants map to this central sequence (for example, mjA' D816-N; Q817-S/T/C/K and V819-K;Figure 2A[20]), and that this region is also exceptionally tolerant to radical twisting of the α-helical axis induced by deletions of two-amino acid segments (Figure 4B). Furthermore, data from anisotropic network mode analysis suggests that rigid body movements of the clamp domain may exert forces onto the center of the Bridge Helix via the Switch domains, potentially linking transcription to a ratchet-like translocation mechanism (Additional file 5; [53]). The Bridge Helix thus appears to have evolved specific nanomechanical properties that result in the controlled and highly localized isomerization of its conformation in response to allosteric alterations in the surrounding protein domains and nucleic acid substrates.
The Bridge Helix N-terminus is tightly surrounded by other domains, such as the β-D loop II [54], the experimentally uncharacterized Link domain [25,55], and the F-Loop [26]. As evident from the exceptionally high degree of sequence conservation (Additional files 1A, 2A and 3A), each of these domains is likely to play key roles in the NAC. The β-D II domain is a loop-like structure that interacts extensively in a side-way interaction with the central part of the Bridge Helix, while simultaneously maintaining direct physical contact with the most recently incorporated nucleotide (i-1 position). The interaction between the β-D II domain and nascent transcript also creates an extended binding pocket for the rNTP (additional file 1B, C). Similarly, the highly conserved Link domain is strategically placed to interact with the Bridge Helix N-terminus, β-D II domain, nascent transcript (i-1 and i-2 positions) and the incoming rNTP (Additional file 2B). Finally, an N-terminal extension of the Bridge Helix, the F-Loop, forms an extensive cap-like structure that contacts the Link domain and the tip of the Trigger Loop (Additional file 3B; [26]). The differential response of superactive substitutions in BH-HN and BH-HC to the presence of Mn2+ in the catalytic site supports the view that conformational changes in these regions cause a distinct effect in the catalytic site of RNAP. While the C-terminal Bridge Helix operates predominantly by influencing Trigger Loop conformation, kinking of the N-terminus via BH-HN most likely alters the positions and/or conformations of the β-D II and Link domains, which are in direct physical contact with the nucleotide and nucleic acid substrates.
In vivo occurrence of proline-substituted Bridge Helix hinges
The existence and biological relevance of the experimentally determined hinge positions that tolerate proline substitutions is independently confirmed by a small number of naturally occurring Bridge Helix variants. Considering the large number of genomes sequenced thus far (currently including approximately 1,200 microbial and approximately 800 eukaryotic genomes; http://www.ncbi.nlm.nih.gov/sites/genome/), it can already be stated with confidence that naturally occurring proline substitutions are an exceedingly rare phenomenon. The three bacterial species that contain naturally occurring proline-substitutions in BH-HN represent two diverse bacterial phyla (Figure 7), and each of these phyla contains other closely related species with sequenced genomes, which do not contain any prolines in their Bridge Helices. Proline substitutions in the BH-HN region therefore appear to evolve spontaneously and independently in different bacterial lineages and subsequently remain restricted to individual species or strains. In at least some cases this evolutionary adaptation may be associated with significant simplifications of the transcriptional machinery, such as loss of the ω-RNAP subunit and absence of transcription-coupled repair (O. tsutsugamushi; [28,29]). These proline-containing Bridge Helix variants do not only provide strong and independent in vivo confirmation of the results identified in the high-throughput mutagenesis screen (Figure 2B), but prove that the results obtained in an archaeal model system also apply to the bacterial and eukaryotic domains; Bridge Helix kinking via two structurally independent molecular hinges is a universal mechanism operating across the entire evolutionary spectrum.
Conclusions
The application of a high-throughput in vitro mutagenesis approach to the N-terminal portion of the M. jannaschii bridge helix domain has revealed a range of new insights that could not be anticipated from previously available structural and genetic data. The data sets (Figures 2A, 4B and 5B) clearly illustrate that many of the most interesting insights were derived from substitutions that would almost certainly not have been designed using a rational, structure-led approach (for example, mjA' M808-P; ΔD816/Q817; A822-P/Q823-P/S824-P). Furthermore, widely used methods, based predominantly on alanine-scanning mutagenesis [56], are also limited in their capacity to uncover some of the most interesting phenotypes (Figure 2A). It is therefore clear that automated high-throughput methods for generating site-directed mutants and assaying their phenotypic consequences will play an increasingly important role in exploratory investigations of protein structure/function relationships as part of a diverse strategy aimed at obtaining new insights into complex biological systems [57].
Methods
High-throughput mutagenesis
Combinatorial permutation libraries, containing all 19 variants with codon replacements optimized for expression in E. coli were purchased for mjA' H806, A807, M808, G809, G810, R811, E812, G813, Q823-X/S824-P, A822-X/Q823-P/S824-P, S824-P/G825-X, and S824-P/G825-P/Y826-X from GeneArt (Regensburg, Germany). The mutations, located within a BstBI-SbfI fragment of the codon-optimized C-terminus of the Bridge Helix [19] were transferred to a pET21a bacterial expression vector for the production of full-length, intein-free mjA' subunits. The presence of the desired mutations in the expression constructs was verified by DNA sequencing. DNA constructs containing the two amino-acid deletions across the Bridge Helix were purchased from GeneArt as synthetic gene fragments and transferred to bacterial expression plasmids as described above.
RNAP Factory
The robotic procedures for high-throughput growth of bacterial expression strains, recombinant subunit purification and automated assembly into recombinant RNAPs (the 'RNAP Factory') have been described previously [19,20,58]. Briefly, bacterial constructs expressing the Bridge Helix mutants were transformed into chemically competent Acella cells (ΔendAΔrecA derivatives of E. coli BL21 [DE3]; EdgeBio, Gaithersburg, Maryland, USA). After growth for 16 to 18 hours at 37°C in 24-well plates in autoinduction medium (OverNight Express, Novagen, Nottingham, UK) the bacteria were harvested and used for a centrifugation-based robotic inclusion body purification protocol. The mjA' inclusion bodies were automatically solubilised in the presence of 8 M urea and quantitated at 562 nm with the bicinchoninic acid assay. Recombinant RNAPs containing the mjA' variants were assembled in a 96-well format dialysis cell using an urea-gradient from 6 M to urea-free spanning 16 hours at room temperature [19]. Each mutant subunit was expressed, purified and assembled in vitro at least in quadruplicate to assure consistency and reproducibility. The assembled RNAPs were harvested and used immediately for robotic transcription assays (see below). The assembly efficiencies of key mutants (including, among others, mjA' M808-P; A807-P/M808-P; M808-P/G809-P; S824-P; Q823-P/S824-P; S824-P/G825-P; A822-P/Q823-P/S824-P) were compared to assembly rates achieved with the wildtype mjA' subunit by assaying the reconstituted polymerases at limiting and saturating template DNA concentrations (see Additional file 19 in [20] for details and examples); no differences between wild-type and mutant enzymes were detected. For replacement of Mg2+ ions in the catalytic site with Mn2+ (for the transcription assays shown in Figure 6), the in vitro assembly process was carried out as described above, but with dialysis- and transcription buffers containing 10 mM Mn2-O-acetate instead of Mg2-O-acetate.
In vitro transcription assays
The robotically implemented high-throughput trichloroacetic acid (TCA) precipitation assays, measuring the incorporation of (α-32P) rUTP into TCA-insoluble products, were carried out exactly as previously described [19,20]. Briefly, assay mixtures were incubated for 45 minutes at 70°C in thin-wall PCR plates before precipitating the radiolabeled transcripts by the addition of ice-cold TCA solution. After incubation for 30 minutes at 1°C, the nucleic acid precipitates were collected by vacuum filtration on a 96-GF/F glass fiber filter plate (Whatman, Maidstone, UK) and extensively washed with further aliquots of ice-cold TCA solution. After drying the filters, scintillant (MicroScint-O; Perkin-Elmer, Cambridge, UK) was added and the amount of incorporated (α-32P) rUTP quantified with a microplate counter (TopCount NXT, Packard, Cambridge, UK).
High-throughput molecular dynamics simulation
Molecular dynamics (MD) simulations were performed using GROMACS [59]. In preparation for MD simulations, the archaeal Bridge Helix was modelled on the S. cerevisiae 'active elongation' RNAPII structure (PDB #2E2H) using the SwissModel server in automated mode [60]. The simulation production runs were executed in a fully solvated atomistic production mode without restraints. The energies of the modelled structures were initially minimized in vacuum using GROMACS with an AMBER force field (http://ambermd.org/) on a CPU cluster of the National Grid Service (NGS). During pre-processing the system was warmed to 200K under the control of a Berendsen thermostat with a coupling constant of 1.0 ps. All structures were energy-minimized in pre-equilibrated simulation boxes filled with TP3 water, and sodium and chloride ions were added to a final concentration of approximately 150 mM. For production runs the temperature was increased to 300 K (27°C). The equations of motion were integrated using a step-size of two femtoseconds. The trajectories generated by 27 independent 200 picoseconds simulation runs, were analyzed using STRIDE [61], as implemented in VMD [62]. The frequencies of particular residues adopting a 'coil' conformation during 5 ps analysis windows were plotted relative to the Bridge Helix sequence.
Abbreviations
BH-Hc: molecular hinge located within the carboxy-terminal portion of the Bridge Helix; BH-HN : molecular hinge located within the amino-terminal portion of the Bridge Helix; NAC: nucleotide addition cycle; rNTP: ribonucleotide triphosphate; ps: picosecond; RNAP: RNA polymerase; TCA: trichloroacetic acid.
Supplementary Material
Acknowledgements
This work was supported by a Wellcome Trust project grant [078043/Z/05/Z] to ROJW. I would like to thank Dominic Conquest for help with the subcloning of the M808-X and G813-X expression clone libraries into expression vectors. I would also like to thank Hans Heindl for advice on the molecular dynamics simulations, Noam Weingarten for programming and Tamas Kiss and Gabor Terstyanszki for advice and provision of computing resources on the UK National Grid Service. I also appreciate the helpful comments on the manuscript received from Martin Buck, Patricia Burrows and Simone Wiesler.
References
- Brueckner F, Ortiz J, Cramer P. A movie of the RNA polymerase nucleotide addition cycle. Curr Opin Struct Biol. 2009;19:294–299. doi: 10.1016/j.sbi.2009.04.005. [DOI] [PubMed] [Google Scholar]
- Erie DA, Kennedy SR. Forks, pincers, and triggers: the tools for nucleotide incorporation and translocation in multi-subunit RNA polymerases. Curr Opin Struct Biol. 2009;19:708–714. doi: 10.1016/j.sbi.2009.10.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Svetlov V, Nudler E. Macromolecular micromovements: how RNA polymerase translocates. Curr Opin Struct Biol. 2009;19:701–707. doi: 10.1016/j.sbi.2009.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kireeva M, Kashlev M, Burton ZF. Translocation by multi-subunit RNA polymerases. Biochim Biophys Acta. 2010;1799:389–401. doi: 10.1016/j.bbagrm.2010.01.007. [DOI] [PubMed] [Google Scholar]
- Zhang G, Campbell EA, Minakhin L, Richter C, Severinov K, Darst SA. A crystal structure of Thermus aquaticus core RNA polymerase at 3.3 Å resolution. Cell. 1999;98:811–824. doi: 10.1016/S0092-8674(00)81515-9. [DOI] [PubMed] [Google Scholar]
- Cramer P, Bushnell DA, Fu J, Gnatt AL, Maier-Davis B, Thompson NE, Burgess RR, Edwards AM, David PR, Kornberg RD. Architecture of RNA polymerase II and implications for the transcription mechanism. Science. 2000;288:640–649. doi: 10.1126/science.288.5466.640. [DOI] [PubMed] [Google Scholar]
- Cramer P, Bushnell DA, Kornberg RD. Structural basis of transcription: RNA polymerase II at 2.8. Å resolution. Science. 2001;292:1863–1876. doi: 10.1126/science.1059493. [DOI] [PubMed] [Google Scholar]
- Vassylyev DG, Sekine SI, Laptenko O, Lee J, Vassylyeva MN, Borukhov S, Yokoyama S. Crystal structure of a bacterial RNA polymerase holoenzyme at 2.6 Å resolution. Nature. 2002;417:712–719. doi: 10.1038/nature752. [DOI] [PubMed] [Google Scholar]
- Korkhin Y, Unligil UM, Littlefield O, Nelson PJ, Stuart DI, Sigler PB, Bell SD, Abrescia NG. Evolution of complex RNA polymerases: the complete archaeal RNA polymerase structure. PLoS Biol. 2009;5:7. doi: 10.1371/journal.pbio.1000102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gnatt AL, Cramer P, Fu J, Bushnell DA, Kornberg RD. Structural basis of transcription: an RNA polymerase II elongation complex at 3.3 A resolution. Science. 2001;292:1876–1882. doi: 10.1126/science.1059495. [DOI] [PubMed] [Google Scholar]
- Westover KD, Bushnell DA, Kornberg RD. Structural basis of transcription: separation of RNA from DNA by RNA polymerase II. Science. 2004;303:1014–1016. doi: 10.1126/science.1090839. [DOI] [PubMed] [Google Scholar]
- Wang D, Bushnell DA, Westover KD, Kaplan CD, Kornberg RD. Structural basis of transcription: role of the trigger loop in substrate specificity and catalysis. Cell. 2006;127:941–954. doi: 10.1016/j.cell.2006.11.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vassylyev DG, Vassylyeva MN, Perederina A, Tahirov TH, Artsimovitch I. Structural basis for transcription elongation by bacterial RNA polymerase. Nature. 2007;448:157–162. doi: 10.1038/nature05932. [DOI] [PubMed] [Google Scholar]
- Brueckner F, Cramer P. Structural basis of transcription inhibition by alpha-amanitin and implications for RNA polymerase II translocation. Nat Struct Mol Biol. 2008;15:811–818. doi: 10.1038/nsmb.1458. [DOI] [PubMed] [Google Scholar]
- Kaplan CD, Larsson KM, Kornberg RD. The RNA polymerase II trigger loop functions in substrate selection and is directly targeted by alpha-amanitin. Mol Cell. 2008;30:547–556. doi: 10.1016/j.molcel.2008.04.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Borukhov S, Nudler E. RNA polymerase: the vehicle of transcription. Trends Microbiol. 2008;16:126–134. doi: 10.1016/j.tim.2007.12.006. [DOI] [PubMed] [Google Scholar]
- Vassylyev DG. Elongation by RNA polymerase: a race through roadblocks. Curr Opin Struct Biol. 2009;19:691–700. doi: 10.1016/j.sbi.2009.10.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Werner F, Weinzierl RO. A recombinant RNA polymerase II-like enzyme capable of promoter-specific transcription. Mol Cell. 2002;10:635–646. doi: 10.1016/S1097-2765(02)00629-9. [DOI] [PubMed] [Google Scholar]
- Nottebaum S, Tan L, Trzaska D, Carney HC, Weinzierl RO. The RNA polymerase factory: a robotic in vitro assembly platform for high-throughput production of recombinant protein complexes. Nucleic Acids Res. 2008;36:245–252. doi: 10.1093/nar/gkm1044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tan L, Wiesler S, Trzaska D, Carney HC, Weinzierl RO. Bridge helix and trigger loop perturbations generate superactive RNA polymerases. J Biol. 2008;7:40. doi: 10.1186/jbiol98. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Klug A. Structural biology. A marvellous machine for making messages. Science. 2001;292:1844–1846. doi: 10.1126/science.1062384. [DOI] [PubMed] [Google Scholar]
- Bar-Nahum G, Epshtein V, Ruckenstein AE, Rafikov R, Mustaev A, Nudler E. A ratchet mechanism of transcription elongation and its control. Cell. 2005;120:183–193. doi: 10.1016/j.cell.2004.11.045. [DOI] [PubMed] [Google Scholar]
- Sousa R. Machinations of a Maxwellian demon. Cell. 2005;120:155–156. doi: 10.1016/j.cell.2005.01.003. [DOI] [PubMed] [Google Scholar]
- Kaplan CD, Kornberg RD. A bridge to transcription by RNA polymerase. J Biol. 2008;7:39. doi: 10.1186/jbiol99. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weinzierl RO. Nanomechanical constraints acting on the catalytic site of cellular RNA polymerases. Biochem Soc Trans. 2010;38:428–432. doi: 10.1042/BST0380428. [DOI] [PubMed] [Google Scholar]
- Miropolskaya N, Artsimovitch I, Klimasauskas S, Nikiforov V, Kulbachinskiy A. Allosteric control of catalysis by the F loop of RNA polymerase. Proc Natl Acad Sci USA. 2009;106:18942–18947. doi: 10.1073/pnas.0905402106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barlow DJ, Thornton JM. Helix geometry in proteins. J Mol Biol. 1988;201:601–619. doi: 10.1016/0022-2836(88)90641-9. [DOI] [PubMed] [Google Scholar]
- Cho NH, Kim HR, Lee JH, Kim SY, Kim J, Cha S, Kim SY, Darby AC, Fuxelius HH, Yin J, Kim JH, Kim J, Lee SJ, Koh YS, Jang WJ, Park KH, Andersson SG, Choi MS, Kim IS. The Orientia tsutsugamushi genome reveals massive proliferation of conjugative type IV secretion system and host-cell interaction genes. Proc Natl Acad Sci USA. 2007;104:7981–7986. doi: 10.1073/pnas.0611553104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nakayama K, Yamashita A, Kurokawa K, Morimoto T, Ogawa M, Fukuhara M, Urakami H, Ohnishi M, Uchiyama I, Ogura Y, Ooka T, Oshima K, Tamura A, Hattori M, Hayashi T. The whole-genome sequencing of the obligate intracellular bacterium Orientia tsutsugamushi revealed massive gene amplification during reductive genome evolution. DNA Res. 2008;15:185–199. doi: 10.1093/dnares/dsn011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stoeva K, Ward BF. Genome mapping of Arcobacter butzleri. FEMS Microbiol Lett. 2006;256:290–297. doi: 10.1111/j.1574-6968.2006.00135.x. [DOI] [PubMed] [Google Scholar]
- Yasumoto K, Liu H, Jeong SM, Ohashi Y, Kakinuma S, Tanaka K, Kawamura F, Yoshikawa H, Takahashi H. Sequence analysis of a 50 kb region between spo0H and rrnH on the Bacillus subtilis chromosome. Microbiology. 1996;142:3039–3046. doi: 10.1099/13500872-142-11-3039. [DOI] [PubMed] [Google Scholar]
- Huang L, Jones AME, Searle I, Patel K, Vogler H, Hubner NC, Baulcombe DC. An atypical RNA polymerase involved in RNA silencing shares small subunits with RNA polymerase II. Nat Struct Mol Biol. 2008;16:91–93. doi: 10.1038/nsmb.1539. [DOI] [PubMed] [Google Scholar]
- Ream TS, Haag JR, Wierzbicki AT, Nicora CD, Norbeck A, Zhu JK, Hagen G, Guilfoyle TJ, Paša-Tolić L, Pikaard CS. Subunit compositions of the RNA-silencing enzymes Pol IV and Pol V reveal their origins as specialized forms of RNA Polymerase II. Mol Cell. 2009;33:1–12. doi: 10.1016/j.molcel.2008.12.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karplus M, McCammon JA. Dynamics of proteins: elements and function. Annu Rev Biochem. 1983;53:263–300. doi: 10.1146/annurev.bi.52.070183.001403. [DOI] [PubMed] [Google Scholar]
- Karplus M, McCammon JA. Molecular dynamics simulations of biomolecules. Nat Struct Biol. 2002;9:646–652. doi: 10.1038/nsb0902-646. [DOI] [PubMed] [Google Scholar]
- Adcock SA, McCammon JA. Molecular dynamics: survey of methods for simulating the activity of proteins. Chem Rev. 2006;106:1589–1615. doi: 10.1021/cr040426m. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pace CN, Scholtz JM. A helix propensity scale based on experimental studies of peptides and proteins. Biophys J. 1998;75:422–427. doi: 10.1016/S0006-3495(98)77529-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- López-Llano J, Campos LA, Sancho J. α-helix stabilization by alanine relative to glycine: roles of polar and apolar solvent exposures and of backbone entropy. Proteins. 2006;64:769–778. doi: 10.1002/prot.21041. [DOI] [PubMed] [Google Scholar]
- Grathwohl C, Wüthrich K. The X-Pro peptide bond as an nmr probe for conformational studies of flexible linear peptides. Biopolymers. 1976;15:2025–2041. doi: 10.1002/bip.1976.360151012. [DOI] [PubMed] [Google Scholar]
- Frömmel C, Preissner R. Prediction of prolyl residues in cis-conformation in protein structures on the basis of the amino acid sequence. FEBS Lett. 1990;277:159–163. doi: 10.1016/0014-5793(90)80833-5. [DOI] [PubMed] [Google Scholar]
- Pal D, Chakrabarti P. Cis peptide bonds in proteins: residues involved, their conformations, interactions and locations. J Mol Biol. 1999;294:271–288. doi: 10.1006/jmbi.1999.3217. [DOI] [PubMed] [Google Scholar]
- Pahlke D, Freund C, Leitner D, Labudde D. Statistically significant dependence of the Xaa-Pro peptide bond conformation on secondary structure and amino acid sequence. BMC Struct Biol. 2005;5:8. doi: 10.1186/1472-6807-5-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hudáky I, Perczel A. Prolylproline unit in model peptides and in fragments from databases. Proteins. 2008;70:1389–1407. doi: 10.1002/prot.21630. [DOI] [PubMed] [Google Scholar]
- Saha I, Chatterjee B, Shamala N, Balaram P. Crystal structures of peptide enantiomers and racemates: probing conformational diversity in heterochiral pro-pro sequences. Biopolymers. 2008;90:537–543. doi: 10.1002/bip.20982. [DOI] [PubMed] [Google Scholar]
- Sosunov V, Sosunova E, Mustaev A, Bass I, Nikiforov V, Goldfarb A. Unified two-metal mechanism of RNA synthesis and degradation by RNA polymerase. EMBO J. 2003;22:2234–2244. doi: 10.1093/emboj/cdg193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sosunov V, Zorov S, Sosunova E, Nikolaev A, Zakeyeva I, Bass I, Goldfarb A, Nikiforov V, Severinov K, Mustaev A. The involvement of the aspartate triad of the active center in all catalytic activities of multisubunit RNA polymerase. Nucleic Acids Res. 2005;33:4202–4211. doi: 10.1093/nar/gki688. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Castro C, Smidansky E, Maksimchuk KR, Arnold JJ, Korneeva VS, Götte M, Konigsberg W, Cameron CE. Two proton transfers in the transition state for nucleotidyl transfer catalyzed by RNA- and DNA-dependent RNA and DNA polymerases. Proc Natl Acad Sci USA. 2007;104:4267–4272. doi: 10.1073/pnas.0608952104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang Y, Beaudry A, McSwiggen J, Sousa R. Determinants of ribose specificity in RNA polymerization: effects of Mn2+ and deoxynucleoside monophosphate incorporation into transcripts. Biochemistry. 1997;36:13718–13728. doi: 10.1021/bi971609o. [DOI] [PubMed] [Google Scholar]
- Mason PB, Struhl K. Distinction and relationship between elongation rate and processivity of RNA Polymerase II in vivo. Mol Cell. 2005;17:831–840. doi: 10.1016/j.molcel.2005.02.017. [DOI] [PubMed] [Google Scholar]
- Feig M, Burton ZF. RNA polymerase II flexibility during translocation from normal mode analysis. Proteins. 2010;78:434–46. doi: 10.1002/prot.22560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seibold SA, Singh BN, Zhang C, Kireeva M, Domecq C, Bouchard A, Nazione AM, Feig M, Cukier RI, Coulombe B, Kashlev M, Hampsey M, Burton ZF. Conformational coupling, bridge helix dynamics and active site dehydration in catalysis by RNA polymerase. Biochim Biophys Acta. 2010;1799:575–587. doi: 10.1016/j.bbagrm.2010.05.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang X, Wang D, Weiss DR, Bushnell DA, Kornberg RD, Levitt M. RNA polymerase II trigger loop residues stabilize and position the incoming nucleotide triphosphate in transcription. Proc Natl Acad Sci USA. 2010;107:15745–15750. doi: 10.1073/pnas.1009898107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yildirim Y, Doruker P. Collective motions of RNA polymerases. Analysis of core enzyme, elongation complex and holoenzyme. J Biomol Struct Dyn. 2004;22:267–280. doi: 10.1080/07391102.2004.10507000. [DOI] [PubMed] [Google Scholar]
- Toulokhonov I, Zhang J, Palangat M, Landick R. A central role of the RNA polymerase trigger loop in active-site rearrangement during transcriptional pausing. Mol Cell. 2007;27:406–419. doi: 10.1016/j.molcel.2007.06.008. [DOI] [PubMed] [Google Scholar]
- Tavormina PL, Reznikoff WS, Gross CA. Identifying interacting regions in the β subunit of Escherichia coli RNA polymerase. J Mol Biol. 1996;258:213–223. doi: 10.1006/jmbi.1996.0244. [DOI] [PubMed] [Google Scholar]
- Cunningham BC, Wells JA. High-resolution epitope mapping of hGH-receptor interactions by alanine-scanning mutagenesis. Science. 1989;244:1081–1085. doi: 10.1126/science.2471267. [DOI] [PubMed] [Google Scholar]
- O'Malley MA, Elliott KC, Haufe C, Burian RM. Philosophies of funding. Cell. 2009;138:611–615. doi: 10.1016/j.cell.2009.08.008. [DOI] [PubMed] [Google Scholar]
- Wiesler SC, Weinzierl ROJ. The Linker domain of basal transcription factor TFIIB controls distinct recruitment and transcription stimulation functions. Nucl Acids Res. 2010. in press . [DOI] [PMC free article] [PubMed]
- Van Der Spoel D, Lindahl E, Hess B, Groenhof G, Mark AE, Berendsen HJ. GROMACS: fast, flexible, and free. J Comput Chem. 2005;26:1701–1718. doi: 10.1002/jcc.20291. [DOI] [PubMed] [Google Scholar]
- Arnold K, Bordoli L, Kopp J, Schwede T. The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling. Bioinformatics. 2006;22:195–201. doi: 10.1093/bioinformatics/bti770. [DOI] [PubMed] [Google Scholar]
- Heinig M, Frishman D. STRIDE: a web server for secondary structure assignment from known atomic coordinates of proteins. Nucleic Acids Res. 2004. pp. W500–502. [DOI] [PMC free article] [PubMed]
- Humphrey W, Dalke A, Schulten K. VMD: visual molecular dynamics. J Mol Graph. 1996;14:33–38. doi: 10.1016/0263-7855(96)00018-5. [DOI] [PubMed] [Google Scholar]
- Bern M, Goldberg D. Automatic selection of representative proteins for bacterial phylogeny. BMC Evol Biol. 2005;5:34. doi: 10.1186/1471-2148-5-34. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Berkholz DS, Krenesky PB, Davidson JR, Karplus PA. Protein Geometry Database: a flexible engine to explore backbone conformations and their relationships to covalent geometry. Nucleic Acids Res. 2010;38:D320–325. doi: 10.1093/nar/gkp1013. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.