Precise quantification of translation inhibition by mRNA structures that overlap with the ribosomal footprint in N-terminal coding sequences

Amin Espah Borujeni; Daniel Cetnar; Iman Farasat; Ashlee Smith; Natasha Lundgren; Howard M Salis

doi:10.1093/nar/gkx061

. 2017 Feb 1;45(9):5437–5448. doi: 10.1093/nar/gkx061

Precise quantification of translation inhibition by mRNA structures that overlap with the ribosomal footprint in N-terminal coding sequences

Amin Espah Borujeni ¹, Daniel Cetnar ¹, Iman Farasat ¹, Ashlee Smith ¹, Natasha Lundgren ¹, Howard M Salis ^1,^2,^*

PMCID: PMC5435973 PMID: 28158713

Abstract

A mRNA's translation rate is controlled by several sequence determinants, including the presence of RNA structures within the N-terminal regions of its coding sequences. However, the physical rules that govern when such mRNA structures will inhibit translation remain unclear. Here, we introduced systematically designed RNA hairpins into the N-terminal coding region of a reporter protein with steadily increasing distances from the start codon, followed by characterization of their mRNA and expression levels in Escherichia coli. We found that the mRNAs’ translation rates were repressed, by up to 530-fold, when mRNA structures overlapped with the ribosome's footprint. In contrast, when the mRNA structure was located outside the ribosome's footprint, translation was repressed by <2-fold. By combining our measurements with biophysical modeling, we determined that the ribosomal footprint extends 13 nucleotides into the N-terminal coding region and, when a mRNA structure overlaps or partially overlaps with the ribosomal footprint, the free energy to unfold only the overlapping structure controlled the extent of translation repression. Overall, our results provide precise quantification of the rules governing translation initiation at N-terminal coding regions, improving the predictive design of post-transcriptional regulatory elements that regulate translation rate.

INTRODUCTION

A mRNA's translation rate is controlled by its sequence through the collective action of several ribosome-mRNA interactions (1–11), creating a complicated sequence-function relationship. Developing a precise and predictive understanding of these interactions has become essential to controlling protein expression levels for a wide variety of biotechnological applications (12), including biosensor development (13–15), metabolic pathway engineering (16–24) and genetic circuit engineering (25–27). Within this sequence-function relationship, different portions of the mRNA play distinct roles, though most studies have largely focused on how the bacterial ribosome binding site sequence affects the mRNA's translation rate (2,14,28). Beyond the ribosome binding site, it has been established that a mRNA's protein coding sequence can greatly affect its translation initiation rate (3,7,29), though the biophysical rules that govern the extent of its control remains unpredictable and poorly quantified. Here, we apply a learn-by-design approach to systematically elucidate the biophysical rules that govern when a protein coding sequence will repress a mRNA's translation rate, and to precisely quantify the length of the ribosome's footprint prior to translation initiation.

Translation initiation is a rate-limiting step in gene expression whereby the 30S ribosomal subunit binds to a mRNA's standby site, hybridizes to its Shine-Dalgarno sequence (SD), and inserts the coding region into its entry channel to form a 30S initiation complex (30SIC), together with tRNA^fMet and initiation factors (2,30,31). Afterward, the 50S ribosomal subunit is recruited to form a 70SIC, GTP is hydrolyzed, and translation elongation begins (32). Notably, the presence of mRNA structures will inhibit 30SIC formation and repress the mRNA's translation rate, though the magnitude of this effect will depend on the mRNA structures’ locations, thermodynamics and folding kinetics (8,10,11,33–35).

In general, there are three categories of mRNA structures that affect a mRNA's translation rate. First, if a mRNA structure is located in the standby site region, upstream of the Shine-Dalgarno sequence, then the ribosome is not required to unfold the structure to initiate translation (10). Instead, a mRNA structure in the standby site will only affect the mRNA's translation rate by altering the size of the ribosome's ‘landing pad’, quantified by the amount of single-stranded RNA available to bind to the ribosome's platform domain. The biophysical rules that govern how standby site sequence, structure, and geometry control a mRNA's translation rate were previously elucidated through characterization of 136 mRNAs, employing a learn-by-design approach (10). Second, if a mRNA structure sequesters or overlaps with the Shine-Dalgarno sequence (the 16S ribosomal RNA binding site), then it must be unfolded prior to translation initiation, and therefore the energy needed to unfold those structures will destabilize 30SIC formation (11,33). Similarly, any mRNA structures that overlap with the spacer or start codon regions also need to be unfolded prior to translation initiation. For both categories of mRNA structure, their effects on a mRNA's translation initiation rate are well-predicted by the latest version (v2.0) of a statistical thermodynamic model of translation initiation, nicknamed the ‘RBS Calculator’ (10,28).

However, a third category of mRNA structure overlaps with the protein coding region of the mRNA, and it remains unclear when and how such mRNA structures will inhibit translation. The diameter of the ribosome's Entry channel is ∼20 Å wide, and may only accommodate single-stranded RNA, therefore requiring unfolding of the portion of the mRNA that is fed into the channel (30,36). For the 30S ribosome to bind to the mRNA and form a 30SIC complex, the mRNA that spans its footprint must be entirely single-stranded. Therefore, it is expected that any mRNA structure that overlaps with the 30SIC footprint must be unfolded prior to translation initiation. Several studies have carried out in vitro measurements of the ribosome's footprint, finding that the ribosome extends to between 12 and 19 nucleotides into the mRNA's N-terminal coding region (37–39). In contrast, the in vivo characterization of combinatorial libraries of ribosome binding sites and protein coding sequences has indicated that mRNA structures could inhibit translation when located anywhere from –4 to +37 relative to the start codon (3,7,29). In addition, from an evolutionary conservation perspective, the first 5–10 codons (15–30 nucleotides) of a protein coding sequence are biased towards synonymous codons that minimize the formation of mRNA structures (4). More precise in vivo measurements of the ribosome's footprint are needed to accurately predict when an N-terminal mRNA structure will inhibit translation.

In this article, we systematically design a series of synthetic expression cassettes to precisely measure the in vivo ribosomal footprint length in Escherichia coli. We show that N-terminal mRNA structures can have a dramatic effect on the mRNA's translation rate, while quantifying the positional and thermodynamic rules that govern their translation repression. By incorporating these improved quantitative rules into our biophysical model of translation initiation, we greatly improve its ability to accurately predict mRNA translation rates across a range of structurally diverse protein coding sequences.

MATERIALS AND METHODS

Plasmid design and cloning

A series of pFTV1-derived plasmids (ColE1, Cm^R) (11) were designed and constructed to express a modified mRFP1 fluorescent protein reporter with the objective of introducing rationally designed modifications to the N-terminal region of mRFP1 to introduce specific, desired mRNA structures while preventing the formation of undesired, confounding mRNA structures in non-CDS regions. All plasmid variants utilized a σ⁷⁰ constitutive promoter (BioBrick #J23100) to control mRFP1 transcription together with a rationally designed ribosome binding site sequence that has the potential to support a high mRFP1 translation rate in the absence of inhibitory mRNA structures (the no-hairpin control). Specifically, we used the RBS Calculator v2.0 to design a ribosome binding site (RBS) sequence with a 5΄ XbaI restriction site, an upstream 6 base pair hairpin, and a 3΄ NdeI restriction site with a targeted translation rate of 30 000 au on the RBS Calculator's proportional scale. The resulting 5΄ untranslated region (5΄-TCTAGAACCCGCCATATACGGCGGGACACACACAAGGAGACCATATG-3΄) has an accessible standby site (ΔG_standby = 0.06 kcal/mol), a high-affinity Shine-Dalgarno sequence (ΔG_SD-antiSD = –8.68 kcal/mol), a three-nucleotide spacer region (ΔG_spacing = 1.52 kcal/mol), an AUG start codon (ΔG_start = –1.19 kcal/mol), and an overall ribosome binding free energy of ΔG_total = –8.3 kcal/mol, yielding a high predicted translation initiation rate of 30400 au. The purpose of the upstream hairpin is to prevent the formation of mRNA structures that sequester the Shine-Dalgarno sequence; the upstream hairpin itself does not overlap with the Shine-Dalgarno sequence and is not predicted to inhibit translation rate. Twenty seven N-terminal mRFP1 variants were designed to incorporate specific, desired mRNA structures, while preventing the formation of alternative mRNA structures. Each variant CDS sequence contains a 5΄ NdeI site and a 3΄ SacI site.

The starting pFTV plasmid contained a 5΄ XbaI upstream of the RBS and a 3΄ SacI site inside the N-terminal CDS. To construct the plasmids used in this study, the designed ribosome binding site was first inserted into pFTV1 by annealing two complementary oligonucleotides (Integrated DNA Technologies) with XbaI/SacI overhangs and ligating the insert with digested pFTV1 vector, creating an intermediate plasmid pFTV1a. The CDS variants were then inserted into pFTV1a by constructing DNA fragments with NdeI/SacI overhangs, either by annealing oligonucleotides or by PCR assembly and digestion, followed by ligation with digested pFTV1a. All plasmids were transformed into E. coli DH10B cells, followed by sequence verification of isolated clones. All sequences are presented in the Supplementary Data.

Strains, growth and characterization

All mRNA level and single-cell fluorescence measurements were performed on plasmid-harboring E. coli DH10B cells during long-time cultures, similar to a previous study (10). For each construct, isogenic colonies were used to inoculate overnight cultures in 700 ul LB-Miller media (10 g/l tryptone, 5 g/l yeast extract and 10 g/l NaCl) supplemented with 50 ug/ml Cm within a 96-well deep-well plate. To begin the characterization, 10 ul culture was diluted into 190 ul of fresh LB-Miller/Cm media using a 96-well microtiter plate, and incubated at 37°C with high orbital shaking inside a M1000 spectrophotometer (TECAN). OD₆₀₀ absorbances were recorded every 10 min until the OD₆₀₀ reached 0.15, indicating the cells were reaching the mid-exponential phase of growth. At this time, a second 96-well microtiter plate was inoculated by serial dilution using culture from the first plate and fresh LB-Miller/Cm media. In the same way, a third serial dilution was conducted using a third 96-well microtiter plate, yielding a total culture time of ∼24 h where cells are continuously maintained in the exponential phase of growth. For each culture, single-cell mRFP1 fluorescence measurements were performed by collecting 10 ul from the end of the second and third dilution, transferring to a microtiter plate with 200 ul PBS solution with 2 mg/ml kanamycin, and utilizing a Fortessa flow cytometer (BD Biosciences) to record 100 000 single-cell fluorescence levels. All single-cell fluorescence distributions were unimodal. The arithmetic mean of distributions is calculated, and the background autofluorescence of E. coli DH10B cells is subtracted. All reported fluorescence levels are the average of four measurements from cultures carried out on two separate days, and are listed in Supplementary Data.

mRNA level measurements were performed on selected strains by using overnight cultures to inoculate 5 ml Cm-supplemented LB media at 37°C with 300 RPM shaking. Cells were harvested once they reached the mid-late exponential phase of growth, determined by an OD₆₀₀ absorbance of 1.5–2.0, measured using a cuvette-based spectrophotometer (NanoDrop 2000C). The cultures’ total RNA was extracted using the Total RNA Purification kit (Norgen Biotek), followed by non-specific degradation of contaminant DNA using the Turbo DNAse kit (Ambion). Following extraction, cDNA was prepared using the High Capacity cDNA Reverse Transcription kit (Applied Biosystems). Taqman-based qPCR was performed using an ABI Step One real-time thermocycler (Applied Biosystems), utilizing a Taqman probe targeting a non-modified mRFP1 region (5΄-ACCTTCCATACGAACTTT-3΄), a forward primer (5΄-ACGTTATCAAAGAGTTC-3΄), and a reverse primer (5΄-CGATTTCGAACTCGTGACCGTTAA-3΄). Taqman-based RT-qPCR measurements were also performed on 16S rRNA as an endogenous control, and were used to calculate relative mRNA levels from ΔC_t numbers. The Minimum Information for Quantitative Real-Time PCR Experiments (MIQE) can be found in Supplementary Table S1.

A biophysical model of translation initiation

We previously developed a statistical thermodynamic model of ribosome–mRNA interactions (the RBS Calculator v2.0) to predict a mRNA's translation initiation rate (r) from its sequence by calculating the total change in Gibbs free energy (ΔG_total) when the 30S ribosomal subunit binds to a mRNA at a selected start codon (8,10,11). The ribosome's binding free energy is related to the mRNA's translation initiation rate according to Boltzmann's relationship,

(1)

where β is the apparent Boltzmann constant, which has been empirically measured to be 0.45 ± 0.05 mol/kcal (8). The total change in Gibbs free energy is calculated using the following multi-term free energy model,

(2)

where each term quantifies the strengths of the molecular interactions that control translation initiation in the initial state (free, folded mRNA) and in the after state (a 30S initiation complex) as shown in Figure 1A. ΔG_total is the amount of free energy (work) needed to transition the system from the initial to final state. Starting from the initial state, we calculate the amount of free energy needed to fully unfold the mRNA from its minimum-free-energy configuration (ΔG_mRNA). We then calculate the large amount of free energy that can be released when the 30S ribosomal subunit binds to the mRNA by identifying the mRNA structure and ribosome–mRNA interaction that minimizes the free energy of the final state (ΔG_mRNA–rRNA + ΔG_spacing + ΔG_standby + ΔG_start), where ΔG_standby quantifies the energy needed for the ribosome to bind to the standby site upstream of the Shine-Dalgarno sequence (ΔG_standby > 0), ΔG_mRNA–rRNA quantifies how much energy is released when the 16S rRNA hybridizes to the mRNA and the mRNA refolds into non-inhibitory structures (ΔG_mRNA–rRNA < 0), ΔG_spacing quantifies the energetic penalty for stretching or compressing the ribosome, due to non-optimal spacing between the Shine-Dalgarno and start codon (ΔG_spacing > 0), and ΔG_start quantifies the energy released when the tRNA hybridizes to the start codon. The free energy ΔG_mRNA–rRNA can be further broken into the following terms:

(3)

where ΔG_SD-antiSD is the free energy released when the last nine nucleotides of the 16S rRNA hybridize to the mRNA at the Shine-Dalgarno (SD) sequence, ΔG_pre-SD is the free energy released when the mRNA upstream of the SD folds into a non-inhibitory structure, and ΔG_{post-footprint} is the free energy released when the CDS portion of the mRNA beyond the ribosome's footprint region folds into a non-inhibitory structure. We previously developed experimentally validated models for calculating ΔG_standby and ΔG_spacing (10,11). All RNA folding energetics and RNA–RNA hybridization free energies are calculated using semi-empirical RNA free energy models (36,37), provided by the Vienna RNA suite (version 1.8.5) (40). Overall, the biophysical model's predictions have been experimentally validated by characterizing 495 mRNAs with diverse sequences and structures in several Gram-positive and Gram-negative bacterial hosts (9–11,15,41), showing that the thermodynamic model can predict 57% of the mRNAs’ translation rates to within 2-fold, and 83% of the mRNAs’ translation rates to within 5-fold, across a 100 000 proportional scale. A key assumption of the thermodynamic model is that there is sufficient time for the mRNA to refold during cycles of translation initiation. In a recent study, we show that RNA folding kinetics can have a significant effect on a mRNA's translation rate according to a Ribosome Drafting mechanism (8). Here, we use Kinfold to carry out kinetic RNA folding simulations (42) as part of the rational design process to ensure that all studied mRNA structures will rapidly fold to their minimum-free-energy configurations, which will minimize the Ribosome Drafting effect. The biophysical model calculations, Kinfold-calculated RNA folding times, and translation rate predictions for all studied mRNAs at a range of ribosomal footprint lengths are included in Supplementary Data.

Figure 1. — Translation initiation rate is controlled by the overlap between the ribosomal footprint and N-terminal mRNA structures. (A) A biophysical model uses a mRNA's sequence to calculate the ribosome's total binding free energy change, which is then related to the mRNA's translation initiation rate. mRNA structures that overlap with the standby site, with the Shine-Dalgarno sequence, and with the coding sequence have different effects on the ribosome's binding free energy. (B) A baseline no-hairpin control mRNA contains a highly accessible standby site, an optimized Shine-Dalgarno sequence, and an AC-rich CDS fusion that translates a *mRFP1* reporter at a very high rate, while enabling the insertion of new hairpin-forming CDS fusion sequences without the formation of undesired mRNA structures. A schematic of the resulting base-line plasmid is shown. (C) Hairpin-forming sequences were inserted into the N-terminal CDS of *mRFP1* at varying distances from the start codon to determine where mRNA hairpins must be unfolded because they overlap with the ribosomal footprint.

RESULTS

Design of expression systems to measure the effect of mRNA hairpins in N-terminal coding sequences

We applied a learn-by-design approach to investigate how the location of a stable mRNA hairpin inside the N-terminal of a coding sequence influences a mRNA's translation initiation rate. Overall, we expected that mRNA hairpins located inside the ribosomal footprint would inhibit the mRNA's translation rate. More specifically, our sequences were designed to precisely measure the length of the ribosomal footprint in an in vivo physiological environment, and to quantify the magnitude of translation repression according to the mRNA hairpins’ locations and folding free energies.

First, we designed, constructed, and characterized a baseline mRNA sequence that enabled us to measure the effect of hairpin-forming sequences within the mRFP1 N-terminal coding sequence, while avoiding the formation of undesired mRNA structures and ensuring that the ribosome's translation elongation rate does not become a rate-limiting step in the translation process (Figure 1B). The baseline mRNA sequence, called our ‘no hairpin’ control, contains a stable mRNA hairpin upstream of the Shine-Dalgarno sequence, an optimized Shine-Dalgarno sequence that supports a very high translation initiation rate of mRFP1 (30400 au on the RBS Calculator v2.0 scale), and a 39 nucleotide AC-rich in-frame insertion into the mRFP1 coding sequence. To accelerate translation elongation, the AC-rich CDS fusion utilized codons that are predominantly found in natural, highly translated E. coli coding sequences, called ‘fast codons’. To minimize any ribosomal pausing, the AC-rich CDS fusion does not contain any Shine-Dalgarno-like sequences. We applied flow cytometry and RT-qPCR to measure the mRFP1 fluorescence and mRNA level of the no-hairpin control in E. coli DH10B cells to confirm that it indeed expressed the mRFP1 reporter at a very high level (Figure 2A).

Figure 2. — Translation initiation is inhibited based on the position of N-terminal mRNA structures. Measured mRFP1 fluorescence levels when either (**A, C**) short mRNA hairpins or (**B, D**) long mRNA hairpins were inserted into the mRFP1 coding sequence at designated positions. (A, B) Measured single-cell fluorescence distributions from mRNAs with N-terminal RNA hairpins at designated positions, compared to the no-hairpin mRNA control. BG, background autofluorescence. (C, D) Average fluorescence levels from mRNAs with N-terminal hairpins at labeled positions. The mRFP1 fluorescence level of the no-hairpin control is denoted by a dashed line. (E, F) The measured mRNA levels of selected expression systems with either (E) short hairpins or (F) long hairpins, normalized with respect to the measured mRNA level of the no-hairpin control. Fluorescence data points and error bars are the average and standard deviation of four cytometry measurements on two separate days. mRNA level data points and error bars are the average and standard deviation of two to eight measurements on up to three separate days. All measurements are listed in Supplementary Data.

We then designed, constructed, and characterized sets of mRNAs where we inserted hairpin-forming sequences into the mRFP1 N-terminal CDS, systematically varying the location of their 5΄ end from +4 to +40 nucleotides after the start codon (Figure 1C). The first set of sequences formed short mRNA hairpins with a 6 bp duplexed stem. To create in-frame CDS fusions, the hairpins’ loop lengths were varied from six to eight nucleotides. Accordingly, when they are inserted into the N-terminal CDS region at varying positions, their calculated folding free energies ranged from –9.5 to –11 kcal/mol. The second set of sequences formed long mRNA hairpins with a 11 bp bulged stem and six to eight nt loop lengths. Their calculated folding free energies ranged from –17.2 to –18.6 kcal/mol when inserted at varying positions within the N-terminal mRFP1 CDS region. When designing these hairpin-forming sequences, we minimized the introduction of confounding variables, such as changes in translation elongation rates and mRNA stability, by ensuring that all hairpin-forming sequences utilized fast codons, and by preventing the appearance of any mRNA duplexes above 8 base pairs to minimize mRNA degradation (43). For each mRNA, we also carried out 1000 kinetic RNA folding simulations using Kinfold (42), and calculated their average folding time, to ensure that all hairpin-forming sequences could rapidly fold to their minimum-free-energy structure. We then characterized the steady-state mRFP1 fluorescence levels and mRNA levels during long-time E. coli DH10B cultures maintained in the exponential growth phase to determine how each N-terminal hairpin-forming sequence affected the mRFP1 translation initiation rate and mRNA stability (Materials and Methods).

mRNA hairpins that overlap with the ribosomal footprint substantially inhibit translation initiation

Characterization of the designed mRNAs revealed an exceedingly clear and quantitative relationship between the locations of the mRNA hairpins and mRFP1 expression levels (Figure 2A). In our designed no-hairpin control mRNA that lacks any inhibitory mRNA structures, the mRFP1 fluorescence level was very high (62027.2 ± 6971.4 au) (Figure 2C and D, dashed lines). However, when a short mRNA hairpin was introduced downstream of the start codon (position +4), the mRFP1 fluorescence level dropped by 2720.2-fold to 22.8 ± 12 au (Figure 2C, bars) coincident with a 5.1-fold drop in mRNA level (Figure 2E, bars), equivalent to a 530-fold decrease in translation rate. Similar magnitudes of translation repression were observed when short mRNA hairpins were positioned at +7, +8, +9 and +10 nucleotides after the start codon. This pattern of translation repression suggests that the ribosome (the 30SIC) is forced to unfold the entirety of these mRNA hairpins in order to insert the mRNA into its entry channel, and that the ribosomal footprint extends at least 10 nucleotides into the N-terminal coding sequence.

Next, as the short mRNA hairpin's position was moved further downstream from the start codon, the magnitude of translation repression smoothly diminished; at positions +11, +12 and +13, the mRFP1 translation rates were repressed by only 122.4-fold, 5.0-fold and 4.1-fold, respectively, compared to the no-hairpin control (Figure 2C). Because the mRNA's translation rate underwent a smooth transition from full to partial repression, these measurements suggest that, when a mRNA hairpin only partially overlaps with the ribosomal footprint, the ribosome need only partially unfold the mRNA hairpin. Below, we will detail calculations that explain how partial unfolding of mRNA hairpins requires less free energy, for example, when a portion of the mRNA hairpin can refold into a less energetic structure.

Finally, when short mRNA hairpins were positioned from +14 to +40, the mRNAs’ translation rates were very similar to the no-hairpin control's translation rate with only minor changes, for example, a 2.1-fold decrease at position +40 and a 2.7-fold increase at position +22. Importantly, these differences in translation rate were not statistically significant (t-test, N = 8, p = 0.227). These measurements show that when a mRNA hairpin does not overlap with the ribosomal footprint, the 30SIC does not need to unfold the hairpin to initiate translation. Overall, we did not observe any significant repression of either translation initiation or translation elongation when short mRNA hairpins were inserted at and beyond position +14.

We next investigated whether the insertion of longer, more stable mRNA hairpins into the N-terminal CDS region would similarly influence the mRNA's translation initiation rate. The measured mRFP1 fluorescence levels and mRNA levels from nine mRNAs with longer mRNA hairpins, systematically inserted at positions +8 to +22 downstream of the start codon, revealed a similar pattern of translation repression and mRNA level changes (Figure 2BD and F). A long mRNA hairpin inserted at position +8 dropped the mRFP1 fluorescence level by 1720.0-fold with a coincident 10.6-fold drop in mRNA level, compared to the no-hairpin control mRNA, yielding a translation repression of 161.1-fold. When the long mRNA hairpins were positioned further downstream, the amount of translation repression decreased relatively smoothly until, at position +16, the translation rate was no longer repressed, and was similar to the translation rate of the no hairpin control. Similar to the previous set of short mRNA hairpins, when long mRNA hairpins were inserted from +16 to +22, they did not cause a statistically significant amount of translation repression, compared to the no-hairpin mRNA control (t-test, N = 3, p = 0.1972).

We also investigated whether modifications to the mRFP1 reporter's N-terminal amino acid sequence could explain the large changes in translation repression, according to three known mechanisms. First, our rational design approach ensured that only fast codons were utilized to construct mRFP1 variants with inserted hairpin structures to avoid the introduction of translation elongation as a rate-limiting step to mRFP1 expression. Indeed, the insertion of short or long mRNA hairpins into positions past position +16 did not appreciably repress the translation of mRFP1 mRNA, indicating that elongating ribosomes were not hindered by the requirement to unfold these mRNA structures during GTP hydrolysis-mediated translocation. Second, the N-end rule is a proteolytic mechanism whereby a protein's N-terminal amino acid residue can lower its half-life (44). In E. coli, N-end rule degradation requires enzymatic removal of the initiating formyl methionine (e.g. by LFTR), ClpS-mediated recognition of a destabilizing amino acid residue now at the N-terminus, and subsequent proteolysis by ClpAP (45). With our rational design approach, we ensured that the second amino acid position in all mRFP1 N-terminal sequences, except for one, contained a lysine, which is considered a destabilizing residue (Supplementary Table S2). All mRFP1 variants with lysine at the second position would have equivalent protein degradation rates. In the one exception, the mRNA with a short hairpin inserted at position +4 expressed mRFP1 with glycine in the second position, which is considered a stabilizing residue. Characterization of this mRNA showed a 2720.2-fold decrease in mRFP1 fluorescence, which is the opposite of a stabilization effect. Overall, the presence of accelerated protein degradation according to the N-end rule can not explain our expression measurements. Third, electrostatic interactions between positively charged amino acid residues and the proteins within the ribosomal exit tunnel have been shown to potentially slow down translation elongation (46) and alter protein folding rates (47). The number of arginines and lysines in the N-terminus of the expressed mRFP1 variants varied from 2 to 5 (Supplementary Table S3) and there was no statistically significant relationship between these counts and the mRFP1 variants’ fluorescence levels (p = 0.66). Moreover, there was no statistically significant relationship considering all charged amino acid residues (p = 0.45). Overall, the very large changes in mRFP1 fluorescence levels could not be explained by the three most predominant mechanisms whereby changes to the mRFP1 N-terminus could alter its rate of translation elongation, protein folding, or protein degradation.

Incorporating mRNA level measurements to determine changes in translation rate

Based on our mRFP1 fluorescence and mRNA level measurements, there was a high degree of coupling between a mRNA's translation rate and its stability (Figure 2). Whenever the mRNA's translation is greatly repressed by an N-terminally located hairpin, the mRNA's level was substantially reduced. In contrast, when either short or long mRNA hairpins were inserted +14 or +16 nucleotides downstream of the start codon, the mRNA's level remained relatively unchanged. These two observations provide support for a previously proposed mechanism whereby actively translated mRNAs, bound by a sufficiently high density of ribosomes, become protected from RNAse activity (48). Quantitative characterization of this relationship remains a topic for a future study. Here, we focused on identifying how the mRNAs’ translation initiation rates were precisely controlled by the positions of the mRNA hairpins and their folding free energies. We calculated the mRNAs’ actual translation rates by dividing the mRFP1 fluorescence measurements by their corresponding measured mRNA levels (Figure 3A and B, red circles and lines), providing a clear relationship between the mRNA hairpins’ positions and translation rate changes. By comparison to the no-hairpin control, we also precisely quantified the apparent amount of work (the free energy change) that the ribosome exerted to unfold the inserted mRNA hairpins (Figure 4C).

Figure 3. — Determination of the Ribosome Footprint Length. Biophysical model predictions using wide range of footprint lengths (0–35 nt) are compared to the measured translation rates (red lines, dots) when either (A) a short 6 bp hairpin or (B) a long 11 bp hairpin are located at designated positions in mRFP1's N-terminal coding section. Model predictions using ribosome footprint lengths of 12, 13 and 14 nt are highlighted (green, black, and blue lines/dots, respectively). (C) A footprint length of 13 nt minimizes the average model error (average ΔΔG_total) across 17 characterized mRNAs shown in parts A and B. (D) Incorporation of the ribosome footprint into biophysical model predictions increases its accuracy for 495 previously characterized mRNAs (R² = 0.66, p = 2.5 × 10⁻¹¹⁶ and R² = 0.78, p = 6.8 × 10⁻¹⁶⁴ for footprint length of 0 and 13 nt, respectively).

Figure 4. — RNA hairpin folding free energy controls translation inhibition. (A) Two mRNA hairpins were inserted at position +13 with different duplex sequences and folding free energies, but similar hairpin sizes. (B) Fluorescence and mRNA level characterization revealed that the mRNA with the more stable hairpin had a lower translation rate, compared to the mRNA with the less stable hairpin. The horizontal dashed line shows the fluorescence and mRNA level for the no-hairpin control mRNA. (C) The apparent amount of free energy needed to unfold N-terminal mRNA hairpins is quantitatively similar to the model-calculated free energy of hairpin unfolding, using a ribosomal footprint of 13 nucleotides (R² = 0.77, p = 1.5 × 10⁻⁶). Yellow and blue circles are the short and long RNA hairpins, respectively. (D and E) The folding free energies of the mRNA regions spanning either (D) from –4 to +37 or (E) from the transcriptional start site (TSS) to +30 correlate poorly with their measured fluorescence levels (R² = 0.24, p = 0.01; and R² = 0.41, p = 3.1 × 10⁻⁴, respectively). +1 is the start codon. (F) The relative changes in the mRNA's expression level (fluorescence level) did not correlate to the differences in folding free energy for the mRNA region spanning from TSS to +96, compared to the no-hairpin control mRNA (R² = 0.06, p = 0.23). Parts D, E, and F show the measured fluorescence levels for all 27 characterized mRNAs in this study.

Precise quantification of the in vivo ribosomal footprint length using biophysical modeling

Next, by comparing these apparent measurements to our biophysical model's predictions, we may determine the length of the ribosomal footprint and the characteristics of the mRNA hairpins’ that control the mRNAs’ translation rate. According to our previously developed biophysical model of translation initiation (Materials and Methods), the length of the ribosomal footprint will have a direct effect on a mRNA's translation initiation according to a single thermodynamic principle: when the ribosome binds to the mRNA and occupies its lowest free energy state, the portion of the mRNA covered by the ribosome's footprint must remain unstructured. This principle is implemented by incorporating a single structural constraint for all model predictions. For example, when a mRNA hairpin fully overlaps with the ribosomal footprint, the model determines that the mRNA hairpin must be fully unfolded by the ribosome, resulting in a reduction in the predicted translation initiation rate. In contrast, when a mRNA hairpin does not overlap with the ribosomal footprint, the model determines that the hairpin remains folded in the ribosome-mRNA's final state, resulting in no change in the predicted translation initiation rate. In between these two cases, when a portion of the mRNA hairpin overlaps with the ribosomal footprint, its effect on the mRNA's translation initiation rate will depend on the amount of overlap and the possibility of another, less stable mRNA structure forming within the downstream region that is not covered by the ribosome's footprint.

Specifically, the model calculates the minimum-free-energy of a mRNA region downstream of the start codon, subject to a structural constraint that prevents the formation of mRNA secondary structures within the length of the ribosomal footprint, but allows the refolding of structures outside the ribosomal footprint region. The free energy calculation is designated ΔG_{post-footprint} and is controlled by a single parameter, the ribosomal footprint length. The value of ΔG_{post-footprint} is used to determine ΔG_mRNA–rRNA, ΔG_total and the mRNA's predicted translation initiation rate according to Equations (1)–(3). In all model calculations, we do not input the entire mRNA sequence, but instead input the 5΄ UTR and a sufficiently large portion of the CDS sequence (at least 75 nucleotides).

We then utilized the developed biophysical model to predict the mRNAs’ translation initiation rates at a range of possible ribosomal footprint lengths to determine the in vivo apparent ribosomal footprint length that best reflects our translation rate measurements (Figure 3). We show the mRNAs’ predicted translation initiation rates at a range of putative ribosomal footprint lengths from 0 to 35 nucleotides long and compare these predictions to the mRNAs’ measured translation rates (Figure 3A and B, red circles). We carried out this comparison for (Figure 3A) nine mRNAs with short hairpins and (Figure 3B) seven mRNAs with long hairpins, where in both cases, we had obtained fluorescence and mRNA level measurements to precisely measure their translation rates. These comparisons sharply identify the in vivo ribosomal footprint length to be 12 or 13 nucleotides long for the short hairpin mRNAs and 13 or 14 nucleotides long for the long hairpin mRNAs. Importantly, in both cases, the model correctly predicts the mRNA's translation rates as the short or long mRNA hairpins’ locations are incrementally shifted. Starting with a hairpin fully overlapping with the ribosomal footprint, and repressing translation, the model is able to predict the sigmoidal increase in translation rate as the mRNA hairpins are shifted into a position that only partially overlaps with the ribosomal footprint. Further shifting of the mRNA hairpin relieves all repression of translation, which is well-predicted by the model.

To further determine the in vivo ribosomal footprint length, we combined the 16 characterized mRNAs with long and short hairpins into a single data-set and compared their measured translation rates to the model's predictions at a range of ribosomal footprint lengths. For each mRNA and ribosomal footprint length, we determined the error in the model's prediction (ΔΔG_total) by first using the measured translation rates to determine the apparent change in binding free energy when the ribosome bound to the mRNA (ΔG_{total,apparent}), and subtracting the model-calculated ΔG_{total,predicted}. We then averaged the absolute value of ΔΔG_total across the 16 characterized mRNAs to show the relationship between this average model error versus ribosome footprint length (Figure 3C). Coincident with our visual qualitative analysis in Figure 3A and B, we found that the model error reached a global minimum of about 2 kcal/mol when the ribosomal footprint was 13 nucleotides long.

Next, we further examined the accuracy of our identified ribosome footprint length on a much larger data-set of 495 previously characterized mRNA sequences expressing different protein reporters (9–11,15,41). In contrast to the carefully designed mRNA sequences in this study, the large data-set of mRNAs have dissimilar 5΄ UTR and protein coding sequences, and their N-terminal coding regions contained mRNA structures with diverse shapes, energies, and positions, providing a stringent test case for the identified ribosome footprint length. We applied the model to predict the translation initiation rates of the 495 mRNAs, using ribosomal footprint lengths of either 0 or 13 nucleotides (Supplementary Data), and compared model predictions with their measured expression levels. Here, in the absence of mRNA level measurements, we assumed that the 495 mRNAs’ translation rates were proportional to their measured reporter expression levels. We found that utilizing the correct ribosome footprint length of 13 nucleotides greatly increased the accuracy of the biophysical model; the average error in the ΔG_total calculation dropped from 6.46 to 2.06 kcal/mol, and consequently increased the accuracy of predicted translation initiation rates (R² = 0.66, p = 2.5 × 10⁻¹¹⁶ using a footprint length of 0 nt, compared to R² = 0.78, p = 6.8 × 10⁻¹⁶⁴ using a footprint length of 13 nt) (Figure 3D). This analysis shows that a correct measurement of the ribosome's footprint length is important to accurately predicting the translation initiation rates of a collection of mRNAs with diverse sequences and structures.

N-terminal RNA hairpins inhibit translation initiation according to their folding free energy

We next investigated the relationship between the RNA structures’ folding free energies and the mRNAs’ translation initiation rates, and how the specific details of the RNA folding calculation will influence this relationship. Several previous studies have observed that reducing the thermodynamic stability of mRNA structures near the beginning of a coding sequence will have the overall qualitative effect of increasing a mRNA's translation rate (3,7,29), though in each study, the thermodynamic calculations were performed on different mRNA regions arriving at different types of correlations. For example, in Kudla et al., the highest correlation between the RNA folding free energy and measured fluorescence levels was observed when the folding calculation was performed specifically on the mRNA region spanning –4 to +37 (+1 is the start codon beginning). Similar analyses were performed in Kosuri et al. and Goodman et al., where they found that the folding free energies of the mRNA regions spanning TSS to +30 or TSS to +96, respectively, had the highest correlation to the mRNAs’ translation rates (TSS is the transcriptional start site and +1 is the start codon beginning). Importantly, the characterized mRNAs in Kudla et al. and Goodman et al. had variable N-terminal coding sequences and constant 5΄ untranslated regions, which are similar to the designed mRNAs characterized in this study. However, based on our measurements here, translation initiation is only inhibited by the presence of an N-terminal mRNA structure when it overlaps with the ribosomal footprint, which is located from +1 to +13. Performing a folding calculation on a much longer mRNA region, which includes both inhibitory and non-inhibitory mRNA structures, may not yield a reproducible correlative relationship on different sets of mRNAs. Therefore, we revisited how these various approaches to the folding calculation affected the observed relationship between RNA folding free energy and measured translation rate.

We first tested the sensitivity of the relationship between RNA unfolding free energy and mRNA translation rate, considering only inhibitory RNA hairpins. To do this, we designed and characterized a new mRNA variant that contains a slightly weakened RNA hairpin positioned exactly at +13 (Figure 4A). We swapped the terminal dinucleotide duplex of the hairpin from GG:CC to UA:AU, changing the hairpin's unfolding free energy from 11 to 7.3 kcal/mol, and thereby making it easier for the ribosome to unfold the dinucleotide overlapping with the ribosomal footprint at position +13. We then measured the fluorescence and mRNA level of the modified mRFP1-expressing mRNA, and found that the mRFP1 reporter expression level increased by 9.5-fold, while the mRNA level marginally increased by 1.5-fold, resulting in a 6.2-fold increase in measured translation rate (Figure 4B). Consistent with this measurement, the biophysical model predicts that weakening the RNA hairpin should have increased the mRNA's translation rate by 5.3-fold. This data provides an additional demonstrative example showing that, when a mRNA structure overlaps with the ribosomal footprint, its unfolding free energy has a large effect on a mRNA's translation rate.

Next, we tested the quantitative relationship between the apparent and predicted folding free energies of inhibitory mRNA structures. We utilized our collection of characterized mRNAs that contain long or short hairpins that fully overlap with the ribosomal footprint and therefore must be fully unfolded by the ribosome to initiate translation. From the translation rate measurements, we calculated the apparent amount of free energy needed to unfold the inhibitory hairpins, using the following equation:

(4)

where X is the measured translation rate of a mRNA with an N-terminal hairpin that overlaps with the ribosomal footprint, and X_ref is the measured translation rate of the no-hairpin control mRNA that does not have any N-terminal inhibitory hairpins. The constant β has been empirically determined to be 0.45 mol/kcal. In Figure 4C, we compared the hairpins’ apparent unfolding free energies to the model's predictions, and found quantitative agreement (R² = 0.77, p = 1.5 × 10⁻⁶), demonstrating that the inhibitory hairpins’ unfolding free energies controlled the mRNAs’ translation rates.

In contrast, when the RNA folding calculation includes both inhibitory and non-inhibitory mRNA structures, we do not observe a quantitative relationship between the mRNA's folding free energy and the mRNA's translation rate. The previously proposed RNA folding calculations, spanning –4 to +37, TSS to +30, or TSS to +96, did not result in clear quantitative relationships (R² = 0.24 for [–4, +37], R² = 0.41 for [TSS, +30], and R² = 0.06 for [TSS, +96]) (Figure 4DE and F). The RNA folding calculations from –4 to +37 or from TSS to +30 will often exclude N-terminal mRNA structures that partially overlap with the ribosomal footprint, while the calculation from TSS to +96 will include both inhibitory and non-inhibitory mRNA structures. These results show that utilizing any constant cutoff in the mRNA folding calculation may exclude the mRNA structures that inhibit translation, or include mRNA structures that have no effect on translation. Instead, the correct RNA folding calculation should only consider mRNA structures that overlap or partially overlap with the ribosomal footprint.

DISCUSSION

In this work, we applied a reductive learn-by-design approach to precisely determine the N-terminal mRNA structures that need to be unfolded by the bacterial ribosome during translation initiation. We designed 27 mRNAs with different N-terminal coding sequences, systematically varying the positioning and energetics of their structures, followed by characterization of their translation rates in E. coli by combining fluorescent protein and mRNA level measurements (Figure 1). We found that protein expression levels were repressed, by up to 2720-fold, when short or long mRNA structures overlapped with the in vivo ribosomal footprint (Figure 2CD). When mRNA level measurements were taken into account, the mRNAs’ apparent translation rates were repressed by up to 530-fold under the same conditions (Figure 2EF). In contrast, when the mRNA structure was located outside the ribosome's footprint, protein expression and translation rate were repressed by <2-fold. By combining our measurements with biophysical modeling, we precisely determined that the ribosomal footprint extends 13 nucleotides past the start codon (Figure 3A–C). By utilizing this improved quantification of the ribosomal footprint length, we showed that our biophysical model could more accurately predict the translation rates of a collection of 495 characterized mRNAs with diverse sequences and structures (Figure 3D). Finally, we determined that the folding energetics of the N-terminal mRNA structures control the mRNAs’ translation rates, but only when the N-terminal mRNA structure overlaps with the ribosomal footprint (Figure 4A–C). Our maximally informative measurements and biophysical modeling calculations are an improvement over previous ‘big data’ studies where correlations between translation rate and various RNA folding energy calculations were observed, but not tested for mechanistic causality (Figure 4D–F).

The measured value of the in vivo ribosomal footprint length is consistent with previous in vitro studies that have utilized a variety of different techniques (37–39). First, Hüttenhofer and Noller applied chemical footprinting on in vitro mRNA-30S ribosome complexes to measure the extent of protection from hydrolysis, and found that the ribosomal footprint extended to +19 nucleotides past the start codon when initiator tRNA^fMet was added, but was shortened to +5 nucleotides when tRNA^fMet was absent (39). Interestingly, inserting a stable mRNA structure from position +10 to +21 resulted in a loss of protection past +5 within the ribosome-mRNA complex. Second, by adding regulatory small RNAs that bind to the N-terminal coding sequence, Bouvier et al. demonstrated that small RNAs can repress translation initiation if their binding site overlaps with the ribosomal footprint, which was measured to be 14 ± 2 nucleotides, depending on the small RNA, referred more generally as the first five codon rule (38). By conducting in vitro toeprinting of ribosome-mRNA complexes with antisense DNA oligos, they also found that 30S ribosome–mRNA complex formation was strongly inhibited when DNA oligos covered the mRNA at or before position +12. Third, by applying optical tweezers to monitor the ribosome-catalyzed unfolding of mRNA structures, Qu et al. found that the ribosomal footprint was 12 ± 2 nucleotides, and that during translation elongation, mRNA structures were unwound through a combination of biased thermal ratcheting and mechanical opening (37). Interestingly, the ribosome's ability to mechanically open mRNA structures, utilizing GTP hydrolysis, ensures a minimum rate of translation elongation regardless of the mRNA structure's stability. However, prior to translation initiation, GTP hydrolysis does not take place and therefore the ribosome relies on biased thermal ratcheting to unwind mRNA structures. This distinction explains why the folding free energies of mRNA structures within the N-terminal coding sequence have a significant effect on a mRNA's translation initiation rate, but not its translation elongation rate.

Overall, our integrated computational design and experimental approach enabled us to elucidate and quantify the physical rules that govern when the ribosome unfolds N-terminal mRNA structures inside cells and how their unfolding energetics controls the mRNA's translation initiation rate. The quantification of these rules improved our ability to predict a mRNA's translation initiation rate according to its sequence, and thereby accelerates the rational design of mRNAs, riboswitches, and other post-transcriptional regulatory elements that manipulate translation initiation rates for useful purposes (15,25,41).

Supplementary Material

Supplementary Data

Click here for additional data file.^{(622.7KB, zip)}

ACKNOWLEDGEMENTS

Contributions: A.E.B. and H.M.S designed the study, analyzed results, and wrote the manuscript. A.E.B. and H.M.S developed the model. A.E.B., D.P.C. I.F., A.S. and N.L. conducted the experiments.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

Air Force Office of Scientific Research [FA9550-14-1-0089]; Office of Naval Research [N00014-13-1-0074]; NSF Career Award [CBET-1253641 to H.M.S.]. Funding for open access charge: Air Force Office of Scientific Research.

Conflict of interest statement. None declared.

REFERENCES

1. Gorochowski T.E., Ignatova Z., Bovenberg R.A., Roubos J.A.. Trade-offs between tRNA abundance and mRNA secondary structure support smoothing of translation elongation rate. Nucleic Acids Res. 2015; 43:3022–3032. [DOI] [PMC free article] [PubMed] [Google Scholar]
2. Duval M., Simonetti A., Caldelari I., Marzi S.. Multiple ways to regulate translation initiation in bacteria: Mechanisms, regulatory circuits, dynamics. Biochimie. 2015; 114:18–29. [DOI] [PubMed] [Google Scholar]
3. Goodman D.B., Church G.M., Kosuri S.. Causes and effects of N-terminal codon bias in bacterial genes. Science. 2013; 342:475–479. [DOI] [PubMed] [Google Scholar]
4. Bentele K., Saffert P., Rauscher R., Ignatova Z., Blüthgen N.. Efficient translation initiation dictates codon usage at gene start. Mol. Syst. Biol. 2013; 9:675. [DOI] [PMC free article] [PubMed] [Google Scholar]
5. Gingold H., Pilpel Y.. Determinants of translation efficiency and accuracy. Mol. Syst. Biol. 2011; 7:481. [DOI] [PMC free article] [PubMed] [Google Scholar]
6. Tuller T., Waldman Y.Y., Kupiec M., Ruppin E.. Translation efficiency is determined by both codon bias and folding energy. Proc. Natl. Acad. Sci. U.S.A. 2010; 107:3645–3650. [DOI] [PMC free article] [PubMed] [Google Scholar]
7. Kudla G., Murray A.W., Tollervey D., Plotkin J.B.. Coding-sequence determinants of gene expression in Escherichia coli. science. 2009; 324:255–258. [DOI] [PMC free article] [PubMed] [Google Scholar]
8. Espah Borujeni A., Salis H.M.. Translation initiation is controlled by RNA folding kinetics via a ribosome drafting mechanism. J. Am. Chem. Soc. 2016; 138:7016–7023. [DOI] [PubMed] [Google Scholar]
9. Tian T., Salis H.M.. A predictive biophysical model of translational coupling to coordinate and control protein expression in bacterial operons. Nucleic Acids Res. 2015; 43:7137–7151. [DOI] [PMC free article] [PubMed] [Google Scholar]
10. Espah Borujeni A., Channarasappa A.S., Salis H.M.. Translation rate is controlled by coupled trade-offs between site accessibility, selective RNA unfolding and sliding at upstream standby sites. Nucleic Acids Res. 2013; 42:2646–2659. [DOI] [PMC free article] [PubMed] [Google Scholar]
11. Salis H.M., Mirsky E.A., Voigt C.A.. Automated design of synthetic ribosome binding sites to control protein expression. Nat. Biotechnol. 2009; 27:946–950. [DOI] [PMC free article] [PubMed] [Google Scholar]
12. Reeve B., Hargest T., Gilbert C., Ellis T.. Predicting translation initiation rates for designing synthetic biology. Front. Bioeng. Biotechnol. 2014; 2:1. [DOI] [PMC free article] [PubMed] [Google Scholar]
13. Aghdam E.M., Hejazi M.S., Barzegar A.. Riboswitches: From living biosensors to novel targets of antibiotics. Gene. 2016; 592:244–259. [DOI] [PubMed] [Google Scholar]
14. McKeague M., Wong R.S., Smolke C.D.. Opportunities in the design and application of RNA for gene expression control. Nucleic Acids Res. 2016; 44:2987–2999. [DOI] [PMC free article] [PubMed] [Google Scholar]
15. Espah Borujeni A., Mishler D.M., Wang J., Huso W., Salis H.M.. Automated physics-based design of synthetic riboswitches from diverse RNA aptamers. Nucleic Acids Res. 2015; gkv1289. [DOI] [PMC free article] [PubMed] [Google Scholar]
16. Trinh C.T., Mendoza B.. Modular cell design for rapid, efficient strain engineering toward industrialization of biology. Curr. Opin. Chem. Eng. 2016; 14:18–25. [Google Scholar]
17. Garcia-Ruiz E., HamediRad M., Zhao H.. Pathway design, engineering, and optimization. Adv. Biochem. Eng. Biotechnol. 2016; doi:10.1007/10_2016_12 [DOI] [PubMed] [Google Scholar]
18. Zhou H., Vonk B., Roubos J.A., Bovenberg R.A., Voigt C.A.. Algorithmic co-optimization of genetic constructs and growth conditions: application to 6-ACA, a potential nylon-6 precursor. Nucleic Acids Res. 2015; 43:10560–10570. [DOI] [PMC free article] [PubMed] [Google Scholar]
19. Freestone T.S., Zhao H.. Combinatorial pathway engineering for optimized production of the anti‐malarial FR900098. Biotechnol. Bioeng. 2016; 113:384–392. [DOI] [PubMed] [Google Scholar]
20. Zeitoun R.I., Garst A.D., Degen G.D., Pines G., Mansell T.J., Glebes T.Y., Boyle N.R., Gill R.T.. Multiplexed tracking of combinatorial genomic mutations in engineered cell populations. Nat. Biotechnol. 2015; 33:631–637. [DOI] [PubMed] [Google Scholar]
21. Carbonell P., Currin A., Jervis A.J., Rattray N.J., Swainston N., Yan C., Takano E., Breitling R.. Bioinformatics for the synthetic biology of natural products: integrating across the Design–Build–Test cycle. Nat. Product Rep. 2016; 33:925–932. [DOI] [PMC free article] [PubMed] [Google Scholar]
22. Coussement P., Maertens J., Beauprez J., Van Bellegem W., De Mey M.. One step DNA assembly for combinatorial metabolic engineering. Metab. Eng. 2014; 23:70–77. [DOI] [PubMed] [Google Scholar]
23. Jones J.A., Toparlak Ö.D., Koffas M.A.. Metabolic pathway balancing and its role in the production of biofuels and chemicals. Curr. Opin. Biotechnol. 2015; 33:52–59. [DOI] [PubMed] [Google Scholar]
24. Ng C.Y., Farasat I., Maranas C.D., Salis H.M.. Rational design of a synthetic Entner–Doudoroff pathway for improved and controllable NADPH regeneration. Metab. Eng. 2015; 29:86–96. [DOI] [PubMed] [Google Scholar]
25. Chappell J., Watters K.E., Takahashi M.K., Lucks J.B.. A renaissance in RNA synthetic biology: new mechanisms, applications and tools for the future. Curr. Opin. Chem. Biol. 2015; 28:47–56. [DOI] [PubMed] [Google Scholar]
26. Yang L., Nielsen A.A., Fernandez-Rodriguez J., McClune C.J., Laub M.T., Lu T.K., Voigt C.A.. Permanent genetic memory with>1-byte capacity. Nat. Methods. 2014; 11:1261–1266. [DOI] [PMC free article] [PubMed] [Google Scholar]
27. Brophy J.A., Voigt C.A.. Principles of genetic circuit design. Nat. Methods. 2014; 11:508–520. [DOI] [PMC free article] [PubMed] [Google Scholar]
28. Salis H.M. The ribosome binding site calculator. Methods Enzymol. 2011; 498:19–42. [DOI] [PubMed] [Google Scholar]
29. Kosuri S., Goodman D.B., Cambray G., Mutalik V.K., Gao Y., Arkin A.P., Endy D., Church G.M.. Composability of regulatory sequences controlling transcription and translation in Escherichia coli. Proc. Natl. Acad. Sci. U.S.A. 2013; 110:14024–14029. [DOI] [PMC free article] [PubMed] [Google Scholar]
30. Simonetti A., Marzi S., Jenner L., Myasnikov A., Romby P., Yusupova G., Klaholz B., Yusupov M.. A structural view of translation initiation in bacteria. Cell. Mol. Life Sci. 2009; 66:423–436. [DOI] [PMC free article] [PubMed] [Google Scholar]
31. Yusupova G.Z., Yusupov M.M., Cate J., Noller H.F.. The path of messenger RNA through the ribosome. Cell. 2001; 106:233–241. [DOI] [PubMed] [Google Scholar]
32. Wallin G., Kamerlin S.C., Åqvist J.. Energetics of activation of GTP hydrolysis on the ribosome. Nat. Commun. 2013; 4:1733. [DOI] [PubMed] [Google Scholar]
33. de Smit M.H., van Duin J.. Translational initiation on structured messengers: another role for the Shine-Dalgarno interaction. J. Mol. Biol. 1994; 235:173–184. [DOI] [PubMed] [Google Scholar]
34. de Smit M.H., van Duin J.. Translational standby sites: how ribosomes may deal with the rapid folding kinetics of mRNA. J. Mol. Biol. 2003; 331:737–743. [DOI] [PubMed] [Google Scholar]
35. Marzi S., Myasnikov A.G., Serganov A., Ehresmann C., Romby P., Yusupov M., Klaholz B.P.. Structured mRNAs regulate translation initiation by binding to the platform of the ribosome. Cell. 2007; 130:1019–1031. [DOI] [PubMed] [Google Scholar]
36. Takyar S., Hickerson R.P., Noller H.F.. mRNA helicase activity of the ribosome. Cell. 2005; 120:49–58. [DOI] [PubMed] [Google Scholar]
37. Qu X., Wen J.-D., Lancaster L., Noller H.F., Bustamante C., Tinoco I.. The ribosome uses two active mechanisms to unwind messenger RNA during translation. Nature. 2011; 475:118–121. [DOI] [PMC free article] [PubMed] [Google Scholar]
38. Bouvier M., Sharma C.M., Mika F., Nierhaus K.H., Vogel J.. Small RNA binding to 5΄ mRNA coding region inhibits translational initiation. Mol. Cell. 2008; 32:827–837. [DOI] [PubMed] [Google Scholar]
39. Hüttenhofer A., Noller H.. Footprinting mRNA-ribosome complexes with chemical probes. EMBO J. 1994; 13:3892. [DOI] [PMC free article] [PubMed] [Google Scholar]
40. Gruber A.R., Lorenz R., Bernhart S.H., Neuböck R., Hofacker I.L.. The vienna RNA websuite. Nucleic Acids Res. 2008; 36:W70–W74. [DOI] [PMC free article] [PubMed] [Google Scholar]
41. Farasat I., Kushwaha M., Collens J., Easterbrook M., Guido M., Salis H.M.. Efficient search, mapping, and optimization of multi‐protein genetic systems in diverse bacteria. Mol. Syst. Biol. 2014; 10:731. [DOI] [PMC free article] [PubMed] [Google Scholar]
42. Flamm C., Fontana W., Hofacker I.L., Schuster P.. RNA folding at elementary step resolution. RNA. 2000; 6:325–338. [DOI] [PMC free article] [PubMed] [Google Scholar]
43. Pertzev A.V., Nicholson A.W.. Characterization of RNA sequence determinants and antideterminants of processing reactivity for a minimal substrate of Escherichia coli ribonuclease III. Nucleic Acids Res. 2006; 34:3708–3721. [DOI] [PMC free article] [PubMed] [Google Scholar]
44. Varshavsky A. The N-end rule: functions, mysteries, uses. Proc. Natl. Acad. Sci. U.S.A. 1996; 93:12142–12149. [DOI] [PMC free article] [PubMed] [Google Scholar]
45. Dougan D., Truscott K., Zeth K.. The bacterial N‐end rule pathway: expect the unexpected. Mol. Microbiol. 2010; 76:545–558. [DOI] [PubMed] [Google Scholar]
46. Lu J., Deutsch C.. Electrostatics in the ribosomal tunnel modulate chain elongation rates. J. Mol. Biol. 2008; 384:73–86. [DOI] [PMC free article] [PubMed] [Google Scholar]
47. Sharma A.K., Bukau B., O’Brien E.P.. Physical origins of codon positions that strongly influence cotranslational folding: a framework for controlling nascent-protein folding. J. Am. Chem. Soc. 2016; 138:1180–1195. [DOI] [PubMed] [Google Scholar]
48. Dreyfus M. Killer and protective ribosomes. Progr. Mol. Biol. Transl. Sci. 2009; 85:423–466. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

Click here for additional data file.^{(622.7KB, zip)}

[B1] 1. Gorochowski T.E., Ignatova Z., Bovenberg R.A., Roubos J.A.. Trade-offs between tRNA abundance and mRNA secondary structure support smoothing of translation elongation rate. Nucleic Acids Res. 2015; 43:3022–3032. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B2] 2. Duval M., Simonetti A., Caldelari I., Marzi S.. Multiple ways to regulate translation initiation in bacteria: Mechanisms, regulatory circuits, dynamics. Biochimie. 2015; 114:18–29. [DOI] [PubMed] [Google Scholar]

[B3] 3. Goodman D.B., Church G.M., Kosuri S.. Causes and effects of N-terminal codon bias in bacterial genes. Science. 2013; 342:475–479. [DOI] [PubMed] [Google Scholar]

[B4] 4. Bentele K., Saffert P., Rauscher R., Ignatova Z., Blüthgen N.. Efficient translation initiation dictates codon usage at gene start. Mol. Syst. Biol. 2013; 9:675. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B5] 5. Gingold H., Pilpel Y.. Determinants of translation efficiency and accuracy. Mol. Syst. Biol. 2011; 7:481. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B6] 6. Tuller T., Waldman Y.Y., Kupiec M., Ruppin E.. Translation efficiency is determined by both codon bias and folding energy. Proc. Natl. Acad. Sci. U.S.A. 2010; 107:3645–3650. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B7] 7. Kudla G., Murray A.W., Tollervey D., Plotkin J.B.. Coding-sequence determinants of gene expression in Escherichia coli. science. 2009; 324:255–258. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B8] 8. Espah Borujeni A., Salis H.M.. Translation initiation is controlled by RNA folding kinetics via a ribosome drafting mechanism. J. Am. Chem. Soc. 2016; 138:7016–7023. [DOI] [PubMed] [Google Scholar]

[B9] 9. Tian T., Salis H.M.. A predictive biophysical model of translational coupling to coordinate and control protein expression in bacterial operons. Nucleic Acids Res. 2015; 43:7137–7151. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B10] 10. Espah Borujeni A., Channarasappa A.S., Salis H.M.. Translation rate is controlled by coupled trade-offs between site accessibility, selective RNA unfolding and sliding at upstream standby sites. Nucleic Acids Res. 2013; 42:2646–2659. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11] 11. Salis H.M., Mirsky E.A., Voigt C.A.. Automated design of synthetic ribosome binding sites to control protein expression. Nat. Biotechnol. 2009; 27:946–950. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B12] 12. Reeve B., Hargest T., Gilbert C., Ellis T.. Predicting translation initiation rates for designing synthetic biology. Front. Bioeng. Biotechnol. 2014; 2:1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B13] 13. Aghdam E.M., Hejazi M.S., Barzegar A.. Riboswitches: From living biosensors to novel targets of antibiotics. Gene. 2016; 592:244–259. [DOI] [PubMed] [Google Scholar]

[B14] 14. McKeague M., Wong R.S., Smolke C.D.. Opportunities in the design and application of RNA for gene expression control. Nucleic Acids Res. 2016; 44:2987–2999. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B15] 15. Espah Borujeni A., Mishler D.M., Wang J., Huso W., Salis H.M.. Automated physics-based design of synthetic riboswitches from diverse RNA aptamers. Nucleic Acids Res. 2015; gkv1289. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B16] 16. Trinh C.T., Mendoza B.. Modular cell design for rapid, efficient strain engineering toward industrialization of biology. Curr. Opin. Chem. Eng. 2016; 14:18–25. [Google Scholar]

[B17] 17. Garcia-Ruiz E., HamediRad M., Zhao H.. Pathway design, engineering, and optimization. Adv. Biochem. Eng. Biotechnol. 2016; doi:10.1007/10_2016_12 [DOI] [PubMed] [Google Scholar]

[B18] 18. Zhou H., Vonk B., Roubos J.A., Bovenberg R.A., Voigt C.A.. Algorithmic co-optimization of genetic constructs and growth conditions: application to 6-ACA, a potential nylon-6 precursor. Nucleic Acids Res. 2015; 43:10560–10570. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B19] 19. Freestone T.S., Zhao H.. Combinatorial pathway engineering for optimized production of the anti‐malarial FR900098. Biotechnol. Bioeng. 2016; 113:384–392. [DOI] [PubMed] [Google Scholar]

[B20] 20. Zeitoun R.I., Garst A.D., Degen G.D., Pines G., Mansell T.J., Glebes T.Y., Boyle N.R., Gill R.T.. Multiplexed tracking of combinatorial genomic mutations in engineered cell populations. Nat. Biotechnol. 2015; 33:631–637. [DOI] [PubMed] [Google Scholar]

[B21] 21. Carbonell P., Currin A., Jervis A.J., Rattray N.J., Swainston N., Yan C., Takano E., Breitling R.. Bioinformatics for the synthetic biology of natural products: integrating across the Design–Build–Test cycle. Nat. Product Rep. 2016; 33:925–932. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B22] 22. Coussement P., Maertens J., Beauprez J., Van Bellegem W., De Mey M.. One step DNA assembly for combinatorial metabolic engineering. Metab. Eng. 2014; 23:70–77. [DOI] [PubMed] [Google Scholar]

[B23] 23. Jones J.A., Toparlak Ö.D., Koffas M.A.. Metabolic pathway balancing and its role in the production of biofuels and chemicals. Curr. Opin. Biotechnol. 2015; 33:52–59. [DOI] [PubMed] [Google Scholar]

[B24] 24. Ng C.Y., Farasat I., Maranas C.D., Salis H.M.. Rational design of a synthetic Entner–Doudoroff pathway for improved and controllable NADPH regeneration. Metab. Eng. 2015; 29:86–96. [DOI] [PubMed] [Google Scholar]

[B25] 25. Chappell J., Watters K.E., Takahashi M.K., Lucks J.B.. A renaissance in RNA synthetic biology: new mechanisms, applications and tools for the future. Curr. Opin. Chem. Biol. 2015; 28:47–56. [DOI] [PubMed] [Google Scholar]

[B26] 26. Yang L., Nielsen A.A., Fernandez-Rodriguez J., McClune C.J., Laub M.T., Lu T.K., Voigt C.A.. Permanent genetic memory with>1-byte capacity. Nat. Methods. 2014; 11:1261–1266. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B27] 27. Brophy J.A., Voigt C.A.. Principles of genetic circuit design. Nat. Methods. 2014; 11:508–520. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B28] 28. Salis H.M. The ribosome binding site calculator. Methods Enzymol. 2011; 498:19–42. [DOI] [PubMed] [Google Scholar]

[B29] 29. Kosuri S., Goodman D.B., Cambray G., Mutalik V.K., Gao Y., Arkin A.P., Endy D., Church G.M.. Composability of regulatory sequences controlling transcription and translation in Escherichia coli. Proc. Natl. Acad. Sci. U.S.A. 2013; 110:14024–14029. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B30] 30. Simonetti A., Marzi S., Jenner L., Myasnikov A., Romby P., Yusupova G., Klaholz B., Yusupov M.. A structural view of translation initiation in bacteria. Cell. Mol. Life Sci. 2009; 66:423–436. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B31] 31. Yusupova G.Z., Yusupov M.M., Cate J., Noller H.F.. The path of messenger RNA through the ribosome. Cell. 2001; 106:233–241. [DOI] [PubMed] [Google Scholar]

[B32] 32. Wallin G., Kamerlin S.C., Åqvist J.. Energetics of activation of GTP hydrolysis on the ribosome. Nat. Commun. 2013; 4:1733. [DOI] [PubMed] [Google Scholar]

[B33] 33. de Smit M.H., van Duin J.. Translational initiation on structured messengers: another role for the Shine-Dalgarno interaction. J. Mol. Biol. 1994; 235:173–184. [DOI] [PubMed] [Google Scholar]

[B34] 34. de Smit M.H., van Duin J.. Translational standby sites: how ribosomes may deal with the rapid folding kinetics of mRNA. J. Mol. Biol. 2003; 331:737–743. [DOI] [PubMed] [Google Scholar]

[B35] 35. Marzi S., Myasnikov A.G., Serganov A., Ehresmann C., Romby P., Yusupov M., Klaholz B.P.. Structured mRNAs regulate translation initiation by binding to the platform of the ribosome. Cell. 2007; 130:1019–1031. [DOI] [PubMed] [Google Scholar]

[B36] 36. Takyar S., Hickerson R.P., Noller H.F.. mRNA helicase activity of the ribosome. Cell. 2005; 120:49–58. [DOI] [PubMed] [Google Scholar]

[B37] 37. Qu X., Wen J.-D., Lancaster L., Noller H.F., Bustamante C., Tinoco I.. The ribosome uses two active mechanisms to unwind messenger RNA during translation. Nature. 2011; 475:118–121. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B38] 38. Bouvier M., Sharma C.M., Mika F., Nierhaus K.H., Vogel J.. Small RNA binding to 5΄ mRNA coding region inhibits translational initiation. Mol. Cell. 2008; 32:827–837. [DOI] [PubMed] [Google Scholar]

[B39] 39. Hüttenhofer A., Noller H.. Footprinting mRNA-ribosome complexes with chemical probes. EMBO J. 1994; 13:3892. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B40] 40. Gruber A.R., Lorenz R., Bernhart S.H., Neuböck R., Hofacker I.L.. The vienna RNA websuite. Nucleic Acids Res. 2008; 36:W70–W74. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B41] 41. Farasat I., Kushwaha M., Collens J., Easterbrook M., Guido M., Salis H.M.. Efficient search, mapping, and optimization of multi‐protein genetic systems in diverse bacteria. Mol. Syst. Biol. 2014; 10:731. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B42] 42. Flamm C., Fontana W., Hofacker I.L., Schuster P.. RNA folding at elementary step resolution. RNA. 2000; 6:325–338. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B43] 43. Pertzev A.V., Nicholson A.W.. Characterization of RNA sequence determinants and antideterminants of processing reactivity for a minimal substrate of Escherichia coli ribonuclease III. Nucleic Acids Res. 2006; 34:3708–3721. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B44] 44. Varshavsky A. The N-end rule: functions, mysteries, uses. Proc. Natl. Acad. Sci. U.S.A. 1996; 93:12142–12149. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B45] 45. Dougan D., Truscott K., Zeth K.. The bacterial N‐end rule pathway: expect the unexpected. Mol. Microbiol. 2010; 76:545–558. [DOI] [PubMed] [Google Scholar]

[B46] 46. Lu J., Deutsch C.. Electrostatics in the ribosomal tunnel modulate chain elongation rates. J. Mol. Biol. 2008; 384:73–86. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B47] 47. Sharma A.K., Bukau B., O’Brien E.P.. Physical origins of codon positions that strongly influence cotranslational folding: a framework for controlling nascent-protein folding. J. Am. Chem. Soc. 2016; 138:1180–1195. [DOI] [PubMed] [Google Scholar]

[B48] 48. Dreyfus M. Killer and protective ribosomes. Progr. Mol. Biol. Transl. Sci. 2009; 85:423–466. [DOI] [PubMed] [Google Scholar]

PERMALINK

Precise quantification of translation inhibition by mRNA structures that overlap with the ribosomal footprint in N-terminal coding sequences

Amin Espah Borujeni

Daniel Cetnar

Iman Farasat

Ashlee Smith

Natasha Lundgren

Howard M Salis

Abstract

INTRODUCTION