Abstract
Specific interaction between the start codon, 5’-AUG-3’, and the anticodon, 5’-CAU-3’, ensures accurate initiation of translation. Recent studies show that several near-cognate start codons (e.g. GUG and CUG) can play a role in initiating translation in eukaryotes. However, the mechanism allowing initiation through mismatched base-pairs at the ribosomal decoding site is still unclear at an atomic level. In this work, we propose an extended simulation-based method to evaluate free energy profiles, through computing the distance between each base-pair of the triplet interactions involved in recognition of start codons in eukaryotic translation pre-initiation complex. Our method provides not only the free energy penalty for mismatched start codons relative to the AUG start codon, but also the preferred pathways of transitions between bound and unbound states, which has not been described by previous studies. To verify the method, the binding dynamics of cognate (AUG) and near-cognate start codons (CUG and GUG) were simulated. Evaluated free energy profiles agree with experimentally observed changes in initiation frequencies from respective codons. This work proposes for the first time how a G:U mismatch at the first position of codon (GUG)-anticodon base-pairs destabilizes the accommodation in the initiating eukaryotic ribosome and how initiation at a CUG codon is nearly as strong as, or sometimes stronger than, that at a GUG codon. Our method is expected to be applied to study the affinity changes for various mismatched base-pairs.
Author summary
Ribosomes synthesize proteins according to the sequence of nucleotides (A, U, G, and C) in mRNA, translating three nucleotides (codon) into an amino acid. If the reading frame is shifted, the resulting amino-acid sequence will be totally different. Hence, the translation should start at an exactly determined position of mRNA. This position is usually indicated by “AUG” (start codon), which is recognized by “CAU” anticodon in tRNA and then translated into methionine. However, translation sometimes initiates at another codon such as “GUG” or “CUG”, and its frequency varies depending on the codon. Then, what regulates the possibility of translation-initiation at such a mismatched codon? To answer the question, we employed computer simulation for the structural changes of RNAs and proteins involved in the process. Through the numerical analysis, we estimated how strongly the mRNA (codon) and the tRNA (anticodon) bind to each other and also inferred how they approach. The binding-strength correlates with the initiation frequency observed in experiments, and the approaching pathway could explain the difference. Our result shows the underlying mechanism for the fidelity of translation-initiation, and our method will be applied to the prediction and design of RNA-RNA interactions.
Introduction
The translation reaction, or mRNA-dependent protein synthesis, is catalyzed by the ribosome, the macromolecular ribonucleoprotein complex [1, 2]. During eukaryotic initiation, the ribosome dissociates into the large (60S) and small (40S) subunits, and the latter binds the methionyl initiator tRNA () and mRNA with the help of eukaryotic initiation factors (eIFs) [3, 4]. is recruited by eIF2, a heterotrimeric factor that binds the tRNA in a manner dependent on GTP binding. The resulting ternary complex (TC) binds the 40S subunit in the context of multifactor complex (MFC) with eIFs 1, 3 and 5, forming the 43S preinitiation complex (PIC) [5]. The mRNA is bound by eIF4F through its 5’ cap, which then is recruited to the 40S subunit through eIF3 in mammals and eIF5 in yeast [6]. The 48S pre-initiation complex (PIC) thus formed scans for the start codon in the process called scanning. The PIC is primed for scanning by the eIF5-catalyzed GTP hydrolysis for eIF2; the products, GDP and Pi, stay bound to eIF2 during scanning. The start codon base-pairing with the anticodon in the P-site allows the 40S subunit to stall at the start codon and then join the 60S subunit after most bound eIFs are released in conjunction with Pi release from eIF2 [7, 8]. The resulting 80S initiation complex accepts an amino-acyl tRNA in the A-site to begin the translation elongation cycle.
The fidelity of start codon recognition is regulated by eIF1A and eIF1 that bind the 40S subunit A-site and P-site, respectively [9, 10]. Essentially, these factors regulate the conformational changes of the PIC. Thus, the N-terminal tail of eIF1A interacts with the codon-anticodon base-pairs to stabilize its closed conformation for initiation [11]. By contrast, eIF1 stabilizes its open, scanning-competent conformation by physically impeding the P-site accommodation of the mismatched codon-anticodon base-pairs [5]. Upon start codon selection by the PIC, eIF1 is released to permanently stabilize the closed state [12, 13]. The PIC-bound eIFs directly interact with eIF1 to control the balance of its binding and release, thereby keeping the level of initiation accuracy appropriate for eukaryotic cell function [6, 14–16]; for review, see [17].
Despite the aforementioned mechanisms to ensure the accurate initiation of translation, most eukaryotes tested allow translation initiation from near-cognate start codons (start codons with one-base substitution in the AUG codon), for example, CUG and GUG, at a low frequency [18–21]. However, not all the near-cognate start codons can initiate translation at an equal rate. Among the most puzzling is the observation that GUG serves as a poor initiation site, even though the G residue in the 1st position can potentially wobble-base-pair with the U residue in the anticodon. In agreement with a wobble base-pairing, GUG serves as a normal initiation site in prokaryotes (Bacteria and Archaea), which possess fewer initiation factors than in eukaryotes [17]. Moreover, CUG is considered as the strongest near-cognate start codon in eukaryotes, even though it is not a start codon in prokaryotes [17]. To solve these conundrums, it is crucial to probe the stability of codon-anticodon interactions in the P-site. As it is difficult to experimentally measure these interactions at an atomic resolution, computational analysis offers an effective solution [22, 23]. Specifically, molecular interactions are represented by the free energy profiles, which in this case depend on the nucleotide constitutions of codon and anticodon, and thereby determine the frequency of the bound state. Thus, computational estimation of their free energy profiles provides an insight into the mechanism of translation initiation by various start codons.
Previous molecular dynamics (MD) simulation studies estimated energy gap between AUG and mismatched codons by computing free energy scores of interaction between these codons by free energy perturbation (FEP) [24]. The work identified CUG as the strongest near-cognate start codon. Moreover, it demonstrated eIF1’s contribution in discriminating from near-cognate start codons. However, there are mainly two caveats in the FEP approach. First, except for the conformation of AUG found within the PIC structure, the geometry of near-cognate start codon was conjectured merely through substituting a base with the corresponding base in the AUG. Second, information on the binding process was missing. As single-strands of RNA are flexible [25], their structural changes may be diverse. Although the binding free energy is independent of the transition path and thus could be calculated by the FEP method, it is crucial to estimate structural changes involved in the binding process in order to understand the selective binding mechanism.
In this study, we employed adaptive biasing force (ABF) method [26–28] in order to overcome these problems. This method explores codon-anticodon binding free energy by using systematic reaction coordinates. By using distances of each triplet base-pair as the reaction coordinates, we generated the free energy profiles of base-pairing interactions between anticodon and cognate (AUG) or near-cognate (CUG or GUG) start codons. Our results provide structural insights related to a strong penalty for placing GUG codon in the P-site and a permissiveness of CUG as a potential start codon in eukaryotes.
Materials and methods
Simulation procedure
We referred to previous study of start codon recognition in eukaryotic translation initiation using all-atom molecular dynamics (MD) simulations [24]. To reconstruct codon-anticodon interaction in solution, we employed open pre-initiation complex (PIC) structure (PDB ID: 3J81) determined by CryoEM [11]. To reduce computational cost for MD simulation, we extract atoms within 25 Å from N1 atom in the middle base of anticodon in tRNA molecule [24]. Then, nucleotides were edited to reconstruct PIC models involving target codons in our study (Table 1). When editing the nucleotide (e.g. AUG → GUG), first, atoms except N1 and N9 in base group, sugar group, and phosphate group were deleted. Then, coordinates of missing atoms were inferred. All histidine residues were configured as ϵ-protonated. These molecules were soaked into a 36 Å radius water sphere (Fig 1), neutralized by K+, and added 150 mM KCl. TIP3P water model was employed. VMD [29] was used to infer missing atom coordinates, solvate the model, and visualize the structure throughout the study.
Table 1. Codons modeled in our simulation.
Nucleotide sequence and reference atoms to define the codon-anticodon distance.
Codon | 1st Nucleotide | 2nd Nucleotide | 3rd Nucleotide | |||
---|---|---|---|---|---|---|
AUG | A | N1 and N6 | U | N3 and O4 | G | N1, N2, and O6 |
GUG | G | N1, N2, and O6 | U | N3 and O4 | G | N1, N2, and O6 |
CUG | C | N3, N4, and O2 | U | N3 and O4 | G | N1, N2, and O6 |
Fig 1. PIC structure models and reaction coordinates in this study.
(Left) mRNA and tRNA around the codon-anticodon base-pairs (Table 1) are shown in black and silver, respectively. Distances d1, d2 and d3 used as reaction coordinates are indicated. (Center) rRNA segments (split) are shown in green, and protein components are shown in other colors. (Right) Water molecules are shown in pink, which enclose the system and form a sphere.
All simulations were carried out using NAMD (version 2.13 multi-core) [30]. CHARMM36 force-field (July 2019 update) was used [31, 32]. Multilevel summation method (MSM) electrostatics [33] was employed. Cutoff at 12 Å (with switching from 10 Å) was applied to non-bonded interactions. Temperature and pressure were set at 310 K and 1 atm, respectively; Langevin thermostat (damping coefficient: 5/ps) and Langevin-piston barostat were adopted. All C1’ (nucleotide) and Cα (amino acid) atoms farther than 22 Å from the center of the system (i.e. water sphere) were restrained at their initial positions, and water molecules crossing the boundary of water sphere (radius 36 Å) were restrained. Harmonic potential functions with spring constant 10 pN/Å were adopted as the restraint of molecules. After energy minimization (10,000 steps), the system was equilibrated for 10 ns, and then simulated for 1 μs (time-step: 2 fs); the biasing force was applied only after collecting 200 samples in the bin. Each model (Table 1) was simulated five times.
Multi dimensional adaptive biasing force (ABF) method
We performed adaptive biasing force (ABF) molecular dynamics method [26–28] to evaluate multi-dimensional free energy profiles in terms of d1, d2, and d3, which were defined as the distances of the 1st, 2nd, and 3rd base-pairs in Å, respectively (Table 1). Specifically, these di were evaluated as distance between the centers of hydrogen donor and acceptor atoms of codon and anticodon (Table 1 shows the atoms in a mRNA segment). Each di was sampled over 4.0 ≤ di ≤ 9.0 with bin width Δd = 0.5 Å, and was restrained by harmonic bonds with spring constant 10 pN/Å if di crosses lower boundary (3.0 Å) or upper boundary (10.0 Å).
The Gibbs free energy profile G(d1, d2, d3) with respect to three variables d1 to d3 was obtained through the analysis of ABF results. The probability P(d1, d2, d3) of each state obeys Eq 1:
(1) |
Furthermore, P(d1, d2, d3) was averaged over the simulation trials for each model, and then G(d1, d2, d3) was evaluated as Eq 2:
(2) |
assuming min{G(d1, d2, d3)|4.0 ≤ d1, d2, d3 ≤ 9.0} = 0.
Then, to evaluate the free energy difference between the codon-anticodon bound and unbound states, we defined the free energy scores Gbound and Gunbound, and their gap ΔGbinding as Eq 3:
(3) |
Here we defined the bound and unbound states as ∀i : 4.0 ≤ di ≤ 6.0 and ∀i : 7.0 ≤ di ≤ 9.0, respectively (Fig 2). The distance range for the bound state corresponds to the codon-anticodon (AUG-CAU) structure [11], and that for the unbound state is based on a previous research that described unbound conformation of the complex [34]. Gbound and Gunbound were hence weighted average of G(d1, d2, d3) in the ranges of bound and unbound states, respectively (Eq 3). Note that Gbound, Gunbound, and ΔGbinding were obtained from G(d1, d2, d3) either in each simulation trial or over five trials for the same model.
Fig 2. Binding free energy.
Schematic representation of Gbound, Gunbound, and ΔGbinding (Eq 3).
To visualize the profiles, G(d1, d2, d3) was projected (marginalized) onto 2-dimension, as G(d1, d2) in Eq 4. Projected profile P(d1, d2) was obtained from G(d1, d2).
(4) |
G(d1, d3), G(d2, d3) and P(d1, d3), P(d2, d3) were defined in the same way.
Convergence of free energy profiles
Probability P(d1, d2, d3) was obtained at equal intervals Δτ (= 100 ns) through the ABF simulation. We evaluated the convergence of P(d1, d2, d3) with increasing simulation time steps, as follows. To compare profiles of P(d1, d2, d3) at two different time points (τ1, τ2), we employed squared-error function of two probability distributions. These profiles of different time points τ1 and τ2 were represented by P(d1, d2, d3;τ1) and P(d1, d2, d3;τ2), respectively. Then, the squared-error function L(τ1, τ2) was calculated as Eq 5:
(5) |
The convergence of P(d1, d2, d3) at τ [ns] was tested by L(τ, τ − Δτ) (Fig 3). The convergence of P(d1, d2), P(d2, d3), and P(d1, d3) was evaluated in the same way.
Fig 3. Convergence of free energy profiles with time evolution.
The L(τ, τ − Δτ) of P(d1, d2, d3), P(d1, d2), and P(d2, d3) are shown (see Eq 5; Δτ = 100 ns). The data points are plotted between the two consecutive time points (τ − Δτ) and τ.
Reconstruction of averaged structure
Typical structures, or atomic coordinates corresponding to a specific reaction coordinate (d1, d2, d3), were obtained by averaging the sampled atomic coordinates as follows. Here, we assumed that the reaction coordinate (d1, d2, d3) is represented by () if Eq 6 is satisfied.
(6) |
Then, each atomic coordinate was averaged over all the snapshots (sampled at 10 ps intervals) corresponding to the representative reaction coordinate ().
Results
Binding free energy
We evaluated the free energy scores of bound and unbound states (Gbound and Gunbound; see Eq 3) averaged over five simulation trials (S1 Fig), and presented their difference ΔGbinding in Fig 4. In the case of the cognate start codon AUG, ΔGbinding must be negative to stabilize the initiation of translation, and was indeed ∼ −4 kB T. In contrast, the GUG codon, less frequently used as a start codon [17, 20, 21], showed a positive ΔGbinding ∼ 2 kB T. For the CUG codon, which is considered as a stronger start codon than GUG [17, 20, 21], ΔGbinding showed an intermediate value ∼ 1 kB T. Thus, ΔGbinding accounts for observed initiation rates from the respective start codons. Note that ΔGbinding could alternatively be obtained from individual simulation trajectories (S2 Fig); assuming each trial as a sample, we confirmed the significance of ΔGbinding between AUG and GUG by Welch’s t-test (p = 0.044).
Fig 4. Estimated binding free energy.
ΔGbinding scores (Eq 3; see also Fig 2) are shown. The scores were obtained from P(d1, d2, d3) averaged over five simulation trials for each model.
Free energy landscape for the binding process
To analyze the base-pair binding dynamics in detail, we constructed projected profiles G(d1, d2), G(d1, d3), and G(d2, d3) from G(d1, d2, d3) (see Materials and methods), as shown in Fig 5. We inferred the transition dynamics from the free energy profiles; Fig 6 shows the suggested paths and their schematics.
Fig 5. Projected free energy profiles.
Profiles of G(d1, d2), G(d2, d3), and G(d1, d3) obtained from Eq 4 for each model is shown by contour plots.
Fig 6. Schematics of the base-pair binding dynamics.
Conformational changes inferred from the free energy landscape (Fig 5). The transition path (• is AUG or GUG) is shown by black arrows (thick arrow: frequent transition; thin arrow: less frequent transition). Blue and red dotted lines indicate Watson-Crick or wobble base-pairing, respectively. In left, AUG pairing is described as being stabilized by the first position. Subsequently, the binding propagates into the second and the third position. In right, GUG pairing is described as bistable states; the base-pairing between GU and the corresponding bases in the anticodon, and the base pairing between the 3rd G and the C residue of the anticodon. Thus, a complete base-triplet interaction is unstable.
In the case of the AUG start codon, the following transitions were expected in the AUG-CAU dynamics in equilibrium (Fig 6, left). Starting from the bound state, d3 shows large fluctuations while d1 and d2 show small ones () (Fig 5B and 5C). d1 and d2 are bistable (bound and unbound), and once the 3rd G:C base-pair is broken (large d3), the 2nd U:A base-pair may become unbound ( to the large d2 state) (Fig 5B). Only after that, the 1st A:U base-pair dissociates ( to the large d1 state) (Fig 5A); this is expected to occur less frequently due to the higher barrier than those for and . Starting from the unbound state and reversing the process above, the AUG codon should bind to the CAU anticodon from the side of the 1st A:U base-pair, followed by the 2nd and then 3rd base-pairs (Fig 6, left). This result suggests that the recognition of the 1st A:U base-pair is very important for the accurate start codon recognition, in agreement with the role and location of eIF1 in the P-site [11, 12] (see below).
In the case of the GUG codon (Fig 6, right), d1 and d2 show the transition () between two distinct (metastable) states (both bound and both almost unbound) (Fig 5D), while d3 is mostly high (Fig 5E and 5F). Binding of the 3rd base-pair (transition to lower d3) is possible but less frequent, and simultaneous binding of the 1st and 3rd base-pairs is rare (Fig 5F). As expected, the affinity of the 1st base-pair (wobble G:U) is lower than the case of AUG. This result is consistent with infrequent GUG initiation observed in the previous works [17, 20, 21].
In the case of the CUG codon, many meta-stable states were observed as shown in Fig 5G–5I. Transition paths seem to be more complicated than the AUG and GUG cases. Although concurrent binding of the 2nd and 3rd base-pairs (lower d2 and d3) is possible (Fig 5H), the 1st base-pair cannot form simultaneously with these other base-pairs (Fig 5G and 5I), which makes the CUG pairing unstable compared to AUG base-pairing. Overall, however, the binding free energy ΔGbinding is lower for CUG than for GUG (Fig 4) (see below). Note that, technically, the rugged free energy landscape (Fig 5G–5I) demanded more computational cost for the ABF sampling, as suggested by the slow convergence shown in Fig 3.
Binding dynamics from the structural views
To visualize the bound structures and consider the mechanism underlying the abovementioned results (Figs 5 and 6), we evaluated the averaged structure of the bound state for each model (see Eq 6 in Materials and Methods). The averaged structures and schematics of the codon and anticodon are shown in Fig 7.
Fig 7. Typical bound structures.
Averaged structures corresponding to reaction coordinate (see Eq 6). In the structures on top, nucleotides of the codon (orange) and anticodon (gray) are drawn by thick lines. Red and blue lines are parts of eIF1 and eIF1A, respectively. Note that the structures of GUG and CUG pairing (middle and right) are much less stable than that of AUG pairing (left) (see Fig 4). The schematics on the bottom describe the direction of each base relative to the paired base.
In the case of AUG, the averaged bound-state structure is ordered and tightly bound. It is reasonable, as it is the correct start codon, and the binding free energy is negative (Fig 4). Note that eIF1 and eIF1A molecules (shown in red and blue in Fig 7, respectively) are present near the AUG-CAU base-pair. It was experimentally suggested that these proteins contribute to the accurate start codon recognition [5, 9–13] (see Fig 8).
Fig 8. Binding mechanism conjectured by this study.
mRNA and tRNA are drawn by orange and gray, respectively. Red lines show parts of eIF1. (a) Schematics of binding and unbinding dynamics of AUG pairing in the P-site. (Left) AUG binding is facilitated by the stabilization of the first base-pair, followed by base-pairing of the 2nd and 3rd positions (downward arrows). (Right) AUG unbinding appears to be facilitated by dissociation of the 3rd position (upward arrow) through stretching (thick arrow) imposed by the open (POUT) conformation of the PIC [11]. Here, eIF1 β-hairpin loop (red) is proposed to prevent dissociation of the base-pairing at the 1st position [35]. (b) Averaged structures corresponding to reaction coordinate (see Eq 6), and their schematic representations. Nucleotides of the codon and anticodon, and (Asn-34) and R36 (Arg-36) are drawn by thick lines.
By contrast, the structure of the GUG-CAU base-pair is disordered (Fig 7). The mismatched bases (the 1st G:U) avoid each other (rather than forming a wobble base-pair) and the uracil in tRNA tilts toward the 2nd U:A base-pair. The directions of the 2nd and 3rd base-pairs were consequently affected, resulting in the unstable bound state (Fig 4). Although the projected free energy profile G(d1, d2) (Fig 5D) suggests cooperative binding of the 1st and 2nd base-pairs (Fig 7), the 3rd base-pair is mostly separate, which may prevent the recognition of the GUG start codon.
In the case of CUG, the structure is relatively ordered (Fig 7). Although the 1st C:U base-pair is mismatched, cytosine is smaller than guanine and adenine (purine bases), which may mitigate steric hindrance at the 1st position. As shown in Fig 5G to 5I, many meta-stable conformations are possible, which we propose to be attributed to combinations of bound and unbound conformations of the base-pairs. It is therefore reasoned that some near-bound states can occasionally allow translation initiation at this codon (Fig 7, right).
Discussion
The binding free energy ΔGbinding evaluated by our method was qualitatively consistent with experimental observation, as described in Fig 4. We compared the AUG start codon and two near-cognate start codons (Table 1). Assuming that Gunbound is common for all the models, i.e. the free energy of the unbound state is independent of the codon, Gbound and ΔGbinding are equivalent. The difference, or penalty, of binding free energy (ΔΔG) induced by AUG → GUG and AUG → CUG substitution was ≃ 6 kB T (3.6 kcal/mol) and ≃ 5 kB T (3.0 kcal/mol), respectively. This result is largely consistent with another computational approach using the free energy perturbation (FEP) [24].
In contrast to the previous work, however, our ABF-based approach provided not merely the binding free energy but information on the nucleic acid binding dynamics represented by the free energy landscape (Figs 5 and 6). The free energy profiles shown in Fig 5A–5C suggested an unexpected stability of the 1st A:U base-pair, compared to the 3rd G:C base-pair. According to the free energy profile of AUG binding, dissociation of the triplet base-pairs starts at the 3rd G:C (Fig 6, ). In the open PIC model that is suggested to occur during the scanning process prior to start codon recognition [11], the tRNA is not perpendicularly attached to the mRNA, in contrast to the P-site tRNA positioning during the elongation phase. This conformation appears to allow the 5’-side (i.e. cytosine side) of the anticodon to curve away from the start codon, suggesting a stretching force towards the tRNA side (Fig 8A). We propose that this stretching decreases the affinity of the 3rd G:C base-pair during the scanning process (Fig 8A). In contrast, the affinity of the 1st A:U base-pair is likely to be increased by interaction with eIF1, so is that of the 2nd U:A base-pair by eIF1A, as proposed previously [5, 9–13] (Fig 8A). In strong agreement with the role of eIF1 in stabilizing the 1st A:U base-pair, our averaged simulation structure indeed positions Asn-34 and Arg-36 in its proximity (Fig 8B). In fact, the residues Asn-34:Gly-35:Arg-36, termed β-hairpin loop 1, is absolutely conserved from yeast to human. Mutations altering Asn-34 and Arg-36 display significant increase in UUG initiation [35], in agreement with their crucial role in maintaining open scanning-competent PIC conformation.
The free energy landscape of GUG-anticodon base-pairs (Fig 5D–5F) and its averaged simulation structure in the P-site (Figs 7 and 8B) also suggest that the same structural restriction in turn prevents G:U pairing at the 1st position, that otherwise occurs frequently in its free form. The disordered 3rd G:C base-pair seen with the GUG structure appears to be consistent with this idea (Fig 7). Since we did not observe a strong disorder in CUG-anticodon structure (Fig 7), we propose that the near-cognate start codon usage characteristic of eukaryotic initiation is mostly explained by a strong perturbation on GUG accommodation in the P-site due to steric restriction imposed by eIF1 β-hairpin loop. In agreement with this thesis, the level of CUG initiation is just equivalent to that of GUG initiation in yeast S. cerevisiae [19] (and our personal observations), although the former is significantly stronger than the latter in various distinct contexts in human cells [21].
The adaptive biasing force (ABF) method adopted in this study was previously used to evaluate free energy profiles of molecular dynamics in other biomolecular systems [26–28]. Among many extended ensemble simulation methods, we chose ABF to mitigate the difficulties in studying nucleic acid (DNA and RNA) interactions. An example of such difficulty is encountered when high temperature is applied in replica exchange molecular dynamics [36] that causes complete separation of nucleic acid strands, which cannot revert to the original structure within the simulation timescale. To overcome this problem, extended simulation methods without destroying the structure should be sought, and the ABF can offer a solution. We hope that further development of this approach can also contribute to the improvement of analysis and prediction of RNA structural dynamics [37, 38].
Conclusion
In this study, we proposed a computational method to obtain multi-dimensional free energy profiles for codon-anticodon base-pairing by adaptive biasing force (ABF) molecular dynamics [26–28]. This reaction-coordinate-based analysis method provided the equilibrium profiles of base-pair binding dynamics depending on ribonucleotide sequence—start codon-anticodon base-pairing in this case. Our method successfully detected the changes in the free energy landscape (Figs 5 and 6) induced by site-specific nucleotide substitution in the start codon (e.g. AUG → GUG) and offered a mechanistic explanation for how such changes led to perturbation in initiation frequency from the altered start codons.
Supporting information
Red and blue bars correspond to Gbound and Gunbound (Eq 3), respectively (see the schematics in Fig 2). The scores were obtained from P(d1, d2, d3) averaged over five simulation trials for each model.
(TIF)
Error bars show the mean ± S.E.M. of ΔGbinding obtained from each simulation trajectory. Note that the mean ΔGbinding here is different from the ΔGbinding score in Fig 4, for which P(d1, d2, d3), not G, was averaged over simulation trials; this SI figure is presented only for reference and Fig 4 is physically relevant.
(TIF)
Acknowledgments
The authors are grateful to A. Awazu and C. Singh for fruitful discussions. The computation was carried out using the computer resource offered under the category of General Projects by Research Institute for Information Technology, Kyushu University.
Data Availability
The datasets generated for this study are available from the Zenodo repository doi: 10.5281/zenodo.4577431.
Funding Statement
YT was supported by JSPS KAKENHI Grant Number JP18KK0388, and by JSPS and NRF under the Japan-Korea Basic Scientific Cooperation Program (https://www.jsps.go.jp/, https://www.nrf.re.kr/). KA was supported by K-INBRE program Pilot Grant (https://www.k-inbre.org/), National Institutes of Health [P20 GM103418]; National Institutes of Health R15 grant [GM125671] (https://www.nih.gov/); National Science Foundation Research Grant [1412250] (https://www.nsf.gov/); and JSPS KAKENHI [JP18K19963] (https://www.jsps.go.jp/). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Asano K. Translational control. In: Encyclopedia of Systems Biology. Springer; New York; 2013. p. 2278–2282. [Google Scholar]
- 2. Asano K, Ito K. Translation elongation. In: Encyclopedia of Systems Biology. Springer; New York; 2013. p. 2259–2263. [Google Scholar]
- 3. Hinnebusch AG, Dever TE, Asano K. Mechanism of translation initiation in the yeast Saccharomyces cerevisiae. Cold Spring Harbor Monograph Series. 2007;48:225–268. [Google Scholar]
- 4. Sonenberg N, Hinnebusch AG. Regulation of translation initiation in eukaryotes: mechanisms and biological targets. Cell. 2009;136(4):731–745. doi: 10.1016/j.cell.2009.01.042 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Asano K, Clayton J, Shalev A, Hinnebusch AG. A multifactor complex of eukaryotic initiation factors, eIF1, eIF2, eIF3, eIF5, and initiator tRNAMet is an important translation initiation intermediate in vivo. Genes Dev. 2000;14(19):2534–2546. doi: 10.1101/gad.831800 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Singh CR, Watanabe R, Chowdhury W, Hiraishi H, Murai MJ, Yamamoto Y, et al. Sequential eukaryotic translation initiation factor 5 (eIF5) binding to the charged disordered segments of eIF4G and eIF2β stabilizes the 48S preinitiation complex and promotes its shift to the initiation mode. Mol Cell Biol. 2012;32(19):3978–3989. doi: 10.1128/MCB.00376-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Singh CR, Lee B, Udagawa T, Mohammad-Qureshi SS, Yamamoto Y, Pavitt GD, et al. An eIF5/eIF2 complex antagonizes guanine nucleotide exchange by eIF2B during translation initiation. EMBO J. 2006;25(19):4537–4546. doi: 10.1038/sj.emboj.7601339 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Algire MA, Maag D, Lorsch JR. Pi release from eIF2, not GTP hydrolysis, is the step controlled by start-site selection during eukaryotic translation initiation. Mol Cell. 2005;20(2):251–262. doi: 10.1016/j.molcel.2005.09.008 [DOI] [PubMed] [Google Scholar]
- 9. Passmore LA, Schmeing TM, Maag D, Applefield DJ, Acker MG, Algire MA, et al. The eukaryotic translation initiation factors eIF1 and eIF1A induce an open conformation of the 40S ribosome. Mol Cell. 2007;26(1):41–50. doi: 10.1016/j.molcel.2007.03.018 [DOI] [PubMed] [Google Scholar]
- 10. Saini AK, Nanda JS, Lorsch JR, Hinnebusch AG. Regulatory elements in eIF1A control the fidelity of start codon selection by modulating binding to the ribosome. Genes Dev. 2010;24(1):97–110. doi: 10.1101/gad.1871910 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Hussain T, Llácer JL, Fernández IS, Munoz A, Martin-Marcos P, Savva CG, et al. Structural changes enable start codon recognition by the eukaryotic translation initiation complex. Cell. 2014;159(3):597–607. doi: 10.1016/j.cell.2014.10.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Llácer JL, Hussain T, Marler L, Aitken CE, Thakur A, Lorsch JR, et al. Conformational differences between open and closed states of the eukaryotic translation initiation complex. Mol Cell. 2015;59(3):399–412. doi: 10.1016/j.molcel.2015.06.033 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Llácer JL, Hussain T, Saini AK, Nanda JS, Kaur S, Gordiyenko Y, et al. Translational initiation factor eIF5 replaces eIF1 on the 40S ribosomal subunit to promote start-codon recognition. eLife. 2018;7:e39273. doi: 10.7554/eLife.39273 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Obayashi E, Luna RE, Nagata T, Martin-Marcos P, Hiraishi H, Singh CR, et al. Molecular landscape of the ribosome pre-initiation complex during mRNA scanning: structural role for eIF3c and its control by eIF5. Cell Rep. 2017;18(11):2651–2663. doi: 10.1016/j.celrep.2017.02.052 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Luna RE, Arthanari H, Hiraishi H, Akabayov B, Tang L, Cox C, et al. The interaction between eukaryotic initiation factor 1A and eIF5 retains eIF1 within scanning preinitiation complexes. Biochemistry. 2013;52(52):9510–9518. doi: 10.1021/bi4009775 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Luna RE, Arthanari H, Hiraishi H, Nanda J, Martin-Marcos P, Markus MA, et al. The C-terminal domain of eukaryotic initiation factor 5 promotes start codon recognition by its dynamic interplay with eIF1 and eIF2β. Cell Rep. 2012;1(6):689–702. doi: 10.1016/j.celrep.2012.04.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Asano K. Why is start codon selection so precise in eukaryotes? Translation. 2014;2(1):e28387. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Ingolia NT, Lareau LF, Weissman JS. Ribosome profiling of mouse embryonic stem cells reveals the complexity and dynamics of mammalian proteomes. Cell. 2011;147(4):789–802. doi: 10.1016/j.cell.2011.10.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Kolitz SE, Takacs JE, Lorsch JR. Kinetic and thermodynamic analysis of the role of start codon/anticodon base pairing during eukaryotic translation initiation. RNA. 2009;15(1):138–152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Wei J, Zhang Y, Ivanov IP, Sachs MS. The stringency of start codon selection in the filamentous fungus Neurospora crassa. J Biol Chem. 2013;288(13):9549–9562. doi: 10.1074/jbc.M112.447177 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Tang L, Morris J, Wan J, Moore C, Fujita Y, Gillaspie S, et al. Competition between translation initiation factor eIF5 and its mimic protein 5MP determines non-AUG initiation rate genome-wide. Nucleic Acids Res. 2017;45(20):11941–11953. doi: 10.1093/nar/gkx808 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Karplus M, McCammon JA. Molecular dynamics simulations of biomolecules. Nat Struct Biol. 2002;9(9):646–652. doi: 10.1038/nsb0902-646 [DOI] [PubMed] [Google Scholar]
- 23. Hansson T, Oostenbrink C, van Gunsteren W. Molecular dynamics simulations. Curr Opin Struct Biol. 2002;12(2):190–196. doi: 10.1016/S0959-440X(02)00308-1 [DOI] [PubMed] [Google Scholar]
- 24. Lind C, Åqvist J. Principles of start codon recognition in eukaryotic translation initiation. Nucleic Acids Res. 2016;44(17):8425–8432. doi: 10.1093/nar/gkw534 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Mills JB, Vacano E, Hagerman PJ. Flexibility of single-stranded DNA: use of gapped duplex helices to determine the persistence lengths of poly (dT) and poly (dA). J Mol Biol. 1999;285(1):245–257. doi: 10.1006/jmbi.1998.2287 [DOI] [PubMed] [Google Scholar]
- 26. Darve E, Rodríguez-Gómez D, Pohorille A. Adaptive biasing force method for scalar and vector free energy calculations. J Chem Phys. 2008;128(14):144120. doi: 10.1063/1.2829861 [DOI] [PubMed] [Google Scholar]
- 27. Henin J, Fiorin G, Chipot C, Klein ML. Exploring multidimensional free energy landscapes using time-dependent biases on collective variables. J Chem Theory Comput. 2010;6(1):35–47. doi: 10.1021/ct9004432 [DOI] [PubMed] [Google Scholar]
- 28. Comer J, Gumbart JC, Hénin J, Lelièvre T, Pohorille A, Chipot C. The adaptive biasing force method: Everything you always wanted to know but were afraid to ask. J Phys Chem B. 2015;119(3):1129–1151. doi: 10.1021/jp506633n [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Humphrey W, Dalke A, Schulten K. VMD: visual molecular dynamics. J Mol Graph. 1996;14(1):33–38. doi: 10.1016/0263-7855(96)00018-5 [DOI] [PubMed] [Google Scholar]
- 30. Phillips JC, Braun R, Wang W, Gumbart J, Tajkhorshid E, Villa E, et al. Scalable molecular dynamics with NAMD. J Comput Chem. 2005;26(16):1781–1802. doi: 10.1002/jcc.20289 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Hart K, Foloppe N, Baker CM, Denning EJ, Nilsson L, MacKerell AD Jr. Optimization of the CHARMM additive force field for DNA: Improved treatment of the BI/BII conformational equilibrium. J Chem Theory Comput. 2012;8(1):348–362. doi: 10.1021/ct200723y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Best RB, Zhu X, Shim J, Lopes PE, Mittal J, Feig M, et al. Optimization of the additive CHARMM all-atom protein force field targeting improved sampling of the backbone ϕ, ψ and side-chain χ1 and χ2 dihedral angles. J Chem Theory Comput. 2012;8(9):3257–3273. doi: 10.1021/ct300400x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Hardy DJ, Wu Z, Phillips JC, Stone JE, Skeel RD, Schulten K. Multilevel summation method for electrostatic force evaluation. J Chem Theory Comput. 2015;11(2):766–779. doi: 10.1021/ct5009075 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Hashem Y, des Georges A, Dhote V, Langlois R, Liao HY, Grassucci RA, et al. Structure of the mammalian ribosomal 43S preinitiation complex bound to the scanning factor DHX29. Cell. 2013;153(5):1108–1119. doi: 10.1016/j.cell.2013.04.036 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Martin-Marcos P, Nanda J, Luna RE, Wagner G, Lorsch JR, Hinnebusch AG. β-Hairpin loop of eukaryotic initiation factor 1 (eIF1) mediates 40 S ribosome binding to regulate initiator tRNAMet recruitment and accuracy of AUG selection in vivo. J Biol Chem. 2013;288(38):27546–27562. doi: 10.1074/jbc.M113.498642 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Sugita Y, Okamoto Y. Replica-exchange molecular dynamics method for protein folding. Chem Phys Lett. 1999;314(1–2):141–151. [Google Scholar]
- 37. Hamada M. Fighting against uncertainty: an essential issue in bioinformatics. Brief Bioinformatics. 2014;15(5):748–767. doi: 10.1093/bib/bbt038 [DOI] [PubMed] [Google Scholar]
- 38. Waldispühl J, Devadas S, Berger B, Clote P. Efficient algorithms for probing the RNA mutation landscape. PLoS Comput Biol. 2008;4(8):e1000124. doi: 10.1371/journal.pcbi.1000124 [DOI] [PMC free article] [PubMed] [Google Scholar]