Summary
In higher eukaryotes, the mRNA sequence in the direct vicinity of the start codon, called the Kozak sequence (CRCCaugG, where R is a purine), is known to influence the rate of the initiation process. However, the molecular basis underlying its role remains poorly understood. Here, we present the cryoelectron microscopy (cryo-EM) structures of mammalian late-stage 48S initiation complexes (LS48S ICs) in the presence of two different native mRNA sequences, β-globin and histone 4, at overall resolution of 3 and 3.5 Å, respectively. Our high-resolution structures unravel key interactions from the mRNA to eukaryotic initiation factors (eIFs): 1A, 2, 3, 18S rRNA, and several 40S ribosomal proteins. In addition, we are able to study the structural role of ABCE1 in the formation of native 48S ICs. Our results reveal a comprehensive map of ribosome/eIF-mRNA and ribosome/eIF-tRNA interactions and suggest the impact of mRNA sequence on the structure of the LS48S IC.
Graphical Abstract

Highlights
- 
•Cryo-EM structures of LS48S ICs unravel crucial mRNA sequence-dependent interactions 
- 
•After start-codon recognition, eIF1A interacts with the G(+4) position in β-globin mRNA 
- 
•t6A(37) modification mediates tRNAiMet binding to the C(−1) mRNA position 
- 
•The binding of ABCE1 does not affect the conformation of the LS48S IC 
Simonetti et al. present a high-resolution snapshot of the architecture of mammalian late-stage translation initiation complexes prepared in near native conditions. They provide structural insights into the Kozak sequence interactions of two different archetype mRNA sequences with the ribosome during translation initiation.
Introduction
mRNA translation initiation in mammals is more complex than its bacterial counterpart. Indeed, it involves more steps, more initiation factors, and more regulation pathways. One can summarize the overall process in four steps, starting with pre-initiation. During pre-initiation, the ternary complex (TC) is formed by the binding of heterotrimeric eukaryotic initiation factor 2 (eIF2) to one molecule of guanosine triphosphate (GTP) and the initiator methionylated tRNA (tRNAiMet). The TC then binds to the post-recycled ribosomal small subunit (SSU), also called the 40S subunit. TC recruitment is partially mediated by eIFs attached to the 40S, eIF1, eIF1A, and 13-subunit eIF3 complex. This leads to the formation of the 43S pre-initiation complex (PIC). The architecture of the 43S PIC has been investigated structurally at low to intermediate resolutions (Aylett et al., 2015, Erzberger et al., 2014, Hashem et al., 2013).
The second step consists of the recruitment of 5′ capped mRNA and leads to the formation of the 48S initiation complex (IC). This step is mediated by the cap-binding complex, composed of eIF4F, eIF4A, and eIF4B (Gross et al., 2003, Jackson et al., 2010, Marintchev et al., 2009, Rogers et al., 2001).
The third step is the scanning process for the start codon (AUG) in the 5′-to-3′ direction. This step was first investigated structurally in yeast (Hussain et al., 2014, Llácer et al., 2015) using in vitro reconstituted complexes. Upon start-codon recognition, the codon:anticodon duplex is formed between the mRNA and the tRNAiMet, aided by the eIF1A N-terminal tail (NTT) (Hinnebusch, 2011, Llácer et al., 2018, Lomakin and Steitz, 2013). The GTP is hydrolyzed by eIF2γ, eIF1 dissociates from the P site along with eIF1A C-terminal tail (Zhang et al., 2015), and the N-terminal domain (NTD) of eIF5 takes their place on the 40S (Llácer et al., 2018), before the eIF5 NTD dissociates in turn at a stage that remains to be elucidated. This results in the formation of the LS48S IC (which we describe in this work). The arrest of scanning and a cascade of structural rearrangements lead to the sequential dissociation of most eIFs upon the release of inorganic phosphate (Pi), generated from the GTP hydrolysis. eIF3 stays attached to the remaining complex, probably through its peripheral subunits, and leaves at a later stage during early elongation cycles (Beznosková et al., 2013, Beznosková et al., 2015). During all these steps, the post-recycling factor ABCE1 can bind directly to the 40S and act as an anti-ribosomal subunit association factor (Heuer et al., 2017, Kiosze-Becker et al., 2016, Mancera-Martínez et al., 2017).
In the fourth and final step, ABCE1 is replaced by the GTPase eIF5B on helix 14 of 18S rRNA, thus stimulating the joining of the 40S and 60S ribosomal subunits, forming an 80S complex (Fernandez et al., 2013), and eIF1A and eIF5B are released together (Fringer et al., 2007).
The sequences flanking the AUG start-codon region have been identified as crucial for start site selection by the IC (Kozak, 1986, Kozak, 1987a, Kozak, 1987b, Kozak, 1989). The optimal sequence for translation initiation in eukaryotes was named after Marylin Kozak, who first defined the optimal sequence in vertebrates as CRCCaugG, where R stands for a purine (Kozak, 1984, Kozak, 1989). In this motif, modification of certain positions has influence on translation efficiency, such as (−3) and (+4) (Kozak, 1984). As a result, a sequence can be dubbed “strong” or “weak” by considering those positions. It was further shown that the substitution of A(−3) for pyrimidine, or mutations of the highly conserved G(+4), leads to a process known as “leaky scanning,” with bypass of the first AUG and initiation of translation at the downstream start codon (Kozak, 1986, Kozak, 1989, Lin et al., 1993). More recent studies observed a more extreme case of sequence-dependent translation initiation regulation, dubbed “cap assisted,” for certain cellular mRNAs, such as those encoding histone proteins and in particular histone 4 (H4) mRNA (Martin et al., 2011, Martin et al., 2016). Cap-assisted internal initiation of H4 mRNA implies a very minimalistic scanning mechanism, which is possible thanks to the presence of a tertiary structure on the mRNA at the channel entrance. This element assists in placing the start codon very close to the P site almost immediately upon its recruitment through the cap-binding complex.
Despite the tremendous recent advances in understanding this phase of translation, high-resolution structural studies of the initiation process have been conducted by in vitro reconstitution of the related complexes. This approach often requires biologically irrelevant molar ratios of the studied eIFs (Aylett et al., 2015, Erzberger et al., 2014, Hashem et al., 2013, Hussain et al., 2014, Llácer et al., 2015), thus limiting insight into more subtle regulatory pathways. Moreover, the structures of the mammalian (pre)ICs are still at intermediate resolutions, approximating 6 Å (Eliseev et al., 2018, des Georges et al., 2015, Simonetti et al., 2016). Finally, although the role of ABCE1 was experimentally demonstrated as a ribosomal subunit anti-association factor preventing premature binding of the 60S (Heuer et al., 2017), its impact on the 48S complex formation and conformation is still unclear.
Here, we present cryoelectron microscopy (cryo-EM) structures of LS48S ICs formed after recognition of the start codon on two different native and abundant cellular mRNAs, β-globin and H4, presenting variants of the Kozak sequence. Both complexes were prepared and isolated in near native conditions from rabbit reticulocyte lysate (RRL) (Figure 1A). Although initiation regulation may differ mechanistically between these two archetype mRNAs, our structures provide a high-resolution snapshot of the Kozak sequence-dependent variable interactions in the LS48S IC in mammals.
Figure 1.
Overall Structure of β-Globin and H4 Late-Stage 48S Initiation Complexes
(A) mRNA sequences used to form and purify the β-globin and the H4 ICs. Only the sequences near the AUG codon are represented, and main differences in the Kozak sequence are indicated in bold.
(B) Semiquantitative mass spectrometry analysis of the eIFs in both ICs, indicating the abundance of each eIF on the basis of the spectra count normalized. The two rounds of normalization were carried out using the total number of eIFs and estimated number of trypsin cleavage sites (see STAR Methods). The normalized spectra counts (NSCs) are presented as heatmaps, with cold colors indicating low abundance and warm colors indicating high abundance. The higher abundance of eIF2 proteins might be due to the excess of a free TC in the sample. The black star points out a large number of NSCs for eIF2Bδ, which is caused by the detection of three different isoforms of this protein. Small stars indicate the values of the coefficients of variation calculated for each NSC. In the analysis, the NSC for ABCE1 is not included, as it is a factor present also in other stages of translation than initiation.
(C and D) Segmented cryo-EM reconstructions of the β-globin IC seen from (C) solvent, beak and (D) platform sides, respectively. The reconstruction shows 40S (yellow), eIF2γ (orange), eIF2α (purple), tRNAiMet (magenta), mRNA (red), eIF1A (sky blue), and ABCE1 (green).
(E and F) Same as (C) and (D) but for the H4 IC. Boxed blowups represent the codon:anticodon duplexes in all shown reconstructions with their respective atomic models fitted in the corresponding electron densities.
Results
Overall Structure of the Mammalian 48S Initiation Complex
The complexes were prepared using a modified version of our approach (Simonetti et al., 2016; see STAR Methods) that consists of stalling the LS48S IC in RRL using GMP-PNP (a non-hydrolyzable analog of GTP) on the two target cellular mRNAs: mouse histone H4 mRNA (suboptimal Kozak) and human β-globin mRNA (stronger Kozak) (Figure 1A). These mRNAs were transcribed and capped from BC007075 cDNA (GenBank: BC007075.1 for β-globin) and X13235 cDNA (GenBank: X13235.1 for H4). The advantage of this approach is the ability to prepare ICs bound on different mRNAs of interest directly in nuclease-treated cell extract in the absence of the endogenous mRNAs, allowing a study of regulatory aspects of the process in natural abundance levels of native eIFs at physiological molar ratios.
The composition of both complexes was investigated using mass spectrometry (MS) (Figure 1B). Our analysis reveals the incorporation in both complexes of all eIFs expected to be present after start-codon recognition (eIF1A, eIF2α, eIF2β, eIF2γ, eIF3 complex, and ABCE1). As expected, extremely poor numbers of peptides and spectra for eIF1 were detected in either complex, corroborating that our complexes are at a late stage after start-codon recognition and eIF1 dissociation.
In parallel, we subjected our prepared complexes to structural analysis using cryo-EM. The structure of the β-globin LS48S IC (3.0 Å, 29% of the total number of 40S particles; Figures 1C, 1D, and S1A–S1H) shows mRNA, 40S, eIF1A, TC, and ABCE1. For the H4 mRNA 48S complex (3.5 Å, 6.5% of the total number of particles; Figures 1E, 1F, and S1P–S1W), the main reconstruction shows mRNA, 40S, and the TC. We attribute the lower percentage of H4 LS48S IC formation to contamination with 60S subunits (∼30%) (Figure S1W).
Interestingly, both our cryo-EM structures and MS/MS analysis show that the H4 LS48S IC displays a significant reduction in the presence of eIF1A, leaving only residual density for its presence in the cryo-EM reconstruction (Figures 1E and S2A). Similar observation can be made for ABCE1 in the H4 LS48S IC. Our reconstructions show also another class of IC with eIF3 that is described later.
Accommodation of the Start Codon in the Late-Stage LS48S IC
In both our reconstructions, the codon:anticodon duplex is clearly formed, characterizing the cognate start-codon recognition (Figures 1C, 1E, 2A, and 2B). AUG codons of both mRNAs face the (34)CAU(36) of anticodon stem-loop (ASL) tRNAiMet, within hydrogen-bonding distances (∼2.7 Å). In the case of β-globin mRNA, the codon:anticodon interaction is stabilized further by the NTT of eIF1A (Lys7 interacts with the ribose of G[+3] from mRNA; Figure 2D). The tail also interacts with the tRNAiMet A(35) between Gly8 and Gly9. With few exceptions, this eIF1A NTT is highly conserved among eukaryotes (Figure 2D). Recent fluorescence anisotropy with yeast reconstituted PICs (Llácer et al., 2018) demonstrated that eIF1A binds with lesser affinity to a near cognate start codon (UUG) compared with a cognate AUG. Along the same lines, only very residual density for eIF1A can be observed in the H4 LS48S IC structure (Figure S2A) (discussed below), which reflects its weaker binding affinity after the start-codon recognition at this late stage to the 48S complex.
Figure 2.
Key Interactions Surrounding the Start-Codon Recognition Sites in β-Globin and H4 LS48S ICs
(A) Ribbon representation of the atomic model of β-globin LS48S IC viewed from the intersubunit side.
(B) Codon:anticodon base-pairing view in both mRNA complexes; left: β-globin; right: H4.
(C) eIF1A (sky blue) interaction with the mRNA in the β-globin IC (left panel), compared with the corresponding region in the H4 IC, which is mostly free of eIF1A (right panel).
(D) Close-up of the eIF1A N-terminal tail (colored in cyan) showing its intricate interactions with tRNA and mRNA; stacking of C1696 on tip of tRNAiMet. The nucleotides involved in the interactions are colored green.
(E) Interaction network of the tRNAi with ribosomal proteins uS13 and uS19 (colored salmon). Residues involved in the interactions are colored cyan in uS13 and uS19 and green in the tRNAi. For eIF1A, uS13, and uS19, sequence alignments of the concerned interacting regions from eight representative eukaryotic species are shown below the panels in black boxes and the described residues are indicated by colored frames.
Hs, Homo sapiens; Mm, Mus musculus; Dr, Danio rerio; Dm, Drosophila melanogaster; Ce, Caenorhabditis elegans; Nc, Neurospora crassa; Sc, Saccharomyces cerevisiae; At, Arabidopsis thaliana.
C1696 of 18S rRNA is stacked on the C(34) base at the very tip of the tRNAiMet ASL that is paired to G(+3) of both β-globin and H4 mRNA (Figure 2D, shown only for β-globin complex). This contact between C1696 and C(34) is also found in the yeast partial 48S PIC (py48S IC) (Hussain et al., 2014), and it occurs even in the absence of any mRNA (des Georges et al., 2015). This stacking interaction may partly explain the difference in recruitment of initiator tRNA between bacteria and eukaryotes. In bacteria, the initiator tRNA is recruited directly at the P site-accommodated start codon, whereas in eukaryotes, the tRNAiMet is recruited at the pre-initiation stage of the complex, before the attachment of mRNA into its channel. The tRNAiMet ASL also interacts with the C-terminal tails of 40S ribosomal head protein uS19 (Figures 2E and S3B) through its Arg140 that contacts A(35).
We then compared the overall conformation of the 40S between both complexes, and we observed that in the β-globin IC, the head of the SSU is tilted downward by ∼2° and swiveled toward the solvent side by ∼3° compared with its counterpart in the H4 IC (Figure S2C). We attribute these subtle conformational changes to the dissociation of eIF1A in H4 LS48S IC, after the start-codon recognition, due to the loss of contacts between eIF1A and the 40S head.
Interaction Network of the Kozak Sequence (−4 to +4) with 40S and Initiation Factors
The (+4) position, occupied mainly by a G in eukaryotic mRNAs, plays a pivotal role (Kozak, 1984, Kozak, 1986). Our reconstructions demonstrate the structural importance of this position to both mRNAs. In the β-globin LS48S IC, the highly conserved Trp70 from eIF1A is trapped between the mRNA G(+4) position and the A1819 from h44 18S rRNA of the A site by stacking interactions (Figure 2C). Interestingly, the interaction of the (+4) mRNA position with h44 was shown by cross-linking studies (Pisarev et al., 2006). Our β-globin LS48S IC structure also shows the proximity of uS19 C-terminal tail to the (+4) mRNA position, which can also be corroborated by several cross-linking studies (Bulygin et al., 2005, Pisarev et al., 2006, Pisarev et al., 2008). In H4 mRNA a U is at position (+4), so the stacking interaction with eIF1A appears weaker than when a G is present. Moreover, nucleotides A1818 and A1819 have even more scant densities, indicating their undetermined conformations, probably linked to this poor stacking (Figure 2C). Our reconstructions therefore suggest the structural importance of the (+4) position in the interaction with eIF1A.
Another crucial position in the Kozak consensus sequence is at (−3), often occupied by an adenine (Figure 1A). This nucleotide in both complexes shows several contacts with ribosomal proteins and initiation factors, including salt bridge interaction between A(−3) base and a side chain of Arg55 from domain 1 (D1) of eIF2α (Figure 3A), which was reported previously in the py48S IC structure (Hussain et al., 2014, Llácer et al., 2018). However, in the yeast structure, the A(−3) base is in the syn conformation, and in both our mammalian ICs, the adenine is in the anti conformation. Noteworthy, the near cognate yeast mRNA present in the py48S IC structure (Llácer et al., 2018) contains adenines at positions (−1) and (−2), which in principle could create an ideal stacking context for the A base in (−3), thus explaining this difference in conformation compared with our mRNAs, where these positions are occupied by two cytosines. The (−3) position further interacts with the G957 nucleotide at the 40S platform (Figure 3A), highlighted in earlier studies (Demeshkina et al., 2000). In addition, cross-linking studies of reconstituted mammalian PIC previously demonstrated that eIF2α and uS7 interact with the (−3) nucleotide and uS7 with the (−4) nucleotide (Pisarev et al., 2006, Pisarev et al., 2008). The interaction of uS7 through its β-hairpin was also suggested in the py48S IC structure, because of their proximity in space (Hussain et al., 2014, Llácer et al., 2018). However, in our structures, this interaction cannot be confirmed, as the electron density at this specific region is very disperse, probably because of the flexibility of this part of uS7 (see Discussion; Figure S5A) compared with its other parts.
Figure 3.
Kozak and beyond Kozak Interaction Networks in β-Globin and H4 LS48S ICs
(A) Close-up of the interactions of upstream start-codon nucleotides top-viewed from the head side in the β-globin (left panel) and H4 (right panel) ICs with ribosomal proteins eS26 and uS11, as well as eIF2α D1 domain, tRNAi, and 18S rRNA. mRNA (−4) position contact with His80 of eS26 is highlighted with dashed-line circle. The distances between atom N1 of His80 eS26 and amine groups of A(−4) (β-globin) and C(−4) (H4) are 3.7 and 3.2 Å, respectively. G1203 and G957 of 18S rRNA stacking and interaction with C(−1) and A(−3), respectively, of both mRNAs are shown.
(B) mRNA entry channel seen from the beak side with close-up of the interactions with uS3, eS30, and h16 of 18S rRNA.
(C) mRNA exit channel seen from the solvent side with close-up on the mRNA contacts with ribosomal proteins eS26 and eS28.
(B) and (C) show an example of β-globin LS48S IC. The nucleotides involved in the interactions are indicated in green and residues in cyan. Respective sequence alignments are shown in black boxes from eight representative eukaryotic species on the right of the figure panels.
Position (−4) of both mammalian mRNAs interacts with ribosomal protein eS26 through its His80. However, we have found that in the case of the β-globin mRNA, position (−4) is a cytosine and appears to interact mildly with eS26 His80 (Figure 3A, left panel), as its weak density suggests. Whereas when this position is an adenine, as in the H4 mRNA, a stronger stacking interaction occurs, which could further participate in stabilizing the mRNA in its channel (Figure 3A, right panel). Consequently, the mRNA in this latter case adopts a slightly different conformation. A possible result of this difference is the observed tighter interaction with eIF2α Arg55 residue from domain D1 (Figure 3A), as its density is better defined in H4 than in β-globin.
Finally, upstream residues near the start codon in our complexes are in contact with 18S rRNA including G1203 from the head rRNA, which interacts with the phosphate of A(+1) and stacks with C(−1) of both mRNAs (Figure 3A).
eIF1A Interaction with the β-Globin mRNA Sequence and the 18S rRNA
In addition to the aforementioned contact with the start codon and G(+4), eIF1A can potentially establish several interactions at more distal positions in the β-globin mRNA sequence, closer to the mRNA entrance channel. Indeed, Arg12, Lys67, and Lys68 in eIF1A are in close proximity to C(+7), G(+6), and U(+5) (Figure 4A). eIF1A NTT also interacts with the 18S rRNA (Gly9 and Lys10 with C1696 and Lys16 and Asn17 with C1327) (Figures 2D and 4A). Other contacts involve the loops of the eIF1A OB domain with the 40S near the A site (Figures 4B–4D); namely, Asn44 and Arg46 are in contact with A1817–A1819 and C1705 from h44 of 18S rRNA (Figure 4C); moreover, Lys64 and Arg62 contact G604 and C605 of h18 18S rRNA (Figure 4D). In addition, Arg82, Tyr84, and Gln85 of eIF1A contact Glu58, Leu91, and Gly56 of ribosomal protein uS12 (Figure 4D); finally, Asp83 is in contact with Arg82 of eS30 (Figure 4D). Altogether, the previously mentioned interactions might depend on the mRNA sequence, and perhaps they can have influence on the stability the cognate start-codon duplex with its anticodon by the NTT of eIF1A (residues Lys7, Gly8, Gly9, and K10).
Figure 4.
eIF1A Interactions in the A Site of β-Globin LS48S IC
(A) eIF1A (dark blue) N-terminal tail interactions with mRNA of downstream start-codon nucleotides and tRNAi. The nucleotides involved in the interactions are indicated in green and residues in cyan.
(B) eIF1A OB-domain interactions with mRNA and 40S.
(C) Close-up of interactions of eIF1A (dark blue) with h44 of 18S rRNA (nucleotides are colored green).
(D) Zoom in on eIF1A (dark blue) interactions with h18 of 18S rRNA (gold) and ribosomal proteins uS12 and eS30 (salmon). The nucleotides and residues of uS12 and eS30 involved in the interactions are indicated in green and eIF1A residues in cyan. Respective sequence alignments are shown in black boxes.
Of note, eIF1A NTT was shown to interact with eIF5 (Luna et al., 2012, Luna et al., 2013), but because of its clear involvement in the start codon:anticodon duplex, we suggest that this eIF5 interaction occurs during the pre-initiation phase and very shortly after the recruitment of the mRNA.
mRNA Interactions with the 48S beyond the Kozak Sequence
The mRNA density at distal positions from the Kozak sequence appears disperse when filtered to high resolution, suggesting overall flexibility at both the entrance and the exit of the channel (local resolution of ∼6 to ∼9 Å). Nevertheless, several contacts can be observed at the entrance and exit sites of the mRNA channel of the β-globin and H4 LS48S ICs. These interactions are common to both complexes and could be more site specific than they are sequence specific.
At the entrance of the mRNA channel during this late stage of the initiation process, the mRNA extensively interacts with conserved residues of the 40S ribosomal proteins uS3 and eS30 and with rRNA h16 in positions spanning from +10 to ∼+20 through ionic and hydrophobic interactions (Figure 3B). For instance, the conserved Arg117 of the head protein uS3 contacts the mRNA at the channel entrance. This residue was recently indicated as important for stabilizing the PIN closed state of the 48S in yeast IC (Llácer et al., 2018) and for the initiation accuracy in the presence of suboptimal Kozak sequence by in vivo assays in yeast (Dong et al., 2017). The contribution of this charged residue of uS3 contacting the mRNA is partially corroborated by cross-links in a previous study (Pisarev et al., 2008). More globally, charged amino acid residues from uS3 helix α (residues 117–128) are in close proximity to nucleotides from positions (+14) to (+18) (Figure 3B). Moreover, residues from a β-hairpin (residues 142–146) can potentially contact bases of the nucleotides C(+9) and C(+10) of the mRNA, forming hydrophobic and salt bridge interactions. For ribosomal protein eS30, Lys126 is in close distance to the bases of G(+12) and A(+13). The proximity of A(+13) of mRNA to Ala133 of uS5 can also be noted (Figure 3B).
On the other side at the mRNA exit channel, we can observe the exit of both β-globin and H4 mRNAs from their respective 48S ICs (Figure 3C). The 5′ untranslated region (5′ UTR) for β-globin mRNA is substantially longer than for H4 (50 and 9 nt, respectively) (Figure 1A; see STAR Methods). We compared the mRNA exit channels of both complexes below the ribosomal head protein RACK1, which unambiguously shows the expected larger 5′ UTR for β-globin LS48S IC compared with H4 (Figure S2B). We were able to spot several possible contacts of ribosomal proteins at the exit site with mRNA nucleotides in both LS48S IC, including eS28 (Arg66 with A[−5] and Arg67 with A[−7]) as well as eS26 (Ile41 with C[−8], Arg42 with A[−9], and Arg100 with both these nucleotides) (Figure 3C), in agreement with previous cross-linking results (Pisarev et al., 2006, Pisarev et al., 2008).
Interactions of 48S Initiation Complex with the tRNAiMet
The overall accommodation of the mammalian ASL resembles its yeast counterpart found in the PIN state (Hussain et al., 2014, Llácer et al., 2018). The 48S IC-tRNAiMet interaction network is summarized in Figure S3B. In both ICs, we can observe a density attached to A(37), in which we can model the threonylcarbamyol group forming a t6A modification (Figures 3A and 5A). This modification mediates the binding of t6A(37) to the 2′OH of C(−1) in the mRNA and therefore can further stabilize start-codon recognition. It is tempting to suggest that C(−1):t6A(37) interaction is required for efficient translation in mammals. This mRNA C(−1) position is conserved in higher eukaryotes, as revealed by quantitative sequence analysis (Grzegorski et al., 2014), and forms part of the Kozak sequence. Interestingly, the electron density for this modification is even stronger in the H4 mRNA complex than in β-globin, even at a lower resolution (3.5 Å). We therefore suspect that this interaction could be more important in the case of suboptimal Kozak sequences, where this modification could compensate for the loss of some interactions with the 48S IC, compared with a stronger Kozak mRNA. The same interaction in the case of A at (−1) position is not excluded, but its nature and conformation will be different.
Figure 5.
Initiator tRNA Anticodon Stem-Loop (ASL) Interactions with the LS48S IC
(A) Comparison of the tRNAi modified t6A(37) interaction with mRNA (−1) position in mammalian β-globin (light pink) and H4 (red) and yeast (Llácer et al., 2018) (dark purple) initiation complexes. The contact with (−1) mRNA position is labeled by black solid line, black dots, and gray solid line for β-globin, H4, and optimal yeast mRNA sequence, respectively. In dashed sky blue circle, comparison of the conformation of highly conserved A(−3) mRNA position: anti in mammalian IC and syn in yeast.
(B) Interaction of the modified m1acp3Ψ1244 of the 18S rRNA (colored green, overlapping bottom panel) with the C(34) of the tRNAi and ribosomal protein uS9 (overlapping top panel).
(C) Close-up of the interactions of C(32) and C(33) with Arg146 of uS9.
(D) Close-up of the interaction of the ASL cytosines from the conserved G-C base pairs with eIF2α domain D1. Respective sequence alignments are shown in black boxes and interacting residues in colored frames.
Despite the universal presence of the t6A hypermodified base in all organisms, and a crucial role in translation efficiency (Pollo-Oliveira and de Crécy-Lagard, 2019, Thiaville et al., 2016), it has only recently been shown that the modification directly contributes to AUG recognition accuracy. In the py48S-eIF5 IC structure at a resolution of 3.5 Å (Llácer et al., 2018), t6A of the initiator tRNAMet was suggested to enhance codon:anticodon base pairing by interacting with A(−1) and by a stacking on the downstream base pair involving A(+1). Thanks to our mammalian LS48S IC at 3.0 Å, we can clearly observe the threonylcarbamyol group in a different conformation, placing the carboxyl group within hydrogen-bonding distance (2.7 Å) of the 2′OH group of C(−1) (Figure 5A). However, this modification does not appear to stack over the downstream base pair as previously suggested. Of note, in yeast there is a preference for an A at position (−1) (Dvir et al., 2013, Kozak, 1986), while it is a C(−1) in mammals.
Furthermore, C(34), which is a part of the anticodon, is stabilized by the modified U1244 (U1248 in human) of helix31 of rRNA (m1acp3Ψ, 1-methyl-3-[3-amino-3-carboxypropyl] pseudouridine) (Maden, 1990, Taoka et al., 2018; Figure 5B), which was previously reported in the py48S-eIF5 (modified U1119) (Llácer et al., 2018). In addition, the neighboring nucleotides, C(32) and C(33), are in contact with C-terminal arginine Arg146 of ribosomal head protein uS9 (Figure 5C).
Aside from its role in codon:anticodon stabilization, uS19 and uS13 were found to contact other parts of the tRNAiMet through their highly conserved C-terminal tails (Figures 2E and S3B). Thr136 side chain of uS19 interacts with the guanine backbone of three conserved G-C base pairs in ASL that are crucial for stabilization of the IC in eukaryotes (Dong et al., 2008). Residues Thr145 and Gly147 of uS13 are in contact with the phosphate groups of U28 and G29 (Figure 2E). Consistent with previous reports (Hussain et al., 2014, Llácer et al., 2018), several residues from domain D1 of eIF2α (Arg57, Ser58, Asn60, and Lys61) interact with the cytosines backbones of three conserved G-C base pairs of the ASL (phosphate groups of C39–41) (Figures 5D and S3B).
ABCE1 Binding to the Initiation Complex Is NTP Dependent
ABCE1 (called Rli1 in yeast) is a conserved NTP-binding cassette ABC-type multi-domain protein that plays a role in translation initiation as well as translation termination and ribosome recycling (Becker et al., 2012, Preis et al., 2014, Khoshnevis et al., 2010, Pisarev et al., 2010, Shoemaker and Green, 2011, Young et al., 2015). It contains two nucleotide-binding domains (NBDs), where the two NTP molecules bind. Its N-terminal NBD contains two iron-sulfur clusters [4Fe-4S]2+ (Barthelme et al., 2007, Barthelme et al., 2011, Karcher et al., 2008). In our β-globin LS48S IC structure, the NTP-binding cassette of ABCE1 displays lower local resolution (between 3.5 and 5 Å; Figures 6A and S1D) compared with the average resolution, likely because of the flexibility of the NBDs. In H4 LS48S IC structure, we observe only a residual density of ABCE1 (Figure S1L), which can be caused by a slightly different conformation of h44 18S rRNA in the absence of eIF1A.
Figure 6.
Atomic Model of ABCE1 in the LS48S IC
(A and E) Ribbon representations of ABCE1 (green) in its electron density with NTP-binding pockets framed in blue (NBD1) and pink (NBD2) seen from different side views.
(B) Blowups on the NTP pockets NBD1 (left, blue) and NBD2 (right, pink). Although in the purification conditions GMP-PNP was used, ATP molecules were modeled in the electron densities obtained.
(C) Mixed ribbon-and-stick representation of Fe-S binding domain atomic model fitted into its electron density.
(D) Close-up of Fe-S binding domain interactions with 40S ribosomal protein uS12 and h44 of 18S rRNA.
(F) Close-up of NBD1 interactions with the 40S.
(G) Close-up on NBD2 interactions with the 40S. The nucleotides involved in the interactions are indicated in light green and protein residues in cyan. Respective sequence alignments are shown in black boxes.
(H) Comparison between the β-globin (pink surface) and β-globin•GMP-PNP+ATP (gray surface) LS48S IC reconstructions without (left superimposition) and with (right superimposition) eIF3. The panel shows that the addition of ATP triggers the dissociation of ABCE1, probably after its hydrolysis. No further conformational changes between both complexes, with and without ATP, can be detected.
The iron-sulfur (Fe-S) binding domain presents a higher local resolution than the NBDs (between 3 and 4 Å; Figures 6B and S1D). Therefore, we can clearly distinguish the Fe-S clusters in the cryo-EM density map as well as the presence of two bound nucleotides that probably represent GMP-PNP that was used to stall the ICs by blocking eIF2γ (Figures 6A–6C).
NBD1 contacts five nucleotides in the 18S rRNA helix14 (U478, A455, A454, C453, and C452) through residues located in a helix-loop-helix motif (Ser150) and in the hinge-N (Arg306 and Asn310) (Figures 6F, 6G, and S4A). NBD2 also makes contacts to nucleotides A455, A454, and C453 through the hinge-C residues Lys584 and Ile583, conserved in higher eukaryotes (Figures 6G and S4A). Residues Arg566 and Arg567 from NBD2 are in close proximity to the 18S rRNA and can potentially be involved in the interaction with the latter, as suggested by their conservation (Figure S4A). Moreover, NBD1 residues Pro265 and Asp266 interact with residues from the C-terminal helix of ribosomal protein eS24 such as Gly128 (Figure 6F).
As for the Fe-S binding domain, it interacts with the 18S rRNA (Arg7 from strand β1 to C471, Lys20 to G1718, Pro66 to A1719, and Asn74 to G470) (Figure 6D). In addition, ABCE1 interacts with ribosomal protein uS12 residues Ile50, Leu52, and Ile75 via a hydrophobic pocket formed by helices α2, α3, and cluster (II) (residues Pro30, Val31, Ile56, and Ile60) (Figure 6D). This is consistent with previous structural and cross-linking studies in yeast and archaeal complexes (Heuer et al., 2017, Kiosze-Becker et al., 2016, Nürenberg-Goloub et al., 2020).
In order to investigate the effect of ABCE1 binding on the structure and composition of the LS48S IC, we purified β-globin complexes using our choice strategy (Simonetti et al., 2016, and this work) supplemented with 10 mM of ATP, thus taking advantage of the ability of ABCE1 to hydrolyze ATP, in contrast to the obligate GTPase eIF2. The β-globin-ATP 48S IC was then analyzed using cryo-EM and yielded two main LS48S IC reconstructions at ∼14 and ∼10 Å that differ in the presence and absence of eIF3, respectively (Figures 6H, S1X, and S1Y). The addition of ATP most likely causes the replacement of the GMP-PNP molecules in the NBD pockets by ATP molecules that was then hydrolyzed by ABCE1. Our structures clearly reveal the dissociation of ABCE1 as a result of ATP addition, consistent with recent structural and biophysical studies (Gouridis et al., 2019, Heuer et al., 2017, Kiosze-Becker et al., 2016). Aside from the absence of ABCE1, the global structure of the β-globin-ATP LS48S IC is identical to its higher resolution counterpart without ATP (Figure 6H), thus very likely excluding a direct active role of ABCE1 in the assembly of the IC. It is reasonable to assume that in the cell, ABCE1 undergoes on/off cycles to the IC in an ATP-dependent manner, as we have previously suggested (Mancera-Martínez et al., 2017). However, these results do not contradict the demonstrated function for ABCE1 as an anti-ribosomal-subunit association factor.
eIF3 in the Late-Stage 48S Initiation Complex
Nearly 15% (∼5% of the total particle count) of the particles corresponding to the β-globin LS48S IC structures contain a density for eIF3 at the solvent side. After extensive particle sorting and refinement, a reconstruction of the β-globin LS48S IC showing eIF3 was obtained at a resolution of ∼3.6 Å (Figures S1I–S1O), thus allowing verification of the recognition of the start codon for this specific class (Figures 7A and 7B). The eIF3a and eIF3c subunits (i.e., those that bind directly to the 40S) are mostly resolved at a resolution ranging from 3.5 to 4.5 Å, enabling a model of the exact residues in interaction with the 40S subunit (Figures 7C–7G). We find that eIF3a in β-globin LS48S IC shows several contacts with 40S ribosomal body proteins, with eIF3a residues Asn10, Lys13, Arg14, and Phe18 interacting with eS1 residues Asp77, Asn76, Asp191, and Pro190, respectively (Figures 7F and 7G). In addition, eIF3c residues Asn388, Arg340, Asn384, Gly341, Lys343, and Arg450 contact the 40S ribosomal protein eS27 via residues Glu75, Thr61, Gln65, and Cys59 (Figure 7E). Moreover, residues Lys342, Lys343, Thr391, and Tyr392 interact with nucleotides G925, C1112, and U1116, the latter two being a part of the apical loop of expansion segment 7 (ES7).
Figure 7.
Cryo-EM Reconstruction of the eIF3-Containing Class of the β-Globin LS48S IC
(A) Segmented map showing electron density of the eIF3 core (rose) attached to the 48S viewed from the platform side.
(B) Codon:anticodon base-pairing view in LS48S-eIF3 ICs (identical to both β-globin and H4).
(C) Ribbon representation of the atomic model of LS48S-eIF3 IC seen from the platform side.
(D) Blowup of the mRNA channel exit, seen from the platform side. mRNA 5′ UTR cannot be modeled because of the low local resolution of the cryo-EM reconstruction in this region, so we propose an extrapolation of the mRNA 5′ UTR trajectory showing possible interactions with eIF3d and eIF3a subunits (residues colored cyan).
(E) Close-up of the eIF3c (navy blue) interaction with 40S: h26 (ES7) of 18S rRNA and eS27.
(F) Close-up of the eIF3a (coral) interaction with eS1.
(G) Mixed ribbon-and-stick representation of eIF3 core interactions with 40S. The nucleotides involved in the interactions are indicated in chartreuse and protein residues in cyan. Respective sequence alignments are shown in black boxes, with interacting residues highlighted by colored frames.
(H) Summary of the mRNA interactions with mammalian LS48S IC. The ribosomal proteins are colored orange and 18S rRNA elements in yellow. The mRNA contacts critical for recognition of optimal or suboptimal Kozak context are highlighted by gray frames.
The eIF3d subunit structure is at lower resolution compared with the eIF3 octamer core. Nevertheless, secondary structure elements can clearly be depicted when filtered to a lower resolution (6 Å), which enabled the fitting of its partial crystal structure in our density (Lee et al., 2016). The modeled eIF3d displays contacts with several ribosomal head proteins: helix α12 contacts the N-terminal loop of uS7, the loop between β9 and β10 contacts RACK1 (loop located between strands β6D and β7A), and β strand loops and the “RNA gate” insertion contacts eS28 (strand β3, loops between helices α8 and α9).
Despite the low local resolution of this particular subunit, our structure provides clues to the demonstrated interactions of eIF3d and eIF3a with the mRNA 5′ UTR (Figure 7D). Thus, shown by the residual electron density traces, the 5′ UTR of numerous mRNAs such as β-globin can possibly interact with different parts of eIF3d: N-terminal loop and strand β2 (residues S166–E172) and a loop between β11 and β12 (residues Asn513 and Lys514) (Figure 7D). These residues of eIF3d are better conserved in higher eukaryotes, indicating a possible species-specific regulation. An interaction with 5′ RNA terminus recognition motif was also previously reported (Lee et al., 2016). More insight into the interaction patterns of eIF3d with different mRNA 5′ UTRs in the context of translation initiation will be an important goal in future studies.
Finally, the mRNA 5′ UTR of β-globin can also interact with eIF3a (residues Gln6, Arg7, Arg41, Gln44, and Lys45) (Figure 7D). Similarly to eIF3d, these residues are conserved mainly among higher eukaryotes (Figure 7G). It was previously shown that the eIF3a-PCI domain (a domain with a common fold for proteasome, COP9, initiation factor 3) is critical for stabilizing mRNA binding at the exit channel (Aitken et al., 2016). However, because of the low local resolution of the β-globin mRNA 5′ UTR in our reconstruction, we do not exclude other patterns of interaction. H4 LS48S IC also shows the residual presence of eIF3 core, but particle sorting reveals a reconstruction containing eIF3 (Figure S1W) at only intermediate resolution because of the small number of particles.
Discussion
Our cryo-EM structures reveal in detail the accommodation of two native mRNA sequences encoding either β-globin or H4 in the context of the late-stage mammalian IC. In the presented IC structures, we did not identify any density corresponding to eIF1. Combined with codon:anticodon complex formation and several conformational changes characteristic of the stage after the start-codon recognition, we have dubbed our complexes “late-stage 48S IC” (LS48 IC). Initiation on β-globin and H4 mRNA may undergo different regulatory processes, as previously reported (Martin et al., 2011, Martin et al., 2016), but in our structures we analyze only the mRNA nucleotide interactions of the Kozak sequences (Figure 7H; Table S2), without dwelling on the exact regulation mechanism that may be in part influenced by the different interaction patterns that we observe. We therefore believe that our two archetype mRNA sequences are representative of native cellular mRNAs incorporating different Kozak sequences, as their observed interactions are purely the result of sequence differences and unrelated to specific regulatory pathways.
In the Kozak sequence, position (+4) appears to play a role in both mRNAs, where it is a G in β-globin and a U in H4. At this position, the crucial interaction with Trp70 of eIF1A appears to be weaker in the case of H4 IC compared with β-globin IC, as indicated by our MS/MS normalized spectral counts and the cryo-EM reconstructions. We suggest that the poor abundance of eIF1A in the H4 LS48S IC cryo-EM reconstruction (Figures 1, S1L, and S2A) might not suggest a negligible role for this initiation factor in suboptimal Kozak consensus mRNAs during scanning. Rather, it simply shows its weaker interaction in the complex after start-codon recognition, supported by our semiquantitative MS/MS analysis. As expected, the biggest decrease in spectral counts was observed for eIF1A in the H4 LS48S IC compared with its β-globin counterpart (Figure 1B). The weak aforementioned stacking interaction with mRNA purine at (+4) position is likely to be the reason of the affinity drop and subsequent weaker interaction between eIF1A and the 48S in the case of H4, after scanning and the recognition of the start codon. This observation is consistent with the suggested special translation initiation mechanism for H4 (Martin et al., 2011, Martin et al., 2016). It was suggested that H4 mRNA undergoes an unconventional “tethering mechanism,” whereby the ribosomes are tethered directly on the start codon without scanning (Martin et al., 2011). This particular mechanism was proposed to occur thanks to the presence of two secondary structure elements present downstream of the start codon in H4 mRNA (position +19), contacting h16 of rRNA (Martin et al., 2016). In this region, our H4 LS48S IC structure shows disperse densities that cannot be interpreted. We believe that eIF1A is present in the H4 complex during pre-initiation and the short scanning process, and only after recognition of the start codon can the affinity be affected by the mRNA Kozak context.
Regarding the NTT of eIF1A, it is present in the A site starting from the scanning process, as shown by our structure and also yeast 43S PIC and closed-48S ICs (Llácer et al., 2015, Llácer et al., 2018). It is important to emphasize the binding of eIF1A on the H4 IC at a certain stage, because we can observe residual electron density for this initiation factor in the H4 LS48S IC (Figures S1L andS2A). This observation tends to validate the unconventional very short scanning mechanism proposed for H4 mRNA (Martin et al., 2011), yielding in the faster accommodation of the start codon compared with β-globin mRNA. Nevertheless, in the early initiation steps, eIF1A may interact with eIF5, as demonstrated by biophysical studies of in vitro purified proteins (Luna et al., 2013).
The importance of positions (+4) and (−3) of mRNA has been pointed out in previous studies (reviewed by Kozak, 1989), but the significant involvement of position (−4) in mammals was not highlighted. Similarly to py48S-eIF5 IC (Llácer et al., 2018), our structures show that when this position is A, it can be stabilized by residue His80 of eS26. This residue is highly conserved in eukaryotes (Figure 3A), therefore showing a universal mode of interaction. The role of eS26 (and eS28) in the accommodation of the 5′ UTR was also highlighted by chemical cross-linking studies performed with 80S ribosome assembled on H4 mRNA (Martin et al., 2016). Remarkably, in the case of the yeast IC, it was shown that the interaction between the A nucleotide at the position (−4) and His80 of eS26 does occur through stacking (Llácer et al., 2018), in contrast to our study where we can show C and mainly A (−4) stacked below the His80 of eS26. This variation is probably due to the different sequence between our mRNA and those of yeast, where positions −1 to −3 are occupied by A, which favors more stacking between the nucleotides bases and consequently more twist (Figure 5A). Indeed, the kink in E/P site in py48S-eIF5 IC (Llácer et al., 2018) is sharper than in the case of mammalian 48S IC (Figure S3A). Interestingly, a previous cross-linking study in the context of the human 80S ribosome highlighted the eS26 binding to G and U nucleotides in region (−4) to (−9) of the mRNA (Graifer et al., 2004). In a more recent biochemical study in yeast, it was shown that the mRNAs bound to the ribosomes depleted in eS26 (RpS26 in Saccharomyces) translate poorly compared with those enriched in eS26 (Ferretti et al., 2017). It was also reported in the same study that RpS26 is necessary for preferential translation of mRNAs with A at position (−4) and not G, showing that the interaction is very specific and not simply purine/pyrimidine dependent. Therefore, the His80 eS26 recognition is likely optimal for mRNA sequences containing A at this position. In yeast, however, the nucleotide context surrounding the start codon is less critical but shares with mammals the importance of the (−3) position (Cavener and Ray, 1991, Kozak, 1986). Further studies on translation of mRNAs containing mutations at these positions will help unveil the mechanism of scanning in mammals and will shed light on the leaky scanning mechanism.
The tails of uS13 and uS19 have been previously shown to make direct interactions with the ASL of the peptidyl-site-exit-site (P/E) tRNA in presence of elongation factor G-ribosome complex in a pre-translocation state in prokaryotes (Zhou et al., 2013). To the best of our knowledge, these proteins have not yet been reported to be particularly involved in the initiation process in eukaryotes. Nevertheless, our structures are supported by earlier cross-linking studies of human 80S ribosome showing that the tail of uS19 is located closer to the decoding site than that of prokaryotic S19 (Graifer et al., 2004). In the case of S. cerevisiae uS13 and uS19, the C-terminal parts are not conserved, compared with human protein homologs (Figure 2E), and they have never been observed to interact with the tRNAiMet (Hussain et al., 2014, Llácer et al., 2018). Other fungi, such as N. crassa, possess very similar sequences to mammalian counterparts and probably would demonstrate similar interactions to tRNAiMet, as shown by our structures. Moreover, and in contrast to yeast, N. crassa possesses a mammalian-like eIF3. This is in line with the recent genome-wide mapping of Kozak impact in the fungal kingdom, showing the particularity of start-codon sequence context in S. cerevisiae compared with other fungi (Wallace et al., 2019).
The ribosomal protein uS7 is located close in space to position (−3) of mRNA, but we cannot confirm its interaction with this nucleotide, because of the lack of the density for this part of the protein. Of note, this is the only flexible region in this protein structure (Figure S5A), containing the crucial β-hairpin in the case of bacterial and yeast initiation (Visweswaraiah and Hinnebusch, 2017, Wimberly et al., 1997). This region in yeast contains the glycine-stretch GGGG (residues 150–153), whereas in human it is GRAG (residues 129–132). Genetic experiments on single-point mutants of this β-hairpin demonstrated almost unchanged phenotype for human-like G151R and G152A mutations (Visweswaraiah et al., 2015), but the G151S mutation was lethal. More recent work showed by using genetic and biochemical approaches that uS7 modulates start-codon recognition by interacting with eIF2α domains in yeast (Visweswaraiah and Hinnebusch, 2017). The residues implicated in the described interactions are highly conserved and are also present in mammals (Figure S5B). Therefore, we speculate that the effect of the studied substitutions of uS7 might result in similar phenotypes in mammals.
In comparison with the py48S-eIF5N structure (Llácer et al., 2018), the electron density of eIF5 NTD was not observed in our complexes, although MS analysis revealed some residual presence of eIF5 only in the β-globin LS48S IC. This may suggest that the presented LS48S ICs were trapped between eIF1 dissociation and eIF5 NTD binding (which would represent the structure corresponding to the intermediate state between p48S-closed and p48S-5N, according to Llácer et al., 2018). Another reason might be the weaker affinity of eIF5 to the IC at this stage, as both of our complexes are purified directly from RRL without supplementation of any factors, which is overcome when the complex is assembled in vitro with higher molar ratios.
After the dissociation of the TC (GTP-driven binding), eIF5B is recruited to the IC at the exact binding site of ABCE1. As shown by Wang et al. (2019), the time between binding of eIF5B and association of 60S is very short (∼0.59 s), and because of its dynamic nature, we most likely would not be able to stall the 48S-eIF5B complex using our protocol. Indeed, only a small number of MS spectra for eIF5B was recorded.
Compared with the yeast 40S ribosome-ABCE1 post-splitting complex (Heuer et al., 2017), we do not observe any large structural differences. However, NTP pocket 1 appears to be more open than NTP pocket 2 (Figures 6A and 6E, colored frames), and it is similar to the “open state” found by X-ray crystallography (Karcher et al., 2008). A recent single molecule-based fluorescence resonance energy transfer (smFRET) study on archaeal ABCE1 showed that two NTP sites are in asymmetric dynamic equilibrium and both NTP sites can exist in different conformations (Gouridis et al., 2019). Therefore, we propose that in LS48S IC, the ABCE1 is present in its asymmetric conformer, where NTP pocket 1 is in the open state, whereas pocket 2 is in the closed state. The position of ABCE1 in the IC suggests steric incompatibility with the human re-initiation factor eIF2D (Weisser et al., 2017). Indeed, the winged helix (WH) domain of eIF2D was found to interact with the central part of h44 ribosomal RNA in the absence of ABCE1, at the exact position of the ABCE1 Fe-S cluster (I) in our LS48S IC (Figure S4B). This cluster also shows sequence similarity to the C-terminal SUI domain of eIF2D, found to be located in the re-initiation complex at the top of h44 rRNA (Figure 4A; Weisser et al., 2017).
Regarding the relatively low abundance of eIF3 in our complexes, we believe that after the LS48S complex formation, eIF3 simply detaches from the 40S, probably during grid preparation, as has been consistently observed in structural studies of analogous complexes. The superposition of the eIF3 octamer to our previous structure of the in vitro reconstituted 43S PIC (des Georges et al., 2015) showed high structural similarity (RMSD 1.3 Å over all atoms). Consistent with previous study (des Georges et al., 2015), our structure reveals a sizable unassigned density at the mRNA channel exit, interacting mainly with eIF3a and eIF3c (Figures S6B and S6C). Because of its location, it is tempting to attribute this unassigned density to the 5′ UTR of mRNA, but its presence in 43S PIC (des Georges et al., 2015), which does not contain any mRNA, strongly contradicts this assignment. Thus, following our previous suggestion, we believe that this density belongs mainly to flexible segments of eIF3d. Finally, the eIF3 b-i-g module is not visible in our structure, but in the case of py48S-eIF5 IC structure (Llácer et al., 2018), it was demonstrated that these subunits relocate together to the solvent site upon start-codon recognition.
Conclusions
Our cryo-EM structure at 3.0 Å represents the highest resolution reconstruction of a mammalian translation IC to date. It refines our understanding of the architecture of late-stage IC and provides structural insights into the Kozak sequence role in canonical cap-dependent translation initiation. The data presented here demonstrate different interaction networks of the mRNA within the IC on the basis of its sequence. Furthermore, our results demonstrate that the binding of ABCE1 does not affect the conformation of the 48S IC. Finally, our structure reveals the molecular details of the mammalian eIF3 core interactions with the 40S at near atomic resolution.
STAR★Methods
Key Resources Table
| REAGENT or RESOURCE | SOURCE | IDENTIFIER | 
|---|---|---|
| Chemicals, Peptides, and Recombinant Proteins | ||
| Guanosine 5′-[β,γ-imido]triphosphate (GMP-PNP) | Merck | Car#148892-91 | 
| Protease inhibitor cocktail tablets | Roche | Cat#11873580001 | 
| RNasin® Ribonuclease Inhibitors | Promega | Cat#N251B | 
| Rabbit Reticulocyte Lysate nuclease-treated | Promega | Cat#L416A | 
| ScriptCap m7G Capping System | Epicenter | Cat#SCCE0625 | 
| Deposited Data | ||
| Structure of β-globin LS48S IC | This Paper | PDB ID: 6YAL | 
| Structure of β-globin LS48S+eIF3 IC | This Paper | PDB ID: 6YAM | 
| Structure of H4 LS48S IC | This Paper | PDB ID: 6YAN | 
| Cryo-electron microscopy map of β-globin LS48S IC | This Paper | EMDB ID: EMD-10760 | 
| Cryo-electron microscopy map of β-globin LS48S+ eIF3 IC | This Paper | EMDB ID: EMD-10761 | 
| Cryo-electron microscopy map of H4 LS48S IC | This Paper | EMDB ID: EMD-10762 | 
| Cryo-electron microscopy map of β-globin LS48S IC + ATP | This Paper | EMDB ID: EMD-10763 | 
| Cryo-electron microscopy map of β-globin LS48S+eIF3 IC + ATP | This Paper | EMDB ID: EMD-10764 | 
| Recombinant DNA | ||
| plasmid for β-globin mRNA | (Simonetti et al., 2016) | N/A | 
| plasmid for H4 mRNA | This Paper | N/A | 
| Software and Algorithms | ||
| SCIPION | (de la Rosa-Trevín et al., 2016) | http://scipion.i2pc.es | 
| Molecular Dynamic Flexible Fitting | (Trabuco et al., 2008) | https://www.ks.uiuc.edu/Research/mdff/ | 
| CTFFIND4 | (Rohou and Grigorieff, 2015) | https://grigoriefflab.janelia.org/ctf | 
| RELION | (Scheres, 2012) | https://www2.mrc-lmb.cam.ac.uk/groups/scheres/impact.html | 
| UCSF Chimera | (Pettersen et al., 2004). | https://www.cgl.ucsf.edu/chimera/ | 
| MotionCor | (Zheng et al., 2017) | https://emcore.ucsf.edu/ucsf-motioncor2 | 
| PHENIX 1.9.1692 | (Adams et al., 2010) | http://www.phenix-online.org/ | 
| Phenix.ERRASER | (Chou et al., 2016) | |
| Visual Molecular Dynamics | (Humphrey et al., 1996) | https://www.ks.uiuc.edu/Research/vmd/ | 
| Scalable Molecular Dynamics 2 | (Phillips et al., 2005) | http://www.phenix-online.org/ | 
Lead Contact and Materials Availability
Further information and requests for reagents should be sent to the Lead Contact, Yaser Hashem (yaser.hashem@u-bordeaux.fr). All unique/stable reagents generated in this study are available from the Lead Contact without restriction.
Method Details
In vitro transcription and capping
Human β-globin mRNA was prepared as previously described (Simonetti et al., 2016). The template for mouse H4–12 mRNA (375 nt; accession number X13235) was generated by PCR amplification extended on its 3′ end with a 5′-(CAA)9CAC-3′ tail from plasmid containing the gene synthetized by Proteogenix. The PCR product purification and in vitro transcription of mouse H4–12 mRNA were performed as described for the preparation of β-globin mRNA. The pure transcripts were capped using the ScriptCap m7G Capping System (Epicenter). Radiolabelled transcripts were obtained by substituting the GTP from the kit with [α32P]GTP.
Sample preparation for Cryo-EM
β-globin 48S IC and H4 48S IC were isolated from nuclease-treated rabbit reticulocyte lysate (RRL) (Promega L4960), as previously described (Simonetti et al., 2016) with the main difference that a lower concentration of Mg2+ was used for ribosome complex assembly as detailed below. Prior to complexes assembly, the reaction mix in a final volume of 83 μL has been prepared by adding 21 μL of the Amino Acid Mixture (for a final concentration of 0.13 mM) and 80 units of RNasin (Promega Ref. number N2511) to 60 μL of RRL. The mix was incubated at 30°C for 5 minutes to reactivate the ribosomes. In a final volume of 157 μl, complex assembly has been obtained by adding 13 μg of mRNA, guanylyl imidodiphosphate (GMP-PNP) to final concentration of 5 mM and Mg(OAc)2 to keep the final concentration of free Mg2+ at 0.5 mM. The reaction is incubated for further 5 minutes at 30°C. β-globin and H4 mRNA assembled complexes were separate on 5%–25% linear sucrose gradient (in buffer containing 25 mM HEPES-KOH [pH 7.6], 79 mM KOAc, 0.5 mM Mg(OAc)2, and 1 mM DTT) by centrifugation at 36,000 rpm in a SW41Ti rotor for 4.5 h at 4°C. Moreover, we have optimized the gradient fraction collection using BioComp Piston Gradient Fractionator devise. The formation of translation initiation complexes has been monitored following the ribosome profile via the UV absorbance (optical density [OD] at 260 nm) and the radioactivity profile of the 32P-labeled globin or H4 mRNA. Fractions containing β-globin/48S IC or H4/48S IC were centrifuged at 108,000 rpm (S140AT Sorvall-Hitachi rotor) for 1 h at 4°C and the ribosomal pellet was dissolved in a buffer containing 10 mM HEPES-KOH pH 7.4, 50 mM KOAc, 10 mM NH4Cl, 5 mM Mg(OAc)2 and 2 mM DTT.
Grids preparation and data collection parameters
The grids were prepared by applying 4 μL of each complex at ∼70 nM to 400 mesh holey carbon Quantifoil 2/2 grids (Quantifoil Micro Tools). The grids were blotted for 1.5 s at 4°C, 100% humidity, using waiting time 30 s, and blot force 4 (Vitrobot Mark IV). The data acquisitions were performed for the β-globin•GMP-PNP and H4•GMP-PNP ICs on a Titan Krios S-FEG instrument (FEI) operated at 300 kV acceleration voltage and at a nominal underfocus of Δz = ∼0.5 to ∼3.5 μm using the CMOS Summit K2 direct electron detector 4,096 × 4,096 camera and automated data collection with SerialEM (Mastronarde, 2003) at a nominal magnification of 59,000 x. The K2 camera was used at super-resolution mode and the output movies were binned twice resulting in a pixel size of 1.1Å at the specimen level (the calibrated magnification on the 6.35 μm pixel camera is 115,455 x). The camera was setup to collect 20 frames and frames 3 to 20 were aligned. Total collected dose is ∼26 e-/Å2. In addition, Cryo-EM images of the β-globin•GMP-PNP+ATP were collected on a Polara Tecnai F30 cryo-transmission electron microscope (FEI instruments) operated at 300 keV acceleration voltage and at a nominal underfocus of Δz = ∼0.5 to ∼4.0 μm, using a direct electron detector CMOS (Falcon I) 4,096 × 4,096 camera calibrated at a nominal magnification of 59,000 x, resulting in a pixel size of 1.815 Å.
Image processing
SCIPION (de la Rosa-Trevín et al., 2016) package was used for image processing and 3D reconstruction. MotionCor (Zheng et al., 2017) was used for the movie alignment of 8238 movies from the β-globin complex and 8520 movies for the H4 complex. CTFFIND4 (Rohou and Grigorieff, 2015) was used for the estimation of the contrast transfer function of an average image of the whole stack. Particles were selected in SCIPION. Approximately 1,067,000 particles were selected for the β-globin•GMP-PNP IC, 666,000 particles for the H4•GMP-PNP and 200,000 particles for the β-globin•GMP-PNP+ATP IC. RELION (Scheres, 2012) was used for particle sorting through 3D classification via SCIPION, (please refer to Figure S1 for particle sorting details for all three complexes). Selected classes were refined using RELION’s 3D autorefine and the final refined classes were then post-processed using the procedure implemented in RELION applied to the final maps for appropriate masking, B factor sharpening, and resolution validation to avoid over-fitting.
Model building, map fitting and refinement
The four initiation complexes were modeled based on the previous initiation Oryctolagus cuniculus complex (PDBID: 5K0Y) (Simonetti et al., 2016) resolved at 5.8 Å. Adjustments of RNA and proteins were done using the visualization and modeling software UCSF Chimera version 1.12 (build 41623) (Pettersen et al., 2004). Sequences of modeled factors from Oryctolagus cuniculus were retrieved using BLAST (Altschul et al., 1990) tools in the NCBI database (NCBI Resource Coordinators, 2017) using respective template sequence described below. Templates structures were extracted from the PDB (Berman et al., 2007). ABCE1 from Saccharomyces cerevisiae 40S complex (PDB ID: 5LL6 chain h) (Heuer et al., 2017) was used as template to thread ABCE1 of Oryctolagus cuniculus in Swiss-model (Biasini et al., 2014) webservice. The core of initiation factor 3 (eIF3) composed of subunits A, C, E, F, H, K, L and M was extracted from corresponding mammalian eIF3 (PDB ID: 5A5T) (des Georges et al., 2015) and further rmodelled using (Neupane et al., 2019). Initiation factor 3D (eIF3d) from Nasonia vitripennis (PDB ID: 5K4B) (Lee et al., 2016) was the template to thread eIF3d of Oryctolagus cuniculus in Swiss-model. Eukaryotic Initiation factor 1A (eIF1A) template was extracted from Saccharomyces cerevisiae 48S pre-initiation complex (PDB ID: 3JAP chain i) (Llácer et al., 2015) and thread into Oryctolagus cuniculus in Swiss-model. The ternary complex (TC) was affined from Oryctolagus cuniculus initiation complex (PDB ID: 5K0Y) (Simonetti et al., 2016). Both messenger RNAs (globin and H4) were modeled using modeling tools of Chimera. Refinements were done on all four complexes in their corresponding maps. The refinement workflow followed four major steps that applied to all initiation complexes. First, a Molecular Dynamic Flexible Fitting (MDFF) (Trabuco et al., 2008) ran for 200000 steps with gscale of 1 (potential given to the density map to attract atoms in their density). The trajectories reached a plateau of RMSD curve around frame 160 for the four complexes. A minimization followed the trajectories to relax the system. MDFF ran on VMD (Humphrey et al., 1996) 1.9.2 coupled with NAMD2 (Phillips et al., 2005) v.1.3. software. Next steps of refinement required the usage of several specialized tools for RNA and proteins geometry included as modules in PHENIX (Adams et al., 2010) version 1.13-2998-000 software. Phenix.ERRASER (Chou et al., 2016) is a specialized tool for RNA refinement and Phenix.real_space_refine is specialized for proteins geometry and density fitting refinement. Finally, a last step of minimization using VMD and NAMD2 was applied. Assessment and validation of our models were done by Molprobity (Chen et al., 2010) webservice. Validation statistics are in Table S1.
Mass spectrometry analysis and data post-processing
Protein extracts were precipitated overnight with 5 volumes of cold 0.1 M ammonium acetate in 100% methanol. Proteins were then digested with sequencing-grade trypsin (Promega, Fitchburg, MA, USA) as described previously (Khusainov et al., 2016). Each sample was further analyzed by nanoLC-MS/MS on a QExactive+ mass spectrometer coupled to an EASY-nanoLC-1000 (Thermo-Fisher Scientific, USA). Data were searched against the rabbit UniprotKB sub-database with a decoy strategy (UniprotKB release 2016-08-22, taxon 9986 Oryctolagus cuniculus, 23086 forward protein sequences). Peptides and proteins were identified with Mascot algorithm (version 2.5.1, Matrix Science, London, UK) and data were further imported into Proline v1.4 software (http://www.profiproteomics.fr/proline). Proteins were validated on Mascot pretty rank equal to 1, and 1% FDR on both peptide spectrum matches (PSM score) and protein sets (Protein Set score). The total number of MS/MS fragmentation spectra was used to quantify each protein from three independent biological replicates (Spectral Count relative quantification). Proline was further used to align the Spectral Count values across all samples. The average of three experiments was normalized in respect to the total number of spectral counts (NSC) for all initiation factors (eIFs) (2807 for β-globin and 2126 for H4). Therefore, the H4 results were multiplied by the normalization factor of 1.32. Then, the multiplicands for different eIFs were added and the results underwent a second normalization according to the number of trypsin sites (> 70% of probability) predicted by PeptideCutter provided by the ExPaSy server (https://web.expasy.org/peptide_cutter). The heatmaps were generated in respect to the NSC. The coefficients of variations were calculated as standard of deviation between the normalized NSC and are presented in percentage (see Figure 1B).
Alignments
The alignments of 8 eukaryotic species (Homo sapiens, Mus musculus, Danio rerio, Drosophila melanogaster, Ceanorhabditis elegans, Neurospora crassa, Saccharomyces cervisiae, Arabidopsis thaliana) shown in the figures, were done using Constraint-based Multiple Alignment Tool (COBALT) (Papadopoulos and Agarwala, 2007) from NCBI and visualized using BoxShade Server (ExPASy).
Quantification and Statistical Analysis
The total number of MS/MS fragmentation spectra was used to quantify each protein from three independent biological replicates (Spectral Count relative quantification). This is a widely used method to quantify a protein across several samples, based on the count of the total number of MS/MS spectra matching on a specific protein sequence. We have used UniProtKB rabbit database (taxon identifier 9986). The average of three experiments was normalized in respect to the total number of spectral counts (NSC) for all eukaryotic initiation factors (2807 for β-globin and 2126 for H4). Furthermore, the second normalization was according to the predicted number of trypsin cleavage sites (with a probability threshold 70%) by the PeptideCutter program (ExPASy server). The coefficients of variations were calculated as standard of deviation (SD) between the normalized NSC and divided by the average number of spectra. The results are presented in four different interval percentages in the heatmaps (see Figure 1B). The quantification details are also included in the legend of Figure 1 and Mass spectrometry analysis and data post-processing section of the Method details.
Data and Code Availability
The atomic coordinates of the β-globin LS48S IC, β-globin LS48S+eIF3 IC and H4 LS48S IC have been deposited in the Protein Data Bank (PDB). The accession numbers for the atomic models of the LS48ICc reported in this paper are: 6YAL, 6YAM and 6YAN, respectively. The cryo-EM maps of β-globin LS48S IC, β-globin LS48S+eIF3 IC, H4 LS48S IC, β-globin LS48S IC + ATP and β-globin LS48S+eIF3 IC + ATP have been deposited in the Electron Microscopy Data Bank (EMDB) with the accession codes: EMD-10760, EMD-10761, EMD-10762, EMD-10763 and EMD-10764, respectively.
Acknowledgments
We thank Gabor Papai and Julio Ortiz Espinoza (IGBMC, Strasbourg, France) for assistance in data acquisition and Franck Martin for providing the plasmid for β-globin mRNA. We also thank the High-Performance Computing Centre of the University of Strasbourg (funded by the Equipex Equip@Meso project) for information technology (IT) support and the staff of the proteomic platform of Strasbourg-Esplanade for conducting the nanoLC-MS/MS analysis (funded by Laboratoires d'Excellence [LABEX]: ANR-10-LABX-0036 NETRNA). The MS instrumentation was granted from Université de Strasbourg (IdEx 2015 Equipement mi-lourd). We thank Alan G. Hinnebusch for critical reading of the manuscript and Cameron Mackereth for numerous useful comments. We acknowledge Israel S. Fernández for providing us with the molecular model of eIF3 core. This work was supported by Agence Nationale de la Recherche (ANR) grants ANR-14-ACHN-0024 @RAction program “ANR CryoEM80S,” ANR-10-LABX-0036_NETRNA, and ERC-2017-STG #759120 “TransTryp” (to Y.H.).
Author Contributions
A.S. conducted sample preparation and optimization for cryo-EM study and MS analysis. Y.H. and A.S. performed cryo-EM experiments. Y.H., E.G., and A.S. interpreted the data. Y.H. and E.G. carried out data processing and structural analysis and wrote the manuscript with input from all authors. A.B. performed the atomic modeling. L.K. performed the MS experiments. All authors read and commented the manuscript. Y.H. directed the research.
Declaration of Interests
The authors declare no competing interests.
Published: April 7, 2020
Footnotes
Supplemental Information can be found online at https://doi.org/10.1016/j.celrep.2020.03.061.
Supplemental Information
References
- Adams P.D., Afonine P.V., Bunkóczi G., Chen V.B., Davis I.W., Echols N., Headd J.J., Hung L.-W., Kapral G.J., Grosse-Kunstleve R.W. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D Biol. Crystallogr. 2010;66:213–221. doi: 10.1107/S0907444909052925. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Aitken C.E., Beznosková P., Vlčkova V., Chiu W.L., Zhou F., Valášek L.S., Hinnebusch A.G., Lorsch J.R. Eukaryotic translation initiation factor 3 plays distinct roles at the mRNA entry and exit channels of the ribosomal preinitiation complex. eLife. 2016;5:e20934. doi: 10.7554/eLife.20934. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- Aylett C.H.S., Boehringer D., Erzberger J.P., Schaefer T., Ban N. Structure of a yeast 40S-eIF1-eIF1A-eIF3-eIF3j initiation complex. Nat. Struct. Mol. Biol. 2015;22:269–271. doi: 10.1038/nsmb.2963. [DOI] [PubMed] [Google Scholar]
- Barthelme D., Scheele U., Dinkelaker S., Janoschka A., Macmillan F., Albers S.V., Driessen A.J.M., Stagni M.S., Bill E., Meyer-Klaucke W. Structural organization of essential iron-sulfur clusters in the evolutionarily highly conserved ATP-binding cassette protein ABCE1. J. Biol. Chem. 2007;282:14598–14607. doi: 10.1074/jbc.M700825200. [DOI] [PubMed] [Google Scholar]
- Barthelme D., Dinkelaker S., Albers S.-V., Londei P., Ermler U., Tampé R. Ribosome recycling depends on a mechanistic link between the FeS cluster domain and a conformational switch of the twin-ATPase ABCE1. Proc. Natl. Acad. Sci. U S A. 2011;108:3228–3233. doi: 10.1073/pnas.1015953108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Becker T., Franckenberg S., Wickles S., Shoemaker C.J., Anger A.M., Armache J.P., Sieber H., Ungewickell C., Berninghausen O., Daberkow I. Structural basis of highly conserved ribosome recycling in eukaryotes and archaea. Nature. 2012;482:501–506. doi: 10.1038/nature10829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Berman H., Henrick K., Nakamura H., Markley J.L. The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data. Nucleic Acids Res. 2007;35:D301–D303. doi: 10.1093/nar/gkl971. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beznosková P., Cuchalová L., Wagner S., Shoemaker C.J., Gunišová S., von der Haar T., Valášek L.S. Translation initiation factors eIF3 and HCR1 control translation termination and stop codon read-through in yeast cells. PLoS Genet. 2013;9:e1003962. doi: 10.1371/journal.pgen.1003962. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beznosková P., Wagner S., Jansen M.E., von der Haar T., Valášek L.S. Translation initiation factor eIF3 promotes programmed stop codon readthrough. Nucleic Acids Res. 2015;43:5099–5111. doi: 10.1093/nar/gkv421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Biasini M., Bienert S., Waterhouse A., Arnold K., Studer G., Schmidt T., Kiefer F., Gallo Cassarino T., Bertoni M., Bordoli L., Schwede T. SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information. Nucleic Acids Res. 2014;42:W252–W258. doi: 10.1093/nar/gku340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bulygin K., Chavatte L., Frolova L., Karpova G., Favre A. The first position of a codon placed in the A site of the human 80S ribosome contacts nucleotide C1696 of the 18S rRNA as well as proteins S2, S3, S3a, S30, and S15. Biochemistry. 2005;44:2153–2162. doi: 10.1021/bi0487802. [DOI] [PubMed] [Google Scholar]
- Cavener D.R., Ray S.C. Eukaryotic start and stop translation sites. Nucleic Acids Res. 1991;19:3185–3192. doi: 10.1093/nar/19.12.3185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen V.B., Arendall W.B., 3rd, Headd J.J., Keedy D.A., Immormino R.M., Kapral G.J., Murray L.W., Richardson J.S., Richardson D.C. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. D Biol. Crystallogr. 2010;66:12–21. doi: 10.1107/S0907444909042073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chou F.C., Echols N., Terwilliger T.C., Das R. RNA structure refinement using the ERRASER-Phenix pipeline. Methods Mol. Biol. 2016;1320:269–282. doi: 10.1007/978-1-4939-2763-0_17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- de la Rosa-Trevín J.M., Quintana A., Del Cano L., Zaldívar A., Foche I., Gutiérrez J., Gómez-Blanco J., Burguet-Castell J., Cuenca-Alba J., Abrishami V. Scipion: a software framework toward integration, reproducibility and validation in 3D electron microscopy. J. Struct. Biol. 2016;195:93–99. doi: 10.1016/j.jsb.2016.04.010. [DOI] [PubMed] [Google Scholar]
- Demeshkina N., Repkova M., Ven’yaminova A., Graifer D., Karpova G. Nucleotides of 18S rRNA surrounding mRNA codons at the human ribosomal A, P, and E sites: a crosslinking study with mRNA analogs carrying an aryl azide group at either the uracil or the guanine residue. RNA. 2000;6:1727–1736. doi: 10.1017/s1355838200000996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- des Georges A., Dhote V., Kuhn L., Hellen C.U.T., Pestova T.V., Frank J., Hashem Y. Structure of mammalian eIF3 in the context of the 43S preinitiation complex. Nature. 2015;525:491–495. doi: 10.1038/nature14891. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dong J., Nanda J.S., Rahman H., Pruitt M.R., Shin B.-S., Wong C.-M., Lorsch J.R., Hinnebusch A.G. Genetic identification of yeast 18S rRNA residues required for efficient recruitment of initiator tRNA(Met) and AUG selection. Genes Dev. 2008;22:2242–2255. doi: 10.1101/gad.1696608. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dong J., Aitken C.E., Thakur A., Shin B.S., Lorsch J.R., Hinnebusch A.G. Rps3/uS3 promotes mRNA binding at the 40S ribosome entry channel and stabilizes preinitiation complexes at start codons. Proc. Natl. Acad. Sci. U S A. 2017;114:E2126–E2135. doi: 10.1073/pnas.1620569114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dvir S., Velten L., Sharon E., Zeevi D., Carey L.B., Weinberger A., Segal E. Deciphering the rules by which 5′-UTR sequences affect protein expression in yeast. Proc. Natl. Acad. Sci. U S A. 2013;110:E2792–E2801. doi: 10.1073/pnas.1222534110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eliseev B., Yeramala L., Leitner A., Karuppasamy M., Raimondeau E., Huard K., Alkalaeva E., Aebersold R., Schaffitzel C. Structure of a human cap-dependent 48S translation pre-initiation complex. Nucleic Acids Res. 2018;46:2678–2689. doi: 10.1093/nar/gky054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Erzberger J.P., Stengel F., Pellarin R., Zhang S., Schaefer T., Aylett C.H.S., Cimermančič P., Boehringer D., Sali A., Aebersold R., Ban N. Molecular architecture of the 40S⋅eIF1⋅eIF3 translation initiation complex. Cell. 2014;158:1123–1135. doi: 10.1016/j.cell.2014.07.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fernandez I.S., Bai X.-C., Hussain T., Kelley A.C., Lorsch J.R., Ramakrishnan V., Scheres S.H.W. Molecular architecture of a eukaryotic translational initiation complex. Science. 2013;342:1240585. doi: 10.1126/science.1240585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ferretti M.B., Ghalei H., Ward E.A., Potts E.L., Karbstein K. Rps26 directs mRNA-specific translation by recognition of Kozak sequence elements. Nat. Struct. Mol. Biol. 2017;24:700–707. doi: 10.1038/nsmb.3442. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fringer J.M., Acker M.G., Fekete C.A., Lorsch J.R., Dever T.E. Coupled release of eukaryotic translation initiation factors 5B and 1A from 80S ribosomes following subunit joining. Mol. Cell. Biol. 2007;27:2384–2397. doi: 10.1128/MCB.02254-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gouridis G., Hetzert B., Kiosze-Becker K., de Boer M., Heinemann H., Nürenberg-Goloub E., Cordes T., Tampé R. ABCE1 controls ribosome recycling by an asymmetric dynamic conformational equilibrium. Cell Rep. 2019;28:723–734.e6. doi: 10.1016/j.celrep.2019.06.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Graifer D., Molotkov M., Styazhkina V., Demeshkina N., Bulygin K., Eremina A., Ivanov A., Laletina E., Ven’yaminova A., Karpova G. Variable and conserved elements of human ribosomes surrounding the mRNA at the decoding and upstream sites. Nucleic Acids Res. 2004;32:3282–3293. doi: 10.1093/nar/gkh657. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gross J.D., Moerke N.J., von der Haar T., Lugovskoy A.A., Sachs A.B., McCarthy J.E.G., Wagner G. Ribosome loading onto the mRNA cap is driven by conformational coupling between eIF4G and eIF4E. Cell. 2003;115:739–750. doi: 10.1016/s0092-8674(03)00975-9. [DOI] [PubMed] [Google Scholar]
- Grzegorski S.J., Chiari E.F., Robbins A., Kish P.E., Kahana A. Natural variability of Kozak sequences correlates with function in a zebrafish model. PLoS ONE. 2014;9:e108475. doi: 10.1371/journal.pone.0108475. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hashem Y., des Georges A., Dhote V., Langlois R., Liao H.Y., Grassucci R.A., Hellen C.U.T., Pestova T.V., Frank J. Structure of the mammalian ribosomal 43S preinitiation complex bound to the scanning factor DHX29. Cell. 2013;153:1108–1119. doi: 10.1016/j.cell.2013.04.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heuer A., Gerovac M., Schmidt C., Trowitzsch S., Preis A., Kötter P., Berninghausen O., Becker T., Beckmann R., Tampé R. Structure of the 40S-ABCE1 post-splitting complex in ribosome recycling and translation initiation. Nat. Struct. Mol. Biol. 2017;24:453–460. doi: 10.1038/nsmb.3396. [DOI] [PubMed] [Google Scholar]
- Hinnebusch A.G. Molecular mechanism of scanning and start codon selection in eukaryotes. Microbiol. Mol. Biol. Rev. 2011;75:434–467. doi: 10.1128/MMBR.00008-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Humphrey W., Dalke A., Schulten K. VMD: visual molecular dynamics. J. Mol. Graph. 1996;14:33–38, 27–28. doi: 10.1016/0263-7855(96)00018-5. [DOI] [PubMed] [Google Scholar]
- Hussain T., Llácer J.L., Fernández I.S., Munoz A., Martin-Marcos P., Savva C.G., Lorsch J.R., Hinnebusch A.G., Ramakrishnan V. Structural changes enable start codon recognition by the eukaryotic translation initiation complex. Cell. 2014;159:597–607. doi: 10.1016/j.cell.2014.10.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jackson R.J., Hellen C.U.T., Pestova T.V. The mechanism of eukaryotic translation initiation and principles of its regulation. Nat. Rev. Mol. Cell Biol. 2010;11:113–127. doi: 10.1038/nrm2838. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karcher A., Schele A., Hopfner K.P. X-ray structure of the complete ABC enzyme ABCE1 from Pyrococcus abyssi. J. Biol. Chem. 2008;283:7962–7971. doi: 10.1074/jbc.M707347200. [DOI] [PubMed] [Google Scholar]
- Khoshnevis S., Gross T., Rotte C., Baierlein C., Ficner R., Krebber H. The iron-sulphur protein RNase L inhibitor functions in translation termination. EMBO Rep. 2010;11:214–219. doi: 10.1038/embor.2009.272. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Khusainov I., Vicens Q., Bochler A., Grosse F., Myasnikov A., Ménétret J.-F., Chicher J., Marzi S., Romby P., Yusupova G. Structure of the 70S ribosome from human pathogen Staphylococcus aureus. Nucleic Acids Res. 2016;44:10491–10504. doi: 10.1093/nar/gkw933. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kiosze-Becker K., Ori A., Gerovac M., Heuer A., Nürenberg-Goloub E., Rashid U.J., Becker T., Beckmann R., Beck M., Tampé R. Structure of the ribosome post-recycling complex probed by chemical cross-linking and mass spectrometry. Nat. Commun. 2016;7:13248. doi: 10.1038/ncomms13248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kozak M. Point mutations close to the AUG initiator codon affect the efficiency of translation of rat preproinsulin in vivo. Nature. 1984;308:241–246. doi: 10.1038/308241a0. [DOI] [PubMed] [Google Scholar]
- Kozak M. Point mutations define a sequence flanking the AUG initiator codon that modulates translation by eukaryotic ribosomes. Cell. 1986;44:283–292. doi: 10.1016/0092-8674(86)90762-2. [DOI] [PubMed] [Google Scholar]
- Kozak M. At least six nucleotides preceding the AUG initiator codon enhance translation in mammalian cells. J. Mol. Biol. 1987;196:947–950. doi: 10.1016/0022-2836(87)90418-9. [DOI] [PubMed] [Google Scholar]
- Kozak M. An analysis of 5′-noncoding sequences from 699 vertebrate messenger RNAs. Nucleic Acids Res. 1987;15:8125–8148. doi: 10.1093/nar/15.20.8125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kozak M. The scanning model for translation: an update. J. Cell Biol. 1989;108:229–241. doi: 10.1083/jcb.108.2.229. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee A.S., Kranzusch P.J., Doudna J.A., Cate J.H.D. eIF3d is an mRNA cap-binding protein that is required for specialized translation initiation. Nature. 2016;536:96–99. doi: 10.1038/nature18954. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin F.T., MacDougald O.A., Diehl A.M., Lane M.D. A 30-kDa alternative translation product of the CCAAT/enhancer binding protein alpha message: transcriptional activator lacking antimitotic activity. Proc. Natl. Acad. Sci. U S A. 1993;90:9606–9610. doi: 10.1073/pnas.90.20.9606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Llácer J.L., Hussain T., Marler L., Aitken C.E., Thakur A., Lorsch J.R., Hinnebusch A.G., Ramakrishnan V. Conformational differences between open and closed states of the eukaryotic translation initiation complex. Mol. Cell. 2015;59:399–412. doi: 10.1016/j.molcel.2015.06.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Llácer J.L., Hussain T., Saini A.K., Nanda J.S., Kaur S., Gordiyenko Y., Kumar R., Hinnebusch A.G., Lorsch J.R., Ramakrishnan V. Translational initiation factor eIF5 replaces eIF1 on the 40S ribosomal subunit to promote start-codon recognition. eLife. 2018;7:1–33. doi: 10.7554/eLife.39273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lomakin I.B., Steitz T.A. The initiation of mammalian protein synthesis and mRNA scanning mechanism. Nature. 2013;500:307–311. doi: 10.1038/nature12355. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luna R.E., Arthanari H., Hiraishi H., Nanda J., Martin-Marcos P., Markus M.A., Akabayov B., Milbradt A.G., Luna L.E., Seo H.C. The C-terminal domain of eukaryotic initiation factor 5 promotes start codon recognition by its dynamic interplay with eIF1 and eIF2β. Cell Rep. 2012;1:689–702. doi: 10.1016/j.celrep.2012.04.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luna R.E., Arthanari H., Hiraishi H., Akabayov B., Tang L., Cox C., Markus M.A., Luna L.E., Ikeda Y., Watanabe R. The interaction between eukaryotic initiation factor 1A and eIF5 retains eIF1 within scanning preinitiation complexes. Biochemistry. 2013;52:9510–9518. doi: 10.1021/bi4009775. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maden B.E.H. The numerous modified nucleotides in eukaryotic ribosomal RNA. Prog. Nucleic Acid Res. Mol. Biol. 1990;39:241–303. doi: 10.1016/s0079-6603(08)60629-7. [DOI] [PubMed] [Google Scholar]
- Mancera-Martínez E., Brito Querido J., Valasek L.S., Simonetti A., Hashem Y. ABCE1: A special factor that orchestrates translation at the crossroad between recycling and initiation. RNA Biol. 2017;14:1279–1285. doi: 10.1080/15476286.2016.1269993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marintchev A., Edmonds K.A., Marintcheva B., Hendrickson E., Oberer M., Suzuki C., Herdy B., Sonenberg N., Wagner G. Topology and regulation of the human eIF4A/4G/4H helicase complex in translation initiation. Cell. 2009;136:447–460. doi: 10.1016/j.cell.2009.01.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martin F., Barends S., Jaeger S., Schaeffer L., Prongidi-Fix L., Eriani G. Cap-assisted internal initiation of translation of histone H4. Mol. Cell. 2011;41:197–209. doi: 10.1016/j.molcel.2010.12.019. [DOI] [PubMed] [Google Scholar]
- Martin F., Ménétret J.F., Simonetti A., Myasnikov A.G., Vicens Q., Prongidi-Fix L., Natchiar S.K., Klaholz B.P., Eriani G. Ribosomal 18S rRNA base pairs with mRNA during eukaryotic translation initiation. Nat. Commun. 2016;7:12622. doi: 10.1038/ncomms12622. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mastronarde D.N. SerialEM: a program for automated tilt series acquisition on Tecnai microscopes using prediction of specimen position. Microsc. Microanal. 2003;9:1182–1183. [Google Scholar]
- NCBI Resource Coordinators Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2017;45(D1):D12–D17. doi: 10.1093/nar/gkw1071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Neupane R., Pisareva V.P., Rodríguez C.F., Pisarev A.V., Fernández I.S. A complex IRES at the 5′-UTR of a viral mRNA assembles a functional 48S complex via an uAUG intermediate. bioRxiv. 2019 doi: 10.1101/863761. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nürenberg-Goloub E., Kratzat H., Heinemann H., Heuer A., Kötter P., Berninghausen O., Becker T., Tampé R., Beckmann R. Molecular analysis of the ribosome recycling factor ABCE1 bound to the 30S post-splitting complex. EMBO J. 2020 doi: 10.15252/embj.2019103788. Published online February 17, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Papadopoulos J.S., Agarwala R. COBALT: constraint-based alignment tool for multiple protein sequences. Bioinformatics. 2007;23:1073–1079. doi: 10.1093/bioinformatics/btm076. [DOI] [PubMed] [Google Scholar]
- Pettersen E.F., Goddard T.D., Huang C.C., Couch G.S., Greenblatt D.M., Meng E.C., Ferrin T.E. UCSF Chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 2004;25:1605–1612. doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
- Phillips J.C., Braun R., Wang W., Gumbart J., Tajkhorshid E., Villa E., Chipot C., Skeel R.D., Kalé L., Schulten K. Scalable molecular dynamics with NAMD. J. Comput. Chem. 2005;26:1781–1802. doi: 10.1002/jcc.20289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pisarev A.V., Kolupaeva V.G., Pisareva V.P., Merrick W.C., Hellen C.U.T., Pestova T.V. Specific functional interactions of nucleotides at key -3 and +4 positions flanking the initiation codon with components of the mammalian 48S translation initiation complex. Genes Dev. 2006;20:624–636. doi: 10.1101/gad.1397906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pisarev A.V., Kolupaeva V.G., Yusupov M.M., Hellen C.U.T., Pestova T.V. Ribosomal position and contacts of mRNA in eukaryotic translation initiation complexes. EMBO J. 2008;27:1609–1621. doi: 10.1038/emboj.2008.90. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pisarev A.V., Skabkin M.A., Pisareva V.P., Skabkina O.V., Rakotondrafara A.M., Hentze M.W., Hellen C.U.T., Pestova T.V. The role of ABCE1 in eukaryotic posttermination ribosomal recycling. Mol. Cell. 2010;37:196–210. doi: 10.1016/j.molcel.2009.12.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pollo-Oliveira L., de Crécy-Lagard V. Can protein expression be regulated by modulation of tRNA modification profiles? Biochemistry. 2019;58:355–362. doi: 10.1021/acs.biochem.8b01035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Preis A., Heuer A., Barrio-Garcia C., Hauser A., Eyler D.E., Berninghausen O., Green R., Becker T., Beckmann R. Cryoelectron microscopic structures of eukaryotic translation termination complexes containing eRF1-eRF3 or eRF1-ABCE1. Cell Rep. 2014;8:59–65. doi: 10.1016/j.celrep.2014.04.058. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rogers G.W., Jr., Richter N.J., Lima W.F., Merrick W.C. Modulation of the helicase activity of eIF4A by eIF4B, eIF4H, and eIF4F. J. Biol. Chem. 2001;276:30914–30922. doi: 10.1074/jbc.M100157200. [DOI] [PubMed] [Google Scholar]
- Rohou A., Grigorieff N. CTFFIND4: fast and accurate defocus estimation from electron micrographs. J. Struct. Biol. 2015;192:216–221. doi: 10.1016/j.jsb.2015.08.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scheres S.H.W. RELION: implementation of a Bayesian approach to cryo-EM structure determination. J. Struct. Biol. 2012;180:519–530. doi: 10.1016/j.jsb.2012.09.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shoemaker C.J., Green R. Kinetic analysis reveals the ordered coupling of translation termination and ribosome recycling in yeast. Proc. Natl. Acad. Sci. U S A. 2011;108:E1392–E1398. doi: 10.1073/pnas.1113956108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simonetti A., Brito Querido J., Myasnikov A.G., Mancera-Martinez E., Renaud A., Kuhn L., Hashem Y. eIF3 peripheral subunits rearrangement after mRNA binding and start-codon recognition. Mol. Cell. 2016;63:206–217. doi: 10.1016/j.molcel.2016.05.033. [DOI] [PubMed] [Google Scholar]
- Taoka M., Nobe Y., Yamaki Y., Sato K., Ishikawa H., Izumikawa K., Yamauchi Y., Hirota K., Nakayama H., Takahashi N., Isobe T. Landscape of the complete RNA chemical modifications in the human 80S ribosome. Nucleic Acids Res. 2018;46:9289–9298. doi: 10.1093/nar/gky811. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thiaville P.C., Legendre R., Rojas-Benítez D., Baudin-Baillieu A., Hatin I., Chalancon G., Glavic A., Namy O., de Crécy-Lagard V. Global translational impacts of the loss of the tRNA modification t6A in yeast. Microb. Cell. 2016;3:29–45. doi: 10.15698/mic2016.01.473. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trabuco L.G., Villa E., Mitra K., Frank J., Schulten K. Flexible fitting of atomic structures into electron microscopy maps using molecular dynamics. Structure. 2008;16:673–683. doi: 10.1016/j.str.2008.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Visweswaraiah J., Hinnebusch A.G. Interface between 40S exit channel protein uS7/Rps5 and eIF2α modulates start codon recognition in vivo. eLife. 2017;6:1–22. doi: 10.7554/eLife.22572. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Visweswaraiah J., Pittman Y., Dever T.E., Hinnebusch A.G. The β-hairpin of 40S exit channel protein Rps5/uS7 promotes efficient and accurate translation initiation in vivo. eLife. 2015;4:e07939. doi: 10.7554/eLife.07939. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wallace E., Maufrais C., Sales-Lee J., Tuck L., De Oliveira L., Feuerbach F., Moyrand F., Natarajan P., Madhani H.D., Janbon G. Start codon context controls translation initiation in the fungal kingdom. bioRxiv. 2019 doi: 10.1101/654046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang J., Johnson A.G., Lapointe C.P., Choi J., Prabhakar A., Chen D.H., Petrov A.N., Puglisi J.D. eIF5B gates the transition from translation initiation to elongation. Nature. 2019;573:605–608. doi: 10.1038/s41586-019-1561-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weisser M., Schäfer T., Leibundgut M., Böhringer D., Aylett C.H.S., Ban N. Structural and functional insights into human re-initiation complexes. Mol. Cell. 2017;67:447–456.e7. doi: 10.1016/j.molcel.2017.06.032. [DOI] [PubMed] [Google Scholar]
- Wimberly B.T., White S.W., Ramakrishnan V. The structure of ribosomal protein S7 at 1.9 A resolution reveals a β-hairpin motif that binds double-stranded nucleic acids. Structure. 1997;5:1187–1198. doi: 10.1016/s0969-2126(97)00269-4. [DOI] [PubMed] [Google Scholar]
- Young D.J., Guydosh N.R., Zhang F., Hinnebusch A.G., Green R. Rli1/ABCE1 recycles terminating ribosomes and controls translation reinitiation in 3’UTRs in vivo. Cell. 2015;162:872–884. doi: 10.1016/j.cell.2015.07.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang F., Saini A.K., Shin B.S., Nanda J., Hinnebusch A.G. Conformational changes in the P site and mRNA entry channel evoked by AUG recognition in yeast translation preinitiation complexes. Nucleic Acids Res. 2015;43:2293–2312. doi: 10.1093/nar/gkv028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zheng S.Q., Palovcak E., Armache J.-P., Verba K.A., Cheng Y., Agard D.A. MotionCor2: anisotropic correction of beam-induced motion for improved cryo-electron microscopy. Nat. Methods. 2017;14:331–332. doi: 10.1038/nmeth.4193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhou J., Lancaster L., Donohue J.P., Noller H.F. Crystal structures of EF-G - Ribosome complexes trapped in intermediate states of translocation. Science. 2013;340:1236086. doi: 10.1126/science.1236086. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The atomic coordinates of the β-globin LS48S IC, β-globin LS48S+eIF3 IC and H4 LS48S IC have been deposited in the Protein Data Bank (PDB). The accession numbers for the atomic models of the LS48ICc reported in this paper are: 6YAL, 6YAM and 6YAN, respectively. The cryo-EM maps of β-globin LS48S IC, β-globin LS48S+eIF3 IC, H4 LS48S IC, β-globin LS48S IC + ATP and β-globin LS48S+eIF3 IC + ATP have been deposited in the Electron Microscopy Data Bank (EMDB) with the accession codes: EMD-10760, EMD-10761, EMD-10762, EMD-10763 and EMD-10764, respectively.







