Abstract
Protein folding in vivo begins during synthesis on the ribosome and is modulated by molecular chaperones that engage the nascent polypeptide. How these features of protein biogenesis influence the maturation pathway of nascent proteins is incompletely understood. Here, we use hydrogen–deuterium exchange mass spectrometry to define, at peptide resolution, the cotranslational chaperone-assisted folding pathway of Escherichia coli dihydrofolate reductase. The nascent polypeptide folds along an unanticipated pathway through structured intermediates not populated during refolding from denaturant. Association with the ribosome allows these intermediates to form, as otherwise destabilizing carboxy-terminal sequences remain confined in the ribosome exit tunnel. Trigger factor binds partially folded states without disrupting their structure, and the nascent chain is poised to complete folding immediately upon emergence of the C terminus from the exit tunnel. By mapping interactions between the nascent chain and ribosomal proteins, we trace the path of the emerging polypeptide during synthesis. Our work reveals new mechanisms by which cellular factors shape the conformational search for the native state.
Subject terms: Biochemistry, Structural biology, Protein folding
The authors follow the folding dynamics of a nascent protein trapped during its synthesis, showing how the ribosome and a molecular chaperone shape the pathway of protein folding.
Main
Our understanding of the physical principles underlying the refolding of small proteins in isolation is continually improving1. De novo protein folding in vivo differs in that it begins in the context of translation. Unlike folding from denaturant, folding on the ribosome is coupled to vectorial synthesis (proceeding from the amino terminus to the C terminus), such that not all elements of the nascent protein chain (NC) are simultaneously available for folding. As a result, structural motifs or domains may form sequentially during translation2–4, a mechanism that can mitigate interdomain misfolding5–7. The ribosome also places physical constraints on folding. The exit tunnel, which is narrow and negatively charged, limits the conformational space initially accessible to the nascent polypeptide8–10, and detailed investigations of several model proteins have shown that proximity to the ribosome surface can thermodynamically destabilize complete domains, thereby delaying the onset of folding11–15. Nevertheless, there is abundant indirect evidence for partial folding of incomplete domains at intermediate points during translation3,9,16–20. How cotranslational folding proceeds during domain synthesis is incompletely understood.
The ribosome associates with additional factors that regulate folding. Most prominent among these in bacteria is the highly abundant ribosome-bound chaperone Trigger factor (TF), which engages the majority of nascent E. coli proteins21. Although not essential under normal growth conditions, TF deletion is lethal in the absence of buffering by the Hsp70 chaperone DnaK22. Various, sometimes paradoxical functions have been ascribed to TF. These include inhibiting aggregation23, promoting folding and assembly24,25, delaying cotranslational assembly26, destabilizing folded domains27 and favoring post-translational folding28,29. How exactly chaperones modify de novo folding on the ribosome is unclear.
The inherent structural heterogeneity of the nascent protein, especially in the context of the size and complexity of the ribosome (with associated chaperones), poses a substantial technical challenge to probing local NC conformation. As a result, folding pathways on the ribosome are ill-defined compared to refolding from denaturant, and whether translation fundamentally modifies the mechanism of protein maturation remains controversial16,30–34. Here, we use proteomics, biophysical measurements and hydrogen–deuterium exchange (HDX) mass spectrometry (MS) to study cotranslational protein folding at the peptide level. Our approach resolves subtle local conformational differences between ribosome-bound biogenesis intermediates, while simultaneously reporting on the dynamic behavior and interactions of ribosomal proteins and bound chaperones.
Using dihydrofolate reductase (DHFR) as a model, we show that the ribosome grants the NC access to an efficient folding route that is inaccessible during refolding from denaturant. Although the vectorial character of protein synthesis prevents simultaneous folding of all elements of the DHFR β-sheet, a central subdomain in DHFR behaves as a novel independent folding unit during translation. This is enabled by association with the ribosome and occurs while the NC is bound by the chaperone TF. Together, our data show how the ribosome and TF collaborate to define the structural progression of a nascent protein.
Results
Preparation of cotranslational folding intermediates
As a model for studying protein biogenesis, we used E. coli DHFR, an essential oxidoreductase enzyme with close homologs in all domains of life35. DHFR is a single-domain monomeric protein comprising 159 amino acids (aa), with a central discontinuous eight-stranded β-sheet and four flanking α-helices (Fig. 1a,c). During refolding from denaturant in vitro, DHFR undergoes global collapse on the microsecond timescale, and the order of strands in the β-sheet motif is established within 6 ms36. Importantly, folding of DHFR is substantially faster than its synthesis on the ribosome, which would require ~8–16 s (~10–20 codons translated per second37). Stalling the ribosome is therefore not expected to distort NC conformational sampling that occurs during normal translation.
To sample DHFR cotranslational folding intermediates along the pathway of vectorial synthesis, we prepared stalled ribosome–NC complexes (RNCs) representing snapshots of folding in vivo (Fig. 1b)38–40. The NC sequence was truncated at the boundaries between discrete structural motifs, also considering the annotation of DHFR 'subdomains' (adenosine binding subdomain, ABD (residues 38–106); discontinuous loop subdomain, DLD (residues 1–37 and 107–159)). Translation was stalled by encoding, C-terminal to each DHFR fragment, an 8 aa ribosome stall-inducing sequence that is resistant to folding-induced release41. As a control, we prepared a construct consisting of the complete DHFR sequence separated from the stall site by an unstructured C-terminal linker of 50 aa, such that the full-length DHFR is a total of 58 residues from the peptidyl transfer center (denoted FL + 58RNC). Prior force profile analyses suggested that DHFR stalled at this linker length can fold into a conformation capable of binding the inhibitor methotrexate17. Note that ~30 residues in an extended conformation are required to span the exit tunnel42.
RNCs purified from E. coli were homogeneous, stable and insensitive to puromycin, indicating that the ribosomes were accurately stalled (Extended Data Fig. 1a–f). MS confirmed the presence of the NC and all ribosomal proteins at close to the expected stoichiometries, as well as the absence of significant contamination (Extended Data Fig. 2a,b). The exception was the chaperone TF, which copurified predominantly with RNCs containing the first 106 or 126 residues of DHFR (1–106RNC and 1–126RNC; Extended Data Figs. 1c and 2c).
Purified RNCs were exposed to deuterated buffer for 10 s, 100 s or 1,000 s and the entire system was digested into peptides. The peptide mixture was separated by liquid chromatography and ion mobility, and the relative deuterium uptake of peptides from all proteins in the system was measured by mass spectrometry (Fig. 2a and Supplementary Data 1). The degree of deuterium incorporation indicates protein conformation, as backbone hydrogens are protected from exchange when involved in stable secondary structure, buried in the core of a folded protein or at a protein–protein interface43. Comparative HDX measurements are therefore a sensitive probe of the local environment of amide hydrogens. Although different factors can influence HDX rates, quantitative comparison with appropriate reference samples can readily distinguish folded versus unfolded and native versus non-native states at the peptide level44–47.
We first analyzed FL + 58RNC, in which DHFR was expected to be completely emerged from the ribosome and folded. A total of 34 peptides from the NC covering 94% of the sequence could be followed, despite the analytical complexity of the system (Extended Data Fig. 3a). For comparison, we analyzed isolated DHFR (here called FL DHFR). Deuterium uptake for DHFR peptides was almost identical between FL + 58RNC and FL DHFR (Fig. 2b, Extended Data Fig. 3b and Supplementary Data 1), indicating that DHFR, fully translated yet still coupled to the ribosome, is essentially natively folded. Two exceptions were subtle protection from exchange of peptide 63–92 in the NC, and deprotection of peptide 94–117, both apparent at longer deuterium exposure times. These differences may arise from weak interactions between the ribosome surface and folded DHFR, discussed below. Overall, these data demonstrate that our approach allows for the analysis of HDX in an extremely complex mixture and can accurately report on local NC conformation even in the background of the entire ribosomal protein complement.
Cotranslational folding pathway of DHFR
To define the folding dynamics of DHFR on the ribosome, we compared peptide-resolved HDX at different NC lengths. FL DHFR and FL + 58RNC served as fully folded references, and deuterium uptake at three exchange times provided a fingerprint for the native conformation. We initially focused on the N-terminal region including part of strand β1 in the DLD (residues 5–30), which presented a set of overlapping peptides common to all RNCs. As a representative example, deuteration of peptide 9–28 as a function of NC length and deuterium exposure time is shown in Fig. 3a. Note that the same behavior was observed in multiple peptides covering this region of DHFR (Fig. 3b). At short chain lengths (1–37RNC), the N-terminal region is protected relative to the maximally deuterated control but is readily distinguishable from the same sequence in native DHFR, which was much more protected at 10 s deuteration. Non-native protection at short chain lengths may be a result of interactions with the ribosome or reflect transient non-native structure stabilized by the exit tunnel4. Although substantially deprotected relative to native DHFR, the N-terminal region was not completely deuterated in the RNCs, as judged by comparison to deuteration of a synthetic peptide comprising residues 1–37 (1–37peptide), which was maximally exchanged at the earliest labeling time. Elongation of the NC to extend the N terminus fully beyond the exit tunnel (1–64RNC) resulted in further deuteration. The N-terminal region remained highly deuterated with little variation in uptake, even as the NC extended to 106 and 126 residues during synthesis of the ABD. Folding of the DLD is therefore delayed until the complete subdomain is available and the C terminus is released from the ribosome, or artificially extended beyond the tunnel through a linker as in FL + 58RNC. Notably, this folding pathway is different than the in vitro pathway of refolding of DHFR from denaturant, in which the central eight-stranded β-sheet that spans the two subdomains is established early in the folding pathway48.
Extending this analysis to additional peptides across the RNCs, each reporting local folding information, allowed us to reconstruct the complete cotranslational folding pathway of DHFR. Peptide-resolved HDX for each different length RNC is shown in Fig. 3b, Extended Data Fig. 4a,b and Supplementary Data 1, and folding information is mapped onto the structure of native DHFR in Fig. 3c. Although not all peptides were uniformly detected across stalled RNCs of different lengths, clear patterns of HDX protection corresponding to NC folding were apparent. As described above, the N-terminal region including strand β1 is not folded when initially exposed outside the exit tunnel (1–37RNC), with deuteration levels close to the maximally deuterated control. In 1–64RNC, residues 25–36 emerge from the tunnel and fold into a native helix α1 belonging to the DLD, while the N-terminal region remains unstructured. Synthesis of the remainder of the ABD (1–106RNC) allows β2–β4 to exit the ribosome and acquire native-like protection from HDX. This protection was not a result of association with TF, as discussed in subsequent sections. Folding of the ABD completes when 126 residues of DHFR have been synthesized (1–126RNC), allowing β5 and α4 to coalesce with the remainder of the ABD outside the exit tunnel. At this point, β6 is still occluded in the tunnel, precluding folding of the adjacent β1 in the DLD. Synthesis and release of the C-terminal strand β8 from the ribosome triggers the final folding of the DLD, including the N-terminal segments (FL DHFR and FL + 58RNC).
The ribosome stabilizes a cotranslational folding intermediate
Unlike the DLD, peptides corresponding to the β-sheet of the ABD became progressively protected from HDX during translation, requiring neither the sequence context of FL DHFR nor even the complete subdomain (Fig. 3c). By contrast, previous work has shown that fragments of DHFR produced by chemical cleavage are disordered in isolation49. To test whether the ribosome modulates the conformation of the nascent ABD, we expressed isolated fragments corresponding to the ABD in E. coli. Only the longest fragment, consisting of residues 1–126 (herein 1–126stop), was soluble, and this was contingent on maintaining it as a fusion protein, with monomeric ultrastable GFP (muGFP50) at the N terminus. HDX MS showed that 1–126stop was substantially deprotected relative to native DHFR (Fig. 3d and Supplementary Data 1). High levels of exchange were observed for the N-terminal region, as expected in the absence of C-terminal strands β7 and β8 that complete the DLD. The β-strands (β2–β4) and peripheral helices comprising the core of the ABD were also strongly deprotected relative to FL DHFR. The ABD, therefore, fails to fold into a stable structure in isolation, although native-like folding of this subdomain is supported on the ribosome in the context of an RNC.
We asked why incomplete chains of DHFR can fold on the ribosome but not in free solution. We noticed that residues 93–118 near the C terminus of 1–126stop were maximally deuterated even at short deuteration times, indicative of structural disorder (Fig. 3d and Supplementary Data 1). In the corresponding RNC, however, this sequence is sterically confined in the ribosome exit tunnel (Figs. 1b and 3c). Therefore, we considered whether, in the absence of the ribosome, a region of C-terminal disorder may destabilize folded DHFR. Previous work has shown that unstructured termini can generate an entropic force that modulates protein conformation51. To explore this idea, we used FL + 58RNC as a model. FL + 58RNC consists of FL DHFR tethered to the peptidyl transfer center (PTC) by a flexible 50 aa linker that is largely occluded in the exit tunnel (Fig. 1b). Our HDX analysis showed that DHFR is natively folded in this context and almost indistinguishable from the isolated domain (Fig. 2b and Extended Data Fig. 3b,c). We reasoned that releasing the protein from the ribosome would remove the protective effect of the exit tunnel and allow an unstructured terminus to disrupt the conformation of DHFR. To test this idea, we replaced the stalling sequence with a stop codon, resulting in FL + 50stop. HDX MS analysis of FL + 50stop confirmed that the linker was highly dynamic, as expected (Extended Data Fig. 4c–e and Supplementary Data 1). Furthermore, we observed extensive deprotection in the DLD and ABD relative to native FL DHFR, consistent with increased structural dynamics (Fig. 3e and Extended Data Fig. 4c–e). Combined, our observations suggest a model whereby domain instability induced by unstructured C termini is rescued by confinement in the ribosome exit tunnel, potentially facilitating the folding of structured intermediates during translation.
DHFR can fold close to the ribosome
Proximity to the ribosome surface thermodynamically destabilizes some full-length nascent proteins relative to the native state of the ribosome11–13,15, prompting us to consider whether tethering to the ribosome might alter the conformation of FL DHFR. HDX analysis of FL + 58RNC did not reveal substantial differences in conformation compared to isolated DHFR (Fig. 2b). We therefore prepared an RNC with a 38-residue linker (FL + 38RNC), designed to bring DHFR closer to the ribosome surface. The folded core of DHFR in FL + 38RNC was similar in protection to isolated DHFR, although peripheral loops and part of the DLD were affected (Fig. 4a, Extended Data Fig. 5 and Supplementary Data 1). Furthermore, both FL + 58RNC and FL + 38RNC were enzymatically active (Fig. 4b). Therefore, close proximity to the ribosome exit tunnel does not substantially disrupt the conformation of folded wild-type DHFR.
Remarkably, FL + 38RNC and FL + 58RNC were at least twofold to fourfold more active than isolated DHFR (Fig. 4b and Extended Data Fig. 6a,b). Vmax and, importantly, KMDHF were higher for the RNC, indicating that this effect is not explained by errors in measurement of enzyme concentration (Fig. 4b and Extended Data Fig. 6d). The results were also not explained by tube adsorption (Extended Data Fig. 6a). The extent of activity stimulation at fixed substrate concentration may have been underestimated because RNC concentration was calculated by assuming 100% occupancy of NCs on ribosomes (See Methods). This was bona fide DHFR activity, as it was fully inhibited by methotrexate, and neither empty ribosomes nor intermediate-length RNCs were active (Fig. 4b and Extended Data Fig. 6e). We observed the same effect for an RNC with a different linker sequence, indicating that hyperactivity is not an artefact of linker chemistry (Extended Data Fig. 6c). Notably, the region near the N terminus of DHFR that is protected in FL + 38RNC includes the 'Met20 loop' at the folate binding site of DHFR, the dynamics of which are known to strongly influence catalysis35 (Fig. 4a). Thus, physical interactions with the ribosome may alter the active site of DHFR without inducing global unfolding.
To test the threshold linker length for folding of DHFR on the ribosome, we prepared FL + 28RNC. This RNC was devoid of oxidoreductase activity (Fig. 4b) and copurified with TF (Extended Data Fig. 2c). TF was not responsible for the lack of activity, as FL + 28RNC purified from a TF-free background was also inactive (Extended Data Fig. 6c). We reasoned that a 28 aa linker is insufficient to expose the C terminus of DHFR outside the exit tunnel, precluding folding of the DLD. Indeed, DHFR activity could be restored by releasing the NC from FL + 28RNC with EDTA and RNase (Fig. 4b). This observation is consistent with our interpretation, based on HDX MS, that DHFR folding completes post-translationally when the C terminus emerges from the tunnel (Fig. 3a–c). To test whether DHFR in FL + 28RNC is folding-competent before release from the ribosome, we added a peptide comprising the C-terminal 10 aa of DHFR (peptide C10) in trans. Peptide C10 activated FL + 28RNC in a concentration-dependent manner, whereas a scrambled-sequence control had no effect (Fig. 4b and Extended Data Fig. 6c). None of the peptides influenced the activity of isolated FL DHFR. TF was not required for reactivation, as TF-free FL + 28RNC was similarly reactivated (Extended Data Fig. 6c). Peptide C10 did not release the NC from ribosomes, nor did it displace TF (Extended Data Fig. 6f). DHFR is therefore poised to complete folding upon emergence of the C terminus from the ribosome exit tunnel, and neither close proximity to the ribosome surface nor association with TF prevent acquisition of the native state.
Interaction of TF with DHFR on the ribosome
We next sought to determine the role of TF in the cotranslational folding of DHFR. TF is a ribosome-associated chaperone with a three-domain architecture (Fig. 5a). The ribosome binding domain (RBD) contains a conserved ribosome-interaction motif52, and the substrate binding domain (SBD) is required for chaperone function off the ribosome53. The role of the peptidyl-prolyl isomerase domain (PPD) in de novo folding is enigmatic54. Given that chaperone–RNC complexes are highly dynamic, details of the interaction between TF and NCs have eluded structural characterization55,56, although site-specific photocrosslinking showed that NCs contact all three domains of TF56,57.
TF copurified with 1–106RNC and 1–126RNC, coinciding with the synthesis of the complete ABD (Extended Data Figs. 1c and 2c). To characterize the interaction between TF and nascent DHFR, we analyzed deuterium uptake for endogenous TF in the RNC samples and compared these data to HDX MS measurements of isolated TF. We followed 181 peptides for TF in each condition, with 99.5% sequence coverage (Extended Data Fig. 7a and Supplementary Data 1). A potential confounding factor is that isolated TF is weakly dimeric (dimerization Kd ~1 µM) but binds ribosomes as a monomer57–59. To account for this effect, we used both wild-type TF and a constitutively monomeric variant60 as reference controls for isolated TF. HDX MS confirmed that the monomeric variant was deprotected relative to wild-type TF at sites that are normally buried by the dimer interface (Extended Data Fig. 7b and Supplementary Data 1).
Compared to isolated TF, TF that was bound to 1–106RNC and 1–126RNC was very strongly protected from HDX in the ribosome-interacting motif of the RBD, as expected52 (Fig. 5b). We also observed protection of additional regions in all three domains of TF, which we attribute to interaction with nascent DHFR (Fig. 5b). Strong protection was observed in the two 'arms' of the SBD, with intermediate protection in the RBD, 'neck' region and catalytic site in the PPD. Analysis of the binding interface revealed a mix of hydrophobic and hydrophilic surfaces (Fig. 5c). The interaction sites in the RBD and PPD are predominantly hydrophobic. In the SBD, the interface includes the hydrophobic pocket in the crook of arm 2 as well as hydrophilic surfaces in arm 1 and the neck. The hydrophilic part of the neck situated between the arms, previously implicated in substrate engagement off the ribosome61, was not protected from exchange. The protected sites were the same in TF bound to 1–106RNC and 1–126RNC, indicating a common interaction surface. We did, however, detect increased protection of hydrophilic regions in arm 2 (residues 351–374 and 388–409) when TF was bound to the longer NC (Fig. 5B (red ellipses) and Supplementary Data 1).
Nascent DHFR can fold while associated with TF
To determine how TF influences the folding of DHFR on the ribosome, we purified 1–106RNC and 1–126RNC from E. coli lacking TF (ref. 22), resulting in 1–106RNCΔTF and 1–126RNCΔTF. MS confirmed that the absence of TF did not result in other chaperones (for example DnaK, DnaJ or GroEL) copurifing with the RNCs (Extended Data Fig. 2c). Pelleting assays showed that 1–126RNCΔTF was still competent to bind purified TF in vitro, and binding was sensitive to mutation of the ribosome-interacting motif in TF (Extended Data Fig. 8a).
To probe the conformation of the NC without TF, we analyzed the HDX behavior of nascent DHFR in 1–106RNCΔTF and 1–126RNCΔTF (Extended Data Fig. 7c and Supplementary Data 1). A set of peptides reported on the N terminus and β-strands in the ABD enabled us to compare wild-type and ΔTF RNCs. Although some marginal differences in deuterium uptake could be detected in the absence of TF, these were not consistently observed across overlapping peptides. Cotranslational folding of the ABD therefore occurs irrespective of the presence of TF, and TF binding does not explain our observation that the N-terminal region remains unfolded until release of the C terminus from the ribosome (Fig. 3c).
Considering that DHFR folding can occur in the presence of TF, we questioned what features of the NC are recognized by the chaperone. To probe the contribution of electrostatic versus hydrophobic interactions to binding, we tested the salt sensitivity of the TF–RNC interaction. TF preferred 1–106RNC over 1–126RNC when the complexes were purified under high-salt conditions (Extended Data Fig. 2c). By contrast, RNCs purified under low-salt conditions bound similar amounts of TF (Extended Data Fig. 8b). TF binding to 1–126RNC is therefore partially stabilized by electrostatic interactions, unlike binding to 1–106RNC, which is predominantly mediated by hydrophobic contacts and therefore stabilized by high salt. This observation is consistent with the folding-induced burial of the ABD hydrophobic core in 1–126RNC (Fig. 3c) as well as the preference of this NC for binding hydrophilic surfaces on TF (Fig. 5b).
To directly test the contribution of NC folding to TF binding, we introduced destabilizing mutations62 into 1–126RNC (Extended Data Fig. 8c). The mutated RNC bound more TF than wild-type 1–126RNC when purified under high-salt conditions, but the difference was much less pronounced when binding was reconstituted in vitro under low-salt conditions (Extended Data Fig. 8d,e). Thus, TF engages poorly folded intermediates by hydrophobic surfaces. Taken together, these observations indicate that TF uses a composite hydrophobic–hydrophilic interface to accommodate both folded and unfolded NCs, and provide indirect evidence supporting our conclusion that the ABD is natively folded in wild-type 1–126RNC. Our low-salt conditions are in a similar range of ionic strength to the E. coli cytosol (~100–200 mM (ref. 63)). In vivo, TF would thus be expected to bind equally well to NCs exposing different amounts of hydrophobic surface, exploiting different types of interaction in each case.
NC interactions with ribosomal proteins
To identify possible NC–ribosome interaction sites, we compared the HDX of ribosomal proteins in the RNCs to the same proteins in empty 70S ribosomes. We focused on a set of five ribosomal proteins (L4, L22, L23, L24 and L29) that are near or in the exit tunnel and therefore likely to contact the emerging NC (Fig. 6a). Sequence coverage of these proteins was close to 100% (Extended Data Fig. 9a). We identified several sites of HDX protection in ribosomal proteins when NC was present, often dependent on NC length (Fig. 6b–g and Supplementary Data 1).
The lower part of the tunnel is occupied by the stall-inducing sequence. Consequently, the tunnel-exposed loop of L22, which together with L4 forms the constriction site close to the PTC, was protected from exchange at all NC lengths. The corresponding loop in L4 was not protected, consistent with structural data showing that wild-type SecM contacts L22, not L4 (refs. 64,65) (Fig. 6a). Peptide 22–42 of L29, located on the ribosome surface adjacent to the exit port, was protected only in the shortest RNC (1–37RNC) (Fig. 6b). Longer NCs may expose sequences that favor different interaction sites (discussed below) or preferentially interact with TF rather than L29.
Peptide 8–29 of L23, covering the binding site for TF on the ribosome surface52, was strongly protected in 1–106RNC and 1–126RNC (Fig. 6d,e); as expected, given that these RNCs recruit TF (Extended Data Fig. 2c). However, we also detected protection of the TF docking site in 1–64RNC, which does not engage TF (Fig. 6c). In 1–64RNC (but not 1–37RNC), the emerging NC is highly basic (pI ~9.9), perhaps facilitating electrostatic interactions with the acidic TF docking site. Direct competition between the NC and the TF-binding site on the ribosome may be an additional mechanism regulating TF recruitment to RNCs.
Peptide 61–84 of L23, forming a hairpin loop that protrudes into the exit tunnel ~50 Å from the PTC (Fig. 6b, green rectangle), was protected in all RNCs except 1–106RNC and 1–126RNC, which strongly recruit TF (Extended Data Fig. 2c), suggesting the possibility of allosteric communication between the NC and TF through L23 as previously hypothesized66. Removing TF did not result in new protection of the L23 tunnel loop, arguing against reciprocal communication between TF and L23 (Extended Data Fig. 9b,c and Supplementary Data 1).
In FL + 58RNC, none of the proteins surrounding the exit port were protected from HDX relative to empty ribosomes, consistent with the absence of stable interactions with the ribosome surface (Fig. 6g). However, FL + 38RNC protected a loop in L24 (Fig. 6f, green rectangle) that was previously identified to interact with NCs13, which may underlie the modulation of the DHFR conformation that we observe for this RNC (Fig. 4a).
Together, our HDX MS analysis of ribosomal proteins suggests that nascent DHFR does not generically interact with proteins comprising the exit tunnel, but rather samples a biased route during synthesis that is potentially dictated by the chemical properties of the emerging sequence. As previously described for other systems, specific NC–ribosome interactions may further modulate the pathway of cotranslational folding10,67.
Discussion
We analyzed the conformational dynamics of ribosome–chaperone–NC systems using HDX MS. This approach is label-free, yields local information (resolved to peptide level) and simultaneously reports on the structural dynamics of all proteins in the system.
Our results add to the range of folding scenarios previously described to occur at the ribosome3,13,33,34,68. We find that cotranslational folding of DHFR is neither strictly sequential (N terminus to C terminus) nor concerted (all-or-none) but rather proceeds through a combination of both mechanisms. Sequential folding of the middle subdomain poises DHFR for rapid completion of folding in a concerted post-translational step involving both the N terminus and C terminus (Fig. 7). This may be a generic mechanism exploited by proteins with discontinuous domains to minimize the delay between synthesis and acquisition of the native state.
Several studies have found evidence for differences between cotranslational and refolding intermediates6,19,69. During refolding in vitro, elements of the central β-sheet of DHFR fold simultaneously rather than sequentially, and intermediates with a folded ABD are not populated. The ribosome chaperones nascent DHFR in at least two ways. First, by acting as a solubility tag70, the ribosome allows the NC to access conformations that are too aggregation-prone to persist in isolation. Second, occlusion of the dynamic C terminus in the exit tunnel prevents entropic destabilization of vulnerable folding intermediates. Disordered termini can alter the stability and dynamics of folded proteins through excluded volume effects51,71. We reproduce this phenomenon for DHFR and show that the ribosome buffers entropic destabilization during protein synthesis. In this regard, the physical dimensions of the exit tunnel, which are conserved in cytosolic ribosomes72, may have facilitated the evolution of topologically complex folds. Interactions between the NC and ribosome surface (Fig. 6b–g) are also likely to have a role in modulating the stability of cotranslational folding intermediates, as previously shown10,73.
Although the chaperone function of TF off the ribosome is well characterized23,24,53,61,74, its canonical role as a cotranslational chaperone is comparatively poorly understood. Here, we address two outstanding questions: considering that TF binds the majority of nascent proteins, what properties of NCs are recognized, and how does TF engagement influence the conformation of the NC? We find that NCs with little compact structure fail to engage TF, as evidenced by the lack of binding to 1–37RNC and 1–64RNC, as well as FL + 58RNC, which presents an unstructured linker to the chaperone. This argues against the unstructured N terminus being the driver of TF binding to intermediate-length RNCs. Instead, our data indicate that TF prefers partially (1–106RNC, 1–126RNC) or near-fully folded (complemented FL + 28RNC) domains (Fig. 7). Native-like folding is not, however, required for TF binding, shown by the strong recruitment of TF to an RNC exposing a destabilized subdomain. We propose that this remarkable plasticity in binding is achieved by dynamic sampling of an unusually large chaperone–NC interface that includes both hydrophilic and hydrophobic surfaces (Fig. 5). In this model, persistent binding to RNCs would require simultaneous engagement of multiple sites across the TF surface, with ribosome binding by TF contributing to overall avidity. The chemically heterogeneous binding surface accommodates both partially and fully folded domains, allowing continuous engagement of NCs as they mature. Folding may therefore occur while the NC is associated with TF, as has been suggested for refolding off the ribosome61. Importantly, the binding mode we describe allows TF to engage fragile folding intermediates without disrupting the incipient structure. Indeed, we observe that cotranslational folding intermediates of DHFR are essentially identical regardless of the presence of TF.
Interactions with the ribosome surface have been shown to thermodynamically destabilize several full-length nascent domains11,12,14,15 and can stabilize folding intermediates of a full-length immunoglobulin domain relative to the native state73. We find that the conformational dynamics of the folded core of FL DHFR are unperturbed by proximity to the ribosome, and our peptide complementation experiments show that DHFR can fold to an enzymatically active state very close to the ribosome surface (Fig. 7). These observations are not inconsistent with previous findings. Rather, structural destabilization may be local rather than global and may not result in a substantial population of unfolded conformations at equilibrium. As sequence-specific interactions with the ribosome have been implicated in NC destabilization14, it is also likely that any effect is protein-dependent. Our results point to the possibility that ribosome interactions may positively modulate the function of N-terminal domains in multidomain proteins, for example by promoting cofactor loading during synthesis.
Although we chose to analyze in detail only the NC and a subset of ribosomal proteins that directly engage the NC, the high-quality HDX MS dataset we describe here is comprehensive. Our approach could shed new light on other aspects of translation regulation, especially dynamic processes that have been challenging to study using conventional structural biology methods.
Methods
Expression and purification of RNCs
Expression constructs were synthesized by Twist Biosciences. All open reading frames were cloned between a ribosome binding site (TTTGTTTAACTTTAAGAAGGAGA) and 6×His-tag followed by a stop codon in pET21 plasmids with ampicillin resistance. The amino acid sequences encoded by each open reading frame are listed and annotated in Extended Data Table 1. Sequences are listed in Supplementary Table 1.
Extended Data Table 1.
The N-terminal muGFP (green), functioning as an affinity tag, precedes a 3 C cleavage sequence (blue) that is flanked on either side by (GS)2 linkers (bold). The subsequent DHFR sequence fragment is followed by a C-terminal stall-inducing sequence derived from M. succiniciproducens SecM. When describing our RNCs we assume that the ribosome stalls with the penultimate tRNAPro in the A-site and the first 8 residues from the stall-inducing sequence occupying the exit tunnel. In the full-length RNCs an additional disordered linker precedes the stall sequence (bold). In 1-37RNC, the GS linker following the 3 C cleavage sequence is extended to improve capture by the GFP affinity resin. Sites of point mutations are underlined.
Plasmids were used to transform E. coli BL21(DE3) wild-type (NEB) or Δtig (J. Christodoulou, University College London) cells. Bacteria were grown in ZYM 5052 autoinduction media76 with 100 µg ml−1 ampicillin at 37 °C for 18 h. Cells were collected by centrifugation (4,000g, 20 min, 4 °C) and lysed in RNC lysis buffer (Tris-HCl pH 7.5, 150 mM KCl and 10 mM MgCl2) containing 1.25 mg ml−1 lysozyme and subjected to 2× freeze–thaw cycles in the presence of 2.72 Kunitz units per µl DNase (QIAGEN). The soluble fraction was obtained by centrifugation at 20,000g for 20 min, loaded onto a 35% sucrose cushion in high-salt RNC buffer (50 mM HEPES-KOH, pH 7.5, 12 mM Mg(OAc)2, 1 M KOAc and 1 mM dithiothreitol (DTT)) and centrifuged at 4 °C for 2 h at 250,000g (Beckman ultracentrifuge, TLA-110 rotor, tube no. 363305). The ribosome pellet was resuspended overnight at 4 °C in low-salt RNC buffer (50 mM HEPES-KOH, pH 7.5, 12 mM Mg(OAc)2, 100 mM KOAc and 1 mM DTT). The resuspended pellet was applied to in-house-prepared GFP-clamp coupled agarose beads77 and incubated overnight at 4 °C. RNCs were eluted by cleavage with 0.4 mg ml−1 HRV 3C protease for >5 h, followed by a second round of sucrose cushion ultracentrifugation in high-salt RNC buffer (2 h, 250,000g). For low-salt purifications, both sucrose cushions were prepared using low-salt RNC buffer. In all cases, the final pellet was resuspended in low-salt RNC buffer and A260 was measured to estimate the RNC concentration. See also Extended Data Fig. 1a.
Where indicated, purified RNCs were incubated with either 50 µg ml−1 RNaseA (NEB) and 50 mM EDTA, or 2.5 mM puromycin (Santa Cruz Biotechnology) at 20 °C for 20 min.
Expression and purification of isolated DHFR variants
E. coli BL21(DE3) cells were transformed with the plasmid expressing FL DHFR from a T7 promoter (supplied as control plasmid with the NEB PURExpress kit) and selected on an LB agar plate supplemented with 100 µg ml−1 ampicillin. A single colony was used to inoculate 500 ml of ZYM 5052 autoinduction media containing 100 µg ml−1 ampicillin. The culture was grown for 24 h in an orbital shaker set to 37 °C. Cells were collected by centrifugation and resuspended in lysis buffer (25 mM Tris-HCl pH 7.5, 1 mM EDTA and 2 mM DTT). The resuspended cells were supplemented with 1 mM phenylmethanesulfonyl fluoride (PMSF), 0.025 units per ml benzonase, two tablets of EDTA-free protease inhibitor cocktail and then lysed by sonication. The lysate was centrifuged at 48,000g, 4 °C, 45 min, and the supernatant was loaded onto a 6 ml RESOURCE Q column connected to an ÄKTA Pure Protein Purification System. The bound protein was eluted with a linear gradient of 0–500 mM NaCl in lysis buffer. The eluted fractions were examined using NuPAGE 4–12% Bis-Tris gels and the fractions with pure protein were pooled, concentrated using centrifugal filters and injected into a HiLoad 16/600 Superdex 200 pg column that was equilibrated with gel filtration buffer (25 mM Tris-HCl pH 7.5, 200 mM NaCl, 1 mM EDTA and 1 mM DTT). The fractions with pure protein were pooled, concentrated using centrifugal filters, flash-frozen in liquid nitrogen and stored at −80 °C. The concentration of the protein was estimated by absorbance at 280 nm, using an extinction coefficient of 33,585 M−1 cm−1.
FL + 50stop and 1–126stop were generated from FL + 58RNC and 1–126RNC, respectively, by replacing the SecM sequence with a stop codon using Q5 site-directed mutagenesis (NEB). Proteins were expressed as described above for RNCs. Cells were collected by centrifugation as above, and the pellet was resuspended in lysis buffer supplemented with 1 µl ml−1 DNase (Roche), 1.25 mg ml−1 lysozyme and 1 protease inhibitor tablet (Roche), then incubated for 30 min at 4 °C. Cells were lysed by sonication (3× 1 min, 40% amplitude) and the soluble fraction was isolated by centrifugation for 40 min at 20,000g using a JA-25.50 rotor in a Beckman Avanti J-26S XP centrifuge. The supernatant was filtered (0.2 µm) and loaded onto a RESOURCE Q column connected to ÄKTA Pure Protein Purification System. The protein was eluted using a linear gradient to 50% elution buffer (25 mM Tris, pH 7.4 at 4 °C, 1 mM EDTA, 2 mM DTT and 1 M NaCl) in 20 column volumes. Fractions containing muGFP–DHFR fusion proteins were concentrated and the muGFP-tag was cleaved by incubation with 0.4 mg ml−1 3C protease overnight at 4 °C. A second, identical round of purification on a RESOURCE Q column was then performed to separate DHFR and muGFP. Fractions containing cleaved DHFR were pooled, concentrated and purified from remaining contaminants by size-exclusion chromatography using a Superdex75i column in gel filtration buffer. Fractions containing pure protein were pooled, concentrated using centrifugal filters, flash-frozen in liquid nitrogen and stored at −80 °C. Protein concentrations were determined by absorbance at 280 nm.
Expression and purification of TF
TF was expressed with a cleavable N-terminal 6×His-tag from a pPROEX-HTa vector58. TF ΔRBS (F44A/R45A/K46A)52 and monomeric TF (V39E, I76E, I80A)60 were generated using Q5 site-directed mutagenesis (NEB).
E. coli BL21(DE3) cells were transformed with the TF plasmids and selected on LB agar plates supplemented with 100 µg ml−1 ampicillin. A single colony was used to inoculate 1 l of ZYM 5052 autoinduction media containing 100 µg ml−1 ampicillin, which was grown overnight at 37 °C. The cells were collected by centrifugation and resuspended in buffer containing 50 mM Tris-HCl, pH 7.5, 250 mM NaCl, 10% (v/v) glycerol, 2 mM β-mercaptoethanol, 0.025 units per ml benzonase and 0.1 mM PMSF. The resuspended cells were supplemented with 1 mM PMSF and a tablet of EDTA-free protease inhibitor cocktail and then lysed by sonication. The lysate was centrifuged at 48,000g for 45 min at 4 °C and the supernatant was loaded onto a 5 ml HisTrap HP column connected to an ÄKTA Pure Protein Purification System. The bound protein was eluted by washing the column with a linear gradient of 0–500 mM imidazole in lysis buffer. Fractions containing pure protein were pooled, supplemented with 1:100 TEV protease to cleave the His-tag and dialyzed in a buffer containing 20 mM Tris-HCl, pH 7.5, 250 mM NaCl and 2 mM DTT at 4 °C. The protein was passed through a HisTrap column and the flow-through containing the cleaved protein was concentrated using centrifugal filters and injected into a Superdex 200i 10/300 GL column that was equilibrated with 20 mM Tris-HCl, pH 7.5, 150 mM NaCl, 5% (v/v) glycerol and 1 mM DTT. For monomeric TF, a HiPrep 26/60 Sephacryl S-300 HR column was used for the final SEC step. Fractions containing pure protein were pooled, concentrated using centrifugal filters, flash-frozen in liquid nitrogen and stored at −80 °C. The concentration of the protein was estimated using Bradford’s assay78.
Mass spectrometry of RNCs
RNCs were purified as described above in either high-salt RNC buffer or, to preserve salt-sensitive interactions, in low-salt RNC buffer. Proteins were run for 8 mm using NuPAGE 12% Bis-Tris Gel 1.0 mm 12 wells before Coomassie blue staining (Quick Coomassie Stain, Generon). Excised entire 8 mm bands were diced, destained, alkylated and digested with trypsin (modified sequencing grade V5111, Promega). Digests were loaded onto Evotips (Evosep) and tryptic peptides were eluted using the '30SPD' gradient via an Evosep One79 HPLC fitted with a 15-cm C18 column (EV1074) into a Lumos Tribrid Orbitrap mass spectrometer (Thermo Scientific) with a nanospray emitter operated at 2,200 V. The Orbitrap was operated in data-dependent acquisition mode with precursor ion spectra acquired at 120,000 resolution in the Orbitrap and MS/MS spectra in the ion trap at 32% HCD collision energy in TopS mode. Dynamic exclusion was set to ±10 ppm over 15 s, automatic gain control to standard and max. injection time to dynamic. The vendor’s universal method was adopted to schedule the ion trap accumulation times. Raw files were processed using Maxquant80 (maxquant.org) and Perseus81 (maxquant.net/perseus) with a recent download of the Uniprot E. coli reference proteome database together with a common contaminants database. A decoy database of reversed sequences was used to filter false positives, with both peptide and protein false detection rates set to 1%. Quantification of individual E. coli proteins was achieved using iBAQ (intensity-based absolute quantification) values normalized to the mean iBAQ value across all ribosomal proteins in each sample. All proteomics data have been deposited in the ProteomeXchange Consortium through the PRIDE82 partner repository with dataset identifier PXD036784.
Enzyme activity assays
DHFR activity was measured in low-salt RNC buffer with saturating concentrations of NADPH (Sigma-Aldrich, N6505) and DHF (Sigma-Aldrich, D7006) by following NADPH oxidation through the change in absorbance at 340 nm. For each reaction of 100 µl, 25 nM of DHFR was incubated with 100 µM NADPH for 10 min at 20 °C, then 100 µM of DHF was added and the absorbance at 340 nm was immediately recorded for 100 s. To convert the change in absorbance over time into the initial rate, the differential extinction coefficient value of 12.4 mM−1 cm−1 was used. All experiments were performed in triplicate at 21 °C and measured using a Jasco V-550 Spectrophotometer.
Michaelis–Menten parameters were determined by measuring enzyme activity of 25 nM DHFR as a function of increasing concentration of DHF, keeping NADPH constant at 100 µM. DHF was varied from 0.1 µM to 30 µM, and the dilutions were made in low-salt RNC buffer for all samples.
Where indicated, 500 nM methotrexate (Sigma-Aldrich, A6770) was added to the samples 20 min before measurement. Where indicated, peptides corresponding to the C terminus of DHFR were added to the samples and incubated on ice for 40 min before adding the substrates. The incubation time was optimized to ensure that the reaction had reached equilibrium. Peptides (C10, acetyl-SYCFEILERR-amine or Scr, acetyl-RFIERCELYS-amine) were synthesized by the Peptide Chemistry Science Technology Platform (Francis Crick Institute) and reconstituted in low-salt RNC buffer.
Data were plotted and analyzed in GraphPad Prism 9.
Pelleting assays
To measure TF binding to RNCs, 10 µM of purified TF was incubated with 1.5-2 µM RNC at 30 °C for 20 min. The reactions were then loaded onto 35% sucrose cushions prepared in either high-salt or low-salt RNC buffer and subjected to ultracentrifugation for 2 h at 250,000g (Beckman ultracentrifuge, TLA-100 rotor) to separate unbound TF from the ribosomal fraction. The pellet containing ribosomes was washed once, then resuspended in low-salt RNC buffer and analyzed by SDS–PAGE with Coomassie staining, or immunoblot with antibodies against TF (GenScript, A01329; 1:1,000 dilution) and small subunit ribosomal protein S2 (abx110548; 1:1,000 dilution).
HDX MS
In addition to the descriptions below, comprehensive experimental details and parameters are provided in Supplementary Data 1, in the recommended83 tabular format. All HDX MS data have been deposited in the ProteomeXchange Consortium through the PRIDE partner repository82 with dataset identifier PXD036945. Supplementary Data 1 contains all the values used to create figures containing HDX MS data.
Deuterium labeling
Stock concentrations of purified constructs are listed in the Exp. Parameters and Replication tab in the Supplementary Data 1. All isolated proteins or RNCs began in storage buffer as follows. For free FL DHFR and DHFR RNC constructs, 50 mM HEPES-KOH, pH 7.5, 100 mM KOAc, 12 mM Mg(OAc)2 and 1 mM DTT; for FL + 50stop, 25 mM Tris, 1 mM EDTA, 1 mM DTT and 200 mM NaCl; for TF constructs, 20 mM Tris-HCL, pH 7.5, 150 mM NaCl, 5% glycerol and 1 mM DTT; for free ribosomes (obtained from NEB, P0763S), 20 mM HEPES-KOH, pH 7.6, 10 mM Mg(OAc)2, 30 mM KCl and 7 mM β-mercaptoethanol. Proteins were diluted, as needed, to the concentration required for HDX and then labeled with deuterium.
Deuterium labeling was initiated with a 15-fold dilution into labeling buffer (30 μl, 10 mM HEPES-KOH, pH 7.5, 25 mM KOAc, 12 mM Mg(OAc)2, 1 mM DTT and 99.9% D2O). After each labeling time (10 s, 100 s and 1,000 s) at 23 °C, the labeling reaction was quenched with the addition of 20 μl of ice-cold quenching buffer (200 mM potassium phosphate, pH 2.44, 4 M guanidinium chloride, 0.72 M TCEP and H2O), 10 μl 50% immobilized pepsin bead slurry (prepared in-house using POROS beads84 in water with 0.1% formic acid and held on ice for 5 min. After on-ice in-solution digestion, the mixture was spun for 15 s at 16,000g and 4 °C in Corning Costar Spin-X centrifuge tube filters (Sigma-Aldrich, CLS8163-100EA) and then the flow-through was immediately injected into a Waters M-class Acquity UPLC with HDX technology for liquid chromatography–MS analysis. Undeuterated control samples were prepared for each experiment using the same procedure as outlined above but using 10 mM HEPES-KOH, pH 7.5, 25 mM KOAc, 12 mM Mg(OAc)2, 1 mM DTT and 99.9% H2O in place of the labeling buffer. Maximally deuterated samples (maxD) were prepared as previously described85 for both FL DHFR and FL + 50stop.
Liquid chromatography–ion-mobility spectrometry–MS
The cooling chamber of the UPLC system (based on a previous publication86), which housed all the chromatographic elements, was held at 0.0 ± 0.1 °C for the entire time of the measurements. Peptides were trapped and desalted on a VanGuard Pre-Column trap (2.1 mm × 5 mm; ACQUITY UPLC BEH C18, 1.7 μm; Waters, 186002346) for 3 min at 100 μl min−1. Peptides were then eluted from the trap using a 5–35% gradient of acetonitrile over 20 min at a flow rate of 100 μl min−1 and separated using an ACQUITY UPLC HSS T3, 1.8 μm, 1.0 mm × 50 mm column (Waters, 186003535). The back pressure averaged ~12,950 psi at 0 °C and 5% acetonitrile, 95% water, 0.1% formic acid. Mass spectra were acquired using a Waters Synapt G2-Si HDMSE mass spectrometer in ion mobility mode. The mass spectrometer was calibrated with direct infusion of a solution of glu-fibrinopeptide (Sigma-Aldrich, F3261) at 200 fmol μl−1 at a flow rate of 5 μl min−1 before data collection. A conventional electrospray source was used, and the instrument was scanned over a range of 50–2,000 m/z. The instrument configuration was as follows: capillary, 2.5 kV; trap collision energy, 4 V; sampling cone, 40 V; source temperature, 80 °C; desolvation temperature, 175 °C. All comparison experiments were performed under identical experimental conditions such that deuterium levels were not corrected for back-exchange and are therefore reported as relative87. Replicates were technical; that is, independent labeling reactions were performed using the same batch of purified protein. The error in determining the deuterium levels was ±0.25 Da in this experimental setup.
HDX MS data processing
Peptides were identified from replicate HDMSE analyses (as detailed in Supplementary Data 1) of undeuterated control samples using PLGS v.3.0.1 (Waters Corporation). Peptide masses were identified from searches using nonspecific cleavage of a custom database containing the sequences of each DHFR construct (based on wild-type E. coli Uniprot P0ABQ4 that included the linker sequence; see peptide maps in Supplementary Data 1), TF (E. coli Uniprot P0A850) and all protein sequences in the E. coli ribosome as extracted from PDB 4YBB. Searches used the following parameters: no missed cleavages, no PTMs, a low energy threshold of 135, an elevated energy threshold of 35 and an intensity threshold of 500. No false discovery rate control was performed. The peptides identified in PLGS (excluding all neutral loss and in-source fragmentation identifications) were then filtered in DynamX v.3.0 (Waters Corporation), implementing minimum products per amino acid and consecutive product ion cutoffs described in the Exp. Parameters and Replication tab of Supplementary Data 1. Peptides meeting the filtering criteria to this point were further processed by DynamX v.3.0 (Waters Corporation), including manual inspection of each mass spectrum. The relative amount of deuterium in each peptide was determined with the software by subtracting the centroid mass of the undeuterated form of each peptide from the deuterated form, at each time point, for each condition. These deuterium uptake values were used to generate all uptake graphs and difference maps.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Online content
Any methods, additional references, Nature Portfolio reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at 10.1038/s41594-024-01355-x.
Supplementary information
Source data
Acknowledgements
We thank J. Christodoulou (University College London) for the kind gift of the Δtig E. coli strain; S. Mouilleron (Francis Crick Institute) for the 3C protease and TEV protease; K. Fadgen (Waters) for technical assistance with HDX MS; and the Peptide Chemistry Science Technology Platform at the Francis Crick institute for peptide synthesis. D.B. is supported by the Francis Crick Institute, which receives its core funding from Cancer Research UK (CC2025, CC1063, CC1065 and CC1068), the UK Medical Research Council (CC2025, CC1063, CC1065 and CC1068) and the Wellcome Trust (CC2025, CC1063, CC1065 and CC1068). For the purpose of open access, the author has applied a CC BY public copyright license to any author accepted manuscript version arising from this submission. J.R.E. acknowledges funding from the National Institutes of Health (R01-CA233978) and the James L. Waters Chair in Analytical Chemistry.
Extended data
Author contributions
F.U.H., J.R.E. and D.B. conceived the project. T.E.W., A.P., A.R., S.S., S.H., S.K. and D.B. conducted the investigation. T.E.W., A.P., A.R., S.S., S.H., J.R.E. and D.B. performed the formal analysis, data curation and visualization: T.E.W., A.P. and D.B. wrote the original draft of the manuscript; all authors reviewed and edited the final draft.
Peer review
Peer review information
Nature Structural & Molecular Biology thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editor: Sara Osman, in collaboration with the Nature Structural & Molecular Biology team. Peer reviewer reports are available.
Funding
Open Access funding provided by The Francis Crick Institute.
Data availability
All mass spectrometry data have been deposited in the ProteomeXchange Consortium through the PRIDE partner repository with dataset identifiers PXD036784 and PXD036945. Proteomic analysis used the Uniprot E. coli reference proteome (UP000000625). Source data are provided with this paper.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Thomas E. Wales, Aleksandra Pajak.
Contributor Information
John R. Engen, Email: j.engen@northeastern.edu
David Balchin, Email: david.balchin@crick.ac.uk.
Extended data
is available for this paper at 10.1038/s41594-024-01355-x.
Supplementary information
The online version contains supplementary material available at 10.1038/s41594-024-01355-x.
References
- 1.Dill, K. A. & MacCallum, J. L. The protein-folding problem, 50 years on. Science338, 1042–1046 (2012). [DOI] [PubMed] [Google Scholar]
- 2.Liutkute, M., Maiti, M., Samatova, E., Enderlein, J. & Rodnina, M. V. Gradual compaction of the nascent peptide during cotranslational folding on the ribosome. eLife9, e60895 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Holtkamp, W. et al. Cotranslational protein folding on the ribosome monitored in real time. Science350, 1104–1107 (2015). [DOI] [PubMed] [Google Scholar]
- 4.Agirrezabala, X. et al. A switch from α-helical to β-strand conformation during co-translational protein folding. EMBO J.41, e109175 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Netzer, W. J. & Hartl, F. U. Recombination of protein domains facilitated by co-translational folding in eukaryotes. Nature388, 343–349 (1997). [DOI] [PubMed] [Google Scholar]
- 6.Frydman, J., Erdjument-Bromage, H., Tempst, P. & Hartl, F. U. Co-translational domain folding as the structural basis for the rapid de novo folding of firefly luciferase. Nat. Struct. Biol.6, 697–705 (1999). [DOI] [PubMed] [Google Scholar]
- 7.Liu, K., Maciuba, K. & Kaiser, C. M. The ribosome cooperates with a chaperone to guide multi-domain protein folding. Mol. Cell74, 310–319.e7 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Bhushan, S. et al. α-Helical nascent polypeptide chains visualized within distinct regions of the ribosomal exit tunnel. Nat. Struct. Mol. Biol.17, 313–317 (2010). [DOI] [PubMed] [Google Scholar]
- 9.Nilsson, O. B. et al. Cotranslational protein folding inside the ribosome exit tunnel. Cell Rep.12, 1533–1540 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Ahn, M. et al. Modulating co-translational protein folding by rational design and ribosome engineering. Nat. Commun.13, 4243 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Jensen, M. K., Samelson, A. J., Steward, A., Clarke, J. & Marqusee, S. The folding and unfolding behavior of ribonuclease H on the ribosome. J. Biol. Chem.295, 11410–11417 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Samelson, A. J., Jensen, M. K., Soto, R. A., Cate, J. H. D. & Marqusee, S. Quantitative determination of ribosome nascent chain stability. Proc. Natl Acad. Sci. USA113, 13402–13407 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Cabrita, L. D. et al. A structural ensemble of a ribosome–nascent chain complex during cotranslational protein folding. Nat. Struct. Mol. Biol.23, 278–285 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Cassaignau, A. M. E. et al. Interactions between nascent proteins and the ribosome surface inhibit co-translational folding. Nat. Chem.13, 1214–1220 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Kaiser, C. M., Goldman, D. H., Chodera, J. D., Tinoco, I. & Bustamante, C. The ribosome modulates nascent protein folding. Science334, 1723–1727 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Nilsson, O. B. et al. Cotranslational folding of spectrin domains via partially structured states. Nat. Struct. Mol. Biol.24, 221–225 (2017). [DOI] [PubMed] [Google Scholar]
- 17.Nilsson, O. B., Müller-Lucks, A., Kramer, G., Bukau, B. & von Heijne, G. Trigger factor reduces the force exerted on the nascent chain by a cotranslationally folding protein. J. Mol. Biol.428, 1356–1364 (2016). [DOI] [PubMed] [Google Scholar]
- 18.Farías-Rico, J. A., Ruud Selin, F., Myronidi, I., Frühauf, M. & von Heijne, G. Effects of protein size, thermodynamic stability, and net charge on cotranslational folding on the ribosome. Proc. Natl Acad. Sci. USA115, E9280–E9287 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Clark, P. L. & King, J. A newly synthesized, ribosome-bound polypeptide chain adopts conformations dissimilar from early in vitro refolding intermediates. J. Biol. Chem.276, 25411–25420 (2001). [DOI] [PubMed] [Google Scholar]
- 20.Notari, L., Martínez-Carranza, M., Farías-Rico, J. A., Stenmark, P. & von Heijne, G. Cotranslational folding of a pentarepeat β-helix protein. J. Mol. Biol.430, 5196–5206 (2018). [DOI] [PubMed] [Google Scholar]
- 21.Oh, E. et al. Selective ribosome profiling reveals the cotranslational chaperone action of trigger factor in vivo. Cell147, 1295–1308 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Deuerling, E., Schulze-Specking, A., Tomoyasu, T., Mogk, A. & Bukau, B. Trigger factor and DnaK cooperate in folding of newly synthesized proteins. Nature400, 693–696 (1999). [DOI] [PubMed] [Google Scholar]
- 23.Saio, T., Guan, X., Rossi, P., Economou, A. & Kalodimos, C. G. Structural basis for protein antiaggregation activity of the trigger factor chaperone. Science344, 1250494 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Martinez-Hackert, E. & Hendrickson, W. A. Promiscuous substrate recognition in folding and assembly activities of the trigger factor chaperone. Cell138, 923–934 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Scholz, C., Stoller, G., Zarnt, T., Fischer, G. & Schmid, F. X. Cooperation of enzymatic and chaperone functions of trigger factor in the catalysis of protein folding. EMBO J.16, 54–58 (1997). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Shieh, Y.-W. et al. Operon structure and cotranslational subunit association direct protein assembly in bacteria. Science350, 678–680 (2015). [DOI] [PubMed] [Google Scholar]
- 27.Hoffmann, A. et al. Concerted action of the ribosome and the associated chaperone trigger factor confines nascent polypeptide folding. Mol. Cell48, 63–74 (2012). [DOI] [PubMed] [Google Scholar]
- 28.Agashe, V. R. et al. Function of trigger factor and DnaK in multidomain protein folding: increase in yield at the expense of folding speed. Cell117, 199–209 (2004). [DOI] [PubMed] [Google Scholar]
- 29.O’Brien, E. P., Christodoulou, J., Vendruscolo, M. & Dobson, C. M. Trigger factor slows co-translational folding through kinetic trapping while sterically protecting the nascent chain from aberrant cytosolic interactions. J. Am. Chem. Soc.134, 10920–10932 (2012). [DOI] [PubMed] [Google Scholar]
- 30.Tian, P. et al. Folding pathway of an Ig domain is conserved on and off the ribosome. Proc. Natl Acad. Sci. USA115, E11284–E11293 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Guinn, E. J., Tian, P., Shin, M., Best, R. B. & Marqusee, S. A small single-domain protein folds through the same pathway on and off the ribosome. Proc. Natl Acad. Sci. USA115, 12206–12211 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Marsden, A. P. et al. Investigating the effect of chain connectivity on the folding of a beta-sheet protein on and off the ribosome. J. Mol. Biol.430, 5207–5216 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Maciuba, K., Rajasekaran, N., Chen, X. & Kaiser, C. M. Co‐translational folding of nascent polypeptides: multi‐layered mechanisms for the efficient biogenesis of functional proteins. BioEssays43, 2100042 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Cassaignau, A. M. E., Cabrita, L. D. & Christodoulou, J. How does the ribosome fold the proteome? Annu. Rev. Biochem.89, 389–415 (2020). [DOI] [PubMed] [Google Scholar]
- 35.Schnell, J. R., Dyson, H. J. & Wright, P. E. Structure, dynamics, and catalytic function of dihydrofolate reductase. Annu. Rev. Biophys. Biomol. Struct.33, 119–140 (2004). [DOI] [PubMed] [Google Scholar]
- 36.Arai, M., Iwakura, M., Matthews, C. R. & Bilsel, O. Microsecond subdomain folding in dihydrofolate reductase. J. Mol. Biol.410, 329–342 (2011). [DOI] [PubMed] [Google Scholar]
- 37.Young, R. & Bremer, H. Polypeptide-chain-elongation rate in Escherichia coli B/r as a function of growth rate. Biochem. J.160, 185–194 (1976). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Cassaignau, A. M. E. et al. A strategy for co-translational folding studies of ribosome-bound nascent chain complexes using NMR spectroscopy. Nat. Protoc.11, 1492–1507 (2016). [DOI] [PubMed] [Google Scholar]
- 39.Evans, M. S., Ugrinov, K. G., Frese, M.-A. & Clark, P. L. Homogeneous stalled ribosome nascent chain complexes produced in vivo or in vitro. Nat. Methods2, 757–762 (2005). [DOI] [PubMed] [Google Scholar]
- 40.Rutkowska, A. et al. Large-scale purification of ribosome-nascent chain complexes for biochemical and structural studies. FEBS Lett.583, 2407–2413 (2009). [DOI] [PubMed] [Google Scholar]
- 41.Cymer, F., Hedman, R., Ismail, N. & von Heijne, G. Exploration of the arrest peptide sequence space reveals arrest-enhanced variants. J. Biol. Chem.290, 10208–10215 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Lu, J. & Deutsch, C. Folding zones inside the ribosomal exit tunnel. Nat. Struct. Mol. Biol.12, 1123–1129 (2005). [DOI] [PubMed] [Google Scholar]
- 43.Marcsisin, S. R. & Engen, J. R. Hydrogen exchange mass spectrometry: what is it and what can it tell us? Anal. Bioanal. Chem.397, 967–972 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Hu, W. et al. Stepwise protein folding at near amino acid resolution by hydrogen exchange and mass spectrometry. Proc. Natl Acad. Sci. USA110, 7684–7689 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Walters, B. T., Mayne, L., Hinshaw, J. R., Sosnick, T. R. & Englander, S. W. Folding of a large protein at high structural resolution. Proc. Natl Acad. Sci. USA110, 18898–18903 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Georgescauld, F. et al. GroEL/ES chaperonin modulates the mechanism and accelerates the rate of TIM-barrel domain folding. Cell157, 922–934 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Balchin, D., Miličić, G., Strauss, M., Hayer-Hartl, M. & Hartl, F. U. Pathway of actin folding directed by the eukaryotic chaperonin TRiC. Cell174, 1507–1521.e16 (2018). [DOI] [PubMed] [Google Scholar]
- 48.Jones, B. E. & Robert Matthews, C. Early intermediates in the folding of dihydrofolate reductase from Escherichia coli detected by hydrogen exchange and NMR. Protein Sci.4, 167–177 (1995). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Gegg, C. V., Bowers, K. E. & Matthews, C. R. Probing minimal independent folding units in dihydrofolate reductase by molecular dissection. Protein Sci.6, 1885–1892 (1997). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Scott, D. J. et al. A novel ultra-stable, monomeric green fluorescent protein for direct volumetric imaging of whole organs using CLARITY. Sci. Rep.8, 667 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Keul, N. D. et al. The entropic force generated by intrinsically disordered segments tunes protein function. Nature563, 584–588 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Kramer, G. et al. L23 protein functions as a chaperone docking site on the ribosome. Nature419, 171–174 (2002). [DOI] [PubMed] [Google Scholar]
- 53.Hoffmann, A., Bukau, B. & Kramer, G. Structure and function of the molecular chaperone trigger factor. Biochim. Biophys. Acta1803, 650–661 (2010). [DOI] [PubMed] [Google Scholar]
- 54.Kramer, G. et al. Trigger factor peptidyl-prolyl cis/trans isomerase activity is not essential for the folding of cytosolic proteins in Escherichia coli. J. Biol. Chem.279, 14165–14170 (2004). [DOI] [PubMed] [Google Scholar]
- 55.Deeng, J. et al. Dynamic behavior of trigger factor on the ribosome. J. Mol. Biol.428, 3588–3602 (2016). [DOI] [PubMed] [Google Scholar]
- 56.Merz, F. et al. Molecular mechanism and structure of trigger factor bound to the translating ribosome. EMBO J.27, 1622–1632 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Lakshmipathy, S. K. et al. Identification of nascent chain interaction sites on trigger factor. J. Biol. Chem.282, 12186–12193 (2007). [DOI] [PubMed] [Google Scholar]
- 58.Kaiser, C. M. et al. Real-time observation of trigger factor function on translating ribosomes. Nature444, 455–460 (2006). [DOI] [PubMed] [Google Scholar]
- 59.Patzelt, H. et al. Three-state equilibrium of Escherichia coli trigger factor. Biol. Chem.383, 1611–1619 (2002). [DOI] [PubMed] [Google Scholar]
- 60.Saio, T., Kawagoe, S., Ishimori, K. & Kalodimos, C. G. Oligomerization of a molecular chaperone modulates its activity. eLife7, e35731 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Wu, K., Minshull, T. C., Radford, S. E., Calabrese, A. N. & Bardwell, J. C. A. Trigger factor both holds and folds its client proteins. Nat. Commun.13, 4126 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Bershtein, S., Mu, W. & Shakhnovich, E. I. Soluble oligomerization provides a beneficial fitness effect on destabilizing mutations. Proc. Natl Acad. Sci. USA109, 4857–4862 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Maguire, M. E. & Cowan, J. A. Magnesium chemistry and biochemistry. Biometals15, 203–210 (2002). [DOI] [PubMed] [Google Scholar]
- 64.Zhang, J. et al. Mechanisms of ribosome stalling by SecM at multiple elongation steps. eLife4, e09684 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Gersteuer, F. et al. The SecM arrest peptide traps a pre-peptide bond formation state of the ribosome. Nat. Commun.15, 2431 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Lin, K.-F. et al. Cotranslational protein folding within the ribosome tunnel influences trigger-factor recruitment. Biophys. J.102, 2818–2827 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Guzman-Luna, V., Fuchs, A. M., Allen, A. J., Staikos, A. & Cavagnero, S. An intrinsically disordered nascent protein interacts with specific regions of the ribosomal surface near the exit tunnel. Commun. Biol.4, 1236 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Chen, X., Rajasekaran, N., Liu, K. & Kaiser, C. M. Synthesis runs counter to directional folding of a nascent protein domain. Nat. Commun.11, 5096 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Samelson, A. J. et al. Kinetic and structural comparison of a protein’s cotranslational folding and refolding pathways. Sci. Adv.4, eaas9098 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Schimmele, B., Gräfe, N. & Plückthun, A. Ribosome display of mammalian receptor domains. Protein Eng. Des. Sel.18, 285–294 (2005). [DOI] [PubMed] [Google Scholar]
- 71.Bickel, T., Jeppesen, C. & Marques, C. M. Local entropic effects of polymers grafted to soft interfaces. Eur. Phys. J. E Soft Matter4, 33–43 (2001). [Google Scholar]
- 72.Dao Duc, K., Batra, S. S., Bhattacharya, N., Cate, J. H. D. & Song, Y. S. Differences in the path to exit the ribosome across the three domains of life. Nucleic Acids Res.47, 4198–4210 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Chan, S. H. S. et al. The ribosome stabilizes partially folded intermediates of a nascent multi-domain protein. Nat. Chem.14, 1165–1173 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Mashaghi, A. et al. Reshaping of the conformational search of a protein by the chaperone trigger factor. Nature500, 98–101 (2013). [DOI] [PubMed] [Google Scholar]
- 75.Eisenberg, D., Schwarz, E., Komaromy, M. & Wall, R. Analysis of membrane and surface protein sequences with the hydrophobic moment plot. J. Mol. Biol.179, 125–142 (1984). [DOI] [PubMed] [Google Scholar]
- 76.Studier, F. W. Protein production by auto-induction in high-density shaking cultures. Protein Expr. Purif.41, 207–234 (2005). [DOI] [PubMed] [Google Scholar]
- 77.Hansen, S. et al. Design and applications of a clamp for green fluorescent protein with picomolar affinity. Sci. Rep.7, 16292 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Bradford, M. M. A rapid and sensitive method for the quantitation of microgram quantities of protein utilizing the principle of protein-dye binding. Anal. Biochem.72, 248–254 (1976). [DOI] [PubMed] [Google Scholar]
- 79.Bache, N. et al. A novel LC system embeds analytes in pre-formed gradients for rapid, ultra-robust proteomics. Mol. Cell Proteom.17, 2284–2296 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Cox, J. & Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol.26, 1367–1372 (2008). [DOI] [PubMed] [Google Scholar]
- 81.Tyanova, S. et al. The Perseus computational platform for comprehensive analysis of (prote)omics data. Nat. Methods13, 731–740 (2016). [DOI] [PubMed] [Google Scholar]
- 82.Perez-Riverol, Y. et al. The PRIDE database resources in 2022: a hub for mass spectrometry-based proteomics evidences. Nucleic Acids Res.50, D543–D552 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Masson, G. R. et al. Recommendations for performing, interpreting and reporting hydrogen deuterium exchange mass spectrometry (HDX–MS) experiments. Nat. Methods16, 595–602 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Wang, L., Pan, H. & Smith, D. L. Hydrogen exchange–mass spectrometry: optimization of digestion conditions. Mol. Cell Proteom.1, 132–138 (2002). [DOI] [PubMed] [Google Scholar]
- 85.Peterle, D., Wales, T. E. & Engen, J. R. Simple and fast maximally deuterated control (maxD) preparation for hydrogen–deuterium exchange mass spectrometry experiments. Anal. Chem.94, 10142–10150 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Wales, T. E., Fadgen, K. E., Gerhardt, G. C. & Engen, J. R. High-speed and high-resolution UPLC separation at zero degrees Celsius. Anal. Chem.80, 6815–6820 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Wales, T. E. & Engen, J. R. Hydrogen exchange mass spectrometry for the analysis of protein dynamics. Mass Spectrom. Rev.25, 158–170 (2006). [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All mass spectrometry data have been deposited in the ProteomeXchange Consortium through the PRIDE partner repository with dataset identifiers PXD036784 and PXD036945. Proteomic analysis used the Uniprot E. coli reference proteome (UP000000625). Source data are provided with this paper.