Abstract
Most proteins begin to fold during biosynthesis on the ribosome. It has been suggested that interactions between the emerging polypeptide and the ribosome surface might allow the ribosome itself to modulate co-translational folding. Here we combine protein engineering and NMR spectroscopy to characterize a series of interactions between the ribosome surface and unfolded nascent chains of the immunoglobulin-like FLN5 filamin domain. The strongest interactions are found for a C-terminal segment that is essential for folding, and we demonstrate quantitative agreement between the strength of this interaction and the energetics of the co-translational folding process itself. Mutations in this region that reduce the extent of binding result in a shift in the co-translational folding equilibrium towards the native state. Our results therefore demonstrate that a competition between folding and binding provides a simple, dynamic mechanism for the modulation of co-translational folding by the ribosome.
Subject terms: Protein folding, Solution-state NMR, Computational models, Ribosome
During polypeptide biosynthesis, a strong interaction can occur between a segment of an emerged, disordered nascent protein and the ribosomal surface. Now, it has been shown that competition between this ribosomal binding and the folding energetics of an immunoglobulin-like domain modulates the mechanism of co-translational folding.
Main
In the cell, most nascent polypeptide chains begin to fold during biosynthesis1–3. In many cases, co-translational folding increases the ability of a protein to efficiently attain its native structure4–7. This may in part be due to the ribosome modulating the conformational ensembles sampled by nascent chains8–11. The ribosome constrains disordered chains close to its charged surface12, and can promote the early formation of compact states during co-translational folding13–16. In general, nascent chains have been found to be destabilized when they are bound to the ribosome5,17–19. This could be linked to interactions between the disordered nascent chain and the ribosome surface20, some of which bear a partial electrostatic character21,22. Ribosome–nascent chain interactions have also been suggested to attenuate co-translational folding rates23, and to compete with co-translational assembly between a nascent chain and its binding partner24. Collectively, these studies point clearly towards a role for the ribosome in shaping the onset of co-translational folding3. However, due in large part to the technical difficulty of measuring the intramolecular equilibria associated with ribosome–nascent chain interactions, a link between the energetics of ribosome interactions and co-translational folding outcomes has not yet been established in quantitative terms.
In this article we study the co-translational folding of FLN5, a 105 residue immunoglobulin-like domain from the tandem repeat protein filamin (Fig. 1a), using SecM-arrested ribosome–nascent chain complexes (RNCs) in which FLN5 is tethered to the ribosome via variable lengths of the following domain, FLN6 (Fig. 1b)20,25. Measurements of the accessibility of a C-terminal cysteine to covalent modification by PEG-maleimide (PEGylation) showed that the entire FLN5 domain emerges beyond the ribosome exit tunnel for linkers comprising at least 31 residues20 (FLN5+31). However, NMR observations demonstrated that FLN5 remains partially unfolded until the linker extends beyond 42 residues. This offset between the emergence of FLN5 and its folding suggests that the ribosome has a destabilizing effect on co-translational folding, which we speculated could be due to interactions between the unfolded nascent chain and the ribosome surface20. We use NMR spectroscopy together with protein engineering, molecular dynamics simulations and PEGylation measurements of nascent chain stability to identify a series of interaction sites on the nascent chain of varying affinities, and their impact on co-translational folding.
Results
Identification of regions of FLN5 RNCs interacting with the ribosome surface
Since the FLN5+31 RNC represents the first point during protein biosynthesis when the entire sequence is available for folding, we have used this biosynthetic snapshot as a starting point to examine how the ribosome might modulate the dynamic properties of a nascent chain. Previously, a comparison of the 2D 1H,15N NMR correlation spectrum of a FLN5+31 RNC against an isolated unfolded variant, Y719E, revealed site-selective line broadening that we interpreted as evidence of such interactions (Fig. 1c,d). From this, we have identified three main regions for investigation here. First, two clusters of aromatic residues, located within the core of the folded domain (Fig. 1a), were identified as displaying substantial broadening; these are referred to here as ‘F3’ (residues F665–F675) and ‘Y3’ (residues Y715–Y727) sites (Fig. 1d, green). Second, strong line broadenings were observed in the mildly basic C-terminal region of FLN5 (residues N728–C747, the ‘C-terminal segment’ (Fig. 1d, red). These observations correlate well with measurements of the interaction between individual amino acids and purified 70S ribosomes, which are strongest for aromatic and basic side chains (Fig. 1e and Extended Data Fig. 1). Third, resonances of the FLN6 tether are broadened to varying extents across FLN5 RNCs of increasing lengths, which, in part, relates to its occlusion within the ribosomal exit tunnel20 (Fig. 1d, cyan).
Interactions of F3 and Y3 aromatic clusters with the ribosome surface
We prepared a series of constructs, termed F3A3, A3Y3 and A3A3, in which the clusters of aromatic residues were mutated to alanine (Fig. 2a and Extended Data Fig. 2). The close overlay between 1H,15N NMR spectra of the isolated variants and disordered FLN5 indicates they are natively unfolded, as expected given large changes to the hydrophobic core (Extended Data Fig. 2c–f). On the ribosome, the 1H,15N correlation spectra of the corresponding FLN5+31 RNC variants showed substantial increases in cross-peak intensities relative to wild-type (wt) FLN5+31 RNC, broadly localized to the alanine mutation sites (Fig. 2b and Extended Data Fig. 3). Residues between A665 and G700 within the FLN5+31 A3A3 RNC variant reached a relative intensity close to 1, indicating that the mobility of this segment of the nascent chain is comparable to that of the isolated unfolded protein. These data demonstrate that the aromatic residues mediate a large proportion of the interactions at these sites. By contrast, in all variants the C-terminal segment of FLN5 (N728–C747) remained broadened beyond detection. This suggests additional interactions within this nascent chain segment (Fig. 2b).
To measure the strength of these aromatic interactions, we developed and acquired sensitivity-optimized measurements of the transverse cross-correlated relaxation (CCR) rate (Extended Data Fig. 4). Interactions of nascent chain segments with the ribosome surface will result in transferred relaxation, and thus an increase in the CCR rate, Δηxy, relative to the isolated unfolded state. The increase, , is proportional to the bound fraction pB, and to the CCR rate of the bound state, . Based on the known rotational correlation time (τC) of the ribosome26, is estimated to be ∼7,000 s−1.
CCR measurements were acquired for FLN5+31 wt, F3A3, A3Y3 and A3A3 RNCs, and the corresponding isolated proteins (Fig. 2c and Extended Data Fig. 4). In general, we observed that increased CCR rates were associated with reduced resonance intensities. In the wt RNCs, increases in ηxy of ∼15 s−1 were measured around the aromatic F3 cluster. Within the Y3 site, residues beyond T714 were broadened beyond detection in wt and A3Y3 RNCs, but examination of flanking residues indicates that ηxy probably increases substantially beyond 15 s−1. Increased relaxation was also observed for residues at the N terminus, which we ascribe to an interaction of the 6xHis tag as previously observed in α-synuclein RNCs21.
We observe some evidence of cooperativity between the F3 and Y3 clusters, both by comparison of resonance intensities and of CCR rates: the elimination of one cluster leads to a small reduction in the interaction of the neighbouring cluster (Fig. 2b,d). However, in quantitative terms the extent of interaction is weak in all cases. Within the F3 cluster, observed increases in CCR rates correspond to ribosome-bound nascent chain populations of ∼0.1% (Fig. 2d). While resonances for the Y3 cluster were strongly broadened in wt and A3Y3, only allowing for a partial quantitation of CCR rates, the ribosome-bound population between residues 708 and 716 also neared ∼0.1%. Although these estimates assume a rigid bound state, even substantial flexibility that results in an order of magnitude decrease in would indicate a bound population of only a few percent. Such weak interactions within these regions are not sufficient to perturb the co-translational folding process.
Interaction of the C-terminal segment with the ribosome surface
We next sought to investigate the interaction of the C-terminal segment (N728–C747) (Fig. 1d). The elimination of the Y3 cluster in the F3A3 and A3A3 variants resulted in newly observable resonances flanking this segment. Careful inspection revealed that small chemical shift perturbations (CSPs) were observed for these resonances (V717–G725, encompassing part of the Y3 region), relative to the isolated protein, and that their magnitude increased towards the (unobserved) ribosome-binding segment (Fig. 3a,b and Extended Data Fig. 5). Similarly, CSPs were observed at the C-terminal end of the binding segment between I748 and A751. Focusing on the A3A3 variant, we found that these CSPs were substantially reduced at high ionic strength (Fig. 3c), suggesting that they are associated with an interaction mediated at least in part by an electrostatic contribution.
We then explored the effect of the RNC linker length on the observed CSPs. We hypothesized that a shorter RNC would experience a higher effective ribosome concentration20 and therefore modulate the extent of binding. Indeed, as the linker length is increased from 26 to 110 amino acids, CSPs were observed to decrease (Fig. 3d) while resonance intensities increased (Extended Data Fig. 5). The CSPs at different lengths were collinear, indicating that changes in the 1H and 15N chemical shifts were strongly correlated (Fig. 3d). These collinear, correlated CSPs are an unambiguous indication that the C-terminal segment is rapidly exchanging between a ribosome-bound and free state, such that the observed chemical shift reflects a population-weighted average of unbound and bound states27. Resonances of the C-terminal residues I749–A751 showed deviations from collinearity at short linker lengths, which we attribute to proximity to the exit tunnel. For this reason, they have been excluded from further analysis. Together, these observations provide compelling evidence for a strong ribosome interaction involving nascent chain residues between Y727 and C747.
Alanine mutations within the F3 and Y3 clusters did not substantially perturb the observed C-terminal CSPs, indicating that there is no detectable cooperativity between interactions of these aromatic clusters and the C-terminal segment (Extended Data Fig. 5). Furthermore, no modulations in chemical shifts or intensities were observed upon varying magnetic field strength (Extended Data Fig. 5). Given the fast chemical exchange behaviour observed, we can infer from the largest frequency difference, Δν, of 30 Hz (for the 15N resonance of E724 at 22.3 T) that the dissociation of the bound state is rapid, with a lifetime much less than 3 ms (τ ≪ 1/4πΔν, that is, koff ≈ τ−1 ≫ 360 s−1).
Impact of the linker sequence composition on FLN5 interactions
We investigated the influence of the FLN6 linker on FLN5–ribosome interactions. We previously reported that replacing all FLN6 residues with a poly(GS) sequence (Fig. 3e) did not perturb co-translational folding equilibria20. Here we find that substituting poly(GS) linkers within A3A3 RNCs led to only small shifts in the CSPs and resonance intensities in the C-terminal segment, indicating that the extent of binding is not greatly perturbed (Fig. 3f and Extended Data Fig. 6), and suggesting little impact of the nature of the tether on the co-translational folding of FLN5.
Quantification and molecular modelling of the C-terminal segment interaction
Having identified a ribosome interaction site in the C-terminal segment, we next examined its molecular basis. Although the C-terminal segment does not contain aromatic residues, it contains three basic side chains (R734, K739, K746) that we hypothesized would contribute towards the electrostatic interactions (Fig. 1e), given the reduced CSPs observed at increased ionic strength (Fig. 3c). To explore this, we designed a FLN5 construct, A3A3E6, in which six residues (surface-exposed within the folded state) were replaced with acidic glutamate residues, thus reversing the net charge within this segment from +1 to −6 (Figs. 1a and 4a). The E6 mutations were found to greatly reduce the magnitude of CSPs in the C-terminal segment (Fig. 4b), indicating that these mutations reduce the affinity of the segment for the ribosome surface. However, some CSPs and line broadening persist (Extended Data Fig. 7), suggesting that the binding interaction of this region is not abrogated completely. Similar reductions in CSPs were also observed between FLN5+31 wt and E6 RNCs, although due to the increased line broadening fewer resonances adjacent to the interacting segment could be resolved (Extended Data Fig. 7).
To quantify the effect of the E6 mutations on the binding interaction, we have analysed further the observed CSPs, which reflect a population-weighted average of unbound and bound states. These residues provide a convenient ‘ruler’ to compare the interactions of different nascent chains with the ribosome. However, to determine the absolute amount of binding, the chemical shift of the fully bound state must be determined. To achieve this, we carried out a global analysis of CSPs observed in the A3A3 and A3A3E6 RNC variants across multiple linker lengths (Fig. 4c). Two assumptions were required for this analysis. First, while the strength of interactions in these variants clearly varies as a function of linker length, we assume that the difference in free energy of binding between variants, (where Ufree and Ubound represent unfolded states with the C-terminal segment unbound and ribosome-bound, respectively), is independent of RNC length. This is equivalent to the assumption that C-terminal segments in both variants experience the same effective ribosome concentration at a given linker length, which is supported by the similar structural and dynamic properties of the A3A3 and A3A3E6 variants: no chemical shift perturbations or differences in 15N R2 relaxation rates are observed beyond the immediate vicinity of the E6 mutations (Extended Data Fig. 8). Second, while unbound resonance positions vary between A3A3 and A3A3E6 RNCs due to local sequence effects, we assume that the chemical shift change upon binding, Δδmax, is the same for both variants. Given this, the observed chemical shift perturbations of four well-resolved resonances in A3A3 and A3A3E6 RNCs were fitted to determine the chemical shift differences between free and bound states, and the difference in affinities of the two variants, (Fig. 4d and Extended Data Fig. 8). These results indicate that at a short RNC length of 26 amino acids, 90% and 22% of the A3A3 and A3A3E6 RNCs are bound to the ribosome, respectively, then at a longer RNC length of 42 amino acids these values decreased to 60% and 7%, respectively, and by 110 amino acids, the interaction essentially disappears in the A3A3E6 variant (Fig. 4e).
Next, we used coarse-grained (CG) molecular dynamics simulations of the A3A3 and A3A3E6 RNCs to probe this interaction further, and to explore the location of nascent chain interaction sites on the ribosome surface (Fig. 4g). The strength of electrostatic interactions between the nascent chain and the ribosome surface in the CG model was calibrated using simulations of the A3A3 FLN5+42 RNC, in order to achieve ∼60% binding of the C-terminal segment (730–746), as determined from our NMR observations (Fig. 4e). Simulations of other RNC lengths (+31, +67 and +110), and of the +42 E6 variant, were then carried out with no further adjustment of parameters. We found that this CG model accurately identified electrostatics-based interactions of the C-terminal segment of the nascent chain that were consistent with the extent of binding measured from our CSP analysis across all lengths (Fig. 4f). The C-terminal binding segment (N728–C747) in the A3A3 FLN5+31 RNC contacted primarily the 23S rRNA region located adjacent to the exit tunnel (mainly helices H24 and H50), and also a loop of the nearby ribosomal protein, uL24 (Fig. 4h). Moreover, for many individual RNA bases in this region, in particular those within helices H24, H47, H50 and H59 (Extended Data Fig. 9), the contact probability correlated strongly with the extent of binding across nascent chain lengths as determined experimentally from our CSP analysis (Fig. 4i). These observations point to a nascent chain interaction site at the ribosome exit vestibule. We note that a similar interaction site is also observed for folded FLN5+47 RNC by cryo electron microscopy28, and that helices H24 and H50 are highly structurally conserved across Bacteria and Eukaryotes. It may be that this region on the ribosomal surface has a role in modulating the co-translational folding of FLN5 and perhaps generally for nascent chains.
Modulation of co-translational folding by C-terminal interactions
In the previous sections, we have developed a detailed description of the interactions formed between unfolded FLN5 nascent chains and the ribosome surface. By stabilizing the unfolded state, such interactions will in general lead to an inhibition of folding (Fig. 5a), and we set out in this section to test this linkage in quantitative terms. Since interactions of the F3 and Y3 aromatic clusters are weak, we have focused our analysis on the strong interactions of the C-terminal segment, which we have previously shown is essential for folding of the domain29. A simple calculation (detailed in the Supplementary Information) relates the amount of binding within the unfolded state (pB) to its change in stability:
1 |
We have used this expression, together with our measurements of interactions in unfolded RNCs of various lengths, to predict the effect on the co-translational folding equilibria of folding-competent FLN5 RNCs in the absence of the destabilizing A3A3 mutations (Fig. 4e, right-hand axis). While this calculation assumes that the A3A3 mutations themselves do not perturb the interaction, we have previously noted that the impact of E6 mutations on interactions of wt and A3A3 RNCs was similar (Extended Data Fig. 7), and that there was no detectable cooperativity between the aromatic clusters and interactions of the C-terminal segment (Extended Data Fig. 5). On this basis, we predict a destabilization of wt FLN5 RNCs by over 1 kcal mol−1 at short linker lengths (Fig. 4e, right-hand axis). In the absence of other changes in thermodynamic stability, we also predict that the E6 variant should shift co-translational folding equilibria towards the folded state (Fig. 5a and Fig. 4e, right-hand axis).
To test the predicted effect of interactions on folding experimentally, we employed a cysteine mass-tagging approach, PEGylation30, as a reporter of folding and co-translational folding equilibria. 2D NMR observations indicated that wt and E6 FLN5 have similar structures both in isolation and as RNCs (Extended Data Fig. 10), and so the mutations C747V and A721C were introduced to replace the native, C-terminal cysteine with one that is unambiguously emerged beyond the exit tunnel at all linker lengths examined here, and that is protected from modification by PEG-maleimide unless the domain is unfolded (Extended Data Fig. 10). The isolated variants, FLN5 C721 V747 (the ‘pseudo-wild-type’) and FLN5 C721 V747 E6, were reacted with PEG-maleimide in increasing concentrations of urea and the extent of protection was determined by electrophoresis. The resulting measurements fitted closely to a two-state unfolding model and indicated that the E6 variant was destabilized relative to the wt by (where N and U represent native and unfolded states), in excellent agreement with equivalent CD measurements (Fig. 5b and Extended Data Fig. 10).
We next subjected FLN5 and FLN5 E6 RNCs to PEGylation following the same protocol (Fig. 5c and Extended Data Fig. 10). As ribosomes dissociate at high concentrations of urea17, we restricted our analysis to measurements below 3 M urea, with the exception of FLN5+110 RNC for which data up to 3.75 M were used (Fig. 5c, dashed lines). The stabilities of FLN5+110 wt and E6 RNCs are indistinguishable from the corresponding isolated proteins, indicating that the difference in thermodynamic stability between isolated FLN5 and FLN5 E6 is conserved on the ribosome at long linker lengths (). However, at short linker lengths, the difference in stability between FLN5 and FLN5 E6 was decreased ( for FLN5+31) (Fig. 5c,d). Consistent with our hypothesis, wt nascent chains therefore become destabilized relative to E6 under strongly interacting conditions. Based on our earlier measurements of bound populations (Fig. 4e), we can predict the change in stability relative to the isolated domains (Fig. 5d), and these predictions closely match direct observations using PEGylation (Fig. 5d). Therefore, we conclude that the C-terminal segment can indeed modulate co-translational FLN5 folding through the competition between binding and folding.
Conclusion
For many domains, co-translational folding has been reported to be destabilized on the ribosome5,17–19. This effect is generally inferred to arise through interactions between emerging nascent chains and the ribosome, which have been observed for a range of different nascent chain sequences12,21–24. However, a direct link between the energetics of interactions and of co-translational folding has not previously been established. In this article we have therefore systematically examined a series of interactions between unfolded FLN5 nascent chains and the ribosome surface, in order to determine their effect on co-translational folding. While some of these interactions, between aromatic clusters and the ribosome surface, are too weak to perturb the energetics of folding substantially, we have identified a strongly interacting C-terminal segment, with over 90% bound at short linker lengths. The length of the sequestered segment (22 amino acids) is longer than the C-terminal truncation that can be tolerated by isolated FLN5 before it unfolds (12 amino acids)29, supporting the crucial role of this segment to enable native structure formation. Importantly, our analysis establishes quantitative agreement between the strength of the observed interactions and the energetics of co-translational folding itself, providing a residue-specific demonstration of the ability of the ribosome surface to directly modulate co-translational folding, effectively acting as a holdase.
A notable consequence of our analysis is that strong interactions are required to appreciably perturb the co-translational folding landscape, for example, a destabilization of 1 kcal mol−1 requires over 80% binding (equation (2)). Such interactions were indeed observed for the C-terminal segment at short linker lengths, but the bound population decreases sharply with increasing linker length, to below 50% at linker lengths beyond 47 amino acids (Fig. 4e). This rapid, short-range effect may provide a mechanism by which the engagement of molecular chaperones with the emerging NC is regulated.
We observe that the molecular determinants of ribosome interactions, that is, positively charged and aromatic residues, are similar to those recognized by other molecule chaperones and processing complexes31. These include signal recognition particle32, the ribosome-associated chaperone trigger factor33–35, and SecB36, both of which function as holdases for nascent polypeptide chains, and other downstream chaperones such as DnaK37. Indeed, over the past few years NMR studies have been instrumental in revealing with exquisite detail the mechanisms underlying substrate recognition and chaperone action by these molecules35,36,38–42.
During biosynthesis, substrates emerge from the ribosome in their high-energy unfolded states, and so in contrast to chaperones that act post-translationally, interactions with holdases carry no energetic cost. This is reflected in the ATP independence of both trigger factor and SecB holdases—as well as in interactions with the ribosome surface itself. We speculate that this short-range holdase activity could have a number of functional roles: sequestration of hydrophobic segments until later residues have been synthesized; to reduce the risk of misfolding, particularly in tandem repeat proteins such as FLN29,43; or to delay folding ahead of co-translational assembly44–46 or the engagement of downstream chaperones such as TF and SecB, which may be involved in secretory pathways35,36. This suggests that the ribosome is more than an inert hub that orchestrates interactions of auxiliary factors and chaperones47, and in fact is itself an active participant in the co-translational folding process. In conclusion, we demonstrate the holdase effect of the ribosome as a bespoke form of regulation over co-translational folding.
Methods
Sample preparation and quality control
DNA constructs of tandem immunoglobulin domains FLN5 and FLN6, and of poly(GS) glycine serine repeat sequence variants and Y719E variants, were described previously20. Aromatic and glutamate mutants were generated using site-directed mutagenesis. A stronger SecM variant was used for preparation of [2H,13CH3-Ile, Leu and Val (ILV)]-labelled RNCs for PEGylation experiments, with sequence ‘FSTPVWIWWWPRIRGPPPPWT’. 15N and [2H,13CH3-ILV]-labelled proteins were grown and purified as described previously49, except in the case of disordered FLN5 variants where the final chromatography step was performed using a HiLoad 16/600 Superdex 200 pg (GE Healthcare) column in the presence of 6 M urea, followed by buffer exchange into Tico buffer25. 15N and [2H,13CH3-ILV]-labelled RNCs were generated in BL21(DE3) Escherichia coli as described previously25, where the only modification was the replacement of the final step of purification (sucrose gradient chromatography) with an additional sucrose cushion step followed by hydrophobic affinity chromatography using a HiTrap FF butyl column (GE Healthcare). The occupancy and integrity of RNC samples were monitored as described previously25.
NMR spectroscopy
NMR data were acquired on 500 and 700 MHz Bruker Avance III, and 800 and 950 MHz Bruker Avance III HD spectrometers, all equipped with TCI cryoprobes. Data were processed and analysed with nmrPipe, Sparky, CCPN Analysis and MATLAB50,51. All samples were prepared at nascent chain concentrations of between 7 and 13 µM in Tico buffer, pH 7.5, containing 10% D2O and 0.001% (w/v) DSS25. For 15N-labelled samples, 1H,15N SOFAST-HMQC experiments52 were recorded with typical acquisition times of 50.4 ms and 29.5 ms in the direct and indirect dimensions, respectively, and a 100 ms recycle delay. Transverse cross-correlated relaxation was measured using 1H,15N BEST-TROSY-CCR experiments (Extended Data Fig. 4), acquired at 950 MHz with a recycle delay of 200 ms and an acquisition time of 49 ms in the direct dimension. For measurements of isolated proteins, a relaxation delay of 111 ms was used, while shorter relaxation delays were used for measurements of RNCs (22 ms for FLN5+31; 44 ms for FLN5+31 F3A3 and FLN5+31 A3A3). An acquisition time of 43 ms was used in the indirect dimension for all measurements except for FLN5+31, for which a 21 ms acquisition time was used. Samples were doped with 15 mM NiDO2A (nickel(II) 1,4,7,10-tetraazacyclododecane-1,7-bis(acetic acid)) to enhance sensitivity (Extended Data Fig. 4)53. All experiments were interleaved with SORDID diffusion measurements54 with a diffusion delay of 190 ms, an acquisition time of 49 ms, and 4 ms trapezoidal gradient pulses with strengths of 0.058 and 0.387 T m−1. Ribosome background labelling was assessed using 15N filtered/edited difference spectroscopy25. Chemical shift assignments for FLN5 variants were verified using 1H,15N NOESY-HSQC and TOCSY-HSQC experiments, with 200 ms and 70 ms mixing times, respectively.
For [2H,13CH3-ILV]-labelled samples, 1H,13C HMQC spectra were acquired at 800 MHz with acquisition times of 100 ms and 7 ms in the direct and indirect dimensions, respectively. Sample integrity was assessed through 13C-edited 1H STE diffusion measurements25 and 1H R2 relaxation measurements (13C-edited and incorporating a filter to select slow relaxing coherences55), in which signals detected after a 100 ms relaxation delay were interpreted as indicating nascent chain release.
1H R2 rates of isolated amino acids (200 µM in D2O) were measured in the presence and absence of 1 µM 70S ribosomes, using a 500 Hz PROJECT pulse train (700 MHz, 283 K)56.
CD spectroscopy
CD spectra were acquired using a Chirascan-plus CD spectrometer (Applied Photophysics). All samples (20 µM) were incubated for ≥3 h at 283 K prior to measurement in Tico buffer (12 mM Hepes, 30 mM NH4Cl, 6 mM MgCl2, pH 7.5). CD signals at 211 nm were fitted globally using MatLab to a two-state unfolding model with a common m value, 〈mD–N〉 (ref. 57):
2 |
where [D] is the denaturant concentration, [D]50% is the midpoint of folding, and α terms represent baselines for native (N) and denatured states. Stabilities were then calculated as ΔGD–N = mD–N[D]50%.
Molecular dynamics
Simulations of FLN5+31, +42, +67 and +110 RNCs were run in Gromacs 4.5.758 using Cα structure-based models generated with SMOG 2.059,60, extended to represent RNA using three beads located at P, C4′ and N3 atoms. A rigid ribosome model was created based on the structure 4ybb61 and including only nascent chain-accessible residues, that is, the exit tunnel and the surrounding surface. The A3A3 variant was modelled by removing all contacts involving the mutated sites, and the simulation temperature was then tuned so that isolated wt and A3A3 variants were folded and unfolded, respectively. Electrostatic interactions were introduced using Debye–Hückel theory62, with parameters chosen to reproduce the experimentally observed bound population of the FLN5+42 RNC and then applied without change to the remaining lengths. Three independent simulations were carried out for each system, with ∼2 × 108 steps per run.
PEGylation
Samples (100–200 pmol isolated proteins or 8 pmol of RNC) were incubated in increasing urea concentrations for at least 2 h at 283 K prior to measurements. Methoxypolyethylene glycol maleimide (PEG-Mal) was added to a final concentration of 2 mM and incubated for 10 min before quenching the reaction with 0.23 M DTT. Samples were separated on denaturing 12% (w/v) polyacrylamide Bis-Tris gels (pH 5.7) which were subsequently Coomassie stained (for isolated proteins) or analysed by Western blot (for RNCs). Quantitation was performed using ImageStudio (Licor). As a control, Western blots of dilutions of a FLN5+47 RNC variant showed excellent linearity (R2 = 0.98) over the concentration range used in analyses (Supplementary Fig. 14). Populations of unfolded (PEGylated) nascent chains were determined relative to the total nascent chain concentration (PEGylated and unPEGylated) and fitted to equation (2) to determine the thermodynamic stability as described above.
Online content
Any methods, additional references, Nature Research reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at 10.1038/s41557-021-00796-x.
Supplementary information
Acknowledgements
We acknowledge the use of the UCL Biomolecular NMR Centre, the MRC Biomedical NMR Centre and the IBS NMR facility (Grenoble). We thank Martin Blackledge for helpful discussions. We thank Blanca Echeverría Perez for technical assistance. This work was supported by a Wellcome Trust Investigator Award (to J.C., 206409/Z/17/Z). This work was supported by the Francis Crick Institute through provision of access to the MRC Biomedical NMR Centre. The Francis Crick Institute receives its core funding from Cancer Research UK (FC001029), the UK Medical Research Council (FC001029) and the Wellcome Trust (FC001029). This work was also supported by iNEXT, grant number 653706, funded by the Horizon 2020 programme of the European Commission (to C.A.W. and J.C.), and grant number GB77-14, funded by the Interdisciplinary Centre for Mathematical and Computational Modelling (ICM), University of Warsaw (to T.W. and J.C.).
Extended data
Source data
Author contributions
Conceptualization: A.M.E.C., T.W., C.A.W. and J.C. Methodology: A.M.E.C., T.W., S.H.S.C., L.F.W., L.D.C., C.A.W. and J.C. Investigation: A.M.E.C., T.W., L.F.W., I.V.B., J.O.S., L.D.C. and C.A.W. Visualization: A.M.E.C., T.W., L.F.W. and C.A.W. Funding acquisition: A.M.E.C., T.W., L.D.C., C.A.W. and J.C. Project administration: A.M.E.C., C.A.W. and J.C. Supervision: A.M.E.C., L.D.C., C.A.W. and J.C. Writing—original draft: A.M.E.C., C.A.W. and J.C. Writing—review and editing: A.M.E.C., T.W., S.H.S.C., L.D.C., C.A.W. and J.C.
Data availability
Data supporting the findings of this study are included in the article, source data and Supplementary Information files. Assignments have been deposited in the BMRB under accession codes 51023 and 51028. Source data are provided with this paper.
Code availability
NMR pulse sequences are available on https://github.com/chriswaudby/pp and also as Supplementary Information. Source data are provided with this paper.
Competing interests
The authors declare no competing interests.
Footnotes
Peer review information Nature Chemistry thanks Bjoern Burmann and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Christopher A. Waudby, Email: c.waudby@ucl.ac.uk
John Christodoulou, Email: j.christodoulou@ucl.ac.uk.
Extended data
are available for this paper at 10.1038/s41557-021-00796-x.
Supplementary information
The online version contains supplementary material available at 10.1038/s41557-021-00796-x.
References
- 1.Balchin D, Hayer-Hartl M, Hartl FU. In vivo aspects of protein folding and quality control. Science. 2016;353:aac4354–aac4354. doi: 10.1126/science.aac4354. [DOI] [PubMed] [Google Scholar]
- 2.Hartl FU, Hayer-Hartl M. Molecular chaperones in the cytosol: from nascent chain to folded protein. Science. 2002;295:1852–1858. doi: 10.1126/science.1068408. [DOI] [PubMed] [Google Scholar]
- 3.Cassaignau AME, Cabrita LD, Christodoulou J. How does the ribosome fold the proteome? Annu. Rev. Biochem. 2020;89:389–415. doi: 10.1146/annurev-biochem-062917-012226. [DOI] [PubMed] [Google Scholar]
- 4.Kim SJ, et al. Protein folding. Translational tuning optimizes nascent protein folding in cells. Science. 2015;348:444–448. doi: 10.1126/science.aaa3974. [DOI] [PubMed] [Google Scholar]
- 5.Liu K, Rehfus JE, Mattson E, Kaiser CM. The ribosome destabilizes native and non-native structures in a nascent multidomain protein. Protein Sci. 2017;26:1439–1451. doi: 10.1002/pro.3189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Alexander LM, Goldman DH, Wee LM, Bustamante C. Non-equilibrium dynamics of a nascent polypeptide during translation suppress its misfolding. Nat. Commun. 2019;10:2709. doi: 10.1038/s41467-019-10647-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Zhang G, Hubalewska M, Ignatova Z. Transient ribosomal attenuation coordinates protein synthesis and co-translational folding. Nat. Struct. Mol. Biol. 2009;16:274–280. doi: 10.1038/nsmb.1554. [DOI] [PubMed] [Google Scholar]
- 8.Waudby CA, Dobson CM, Christodoulou J. Nature and regulation of protein folding on the ribosome. Trends Biochem. Sci. 2019;44:914–926. doi: 10.1016/j.tibs.2019.06.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.O’Brien EP, Christodoulou J, Vendruscolo M, Dobson CM. New scenarios of protein folding can occur on the ribosome. J. Am. Chem. Soc. 2011;133:513–526. doi: 10.1021/ja107863z. [DOI] [PubMed] [Google Scholar]
- 10.Frydman J, Erdjument-Bromage H, Tempst P, Hartl FU. Co-translational domain folding as the structural basis for the rapid de novo folding of firefly luciferase. Nat. Struct. Biol. 1999;6:697–705. doi: 10.1038/10754. [DOI] [PubMed] [Google Scholar]
- 11.Sander IM, Chaney JL, Clark PL. Expanding Anfinsen’s principle: contributions of synonymous codon selection to rational protein design. J. Am. Chem. Soc. 2014;136:858–861. doi: 10.1021/ja411302m. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Ellis JP, Bakke CK, Kirchdoerfer RN, Jungbauer LM, Cavagnero S. Chain dynamics of nascent polypeptides emerging from the ribosome. ACS Chem. Biol. 2008;3:555–566. doi: 10.1021/cb800059u. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Holtkamp W, et al. Cotranslational protein folding on the ribosome monitored in real time. Science. 2015;350:1104–1107. doi: 10.1126/science.aad0344. [DOI] [PubMed] [Google Scholar]
- 14.Notari L, Martínez-Carranza M, Farías-Rico JA, Stenmark P, von Heijne G. Cotranslational folding of a pentarepeat β-helix protein. J. Mol. Biol. 2018;430:5196–5206. doi: 10.1016/j.jmb.2018.10.016. [DOI] [PubMed] [Google Scholar]
- 15.Kemp G, Kudva R, de la Rosa A, von Heijne G. Force–profile analysis of the cotranslational folding of HemK and filamin domains: comparison of biochemical and biophysical folding assays. J. Mol. Biol. 2019;431:1308–1314. doi: 10.1016/j.jmb.2019.01.043. [DOI] [PubMed] [Google Scholar]
- 16.Liutkute M, Maiti M, Samatova E, Enderlein J, Rodnina MV. Gradual compaction of the nascent peptide during cotranslational folding on the ribosome. eLife. 2020;9:e60895. doi: 10.7554/eLife.60895. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Samelson AJ, Jensen MK, Soto RA, Cate JHD, Marqusee S. Quantitative determination of ribosome nascent chain stability. Proc. Natl Acad. Ssci. USA. 2016;113:13402–13407. doi: 10.1073/pnas.1610272113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Liu K, Maciuba K, Kaiser CM. The ribosome cooperates with a chaperone to guide multi-domain protein folding. Mol. Cell. 2019;74:310–319.e7. doi: 10.1016/j.molcel.2019.01.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Jensen MK, Samelson AJ, Steward A, Clarke J, Marqusee S. The folding and unfolding behavior of ribonuclease H on the ribosome. J. Biol. Chem. 2020;295:11410–11417. doi: 10.1074/jbc.RA120.013909. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Cabrita LD, et al. A structural ensemble of a ribosome-nascent chain complex during cotranslational protein folding. Nat. Struct. Mol. Biol. 2016;23:278–285. doi: 10.1038/nsmb.3182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Deckert A, et al. Structural characterization of the interaction of α-synuclein nascent chains with the ribosomal surface and trigger factor. Proc. Natl Acad. Sci. USA. 2016;113:5012–5017. doi: 10.1073/pnas.1519124113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Knight AM, et al. Electrostatic effect of the ribosomal surface on nascent polypeptide dynamics. ACS Chem. Biol. 2013;8:1195–1204. doi: 10.1021/cb400030n. [DOI] [PubMed] [Google Scholar]
- 23.Kaiser CM, Goldman DH, Chodera JD, Tinoco I, Bustamante C. The ribosome modulates nascent protein folding. Science. 2011;334:1723–1727. doi: 10.1126/science.1209740. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Marino J, Buholzer KJ, Zosel F, Nettels D, Schuler B. Charge interactions can dominate coupled folding and binding on the ribosome. Biophys. J. 2018;115:996–1006. doi: 10.1016/j.bpj.2018.07.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Cassaignau AME, et al. A strategy for co-translational folding studies of ribosome-bound nascent chain complexes using NMR spectroscopy. Nat. Protoc. 2016;11:1492–1507. doi: 10.1038/nprot.2016.101. [DOI] [PubMed] [Google Scholar]
- 26.Lavalette D, Amand B, Pochon F. Rotational relaxation of 70S ribosomes by a depolarization method using triplet probes. Proc. Natl Acad. Sci. USA. 1977;74:1407–1411. doi: 10.1073/pnas.74.4.1407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Jagger AM, Waudby CA, Irving JA, Christodoulou J, Lomas DA. High-resolution ex vivo NMR spectroscopy of human Z α1-antitrypsin. Nat. Commun. 2020;11:1–13. doi: 10.1038/s41467-020-20147-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Javed, A. et al. Visualising nascent chain dynamics at the ribosome exit tunnel by cryo-electron microscopy. Preprint at https://www.biorxiv.org/content/10.1101/722611v3.full (2019).
- 29.Waudby CA, et al. Systematic mapping of free energy landscapes of a growing filamin domain during biosynthesis. Proc. Natl Acad. Sci. USA. 2018;115:9744–9749. doi: 10.1073/pnas.1716252115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Lu J, Deutsch C. Pegylation: a method for assessing topological accessibilities in Kv1. 3. Biochemistry. 2001;40:13288–13301. doi: 10.1021/bi0107647. [DOI] [PubMed] [Google Scholar]
- 31.Bose D, Chakrabarti A. Substrate specificity in the context of molecular chaperones. IUBMB Life. 2017;69:647–659. doi: 10.1002/iub.1656. [DOI] [PubMed] [Google Scholar]
- 32.Schibich D, et al. Global profiling of SRP interaction with nascent polypeptides. Nature. 2016;536:219–223. doi: 10.1038/nature19070. [DOI] [PubMed] [Google Scholar]
- 33.Patzelt H, et al. Binding specificity of Escherichia coli trigger factor. Proc. Natl Acad. Sci. USA. 2001;98:14244–14249. doi: 10.1073/pnas.261432298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Tomic S, Johnson AE, Hartl FU, Etchells SA. Exploring the capacity of trigger factor to function as a shield for ribosome bound polypeptide chains. FEBS Lett. 2006;580:72–76. doi: 10.1016/j.febslet.2005.11.050. [DOI] [PubMed] [Google Scholar]
- 35.Saio T, Guan X, Rossi P, Economou A, Kalodimos CG. Structural basis for protein antiaggregation activity of the trigger factor chaperone. Science. 2014;344:1250494–1250494. doi: 10.1126/science.1250494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Huang C, Rossi P, Saio T, Kalodimos CG. Structural basis for the antifolding activity of a molecular chaperone. Nature. 2016;537:202–206. doi: 10.1038/nature18965. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Rüdiger S, Germeroth L, Schneider-Mergener J, Bukau B. Substrate specificity of the DnaK chaperone determined by screening cellulose-bound peptide libraries. EMBO J. 1997;16:1501–1507. doi: 10.1093/emboj/16.7.1501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Sekhar A, Rosenzweig R, Bouvignies G, Kay LE. Hsp70 biases the folding pathways of client proteins. Proc. Natl Acad. Sci. USA. 2016;113:E2794–E2801. doi: 10.1073/pnas.1601846113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.He L, Sharpe T, Mazur A, Hiller S. A molecular mechanism of chaperone-client recognition. Sci. Adv. 2016;2:e1601625. doi: 10.1126/sciadv.1601625. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Libich DS, Tugarinov V, Clore GM. Intrinsic unfoldase/foldase activity of the chaperonin GroEL directly demonstrated using multinuclear relaxation-based NMR. Proc. Natl Acad. Sci. USA. 2015;112:8817–8823. doi: 10.1073/pnas.1510083112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Jiang Y, Rossi P, Kalodimos CG. Structural basis for client recognition and activity of Hsp40 chaperones. Science. 2019;365:1313–1319. doi: 10.1126/science.aax1280. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Faust O, et al. HSP40 proteins use class-specific regulation to drive HSP70 functional diversity. Nature. 2020;587:489–494. doi: 10.1038/s41586-020-2906-4. [DOI] [PubMed] [Google Scholar]
- 43.Borgia MB, et al. Single-molecule fluorescence reveals sequence-specific misfolding in multidomain proteins. Nature. 2011;474:662–665. doi: 10.1038/nature10099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Shieh Y-W, et al. Operon structure and cotranslational subunit association direct protein assembly in bacteria. Science. 2015;350:678–680. doi: 10.1126/science.aac8171. [DOI] [PubMed] [Google Scholar]
- 45.Shiber A, et al. Cotranslational assembly of protein complexes in eukaryotes revealed by ribosome profiling. Nature. 2018;561:268–272. doi: 10.1038/s41586-018-0462-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Bertolini M, et al. Interactions between nascent proteins translated by adjacent ribosomes drive homomer assembly. Science. 2020;371:57–64. doi: 10.1126/science.abc7151. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Pechmann S, Willmund F, Frydman J. The ribosome as a hub for protein quality control. Mol. Cell. 2013;49:411–421. doi: 10.1016/j.molcel.2013.01.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.McCoy AJ, Fucini P, Noegel AA, Stewart M. Structural basis for dimerization of the Dictyostelium gelation factor (ABP120) rod. Nat. Struct. Biol. 1999;6:836–841. doi: 10.1038/12296. [DOI] [PubMed] [Google Scholar]
- 49.Hsu S-TD, et al. Structure and dynamics of a ribosome-bound nascent chain by NMR spectroscopy. Proc. Natl Acad. Sci. USA. 2007;104:16516–16521. doi: 10.1073/pnas.0704664104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Delaglio F, et al. NMRPipe: a multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR. 1995;6:277–293. doi: 10.1007/BF00197809. [DOI] [PubMed] [Google Scholar]
- 51.Vranken WF, et al. The CCPN data model for NMR spectroscopy: development of a software pipeline. Proteins. 2005;59:687–696. doi: 10.1002/prot.20449. [DOI] [PubMed] [Google Scholar]
- 52.Schanda, P. & Brutscher, B. Very fast two-dimensional NMR spectroscopy for real-time investigation of dynamic events in proteins on the time scale of seconds. J. Am. Chem. Soc.127, 8014–8015 (2005). [DOI] [PubMed]
- 53.Chan SHS, Waudby CA, Cassaignau AME, Cabrita LD, Christodoulou J. Increasing the sensitivity of NMR diffusion measurements by paramagnetic longitudinal relaxation enhancement, with application to ribosome-nascent chain complexes. J. Biomol. NMR. 2015;63:151–163. doi: 10.1007/s10858-015-9968-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Augustyniak R, Ferrage F, Damblon C, Bodenhausen G, Pelupessy P. Efficient determination of diffusion coefficients by monitoring transport during recovery delays in NMR. Chem. Commun. 2012;48:5307–5309. doi: 10.1039/c2cc30578j. [DOI] [PubMed] [Google Scholar]
- 55.Tugarinov V, Ollerenshaw JE, Kay LE. Dipolar dynamic frequency shifts in multiple-quantum spectra of methyl groups in proteins: correlation with side-chain motion. Magn. Reson. Chem. 2006;44:S122–S129. doi: 10.1002/mrc.1819. [DOI] [PubMed] [Google Scholar]
- 56.Aguilar JA, Nilsson M, Bodenhausen G, Morris GA. Spin echo NMR spectra without J modulation. Chem. Commun. 2012;48:811–813. doi: 10.1039/c1cc16699a. [DOI] [PubMed] [Google Scholar]
- 57.Pace CN. Determination and analysis of urea and guanidine hydrochloride denaturation curves. Methods Enzymol. 1986;131:266–280. doi: 10.1016/0076-6879(86)31045-0. [DOI] [PubMed] [Google Scholar]
- 58.Pronk S, et al. GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit. Bioinformatics. 2013;29:845–854. doi: 10.1093/bioinformatics/btt055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.JK N, et al. SMOG 2: A versatile software package for generating structure-based models. PLoS Comput. Biol. 2016;12:e1004794. doi: 10.1371/journal.pcbi.1004794. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Clementi C, Nymeyer H, Onuchic JN. Topological and energetic factors: what determines the structural details of the transition state ensemble and ‘en-route’ intermediates for protein folding? An investigation for small globular proteins. J. Mol. Biol. 2000;298:937–953. doi: 10.1006/jmbi.2000.3693. [DOI] [PubMed] [Google Scholar]
- 61.J N, et al. High-resolution structure of the Escherichia coli ribosome. Nat. Struct. Mol. Biol. 2015;22:336–341. doi: 10.1038/nsmb.2994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Noel JK, Whitford PC. How EF-Tu can contribute to efficient proofreading of aa-tRNA by the ribosome. Nat. Commun. 2016;7:13314. doi: 10.1038/ncomms13314. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data supporting the findings of this study are included in the article, source data and Supplementary Information files. Assignments have been deposited in the BMRB under accession codes 51023 and 51028. Source data are provided with this paper.
NMR pulse sequences are available on https://github.com/chriswaudby/pp and also as Supplementary Information. Source data are provided with this paper.