Significance
In living cells, proteins are produced in a sequential way by ribosomes. This vectorial process allows the growing protein chain to start to fold before translation has been completed. Thereby, cotranslational protein folding can be significantly different than the folding of a full-length protein in isolation. Here, we show how structurally similar repeat domains, normally produced as parts of a single long polypeptide, affect the cotranslational folding of their neighbors. This provides insight into how the cell may efficiently produce multidomain proteins, paving the way for future studies in vivo or with chaperones. We also provide an estimated magnitude of the mechanical force on the nascent chain generated by cotranslational folding, calculated from biochemical measurements and molecular dynamics simulations.
Keywords: protein folding, spectrin, translational arrest peptide, SecM, molecular dynamics
Abstract
Proteins synthesized in the cell can begin to fold during translation before the entire polypeptide has been produced, which may be particularly relevant to the folding of multidomain proteins. Here, we study the cotranslational folding of adjacent domains from the cytoskeletal protein α-spectrin using force profile analysis (FPA). Specifically, we investigate how the cotranslational folding behavior of the R15 and R16 domains are affected by their neighboring R14 and R16, and R15 and R17 domains, respectively. Our results show that the domains impact each other’s folding in distinct ways that may be important for the efficient assembly of α-spectrin, and may reduce its dependence on chaperones. Furthermore, we directly relate the experimentally observed yield of full-length protein in the FPA assay to the force exerted by the folding protein in piconewtons. By combining pulse-chase experiments to measure the rate at which the arrested protein is converted into full-length protein with a Bell model of force-induced rupture, we estimate that the R16 domain exerts a maximal force on the nascent chain of ∼15 pN during cotranslational folding.
Classically, protein folding has been studied in vitro using purified proteins, providing a detailed picture of how the biophysical properties of a given polypeptide chain affect folding pathways. However, studies of purified proteins do not reflect the vectorial nature of biological protein synthesis, where each amino acid is added sequentially on a timescale that allows the growing protein to explore its continually expanding energy landscape (1).
The discovery (2, 3) and engineering (4, 5) of translational arrest peptides (APs) have provided us with a tool to examine how proteins fold cotranslationally. To date, the folding of several small proteins and protein domains has been studied using the force-sensitive AP from the Escherichia coli SecM protein, establishing a method we have called force profile analysis (FPA) (6–15). Recently, we demonstrated that FPA faithfully picks up cotranslational protein folding events observed by direct biophysical measurements (15).
Spectrin, the primary structural component of the cellular cytoskeleton (16–19), is composed of two heterodimers of α- and β-spectrin that come together in a coiled tetramer (20). The 285-kDa nonerythroid αII isoform of spectrin contains 20 three-helix bundle domains, which provide interesting model systems for protein folding. This is because, while they are structurally nearly identical (21), they vary in stability (22–25), folding rate (26, 27), and folding mechanism (28). In particular, folding of the 15th, 16th, and 17th α-spectrin domains (herein called R15, R16, and R17, respectively) has been extensively studied in vitro (24, 27, 29–32).
Previously, using FPA, we demonstrated that R15 and R16 are able to fold cotranslationally and that the point at which this folding begins does not seem to be influenced by the in vitro folding rate (9). Specifically, while R15 folds very quickly in vitro, cotranslational folding of R16 starts at shorter chain lengths than does folding of R15. Indeed, even when the folding core of R15 was substituted into R16, which gives R16 the folding properties of R15 in vitro (33), the protein was still able to fold earlier than R15 during translation. When the minimal, five-residue R15 folding nucleus (34) was substituted into R16, this led to late cotranslational folding, but early folding was restored when this mutant was expressed in tandem with R15, as it would be in vivo. This demonstrated that the mutant R16 variants fold cotranslationally via early intrachain contacts, at chain lengths where the R15 folding nucleus cannot yet form. This was not observed in vitro because the purified mutant R16 protein would preferentially fold using the “fast” R15 nucleus rather than the slower R16 contacts. Cotranslationally, R16 folds using the first available pathway, i.e., the slower R16 folding contacts.
Here, we have extended our studies on spectrin to determine how the presence of neighboring domains affect the folding of R15 and R16. We find that the cotranslational folding of these two domains is differently affected by their respective upstream and downstream neighbor domains, a previously unappreciated feature of spectrin folding. We were also able to observe that the synthesis of native, likely helical, structures downstream of the folding spectrin domain increases the observed folding force and further stabilizes the fold even in the presence of amino acid substitutions that destabilize the final native spectrin structure.
While it has been shown previously that the cotranslational folding of proteins can provide a mechanical force to release SecM-mediated ribosomal stalling (6), the magnitude of the force has never been directly measured by FPA. Using pulse-chase experiments, we have now measured the rate at which arrested protein is converted to full-length protein under standard experimental conditions. These rates were fit to a Bell model (35) for force-induced rupture, using parameters elucidated by molecular dynamics (MD) simulations (12), to provide an estimate of the magnitude of the pulling force exerted on the nascent chain by cotranslational protein folding. The determined force is in good agreement with previously published measurements using optical tweezers (6).
Results
The FPA Assay.
APs are short stretches of polypeptide that interact with the ribosome exit tunnel in such a way that translation is stalled when the last codon in the portion of the mRNA that codes for the AP is located in the ribosomal A-site (36). APs generally have regulatory functions in cells, controlling, e.g., translation initiation on downstream open reading frames in polycistronic mRNAs (36). Interestingly, the stalling efficiency of many APs has been shown to be sensitive to pulling forces exerted on the nascent polypeptide chain, with high pulling force leading to reduced stalling (6, 37, 38). APs can therefore be used as force sensors, reporting on cotranslational events that in one way or another generate pulling forces on the nascent chain (7, 38, 39). The factors influencing such force-generating events were recently examined using molecular simulations and statistical mechanics models, in particular the effect of translation speed (40). Although AP measurements of force do not include the effect of translation speed directly, this could be accounted for by using a suitable kinetic model for coupled folding and translation (41).
In FPA, a force-generating domain in a polypeptide is placed at increasing distances upstream of an AP (Fig. 1A), and the degree of translational stalling is measured for the corresponding series of protein constructs. Fig. 1B shows how FPA can be applied to study the cotranslational folding of soluble protein domains, e.g., tandem spectrin domains. In the construct shown on the left, the chain is long enough for one of the two spectrin domains to have already folded, while the second spectrin domain is largely buried in the exit tunnel and thus cannot fold when the ribosome reaches the C-terminal end of the AP. Therefore, little force is generated on the AP, and stalling is efficient. In the right-hand construct, the chain is so long that both spectrin domains have folded prior to the ribosome reaching the C-terminal end of the AP, and again little force is generated on the AP. In the middle construct, however, the chain is just long enough that the second spectrin domain can begin to fold if the linker that connects it to the AP is stretched beyond its equilibrium length. Under this condition, some of the free energy gained upon folding will be converted to tension in the linker and generate a pulling force on the AP, resulting in reduced stalling and increased production of full-length protein.
Fig. 1.
FPA. (A) Schematic representation of a typical construct. A “force generator,” which here is one or more spectrin domains, is connected by a linker sequence to the E. coli SecM arrest peptide (AP). Following the AP is an unrelated protein sequence from the E. coli LepB protein that is added to allow the arrested (A) and full-length (FL) forms to be separable by SDS/PAGE. (B) Schematic representation of cotranslationally arrested ribosome–nascent chain complexes with increasing linker lengths (Left to Right). Represented here is the R15R16 nL construct (colored as in Fig. 2A). (Left) At short linker lengths, when the folding of the R16 domain (red) has not yet begun, the force acting on the nascent chain is minimal, and the arrest is efficient (F ∼ 0, fFL ∼ 0). (Middle) At linker lengths that allow the folding of the R16 domain, the force acting on the nascent chain is increased and the arrest is less efficient (F > 0, fFL > 0). (Right) At linker lengths that arrest the ribosome after the folding of the R16 domain has already occurred, the force acting on the nascent chain is again minimal (F ∼ 0, fFL ∼ 0). (C) Determination of fFL by SDS/PAGE. Lane 1 is the R15 nL39 construct, and lane 2 is the R15 nL39 full-length control construct. The control construct has the final (critical) proline residue of the AP mutated to alanine, preventing arrest and producing only full-length protein. (D) A plot showing the calculation of Lonset, Lmax, and Lend. The data points used for the calculation are denoted by black, yellow, and gray circles, respectively. The interpolated midpoint between each pair of selected point gives the Lonset (vertical dotted line), Lmax (dashed line), and Lend (dash-dot line) values.
The cotranslational folding process can hence be followed by measuring the amount of full-length and arrested protein product at each linker length (Fig. 1C). Folding transitions will appear as peaks in a “force profile” plot of the fraction full-length protein (fFL) vs. linker length (L) (Fig. 1D) and can be described by the peak amplitude and the linker lengths that define the onset (Lonset), maximum (Lmax), and end (Lend) of the peak, as shown.
Cotranslational Folding of Tandem Spectrin Domains.
In the intact spectrin protein, the C-terminal α-helix in one domain is continuous with the N-terminal α-helix in the next domain (Fig. 2A). Consecutive domains are thus intimately connected to each other and can influence each other’s thermodynamic stability (24, 27, 31, 42). In order to better understand the cotranslational folding of spectrin, we decided to systematically evaluate the effects of upstream and downstream domains on the force profiles of the R15 and R16 spectrin domains by analyzing the two- and three-domain combinations shown in Fig. 2B. For each domain combination, a force profile was recorded by generating a series of constructs where the 17-residue SecM(Ec) AP (FSTPVWISQAQGIRAGP) from the E. coli SecM protein was placed at different linker lengths L from the C-terminal end of the R15 or R16 domain, translating each construct for 20 min in an in vitro translation system (43) in the continuous presence of [35S]methionine, separating the translation products by sodium dodecyl sulfate–polyacrylamide gel electrophoresis (SDS/PAGE), and calculating fFL from the intensities of the bands corresponding to the full-length and arrested forms of the protein. The full amino acid sequences of all constructs are given in SI Appendix, Table S1.
Fig. 2.
Construct design. (A) Structure of the R15 (blue), R16 (red), and R17 (orange) domains of chicken brain α-spectrin (Protein Data Bank ID code 1U4Q) (24). The arrows indicate the domain boundaries and the A, B, and C helices of each domain are labeled. (B) Constructs analyzed in this study. A solid black line represents sequence derived from the LepB protein, and dashed boxes represent truncated spectrin sequences. The ends of the arrested (A) and full-length (FL) forms are indicated.
In order to distinguish which spectrin repeat is being assayed in the tandem constructs, we use the following naming convention. Constructs named “nL” (for LepB linker) are used to follow the folding of most C-terminal spectrin repeat (i.e., R15R16 nL follows the folding of R16) and constructs named “nT” (for Truncated spectrin linker) follow the folding of the penultimate spectrin repeat (i.e., R15R16 nT follows the folding of R15).
The results are shown in Fig. 3 A–C. These include previous measurements collected for R16 nL and R15R16 nL (9) along with additional replicates and newly collected data for R15 nL. To facilitate comparison of the results, the Lonset, Lmax, and Lend values extracted from each force profile are summarized in SI Appendix, Fig. S1; note that Lmax values for force profiles that reach saturation (ƒFL ∼ 1) cannot be accurately determined. As seen in Fig. 3B, the cotranslational folding of the R15 domain is unaffected by the presence of the upstream R14 domain (dark blue and light blue curves), while the presence of the N-terminal part of the downstream R16 domain induces a reduction in Lonset from 36 to 34 residues and a marked increase in the amplitude of the peak (compare the dark blue and purple curves). Thus, the onset of folding of R15 is sensitive to the presence of the early parts of the N-terminal α-helix of the downstream R16 domain but is not affected by the presence of the upstream R14 domain.
Fig. 3.
Cotranslational folding of spectrin domains. (A) Force profiles obtained with the SecM(Ec) AP for the R15 nL and R16 nL constructs. ƒFL is the fraction full-length protein produced, and L is linker length (i.e., the number of amino acid residues between the C-terminal end of the spectrin domain and the C-terminal Pro residue in the AP). Data for R16 nL constructs from ref. 9 along with additional replicates are presented. (B) Force profiles obtained with the SecM(Ec) AP showing the influence of R14 and R16 on the folding of R15. (C) Force profiles obtained with the SecM(Ec) AP showing the influence of R15 and R17 on the folding of R16. Data for R15R16 nL from ref. 9 along with additional replicates are presented. The reason for the abnormally low ƒFL value for the R16R17 nT construct at L = 47 is unknown. (D) Force profiles obtained with the SecM(Ec-sup1) AP showing the influence of R15 and R17 on the folding of R16. Note that the R15R16 nL and the R15R16R17 nT profiles overlap perfectly in the interval L = 23 to 29 residues.
For the R16 domain, the effects of the neighboring R15 and R17 domains are more dramatic (Fig. 3C). The presence of the upstream R15 domain leads to a reduction in Lonset from 30 to 28 residues (compare the red and magenta curves), as shown before (9). The presence of the downstream R17 domain does not appreciably affect Lonset but leads to an increase in amplitude and a shift to a higher Lend value (compare the red and orange curves). Finally, when flanked by both the R15 and R17 domains, the R16 folding transition starts at a lower Lonset and ends at a higher Lend than for the isolated R16 domain (compare the red and brown curves).
In order to more precisely define Lmax for constructs where fFL approaches 1, we substituted two amino acids in the relatively weak SecM(Ec) AP to create the stronger SecM(Ec-sup1) variant (FSTPVWISQAPPIRAGP) (44) in the R16 nL, R15R16 nL, R16R17 nT, and R15R16R17 nT constructs (Fig. 3D). Again, the presence of R15 reduces Lonset and Lmax by approximately two residues (note that the R15R16 nL and the R15R16R17 nT profiles overlap perfectly in the interval L = 23 to 31), and the presence of R17 increases Lend by six to seven residues (SI Appendix, Fig. S2). Notably, the bimodal shape of the R15R16R17 nT profile with peaks at L = 29 and 37 residues is an almost perfect match to the sum of the R15R16 nL and R16R17 nT profiles, SI Appendix, Fig. S2F, suggesting that the R16 part folds first (stabilized by the R15 C helix; cf. Fig. 2A), followed by a second folding event involving the R17 A helix (stabilized by the R16 C helix).
Interestingly, both R16 folding constructs that include parts of R17 (R15R16R17 nT and R16R17 nT) show an increase in fFL at the longest linker length (L = 61 residues) (Fig. 3C). At this point, the entire R17 A helix, the loop, and the very beginning of R17 helix B have been translated (SI Appendix, Table S1). This increase in fFL may thus signal an early interaction between helices A and B in R17.
As a control, we introduced mutations that are known to inhibit folding of R15 and R16 in vitro (45, 46) into the R15 nL, R15R16 nT, R16 nL, R16R17 nT, and R15R16R17 nT constructs (Fig. 4 and SI Appendix, Fig. S3). As expected, fFL values for the “nonfolding” R15(nf) nL and R16(nf) nL constructs were strongly reduced, but surprisingly, fFL values remained high for linker lengths between Lmax and Lend for R15(nf)R16 nT, R16(nf)R17 nT, and R15R16(nf)R17 nT constructs. Since the difference between the nL and nT series of constructs is the presence of a C-terminal “linker” derived from the N-terminal part of R16 or R17, respectively (replacing unstructured segments from LepB), this suggested that the linker itself, perhaps together with C-terminal parts of the upstream spectrin domain, might fold in the nf-mutants.
Fig. 4.
Linker effects in R15 folding. (A) Force profiles for R15 nL (blue, unbroken line) and R15R16 nT (purple, unbroken line) data from Fig. 3B compared with profiles for a R15 nonfolding (nf) mutant (F18D + I55D) in R15(nf) nL constructs (blue, dashed line) and R15(nf)R16 nT constructs (purple, dashed line). (B) R15R16 nT spectrin structure (L = 40) with the A–C helices indicated and the destabilizing mutations F18D and I55D in the R15A and C helices shown as yellow spheres. (C) Graph showing fFL values (L = 40) for constructs with increasingly long R15 parts fused to the N terminus of the 19-residue R16 nT linker (black circles; points 1, 2, 3, 4, and 5). R15 nL L = 39 (point 6) and R15(nf) nL L = 41 (point 7) are shown for comparison. Structural representations of each construct (spectrin helices are colored as in B and LepB linker sequences colored green) surround the plot. For points 1 to 4, the effect of helix-breaking insertions in the R16 A helix (SI Appendix, Table S1) in fFL values are shown as open circles (Gly–Gly), open dotted circles (Pro), and open dashed circles (Gly–Ser–Gly–Ser).
To investigate this possibility, we started from the R15R16 nT, L = 40 construct [whose corresponding nf-mutant has a much higher fFL value than the R15(nf) nL version; Fig. 4A] and successively removed helices A, B, and C from the R15 part (i.e., N-terminal truncations) (Fig. 4B). We found that removing helix R15-A or both helices R15-A and R15-B decreased fFL, but only to 0.47 and 0.36, respectively (Fig. 4C) (compare points 4, to 3, and 2). Only with the removal of all of R15, leaving only 19 amino acids of the R16 A helix, did fFL decrease to baseline (point 1). Thus, the cotranslational formation of a continuous helical structure encompassing parts of the R15 C and R16 A helices appears to cause an increase in fFL at L = 40; indeed, it is known from previous work that helices can form cotranslationally in the exit tunnel (47). The introduction of helix-breaking residues into the middle of the R16 A helix (SI Appendix, Table S1) in the R15-ABC/R16-A (point 4) and R15-BC/R16-A (point 3) constructs decreased the fFL to the same level as R15-C/R16-A (point 2) (Fig. 4C, open circles). Likewise, comparison of constructs where the R16 A helix in the R15R16 nT and R15(nf)R16 nT at L = 40 was replaced by a nonhelical segment from LepB at a comparable length (L = 39 and L = 41, respectively) led to reductions in fFL (compare points 5 and 6 to 4 and 7). Similar behavior was also observed for the R16R17 nT, L = 43 constructs (SI Appendix, Fig. S3), implying that the R17 A helix forms a cotranslational folding intermediate together with the R16 C helix, as suggested above.
We conclude that the cotranslational folding of the R15 domain is affected by its downstream but not by its upstream neighbor domain, and that folding of R16 is affected both by its upstream and downstream neighbors. Different spectrin domains thus not only fold via different folding mechanisms, but their cotranslational folding is also differently impacted by their upstream and downstream neighbor domains, a previously unappreciated feature of spectrin folding. We attribute the contribution of the downstream neighbor to the formation of helical structure encompassing helix C from the upstream domain and helix A from the downstream domain.
A Quantitative Relation Between fFL and Pulling Force.
In all FPA studies published to date, the quantitative relation between the calculated fFL values and the underlying pulling force acting on the nascent chain has remained undefined [although attempts have been made to derive it from simulations or other kinds of theoretical modeling (11, 39)]. Using the PURE translation system, Goldman et al. (6) showed that the interactions between the SecM(Ec) AP and the ribosome exit tunnel can be disrupted by a mechanical force applied through optical tweezers, and that the rate by which the translational stall induced by the SecM(Ec) AP is released, kR, increases in step with the amount of force, F, applied to the nascent chain, in a way that can be approximated by the Bell model (35) for force-induced rupture:
| [1] |
where k0 = 3 × 10−4 s−1 (95% confidence interval [CI]: 0.5 × 10−4 s−1, 20 × 10−4 s−1) is the release rate at zero pulling force, and Δx‡ = 0.4 nm (95% CI: 0.1 nm, 0.8 nm) is the distance to the transition state.
The work by Goldman et al. (6) suggested to us that approximating kR with the rate of conversion of the arrested form of a given construct to the full-length form as measured in a pulse-chase experiment would allow us to estimate the corresponding pulling force using the relation in Eq. 1 between F and kR. To this end, we did pulse-chase experiments on a range of spectrin R16 and ADR1a (7) constructs that have fFL values between 0.2 and 0.9, and fit the pulse-chase data to a first-order kinetic equation (SI Appendix, Fig. S4).
To derive an analytical relation between fFL and F, we reasoned that if fFL were measured at a single delay time (Δt) in a pulse-chase experiment, rather than under the standard continuous-labeling experimental conditions, the release rate (and hence F, according to Eq. 1) could be estimated from the following:
| [2] |
Since in the standard experiment translation can be initiated at any time during the 20-min labeling period, the delay time Δt varies from one ribosome–nascent chain complex to another. Although many factors could in principle contribute to the distribution of delay times, it turns out empirically that using an average delay time Δt = 550 s, approximately equal to half of the total incubation time, describes remarkably well the relation between the standard fFL values (20-min continuous [35S]Met labeling) and the release rates measured by pulse-chase experiments (SI Appendix, Fig. S5).
Combining Eqs. 1 and 2, one can solve for the pulling force F:
| [3] |
To obtain good estimates of the Bell parameters k0 and Δx‡ we started from the values obtained in ref. 6 and performed a local optimization by requiring that k0 (the release rate at zero force) is close to kR measured for the “zero-force” construct R16 nL27 (Fig. 5A), and that the titin I27, spectrin R16, and ADR1a force profiles in ref. 12 are well reproduced by the MD simulation also described in ref. 12. These conditions are both fulfilled by setting k0 = 3 × 10−4 s−1 (i.e., the same value as in ref. 6 and equal to kR for R16 nL27; Fig. 5A) and = 0.65 nm (well inside the 95% CI from ref. 6) (SI Appendix, Fig. S6). With these parameter values and Δt = 550 s, Eq. 3 nicely captures the relation between the force estimated from the measured kR and standard fFL values (Fig. 5B) and can hence be used to predict F from fFL. Over the interval 0.2 < fFL < 0.9, the experimental data are also well approximated by the simple linear fit F = 22 fFL – 2.2 (blue line). Obviously, Eq. 3 holds only for the 17-residue SecM(Ec) AP used here, and not for other APs of different stalling strengths (39). Presumably, Eq. 3 may be applied also in other contexts where the SecM AP is used to measure pulling forces, such as during membrane protein synthesis or translocation of charged residues across energized membranes (38, 39), although the parameters would first need to be verified to hold for in vivo experiments. We note that another approach to determining Bell parameters under a given set of conditions would be to repeat the FPA experiments using multiple APs with different resistance to force. A global fit of fFL obtained for each AP as a function of linker length L would allow simultaneous determination of the forces exerted by the protein (independent of AP) as well as AP-dependent Bell parameters. In SI Appendix, Fig. S7, we illustrate this approach using data for translocon-mediated transmembrane helix insertion into the inner membrane of E. coli from in vivo AP measurements.
Fig. 5.
Release rates and estimation of pulling forces. (A) The rate of release (kR) obtained from pulse-chase experiments (SI Appendix, Fig. S4 and Dataset S2), the fraction full-length protein (fFL) measured under standard experimental conditions (20-min incubation in PURExpress in the continuous presence of [35S]Met), and the pulling force F calculated using Eq. 1 (, ). The constructs are from Kudva et al. (12) and are colored to match those in B and SI Appendix, Figs. S4 and S5. (B) F values calculated from Eq. 1 plotted against the standard fFL values, with constructs colored as in A. The least-squares fit line is indicated by the blue line, and the analytic relation Eq. 3 between F and fFL, assuming an average delay time Δt = 550 s (approximately equal to half the standard incubation time), is shown as a red curve.
The force F estimated from pulse-chase measurements or fFL values represents the constant force that would have the same effect on the escape rate from arrest as the combined effect of the forces from both folded and unfolded states. We can compare this inferred force with the ensemble average force calculated from MD simulations, , where and are the population of, and force exerted by, the unfolded (folded) state in the simulation, respectively. In previous work (12), we calculated both fFL as well as directly from MD simulations. In SI Appendix, Fig. S8A, we show the relation between and fFL from our previous simulations (12), which is in good agreement with the relation between F and fFL calculated from Eq. 3. This agreement also confirms that the pulling forces are small enough that the force F calculated from simulated fFL values using Eq. 3 is close to the ensemble-average force determined by MD simulations; the two forces are not equal in general because of the nonlinear relation between release rate and force, such that states exerting a larger force (e.g., the folded state) contribute disproportionately to the average release rate and hence to F (a direct comparison between from the MD simulations and F calculated from Eq. 3 is given in SI Appendix, Fig. S8B).
We note that the forces exerted by a protein folding as it exits the ribosome are qualitatively different from the tensile forces such as those exerted when proteins fold in atomic force microscopy (AFM) experiments [as have been performed on spectrin before (48)]. Therefore, the force magnitudes probed by the two experiments cannot be directly compared, despite their apparent resemblance. For example, even a small force exerted on the termini of an unfolded protein by an AFM can massively slow the refolding rate because of the large distance the protein must contract against this force in order to fold. The same force magnitude would have a much smaller effect on the refolding rate of a protein attached at one end to the ribosome (11).
Discussion
α-Spectrin contains over 20 repeat domains and is produced as a single, long polypeptide. Previously, we observed that the R16 domain starts to fold while a part of its C-terminal α-helix is still in the ribosome exit tunnel (9). We now find that the cotranslational folding of R16 is affected both by the presence of its upstream R15 domain and when the N-terminal part of the following R17 domain is present in the exit tunnel. In contrast, folding of the R15 domain is not affected by the upstream R14 domain, but starts at shorter linker lengths when the N-terminal end of the following R16 domain is present.
In vitro, R16 is stabilized by ∼1.7 kcal/mol by the presence of R15 (49), meaning that, during cotranslational folding, R16 can “sacrifice” a few interactions in the folding nucleus and start to fold at a shorter linker length in the R15R16 nL constructs (9).
For R15, the situation is a bit different because, in contrast to R16, its folding nucleus involves residues located at the C-terminal end of helix C (45). The presence of the upstream R14 domain thus may have little impact on Lonset, because the whole R15 domain anyway must have emerged from the exit tunnel before folding can start. On the other hand, the R15 C helix is stabilized by the presence of the early parts of the R16 A helix, possibly allowing the folding nucleus to form at a shorter linker length and reducing Lonset in the R15R16 nT constructs.
The picture that emerges is that the spectrin domains fold one after the other as they emerge from the exit tunnel, but not completely independently of one another. As seen for the R16 domain, the presence of an already folded N-terminal upstream neighbor domain not only can increase the thermodynamic stability of the folded state of an emerging domain, but can also allow the emerging domain to start folding while still partly buried in the exit tunnel. Likewise, when the C-terminal α-helix in the emerging domain is extended by a part of the N-terminal α-helix in the following domain, the folding transition can persist to longer linker lengths (as seen for R16) or start at shorter linker lengths (as seen for R15). Finally, the increase in fFL seen for the R16R17 nT and R15R16R17 nT constructs at L = 61 residues (Fig. 3C) is suggestive of an early folding intermediate in R17. Further work is required to investigate this, but it may be possible that the regular helical structure of the spectrin repeats drives a nearly continuous folding reaction, punctuated by periods when the nascent chain is lengthened but no residues are added to the folded part emerging from the ribosome. In this scenario, at most a short stretch of unfolded polypeptide would be exposed outside the ribosome at any one time, which could minimize the need for protection of the nascent chain by chaperones.
Finally, a pulse-chase analysis has allowed us to measure the release rate kR from the stalled state for different R16 and ADR1a constructs, making it possible to estimate the magnitude of the pulling forces exerted on the AP for different linker lengths (Fig. 5A). This in turn makes it possible to derive expressions for how the magnitude pulling force F depends on fFL (as obtained from our standard continuous-labeling experimental protocol) (Eqs. 2 and 3). In general, we find that the folding of protein domains such as spectrin R16 and ADR1a can generate a maximal force of 15 to 20 pN on the AP (Fig. 5), in line with theoretical estimates based on MD simulations (40, 50).
Materials and Methods
All of the enzymes used for molecular biology were obtained from New England Biolabs. The PUREfrex in vitro transcription-translation system was produced by Eurogentec and purchased via BioNordika. The PURExpress in vitro transcription–translation system was purchased from NEB. GeneJet polymerase chain reaction (PCR) clean-up kit was purchased from Thermo Fisher Scientific. Precast NuPAGE gels and running buffers were purchased from Invitrogen. Homemade gels were cast using acrylamide-bisacrylamide mix, Tris, and glycine from VWR. Filter paper for gel drying was from Whatman. All other chemicals were purchased from Sigma.
Cloning.
The cDNAs for spectrin R15 and R16 domains were kindly provided by Jane Clarke, University of Cambridge, Cambridge, United Kingdom, and the spectrin R17 and R14 domains were ordered as DNA fragments from Eurofins Genomics GmbH. All protein sequences used in this publication are presented in SI Appendix, Table S1. We used two different construct types: those with an unrelated linker sequence, labeled nL, and those with a truncated spectrin linker sequence, labeled nT (Fig. 2B). For nL constructs, the respective spectrin domain or domains were cloned into the pET19b plasmid upstream from a Ser–Gly–Ser–Gly sequence attached to a linker sequence derived from the P2 domain of leader peptidase (LepB) followed by the 17-residue AP from the E. coli SecM protein (FSTPVWISQAQGIRAGP) herein referred to as SecM(Ec), and a further 23 amino acids derived from LepB. The shortest constructs (nL/nT 21) include only the SGSG sequence fused to the SecM(Ec) AP. For nT constructs, the respective spectrin domains were cloned into the same pET19b plasmid without a LepB linker sequence, but including the SGSG sequence immediately before the SecM(Ec) AP. The most C-terminal spectrin domain was then sequentially truncated via partially overlapping inverse PCR. For some constructs, a full-length (FL) control was created by changing the critical Pro at the C-terminal end of the SecM AP to Ala, thereby abolishing arrest (51) and yielding only full-length protein. The SecM(Ec-sup1) variants were created by site-directed mutagenesis of the constructs described above. N-terminal truncations of R15 and R16 were carried out by partially overlapping inverse PCR, and insertion of the helix breaking insertions was carried out by site-directed mutagenesis where the inserted residues were included in the primer sequence. All sequences were confirmed by DNA sequencing (Eurofins Genomics GmbH).
In Vitro Expression for FPA.
Expression and analysis was carried out as described previously (9). Briefly, a linear DNA product is created from each construct plasmid by PCR using Q5 polymerase with forward and reverse primers that anneal to the T7 promoter and terminator regions, respectively. Following PCR cleanup (using the manufacturer’s instructions), the product is confirmed by agarose gel electrophoresis. In vitro transcription and translation are carried out in either the PUREfrex of PURExpress commercial systems (mixed according to the manufacturer’s recommendations). One microliter of the PCR product and ∼10 µCi of [35S]methionine are mixed for a 10-µL PUREfrex reaction or 0.5 µL of PCR product and ∼5 µCi of [35S]methionine are mixed for a 5-µL PURExpress reaction, followed by incubation at 37 °C for 20 min at 600 rpm shaking. Translation is halted by the addition of an equal volume of ice-cold 10% trichloroacetic acid (TCA), followed by incubation on ice for at least 30 min. Total protein is sedimented by centrifugation at 4 °C for 5 min at 20,000 × g. The supernatant is carefully removed and the pellet is resuspended in a suitable volume of 1× SDS/PAGE sample buffer (62.5 mM Tris⋅HCl, pH 6.8, 10% glycerol, 2.5% SDS, 5% β-ME, 0.02% bromophenol blue, and 25 mM NaOH [NaOH is added to neutralize any remaining TCA]) by shaking at 37 °C and 1,000 rpm for at least 5 min. The prolyl-tRNA that remains attached due to SecM arrest is digested by the addition of 4 µg of RNase I (2 µL of a 4 µg/µL solution), followed by incubation at 37 °C and 600 rpm for 30 min.
Quantifications.
Following a brief centrifugation to remove any remaining insoluble material, the sample is loaded onto an appropriate SDS/PAGE gel (12% Tris-glycine gels were used for two- and three-spectrin domain constructs and 16% or 18% Tris-tricine gels were used for single-spectrin domains and the N-terminal truncations). Following electrophoresis, the gels are dried onto thick filter paper by heating under vacuum (Bio-Rad model 583 or Hoefer GD 2000), a radioactive molecular weight ladder included in the gel is visualized by spotting the filter paper with a ∼1:1,000 solution of [35S]methionine in 1× SDS/PAGE sample buffer, and the gel is exposed to a phosphorimager screen for 12 to 72 h depending on the strength of the signal. The screen is imaged using a Fujifilm FLA9000 (50-µm pixels), and densitometry analysis on the resultant raw image (TIFF format) file is carried out using FIJI (ImageJ) software. The densitometry values are quantified using our in-house EasyQuant software and the fraction full-length protein, , is calculated from the intensities of the full-length (IFL) and arrested (IA) bands. See SI Appendix, Fig. S9, for examples of gels. Independent replicate in vitro translation reactions were carried out for all spectrin constructs in the folding peaks (Dataset S1 [191 unique constructs and 405 independent data points]). The majority of the data points in the folding peaks were collected in triplicate, with the remaining points collected in duplicate. A small number of data points outside of the folding peaks were collected as single measurements to save costs.
Pulse-Chase Experiments.
The rate at which translation recommences following the arrest of various constructs was measured for five R16 constructs and two ADR1 constructs (7) chosen to represent the range of fFL values measured during typical experimental conditions. Pulse-chase experiments were carried out using the PURExpress system. The 75-µL reactions were mixed according to the manufacturer’s instructions, with the addition of [35S]methionine. Following a 5-min incubation at 37 °C (pulse), an excess of unlabeled methionine was added, and incubation at 37 °C was continued. At discrete time points, a 10-µL aliquot of the reaction was removed and mixed with 15 µL of ice-cold 10% TCA and further processed as above. Three independent replicates were collected for each construct (Dataset S2). The rate calculated is for the conversion of arrested protein to full-length protein:
The rate of this irreversible conversion can be calculated using a first-order equation:
where fA(t) is the fraction arrested protein at a given time, , and fA(0) is the fraction arrested protein at . Prism 8 (GraphPad Software) was used to calculate the nonlinear regression fit of the kinetic equation to each dataset [R16 nL-27, -29, -31, -33, and -37, and ADR1 L25 and 27 (7)]. The rates were used to calculate the force exerted by the folding domain on the arrested nascent chain as detailed in the main text.
Optimization of the Bell Parameters.
The coarse-grained MD simulations of cotranslational folding of R16 and ADR1a constructs in WT, ΔuL23, and ΔuL24 ribosomes used here were originally reported in ref. 12. For each construct and each linker length, these simulations were used to calculate the ensemble average force , where and are the population of, and force exerted by, the unfolded (folded) state in the simulation, respectively. In order to determine the Bell parameters k0 and in Eq. 1 in the main text, we set k0 = 3 × 10−4 s−1 (i.e., the same value as estimated in ref. 6, and equal to kR for the “zero force” construct R16 nL27; Fig. 5A) and explored a range of values around the approximate value = 0.4 nm given in ref. 6. For each value, we calculated an average kR over unfolded and folded states as and then obtained fFL (setting Δt = 550 s) from Eq. 2 in the main text. The simulated fFL values (i.e., the simulated force profiles) obtained in this way were then compared with the experimental force profiles reported in ref. 12 for the I27 and R16 domains expressed with WT, ΔuL23, and ΔuL24 ribosomes, and for the ADR1a domain expressed with WT and ΔuL24 ribosomes. The parameter values k0 = 3 × 10−4 s−1, Δx‡ = 0.65 nm were found to give the best fit between the simulated and experimental force profiles (SI Appendix, Fig. S7).
Data Availability.
The quantified raw data points are provided in Datasets S1 and S2.
Supplementary Material
Acknowledgments
This work was supported by grants from the Knut and Alice Wallenberg Foundation (2012.0282), the Swedish Cancer Society (15 0888), and the Swedish Research Council (621-2014-3713) to G.v.H., and from the Intramural Research program of the National Institute of Diabetes and Digestive and Kidney Diseases of the NIH to P.T. and R.B.B. This work utilized the computational resources of the NIH High-Performance Computing (HPC) Biowulf cluster (https://hpc.nih.gov/). We thank Dr. Jane Clarke and Dr. Adrian Nickson (University of Cambridge, Cambridge, United Kingdom) for providing plasmids with spectrin DNA and Dr. Rickard Hedman for programming and maintenance of the EasyQuant software.
Footnotes
The authors declare no competing interest.
This article is a PNAS Direct Submission.
This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.1909683117/-/DCSupplemental.
References
- 1.O’Brien E. P., Christodoulou J., Vendruscolo M., Dobson C. M., New scenarios of protein folding can occur on the ribosome. J. Am. Chem. Soc. 133, 513–526 (2011). [DOI] [PubMed] [Google Scholar]
- 2.Nakatogawa H., Ito K., Secretion monitor, SecM, undergoes self-translation arrest in the cytosol. Mol. Cell 7, 185–192 (2001). [DOI] [PubMed] [Google Scholar]
- 3.Narayanan C. S., Dubnau D., Demonstration of erythromycin-dependent stalling of ribosomes on the ermC leader transcript. J. Biol. Chem. 262, 1766–1771 (1987). [PubMed] [Google Scholar]
- 4.Cymer F., Hedman R., Ismail N., von Heijne G., Exploration of the arrest peptide sequence space reveals arrest-enhanced variants. J. Biol. Chem. 290, 10208–10215 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Shanmuganathan V., et al. , Structural and mutational analysis of the ribosome-arresting human XBP1u. eLife, 10.7554/eLife.46267 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Goldman D. H. et al., Ribosome. Mechanical force releases nascent chain-mediated ribosome arrest in vitro and in vivo. Science 348, 457–460 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Nilsson O. B. et al., Cotranslational protein folding inside the ribosome exit tunnel. Cell Rep. 12, 1533–1540 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Nilsson O. B., Müller-Lucks A., Kramer G., Bukau B., von Heijne G., Trigger factor reduces the force exerted on the nascent chain by a cotranslationally folding protein. J. Mol. Biol. 428, 1356–1364 (2016). [DOI] [PubMed] [Google Scholar]
- 9.Nilsson O. B. et al., Cotranslational folding of spectrin domains via partially structured states. Nat. Struct. Mol. Biol. 24, 221–225 (2017). [DOI] [PubMed] [Google Scholar]
- 10.Farías-Rico J. A., Goetz S. K., Marino J., von Heijne G., Mutational analysis of protein folding inside the ribosome exit tunnel. FEBS Lett. 591, 155–163 (2017). [DOI] [PubMed] [Google Scholar]
- 11.Tian P. et al., Folding pathway of an Ig domain is conserved on and off the ribosome. Proc. Natl. Acad. Sci. U.S.A. 115, E11284–E11293 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Kudva R. et al., The shape of the bacterial ribosome exit tunnel affects cotranslational protein folding. eLife 7, e36326 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Farías-Rico J. A., Ruud Selin F., Myronidi I., Frühauf M., von Heijne G., Effects of protein size, thermodynamic stability, and net charge on cotranslational folding on the ribosome. Proc. Natl. Acad. Sci. U.S.A. 115, E9280–E9287 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Notari L., Martínez-Carranza M., Farías-Rico J. A., Stenmark P., von Heijne G., Cotranslational folding of a pentarepeat β-helix protein. J. Mol. Biol. 430, 5196–5206 (2018). [DOI] [PubMed] [Google Scholar]
- 15.Kemp G., Kudva R., de la Rosa A., von Heijne G., Force-profile analysis of the cotranslational folding of HemK and filamin domains: Comparison of biochemical and biophysical folding assays. J. Mol. Biol. 431, 1308–1314 (2019). [DOI] [PubMed] [Google Scholar]
- 16.Winkelmann J. C., Forget B. G., Erythroid and nonerythroid spectrins. Blood 81, 3173–3185 (1993). [PubMed] [Google Scholar]
- 17.Machnicka B., Grochowalska R., Bogusławska D. M., Sikorski A. F., Lecomte M. C., Spectrin-based skeleton as an actor in cell signaling. Cell. Mol. Life Sci. 69, 191–201 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Xu K., Zhong G., Zhuang X., Actin, spectrin, and associated proteins form a periodic cytoskeletal structure in axons. Science 339, 452–456 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Liem R. K., Cytoskeletal integrators: The spectrin superfamily. Cold Spring Harb. Perspect. Biol. 8, a018259 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Hill S. A., Kwa L. G., Shammas S. L., Lee J. C., Clarke J., Mechanism of assembly of the non-covalent spectrin tetramerization domain from intrinsically disordered partners. J. Mol. Biol. 426, 21–35 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ipsaro J. J. et al., Crystal structure and functional interpretation of the erythrocyte spectrin tetramerization domain complex. Blood 115, 4843–4852 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.An X. et al., Thermal stabilities of brain spectrin and the constituent repeats of subunits. Biochemistry 45, 13670–13676 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.An X. et al., Conformational stabilities of the structural repeats of erythroid spectrin and their functional implications. J. Biol. Chem. 281, 10527–10532 (2006). [DOI] [PubMed] [Google Scholar]
- 24.Kusunoki H., Minasov G., Macdonald R. I., Mondragón A., Independent movement, dimerization and stability of tandem repeats of chicken brain α-spectrin. J. Mol. Biol. 344, 495–511 (2004). [DOI] [PubMed] [Google Scholar]
- 25.Patra M., Mukhopadhyay C., Chakrabarti A., Probing conformational stability and dynamics of erythroid and nonerythroid spectrin: Effects of urea and guanidine hydrochloride. PLoS One 10, e0116991 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Scott K. A., Batey S., Hooton K. A., Clarke J., The folding of spectrin domains I: Wild-type domains have the same stability but very different kinetic properties. J. Mol. Biol. 344, 195–205 (2004). [DOI] [PubMed] [Google Scholar]
- 27.MacDonald R. I., Cummings J. A., Stabilities of folding of clustered, two-repeat fragments of spectrin reveal a potential hinge in the human erythroid spectrin tetramer. Proc. Natl. Acad. Sci. U.S.A. 101, 1502–1507 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Kwa L. G. et al., The folding of a family of three-helix bundle proteins: Spectrin R15 has a robust folding nucleus, unlike its homologous neighbours. J. Mol. Biol. 426, 1600–1610 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Rief M., Pascual J., Saraste M., Gaub H. E., Single molecule force spectroscopy of spectrin repeats: Low unfolding forces in helix bundles. J. Mol. Biol. 286, 553–561 (1999). [DOI] [PubMed] [Google Scholar]
- 30.Lenne P. F., Raae A. J., Altmann S. M., Saraste M., Hörber J. K. H., States and transitions during forced unfolding of a single spectrin repeat. FEBS Lett. 476, 124–128 (2000). [DOI] [PubMed] [Google Scholar]
- 31.MacDonald R. I., Pozharski E. V., Free energies of urea and of thermal unfolding show that two tandem repeats of spectrin are thermodynamically more stable than a single repeat. Biochemistry 40, 3974–3984 (2001). [DOI] [PubMed] [Google Scholar]
- 32.Law R. et al., Cooperativity in forced unfolding of tandem spectrin repeats. Biophys. J. 84, 533–544 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Wensley B. G. et al., Experimental evidence for a frustrated energy landscape in a three-helix-bundle protein family. Nature 463, 685–688 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Wensley B. G. et al., Separating the effects of internal friction and transition state energy to explain the slow, frustrated folding of spectrin domains. Proc. Natl. Acad. Sci. U.S.A. 109, 17795–17799 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Bell G. I., Models for the specific adhesion of cells to cells. Science 200, 618–627 (1978). [DOI] [PubMed] [Google Scholar]
- 36.Ito K., Chiba S., Arrest peptides: Cis-acting modulators of translation. Annu. Rev. Biochem. 82, 171–202 (2013). [DOI] [PubMed] [Google Scholar]
- 37.Butkus M. E., Prundeanu L. B., Oliver D. B., Translocon “pulling” of nascent SecM controls the duration of its translational pause and secretion-responsive secA regulation. J. Bacteriol. 185, 6719–6722 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Ismail N., Hedman R., Schiller N., von Heijne G., A biphasic pulling force acts on transmembrane helices during translocon-mediated membrane integration. Nat. Struct. Mol. Biol. 19, 1018–1022 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Ismail N., Hedman R., Lindén M., von Heijne G., Charge-driven dynamics of nascent-chain movement through the SecYEG translocon. Nat. Struct. Mol. Biol. 22, 145–149 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Leininger S. E., Trovato F., Nissley D. A., O’Brien E. P., Domain topology, stability, and translation speed determine mechanical force generation on the ribosome. Proc. Natl. Acad. Sci. U.S.A. 116, 5523–5532 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.O’Brien E. P., Vendruscolo M., Dobson C. M., Prediction of variable translation rate effects on cotranslational protein folding. Nat. Commun. 3, 868 (2012). [DOI] [PubMed] [Google Scholar]
- 42.Batey S., Randles L. G., Steward A., Clarke J., Cooperative folding in a multi-domain protein. J. Mol. Biol. 349, 1045–1059 (2005). [DOI] [PubMed] [Google Scholar]
- 43.Shimizu Y. et al., Cell-free translation reconstituted with purified components. Nat. Biotechnol. 19, 751–755 (2001). [DOI] [PubMed] [Google Scholar]
- 44.Yap M. N., Bernstein H. D., The plasticity of a translation arrest motif yields insights into nascent polypeptide recognition inside the ribosome tunnel. Mol. Cell 34, 201–211 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Scott K. A., Randles L. G., Clarke J., The folding of spectrin domains II: Phi-value analysis of R16. J. Mol. Biol. 344, 207–221 (2004). [DOI] [PubMed] [Google Scholar]
- 46.Wensley B. G., Gärtner M., Choo W. X., Batey S., Clarke J., Different members of a simple three-helix bundle protein family have very different folding rate constants and fold by different mechanisms. J. Mol. Biol. 390, 1074–1085 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Tu L. W., Deutsch C., A folding zone in the ribosomal exit tunnel for Kv1.3 helix formation. J. Mol. Biol. 396, 1346–1360 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Randles L. G., Rounsevell R. W. S., Clarke J., Spectrin domains lose cooperativity in forced unfolding. Biophys. J. 92, 571–577 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Batey S., Clarke J., The folding pathway of a single domain in a multidomain protein is not affected by its neighbouring domain. J. Mol. Biol. 378, 297–301 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Fritch B. et al., Origins of the mechanochemical coupling of peptide bond formation to protein synthesis. J. Am. Chem. Soc. 140, 5077–5087 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Nakatogawa H., Ito K., The ribosomal exit tunnel functions as a discriminating gate. Cell 108, 629–636 (2002). [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The quantified raw data points are provided in Datasets S1 and S2.





