Skip to main content
eLife logoLink to eLife
. 2021 Feb 8;10:e64302. doi: 10.7554/eLife.64302

Residue-by-residue analysis of cotranslational membrane protein integration in vivo

Felix Nicolaus 1,, Ane Metola 1,, Daphne Mermans 1,, Amanda Liljenström 1, Ajda Krč 1,2, Salmo Mohammed Abdullahi 1, Matthew Zimmer 3, Thomas F Miller III 3, Gunnar von Heijne 1,4,
Editors: Randy Schekman5, Olga Boudker6
PMCID: PMC7886326  PMID: 33554862

Abstract

We follow the cotranslational biosynthesis of three multispanning Escherichia coli inner membrane proteins in vivo using high-resolution force profile analysis. The force profiles show that the nascent chain is subjected to rapidly varying pulling forces during translation and reveal unexpected complexities in the membrane integration process. We find that an N-terminal cytoplasmic domain can fold in the ribosome exit tunnel before membrane integration starts, that charged residues and membrane-interacting segments such as re-entrant loops and surface helices flanking a transmembrane helix (TMH) can advance or delay membrane integration, and that point mutations in an upstream TMH can affect the pulling forces generated by downstream TMHs in a highly position-dependent manner, suggestive of residue-specific interactions between TMHs during the integration process. Our results support the ‘sliding’ model of translocon-mediated membrane protein integration, in which hydrophobic segments are continually exposed to the lipid bilayer during their passage through the SecYEG translocon.

Research organism: E. coli

Introduction

Most integral membrane proteins are cotranslationally integrated into their target membrane with the help of translocons such as bacterial SecYEG and YidC, and the eukaryotic Sec61 and EMC complexes (Rapoport et al., 2017; Chitwood et al., 2018). While the energetics of translocon-mediated integration of a transmembrane α-helix (TMH) is reasonably well understood (Hessa et al., 2007), the actual integration process is not, other than in general terms. We have shown that force profile analysis (FPA) – a method in which a translational arrest peptide (AP) engineered into a target protein serves as a sensor to measure the force exerted on a nascent polypeptide chain during translation – can be used to follow the cotranslational folding of soluble proteins and the membrane integration of a model TMH (Ismail et al., 2012; Ismail et al., 2015; Farías-Rico et al., 2018). Here, we have applied FPA and coarse-grained molecular dynamics (CGMD) simulations to follow the cotranslational membrane integration of three multispanning Escherichia coli inner membrane proteins of increasing complexity (EmrE, GlpG, BtuC), providing the first residue-by-residue data on membrane protein integration in vivo.

Results

Force profile analysis

FPA takes advantage of the ability of APs to bind in the upper parts of the ribosome exit tunnel and thereby pause translation when their last codon is in the ribosomal A-site (Ito and Chiba, 2013). The duration of an AP-induced pause is reduced in proportion to pulling forces exerted on the nascent chain (Goldman et al., 2015; Kemp et al., 2020), that is, APs can act as force sensors and can be tuned by mutation to react to different force levels (Cymer et al., 2015a). In an FPA experiment, a series of constructs is made in which a force-generating sequence element (e.g., a TMH) is placed an increasing number of residues away from an AP (reflected in N, the number of residues from the start of the protein to the end of the AP), which in turn is followed by a C-terminal tail (Figure 1a). In constructs where the TMH engages in an interaction that generates a strong enough pulling force F on the nascent chain at the point when the ribosome reaches the last codon of the AP, pausing will be prevented and mostly full-length protein will be produced during a short pulse with [35S]-Met (Figure 1b, middle). In contrast, in constructs where little force is exerted on the AP, pausing will be efficient and more of the arrested form of the protein will be produced (Figure 1b, left and right). The fraction full-length protein produced, fFL = IFL/(IFL+IA), where IFL and IA are the intensities of the bands representing the full-length (FL) and arrested (A) species on an SDS-PAGE gel (Figure 1c and Figure 1—figure supplement 1), can therefore be used as a proxy for F in a given construct (Kemp et al., 2020; Niesen et al., 2018; Leininger et al., 2019). A plot of fFL versus N – a force profile (FP) – thus can provide a detailed picture of the cotranslational process in question, as reflected in the variation in the force exerted on the nascent chain during translation. FPs can be recorded with up to single-residue resolution by increasing N in steps of one residue (corresponding to a lengthening of the nascent chain by ~3 Å).

Figure 1. The force profile assay.

(a) Basic construct. Arrested (A) and full-length (FL) products are indicated. (b) At construct length N1, TMH2 has not yet entered the SecYEG channel and no pulling force F is generated. At N2, TMH2 is integrating into the membrane and F ≫0. At N3, TMH2 is already integrated and F ≈ 0. (c) SDS-PAGE gels showing A and FL products for [35S]-Met labeled and immunoprecipitated EmrE(Cout) (N = 105), GlpG (N = 196), and BtuC (N = 314). Control constructs AC and FLc have, respectively, a stop codon and an inactivating Ala codon replacing the last Pro codon in the arrest peptide (AP). The band just below the A band in the EmrE(Cout) (N = 105) lane most likely represents ribosomes stacked behind the AP-stalled ribosomes (Notari et al., 2018) and is not included in the calculation of fFL. See Figure 1—figure supplement 1 for additional gels.

Figure 1.

Figure 1—figure supplement 1. Gel gallery with selected EmrE (a–e), GlpG (f–l), and BtuC (m–r) constructs.

Figure 1—figure supplement 1.

Full-length (FL) and arrested (A) products are indicated by black and white circles, respectively. Repeat experiments are indicated by single and double lines above the lanes for EmrE (a–e), by short lines beneath the lanes for GlpG (f–i), and are on neighboring gels for BtuC. For BtuC, some full-length (FLc) and arrest (Ac) controls are included (c.f., Figure 1c); these are for the construct immediately to the right of the control construct lanes. GlpG (j, k) show GlpG constructs (indicated by –) and the corresponding LepB-GlpG constructs (indicated by +) for N = 136–196. For N = 141–186, GlpG constructs have the expected full-length band (lower gray circles) that runs slightly faster than the corresponding LepB-GlpG construct (black circles), plus an extra band of unknown provenance that runs more slowly than the full-length LepB-GlpG construct (upper gray circles). The GlpG fFL values shown in Figure 3—figure supplement 1a were calculated by assigning only the lower (dashed magenta curve), or both (dashed green curve), of the bands indicated by gray circles as full-length product. Panel l shows a repeat experiment for the LepB-GlpG constructs included in panels j and k.

EmrE: 4 TMHs, 110 residues

We chose EmrE as an example of a small, relatively simple 4-TMH protein. EmrE is a dual-topology protein, that is, the monomers integrate into the inner membrane in a 50–50 mixture of Nin-Cin and Nout-Cout topologies; two oppositely oriented monomers then assemble into an antiparallel dimer (Chen et al., 2007; Rapp et al., 2007). To avoid potential complications caused by the dual topology, we used EmrE(Cout), a mutant version that adopts the Nout-Cout topology (Rapp et al., 2007), and further used the relatively weak SecM(Ec) AP (Ismail et al., 2012) and included an HA tag for immunoprecipitation (Figure 2a). A series of EmrE(Cout)-AP constructs (see Supplementary file 1 for sequences) was used to obtain the FP shown in Figure 2b (orange curve), at 2–5 residues resolution. Also shown is an FP derived from a CGMD simulation (CGMD-FP, gray; Van Lehn et al., 2015); a hydrophobicity plot (HP) is included in Figure 2—figure supplement 1.

Figure 2. EmrE(Cout).

(a) Construct design. EmrE(Cout) is shortened from the C-terminal end of the LepB-derived linker (dotted), as indicated by the arrow. Cytoplasmic (red) and periplasmic (blue) loops, and lengths of full-length EmrE(Cout), LepB-derived linker, HA tag + arrest peptide (AP), and C-terminal tail, are indicated. Since the 30-residue HA + AP segment is constant in all constructs, the force profile (FP) reflects nascent chain interactions occurring mainly outside the ribosome exit tunnel. (b) FPs for EmrE(Cout) (orange), EmrE(Cout,E14L) (green), EmrE(Cout) with SecM(Ec-sup1) AP (blue), EmrE(Cout, I37I38→NN) (magenta triangles), and coarse-grained molecular dynamics (CGMD-FP) calculated with a −100 mV membrane potential (gray). (c) Effects of mutations in E14 on fFL values for the N values are indicated by arrows in (b). p-values (two-sided t-test): *p < 0.05; **p < 0.01; ***p < 0.001. (d, e) Sequences corresponding to peaks I–IV aligned from their Nstart (d) and Nend (e) values. The + sign indicates 45 residues from the polypeptide transferase center (PTC). Hydrophobic transmembrane helix (TMH) segments are shown in orange and transmembrane α-helices underlined (PDB: 3B5D). Error bars in b and c indicate SEM values.

Figure 2.

Figure 2—figure supplement 1. EmrE(Cout).

Figure 2—figure supplement 1.

As in Figure 2b, but with a hydrophobicity plot (HP) (ΔG) calculated by TOPCONS (3, 50) (gray). Since the HP represents the membrane integration energy, and the force profile (FP) the force generated during integration, the two profiles have been aligned such that peaks in the FP approximately align with maxima in the derivative of the HP.

We have previously shown that a model TMH composed of Ala and Leu residues generates a peak in an FP recorded with the SecM(Ec) AP that reaches half-maximal amplitude (Nstart) when the N-terminal end of the TMH is ~45 residues away from the polypeptide transferase center (PTC) (Ismail et al., 2012), and a recent real-time FRET study of cotranslational membrane integration found that the N-terminal end of the first TMH in a protein reaches the vicinity of the SecYEG translocon when it is 40–50 residues away from the PTC (Mercier et al., 2020). For EmrE(Cout) TMH1, this would correspond to constructs with N ≈ 50. However, the fFL values are hardly above background in this region of the FP. Due to the functionally important E14 residue, TMH1 is only marginally hydrophobic and does not become firmly embedded in the membrane until the protein dimerizes (Seurig et al., 2019). To ascertain whether the lack of a peak in the FP corresponding to the membrane integration of TMH1 is because of its low hydrophobicity, we mutated E14 to L. Indeed, in the FP obtained for EmrE(Cout,E14L) (Figure 2b, green curve), a clear peak appears at the expected chain length Nstart ≈ 50 residues. Mutation E14A yields an fFL value intermediate between EmrE(Cout,E14L) and EmrE(Cout) at N = 55 (Figure 2c), while fFL for the mutants EmrE(Cout,E14D) and EmrE(Cout,E14Q) is the same as for EmrE(Cout).

Peak II has Nstart ≈ 76, corresponding to a situation where the N-terminal end of TMH2 is ~45 residues from the PTC (Figure 2d). The double mutation I37I38→NN in TMH2 reduces fFL at N = 80 and 85 (magenta triangles), as expected. Unexpectedly, however, the E14L, E14A, and E14Q (but not the E14D) mutations in TMH1 increase fFL at N = 85 (Figure 2c), showing that a negatively charged residue (D or E) in position 14 in TMH1 specifically reduces the pulling force generated by TMH2 at N = 85, that is, when about one-half of TMH2 has integrated into the membrane. Likewise, fFL values at N = 115 and 130 (but not at N = 105, included as a negative control) are specifically affected by mutations in E14: at N = 115 (one-half of TMH3 integrated), all four mutations in position 14 increase fFL relative to E14, while at N = 130 (beginning of TMH4 integration) the E14A and E14L mutations decrease fFL (Figure 2c). FPA thus reveals long-range effects of mutations in E14 on three specific steps in the membrane integration of the downstream TMHs. This implies that TMH1 remains in the vicinity of the translocon and that E14 makes specific interactions with residues in the TMH2–TMH4 region during the membrane integration process. Further studies will be required to pinpoint these interactions and understand the role played by the slow dynamics of TMH1 integration (Seurig et al., 2019).

Peak III has Nstart ≈ 102 residues, with the N-terminal end of TMH3 ~45 residues from the PTC (Figure 2d). Peak IV is difficult to locate precisely in the FP because fFL values are high throughout the TMH3–TMH4 region, but is seen at Nstart ≈ 132 residues when the strong SecM(Ec-sup1) AP (Yap and Bernstein, 2009) is used (blue curve), again with the N-terminal end of TMH4 ~45 residues from the PTC (Figure 2d). As shown in Figure 2e, the TMHs cease generating a pulling force when their C-terminal ends are ~45 residues away from the PTC, indicating that they are fully integrated at this point.

GlpG: 6 TMHs, 276 residues

We next studied GlpG, a medium-sized monomeric 6-TMH rhomboid protease with an ~60 residue cytoplasmic N-terminal domain (NTD) (Sherratt et al., 2012; Wang et al., 2006) (Figure 3a),  a protein that allows us to follow the cotranslational folding of a soluble domain and integration of a membrane domain in the same experiment.

Figure 3. GplG.

(a) Construct design, c.f., Figure 2a. The N-terminal LepB fusion is indicated. (b) Force profiles (FPs) for GlpG and LepB-GlpG (N = 131–224) (orange), NTD(F16E) (green), in vitro translated N-terminal domain (NTD) (magenta), and NTD(F16E) (black), LepB-GlpG with SecM(Ec-Sup1) AP (blue), and coarse-grained molecular dynamics (CGMD)-FP calculated with a −100 mV membrane potential (gray). Error bars indicate SEM values. Note that the LepB-GlpG constructs are two residues shorter than the corresponding GlpG constructs but are plotted with the same N values as the latter to facilitate comparison. (c) NTD (PDB ID: 2LEP), with F16 in spacefill. (d) Enlarged FPs for LepB-GlpG with SecM(Ec) AP (orange), SecM(Ec-Ms) AP (green), SecM(Ec-sup1) AP (blue), and GlpG(Y138F139L143→NNN) with SecM(Ec-Ms) AP (magenta). CGMD-FP in gray. (e) Structure of GlpG with the periplasmic surface helix in blue, TMH2 in red, the membrane-associated cytoplasmic segment in cyan, and TMH5 in yellow. Y138F139L143 and G222I223Y224L225 are shown as sticks. (f) LepB-GlpG peak III-a and III-c sequences aligned, respectively, from their Nstart and Nmax values, and the mutant LepB-GlpG(Y138F139L143→NNN) peak III-c sequence aligned from its Nmax value. Hydrophobic transmembrane helix (TMH) segments are shown in orange and transmembrane α-helices (PDB: 2IC8)underlined. The periplasmic surface helix is italicized. AP: arrest peptide; PTC: polypeptide transferase center.

Figure 3.

Figure 3—figure supplement 1. GlpG.

Figure 3—figure supplement 1.

(a) fFL values for peak III obtained for LepB-GlpG fusion constructs (orange) and GlpG constructs calculated either including (dashed green) or excluding (dashed magenta) the slowly migrating band indicated in Figure 1—figure supplement 1j,k in IFL. The two latter are from single measurements. (b) As in Figure 3b, but with a hydrophobicity plot (HP) (ΔG) calculated by TOPCONS (Hessa et al., 2007; Tsirigos et al., 2015) (gray). (c) Sequences corresponding to peaks II–VII aligned based on the Nstart values. The periplasmic surface helix upstream of TMH2 and the hydrophobic patch upstream of TMH5 are in italics. Hydrophobic transmembrane helix (TMH) segments are shown in orange and membrane-embedded α-helices underlined. (d) Sequences corresponding to peaks II–VII aligned based on the Nend values. Hydrophobic TMH segments are shown in orange and membrane-embedded α-helices underlined. PTC: polypeptide transferase center.

The FP is shown in Figure 3b (orange curve). It was obtained at 5-residue resolution, except for the portion N = 168–224, which we measured with single-residue resolution. For unknown reasons, constructs with N ≈ 140–190 residues gave rise to a slowly migrating band on the gel that was difficult to interpret (Figure 1—figure supplement 1j,k); this problem did not arise when the NTD (GlpG residues 1–60) was replaced by residues 1–58 of the LepB protein (Figure 3a), and the corresponding fFL values are shown in the FP (N = 131–224). The LepB part contains an N-terminal, Nout-Cin-oriented TMH (Wolfe et al., 1983; von Heijne, 1989), that interacts with the signal recognition particle Ffh (Schibich et al., 2016) and hence targets the LepB-GlpG constructs to the SecYEG translocon before GlpG TMH1 is translated. This could in principle affect the FP; however, because the C-terminal end of the LepB part is ≥70 residues away from the C-terminal end of the SecM AP in these constructs, LepB is far outside the ribosome exit tunnel and therefore unlikely to exert a strong effect. Indeed, fFL values for GlpG (calculated either including or excluding the slowly migrating band in IFL) and LepB-GlpG are very similar in the peak III region (N = 166–231) of the FP (Figure 3—figure supplement 1a). Nstart and Nend values for peaks II–VII are indicated in Figure 3—figure supplement 1c,d.

Peak I, at Nstart ≈ 84 residues, is conspicuously close to what would be expected for the folding of the NTD from previous studies of cotranslational folding of small globular domains in the ribosome exit tunnel (Farías-Rico et al., 2018). To verify that the peak indeed represents folding of the NTD, we recorded an FP for the NTD by in vitro transcription-translation in the PURE system (Shimizu et al., 2005) and further made a destabilizing point mutation (F16E) in the core of the NTD (Figure 3c). The FP obtained in vitro (magenta) overlaps peak I in the in vivo FP, and the mutation strongly reduces fFL values for peak I both in vivo (green) and in vitro (black). Given that the NTD has a relative contact order of 15% and is predicted to fold on the ms time scale (Plaxco et al., 1998) while the elongation cycle on the ribosome takes ~100 ms/codon (Young and Bremer, 1976), the NTD has ample time to equilibrate between the unfolded and accessible folded states at each elongation step (Kemp et al., 2019). We conclude that the ~60 residue NTD folds inside the ribosome exit tunnel when its C-terminal end is 25–30 residues from the PTC, well before synthesis of the membrane domain has commenced.

Peaks II–VII in the FP correspond reasonably well to the CGMD-FP (gray) and HP (Figure 3—figure supplement 1b). The unexpectedly low Nstart value for peak III seems to be caused by an upstream periplasmic surface helix (Figure 3f) (see below). Likewise, peak VI-a likely reflects the membrane integration of a hydrophobic, membrane-associated cytoplasmic segment located just upstream of TMH5 (Figure 3—figure supplement 1c). In contrast, the unexpectedly high Nstart value for peak IV indicates that integration of TMH3 commences only when its N-terminal end is ~52 residues away from the PTC, possibly because of the tight spacing between TMH2 and TMH3.

As peak III saturates at fFL ≈ 0.9 over a rather wide range, we sought a more detailed view by using the strong SecM(Ec-Sup1) AP (Yap and Bernstein, 2009) (Figure 3b,d, blue) and the medium-strong SecM(Ec-Ms) AP (Farías-Rico et al., 2017) (Figure 3d, green). The SecM(Ec-Sup1) FP allows a precise determination of Nmax = 200, at which point the middle of TMH2 (L155) is located 45 residues from the PTC (Figure 3f). The SecM(Ec-Ms) FP reveals additional detail: peak III is now seen to be composed of three subpeaks III-a, III-b, and III-c. III-a has Nstart = 182, coinciding with the N-terminal end of the periplasmic surface helix reaching 45 resides away from the PTC. For III-b, Nstart ≈ 190, with the N-terminal end of TMH2 ~45 residues from the PTC. The major subpeak III-c at N ≈ 197–204 finally corresponds well to the peak seen in the SecM(Ec-Sup1) and the CGMD FPs, and therefore represents the membrane insertion of the most hydrophobic part of TMH2. Taken together, subpeaks III-b and III-c are reminiscent of the biphasic pulling force pattern previously recorded for a model hydrophobic transmembrane segment using the medium-strong SecM(Ms) AP (Ismail et al., 2012), which is closely related to the SecM(Ec-Ms) AP used here.

We further recorded a SecM(Ec-Ms) FP (magenta) for the triple mutation Y138F139L143→NNN (Figure 3e) that renders the periplasmic surface helix less hydrophobic: the mutation strongly reduces the amplitude of peak III-a, has only a small effect on peak III-b, and both reduces the amplitude and shifts Nstart and Nmax for peak III-c by approximately four residues (Figure 3d,f). Thus, the periplasmic surface helix engages in hydrophobic interactions already during its passage through the translocon, presumably by sliding along a partly open lateral gate (Cymer et al., 2015b). It also adds to the force generated by the membrane integration of TMH2, possibly by partitioning into the periplasmic leaflet of the inner membrane at approximately the same time that TMH2 enters the translocon.

BtuC: 10 TMHs, 326 residues

Finally, we studied BtuC, a vitamin B12 transporter with 10 TMHs, as an example of a large, multispanning protein with a complex fold (Hvorup et al., 2007). In order to improve expression, we added the N-terminal part of LepB to the BtuC constructs (Figure 4a) and used a LepB antiserum for immunoprecipitation. The Nout-Cin orientation of LepB TMH1 ensures that the Nin-Cin topology of BtuC will be maintained, and constructs that we could measure without the LepB fusion gave similar fFL values as those seen for the LepB fusions (Figure 4—figure supplement 1b).

Figure 4. BtuC.

(a) Construct design, cf. Figure 2a. The N-terminal LepB fusion is indicated. N values are calculated from the N-terminus of BtuC. For constructs with N ≥ 298, the C-terminal tail is 75 residues long. Circles indicate constructs for which mutations were made in the corresponding transmembrane helix (TMH) (see Figure 4—figure supplement 2. (b) Force profiles (FPs) for BtuC (orange), BtuC-TMH2 (green), BtuC(R47R56R59→QQQ) (black), BtuC-TMH6 (dark blue), BtuC-TMH8 (blue), BtuC-TMH10 (pink), and CGMD-FP calculated with a −100 mV membrane potential (gray). Error bars indicate SEM values. Note that the BtuC-TMH2, BtuC-TMH6, BtuC-TMH8, and BtuC-TMH10 constructs are plotted with the same N values as the corresponding BtuC constructs to facilitate comparison (i.e., the number of residues between the TMH in question and the last residue of the AP is the same in both types of constructs, see Supplementary file 1). (c) Sequences corresponding to peaks I–XI aligned from their Nstart values. Hydrophobic TMH segments are shown in orange and membrane-embedded α-helices according to the OPM database (Lomize et al., 2012) underlined. Re-entrant loops and surface helices discussed in the text are italicized. (d) Construct design for obtaining FPs of isolated Nout-oriented BtuC TMHs. Dashed segments are derived from LepB. (e) Enlarged FPs for BtuC (orange) and (R47R56R59→QQQ) (black), together with coarse-grained molecular dynamics (CGMD)-FPs calculated with (gray) and without (dashed gray) a −100 mV potential. (f) BtuC TMH9-TMH10, with hydrophobic flanking residues in stick representation (PDB ID: 2QI9). (g) Enlarged FPs for BtuC (orange), isolated TMH6 (residues 187–206; blue), and isolated TMH5-6 (residues 138–206; green). In the latter construct, LepB TMH2 was not included in order to maintain the correct membrane topology of the BtuC TMH5-TMH6 part. The CGMD-FP is in gray. (h) Structure of TMH6 including the upstream periplasmic re-entrant helix and the downstream cytoplasmic surface helix, with hydrophobic flanking residues in stick representation. AP: arrest peptide; PTC: polypeptide transferase center.

Figure 4.

Figure 4—figure supplement 1. BtuC.

Figure 4—figure supplement 1.

(a) As in Figure 4b, but with a hydrophobicity plot (ΔG) calculated by TOPCONS (3, 50) (gray). (b) Close-up view of the BtuC force profile (FP) (N = 30–150; orange), and the corresponding FP obtained with BtuC constructs lacking the N-terminal LepB fusion (green). For the latter, an HA tag was included just upstream of the arrest peptide, and cells were radiolabeled with [35S]-Met for 1 min before trichloroacetic acid precipitation.
Figure 4—figure supplement 2. Mutations in constructs representing peaks in the BtuC force profile (FP).

Figure 4—figure supplement 2.

(a) Sequences of the 67 residues leading up to the end of the arrest peptide (AP) for constructs with the indicated N-values. The constructs are identified by black circles on the BtuC FP in Figure 4b. For each construct, the residues indicated in bold green were simultaneously mutated to Ala. The shaded area encompasses residues located 40–50 residues away from the C-terminal end of the AP in the respective constructs. Hydrophobic transmembrane helix segments are shown in orange and membrane-embedded α-helices underlined. (b) fFL values for the unmutated constructs (orange) and the Ala-replacement mutants (blue). Error bars indicate SEM values, and stars indicate p-values calculated using a two-sided t-test (*p < 0.05; **p < 0.01; ***p < 0.001).
Figure 4—figure supplement 3. BtuC.

Figure 4—figure supplement 3.

As in Figure 4b, with the force profile (FP) for construct BtuC(ΔTMH1-TMH4) in green.

We identified 11 peaks in the FP (Figure 4b, orange), one more than could be accounted for by the 10 TMHs. Since it was not possible to provide an unequivocal match between the BtuC FP and the CGMD-FP (or HP, Figure 4—figure supplement 1a), we did two sets of controls. First, we chose constructs at or near peaks in the FP and CGMD-FP and mutated multiple hydrophobic residues (Leu, Ile, Val, Met) located 40–50 residues from the PTC to less hydrophobic Ala residues (Figure 4—figure supplement 2). The mutations caused significant drops in fFL (p < 0.01, two-sided t-test), except for construct N = 191 that is mutated at the extreme N-terminus of TMH5. The mutation data allowed us to identify the membrane integration of TMHs 1, 2, 3, 4, 5, 7, 8, 9, and 10 with peaks I, II, III, IV, V, VIII, IX, X, and XI, respectively; the overlapping peaks VIII and IX appear to represent the concerted integration of the closely spaced TMH7 and TMH8. However, peak II (corresponding to TMH2) is shifted to unexpectedly high, and peaks V (corresponding to TMH5), X (corresponding to TMH9), and XI (corresponding to TMH10) to unexpectedly low, Nstart values (Figure 4c). To confirm these assignments, we obtained FPs for the isolated TMH2 (dashed green), TMH8 (dashed light blue), and TMH10 (dashed pink) sequences (Figure 4b) by introducing them into the periplasmic domain of LepB such that they maintained their natural Nout-Cin orientation (Figure 4d); the FPs for the individual TMHs overlap the corresponding peaks II, IX, and XI in the full FP. Likewise, an FP obtained for a construct lacking TMH1-TMH4 overlaps the full FP, except that peak V is shifted to a higher Nstart value (Figure 4—figure supplement 3), more in line with the peak seen in the CGMD-FP. The low Nstart value for the Nin-Cout-oriented TMH5 in full-length BtuC may result from an early interaction between a positively charged patch (RFARRHLSTSR) just upstream of TMH5 and negatively charged lipid headgroups (note that only two of the four Arg residues are present in the ΔTMH1-TMH4 construct; Figure 4—figure supplement 3), while the low Nstart values for peaks X and XI are likely caused by the short upstream hydrophobic segments LCGL and LAAALEL (Figure 4c,f), similar to peak III in GlpG.

Remarkably, the N-terminal end of the isolated TMH2 is ~45 residues away from the PTC at Nstart, suggesting that upstream sequence elements present in full-length BtuC delay the integration of TMH2 by ~10 residues (compare II* and II in Figure 4c). The most conspicuous feature in the upstream region of TMH2 is the presence of three positively charged Arg residues, an uncommon occurrence in a periplasmic loop (Heijne, 1986). Indeed, when these residues are replaced by uncharged Gln residues in LepB-BtuC, peak II (dashed black in Figure 4b,e) becomes almost identical to the FP for the isolated TMH2; a similar behavior is seen when the CGMD-FP simulation is run without an electrical membrane potential (Figure 4e). Upstream positively charged residues thus delay the membrane integration of the Nout-oriented TMH2, possibly because of the energetic cost of translocating them against the membrane potential (Ismail et al., 2015), or because they are temporarily retained in the negatively charged exit tunnel (Mercier et al., 2020).

Neither peak VI nor VII seems to represent the integration of TMH6, but instead flanks the location expected from the CGMD-FP and HP and apparently corresponds, respectively, to the membrane insertion of a short periplasmic re-entrant helix and of a short cytoplasmic surface helix (Figure 4c,h). Mutation of three hydrophobic residues to Ala in the latter significantly reduces the amplitude of peak VII (Figure 4—figure supplement 2, construct N = 259). Further, the FP for the isolated TMH6 (Figure 4b,g, dashed dark blue) peaks in the location expected from the CGMD-FP, between peaks VI and VII, and the FP for the isolated TMH5-6 part that includes the re-entrant helix but lacks the downstream surface helix is intermediate between the LepB-BtuC and the TMH6 FPs (Figure 4g, dashed green). Thus, the membrane interactions of the periplasmic re-entrant helix and the cytoplasmic surface helix exert a strong effect on the membrane integration of the intervening TMH6.

Discussion

A detailed view of the cotranslational integration of three multispanning membrane proteins provided here shows that translocating nascent chains experience a distinct transition to a more hydrophobic environment at a distance of ~45 residues from the PTC, generating an oscillating force on the nascent chain that is ultimately transmitted to the PTC and varies in step with the appearance of each TMH in the vicinity of the SecYEG translocon channel. It seems likely that such oscillations can have multiple effects on the translation of membrane proteins, as recently demonstrated for ribosomal frameshifting (Harrington et al., 2020), and may affect protein quality control (Lakshminarayan et al., 2020).

Notably, TMHs also stop generating a force on the nascent chain when their C-terminal end reaches ~45 residues from the PTC, irrespective of whether their orientation is Nout-Cin or Nin-Cout. This is in agreement with the 'sliding’ model of TMH integration (Cymer et al., 2015b), which posits that Nout-Cin TMHs have continuous lipid contact as they slide across the membrane along the open lateral gate in the SecYEG translocon, while Nin-Cout TMHs first partition into the cytoplasmic interface region of the membrane as they exit the ribosome (and therefore generate less pulling force than Nout-Cin TMHs (Cymer et al., 2014) and only insert across the membrane as their polar C-terminal flanking region translocates through the central translocon channel. In both cases, the TMHs are embedded in the membrane (albeit in perpendicular orientations) when their C-terminal end is ~45 residues from the PTC. In the sliding model, the translocon channel serves as a conduit for polar nascent chain segments while hydrophobic segments are always in contact with surrounding lipid, similar to what has been proposed for the YidC/Oxa1 translocon family (He et al., 2020). The lateral gate region in the SecYEG translocon thus in a certain sense mimics the water–bilayer interface environment (Marx and Fleming, 2021).

We also find that the cytoplasmic NTD in GlpG folds already in the ribosome exit tunnel, before the first TMH has been synthesized. Further, the FPs for EmrE, GlpG, and BtuC to a first approximation match those predicted by CGMD calculations, but uncover a much richer picture of the membrane integration process where charged residues and membrane-interacting segments such as re-entrant loops and surface helices flanking a TMH show prominent interactions with the translocon and surrounding lipid. Finally, point mutations in EmrE TMH1 affect the pulling force generated by downstream TMHs in a highly position-dependent manner, suggestive of residue-specific interactions between TMHs during the membrane integration process. Complementing in vitro unfolding/folding studies (Yu et al., 2017; Choi et al., 2019), real-time FRET analyses (Mercier et al., 2020), chemical crosslinking (He et al., 2020), structure determination (Kater et al., 2019), and computational modeling (Lu et al., 2018), high-resolution in vivo FPA can thus help identify the molecular interactions underlying cotranslational membrane protein biogenesis with up to single-residue precision.

Materials and methods

Key resources table.

Reagent type
(species) or resource
Designation Source or reference Identifiers Additional information
Strain, strain background (Escherichia coli) BL21(DE3) Sigma-Aldrich CMC0016 Electrocompetent cells
Strain, strain background (Escherichia coli) MC1061 J Biol Chem. 261:13844–9. PMID:3531212 NA Electrocompetent cells
Other Protein-G-agarose Roche 11243233001 Resin used for immunoprecipitation
Antibody Anti-HA.11 epitope tag antibody (mouse monoclonal) IgG BioLegend Cat# 901533 Used for immunoprecipitation (1 μl of 1 mg/ml, diluted 1:820)
Antibody LepB antibody
(rabbit polyclonal) IgG
Generated in-house NA Used for immunoprecipitation (dilution 1:820)
Recombinant DNA reagent pET Duet-1 (plasmid) Novagen Cat# 71146 Expression plasmid
Recombinant DNA reagent pING1 (plasmid) Gene 34:137–45. PMID:4007491 NA Expression plasmid
Commercial assay, kit GeneJET Plasmid miniprep kit Thermo Fisher
Scientific
RRID: SCR_008452
Cat# 0502 Used to purify plasmids
Commercial assay, kit GeneJET PCR Purification Kit Thermo Fisher
Scientific
Cat# K0701 Used to purify linear fragments for in vitro expression
Commercial assay, kit PURExpress New England Biolabs Cat# E6800L Used for in vitro expression
Chemical compound, drug 35S methionine PerkinElmer Cat#
NEG009T001MC
35S Methionine is incorporated into the protein during in vitro and in vivo translation and aids detection by phosphor imaging
Software, algorithm EasyQuant Developed in-house
Nat Struct Mol Biol. 19:1018–22. PMID: 23001004
Used to quantify relative fraction full length of translated protein from SDS-PAGE

Enzymes and chemicals

All enzymes used in this study were purchased from Thermo Fisher Scientific (USA) and New England Biolabs (USA). Oligonucleotides were from Eurofins Genomics (Germany). DNA isolation/purification kits and precast polyacrylamide gels were from Thermo Fisher Scientific (USA). L-[35S]-methionine was obtained from PerkinElmer (USA). Mouse monoclonal antibody against the HA antigen was purchased from BioLegend (USA). Protein G-agarose beads were manufactured by Roche (Switzerland). All other reagents were from Sigma-Aldrich (Germany).

Cloning and mutagenesis

EmrE

The previously described Nout-Cout-oriented EmrE(Cout) version carrying mutations T28R, L85R, and R106A was engineered in a pETDuet-1 vector (Rapp et al., 2007). A series of constructs was designed by inserting nucleotides downstream of EmrE(Cout) coding for a variable LepB-derived linker sequence (between 4 and 34 residues), the 9-residue long HA tag, the 17-residue long E. coli SecM AP, and a 23-residue long C-terminal tail. The following APs with stalling strengths were used: SecM(Ec) (FSTPVWISQAQGIRAGP), SecM(Ec-Ms) (FSTPVWISQHAPIRGSP, mutations underlined), and SecM(Ec-Sup1) (FSTPVWISQAPPIRAGP, mutations underlined). The LepB-derived linker as well as EmrE(Cout) were truncated 2–5 residues at a time from the C-terminus of the respective sequence. Site-specific DNA mutagenesis was carried out to introduce point mutations E14L, E14A, E14D, and E14Q in EmrE(Cout). All cloning and mutagenesis products were confirmed by DNA sequencing. Different EmrE sequences used in this study are summarized in Supplementary file 1.

GlpG

The gene encoding for GlpG was amplified from the genome of E. coli K-12 MG1655 strain by PCR and assembled together with other sequence elements into the pING1 plasmid (Johnston et al., 1985; Dalbey and Wickner, 1985) by Gibson assembly (in-house). For the longest truncates, a LepB-derived unstructured linker was introduced downstream of the GlpG sequence, followed by an HA tag, a 17-residue long E. coli SecM AP, and a 23-residue long C-terminal tail derived from LepB. Partially overlapping primers were used in around-the-horn PCR (Floor, 2018) to create deletion variants truncating upstream of the HA tag. All the sequences of GlpG deletion variants used in this study are summarized in Supplementary file 1. For LepB-GlpG constructs, 60 N-terminal residues corresponding to the soluble domain were truncated from GlpG and substituted by LepB N-terminal segment comprising TMH1 and a long cytoplasmic loop (1–174 res of LepB). Three different stalling sequences of increasing strength were used: SecM(Ec) (FSTPVWISQAQGIRAGP), SecM(Ec-Ms) (FSTPVWISQHAPIRGSP, mutations underlined), and SecM(Ec-Sup1) (FSTPVWISQAPPIRAGP, mutations underlined). Mutations in SecM(Ec) AP and GlpG folding variants NTD(F16E), GlpG(Y138F139L143→NNN) were engineered using partially overlapping primers in around-the-horn PCR. All cloning and mutagenesis products were confirmed by DNA sequencing.

For in vitro transcription/translation of the soluble NTD domain, constructs of variable length were fused to the SecM(Ec) AP and cloned into the pET19b vector. Folding variant NTD(F16E) was engineered using partially overlapping primers in around-the-horn PCR. pET19b plasmids containing different GlpG variants were used as template to create linear DNA fragments amplified by PCR for each construct using forward and reverse primers that anneal to the T7 promoter and terminator regions, respectively.

BtuC

The previously described pING1 plasmid harboring a truncated LepB sequence with an inserted hydrophobic test segment (6L/13A) followed by a variable LepB-derived linker (between 9 and 43 residues), the 17-residue long E. coli SecM AP, and a C-terminal tail comprising 23 or 75 residues derived from LepB was used to generate all BtuC constructs (Ismail et al., 2012). All BtuC sequences used in this study are summarized in Supplementary file 1. The gene encoding BtuC was amplified from the genome of the E. coli K-12 MG1655 strain by PCR and then engineered to replace 6L/13A using Gibson assembly (in-house) (Gibson et al., 2009). In order to maintain the correct topology of BtuC, the sequence coding for TMH2 of LepB (between residues P58 and P114) was removed by deletion-PCR, resulting in a 177-residue-long sequence upstream of BtuC. A gene sequence encoding 52 residues (part of LepB P2 domain) was introduced downstream of the SecM AP for constructs with N ≥ 298, resulting in an extension of the C-terminal tail from 23 to 75 residues in order to improve protein separation during SDS-PAGE. The LepB-derived linker as well as BtuC were truncated four residues at a time from the C terminus of the respective sequence. Site-specific DNA mutagenesis was carried out to replace three or six hydrophobic residues with Ala residues in TMHs of BtuC and to replace three Arg residues with Gln residues in the periplasmic loop connecting TMH1 and TMH2 (R47R56R59→QQQ). Gene sequences of single TMH and 2-TMH constructs were cloned with the variable linker sequence derived from LepB, and the single TMH constructs were placed in the background containing gene sequences of both LepB TMHs in order to maintain the correct topology. Furthermore, BtuCΔLepB constructs lacking the N-terminal LepB fusion were obtained by deletion of the entire LepB sequence upstream of BtuC, and the 9-residue long LepB-derived linker was replaced with an HA tag for immunoprecipitation. All cloning and mutagenesis products were confirmed by DNA sequencing.

In vivo pulse-labeling analysis

Competent E. coli MC1061 (Dalbey and Wickner, 1986) or BL21 (DE3) cells were transformed with the respective pING1 (BtuC, GlpG) or pET Duet-1 (EmrE) plasmid, respectively, and grown overnight at 37°C in M9 minimal medium supplemented with 19 amino acids (1 μg/ml, no Met), 100 μg/ml thiamine, 0.4% (w/v) fructose, 100 mg/ml ampicillin, 2 mM MgSO4, and 0.1 mM CaCl2. Cells were diluted into fresh M9 medium to an OD600 of 0.1 and grown until an OD600 of 0.3–0.5. Expression from pING1 was induced with 0.2% (w/v) arabinose and continued for 5 min at 37°C. Expression from pET Duet-1 was induced with 1 mM IPTG and continued for 10 min at 37°C. Proteins were then radiolabeled with [35S]-methionine for 2 min (1 min for BtuC constructs lacking the N-terminal LepB fusion) at 37°C before the reaction was stopped by adding ice-cold trichloroacetic acid (TCA) to a final concentration of 10%. Samples were put on ice for 30 min and precipitates were spun down for 10 min at 20,000 g at 4°C in a tabletop centrifuge (Eppendorf, Germany). After one wash with ice-cold acetone, centrifugation was repeated and pellets were subsequently solubilized in Tris-SDS buffer (10 mM Tris-Cl pH 7.5, 2% [w/v] SDS) for 5 min while shaking at 1400 rpm at 37°C. Samples were centrifuged for 5 min at 20,000 g to remove insoluble material. The supernatant was then added to a buffer containing 50 mM Tris-HCl pH 8.0, 150 mM NaCl, 0.1 mM EDTA-KOH, 2% (v/v) triton X-100, and supplemented with Pansorbin (Sigma-Aldrich) (BtuC constructs) or Protein-G-agarose (Roche) (all GlpG and EmrE constructs, and BtuC constructs lacking the N-terminal LepB fusion). After 15 min incubation on ice, non-specifically bound proteins were removed by centrifugation at 20,000×g (when Pansorbin was used) or 7000×g (when Protein-G-agarose was used). The supernatant was used for immunoprecipitation of BtuC constructs using Pansorbin and LepB antisera (rabbit) (in-house), or immunoprecipitation of GlpG/EmrE constructs using Protein-G-agarose and Anti-HA.11 Epitope Tag Antibody (mouse) (BioLegend). The incubation was carried out at 4°C while rolling. After centrifugation for 1 min, immunoprecipitates were washed with 10 mM Tris-Cl pH 7.5, 150 mM NaCl, 2 mM EDTA, and 0.2% (v/v) triton X-100 and subsequently with 10 mM Tris-Cl pH 7.5. Samples were spun down again and pellets were solubilized in SDS sample buffer (67 mM Tris, 33% [w/v] SDS, 0.012% [w/v] bromophenol blue, 10 mM EDTA-KOH pH 8.0, 6.75% [v/v] glycerol, 100 mM DTT) for 10 min while shaking at 1400 rpm. Solubilized proteins were incubated with 0.25 mg/ml RNase for 30 min at 37°C and subsequently separated by SDS-PAGE on Bis-Tris gels (Thermo Fisher Scientific). Gels were fixed in 30% (v/v) methanol and 10% (v/v) acetic acid and dried by using a Bio-Rad gel dryer model 583 (Bio-Rad Laboratories, US). Radiolabeled proteins were detected by exposing dried gels to phosphorimaging plates, which were scanned in a Fujifilm FLA-3000 scanner (Fujifilm, Japan). Band intensity profiles were obtained using the FIJI (ImageJ) software and quantified with our in-house software EasyQuant. Ac and/or FLc controls were included in the SDS-PAGE analysis for constructs where the identities of the A and FL bands were not immediately obvious on the gel. Data was generally collected from three independent biological replicates for EmrE and BtuC, and for two or three replicates for GlpG, and averages and SEM were calculated (see Source data 1). Note that for two replicates plotting the average ± SEM is equivalent to plotting the average ± error bars representing the two experimental measurements.

In vitro transcription/translation of GlpG NTD

In vitro transcription/translation was performed using the commercially available PURExpress system (New England Biolabs). Reactions were mixed according to the manufacturer’s recommendations by the addition of 2.2 μl of linear DNA of each construct giving a final volume of 10 μl. Polypeptide synthesis was carried out in the presence of [35S]-methionine at 37°C for 15 min under 700 rpm shaking. Translation was stopped by the addition of TCA to a final concentration of 5% and incubated on ice for at least 30 min. Total protein was sedimented by centrifugation at 20,000 g for 10 min at 4°C in a tabletop centrifuge (Eppendorf, Germany). The pellet was resuspended in 2× SDS/PAGE sample buffer, supplemented with RNaseA (400 μg/ml) to digest the stalled peptidyl-tRNA, and incubated at 37°C for 15 min under 1000 rpm agitation. The samples were resolved on 12% Bis-Tris gels (Thermo Fisher Scientific) in MOPS buffer. Gels were dried on Hoefer GD 2000 dryer (Hoefer, US), exposed to a phosphorimager screen for 24 hr, and scanned using the Fujifilm FLA-9000 phosphorimager for visualization of radioactively labeled protein species.

Molecular dynamics simulations

Computer simulations of cotranslational membrane integration were carried out using a previously developed and validated CGMD model in which nascent proteins are mapped onto CG beads representing three amino acids (Niesen et al., 2017; Niesen et al., 2018). The nascent protein interacts with the Sec translocon and the ribosome via pairwise interactions that depend on the hydrophobicity and charge of the beads of the nascent protein. The interaction parameters are unchanged from previous work (Niesen et al., 2017). The lateral gate of the translocon switches between the open and closed conformations with probability dependent on the difference in free energy between the two conformations. The structures of the ribosome and translocon are based on cryo-EM structures and, aside from the lateral gate of the translocon, are fixed in place during the simulations. The lipid bilayer and cytosol are modeled implicitly. The positions of the nascent protein beads are evolved using overdamped Langevin dynamics with a timestep of 300 ns and a diffusion coefficient of 253 nm2/s. Membrane potentials are included by adding an electrostatic energy term to the simulations, as previously described (Niesen et al., 2018).

To simulate protein translation, new amino acids are added to the nascent chain at a rate of five amino acids per second. Simulations of EmrE, GlpG, and BtuC begin with 12 amino acids translated. Translation continues until the nascent protein reaches the desired length, at which point translation is halted and forces on the C-terminus of the nascent chain are measured every 3 ms for 6 s. This methodology has been found to accurately reproduce experimental FPs (Niesen et al., 2018). Forces are measured starting at a nascent protein length of 18 amino acids for EmrE and BtuC, and 70 for GlpG. The computational force profile (CGMD-FP) is then obtained by measuring the forces at lengths incremented by four amino acids. Simulations at different lengths are performed independently and repeated 100 times. Because the ribosomal exit tunnel is truncated in the CGMD model, a shift in the protein index is required to compare simulated and experimental results. Shifts of −12, –5, and −5 residues are used for EmrE, GlpG, and BtuC CGMD-FPs, respectively. The shifts are estimated by aligning the computational and experimental FPs and are in line with what is expected given the length of the truncated exit tunnel. Variation in the shift may reflect different degrees of compaction of the nascent chain. Although previous work provides a framework to estimate the experimentally observed fraction full length from simulated forces given a specific AP (Niesen et al., 2018; Tian et al., 2018), forces are reported directly to facilitate comparison between experiments performed with different APs.

Protein contact order calculation

The relative contact order for the GlpG NTD was calculated using the Contact Order server at https://depts.washington.edu/bakerpg/contact_order/contact_order.cgi.

Acknowledgements

We thank Dr. Rickard Hedman (Stockholm University) for programming and maintenance of the EasyQuant software. This work was supported by grants from the Knut and Alice Wallenberg Foundation (2017.0323), the Novo Nordisk Fund (NNF18OC0032828), and the Swedish Research Council (621-2014-3713) to GvH, from a Marie Curie Initial Training Network Grant (Horizon 2020, ProteinFactory 642863) to FN, and from NIGMS, National Institutes of Health, (R01GM125063) to TFM and MZ. This work used the Extreme Science and Engineering Discovery Environment (XSEDE) Bridges computer at PSC through allocation TG-MCB160013. XSEDE is supported by National Science Foundation grant number ACI-1548562.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Gunnar von Heijne, Email: Gunnar.von.Heijne@dbb.su.se.

Randy Schekman, University of California Berkeley, United States.

Olga Boudker, Weill Cornell Medicine, United States.

Funding Information

This paper was supported by the following grants:

  • Knut och Alice Wallenbergs Stiftelse 2017.0323 to Gunnar von Heijne.

  • Novo Nordisk Fonden NNF18OC0032828 to Gunnar von Heijne.

  • Vetenskapsrådet 621-2014-3713 to Gunnar von Heijne.

  • National Institutes of Health R01GM125063 to Matthew Zimmer, Thomas F Miller III.

  • Horizon 2020 Framework Programme Protein Factory 642863 to Felix Nicolaus.

  • National Science Foundation ACI-1548562 to Thomas F Miller III.

Additional information

Competing interests

No competing interests declared.

Author contributions

Formal analysis, Supervision, Investigation, Visualization, Methodology, Writing - review and editing.

Formal analysis, Supervision, Investigation, Visualization, Methodology, Writing - review and editing.

Formal analysis, Supervision, Investigation, Visualization, Methodology, Writing - review and editing.

Investigation.

Investigation.

Investigation.

Formal analysis, Funding acquisition, Investigation, Writing - review and editing.

Conceptualization, Funding acquisition, Writing - review and editing.

Conceptualization, Resources, Formal analysis, Supervision, Funding acquisition, Validation, Visualization, Methodology, Writing - original draft, Project administration, Writing - review and editing.

Additional files

Source data 1. Measured fFL values for all EmrE, GlpG, and BtuC constucts reported in Figures 2, 3 and 4 (and the corresponding Supplementary Figures and Supplementary file 1).
elife-64302-data1.xlsx (80.9KB, xlsx)
Supplementary file 1. Amino acid sequences of EmrE(Cout), GlpG, and BtuC constructs.
elife-64302-supp1.docx (69.9KB, docx)
Transparent reporting form

Data availability

All fFL values measured in this study are included as figures Source Data.

References

  1. Chen YJ, Pornillos O, Lieu S, Ma C, Chen AP, Chang G. X-ray structure of EmrE supports dual topology model. PNAS. 2007;104:18999–19004. doi: 10.1073/pnas.0709387104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Chitwood PJ, Juszkiewicz S, Guna A, Shao S, Hegde RS. EMC is required to initiate accurate membrane protein topogenesis. Cell. 2018;175:1507–1519. doi: 10.1016/j.cell.2018.10.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Choi HK, Min D, Kang H, Shon MJ, Rah SH, Kim HC, Jeong H, Choi HJ, Bowie JU, Yoon TY. Watching helical membrane proteins fold reveals a common N-to-C-terminal folding pathway. Science. 2019;366:1150–1156. doi: 10.1126/science.aaw8208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Cymer F, Ismail N, von Heijne G. Weak pulling forces exerted on Nin-orientated transmembrane segments during co-translational insertion into the inner membrane of Escherichia coli. FEBS Letters. 2014;588:1930–1934. doi: 10.1016/j.febslet.2014.03.050. [DOI] [PubMed] [Google Scholar]
  5. Cymer F, Hedman R, Ismail N, von Heijne G. Exploration of the arrest peptide sequence space reveals Arrest-enhanced variants. Journal of Biological Chemistry. 2015a;290:10208–10215. doi: 10.1074/jbc.M115.641555. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Cymer F, von Heijne G, White SH. Mechanisms of integral membrane protein insertion and folding. Journal of Molecular Biology. 2015b;427:999–1022. doi: 10.1016/j.jmb.2014.09.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Dalbey RE, Wickner W. Leader peptidase catalyzes the release of exported proteins from the outer surface of the Escherichia coli plasma membrane. Journal of Biological Chemistry. 1985;260:15925–15931. doi: 10.1016/S0021-9258(17)36347-0. [DOI] [PubMed] [Google Scholar]
  8. Dalbey RE, Wickner W. The role of the polar, carboxyl-terminal domain of Escherichia coli leader peptidase in its translocation across the plasma membrane. Journal of Biological Chemistry. 1986;261:13844–13849. doi: 10.1016/S0021-9258(18)67097-8. [DOI] [PubMed] [Google Scholar]
  9. Farías-Rico JA, Goetz SK, Marino J, von Heijne G. Mutational analysis of protein folding inside the ribosome exit tunnel. FEBS Letters. 2017;591:155–163. doi: 10.1002/1873-3468.12504. [DOI] [PubMed] [Google Scholar]
  10. Farías-Rico JA, Ruud Selin F, Myronidi I, Frühauf M, von Heijne G. Effects of protein size, thermodynamic stability, and net charge on cotranslational folding on the ribosome. PNAS. 2018;115:E9280–E9287. doi: 10.1073/pnas.1812756115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Floor S. Around-the-Horn PCR and Cloning, protocols.io. 2018 doi: 10.17504/protocols.io.rf2d3qe. [DOI]
  12. Gibson DG, Young L, Chuang RY, Venter JC, Hutchison CA, Smith HO. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nature Methods. 2009;6:343–345. doi: 10.1038/nmeth.1318. [DOI] [PubMed] [Google Scholar]
  13. Goldman DH, Kaiser CM, Milin A, Righini M, Tinoco I, Bustamante C. Ribosome mechanical force releases nascent chain-mediated ribosome arrest in vitro and in vivo. Science. 2015;348:457–460. doi: 10.1126/science.1261909. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Harrington HR, Zimmer MH, Chamness LM, Nash V, Penn WD, Miller TF, Mukhopadhyay S, Schlebach JP. Cotranslational folding stimulates programmed ribosomal frameshifting in the alphavirus structural polyprotein. Journal of Biological Chemistry. 2020;295:6798–6808. doi: 10.1074/jbc.RA120.012706. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. He H, Kuhn A, Dalbey RE. Tracking the stepwise movement of a membrane-inserting protein in vivo. Journal of Molecular Biology. 2020;432:484–496. doi: 10.1016/j.jmb.2019.10.010. [DOI] [PubMed] [Google Scholar]
  16. Heijne G. The distribution of positively charged residues in bacterial inner membrane proteins correlates with the trans-membrane topology. The EMBO Journal. 1986;5:3021–3027. doi: 10.1002/j.1460-2075.1986.tb04601.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Hessa T, Meindl-Beinker NM, Bernsel A, Kim H, Sato Y, Lerch-Bader M, Nilsson I, White SH, von Heijne G. Molecular code for transmembrane-helix recognition by the Sec61 translocon. Nature. 2007;450:1026–1030. doi: 10.1038/nature06387. [DOI] [PubMed] [Google Scholar]
  18. Hvorup RN, Goetz BA, Niederer M, Hollenstein K, Perozo E, Locher KP. Asymmetry in the structure of the ABC transporter-binding protein complex BtuCD-BtuF. Science. 2007;317:1387–1390. doi: 10.1126/science.1145950. [DOI] [PubMed] [Google Scholar]
  19. Ismail N, Hedman R, Schiller N, von Heijne G. A biphasic pulling force acts on transmembrane helices during translocon-mediated membrane integration. Nature Structural & Molecular Biology. 2012;19:1018–1022. doi: 10.1038/nsmb.2376. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Ismail N, Hedman R, Lindén M, von Heijne G. Charge-driven dynamics of nascent-chain movement through the SecYEG translocon. Nature Structural & Molecular Biology. 2015;22:145–149. doi: 10.1038/nsmb.2940. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Ito K, Chiba S. Arrest peptides: cis-acting modulators of translation. Annual Review of Biochemistry. 2013;82:171–202. doi: 10.1146/annurev-biochem-080211-105026. [DOI] [PubMed] [Google Scholar]
  22. Johnston S, Lee JH, Ray DS. High-level expression of M13 gene II protein from an inducible polycistronic messenger RNA. Gene. 1985;34:137–145. doi: 10.1016/0378-1119(85)90121-0. [DOI] [PubMed] [Google Scholar]
  23. Kater L, Frieg B, Berninghausen O, Gohlke H, Beckmann R, Kedrov A. Partially inserted nascent chain unzips the lateral gate of the Sec translocon. EMBO reports. 2019;20:e48191. doi: 10.15252/embr.201948191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Kemp G, Kudva R, de la Rosa A, von Heijne G. Force-Profile analysis of the cotranslational folding of HemK and filamin domains: comparison of biochemical and biophysical folding assays. Journal of Molecular Biology. 2019;431:1308–1314. doi: 10.1016/j.jmb.2019.01.043. [DOI] [PubMed] [Google Scholar]
  25. Kemp G, Nilsson OB, Tian P, Best RB, von Heijne G. Cotranslational folding cooperativity of contiguous domains of α-spectrin. PNAS. 2020;117:14119–14126. doi: 10.1073/pnas.1909683117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Lakshminarayan R, Phillips BP, Binnian IL, Gomez-Navarro N, Escudero-Urquijo N, Warren AJ, Miller EA. Pre-emptive quality control of a misfolded membrane protein by Ribosome-Driven effects. Current Biology. 2020;30:854–864. doi: 10.1016/j.cub.2019.12.060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Leininger SE, Narayan K, Deutsch C, O'Brien EP. Mechanochemistry in translation. Biochemistry. 2019;58:4657–4666. doi: 10.1021/acs.biochem.9b00260. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Lomize MA, Pogozheva ID, Joo H, Mosberg HI, Lomize AL. OPM database and PPM web server: resources for positioning of proteins in membranes. Nucleic Acids Research. 2012;40:D370–D376. doi: 10.1093/nar/gkr703. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Lu W, Schafer NP, Wolynes PG. Energy landscape underlying spontaneous insertion and folding of an alpha-helical transmembrane protein into a bilayer. Nature Communications. 2018;9:4949. doi: 10.1038/s41467-018-07320-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Marx DC, Fleming KG. Local bilayer hydrophobicity modulates membrane protein stability. Journal of the American Chemical Society. 2021;143:764–772. doi: 10.1021/jacs.0c09412. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Mercier E, Wintermeyer W, Rodnina MV. Co-translational insertion and topogenesis of bacterial membrane proteins monitored in real time. EMBO. 2020;39:e104054. doi: 10.15252/embj.2019104054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Niesen MJ, Wang CY, Van Lehn RC, Miller TF. Structurally detailed coarse-grained model for Sec-facilitated co-translational protein translocation and membrane integration. PLOS Computational Biology. 2017;13:e1005427. doi: 10.1371/journal.pcbi.1005427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Niesen MJM, Müller-Lucks A, Hedman R, von Heijne G, Miller TF. Forces on nascent polypeptides during membrane insertion and translocation via the sec translocon. Biophysical Journal. 2018;115:1885–1894. doi: 10.1016/j.bpj.2018.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Notari L, Martínez-Carranza M, Farías-Rico JA, Stenmark P, von Heijne G. Cotranslational folding of a pentarepeat β-Helix protein. Journal of Molecular Biology. 2018;430:5196–5206. doi: 10.1016/j.jmb.2018.10.016. [DOI] [PubMed] [Google Scholar]
  35. Plaxco KW, Simons KT, Baker D. Contact order, transition state placement and the refolding rates of single domain proteins. Journal of Molecular Biology. 1998;277:985–994. doi: 10.1006/jmbi.1998.1645. [DOI] [PubMed] [Google Scholar]
  36. Rapoport TA, Li L, Park E. Structural and mechanistic insights into protein translocation. Annual Review of Cell and Developmental Biology. 2017;33:369–390. doi: 10.1146/annurev-cellbio-100616-060439. [DOI] [PubMed] [Google Scholar]
  37. Rapp M, Seppälä S, Granseth E, von Heijne G. Emulating membrane protein evolution by rational design. Science. 2007;315:1282–1284. doi: 10.1126/science.1135406. [DOI] [PubMed] [Google Scholar]
  38. Schibich D, Gloge F, Pöhner I, Björkholm P, Wade RC, von Heijne G, Bukau B, Kramer G. Global profiling of SRP interaction with nascent polypeptides. Nature. 2016;536:219–223. doi: 10.1038/nature19070. [DOI] [PubMed] [Google Scholar]
  39. Seurig M, Ek M, von Heijne G, Fluman N. Dynamic membrane topology in an unassembled membrane protein. Nature Chemical Biology. 2019;15:945–948. doi: 10.1038/s41589-019-0356-9. [DOI] [PubMed] [Google Scholar]
  40. Sherratt AR, Blais DR, Ghasriani H, Pezacki JP, Goto NK. Activity-based protein profiling of the Escherichia coli GlpG rhomboid protein delineates the catalytic core. Biochemistry. 2012;51:7794–7803. doi: 10.1021/bi301087c. [DOI] [PubMed] [Google Scholar]
  41. Shimizu Y, Kanamori T, Ueda T. Protein synthesis by pure translation systems. Methods. 2005;36:299–304. doi: 10.1016/j.ymeth.2005.04.006. [DOI] [PubMed] [Google Scholar]
  42. Tian P, Steward A, Kudva R, Su T, Shilling PJ, Nickson AA, Hollins JJ, Beckmann R, von Heijne G, Clarke J, Best RB. Folding pathway of an Ig domain is conserved on and off the ribosome. PNAS. 2018;115:E11284–E11293. doi: 10.1073/pnas.1810523115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Tsirigos KD, Peters C, Shu N, Käll L, Elofsson A. The TOPCONS web server for consensus prediction of membrane protein topology and signal peptides. Nucleic Acids Research. 2015;43:W401–W407. doi: 10.1093/nar/gkv485. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Van Lehn RC, Zhang B, Miller TF. Regulation of multispanning membrane protein topology via post-translational annealing. eLife. 2015;4:e08697. doi: 10.7554/eLife.08697. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. von Heijne G. Control of topology and mode of assembly of a polytopic membrane protein by positively charged residues. Nature. 1989;341:456–458. doi: 10.1038/341456a0. [DOI] [PubMed] [Google Scholar]
  46. Wang Y, Zhang Y, Ha Y. Crystal structure of a rhomboid family intramembrane protease. Nature. 2006;444:179–180. doi: 10.1038/nature05255. [DOI] [PubMed] [Google Scholar]
  47. Wolfe PB, Wickner W, Goodman JM. Sequence of the leader peptidase gene of Escherichia coli and the orientation of leader peptidase in the bacterial envelope. Journal of Biological Chemistry. 1983;258:12073–12080. doi: 10.1016/S0021-9258(17)44342-0. [DOI] [PubMed] [Google Scholar]
  48. Yap MN, Bernstein HD. The plasticity of a translation arrest motif yields insights into nascent polypeptide recognition inside the ribosome tunnel. Molecular Cell. 2009;34:201–211. doi: 10.1016/j.molcel.2009.04.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Young R, Bremer H. Polypeptide-chain-elongation rate in Escherichia coli B/r as a function of growth rate. Biochemical Journal. 1976;160:185–194. doi: 10.1042/bj1600185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Yu H, Siewny MG, Edwards DT, Sanders AW, Perkins TT. Hidden dynamics in the unfolding of individual bacteriorhodopsin proteins. Science. 2017;355:945–950. doi: 10.1126/science.aah7124. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision letter

Editor: Randy Schekman1
Reviewed by: Ian Collinson2

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

This is an elegant report on the ubiquitous process of membrane protein insertion, studied in vivo and in vitro using the bacterial model E. coli – exploiting three exemplar membrane proteins of increasing topological complexity. The exploitation of force profile analysis (FPA) developed in the von Heijne lab – now at high resolution – has proved very powerful and complementary to other techniques; such as MD simulations (also deployed here). The outcome is new knowledge about the molecular details of membrane protein insertion. Importantly the paper reveals a number of very interesting aspects of this process.

Decision letter after peer review:

Thank you for submitting your article "Residue-by-residue analysis of cotranslational membrane protein integration in vivo" for consideration by eLife. Your article has been reviewed by three peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Olga Boudker as the Senior Editor. The following individual involved in review of your submission has agreed to reveal their identity: Ian Collinson (Reviewer #2).

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

We would like to draw your attention to changes in our revision policy that we have made in response to COVID-19 (https://elifesciences.org/articles/57162). Specifically, we are asking editors to accept without delay manuscripts, like yours, that they judge can stand as eLife papers without additional data, even if they feel that they would make the manuscript stronger. Thus the revisions requested below only address clarity and presentation.

This is an elegant report on the ubiquitous process of membrane protein insertion, studied in vivo and in vitro using the bacterial model E. coli – exploiting three exemplar membrane proteins of increasing topological complexity.

Our knowledge about the mechanism of membrane protein insertion is somewhat superficial and the authors are leading the way in this area. This manuscript is a prime example – and thus worthy of publication in close to its present form (certainly no more experiments required).

The exploitation of force profile analysis (FPA) developed in the von Heijne lab – now at high resolution – has proved very powerful and complementary to other techniques; such as MD simulations (also deployed here). The outcome is new knowledge about the molecular details of membrane protein insertion. Importantly the paper reveals a number of very interesting aspects of this process:

Summary:

This is an elegant report on the ubiquitous process of membrane protein insertion, studied in vivo and in vitro using the bacterial model E. coli – exploiting three exemplar membrane proteins of increasing topological complexity.

Our knowledge about the mechanism of membrane protein insertion is somewhat superficial and the authors are leading the way in this area. This manuscript is a prime example – and thus worthy of publication in close to its present form (certainly no more experiments required).

The exploitation of force profile analysis (FPA) developed in the von Heijne lab – now at high resolution – has proved very powerful and complementary to other techniques; such as MD simulations (also deployed here). The outcome is new knowledge about the molecular details of membrane protein insertion. Importantly the paper reveals a number of very interesting aspects of this process:

Revisions:

The following is excerpted from the consultation session which specifies the key points the reviewers feel need to be addressed in your revised submission. I have included the full comments of the reviewers for your consideration. However, these need not be addressed in detail in your the response accompanying the submission of your revised manuscript.

There are a number of technical issues that need to be addressed in a revised text including conclusions that need to be adjusted. If co-translational folding of polypeptide segments in the ribosome tunnel can affect the integration process (which the authors claim occurs), in some cases it is problematic to use Lep fusions to solve technical issues with detection. In those cases, the native polypeptide is no longer the context for this detection issue.

Perhaps the authors could show the force profiles are unaffected by the presence of the fusion proteins? Of course, if the results are affected then the conclusions should be changed accordingly.

However, it may be difficult to show this convincingly given the use of Lep with Btu to allow an interpretation of the protein band pattern. The implication is that Lep affects the process of either integration or folding. Further, the authors claim sequences upstream of the N-terminus may affect integration by co-translational folding as in the case of the NTD region in GlpG. Thus the replacement of Lep for such sequences may interfere with the folding/integration process.

Given the von Heijne group's previous good agreement with MD simulations, perhaps this could approach could be used to access the divergence from native behavior.

Perhaps the current analysis would be acceptable for the constructs that are sufficiently long (For GlpP, N>120; a 58-residue-long LepB portion is completely exposed outside the ribosome and does not engage with SecY). GlpP is probably fine in this regard because LepB-fused constructs are N=131-224. Of course, LepB has a membrane anchor, and we don't know how that might have affected FP of GlpP. They might have some native GlpG data besides those with multiple bands, which might be useful. For BtuC, they show at least some data without LepB fusion (Figure 4—figure supplement 1B). This is nice particularly because these are shorter constructs, which could have been more affected by LepB. Nevertheless, the potential caveats in their interpretations bear correction.

It may be helpful to include raw images and some clarification regarding statistics for the experiments where there were only 2 biological replicates..

The major issues summarized above may be addressed by the authors without additional experiments. Reviewer #1 offers additional experimental tests, but they are not considered essential for a satisfactory revision of this most interesting work.

Reviewer #1:

The manuscript by Nicolaus et al. followed the co-translational integration of three multi-spanning membrane proteins using force profile (FP) analysis, the method previously pioneered by the von Heijne group. This approach uses a C-terminal fusion of a translation arrest peptide to either soluble or membrane proteins of varying truncations. Folding or membrane integration of the nascent chain produces a pulling force, which can be read out from how efficiently the arrest peptide causes translational stalling. Previously the group has analyzed membrane insertion of model single transmembrane helices (TMHs) in detail (Ismail et al., 2012). Now they extended the approach to multi-spanning (and more native) membrane proteins, namely EmrE, GlpG, and BtuC. The authors concluded that their data support the so-called "sliding" model for TMH integration, where each TMH slides down along the open lateral gate of the SecYEG channel. They also present several interesting observations, such as effects of charged residues, re-entrant loops, and surface helices on the timings of TMS integration, although these would require more data to be generalized.

A main weakness of the study is its exclusive reliance on the FP analysis. Coarse-grained MD simulations were also used but rather to confirm observations from the FP analysis. A limitation of the FP analysis is that it only monitors (indirectly) an insertion of the last TMH which is near the SecYEG channel. The analysis does not address what would happen to earlier TMHs (such as folding of TMD, delayed insertion, or possible flipping of TMs) as the polypeptide grows. A second weakness would be that the study adds only rather minor updates (effects of charged residues, re-entrant loops, and surface helices) to the previous model.

Despite the weaknesses, this work provides useful insights and resources to the field, and therefore I consider it to be suitable for publication in eLife with revision. Membrane integration of multi-spanning membrane proteins is a still poorly understood process, largely because there are only very few experimental tools available for researchers to follow the process. The authors' analysis is generally of high quality, and their interpretations are reasonable.

1) The current version of the manuscript is probably difficult for scientists outside the field to follow. It would be better to extend Introduction to provide non-experts with more background information about the pathway, such as the general model for the SecYEG-mediated protein translocation and membrane integration, what is known about membrane protein integration, and what are main questions to address.

2) As in Point #1, a cartoon figure showing the "sliding" model described in Discussion would be helpful to improve readability.

3) Regarding Point #2. Although the authors did not explicitly mention this, the model opposed to the sliding model would be the "in-out" model, where the TMS first goes down along the translocation pore and then partitions into the lipid phase through the open lateral gate. However, in my opinion, mechanistic differences between the two models are rather small, and a boundary between the lateral gate and pore is not clear-cut (also, lipids can partially insert acyl tails into the open lateral gate and contact the polypeptide chain in the pore). The authors should clearly describe why the data fits into the "sliding" model not the "in-out" model. Also "slide along the open lateral gate" is unclear. It is possible that the TMH stays just outside the lateral gate while it slides across the membrane (similar to structures seen in studies of translocation intermediates containing TMHs or signal sequences).

4) In the previous study by Ismail et al., a bi-phasic pulling force has been observed. In the current study, the FPs do not seem to clearly show such a biphasic pattern. Can the authors discuss about this?

5) For some observations, the authors simply say further studies will be required to understand (for instance, long-range residue-residue interactions). Instead, the authors should at least provide a possible explanation or propose a testable hypothesis.

6) “Likewise, peak VI-a likely reflects the membrane integration of a hydrophobic, membrane-associated cytoplasmic segment located just upstream of TMH5, Figure 3—figure supplement 1B. In contrast, the unexpectedly high Nstart value for peak IV indicates that integration of TMH3 commences only when its N-terminal end is ~52 residues away from the PTC possibly because of the tight spacing between TMH2 and TMH3.”. These can be perhaps tested by mutagenesis and inserting a hydrophilic stretch between TMH2 and TMH3. How could the tight spacing between TMH2 and TMH3 cause a delayed insertion of TMH3?

7) The basis for Nstart and Nmax for the magenta (mutant) plot in Figure 3D is not clear because there seem three local peaks. What do these local peaks and minima represent?

8) Generally, the authors only use Nstart positions for interpretations. But interestingly, peak widths are quite variable among TMHs. Is there any correlation between the peak widths and TMH sequences (or their structural properties)?

9) Results final paragraph: It is puzzling why there is no pulling force for TMH6 but two strong peaks before and after. Can the authors offer an explanation for this observation beyond the general statement? Do mutations in the periplasmic re-entrant helix affect the peaks?

Reviewer #2:

This is an elegant report on the ubiquitous process of membrane protein insertion, studied in vivo and in vitro using the bacterial model E. coli – exploiting three exemplar membrane proteins of increasing topological complexity.

Our knowledge about the mechanism of membrane protein insertion is somewhat superficial and the authors are leading the way in this area. This manuscript is a prime example – and thus worthy of publication in close to its present form (certainly no more experiments required).

The exploitation of force profile analysis (FPA) developed in the von Heijne lab – now at high resolution – has proved very powerful and complementary to other techniques; such as MD simulations (also deployed here). The outcome is new knowledge about the molecular details of membrane protein insertion. Importantly the paper reveals a number of very interesting aspects of this process:

– the ability of soluble cytosolic domains to fold while still in the ribosome exit tunnel

– site specific aa substitutions can have long range effects on FPA, indicative of cooperative TMH folding and insertion.

– trans-membranes helices (TMH) very quickly lose their “pulling power”, interpreted as early membrane entry.

The results support the building evidence in favour of a “sliding model” for membrane protein insertion, wherein the incorporating TMHs are always in contact with lipids at the interface between the Sec complex and the bilayer.

I have no substantive concerns about the work. Congratulations to the authors on their study; I understand that it must have been a great deal of work.

Reviewer #3:

Technically, this is a very advanced study. However, there are technical issues with the data that are either ignored or marginalized by the authors, but that can affect the conclusions of the work on individual membrane proteins. This in particular concerns the use of Lep sequences in protein fusions to obtain more "interpretable" data.

Figure 1 displays examples for the three membrane proteins tested how FPA is applied. These are all SDS-PAGE/pulse chase measurements with membrane protein truncates arrested by a translational arrest peptide. Depending on the construct different arrest peptides are used to tune the force window. These type of experiments have been done for a very large set of protein truncates, and these data have been summarized in Figures 2-4 in position versus force graphs. The authors should make the crude data (SDS-PAGE images like in Figure 1C) accessible as an external resource so that the actual data can be inspected.

Data availability: With the current presentation, there is no way to access the quality of the data. That there are ambiguities is already evident from Figure 1C with EmrE. The arrested peptide seems shows up as two bands which signals a problem which is not further discusses. It is not clear whether the lower band is included in determining the intensity of A, nor is it clear what the nature of this protein band is. Since the crude data is not available, it is hard to access whether there are more issues with the data and how this would affect the outcome of the work.

Apparently, with GlpG, constructs with N 140-160 gave rise to multiple bands on SDS-PAGE that were difficult to interpret. This problem was solved by fusion of a portion of the Lep protein to the N-terminus of GlpG. The Lep part resides in the ribosome tunnel with the AP arrested protein constructs. This seems unsatisfactory, as the Lep sequence that is entirely irrelevant to the GlpG integration process. Since this Lep sequence apparently influences that protein band pattern, it is clearly not invariant to the overall process. Thus it is difficult to argue that with the use of the Lep extensions, meaningful data is obtained.

Relating to the above comment, also with BtuC, Lep fusions are being used. If co-translational folding in the ribosome tunnel can affect the forces measured as the authors claim, replacing parts of GlpP and BtuC with Lep will certainly affect the outcome of the work.

The authors argue that co-translational folding of the NTD of GlpG in the ribosome tunnel occurs. With folding the authors probably mean secondary structure formation? In the experimental setup, arrested chains are being used, and a long biochemical processing needs to occur before readout. How can the authors be sure that this folding is also relevant for a true, unobstructed translation/integration process? Here, the physiological relevance of the observation needs to be treated with caution.

eLife. 2021 Feb 8;10:e64302. doi: 10.7554/eLife.64302.sa2

Author response


Revisions:

The following is excerpted from the consultation session which specifies the key points the reviewers feel need to be addressed in your revised submission. I have included the full comments of the reviewers for your consideration. However, these need not be addressed in detail in your the response accompanying the submission of your revised manuscript.

There are a number of technical issues that need to be addressed in a revised text including conclusions that need to be adjusted. If co-translational folding of polypeptide segments in the ribosome tunnel can affect the integration process (which the authors claim occurs), in some cases it is problematic to use Lep fusions to solve technical issues with detection. In those cases, the native polypeptide is no longer the context for this detection issue.

Perhaps the authors could show the force profiles are unaffected by the presence of the fusion proteins? Of course, if the results are affected then the conclusions should be changed accordingly.

However, it may be difficult to show this convincingly given the use of Lep with Btu to allow an interpretation of the protein band pattern. The implication is that Lep affects the process of either integration or folding. Further, the authors claim sequences upstream of the N-terminus may affect integration by co-translational folding as in the case of the NTD region in GlpG. Thus the replacement of Lep for such sequences may interfere with the folding/integration process.

Given the von Heijne group's previous good agreement with MD simulations, perhaps this could approach could be used to access the divergence from native behavior.

Perhaps the current analysis would be acceptable for the constructs that are sufficiently long (For GlpP, N>120; a 58-residue-long LepB portion is completely exposed outside the ribosome and does not engage with SecY). GlpP is probably fine in this regard because LepB-fused constructs are N=131-224. Of course, LepB has a membrane anchor, and we don't know how that might have affected FP of GlpP. They might have some native GlpG data besides those with multiple bands, which might be useful. For BtuC, they show at least some data without LepB fusion (Figure 4—figure supplement 1B). This is nice particularly because these are shorter constructs, which could have been more affected by LepB. Nevertheless, the potential caveats in their interpretations bear correction.

We appreciate the reviewers’ concerns regarding the possible effects of the LepB fusions that we were forced (by expression issues) to make to a certain number of GlpG constructs (N = 131-224) and all of the BtuC constructs. However, we are confident that the LepB part does not have a strong effect on the FPs, for the following reasons:

1) For the LepB-GlpG (N = 131-224) constructs, the segment between Lep TMH1 and GlpG TMH1 is always 71 residues, and the C-terminal end of the LepB part is ≥70 residues away from the PTC and hence far out of the ribosome tunnel. LepB is therefore unlikely to affect the FP in the region of peak II and in longer constructs. Likewise, had the NTD been present instead of the LepB part in these constructs, its C-terminal end would also have been ≥ 70 residues away from the PTC, while its folding transition (as seen in the FP) is completed already when the C-terminal end is ~40 residues from the PTC (construct N = 101, Figure 3B).

2) For the BtuC constructs, the N-terminal LepB fusion part is 177 residues long, and LepB TMH1 is 155 residues away from the N terminus of the BtuC part. The ~150-residue fragment from the LepB periplasmic domain is unlikely to be able to fold, as it does not represent any compact subdomain in the protein. LepB is therefore unlikely to affect the BtuC FP in any major way.

3) Finally, we have been able to measure some points on the GlpG and BtuC FPs both for the wildtype protein and for the corresponding LepB fusions. As can be seen in the new Figure 3—figure supplement 1A (for GlpG) and the already submitted Figure 4—figure supplement 1B (for BtuC), while the fFL values are not completely identical, the shapes of the FPs are only minimally affected.

It may be helpful to include raw images and some clarification regarding statistics for the experiments where there were only 2 biological replicates..

We have included a gallery of gels from repeat experiment for all three proteins in the new Figure 1—figure supplement 1. We haven’t included all gels, as the total number of lanes would be ~1,400 (including all repeats). The statistics for the GlpG constructs for which we only have duplicate measurements is now discussed in Materials and methods. All individual fFL measurements are included in the Supplement.

The major issues summarized above may be addressed by the authors without additional experiments. Reviewer #1 offers additional experimental tests, but they are not considered essential for a satisfactory revision of this most interesting work.

Reviewer #1:

[…]

1) The current version of the manuscript is probably difficult for scientists outside the field to follow. It would be better to extend Introduction to provide non-experts with more background information about the pathway, such as the general model for the SecYEG-mediated protein translocation and membrane integration, what is known about membrane protein integration, and what are main questions to address.

We are not very fond of wordy Introductions and Discussions, and probably err on the short side from time to time. Maybe we’ve read Strunk and White a few times too many: “Vigorous writing is concise. A sentence should contain no unnecessary words, a paragraph no unnecessary sentences, for the same reason that a drawing should have no unnecessary lines and a machine no unnecessary parts.” We’ll be happy to discuss this further, if you want.

2) As in Point #1, a cartoon figure showing the "sliding" model described in Discussion would be helpful to improve readability.

3) Regarding Point #2. Although the authors did not explicitly mention this, the model opposed to the sliding model would be the "in-out" model, where the TMS first goes down along the translocation pore and then partitions into the lipid phase through the open lateral gate. However, in my opinion, mechanistic differences between the two models are rather small, and a boundary between the lateral gate and pore is not clear-cut (also, lipids can partially insert acyl tails into the open lateral gate and contact the polypeptide chain in the pore). The authors should clearly describe why the data fits into the "sliding" model not the "in-out" model. Also "slide along the open lateral gate" is unclear. It is possible that the TMH stays just outside the lateral gate while it slides across the membrane (similar to structures seen in studies of translocation intermediates containing TMHs or signal sequences).

We have considered this suggestion, but feel that putting too much emphasis on the “sliding model” would risk turning the paper into a discussion of the pros and cons of different models rather than providing a first overview of what kinds of fine-level details of the co-translational membrane integration process that FP analysis may offer. We’ll be happy to discuss this further, if you want.

4) In the previous study by Ismail et al., a bi-phasic pulling force has been observed. In the current study, the FPs do not seem to clearly show such a biphasic pattern. Can the authors discuss about this?

In the previous study, the biphasic pulling force was not resolved with the weak SecM(Ec) AP, and the first of the two peaks was hardly visible with the strong SecM(Ec-Sup1) AP. The 2-peak pattern was most clearly seen with the medium-strong SecM(Ms) AP (with peaks at Nstart ≈ 27+19=46 residues and Nstart ≈ 35+19=54 residues). Referring to Figure 3D, it is thus possible that peak III-b in the SecM(Ec-Ms) FP (green) is in fact the first peak in a biphasic peak generated by TMH2. We now mention this possibility in the manuscript.

5) For some observations, the authors simply say further studies will be required to understand (for instance, long-range residue-residue interactions). Instead, the authors should at least provide a possible explanation or propose a testable hypothesis.

At this point, our best guess is that these patterns in the EmrE FPs are explained by specific interactions between E14 in TMH1 and (polar?) residues in downstream TMHs, as stated in the manuscript. Obviously, we will try to identify such interactions in future studies. We have, however, ventured a speculative explanation for how the periplasmic surface helix located just upstream of GlpG TMH2 may affect the FP recorded for the latter.

6) “Likewise, peak VI-a likely reflects the membrane integration of a hydrophobic, membrane-associated cytoplasmic segment located just upstream of TMH5, Figure 3—figure supplement 1B. In contrast, the unexpectedly high Nstart value for peak IV indicates that integration of TMH3 commences only when its N-terminal end is ~52 residues away from the PTC possibly because of the tight spacing between TMH2 and TMH3.”. These can be perhaps tested by mutagenesis and inserting a hydrophilic stretch between TMH2 and TMH3. How could the tight spacing between TMH2 and TMH3 cause a delayed insertion of TMH3?

7) The basis for Nstart and Nmax for the magenta (mutant) plot in Figure 3D is not clear because there seem three local peaks. What do these local peaks and minima represent?

Good point. The discussion of GlpG peak III has been expanded.

8) Generally, the authors only use Nstart positions for interpretations. But interestingly, peak widths are quite variable among TMHs. Is there any correlation between the peak widths and TMH sequences (or their structural properties)?

We show and discuss Nend values for many of the TMHs. Similar to the Nstart values, the Nend values tend to coincide with the C terminus of a TMH reaching ~45 residues from the PTC, hence the peak width also correlates with the length of the TMH.

9) Results final paragraph: It is puzzling why there is no pulling force for TMH6 but two strong peaks before and after. Can the authors offer an explanation for this observation beyond the general statement? Do mutations in the periplasmic re-entrant helix affect the peaks?

Indeed, this is puzzling and will require further study before anything better than mere conjecture can be proposed.

Reviewer #3:

Technically, this is a very advanced study. However, there are technical issues with the data that are either ignored or marginalized by the authors, but that can affect the conclusions of the work on individual membrane proteins. This in particular concerns the use of Lep sequences in protein fusions to obtain more "interpretable" data.

Figure 1 displays examples for the three membrane proteins tested how FPA is applied. These are all SDS-PAGE/pulse chase measurements with membrane protein truncates arrested by a translational arrest peptide. Depending on the construct different arrest peptides are used to tune the force window. These type of experiments have been done for a very large set of protein truncates, and these data have been summarized in Figures 2-4 in position versus force graphs. The authors should make the crude data (SDS-PAGE images like in Figure 1C) accessible as an external resource so that the actual data can be inspected.

See response to Reviewer 1.

Data availability: With the current presentation, there is no way to access the quality of the data. That there are ambiguities is already evident from Figure 1C with EmrE. The arrested peptide seems shows up as two bands which signals a problem which is not further discusses. It is not clear whether the lower band is included in determining the intensity of A, nor is it clear what the nature of this protein band is. Since the crude data is not available, it is hard to access whether there are more issues with the data and how this would affect the outcome of the work.

Additional bands running below the arrested form of the protein have been seen in previous studies and most likely correspond to nascent chains on ribosomes that are stacked up behind the one arrested on the AP. We now mention this in the legend to Figure 1. Also, we now note in the Materials and methods that Ac and/or FLc controls were included in the SDS-PAGE analysis for constructs where the identities of the A and FL bands were not immediately obvious on the gel.

Apparently, with GlpG, constructs with N 140-160 gave rise to multiple bands on SDS-PAGE that were difficult to interpret. This problem was solved by fusion of a portion of the Lep protein to the N-terminus of GlpG. The Lep part resides in the ribosome tunnel with the AP arrested protein constructs. This seems unsatisfactory, as the Lep sequence that is entirely irrelevant to the GlpG integration process. Since this Lep sequence apparently influences that protein band pattern, it is clearly not invariant to the overall process. Thus it is difficult to argue that with the use of the Lep extensions, meaningful data is obtained.

Relating to the above comment, also with BtuC, Lep fusions are being used. If co-translational folding in the ribosome tunnel can affect the forces measured as the authors claim, replacing parts of GlpP and BtuC with Lep will certainly affect the outcome of the work.

As explained above (and now explicitly in the manuscript), the LepB part is far outside the ribosome exit tunnel in the LepB fusion constructs, and, to the extent that we’ve been able to determine, does not have a strong impact on the FPs.

The authors argue that co-translational folding of the NTD of GlpG in the ribosome tunnel occurs. With folding the authors probably mean secondary structure formation? In the experimental setup, arrested chains are being used, and a long biochemical processing needs to occur before readout. How can the authors be sure that this folding is also relevant for a true, unobstructed translation/integration process? Here, the physiological relevance of the observation needs to be treated with caution.

This is a good point, that has been discussed in earlier studies of co-translational folding of soluble protein domains. Specifically, FPA has been shown to give results fully consistent with results obtained by real-time FRET and NMR (JMB 431:1308). Small domains have been shown to be able to fold completely inside the exit tunnel, and the NTD is small enough that it should be able to fit in the distal part of the tunnel. Moreover, as now mentioned in the text, the folding time that can be predicted for the NTD based on its “relative contact order” is ~1 msec, while the elongation cycle on the ribosome takes ~100 msec/codon. The NTD should thus be able to equilibrate between the unfolded and accessible folded states between each elongation step during ongoing translation, just as it will have time to equilibrate when the ribosome pauses on the last codon of the AP. Finally, the NTD peak in Figure 3B is seen both in vivo and in vitro, i.e, in experiments that are carried out in quite different ways.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Source data 1. Measured fFL values for all EmrE, GlpG, and BtuC constucts reported in Figures 2, 3 and 4 (and the corresponding Supplementary Figures and Supplementary file 1).
    elife-64302-data1.xlsx (80.9KB, xlsx)
    Supplementary file 1. Amino acid sequences of EmrE(Cout), GlpG, and BtuC constructs.
    elife-64302-supp1.docx (69.9KB, docx)
    Transparent reporting form

    Data Availability Statement

    All fFL values measured in this study are included as figures Source Data.


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES