Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2016 Sep 6;113(38):10559–10564. doi: 10.1073/pnas.1606776113

Energetics of side-chain snorkeling in transmembrane helices probed by nonproteinogenic amino acids

Karin Öjemalm a,1, Takashi Higuchi b,1, Patricia Lara a, Erik Lindahl a,c, Hiroaki Suga b, Gunnar von Heijne a,c,2
PMCID: PMC5035864  PMID: 27601675

Significance

Membrane proteins are central players in all cells, and their structure and function are under intense study. However, we still lack a detailed understanding of the process whereby they are integrated into biological membranes. Most membrane proteins are integrated cotranslationally into the membrane bilayer. Although the energetics that drive membrane protein integration are known in outline, detailed studies are difficult because the naturally occurring amino acids represent only a limited set of side-chain chemistries. Here we use synthetic, nonproteinogenic amino acids engineered into a transmembrane segment to systematically probe the energetics of membrane insertion in a way not possible with the set of natural amino acids.

Keywords: membrane protein, nonproteinogenic amino acids, translocon

Abstract

Cotranslational translocon-mediated insertion of membrane proteins into the endoplasmic reticulum is a key process in membrane protein biogenesis. Although the mechanism is understood in outline, quantitative data on the energetics of the process is scarce. Here, we have measured the effect on membrane integration efficiency of nonproteinogenic analogs of the positively charged amino acids arginine and lysine incorporated into model transmembrane segments. We provide estimates of the influence on the apparent free energy of membrane integration (ΔGapp) of “snorkeling” of charged amino acids toward the lipid–water interface, and of charge neutralization. We further determine the effect of fluorine atoms and backbone hydrogen bonds (H-bonds) on ΔGapp. These results help establish a quantitative basis for our understanding of membrane protein assembly in eukaryotic cells.


Almost all helical integral membrane proteins are inserted cotranslationally into the endoplasmic reticulum (ER) membrane in eukaryotic cells or the cytoplasmic membrane in prokaryotic cells (1). Insertion is mediated by Sec-type translocons: the Sec61 complex in eukaryotes and the homologous SecYEG complex in prokaryotes. The Sec translocon serves as a protein-conducting channel through which polar stretches of polypeptide can be translocated across the membrane, but it can also open laterally toward the surrounding lipid bilayer, thereby allowing membrane integration of hydrophobic transmembrane helices (1).

Using a cotranslational insertion assay, we previously measured the contribution of each of the 20 natural amino acids to the membrane integration efficiency of model transmembrane segments, and derived a “biological” hydrophobicity scale that assigns an apparent free-energy of membrane integration, ΔGappaa(i),j, to each amino acid of type i [aa(i) = A, C, D, E, …, W, Y] as a function of its position j within a transmembrane α-helix (2). To further probe the physicochemical basis for membrane insertion, we also analyzed a series of aliphatic and aromatic nonproteinogenic amino acids, using the same insertion assay (3). This approach made it possible to quantitate the “hydrophobic effect” during membrane protein insertion, giving nonpolar solvation energy parameter values of –10 cal/(mol⋅Å2) and –7 cal/(mol⋅Å2) for aliphatic and aromatic surface area, respectively.

Here, we extend our analysis of nonproteinogenic amino acids to analogs of the positively charged amino acids Arg and Lys. We provide estimates of the influence on the apparent free energy of membrane integration (ΔGapp) of “snorkeling” of charged amino acids toward the lipid–water interface, and of charge neutralization. We further determine the effect of fluorine atoms and backbone H-bonds on ΔGapp.

Results

Membrane-Insertion Assay.

To examine the effects of nonproteinogenic amino acid analogs on Sec61-mediated insertion of model hydrophobic segments (H segments) into the ER membrane, we used a previously described Lep construct (3) translated in vitro in the presence of ER-derived dog pancreas rough microsomes (Fig. 1A). In this assay, the relative extents of mono- and di-glycosylation provide a direct measure of the efficiency of translocon-mediated insertion of a given hydrophobic segment (H segment). A typical experiment is shown in Fig. 1B; note that the suppression efficiency is ∼50% (Fig. 1B, lane 3). An apparent equilibrium constant for membrane insertion of a given H segment, Kapp, is defined as Kapp = I1gs/I2gs, where I1gs and I2gs are the respective intensities of the bands representing the singly and doubly glycosylated forms of the protein (the two uppermost bands in Fig. 1B, lane 4). Kapp is then converted to an apparent free energy difference ΔGapp = −RT ln(Kapp), where R is the gas constant and T the absolute temperature (T = 293 K).

Fig. 1.

Fig. 1.

Membrane-insertion assay. (A) Model H segments are introduced into the Lep host protein between two engineered acceptor sites for N-linked glycosylation (G1, G2). The [4L,15A] and [6L,13A] H segments (with GGPG…GPGG flanking regions) used in the study are shown. The fractions of membrane-inserted and noninserted H segment are determined by quantitation of radiolabeled singly and doubly glycosylated forms of the protein. (B) In vitro translation of a Lep construct with the H segment AAAALALAAAAALALAXAA (X = Arg) in the absence (−) and presence (+) of tRNAsup charged with Arg, and dog pancreas rough microsomes (RMs). □, Truncated Lep protein resulting from termination at the UAG stop codon in the H segment; ■, truncated Lep protein glycosylated on the G1 site; ○, full-length, nonglycosylated Lep protein; ●, full-length Lep protein glycosylated on the G1 site; ●●, full-length Lep protein glycosylated on the G1 and G2 sites. For this construct, ΔGapp = 0.04 kcal/mol.

A single nonproteinogenic amino acid “X” was inserted in defined positions in H segments of composition [6L,12A,1X] or [4L,14A,1X] by introduction of an amber stop codon (TAG) in the corresponding position in the gene, and carrying out in vitro translation in the presence of a suppressor tRNASup charged with the nonproteinogenic amino acid in question. Charging of the tRNASup was done using Flexizyme (4) (Materials and Methods).

Snorkeling in Arginine Analogs.

The guanidinium group on the Arg side chain has such a high pKa that it remains protonated even when buried in a hydrophobic environment (5). To explore the combined effects of charge and side-chain length [i.e., snorkeling (6, 7)] on membrane insertion, we analyzed a series of Arg analogs with increasingly long side chains of three to eight methyl groups, as well as an analog (l-canavanine), where the δ carbon is replaced by an oxygen atom, thereby lowering the pKa of the guanidinium group to 7.0 (8), and two uncharged carbamide analogs with three and four methyl groups in the side chain (Fig. 2).

Fig. 2.

Fig. 2.

ΔGapp profiles for Arg analogs. (A) Arg analogs were introduced into H segments of composition [6L,12A,1X], where X is the Arg analog. (B) Arg analogs were introduced into H segments of composition [4L,14A,1X]. (C) ΔΔGapp values per one added –CH2– between Arg analogs with increasingly long side chains placed in position 10 in the H segment, after subtraction of an aliphatic contribution of −0.26 kcal/mol per –CH2–. (D) Other Arg analogs with reduced side-chain charge discussed in the text. SDs obtained from three independent experiments are indicated.

As seen in Fig. 2A, the shapes of the ΔGapp profiles are similar for all of the Arg analogs with longer side chains, but the profiles are shifted to progressively lower ΔGapp values as the number of methyl groups in the side chain increases. In particular, addition of one methyl group to an Arg side chain placed in position 10 in an H segment of composition [6L,12A,1R] reduces ΔGapp by −0.30 kcal/mol. Going from four to six methyl groups leads to a further reduction by −0.75 kcal/mol (−0.38 kcal/mol per carbon), and from six to seven methyl groups by −0.63 kcal/mol. Longer side chains with seven and eight methyl groups promote >90% insertion of the H segment, precluding the calculation of accurate ΔGapp values, and were therefore reanalyzed in H segments of composition [4L,14A,1R]; going from seven to eight methyl groups reduces ΔGapp by −0.46 kcal/mol when the Arg analog is in position 10 (Fig. 2B). These stepwise reductions in ΔGapp are larger than the value obtained previously for linear aliphatic side chains (−0.26 kcal/mol per carbon) (3). Subtracting the aliphatic contribution, we find that the ΔΔGapp for moving the charged guanidinium group one methyl group (∼1.2 Å) closer to the membrane interface varies, as shown in Fig. 2C, i.e., the polarity gradient in the membrane is maximal in a region located approximately half-way between the center of the membrane and the membrane–water interface. This finding is consistent with neutron and X-ray diffraction measurements of the water distribution across lipid bilayers (9), indicating that the ER membrane has a similar polarity gradient as a model lipid bilayer.

The uncharged carbamide analog of Arg has a ΔGapp that is −0.30 kcal/mol lower than that of Arg when placed in position 10 in the H segment (Fig. 2D). Lengthening of the side chain of the carbamide analog by one methyl group leads to a further reduction of ΔGapp by −0.24 kcal/mol, very similar to the value found for purely aliphatic side chains (3). The most notable difference between the Arg ΔGapp profile and the profiles for the two carbamide analogs, however, is that the latter are almost perfectly symmetric around position 10, whereas the C-terminal half of the Arg profile has lower ΔGapp values than the N-terminal half. This finding is consistent with the previously demonstrated ability of cytoplasmically located positively charged residues to lower ΔGapp (10), a reflection of the so-called “positive inside rule” (11).

The Arg analog l-canavanine, finally, has a substantially lower ΔGapp than Arg (ΔΔGapp = −0.64 kcal/mol) when placed in position 10 (Fig. 2D), but retains the asymmetric shape of the ΔGapp profile. With a pKa of 7.0, the formal charge of l-canavanine at pH 7 is +0.5; apparently, this is enough to make it behave as a positively charged residue, but with a significantly lower cost for membrane insertion than has Arg.

Snorkeling in Lysine Analogs.

Because of the difficulty in shortening the side-chain length of Lys without inducing the formation of lactones, we were unable to investigate such lysine analogs containing different numbers of methyl groups. Instead, we modified the terminal amino group by acetylation and methylation. Acetylation removes the positive charge, and reduces ΔGapp (ΔΔGapp = −0.67 kcal/mol in position 10 in the H segment) (Fig. 3A). Remarkably, shortening of the acetylated Lys side-chain by one to three methyl groups only marginally increases ΔGapp: in position 10, ΔGapp changes by +0.23 kcal/mol when one methyl group is removed (as expected from the reduced aliphatic surface area), but then remains constant upon removal of the second methyl group and even decreases slightly when the third methyl group is removed (Fig. 3A). Thus, for AcLys(C2) and AcLys(C1), the increase in ΔGapp expected from the removal of aliphatic surface area and deeper burial of the polar moiety is compensated for by some other interaction, possibly H-bonding of the side chain to the backbone as often seen for Ser, Thr, and Cys residues (12). To test whether this is a physically plausible explanation, we performed molecular dynamics (MD) simulations of AcLys and AcLys(C2−C4) introduced in position 10 in a [6L,12A,1X] H segment embedded in a palmitoyloleoylphosphatidylcholine (POPC) bilayer. In support of our hypothesis, and as illustrated by the representative snapshots in Fig. 3C, the side chain of AcLys makes H-bonds to interfacial water with a time-averaged occupancy of 0.5. AcLys(C3) is alternately H-bonded to water and to the backbone carbonyl of the Ala residue in position 6, both with time-averaged occupancies of 0.3. For AcLys(C2), the H-bonds to water and backbone have time-averaged occupancies of 0.6 and 0.4, respectively, and AcLys(C1) forms a H-bond only with the backbone with a time-averaged occupancy of 0.9. This result is also in agreement with earlier measurements on Gln and Asn, which differ by one methyl group but have nearly overlapping ΔGapp profiles (2). In fact, the profiles for AcLys(C2), AcLys(C1), Gln, and Asn in H segments of composition [4L,14A,1X] (2) are all very similar (the ΔGapp values for position 10 are within 0.17 kcal/mol), reflecting the similarities in side-chain composition (one oxygen, one nitrogen, two to four methyl groups).

Fig. 3.

Fig. 3.

ΔGapp profiles for Lys analogs. (A) Acetylated Lys analogs were introduced into H segments of composition [4L,14A,1X], where X is the Lys analog. (B) Methylated Lys was introduced into H segments of composition [6L,12A,1X]. SDs obtained from three independent experiments are indicated. (C) Representative snapshots of Lys analogs introduced in position 10 in a [6L,12A,1X] H segment embedded in a POPC bilayer (headgroups cyan, tails gray). Hydrogen bonds are indicated by yellow dashed lines.

As seen in Fig. 3B, methylation of the Lys side chain reduces ΔGapp by only −0.13 kcal/mol in position 10. MD simulations of H segments with Lys and CH3-Lys in position 10 embedded in a POPC bilayer (Fig. 3C) show that the charged amino group snorkels toward the lipid–water interface in both cases, and hence is located in a region where the introduction of additional nonpolar surface area in the form of an extra methyl group contributes little to the overall ΔGapp.

Fluorination Increases the Hydrophobicity of a Transmembrane Segment.

Fluorine is often used in medicinal chemistry, one reason being that its lipophilic character can make drugs better soluble in lipid membranes and thereby improve their ability to penetrate into cells (13). With this motivation in mind, we measured the effect of fluorine on ΔGapp by changing the terminal methyl group on AcLys and AcLys(C2) to –CF3. This modification reduced ΔGapp by −0.47 kcal/mol for AcLys and by −1.25 kcal/mol for AcLys(C2) when the fluorinated residue was placed in position 10 in the H segment (Fig. 4). These values compare reasonably well to the literature value of −1.0 kcal/mol for the relative hydration-free energies of CH4 vs. CHF3 (14, 15). The difference between CF3-AcLys and CF3-AcLys(C2) presumably reflects the deeper burial in the membrane of the –CF3 group in CF3-AcLys(C2). The fact that corresponding AcLys and AcLys(C2) H-segments have almost the same ΔGapp values (Fig. 3A) has the interesting consequence that ΔGapp for CF3-AcLys(C2) H-segments is lower than for the corresponding CF3-AcLys H-segments, despite the longer side chain of the latter.

Fig. 4.

Fig. 4.

ΔGapp profiles for acetylated Lys analogs with a terminal –CF3 group in H segments of composition [4L,14A,1X]. SDs obtained from three independent experiments are indicated.

A Backbone H-Bond Contributes ∼ −0.7 kcal/mol to ΔGapp.

Although the charged and polar side chains studied above do not have much influence on the helical conformation of the peptide backbone of membrane-integrated H segments, proline specifically disrupts helical structure. Proline prevents the formation of backbone H-bonds and kinks the H segment transmembrane helix, and therefore behaves as a polar residue when inserted into an H segment (2). To better understand the energetics of backbone H-bond formation, we tested the Pro analog carboxy-piperidine [Pro(C5)], which has an additional –CH2– group in the side chain. Unexpectedly, the Pro(C5) H segments had higher ΔGapp values than the corresponding Pro H segments for all tested positions except the most C-terminal one (Fig. 5A), despite the larger aliphatic surface area of Pro(C5). MD simulations of H segments dissolved in cyclohexane (to mimic a membrane environment) containing either a Pro or a Pro(C5) residue in position 10 show that Pro(C5), because of its bulkier side chain, is even more disruptive than Pro, causing a more pronounced break in the helix and the loss of 0.8 additional i,i+4 H-bonds (Fig. 5B). The average difference in exposed aliphatic surface area between the simulated helices with Pro or Pro(C5) is 33 Å2, corresponding to a predicted difference in ΔGapp of −0.33 kcal/mol. Experimentally, the difference in ΔGapp between the two H segments is +0.26 kcal/mol, hence the loss of 0.8 main-chain H-bonds (mainly between residues 4 and 8) (Fig. 5B) contributes ∼0.6 kcal/mol to ΔGapp, corresponding to ∼0.7 kcal/mol per H-bond. From previous measurements on natural amino acids in position 10, we know that Val and Thr contribute, respectively, −0.31 and +0.52 kcal/mol to ΔGapp (16); that is, because the surface areas of a –CH3 and an –OH group are similar, the H-bond on Thr contributes ∼0.8 kcal/mol to ΔGapp. The two measures agree that the loss of an H-bond in the interior of the ER membrane is thus associated with an increase in ΔGapp of 0.7–0.8 kcal/mol, in the same range as observed for the contribution of a membrane-buried H-bond to the stability of membrane proteins in detergent solution (0.5–1.0 kcal/mol) (17).

Fig. 5.

Fig. 5.

(A) ΔGapp profiles for Pro and the Pro(C5) analog in H segments of composition [6L,12A,1X]. SDs obtained from three independent experiments are indicated. (B) MD simulations of H segments AAAALALALXLALALAAAA, where “X” is Pro or Pro(C5), dissolved in cyclohexane. Representative snapshots are shown on top, and the fraction of i,i+4 H-bonds to the backbone carboxyl group in position i averaged over the simulation trajectory is shown at the bottom.

Discussion

Nonproteinogenic amino acids are powerful biochemical tools, in part because they make it possible to vary systematically the physico-chemical characteristics of amino acid side chains in ways that the restricted set of natural amino acids does not. In an earlier study, we used nonproteinogenic amino acids to measure nonpolar solvation energy parameters for aliphatic and aromatic surface areas that apply during cotranslational insertion of transmembrane α-helices into the ER membrane (3). We have now used the same approach to study the energetics of membrane insertion of positively charged amino acids, fluorine atoms, and backbone H-bonds.

Our analysis of how charged Arg analogs of increasing side-chain length affects ΔGapp shows that snorkeling toward the membrane–water interface can significantly reduce the cost of burying a charged guanidnium group in the membrane; we estimate that moving the guanidinium group ∼1.2 Å closer to the interface (i.e., a distance corresponding to the addition of one methyl group to the side chain) can lower ΔGapp by up to −0.4 kcal/mol. The maximal effect is seen when the guanidinium group is located approximately half-way between the center of the membrane and the membrane–water interface.

Uncharged carbamide analogs of Arg nicely illustrate both the effect of aliphatic surface area and the strong effect of positively charged residues located near the cytoplasmic end of a transmembrane helix on ΔGapp. When placed in the center of the H segment (position 10), the uncharged carbamide analog reduces ΔGapp by −0.30 kcal/mol compared with a charged Arg residue. In contrast to the ΔGapp profile for Arg, the profile for the carbamide analog is perfectly symmetric, which has the interesting consequence that Arg has a higher cost of membrane insertion than the uncharged carbamide analog in the luminal two-thirds of the H segment, but a lower cost of insertion in the cytoplasmic third. Adding an extra methyl group to the side chain of the carbamide analog reduces ΔGapp by the amount expected from the added aliphatic surface area (−0.24 kcal/mol), while maintaining the symmetric shape of the ΔGapp profile. Reducing the pKa of the Arg side chain by replacing the Cδ by O markedly reduces ΔGapp, especially in the cytosolic half of the H segment where the side chain is likely snorkeling toward the cytosol.

The Lys analogs that we have analyzed behave as expected: acetylation of the side chain (rendering it uncharged) reduces ΔGapp by −0.67 kcal/mol in position 10 of the H segment, twice the amount seen for carbamidation of Arg in the same position. The delocalized charge on the Arg guanidinium group is thus less costly to bury in the membrane than is the charge on the –NH3+ group on Lys. Methylation of the amino group on the Lys side chain (that retains its charge) has only a marginal effect on ΔGapp.

Analogs of acetylated lysine (AcLys) with shortened side chains show an interesting behavior: the removal of one methyl group from AcLys leads to the expected increase of +0.23 kcal/mol in ΔGapp caused by the reduction in aliphatic surface area, but further shortening of the side chain does not cause any further increase in ΔGapp. This finding is reminiscent of our earlier observation that the ΔGapp profiles of Asn and Gln are nearly identical (2), despite the additional methyl group in the Gln side chain. H-bond formation between the shorter side chains and the helix backbone seems to offset the loss of aliphatic surface area in these cases.

Finally, replacing the terminal –CH3 group in the AcLys and AcLys(C2) side chains by –CF3 reduces ΔGapp by −0.5 and −1.2 kcal/mol, respectively, when the residue is in position 10 in the H segment, clearly attesting to the lipophilicity of fluorine in the context of the ER membrane. In addition, a comparison of the energetic effects of introducing either Pro or the Pro(C5) analog allows us to derive a value of 0.7–0.8 kcal/mol for the energy of a backbone H-bond in the membrane environment.

Materials and Methods

Enzymes and Chemicals.

All enzymes were purchased from Fermentas, except Phusion DNA polymerase from Finnzyme and SP6 RNA polymerase from Promega. The QuikChange Site-directed Mutagenesis kit and deoxyribonucleotides were from Stratagene, and the Megashortscript T7 kit was from Ambion. The plasmid pGEM1, the rabbit reticulocyte lysate system and the RNasin were from Promega. Oligonucleotides were from MWG Operon. All chemicals were from Sigma-Aldrich, except DMSO from J.T. Baker Chemicals, ethanol from Kemetyl, and [35S]-methionine from PerkinElmer.

DNA Manipulations.

For cloning, a modified version of the Escherichia coli lepB gene (signal peptidase) in a pGEM1 vector was used (16). This version harbors an engineered test segment (H segment) in the coding region of the P2 domain between a Spe1 cleavage site in codons 226–227 and a Kpn1 cleavage site in codon 253 (WT lepB codon positions) as well as two glycosylation acceptor sites for N-linked glycosylation at codons 96–98 (G1: Asn-Ser-Thr) and codons 258–260 (G2: Asn-Ala-Thr). The sequences of the H-segment’s flanking regions are: QETKENGIRLSETSGGPG-(H segment)-GPGGVPGQQNATWIVPP (Spe1 and Kpn1 cleavage sites underlined). The introduction of amber stop codons (TAG) into the termini of the H-segment-encoding sequence was done by site-directed mutagenesis using Pfu Turbo polymerase. To introduce amber stop codons in or near the middle of the H segment, double-stranded oligonucleotides encoding the H segment with the amber stop (including also GGPG/GPGG) and flanked by N-terminal Spe1 and C-terminal Kpn1 sticky ends were first generated by annealing of two pairs of complementary oligonucleotides with overlapping overhangs (each 18–45 nucleotides long), followed by annealing of the pairs via the complementary overhangs and cloning into the lepB gene between the Spe1 and Kpn1 cleavage sites (16).

Flexizyme dFx and eFx and ML-tRNAAsncta Preparation.

Preparation of Flexizyme (dFx, eFx) and mycobacteriophage L5 (ML)-derived ML-tRNAAsncta (tRNAsup) were done using the same protocol. First, double-stranded DNA templates encoding the RNA species and a N-terminal T7 promotor sequence (18, 19) were generated by PCR extension of annealed overlapping oligonuclotides. DNA templates were then amplified by PCR using primers complementary to both ends of the templates, followed by phenol/chloroform extraction and ethanol precipitation. The DNA was used in a second step for transcription by T7 polymerase using the Ambion Megashortscript T7 kit, and the RNA product was isopropanol precipitated and purified over 12% denaturing PAGE. After cutting out the RNA band, RNA was eluted for 2 h in 0.3 M NaCl, ethanol-precipitated, and dissolved in dH2O (70- to 250-μM final concentration).

Synthesis and Characterization of Amino Acid Derivatives.

Amino acid derivatives were synthesized as described previously (18, 20). For NMR characterization of the amino acid derivatives, see the Supporting Information.

Acylation of ML-tRNAAsncta.

Flexizyme (dFx in all cases, except for aromatic amino acids where eFx was used) and ML-tRNAAsncta (tRNAsup), each 250 μM in 71 mM Hepes-K buffer pH 7.5 (total volume 7 μL), were heated at 95 °C for 2 min and cooled to room temperature over 5 min. One-microliter MgCl2 (200 mM for dFx and 3 M for eFx reactions) and 2 μL of amino acid-DBE (25 mM, nonaromatic amino acids) or amino acid-CME (25 mM, aromatic amino acids) were added. The reaction was carried out for 2 h on ice and then stopped with 40 μM AcONa (0.3M, pH 5.2). RNA was ethanol-precipitated twice, once with a 0.1 M ethanolic AcONa solution (pH 5.2) and once with 95% (vol/vol) ethanol. More detailed information can be found elsewhere (18).

Expression in Vitro and Quantification of Membrane Insertion Efficiency.

All constructs were transcribed for 60 min at 37 °C using a standard SP6 polymerase transcription protocol (21). Resulting mRNA was translated for 80 min at 30 °C in rabbit reticulocyte lysate (40–60 pg mRNA per microliter of translation mix) in the presence of [35S]-Met (370 pCi/μL translation mix), dog pancreas rough microsomes (50 pL/μL translation mix), amino acid mix (each 75 μM/μL translation mix), amino acid-tRNAsup (3.5 pmol/μL translation mix), and RNasin (3 μ/μL translation mix). Translation products were analyzed by SDS/PAGE. Gels were visualized on a Fuji FLA-3000 phosphoimager using the Image Reader 8.1j software and quantified using ImageGauge v3.45 and the Qtiplot 0.9.3-rc2 softwares (22). The degree of membrane integration of each H-segment was calculated as an apparent equilibrium constant between the membrane-integrated and nonintegrated forms: Kapp = f1/f2, where f1 is the fraction of singly and f2 the fraction of doubly glycosylated Lep molecules and the results were then converted to apparent free energies, ΔGapp = −RT lnKapp. All reported ΔGapp values are averages of three or four independent measurements. The degree of suppression of the UAG stop codon obtained in the in vitro translation system was around 50% for the different aminoacylated tRNAsup species, and around 5% when an uncharged tRNAsup was present in the translation mix.

MD Simulations.

The H-bond geometry in nonpolar environments was studied by building models of isolated H segments in ideal α-helical geometry with noncharged termini. Each segment was energy-minimized and solvated in 5.5-nm rhombic dodecahedron unit cells with ∼600 cyclohexane molecules as a simplified model of the bilayer interior. The systems were subject to 10 ns of relaxation with uniform pressure scaling to 1 bar using a Parinello–Rahman barostat (23), temperature scaling to 300 K using separate Bussi thermostats (24) for protein and cyclohexane, and position restraints (1,000 kJ⋅mol−1⋅nm−2) followed by 100 ns of simulations without restraints. All simulations were performed with Gromacs-5.0 (25), interactions were modeled with the Amber99SB-ILDN force field (26), 2-fs time steps were used with constraints on bonds involving hydrogens, and the nonbonded interactions were calculated with 1.0-nm cut-offs combined with partical mesh Ewald long-range electrostatics (27). During analysis, H-bonds were classified as present if the distance between the donor and acceptor was within 0.35 nm and the angle donor-hydrogen acceptor was below 30°. Solvent-accessible surfaces areas were calculated with the double cubic lattice method (28) using 300 dots per sphere and a probe radius of 0.14 nm. Simulations of Lys analog snorkeling were performed by embedding an ideal α-helix with the sequence Ace-GGPGAAAALALALXLALALAAAAGPGG-Nme (with X as the Lys analog) in a hexagonal POPC lipid bilayer with 7-nm side (128 lipids, 60 waters per lipid) using the GROMACS g_membed program. Simulation and analysis settings were as described above, with the Berger force field used for lipids (29). The lipids were relaxed for 50 ns while position restraints were applied to each helix, followed by 250 ns of unrestrained simulations, the first 50 ns of which was also discarded as relaxation.

NMR Characterization of Amino Acid Derivatives

Arg-carbamide-DBE⋅HCl.

1H NMR (500 MHz, D2O) d 1.41 (m, 1H), 1.53 (m, 1H), 1.92 (m, 2H), 3.00 (m, 2H), 4.23 (m, 1H), 5.42 (d, 1H, J = 10.8 Hz), 5.47 (d, 1H, J = 10.8 Hz), 8.63 (s, 2H), 8.99 (s, 1H).

Arg(C4)-carbamide-DBE⋅HCl.

1H NMR [300 MHz, acetone-d6/D2O, 10/1 (vol/vol)] d 1.42–1.71 (m, 4H), 2.04–2.21 (m, 2H), 3.15 (t, 2H, J = 6.6 Hz), 4.40 (t, 1H, J = 6.3 Hz), 5.65 (s, 2H), 8.78 (d, 2H, J = 2.1 Hz), 8.93 (t, 1H, J = 2.1 Hz).

Canavanine-DBE⋅TFA.

1H NMR (300 MHz, D2O) d 2.25–2.50 (m, 2H), 4.06 (t, 2H, J = 6.0 Hz), 4.43 (dd, 1H, J = 6.6, 5.7 Hz), 5.47 (d, 1H, J = 12.6 Hz), 5.61 (d, 1H, J = 12.6 Hz), 8.74 (d, 2H, J = 2.1 Hz), 8.74 (t, 1H, J = 2.1 Hz).

Arg(C4)-DBE⋅HCl.

1H NMR (500 MHz, D2O) d 1.33 (m, 1H), 1.45 (m, 1H), 1.55 (m, 2H), 1.45), 1.95 (m, 2H), 3.07 (m, 2H), 4.25 (dd, 1H, J = 6.1, 6.1 Hz), 5.45 (d, 1H, J = 13.1 Hz), 5.52 (d, 1H, J = 13.1 Hz), 8.67 (s, 2H), 9.05 (s, 1H).

Arg(C6)-DBE⋅HCl.

1H NMR [300 MHz, acetone-d6/D2O, 10/1 (vol/vol)] d 1.20–1.62 (m, 8H), 1.92 (m, 2H), 3.01 (t, 2H, J = 7.2 Hz), 4.20 (dd, 1H, J = 6.3, 5.7 Hz), 5.39 (d, 1H, J = 13.2 Hz), 5.49 (d, 1H, J = 13.2 Hz), 8.64 (d, 2H, J = 2.1 Hz), 9.00 (t, 1H, J = 2.1 Hz).

Arg(C7)-DBE⋅HCl.

1H NMR [300 MHz, acetone-d6/D2O, 10/1 (vol/vol)] d 1.19–1.64 (m, 10H), 2.07 (m, 2H), 3.23 (t, 2H, J = 6.9 Hz), 4.37 (dd, 1H, J = 6.3, 6.3 Hz), 5.59 (d, 1H, J = 13.2 Hz), 5.69 (d, 1H, J = 13.2 Hz), 8.76 (d, 2H, J = 2.1 Hz), 8.98 (t, 1H, J = 2.1 Hz).

Arg(C8)-DBE⋅HCl.

1H NMR [300 MHz, acetone-d6/D2O, 10/1 (vol/vol)] d 1.27–1.65 (m, 12H), 2.08 (m, 2H), 3.24 (t, 2H, J = 7.2 Hz), 4.37 (dd, 1H, J = 6.3, 6.3 Hz), 5.59 (d, 1H, J = 13.2 Hz), 5.71 (d, 1H, J = 13.2 Hz), 8.77 (d, 2H, J = 1.8 Hz), 8.98 (t, 1H, J = 2.1 Hz).

Lys-DBE⋅HCl.

1H NMR (300 MHz, D2O) d 1.35 (m, 2H), 1.61 (m, 2H), 1.90 (m, 2H), 2.94 (t, 2H, J = 7.5 Hz), 4.25 (t, 1H, J = 6.3 Hz), 5.48 (s, 2H), 8.65 (d, 2H, J = 2.1 Hz), 9.02 (t, 1H, J = 2.1 Hz).

CH3-Lys-DBE⋅HCl.

1H NMR (300 MHz, DMSO-d6) d 1.40 (m, 1H), 1.48–1.69 (m, 3H), 1.84–2.07 (m, 2H), 2.60 (s, 3H), 2.74 (m, 2H), 4.17 (m, 1H), 5.55 (s, 2H), 8.12 (br, 3H), 8.77 (d, 2H, J = 2.4 Hz), 8.83 (t, 1H, J = 2.4 Hz), 9.61 (br, 1H), 10.01 (br, 1H).

Proline(C4)-DBE⋅HCl.

1H NMR (300 MHz, acetone-d6) d 1.89 (m, 2H), 2.27 (m, 1H), 2.37 (m, 1H), 4.61 (dd, d, 1H, J = 8.1, 7.5 Hz), 5.49 (s, 2H), 8.62 (d, 2H, J = 2.1 Hz), 9.06 (t, 1H, J = 2.1 Hz).

“Carboxy-piperidine”(C5-Proline)-DBE⋅HCl.

1H NMR (300 MHz, D2O) d 1.50–1.97 (m, 5H), 2.35 (m, 1H), 3.06 (m, 1H), 3.49 (m, 1H), 4.16 (dd, 1H, J = 10.8, 3.6 Hz), 5.50 (s, 2H), 8.67 (d, 2H, J = 2.1 Hz), 9.06 (t, 1H, J = 2.1 Hz).

AcLys (1)-DBE⋅HCl.

1H NMR (500 MHz, DMSO-d6) d 1.68 (s, 3H), 3.57 (m, 2H), 4.22 (m, 1H), 5.42 (d, 1H, J = 13.2 Hz), 5.50 (d, 1H, J = 13.2 Hz), 8.27 (br, 1H), 8.56 (br, 3H), 8.77 (s, 2H), 8.94 (s, 1H).

AcLys (2)-DBE⋅HCl.

1H NMR (500 MHz, DMSO-d6) d 1.80 (s, 3H), 1.90–2.07 (m, 2H), 3.20 (m, 2H), 4.18 (m, 1H), 4.12 (m, 1H), 5.48 (d, 1H, J = 13.2 Hz), 5.52 (d, 1H, J = 13.2 Hz), 8.13 (br, 1H), 8.59 (br, 3H), 8.83 (s, 2H), 8.93 (s, 1H).

AcLys (3)-DBE⋅HCl.

1H NMR (500 MHz, DMSO-d6) d 1.42 (m, 1H), 1.51 (m, 1H), 1.75 (s, 3H), 1.89(m, 2H), 3.00 (m, 2H), 4.16 (m, 1H), 5.34–5.52 (m, 2H), 7.90 (br, 1H), 8.58 (br, 3H), 8.82 (s, 2H), 8.91 (s, 1H).

AcLys-DBE⋅HCl.

1H NMR (500 MHz, DMSO-d6) d 1.23–1.39 (m, 4H), 1.76(s, 3H), 1.90 (m, 2H), 2.98 (m, 2H), 4.12 (m, 1H), 5.50 (s, 2H), 7.89 (br, 1H), 8.62 (br, 3H), 8.73 (s, 2H), 8.79 (s, 1H).

CF3-AcLys(C2)-DBE⋅HCl.

1H NMR (300 MHz, DMSO-d6) d 2.05 (m, 1H), 2.14 (m, 1H), 3.37 (m, 2H), 4.23 (m, 1H), 5.50 (d, 1H, J = 13.2 Hz), 5.52 (d, 1H, J = 13.2 Hz), 8.54 (br, 3H), 8.76 (s, 2H), 8.83 (s, 1H) 9.58 (s, 1H).

Preparation methods for Arg-tRNASup and CF3-AcLys–tRNASup are described in refs. 18 and 20, respectively.

Acknowledgments

This work was supported by grants from the Swedish Foundation for Strategic Research, the European Research Council (ERC-2008-AdG 232648), and the Swedish Cancer Foundation (to G.v.H.); grants from the Swedish Research Council and the Knut and Alice Wallenberg Foundation (to G.v.H. and E.L.); and a grant from the Japan Science and Technology Agency Strategic International Research Cooperative Program and Japan Society for the Promotion of Science Bilateral Join Research Projects (to H.S.).

Footnotes

The authors declare no conflict of interest.

This article is a PNAS Direct Submission.

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1606776113/-/DCSupplemental.

References

  • 1.Cymer F, von Heijne G, White SH. Mechanisms of integral membrane protein insertion and folding. J Mol Biol. 2015;427(5):999–1022. doi: 10.1016/j.jmb.2014.09.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Hessa T, et al. Molecular code for transmembrane-helix recognition by the Sec61 translocon. Nature. 2007;450(7172):1026–1030. doi: 10.1038/nature06387. [DOI] [PubMed] [Google Scholar]
  • 3.Öjemalm K, et al. Apolar surface area determines the efficiency of translocon-mediated membrane-protein integration into the endoplasmic reticulum. Proc Natl Acad Sci USA. 2011;108(31):E359–E364. doi: 10.1073/pnas.1100120108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Goto Y, Katoh T, Suga H. Flexizymes for genetic code reprogramming. Nat Protoc. 2011;6(6):779–790. doi: 10.1038/nprot.2011.331. [DOI] [PubMed] [Google Scholar]
  • 5.Fitch CA, Platzer G, Okon M, Garcia-Moreno BE, McIntosh LP. Arginine: Its pKa value revisited. Protein Sci. 2015;24(5):752–761. doi: 10.1002/pro.2647. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Mishra VK, Palgunachari MN, Segrest JP, Anantharamaiah GM. Interactions of synthetic peptide analogs of the class A amphipathic helix with lipids. Evidence for the snorkel hypothesis. J Biol Chem. 1994;269(10):7185–7191. [PubMed] [Google Scholar]
  • 7.Chamberlain AK, Lee Y, Kim S, Bowie JU. Snorkeling preferences foster an amino acid composition bias in transmembrane helices. J Mol Biol. 2004;339(2):471–479. doi: 10.1016/j.jmb.2004.03.072. [DOI] [PubMed] [Google Scholar]
  • 8.Boyar A, Marsh RE. l-canavanine, a paradigm for the structures of substituted guanidines. J Am Chem Soc. 1982;104(7):1995–1998. [Google Scholar]
  • 9.Wiener MC, White SH. Structure of a fluid dioleoylphosphatidylcholine bilayer determined by joint refinement of X-ray and neutron diffraction data. III. Complete structure. Biophys J. 1992;61(2):434–447. doi: 10.1016/S0006-3495(92)81849-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Lerch-Bader M, Lundin C, Kim H, Nilsson I, von Heijne G. Contribution of positively charged flanking residues to the insertion of transmembrane helices into the endoplasmic reticulum. Proc Natl Acad Sci USA. 2008;105(11):4127–4132. doi: 10.1073/pnas.0711580105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Heijne G. The distribution of positively charged residues in bacterial inner membrane proteins correlates with the trans-membrane topology. EMBO J. 1986;5(11):3021–3027. doi: 10.1002/j.1460-2075.1986.tb04601.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Gray TM, Matthews BW. Intrahelical hydrogen bonding of serine, threonine and cysteine residues within alpha-helices and its relevance to membrane-bound proteins. J Mol Biol. 1984;175(1):75–81. doi: 10.1016/0022-2836(84)90446-7. [DOI] [PubMed] [Google Scholar]
  • 13.Wang J, et al. Fluorine in pharmaceutical industry: Fluorine-containing drugs introduced to the market in the last decade (2001-2011) Chem Rev. 2014;114(4):2432–2506. doi: 10.1021/cr4002879. [DOI] [PubMed] [Google Scholar]
  • 14.Hine J, Mookerjee PK. The intrinsic hydrophilic character of organic compounds. Correlations in terms of structural contributions. J Org Chem. 1975;40(3):292–298. [Google Scholar]
  • 15.Gough CA, Pearlman DA, Kollman PA. Calculations of the relative free energies of aqueous solvation of several fluorocarbons: A test of the bond potential of mean force correction. J Chem Phys. 1993;99(11):9103–9110. [Google Scholar]
  • 16.Hessa T, et al. Recognition of transmembrane helices by the endoplasmic reticulum translocon. Nature. 2005;433(7024):377–381. doi: 10.1038/nature03216. [DOI] [PubMed] [Google Scholar]
  • 17.Joh NH, et al. Modest stabilization by most hydrogen-bonded side-chain interactions in membrane proteins. Nature. 2008;453(7199):1266–1270. doi: 10.1038/nature06977. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Murakami H, Ohta A, Ashigai H, Suga H. A highly flexible tRNA acylation method for non-natural polypeptide synthesis. Nat Methods. 2006;3(5):357–359. doi: 10.1038/nmeth877. [DOI] [PubMed] [Google Scholar]
  • 19.Ohta A, Murakami H, Higashimura E, Suga H. Synthesis of polyester by means of genetic code reprogramming. Chem Biol. 2007;14(12):1315–1322. doi: 10.1016/j.chembiol.2007.10.015. [DOI] [PubMed] [Google Scholar]
  • 20.Morimoto J, Hayashi Y, Suga H. Discovery of macrocyclic peptides armed with a mechanism-based warhead: Isoform-selective inhibition of human deacetylase SIRT2. Angew Chem Int Ed Engl. 2012;51(14):3423–3427. doi: 10.1002/anie.201108118. [DOI] [PubMed] [Google Scholar]
  • 21.Nilsson I, Johnson AE, von Heijne G. How hydrophobic is alanine? J Biol Chem. 2003;278(32):29389–29393. doi: 10.1074/jbc.M212310200. [DOI] [PubMed] [Google Scholar]
  • 22.Hedin LE, et al. Membrane insertion of marginally hydrophobic transmembrane helices depends on sequence context. J Mol Biol. 2010;396(1):221–229. doi: 10.1016/j.jmb.2009.11.036. [DOI] [PubMed] [Google Scholar]
  • 23.Parrinello M, Rahman A. Polymorphic transitions in single-crystals—A new molecular-dynamics method. J Appl Phys. 1981;52(12):7182–7190. [Google Scholar]
  • 24.Bussi G, Zykova-Timan T, Parrinello M. Isothermal-isobaric molecular dynamics using stochastic velocity rescaling. J Chem Phys. 2009;130(7):074101. doi: 10.1063/1.3073889. [DOI] [PubMed] [Google Scholar]
  • 25.Abraham M, et al. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX. 2015;1:19–25. [Google Scholar]
  • 26.Lindorff-Larsen K, et al. Improved side-chain torsion potentials for the Amber ff99SB protein force field. Proteins. 2010;78(8):1950–1958. doi: 10.1002/prot.22711. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Essmann U, et al. A smooth particle mesh Ewald method. J Chem Phys. 1995;103(19):8577–8593. [Google Scholar]
  • 28.Eisenhaber F, Lijnzaad P, Argos P, Sander C, Scharf M. The double cubic lattice method—Efficient approaches to numerical-integration of surface-area and volume and to dot surface contouring of molecular assemblies. J Comput Chem. 1995;16(3):273–284. [Google Scholar]
  • 29.Berger O, Edholm O, Jähnig F. Molecular dynamics simulations of a fluid bilayer of dipalmitoylphosphatidylcholine at full hydration, constant pressure, and constant temperature. Biophys J. 1997;72(5):2002–2013. doi: 10.1016/S0006-3495(97)78845-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES