Skip to main content
Protein Science : A Publication of the Protein Society logoLink to Protein Science : A Publication of the Protein Society
. 2019 Nov 11;29(2):469–479. doi: 10.1002/pro.3763

GeTFEP: A general transfer free energy profile of transmembrane proteins

Wei Tian 1, Hammad Naveed 2, Meishan Lin 1, Jie Liang 1,
PMCID: PMC6954696  PMID: 31658402

Abstract

Free energy of transferring amino acid side‐chains from aqueous environment into lipid bilayers, known as transfer free energy (TFE), provides important information on the thermodynamic stability of membrane proteins. In this study, we derived a TFE profile named General Transfer Free Energy Profile (GeTFEP) based on computation of the TFEs of 58 β‐barrel membrane proteins (βMPs). The GeTFEP agrees well with experimentally measured and computationally derived TFEs. Analysis based on the GeTFEP shows that residues in different regions of the transmembrane (TM) segments of βMPs have different roles during the membrane insertion process. Results further reveal the importance of the sequence pattern of TM strands in stabilizing βMPs in the membrane environment. In addition, we show that GeTFEP can be used to predict the positioning and the orientation of βMPs in the membrane. We also show that GeTFEP can be used to identify structurally or functionally important amino acid residue sites of βMPs. Furthermore, the TM segments of α‐helical membrane proteins can be accurately predicted with GeTFEP, suggesting that the GeTFEP is of general applicability in studying membrane protein.

Keywords: beta barrel membrane proteins, hydrophobicity scale, thermodynamical stability, transfer free energy, transmembrane proteins, outer membrane proteins, protein stability

1. INTRODUCTION

Membrane proteins play important roles in cellular metabolism, signaling regulation, and intercellular interactions.1 Knowledge of the thermodynamic stability of membrane proteins is essential for understanding their folding behavior and their structure–function relationship.2, 3, 4, 5 A widely used measure to estimate the stabilities of membrane proteins is the transfer free energies (TFEs), which quantify the free energies of transferring amino acid residues from aqueous environment into lipid bilayers.6, 7, 8, 9, 10, 11

Often called hydrophobicity scales, transfer‐free energies have been measured experimentally based on several model systems. The Wimley–White whole residue scale (WW‐scale) measures TFEs of residue partitioning between water and octanol using a set of peptides as the host of amino acids.8 The biological scale (Bio‐scale) of Hessa et al. measures the free energies required to transfer residues in polypeptides into the ER membrane through the translocon machinery.9, 12 The Moon–Fleming whole protein scale (MF‐scale) measures TFEs of residues from water to the membrane core in the context of a whole β‐barrel membrane protein (βMP).10 These experimentally obtained hydrophobicity scales have led to improved understanding of the structures and functions of membrane proteins13 and have been used in the prediction of transmembrane (TM) segments of membrane proteins.14 These hydrophobicity scales are obtained through the measurement of different systems. Beyond correlation, transformations between them involve additional shifting and/or compression of one of the scales. We refer interested readers to Refs. 13 and 15.

Experimental measurement of TFEs is technically challenging, cumbersome, and costly.16, 17 Complementing experimentally measured TFEs, several hydrophobicity scales have been derived computationally, which can aid in our understanding of the governing principles of membrane protein folding.2, 4, 18 The E Z α and E Z β empirical potentials are knowledge‐based hydrophobicity scales. They have been successfully applied in predicting the positioning of membrane proteins in the lipid bilayer, in discriminating side‐chain decoys, and in identifying protein–lipid interfaces.19, 20 However, these scales obtained from statistical analysis do not consider the physical interactions either between residues from neighboring helices/strands or within the same helix/strand, which are known to be important for membrane protein folding.21, 22 There have also been studies based on molecular dynamics (MD) simulations to calculate TFEs,15, 23, 24 although the choice of the reference state before membrane insertion remains a challenging task.24

Another method of deriving TFEs computationally was developed for βMPs recently.11 This method is based on a statistical mechanical model in a discretized conformational space. It incorporates both intra‐ and inter‐strand interactions in the TM segments of the proteins. It can be used to calculate TFEs of any lipid‐facing residue in the TM segment of a βMP, as long as the number of TM strands of the protein is no more than 12. The computed TFE scale (OmpLA scale) is in excellent agreement with the MF‐scale with a Pearson correlation coefficient r = .90. This scale has been applied successfully to explain how the functional fold and topology of the βMP are determined by the asymmetry of both the Gram‐negative bacterial outer membrane and the TM residues.11 A further algorithmic extension of this method has greatly reduced the computational cost, enabling the calculation of TFEs on all βMP known so far, regardless of their sizes, with little loss of the accuracy.25

In this study, we use the new algorithm25 to compute the depth‐dependent TFE profile of each βMP in a non‐redundant set of 58 βMPs. After examining their overall patterns, we found that there exists a general TFE profile applicable to all βMPs, which we call the General Transfer Free Energy Profile (GeTFEP). Readers can use GeTFEP at http://sts.bioe.uic.edu/getfep/index.html. GeTFEP agrees well with previously measured and computed TFEs. Analysis based on GeTFEP shows that residues in different regions of the TM segment have different roles during the membrane insertion process. Our results further reveal the importance of the sequence pattern of TM segments in stabilizing βMPs in the membrane environment. In addition, we also show that GeTFEP can be used to predict the positioning and orientation of βMPs when embedded in the membrane, with overall results in good agreement with experimental data. Furthermore, we show that the GeTFEP can be used to locate structurally or functionally important sites of βMPs. In addition, TM segments of α‐helical membrane proteins (αMPs) can also be accurately predicted using the GeTFEP, suggesting GeTFEP has general applicability in studying membrane protein.

2. RESULTS

2.1. General transfer free energy profile

2.1.1. Computation of TFE profiles of βMPs

Using the methods described in Ref. 25, we calculate the depth‐dependent TFE profiles for each βMP in a non‐redundant set of 58 βMPs. The proteins in this set have ≤30% pairwise sequence similarity. Briefly, for each βMP, we substituted each lipid‐facing residue in the TM region to the other 19 amino acids. We calculated the TFEs of each amino acid substitution using Ala as the reference. The TFE profile of the protein was then obtained by taking an average of the TFE values of the same amino acid type at the same depth position in the membrane. As an example, Figure 1 shows the computed TFE profile of the protein LptD (PDB ID: http://firstglance.jmol.org/fg.htm?mol=4q35).

Figure 1.

Figure 1

The transfer free energy (TFE) profile of the LptD protein. Each subfigure shows the calculated free energies (y‐axis) of a specific amino acid type when transferred to a certain depth position of the membrane (x‐axis). The depth position of the membrane is discretized and indexed from −4 to +4, starting from the periplasmic side to the extracellular side of the outer membrane. Depth position 0 is at the mid‐plane of the membrane

2.1.2. Derivation of GeTFEP

Although the 58 βMPs are in different oligomerization states, have different sizes (strand numbers) of TM segments, and come from different organisms, their TFE profiles are remarkably similar. Results of clustering their profiles show that the 58 βMPs can be grouped into only one group (with 56 βMPs) and two outliers: α‐ and γ‐hemolysins (PDB ID: http://firstglance.jmol.org/fg.htm?mol=7ahl and http://firstglance.jmol.org/fg.htm?mol=3b07). Details of the clustering method can be found in the Supporting Information.

Unlike the other βMPs, the TM regions of both α‐ and γ‐hemolysins are formed by repeated β‐hairpin (Figure S2B), which make their TFE profiles highly sensitive to the composition of the β‐hairpin and the local interactions of residues within the hairpin (Figures S2C and S2D). Accordingly, we further investigate whether α‐ and γ‐hemolysins have truly different thermodynamic properties than βMPs, or their outlier status is due to the special architecture of repeated β‐hairpins.

We first computed the TFE profiles of artificially generated hemolysin‐like βMPs constructed by repeating each β‐hairpin in our βMP set. Altogether, we computed TFE profiles for 778 artificial hemolysin‐like βMPs. We then sampled from these profiles with replacement and computed the distribution of the distance from each sampled profile to the average profile of all sampled artificial βMPs. The distances from the TFE profiles of both α‐ and γ‐hemolysins to the average profile are at the 80th percentile in the distance distribution (Figure 2b), indicating that α‐ and γ‐hemolysins are not fundamentally different in their thermodynamic properties from other βMPs. Therefore, we conclude that a general TFE profile exists and is applicable to all βMPs, including α‐ and γ‐hemolysins. We derive the GeTFEP by averaging the TFEs of a specific amino acid at the same lipid bilayer depth position for all 58 βMPs (Figure 2c).

Figure 2.

Figure 2

Derivation of the GeTFEP. (a) Results of hierarchical clustering shows that all βMPs in the data set can be grouped into one cluster, except α‐ and γ‐hemolysins (http://firstglance.jmol.org/fg.htm?mol=7ahl and http://firstglance.jmol.org/fg.htm?mol=3b07). (b) The distribution of distance between the sampled TFE profiles of the artificially constructed hemolysin‐like βMPs and their average TFE profile. The distances of both α‐ and γ‐hemolysins are at the 80th percentile of the distribution. (c) The General Transfer Free Energy Profile (GeTFEP) of each residue (blue), and the corresponding curves fitted by third degree polynomials (red)

2.1.3. Comparison with other hydrophobicity scales

We then examine how GeTFEP compares with other hydrophobicity scales. Since most experimentally measured scales are not depth‐dependent, we first compare the scale of the TFEs at the hydrocarbon core position of depth 0 in the GeTFEP with other hydrophobicity scales. We refer this hydrophobicity scale as the mid‐GeTFEP scale. The mid‐GeTFEP scale correlates well with the experimentally measured hydrophobicity scales, having Pearson correlation coefficients r = .83 with the WW‐scale, and r = .92 with the Bio‐scale. It also correlates well with the computational βMP OmpLA scale,11, 25 with r = .90 (Figure S3).

When compared with the experimentally measured MF‐scale of the βMP OmpLA mid‐GeTFEP has a correlation of r = .87. One noticeable difference between mid‐GeTFEP and the MF‐scale is that the TFE value of His is less unfavorable in mid‐GeTFEP (Figure 3a). This is expected since the MF‐scale was measured in acidic condition at pH = 3.8, where His was fully protonated.10 The different value in mid‐GeTFEP likely reflects the property of His in physiological conditions of the outer membrane.

Figure 3.

Figure 3

Comparison between the GeTFEP and the experimentally measured MF‐scale. (a) The mid‐GeTFEP scale agrees well with the MF‐scale, except Pro and His. (b) The depth‐dependent TFEs of Arg and Leu of GeTFEP also agree well with the experimental measurements10

Another notable difference is Pro. It is found that Pro is unfavorable in the membrane environment according to the mid‐GeTFEP scale, while it is found to be favorable according to the MF‐scale (Figure 3a). Pro tends to disrupt the structures of both α‐helix and β‐sheet, and is thermodynamically unfavorable in the non‐polar core of the membrane.26 The value of Pro in the GetFEP‐mid‐scale reflects the general situation.

We then examined the depth‐dependency of the GeTFEP of Arg and Leu, whose experimental results are available.10 Their TFEs at different depth positions of the membrane are in good agreement with the experimentally measured values, with r = .87 for Arg and r = .75 for Leu (Figure 3b), suggesting the GeTFEP captures the depth‐dependency of TFEs of amino acids.

2.2. Insertion of βMP into membrane

2.2.1. βMP insertion as a thermodynamically driven spontaneous process

Upon synthesis in the cytoplasm, βMPs need to be transported across the periplasm and then folded into the outer membrane. As there is no energy source such as ATP in the periplasm, it was suggested that the free energies of βMP folding provide an adequate source to ensure successful periplasm translocation.27 A computational study showed that the TFE of lipid‐facing residues of the hydrophobic core regions are indeed the main driving force for membrane insertion.11 Analysis also showed that lipid‐facing residues in the TM regions of βMPs have clear patterns of amino acid composition.28 However, it is still unclear whether the insertion of βMPs into the membrane is primarily due to the extensive property of the hydrophobicity of lipid‐facing residues, or if the specific pattern of amino acid composition also plays an important role.

To investigate this question, we employed a simplified βMP insertion model based on the concerted folding mechanism proposed in Ref. 29. We ignore the effects of non‐TM loops and discretize the insertion process into 17 steps (Figure 4a). We take the position recorded in the widely used Orientations of Proteins in Membranes (OPM) database30 as the fully inserted position of each βMP. This position is denoted as the reference position 0, and the other positions are indexed accordingly from −8 to +8. βMPs start the insertion process at position −8 from the periplasmic side and become fully inserted into the membrane at position 0. From position 0 to +8, βMPs would translocate across the membrane. We assume that the stability of the TM region of a βMP can be approximated by summarizing TFEs of all lipid‐facing residues in the membrane region. The stability of the βMP at each position was then calculated using the GeTFEP following this additive model. As an example, Figure 4b shows the stability of the protein OmpA (PDB ID: http://firstglance.jmol.org/fg.htm?mol=1bxw) at different insertion positions. Overall, results of all βMPs show a funnel‐like pattern of insertion energy (Figure 4c). Most βMPs (52 of 58) have minimum free energy when they are fully inserted into membranes (position 0, Tables S1 and S2). The funnel‐like pattern indicates that the insertion of βMPs into outer membranes is indeed a spontaneous process, though this process still needs to be catalyzed by the β‐barrel assembly machinery (BAM) complex.31 βMPs become energetically trapped after being fully inserted.

Figure 4.

Figure 4

Illustration of the membrane insertion process of βMPs. (a) The simplified βMP insertion model following Ref. 29. (b) The computed insertion energies of the OmpA protein at different depth positions. The contribution of different regions is color‐coded. (c) Illustration on how free energies change with the position of βMP in the membrane. The dashed red segments show that lipid‐facing residues in the extracellular head group region sometimes become energetically unfavorable

2.2.2. Importance of patterns of TM lipid‐facing residues in membrane insertion

We then examine if the funnel‐like insertion energy pattern arises from the extensive property of the TFEs of the hydrophobic residues alone. We considered only the 52 βMPs whose minimum free energies are at the fully inserted position. We first shuffled the sequences of the β‐strands within the TM segment of each βMP. While the side‐chain direction as well as the interstrand hydrogen bond pairing at each residue position in β‐strands is maintained, all TM residues are permuted. Each βMP is shuffled 2,000 times. We found that it is highly unfavorable to insert the shuffled βMPs into the membrane. This is expected, since the shuffling changes hydrophobicity of TM segments βMPs. Before the shuffling, the ionizable/polar residues were enriched among lumen‐facing residues of βMPs, while lipid‐facing residues were mostly apolar. After the shuffling, they were much more evenly distributed.

We then investigate how insertion energy is affected if only the lipid‐facing residues are shuffled. While the insertion of the shuffled βMPs remains energetically favorable (see Figure S4 for an example), shuffled βMPs are less stable compared to the original βMPs at the fully inserted position for 50 out of 52 βMPs: The insertion energy for the shuffled βMPs is on average 6.36 kcal/mol higher (Table S1). In addition, the fully inserted position (position 0) is no longer the most stable position for 17.4% of the shuffled βMPs (Table S1). These results indicate that the locational patterns of lipid‐facing residues32 in the TM region are arranged for βMPs to gain stability in the membrane environment.

2.2.3. Roles of residues in different TM regions during membrane insertion

The TM segment of a βMP can be divided into three regions, namely, the periplasmic headgroup region, the hydrophobic core region, and the extracellular headgroup region.28 We investigate how these regions contribute to the insertion energy of the βMP. We found that residues in the same regions across all 52 βMPs shared similar patterns in their insertion free energy profile (Figure 4c), indicating that they play similar roles in the insertion process. Among these, lipid‐facing residues of the extracellular headgroup region facilitate the initialization of the insertion process, as they are energetically favorable in the interfacial region on the periplasmic side (position −8 and −7). As insertion proceeds, these residues become less favorable and occasionally unfavorable when they become more embedded in the membrane. At this time, lipid‐facing residues of the hydrophobic core region start to be inserted in the membrane, and strongly drive the insertion process (position −6 to −2). When lipid‐facing residues of the extracellular headgroup region approach the interfacial region of the extracellular side, they become energetically favorable again. At the same time, lipid‐facing residues of the periplasmic headgroup region become inserted (position −1 and 0), and the TFE of the whole βMP reaches its minimum at position 0.

Although lipid‐facing residues of the hydrophobic core region are known to provide the main driving force for membrane insertion of βMPs,11 we found that the TFEs of hydrophobic core region do not reach their minimum when βMPs are fully inserted at position 0 for all 52 βMPs. Upon incorporation of contributions from other regions, the overall TFEs of the whole βMPs indeed reach the minimum at the fully inserted position. The “W” shape of the free energy curves of the two head group regions (the red and green curve in Figure 4c) suggests that lipid‐facing residues in these regions act like “energetic latches” to lock βMPs into their fully inserted position.

2.2.4. Prediction of βMP positioning and orientation in the membrane

GeTFEP can be used to predict the positioning and orientation of βMPs in the membrane, similar to previous studies.19, 20 Here, the membrane is idealized as an infinite slab with a thickness of h. Each βMP is initially positioned in the membrane with its center of mass of the barrel domain at the midplane of the membrane and its barrel axis aligned with the normal direction (z‐axis) of the membrane (Figure 5a). The protein can be rotated around the x‐ and y‐axes with angles θ x and θ y, respectively. The two rotation angles together determine the tilt angle of the protein. The protein can also be translated with a displacement d z. This displacement and the membrane thickness determine the TM segment of the protein. When embedded in the membrane, the lipid‐facing residues of the TM region and the loop residues are used to calculate the total energy of the βMP using the GeTFEP. As an example, Figure 5b shows how rotation angles θ x and θ y affect the stability of the protein BtuB (PDB ID: http://firstglance.jmol.org/fg.htm?mol=1nqe) when the displacement d z and the membrane thickness h are fixed.

Figure 5.

Figure 5

Prediction of positioning and orientation of βMPs. (a) The positioning and orientation of the βMP inside the membrane are determined by the rotation angles θ x and θ y, the translation displacement d z, and the membrane thickness h. (b) The funnel‐like landscape of the stability of the BtuB protein. It shows how rotation angles affect the stability of BtuB when d z and h are fixed

We systematically examine the parameter combination of θ x, θ y, d z, and h. A βMP is predicted to take the position and the orientation when the lowest free energy is reached. The predicted protein tilt angles of all 58 βMPs correlate well (r = .76) with OPM records.30 The average protein tilt angle of 7.3° is consistent with that of 6.2 ± 1.8° recorded in the OPM. The strand tilt angles and the membrane thickness predicted are again in good agreement with experimentally determined results (Table 1).

Table 1.

Comparison between the predicted positioning and orientation and experimental results33 of βMPs

Protein PDB ID Experiment GeTFEP OPM
TM tilt FhuA http://firstglance.jmol.org/fg.htm?mol=2fcp 46.0a 38.2 38.3
(°) OmpA http://firstglance.jmol.org/fg.htm?mol=1bxw 44.5a 40.2 38.7
Membrane FhuA http://firstglance.jmol.org/fg.htm?mol=1fep ≥23.1 23.5 24.3
Thickness OmpF http://firstglance.jmol.org/fg.htm?mol=2omf ∼21.0 22.8 25.2
(Å) BtuB http://firstglance.jmol.org/fg.htm?mol=1nqe ≥20.2 23.0 23.4
a

The experimentally measured tilt angles are the upper bounds of the actual values.33

2.3. Prediction of structurally and functionally important sites of βMPs

While overall the computed TFEs of lipid‐facing residues of βMPs follow the general pattern of the GeTFEP, the TFE values of a specific residue in a particular βMP can deviate significantly from values in the general profile (see Supporting Information for details). Among all 3,500 lipid‐facing residues in the TM segments of all 58 βMPs, we find that 305 or 8.7% of the residues have TFE values that deviate significantly from the GeTFEP. Since lipid‐facing residues are overall the major contributors to the stability of βMPs as discussed above, the deviation from the general profile indicates that the residue is likely to have important roles other than providing stability. To understand the origin of these deviations, we examined three proteins in detail, namely, OmpLA, PagP, and PagL, which have sufficient experimental information. We found that most deviant residues either have functional roles or have local structures quite different from residues in the canonical model of beta barrels (Table 2).

Table 2.

Predicted important sites in OmpLA, PagP, and PagL by deviation analysis

Protein Residue Notes
OmpLA (http://firstglance.jmol.org/fg.htm?mol=1qd6) 38N
40L Substrate binding34
92Y Substrate binding34
116P Interstand neighbor of 92Y and 142H
120L
142H Catalytic site35, 36
156N Catalytic site35, 36
237L
PagP (http://firstglance.jmol.org/fg.htm?mol=1thq) 27I Lateral route from membrane to protein interior37
69L Interact with the out clamp α‐helix37
125L Lateral route from membrane to protein interior37
131L
PagL (http://firstglance.jmol.org/fg.htm?mol=2erv) 108I Ligand binding site38
126H Catalytic site39

Among the deviant residues in OmpLA, 142H and 156N are both in the catalytic triad35, 36 that are essential for its phospholipase activities; 40L and 92Y are the sites where substrates bind.34 Furthermore, the deviant residue 116P interacts with 92Y and 142H through hydrogen bonds. Among the deviant residues in PagP, 69L interacts with the out‐clamp α‐helix of PagP37; 27I and 125L are both at the lateral routes where β‐hydrogen bonding is absent (Figure 6), which ensure that substrates can access the protein interior so that PagP can carry out its enzymatic functions.40 In PagL, the deviant residue 108I is in the ligand‐binding site,38 and 126H is part of the catalytic triad of its enzymatic site.39

Figure 6.

Figure 6

Predicted important sites of PagP. Residues 27I and 125L are at the sites where the hydrogen bonds between the β‐strands are disrupted. 69L has interaction with the out‐clamp α‐helix of PagP

As the calculation of TFEs does not require knowledge of 3D structures of βMPs, our results suggest that deviation analysis can help to discover functional sites and/or structurally anomalous sites using sequence information only. While our analysis is restricted to three proteins due to the limited nature of experimental data, we believe that overall deviant residues play special roles in either performing biological function or in maintaining the unique structural form of βMPs.

2.4. GeTFEP can predict TM region of αMPs

Although the MF‐scale was measured in the βMP system, it was suggested that the scale is also applicable to TM region of the αMPs, since the MF‐scale has a strong correlation with the nonpolar solvent accessible surface areas of the residues.10 We hypothesize that the GeTFEP may also reflect fundamental thermodynamic properties of transferring side‐chains of amino acids to the membrane environment, regardless of whether the residue is in a β‐barrel or an αMP. We carried out the standard hydropathy analysis41 using the Membrane Protein Explorer (MPEx) program42 on 131 αMPs obtained from the MPTopo database.43 Since MPEx uses depth‐independent hydrophobicity scales, we used the mid‐GeTFEP scale for our calculation.

The results show that this simple analysis using the mid‐GeTFEP scale correctly predicts both the TM regions and the numbers of the TM segments for 90 or ∼69% of the 131 αMPs in the data set. This compares favorably to other hydrophobicity scales, including those measured or derived from αMPs (Table 3). For most of the remaining 41 proteins, GeTFEP correctly predicted the TM regions but predicted the numbers of the TM segments incorrectly due to the ambiguity in assignment of whether two consecutive TM segments should be considered as one TM segment (see Figure S5B for an example). Examination of the number of TM residues correctly predicted by the mid‐GeTFEP scale shows that we achieve a precision of ∼85% and a recall of ∼71%, which compares favorably to other hydrophobicity scales (Table 3). These results suggest that the GeTFEP can be used to study the general stability of both α‐helical and βMPs.

Table 3.

Prediction of TM segments and residues αMPs

Hydrophobicity scale αMPs % (#) with TM segs. correctly predicted TM res. precision (%) TM res. recall (%) TM res. F‐measure
WW‐scale 50% (66) 73 75 0.74
Bio‐scale 22% (29) 95 21 0.34
E Z α 49% (64) 71 77 0.74
MF‐scale 48% (63) 77 65 0.70
Pro‐swapped 49% (64) 72 74 0.71
Mid‐GeTFEP 69% (90) 85 71 0.78

The mid‐GeTFEP scale performs better than the other hydrophobicity scales. The first three scales are measured or derived in α‐helical systems, the others in βMPs.

2.4.1. The validity of TFE value of pro in the GeTFEP

We further examine the TFE value of Pro in the mid‐GeTFEP scale, which is qualitatively different from that in the MF‐scale. We swapped the value of Pro from MF‐scale into the mid‐GeTFEP scale, and used this Pro‐swapped scale in the hydropathy analysis. This is reasonable as the mid‐GeTFEP scale is strongly correlated with the MF‐scale and has comparable values. However, we found that the precision of predicting TM residues deteriorates significantly from 85% using the mid‐GeTFEP scale to 72% using the Pro‐swapped scale (Table 3). This result suggests that Pro is more likely to be membrane unfavorable as characterized by the mid‐GeTFEP scale rather than membrane favorable as characterized by the MF‐scale.

Pro is known to disrupt secondary structures. In computationally derived hydrophobicity scale, Pro is always unfavorable inside the membrane environments.19, 20 In the experimentally measured scales such as Bio‐scale,9 and two independent scales measured in the βMP PagP,44, 45 Pro is unfavorable as well. The value of Pro in GeTFEP agrees with these observations.

In MF‐scale, Pro did appear favorable in the membrane. This discrepancy indeed implies the existence of structural degrees of freedom, which is currently unknown. In Ref. 45, it is suggested that whether Pro is favorable is affected by the interplay between the local microenvironment and the torsion strain caused by Pro. However, more experimental data are required to pin down this issue.

3. CONCLUSIONS AND DISCUSSION

In this study, we derived the GeTFEP from a non‐redundant set of 58 βMPs. We showed that the GeTFEP agrees well with previous experimentally measured and computationally derived TFEs. The GeTFEP reveals fundamental thermodynamic properties of amino acid residues inside the membrane environment, and it is useful in the analysis of stability and function of membrane proteins.5

As the lipid membrane bilayer is anisotropic along the bilayer normal,46 a residue at different depth of the membrane will have different interactions with lipid molecules in the environment, resulting in the depth‐dependency of TFEs. A comparison between the GeTFEP and the experimentally measured values of Arg and Leu10 shows that the GeTFEP captures this depth‐dependency well.

In addition, the GeTFEP exhibits asymmetric values between TFEs of residues in the membrane inner leaflet (depth − 4 to 0) and in the outer leaflet (depth 0 to +4, Figure 2c). Most βMPs in our dataset reside in the bacterial outer membrane, whose outer leaflet contains additional complex lipolysaccharides in contrast to its inner leaflet of phospholipids. This asymmetry in membrane composition results in the asymmetry of the TFEs in the GeTFEP. To understand membrane proteins in an environment of symmetric membrane leaflets, we also derived a symmetric TFE profile, named sym‐GeTFEP, by mirroring the TFE values of the inner leaflet side of the GeTFEP (Figure S6). In this study, the sym‐GeTFEP was used to analyze the non‐outer‐membrane βMPs, for example, α‐ and γ‐hemolysins and vibrio cholerae cytolysin.

We explored the energetic contribution of different regions of βMPs during the membrane insertion process. Our analysis showed that the stability of βMPs does not come alone from the extensive property of the hydrophobicity of lipid‐facing residues in the TM segment. Rather, the pattern of the amino acid residues in the TM segment also plays significant roles. Results from the analysis of sequence shuffling show that the patterns and location of amino acid residues are arranged to stabilize βMPs in the membrane environment. Using the GeTFEP, we are also able to predict membrane positioning and orientations of βMPs.

The GeTFEP can also be used to detect structurally or functionally important residues in βMPs. This can be achieved by examination of residues whose TFEs deviate significantly from the GeTFEP. As calculation of TFEs of residues of a specific βMP only requires rough estimation of relative positions between adjacent β‐strands, which can be reliably predicted from the protein sequence,47, 48 computing the TFE deviation, therefore, requires only sequence information. The GeTFEP‐deviation analysis can aid in the discovery of functional sites or structurally important sites in novel βMPs, without requiring knowledge of their 3D structures. In addition, GeTFEP‐based analysis can aid in the design and engineering of novel βMPs.

Furthermore, we demonstrated that GeTFEP can be used to predict TM residues of αMPs. Results showed that GeTFEP performs better than the hydrophobicity scales measured/calculated in αMP systems. To further understand why GeTFEP also works well with αMPs, we systematically calculate the TFEs using a model with interstrand energy terms removed using the same βMP set. We derived a new energy profile with the calculated TFEs, which correlates well with GeTFEP (r = .97, MSE = 0.36). For the task of predicting αMPs via hydropathy analysis, this new profile gives the result of 71% precision and 41% recall in identifying TM residues.

Although the prediction results are reasonable, there is non‐negligible deterioration in performance compared to predictions obtained using GeTFEP (85% precision and 71% recall, Table 3). This suggests that pairwise interactions among TM segments are important factors in membrane protein stability. Clearly, the nature of pairwise interaction among TM segments in α‐helical and in β‐barrel is quite different. Nonetheless, consideration of interstrand interactions of βMPs likely can provide some useful corrections by accounting for the protein–environment interactions of αMPs to some degree. This is reasonable, as the finer details might be less consequential when characterizing these interactions using a single energy value. More sophisticated models and detailed analysis will be required to understand to what extent pairwise interactions in βMPs mimic interactions in light of TFE in the different TM context of αMPs.

Supporting information

Appendix S1: Supporting information

ACKNOWLEDGMENT

This work was supported by NIH R01GM079804, R01CA204962‐01A1, R21AI126308, and R35GM127084.

Tian W, Naveed H, Lin M, Liang J. GeTFEP: A general transfer free energy profile of transmembrane proteins. Protein Science. 2020;29:469–479. 10.1002/pro.3763

Funding information National Institutes of Health, Grant/Award Numbers: R01CA204962‐01A1, R01GM079804, R21AI126308, R35GM127084

REFERENCES

  • 1. Fairman JW, Noinaj N, Buchanan SK. The structural biology of β‐barrel membrane proteins: A summary of recent reports. Curr Opin Struct Biol. 2011;21(4):523–531. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Liang J. Experimental and computational studies of determinants of membrane‐protein folding. Curr Opin Chem Biol. 2002;6(6):878–884. [DOI] [PubMed] [Google Scholar]
  • 3. Naveed H, Jackups R, Liang J. Predicting weakly stable regions, oligomerization state, and protein‐protein interfaces in transmembrane domains of outer membrane proteins. Proc Natl Acad Sci U S A. 2009;106(31):12735–12740. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Liang J, Naveed H, Jimenez‐Morales D, Adamian L, Lin M. Computational studies of membrane proteins: Models and predictions for biological understanding. Biochim Biophys Acta Biomembr. 2012;1818(4):927–941. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Yang Y, Guo R, Gaffney K, et al. Folding‐degradation relationship of a membrane protein mediated by the universally conserved ATP‐dependent protease FtsH. J Am Chem Soc. 2018;140(13):4656–4665. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Tanford C. The hydrophobic effect and the organization of living matter. Science. 1978;200(4345):1012–1018. [DOI] [PubMed] [Google Scholar]
  • 7. White SH, Wimley WC. Membrane protein folding and stability: Physical principles. Annu Rev Biophys Biomol Struct. 1999;28:319–365. [DOI] [PubMed] [Google Scholar]
  • 8. Wimley WC, White SH. Experimentally determined hydrophobicity scale for proteins at membrane interfaces. Nat Struct Biol. 1996;3:842–848. [DOI] [PubMed] [Google Scholar]
  • 9. Hessa T, Kim H, Bihlmaier K, et al. Recognition of transmembrane helices by the endoplasmic reticulum translocon. Nature. 2005;433(7024):377–381. [DOI] [PubMed] [Google Scholar]
  • 10. Preston Moon C, Fleming KG. Side‐chain hydrophobicity scale derived from transmembrane protein folding into lipid bilayers. Proc Natl Acad Sci U S A. 2011;108(25):10174–10177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Lin M, Gessmann D, Naveed H, Liang J. Outer membrane protein folding and topology from a computational transfer free energy scale. J Am Chem Soc. 2016;138(8):2592–2601. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Hessa T, Meindl‐Beinker NM, Bernsel A, et al. Molecular code for transmembrane‐helix recognition by the Sec61 translocon. Nature. 2007;450:1026–1030. [DOI] [PubMed] [Google Scholar]
  • 13. MacCallum JL, Tieleman DP. Hydrophobicity scales: A thermodynamic looking glass into lipid‐protein interactions. Trends Biochem Sci. 2011;36(12):653–662. [DOI] [PubMed] [Google Scholar]
  • 14. Deber CM, Wang C, Liu LP, et al. TM finder: A prediction program for transmembrane protein segments using a combination of hydrophobicity and nonpolar phase helicity scales. Protein Sci. 2001;10(1):212–219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Ulmschneider MB, Ulmschneider JP, Schiller N, Wallace BA, von Heijne G, White SH. Spontaneous transmembrane helix insertion thermodynamically mimics translocon‐guided insertion. Nat Commun. 2014;5(4863):4863. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Moon CP, Kwon S, Fleming KG. Overcoming hysteresis to attain reversible equilibrium folding for outer membrane phospholipase A in phospholipid bilayers. J Mol Biol. 2011;413(2):484–494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Otzen DE, Andersen KK. Folding of outer membrane proteins. Arch Biochem Biophys. 2013;531(1–2):34–43. [DOI] [PubMed] [Google Scholar]
  • 18. Adamian L, Liang J. Prediction of buried helices in multispan alpha helical membrane proteins. Proteins. 2006;63(1):1–5. [DOI] [PubMed] [Google Scholar]
  • 19. Senes A, Chadi DC, Law PB, Walters RFS, Nanda V, DeGrado WF. Ez, a depth‐dependent potential for assessing the energies of insertion of amino acid side‐chains into membranes: Derivation and applications to determining the orientation of transmembrane and interfacial helices. J Mol Biol. 2007;366(2):436–448. [DOI] [PubMed] [Google Scholar]
  • 20. Hsieh D, Davis A, Nanda V. A knowledge‐based potential highlights unique features of membrane α‐helical and β‐barrel protein insertion and folding. Protein Sci. 2012;21(1):50–62. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Moore DT, Berger BW, DeGrado WF. Protein‐protein interactions in the membrane: Sequence, structural, and biological motifs. Structure. 2008;16(7):991–1001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Hedin LE, Öjemalm K, Bernsel A, et al. Membrane insertion of marginally hydrophobic transmembrane helices depends on sequence context. J Mol Biol. 2010;396(1):221–229. [DOI] [PubMed] [Google Scholar]
  • 23. MacCallum JL, Bennett WFD, Tieleman DP. Partitioning of amino acid side chains into lipid bilayers: Results from computer simulations and comparison to experiment. J Gen Physiol. 2007;129(5):371–377. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Gumbart J, Roux B. Determination of membrane‐insertion free energies by molecular dynamics simulations. Biophys J. 2012;102(4):795–801. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Tian W, Lin M, Naveed H, Liang J. Efficient computation of transfer free energies of amino acids in beta‐barrel membrane proteins. Bioinformatics. 2017;33(11):1664–1671. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Senes A, Engel DE, DeGrado WF. Folding of helical membrane proteins: The role of polar, GxxxG‐like and proline motifs. Curr Opin Struct Biol. 2004;14(4):465–479. [DOI] [PubMed] [Google Scholar]
  • 27. Preston Moon C, Zaccai NR, Fleming PJ, Gessmann D, Fleming KG. Membrane protein thermodynamic stability may serve as the energy sink for sorting in the periplasm. Proc Natl Acad Sci U S A. 2013;110(11):4285–4290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Jackups R, Liang J. Interstrand pairing patterns in beta‐barrel membrane proteins: The positive‐outside rule, aromatic rescue, and strand registration prediction. J Mol Biol. 2005;354(4):979–993. [DOI] [PubMed] [Google Scholar]
  • 29. Kleinschmidt JH, den Blaauwen T, Driessen AJ, Tamm LK. Outer membrane protein A of Escherichia coli inserts and folds into lipid bilayers by a concerted mechanism. Biochemistry. 1999;38(16):5006–5016. [DOI] [PubMed] [Google Scholar]
  • 30. Lomize MA, Lomize AL, Pogozheva ID, Mosberg HI. OPM: Orientations of proteins in membranes database. Bioinformatics. 2006;22(5):623–625. [DOI] [PubMed] [Google Scholar]
  • 31. Knowles TJ, Scott‐Tucker A, Overduin M, Henderson IR. Membrane protein architects: The role of the BAM complex in outer membrane protein assembly. Nat Rev Microbiol. 2009;8(3):206–214. [DOI] [PubMed] [Google Scholar]
  • 32. Jackups R, Cheng S, Liang J. Sequence motifs and antimotifs in beta‐barrel membrane proteins from a genome‐wide analysis: The Ala‐Tyr dichotomy and chaperone binding motifs. J Mol Biol. 2006;363(2):611–623. [DOI] [PubMed] [Google Scholar]
  • 33. Lomize AL, Pogozheva ID, Lomize M a, Mosberg HI. Positioning of proteins in membranes: A computational approach. Protein Sci. 2006;15(6):1318–1333. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Snijder HJ, Ubarretxena‐Belandia I, Blaauw M, et al. Structural evidence for dimerization‐regulated activation of an integral membrane phospholipase. Nature. 1999;401(6754):717–721. [DOI] [PubMed] [Google Scholar]
  • 35. Dekker N. Outer‐membrane phospholipase A: Known structure, unknown biological function. Mol Microbiol. 2000;35(4):711–717. [DOI] [PubMed] [Google Scholar]
  • 36. Kingma RL, Fragiathaki M, Snijder HJ, et al. Unusual catalytic triad of Escherichia coli outer membrane phospholipase A. Biochemistry. 2000;38(4):10017–10022. [DOI] [PubMed] [Google Scholar]
  • 37. Ahn VE, Lo EI, Engel CK, et al. Crystal Structure of Outer Membrane Enzyme PagP. Nat Struct Biol. 2003;1:980 10.2210/pdb1thq/pdb. [DOI] [Google Scholar]
  • 38. Rutten L, Geurtsen J, Lambert W, et al. Crystal structure of the outer membrane enzyme PagL. Nat Struct Biol. 2003;10:980 10.2210/pdb2erv/pdb.14634627 [DOI] [Google Scholar]
  • 39. Rutten L, Geurtsen J, Lambert W, et al. Crystal structure and catalytic mechanism of the LPS 3‐O‐deacylase PagL from Pseudomonas aeruginosa . Proc Natl Acad Sci U S A. 2006;103(18):7071–7076. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Bishop RE. Structural biology of membrane‐intrinsic β‐barrel enzymes: Sentinels of the bacterial outer membrane. Biochim Biophys Acta Biomembr. 2008;1778(9):1881–1896. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Kyte J, Doolittle RF. A simple method for displaying the hydropathic character of a protein. J Mol Biol. 1982;157(1):105–132. [DOI] [PubMed] [Google Scholar]
  • 42. Snider C, Jayasinghe S, Hristova K, White SH. MPEx: A tool for exploring membrane proteins. Protein Sci. 2009;18(12):2624–2628. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Jayasinghe S, Hristova K, White SH. MPtopo: A database of membrane protein topology. Protein Sci. 2001;10(2):455–458. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Iyer BR, Zadafiya P, Vetal PV, Mahalakshmi R. Energetics of side‐chain partitioning of β‐signal residues in unassisted folding of a transmembrane β‐barrel protein. J Biol Chem. 2017;292:12351–12365. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Marx DC, Fleming KG. Influence of protein scaffold on side‐chain transfer free energies. Biophys J. 2017;113:597–604. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Pogozheva ID, Mosberg HI, Lomize AL. Life at the border: Adaptation of proteins to anisotropic membrane environment. Protein Sci. 2014;23(9):1165–1196. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Naveed H, Xu Y, Jackups R, Liang J. Predicting three‐dimensional structures of transmembrane domains of β‐barrel membrane proteins. J Am Chem Soc. 2012;134:1775–1781. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Tian W, Lin M, Tang K, Liang J, Naveed H. High‐resolution structure prediction of β–barrel membrane proteins. Proc Natl Acad Sci. 2018;115(7):1511–1516. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Appendix S1: Supporting information


Articles from Protein Science : A Publication of the Protein Society are provided here courtesy of The Protein Society

RESOURCES