Skip to main content
PLOS One logoLink to PLOS One
. 2019 May 10;14(5):e0216142. doi: 10.1371/journal.pone.0216142

Different modes of barrel opening suggest a complex pathway of ligand binding in human gastrotropin

Zita Harmat 1, András L Szabó 1, Orsolya Tőke 2, Zoltán Gáspári 1,*
Editor: Pratul K Agarwal3
PMCID: PMC6510414  PMID: 31075121

Abstract

Gastrotropin, the intracellular carrier of bile salts in the small intestine, binds two ligand molecules simultaneously in its internal cavity. The molecular rearrangements required for ligand entry are not yet fully clear. To improve our understanding of the binding process we combined molecular dynamics simulations with previously published structural and dynamic NMR parameters. The resulting ensembles reveal two distinct modes of barrel opening with one corresponding to the transition between the apo and holo states, whereas the other affecting different protein regions in both ligation states. Comparison of the calculated structures with NMR-derived parameters reporting on slow conformational exchange processes suggests that the protein undergoes partial unfolding along a path related to the second mode of the identified barrel opening motion.

Introduction

Gastrotropin (also known as ileal bile acid-binding protein (I-BABP) or fatty acid-binding protein 6 (FABP6)) [1] is involved in the enterohepatic circulation of bile salts. Being synthetsized in the liver from cholesterol, bile salts are secreted into the proximal small intestine via the gall bladder and then efficiently recycled blood via the hepatic portal circulation [23]. This recycling process ensures that only a small amount of bile salts needs to be synthesised de novo. Gastrotropin is thought to play a role in this recyclinc process via binding interactions occurring within the absorptive epithelial cells of the distal ileum [45] and has an important role in cholesterol homeostasis [1,6].

Gastrotropin belongs to the family of intracellular lipid-binding proteins (iLBPs), a group of small, approximately 15-kDa proteins that bind fatty acids, retinoids, cholesterol, and bile salts [7]. Additionally, iLBPs have been shown to have a role in the stimulation of the transcriptional activity of nuclear hormone receptors [810]. Among the four main groups of the iLBP family, the subfamily of gastrotropin is unique in the sense that it has the capability of binding two [1112] or possibly even three [1314] ligands simultaneously. NMR solution structure of the apo form of human gastrotropin (PDB ID: 1O1U) was determined along with the cholyltaurine bound form (PDB ID: 1O1V) [15]. More recently, the structure of the heterotypic doubly-ligated complex of human gastrotropin with glycocholate and glycochenodeoxycholate has been determined [16]. Similarly to other members of the iLBP family, the structure of human I-BABP is composed of a β-barrel formed by ten antiparallel β-strands (A-J) and two α-helices (I-II). The binding cavity of ~1000 Å3 is located inside of the β-barrel [17]. Ligand binding in human gastrotropin exhibits positive cooperativity [11], which has been shown to be governed by the hydroxylation pattern of the bound bile salts [18]. Accordingly, hydrogen bonding networks have been shown to have a key mediatory role in positive binding cooperativity [19]. Besides the observed communication between the two binding sites, di- and trihydroxy bile salts display a site preference upon binding in each other’s presence [20]. As it is apparent from the comparison of apo and holo human gastrotropin structures (Fig 1), bile salt binding is accompanied by large conformational changes in the E-F and G-H protein regions as well as in the C/D-turn and the proximate helical cap [4,13]. Importantly, NMR relaxation measurements suggest that in the apo form, the ground state is in slow exchange with a low-populated ‘invisible’ conformer resembling some structural features of the the ligand-bound form [21]. Intriguingly, residues undergoing a conformational fluctuation on the μs-ms time scale can be grouped into a ‘slower’ and a ‘faster’ cluster, which appear to be spatially separated. Specifically, while the ‘slower’ cluster involves part of the helical region, the C/D-turn, and the proximate B and D β-strands in the N-terminal half, the ‘faster’ cluster comprises segments of the EFGH protein region in the C-terminal half [21].

Fig 1. Ribbon representation highlighting the differences between the apo (PDB ID: 1O1U model 7) and the holo (PDB ID: 2MM3 model 1) form of human gastrotropin.

Fig 1

The figure was prepared with UCSF Chimera [49].

As the binding site of gastrotropin is located in the interior of the protein, the mechanism of ligand entry is an important issue to be investigated. The most widely accepted scenario for the protein family is formulated in the ‘portal hypothesis’, stating that access of ligands to the protein interior is governed by the C/D and E/F-turn regions together with the C-terminal part of helix II [2224]. Based on NMR structural and dynamic studies, a conformational selection mechanism of ligand binding involving an equilibrium between a closed and a more open protein state has been suggested for both the human ileal [16] and the chicken liver BABP analogues [25]. In line with the NMR spectroscopic analysis of internal motions, molecular dynamics simulations show evidence of correlated motions in human gastrotropin and in the absence of ligands indicate a partial unfolding of the E-F protein region [26].

To improve our understanding of the mediatory role of internal motions in human gastrotropin-bile salt interaction, we generated conformational ensembles consistent with experimentally obtained NMR structural and dynamic data published earlier [21] using a methodology described earlier [27] and performed ligand docking to obtain an atomic-level insight into the binding mechanism. Our results reveal different conformational rearrangements in the protein that are suggested to correspond to motions characteristic of different time scales indicating a complex mechanism of bile salt entry.

Materials and methods

Ensemble molecular dynamics simulations with NMR restraints

Molecular dynamics calculations were performed using GROMACS version 4.5.5. [2829] modified to handle S2 order parameters as well as pairwise averaging of NOE distance restraints over replicas [30], as proposed for the MUMO (Minimal Under-restraining Minimal Over-restraining) approach [31]. The OPLS-AA force field [32] and the TIP3P water model [33] was used for all molecular dynamics simulations described below.

For modeling the apo structure of gastrotropin, we chose model 7 of PDB entry 1O1U [15] based on its highest PRIDE-NMR score [34] among the deposited models. As an initial model of the holo structure we used model 1 of the PDB entry 2MM3. Ligand topologies for glycocholic acid (GCA, PDB ligand ID: GCH) and glycochenodeoxycholic acid (GCDA, PDB ligand ID: CHO) were generated with the TopolGen script and corrected manually for atom types where necessary as well as with an in-house Perl script to reassign hydrogen atoms to the charge groups defined by the heavy atoms they are connected to.

NOE restraints were only available for the holo protein (PDB ID: 2MM3). For the apo form, we used restraints from the 2MM3 list that were unviolated in the deposited 1O1U structure as checked with the CoNSEnsX server. Restraints were modified by the removal of stereospecificity and rounding the restrained distance up to the next integer Å, creating 1 Å wide ‘bins’ from 4 to 10 Å.

Chemical shifts for the apo structure were obtained from BMRB (BMRB ID: 19843) and for the holo structure directly from the authors. S2 values for the apo and holo structures measured at 283, 291, 298, and 313 K were taken from [21].

After generating a topology using the OPLS-AA force field and TIP3P water model, the molecule was put into a cubic box, followed by energy minimization with conjugate gradient method for 5000 number of steps with 0.001 ps step length. The maximum force was set to 200. In the next step, the molecule was solvated and then one of the water molecules was replaced by a Na+ ion to ensure the neutrality of the system. After that, another energy minimization was performed using the same parameters, but including the water molecules. In the last step, a short MD simulation was performed using position restraints of 1000 kJ mol-1 nm-2 on the heavy atoms of the protein for 2500 steps with 0.002 ps step size using the LINCS algorithm [35].

For the production runs, eight replicas were simulated in parallel with the OpenMPI environment [36]. Backbone S2 order parameter restraints were applied on the full ensemble and NOE distance restraints were averaged between neighboring replicas, similar to the MUMO (Minimal Under-restraining, Minimal Over-restraining) protocol [31]. The simulations were performed at four temperatures: 283 K, 291 K, 298 K, and 313 K using S2 restraints measured at the corresponding temperatures. With LINCS constraining on bond lengths, a timestep of 2 fs was used to generate runs of 2 ns and 6 ns, totaling 16 and 48 ns for the 8 replicas combined, respectively. Control simulations with the same parametrization but without restraints were also performed. Topology files for the restrained simulations are included in the supplementary material as (S1 File).

In order to generate a larger pool of possible conformations in order to further explore the conformational space, molecular dynamics simulations with only one type of restraint, NOE or S2, or without any restraints were also performed. Accelerated Molecular Dynamics and short (500 ps) Targeted Molecular Dynamics simulations were also performed on the apo structure using the chemical shifts of the holo structure and vica versa in order to achieve transition from one form to the other.

Docking simulations

Docking calculations were performed on selected structures with GCA and GCDA using Schrödinger Glide [37]. The binding sites were defined using the ternary complex structure 2MM3. After importing the structure, it was split to separate molecules. As a next step, either GCA or GCDA was merged with the protein and a mesh grid around the ligand was generated with the ‘Receptor Grid Generation Tool’ using default settings. Docking of the respective ligands was performed using the ‘Ligand docking’ tool with default settings except requiring the inclusion of per-residue interaction scores in the output. To dock the second ligand into the binary complex obtained, the docking result most similar to the pose in the initial 2MM3 structure was merged with the second ligand and used to define the second binding site with the grid generation tool. For each of the four different setups, i.e. GCA, GCDA, GCA+GCDA and GCDA+GCA docking runs, 32 different poses were generated and evaluated. Total energy of the docked complexes was estimated using the MacroModel routine with the OPLS3 force field and water as solvent.

Data analysis

Correspondence to the experimental parameters was analyzed using the CoNSEnsX webserver [34,38] (for S2 order parameters and chemical shift data) as well as in-house Perl scripts (for NOE distance restraints). NOE restraints were evaluated on a per-ensemble basis using r-6 averaging both for intramolecular ambiguity and between members of the ensembles.

For the S2 order parameter correspondences, MUMO simulations and the original PDB ensembles, the corrected S2 values are also displayed. In the correction, those points were excluded from the analysis, which had greater than 0.2 as an absolute value of the difference between the experimental and back-calculated values. For the MUMO simulations, maximum 5 such values were found. All the experimental and back-calculated values are depicted in S1 Fig.

Principal Component Analysis was performed using ProDy [39] and visualized with the NMWIZ module of the program VMD [40]. Structure-based chemical shift calculations were performed with the program SHIFTX2 [41].

The presence or absence of hydrogen bonds in the ensembles was investigated with an in-house Perl program using distance-angle based hydrogen bond identification parameters [4243].

Comparing calculated 15N chemical shift differences with experimentally derived |Δɷ| (15N) values

For each structure, backbone 15N chemical shifts were estimated with Shiftx [41]. For each conformation in the large conformer pool (see above), the absolute value of the difference of the predicted chemical shifts relative to those in each calculated unliganded structure in the MUMO ensembles was calculated. These differences were then compared to experimental |Δɷ| (15N) data derived from CPMG relaxation dispersion NMR measurements for each residue for which it was available [21]. Both correlation and RMSD measures were calculated after normalization to the 0–1 range. As there are |Δɷ| values available for three temperatures and the conformational pool is of a heterogeneous source with no well-defined temperature, the correlation and RMSD values were calculated for all three temperatures and then were averaged for each structure investigated. The structures with highest correlation and lowest RMSD values were selected for analysis.

Results and discussion

The generated ensembles reflect experimental parameters

According to the expectations, experimental S2 parameters are generally better reflected in the MUMO generated ensembles than in the PDB ensembles or the unrestrained ensembles (Table 1). Interestingly, the MUMO ensemble of the apo protein calculated with the S2 parameters of 283 K and the MUMO ensemble of the ternary complex calculated with the S2 parameters of 291 K corresponds only moderately to these data, while all other restrained ensembles show good correspondence. The reason for this is the presence of some extremely low (< 0.3) experimental S2 values, located mostly in turn regions, not reflected in the simulations. Plots for the experimental and the back-calculated S2 values are depicted in S1 Fig.

Table 1. RMSD values, S2 parameters, NOE correspondence, amide N and H chemical shift correlations.

Ensemble size Temperature (K) Backbone RMSD (Å) Backbone S2 correlation Corrected backbone S2 correlation Percentage of violated NOE restraints
(r-6, >0.5Å)
Chemical shift correlation
amide N amide H
1O1U pdb 10 305 0.66+-0 0.502a 0.857 0.00% 0.709 0.596
apo MUMO 168 283 1.01+-0.03 0.675 0.873 0.29% 0.832 0.72
apo MUMO 168 291 1.12+-0.04 0.8 0.841 0.29% 0.828 0.729
apo MUMO 168 298 1.27+-0.06 0.797 0.883 0.12% 0.834 0.738
apo MUMO 168 313 1.45+-0.04 0.963 0.972 - 0.834 0.705
apo unrestrained 48 283 1.46+-0.09 0.307 - - 0.827 0.724
apo unrestrained 48 291 1.45+-0.05 0.351 - - 0.832 0.703
apo unrestrained 48 298 1.53+-0.03 0.301 - - 0.833 0.731
apo unrestrained 48 313 1.5+-0.07 0.6 - - 0.829 0.733
2MM3 pdb 10 293 0.52+-0 0.297b 0.315 0.00% 0.773 0.593
holo MUMO 168 283 1.28+-0.03 0.726 0.756 0.41% 0.784 0.548
holo MUMO 168 291 1.31+-0.07 0.544 0.731 0.46% 0.78 0.589
holo MUMO 168 298 1.46+-0.15 0.884 0.930 0.36% 0.767 0.552
holo MUMO 168 313 1.5+-0.08 0.724 0.870 0.36% 0.758 0.551
holo unrestrained 48 283 1.92+-0.64 0.073 - - 0.782 0.531
holo unrestrained 48 291 2.34+-0.65 0.296 - - 0.777 0.537
holo unrestrained 48 298 1.82+-0.07 0.484 - - 0.772 0.493
holo unrestrained 48 313 2.59+-2.09 0.272 - - 0.775 0.544

a: S2 data of 313 K (highest correlation from the 4 datasets)

b: S2 data of 298 K (highest correlation from the 4 datasets)

The number of ensemble-calculated NOE violations are below 0.5 percent in each of the MUMO ensembles, despite the clearly higher global RMSD values of the MUMO ensembles than those for the original PDB ensembles. Correspondence to the amide N and H chemical shifts are in the same range for the MUMO and the original PDB ensembles.

Gastrotropin ensembles reveal two distinct modes of barrel opening

Principal component analysis (PCA) of the ensembles suggests the presence of two kinds of modes, both corresponding to the opening of the barrel structure, termed ‘Type I’ and Type II’ openings below. Type I opening clearly separates the apo and holo structures along PC1, corresponding to the opening of the barrel between strands F and G. Viewing the structure from the direction of the helices, this apo to holo structural change can be described as a clockwise rotation of the E/F- and G/H-turns accompanied by a lower amplitude counterclockwise rotation of the C/D-turn and helix-II, resulting in the appearance of a large aperture between the E/F- and G/H-turns at the ‘top’ of the barrel. PC2 or Type II opening, in contrast, primarily affects helix-I and the CD-turn, most prominently resulting in the widening of the interhelical gap and the appearance of an opening between strands D and E.

It is notable that the Type II opening motion occurs in both the apo and the holo structures. At higher temperatures, the ensembles occupy a larger region of the conformational space along this particular opening mode (S4 Fig). It should also be noted that this kind of opening is also compatible with the portal hypothesis as described in detail for human liver fatty acid binding protein [4445].

The structural changes can be described in more detail by measuring distances between selected amino acids. In S2A Fig the correlation of Cα distances with each of the two motional modes is plotted for each amino acid pair as a matrix. On the basis of the correlation of these Cα-Cα distances, the most mobile regions corresponding to Type I opening are near the termini and at the D/E-turn region including β strand E itself. Regarding Type II motion, the regions around amino acid 50 (β strand C) and 35 (linker between helix II and β strand B) appear to be the most flexible together moving segments.

Cα-Cα distances displaying the best correlation with Type I opening are between residues 47–69, 48–69, 60–69, 61–69, 62–69, 63–69 (corr. -0.96) as well as 66–70, 67–70 (corr. 0.96). Regarding Type II motion, the best correlated Cα distances are between residues 16–58, 17–58, and 18–58 (corr. -0.92). The listed Cα atom-atom distances are mapped on the structure in S2B and S2C Fig. Our results suggest that the increasing distance between β strands C and E or D and E are correlated with Type I opening, and the increasing distance of helix I from the top of the barrel (around the C/D.turn) correlates with Type II opening. Note that the residues displaying the largest displacement relative to the average structure do not necessarily coincide with the ones exhibiting the largest changes in interatomic distances.

While the experimentally determined apo structure (1O1U) is clustered with the dynamic ensembles for the apo state, the NMR structure of the ternary complex (2MM3) is located between the apo and holo ensembles. This indicates that our ensembles of the complex state exhibit a more pronounced opening along PC1 than the structure obtained by conventional NMR calculations. The phenomenon that dynamic ensembles, corresponding reasonably well to experimental data ‘magnify’ the differences between different states has been observed in previous works [46]. This magnification is likely a consequence of the ensemble-based treatment of NOE restraints allowing more conformational freedom than conventional structure calculations. It is also the consequence of the different balance between the force field and experimental restraints than in conventional structure calculation methods.

The apo and holo states exhibit characteristic differences in their hydrogen bond pattern as well (Fig 2B), (Table 2) and (S1 Table). As shown previously, hydrogen bonds form an extensive network in human I-BABP [19,21]. According to our calculations, the most significant differences in hydrogen bond occurence include the formation of one and breaking of two intrastrand hydrogen bonds upon transition from the apo to the holo state, consistent with a specific mode of barrel opening between strands E and F. Interestingly, hydrogen bonds with ligands (purple lines on Fig 2B) are present only in a few conformations, which may indicate a loose ligand binding as a result of dynamically changing hydrogen bonds. Notable are the hydrogen bonds of Thr73, where the γ1 OH group forms an intraresidue hydrogen bond in the holo state that is not present in the apo form. This particular residue in the E/F-turn has been suggested to have a key role in a conformational selection mechanism of ligand binding together with proximate residues in the EFGH region of human I-BABP [16].

Fig 2. Description of the Type I and Tye II motions.

Fig 2

(A) PCA (Principal Component Analysis) scatter plot of the simulated and experimentally determined conformer pool. (B) Hydrogen bonds with the largest changes between the apo and holo states according to the MUMO ensembles. Black numbers denote amino acid residues, black letters denote atoms, secondary structure elements are labeled with green letters. Red lines represent H-bonds characteristic of the apo form, blue lines represent those formed mainly in the holo form and purple lines indicate H-bonds between amino acids and the ligands. Note the central role of Thr73 in the hydrogen bond network. (C) Structural movements along PC1: barrel opening (D) Structural movements along PC2 (E) Square fluctuation of Cα atoms in the two PCA modes: PC1 (Type I motion, purple), PC2 (Type II motion, orange). The previously measured experimental kex values indicating two distinct clusters of residues involved in slow conformational exchange processes are depicted as different gray areas corresponding to the three different temperatures (283 K, 287 K, 291 K) of the measurements. As only about 30–40 amino acids have displayed ms timescale motion with measurable kex values [21], a continuous depiction is used to guide the eye to highlight the regional differences.

Table 2. List of the most significantly changing hydrogen bonds in the course of the MUMO simulations.

Bond (amino acid number and atom identifier) Protein state Donor region Acceptor region
64 N– 69 OG apo D-E E
64 N– 70 OD1 holo D-E E
71 N– 79 O apo E F
73 N– 73 OG1 holo E-F E-F
73 N– 77 O holo E-F E-F
73 OG1–72 O apo E-F E
81 N– 69 O apo F E
81 N– 79 O Mostly holo F F
90 N– 101 O mostly apo G H

Residues involved in the two opening modes coincide with different exchange rates along the sequence as determined by NMR

Comparing the regions affected by the motions with NMR-derived conformational exchange data, it is apparent that there is a coincidence of the region affected by Type II opening and the the NMR-reported ‘slow’ cluster located in the N-terminal half of the protein [16] (Fig 2E). Although kex parameters derived from CPMG relaxation dispersion NMR measurements report on a motion occuring on a much slower μs-ms time scale than reflected by the S2 restraints used in our simulations, we suggest that the observed Type II barrel opening is related to the slow conformational exchange revealed by NMR relaxation dispersion analysis. Specifically, the fast motions could set the stage for slower, larger-amplitude motions in the protein along a similar opening mode. The structural transition on a different time scale is also consistent with the temperature-dependence of the observed motions, i.e. a more even distribution of conformers along the Type II mode at higher temperatures. Importantly, the presence of fast motion along this mode in both the apo and the holo states suggests that Type II motions may have a role in both ligand uptake and release.

The hidden “holo-like” conformation in the apo state is partially unfolded

Previous NMR investigations of human gastrotropin have identified the presence of an invisible state that is in slow exchange with the observable apo state [4]. Moreover, it was suggested that this state exhibits holo-like structural features [16, 21]. In order to get a deeper insight into the nature of this conformer, we generated a pool of conformers and selected structures that might be representative of the higher energy state based on the differences in chemical shifts relative to the apo state when compared with the NMR-derived |Δɷ| (15N) values between the ground and higher energy states of IBABP. We note that with the availability of only backbone 15N chemical shift differences the structural information on the invisible state remains to be rather limited.

Apparently, the identified conformers with best correspondence to the experimental data are scattered around a large conformational space (Fig 3). Their common characteristics is that they are closer to the holo than to the apo state, which is in agreement with the previously proposed holo-like characteristics of the sparsely populated excited state indicated by NMR dynamic measurements [16]. Importantly, some of the conformers show a more pronounced Type II-like opening than the MUMO ensembles. The principal components in Fig 3. are not direclty corresponding to those in Fig 2. The HD3 and HD5 structures, being close in the PCA plot, show different degree of Type II-like opening in their helical region. Nevertheless, we consider this aspect to be the most relevant as Type II opening is clearly identifable in the MUMO ensembles calculated with a substantial amount of experimental data, in contrast to other motions identified in unrestrained simulations only.

Fig 3. PCA scatter plot of the apo MUMO (red dots) and holo MUMO (blue dots) ensembles along with the conformer pool (purple hollow squares) used to select the structures best corresponding to the NMR-derived invisible state.

Fig 3

Structures with a mean correlation between |Δɷ| (15N) values and calculated chemical shift differences above a threshold of 0.35 are shown with black dots (left panel). Structures with an RMSD between |Δɷ| (15N) values and calculated chemical shift differences lower than 0.00603 are depicted with green dots (right panel). Selected structures are also depicted and linked to their corresponding points in the PCA scatter plots. These hidden conformations are termed HD1-HD5.

As shown in Fig 4, secondary structure of the simulated conformers is diverse around the boundary of the α-helical and β-strand elements. In some structures almost all of the α- helical and β-strand elements are partially unfolded. The structures assumed to be the ‘holo-like’ apo conformations have low helical and β-strand content. The E-F region is the most susceptible to unfolding, in accordance with recent reports [26]. Taken together, these observations suggest that the transition from the apo to the holo state, instead of being a simple physical opening along the shortest route, is rather a complex succession of conformational rearrangements proceeding through a partially unfolded intermediate involving a loosened helical and C/D-turn regions, resembling in part the observed ‘Type II’ mode of motions.

Fig 4. Secondary structure of the conformations inferred from our simulations (rows).

Fig 4

Each column represents one amino acid. Extended β-strands are colored yellow, α-helices are brown, the rest of the residues are colored black. (A) All of the conformations. (B) The high correlation conformations (subset of conformations of panel A). (C) Conformations with lowest RMSD (another subset of panel A). The analysis was performed with DSSPCont [50]. Note the shortening of secondary structure elements in some structures, especially in B) and C).

Docking simulations support cooperativity of ligand binding

In order to further characterize the mechanism of ligand binding, we performed docking simulations into selected structures obtained in our calculations.

Based on the PCA analysis, four structures were selected representing extreme states along Type I and Type II opening, respectively. Additional three structures, regarded as the best models of the invisible state in slow exchange with the apo form were also included.

In general, the most favorable complexes were obtained when GCA was docked first, followed by the docking of GCDA (box diagrams in Fig 5). This scenario did not result in a successful ternary complex for only one of the proposed hidden structures, HD1, corresponding to an intermediate position between the apo and holo ensembles along Type I opening. Ligand binding provides the highest stabilization for the partially unfolded structures corresponding to a larger opening along a motion resembling Type II opening. Comparing the relative estimated energies of the corresponding apo structures (colored bars in Fig 5) suggests a complex energetic landscape where conformational states and ligand binding contribute to stability in an interdependent manner. Our results are compatible with a scenario where ligand entry occurs in an open, partially unfolded state followed by subsequent structural compaction, completing a transition along Type I rearragement along a pathway including a different, Type II-like opening.

Fig 5. Relative energies of docked structures relative to the ligand-free conformations.

Fig 5

Differences between the starting conformations of the molecular dynamics-derived structures relative to 2MM3 are depicted with colored bars. White boxes: only GCDA docked, stripped white boxes: GCDA docked first, GCA docked second, gray boxes: only GCDA docked, stripped gray boxes: GCA docked first, GCDA docked second. HM denotes a representative conformer from the holo MUMO ensemble, HN (holo narrow) and HW (holo wide) are selected extreme structures from the holo MUMO ensembles corresponding to Type II opening. In addition, three from the high correlation hidden conformers are selected (HD1, HD3 and HD5).

It should be noted that we do not regard the docking energies reported here as reliable ones in their absolute values, but as ones that can offer some insight to the binding process when used in a comparative evaluation. More elaborated methods, like free energy calculations [47] on the different complexes or umbrella sampling [48] could yield more precise and reliable estimates of the energetics of the binding process. It should be noted, however, that for the partially unfolded structures it is not trivial to set up and run such simulations and thus these are outside the scope of the present study, which focuses on ensembles generated on the basis of experimental data.

Comparison of ligand positions in the different structures (Table 3) and the total energies of the complexes (Fig 5) might lead to the conclusion that open gastrotropin structures can bind ligands with a high structural versatility while maintaining high affinity, although additional investigations are needed to fully prove this statement. These above results suggest that ligands might undergo dynamic reposition even in the binary and ternary complexes.

Table 3. A summary of the docking simulations conducted on three specific input structures showing the number of successful calculations as well as the number of cases where the ligand binds in an orientation similar to that observed in the 2MM3 structure.

Input structure Successful docking of GCDA poses into an empty protein Successful docking of GCA poses into a protein containing GCDA Successful docking of GCA poses into an empty protein Successful docking of GCDA poses into a protein containing GCA
HM 32 (22) 32 (24) 32 (13) 32 (25)
HW 32 (9) 30 (25) 32 (14) 32 (20)
HN 32 (12) 32 (29) 32 (32) 32 (23)
HD1 32 (23) 30 (28) 32 (0) 0 (0)
HD3 32 (13) 32 (29) 32 (21) 32 (24)
HD5 32 (5) 32 (30) 32 (28) 32 (7)

HM (a representative conformer from the holo MUMO ensemble), HW (holo wide), HN (holo narrow), corresponding to selected extreme conformations along Type II opening in the holo MUMO ensembles (Fig 2D). HD1, HD3 and HD5are selected hidden conformers (see Fig 3).

Conclusions

We have generated structural ensembles that are in agreement with available NMR parameters reporting on the structure and fast time-scale dynamics of human gastrotropin. The two types of barrel opening identified are in agreement with previous observations of the iLBP family. We propose a refined model of ligand entry that is compatible with the portal hypothesis, namely, that the structural transition from the apo to the holo state, termed Type I opening, proceeds along an indirect route involving partial unfolding of the helical cap structure. In our model this unfolding is related to and facilitated by another mode of barrel opening, termed Type II, that is present in both the apo and holo states.

Supporting information

S1 Fig. Experimental and back-calculated S2 order parameters of different ensembles.

(A) Apo MUMO ensembles by amino acids

(B) Apo MUMO ensembles as correlation plots

(C) Holo MUMO ensembles by amino acids

(D)Holo MUMO ensembles as correlation plots

(E) Unrestrained ensembles by amino acids

(F) Unrestrained ensembles as correlation plots

The PDB ensembles are depicted in all panels as references. All the plots were generated by the CoNSEnsX webserver.

(PDF)

S2 Fig. Cα atom distance correlations.

(A) Cα atom distance correlation matrix with PC1 (above diagonal) and PC2 (below diagonal) for each amino acid pair. Helical regions are labeled with beige rectangles, β strand regions with gray rectangles

(B) The highest (red, brown lines) and lowest (blue, gray lines) distances depicted on the structure correalted with PC1

(C) The lowest (blue, gray lines) distances depicted on the structure correalted with PC2.

(JPG)

S3 Fig. Investigation of the docked frames.

(A) Tanimoto distances of the simulation frames and the docked frames based on ligand contact data (see text)

(B) Comparison of the average of the number of ligand heavy atoms (vertical axis) being closer, than 4 Å to each amino acid (horizontal axis) of CHO (blue all frames, black docked frames).

(C). Comparison of the average of the number of ligand heavy atoms (vertical axis) being closer, than 4 Å to each amino acid (horizontal axis) of GCH: green simulated frames, red docked frames. Correlations are listed on the top.

(JPG)

S4 Fig. Histograms of PC2 (Type II motion) of the MUMO ensembles along PC1 and PC2.

(JPG)

S1 Table. The number of frames in each simulation, where the respective hydrogen bonds are present.

The first column contains the specification of the hydrogen bonds in the structures as: a_<donor_residue_number>_<donor_atom_name>_a_<acceptor_residue_number>_<acceptor_atom_name>.

(PDF)

S1 Movie. The movement along the first PCA mode PC1.

(MPG)

S2 Movie. The movement along the second PCA mode PC2.

(MPG)

S1 File. Topology files for the the 6 ns MUMO simulations.

(ZIP)

S1 Appendix. Main features of protein-ligand interactions are similar in the simulated and docked structures.

(DOCX)

Data Availability

All relevant data are within the manuscript and its Supporting Information files.

Funding Statement

This work was supported by: ZG: National Research, Development and Innovation Office – NKFIH, grant no. NF104198, http://nkfih.gov.hu/; OT: National Research, Development and Innovation Office – NKFIH, grant no. K109035, http://nkfih.gov.hu/; and OT: Hungarian Academy of Sciences, János Bolyai Research Fellowship, http://www.mta.hu. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Smathers RL, Petersen DR. The human fatty acid-binding protein family: evolutionary divergences and functions. Hum Genomics. 2011. March;5(3):170–191. 10.1186/1479-7364-5-3-170 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Zwicker BE, Agellon LB. Transport and biological activities of bile acids. Int J Biochem Cell Biol. 2013. July;45(7):1389–98. 10.1016/j.biocel.2013.04.012 [DOI] [PubMed] [Google Scholar]
  • 3.Trauner M, Boyer JL. Bile salt transporters: molecular characterization, function, and regulation. Physiol Rev. 2003. April;83(2):633–71. 10.1152/physrev.00027.2002 [DOI] [PubMed] [Google Scholar]
  • 4.Horváth G, Király P, Tárkányi G, Toke O. Internal motions and exchange processes in human ileal bile acid binding protein as studied by backbone (15)N nuclear magnetic resonance spectroscopy. Biochemistry. 2012. March 6;51(9):1848–1861. doi: 10.1021/bi201588q [DOI] [PubMed] [Google Scholar]
  • 5.Lin MC, Kramer W, Wilson FA. Identification of cytosolic and microsomal bile acid-binding proteins in rat ileal enterocytes. J Biol Chem. 1990. September 5;265(25):14986–14995. [PubMed] [Google Scholar]
  • 6.Alrefai WA, Gill RK. Bile acid transporters: structure, function, regulation and pathophysiological implications. Pharm Res. 2007. October;24(10):1803–1823. 10.1007/s11095-007-9289-1 [DOI] [PubMed] [Google Scholar]
  • 7.Veerkamp JH, Maatman RG. Cytoplasmic fatty acid-binding proteins: their structure and genes. Prog Lipid Res. 1995;34(1):17–52. 10.1016/0163-7827(94)00005-7 [DOI] [PubMed] [Google Scholar]
  • 8.Sessler RJ, Noy N. A ligand-activated nuclear localization signal in cellular retinoic acid binding protein-II. Mol Cell. 2005. April 29;18(3):343–353. 10.1016/j.molcel.2005.03.026 [DOI] [PubMed] [Google Scholar]
  • 9.Ayers SD, Nedrow KL, Gillilan RE, Noy N. Continuous nucleocytoplasmic shuttling underlies transcriptional activation of PPARgamma by FABP4. Biochemistry. 2007. June 12;46(23):6744–6752. 10.1021/bi700047a [DOI] [PubMed] [Google Scholar]
  • 10.Nakahara M, Furuya N, Takagaki K, Sugaya T, Hirota K, Fukamizu A, Kanda T, Fujii H, Sato R. Ileal bile acid-binding protein, functionally associated with the farnesoid X receptor or the ileal bile acid transporter, regulates bile acid activity in the small intestine. J Biol Chem. 2005. December 23;280(51):42283–42289. 10.1074/jbc.M507454200 [DOI] [PubMed] [Google Scholar]
  • 11.Tochtrop GP, Richter K, Tang C, Toner JJ, Covey DF, Cistola DP. Energetics by NMR: site-specific binding in a positively cooperative system. Proc Natl Acad Sc U S A. 2002. February 19;99(4):1847–1852. 10.1073/pnas.012379199 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Eliseo T, Ragona L, Catalano M, Assfalg M, Paci M, Zetta L, Molinari H, Cicero DO. Structural and dynamic determinants of ligand binding in the ternary complex of chicken liver bile acid binding protein with two bile salts revealed by NMR. Biochemistry. 2007. November 6;46(44):12557–12567. 10.1021/bi7013085 [DOI] [PubMed] [Google Scholar]
  • 13.Turpin ER, Fang HJ, Thomas NR, Hirst JD. Cooperativity and site selectivity in the ileal lipid binding protein. Biochemistry. 2013. July 9;52(27):4723–4733. 10.1021/bi400192w [DOI] [PubMed] [Google Scholar]
  • 14.Kouvatsos N, Thurston V, Ball K, Oldham NJ, Thomas NR, Searle MS. Bile acid interactions with rabbit ileal lipid binding protein and an engineered helixless variant reveal novel ligand binding properties of a versatile beta-clam shell protein scaffold. J Mol Biol.2007. August 31;371(5):1365–1377. 10.1016/j.jmb.2007.06.024 [DOI] [PubMed] [Google Scholar]
  • 15.Kurz M, Brachvogel V, Matter H, Stengelin S, Thüring H, Kramer W. Insights into the bile acid transportation system: the human ileal lipid-binding protein-cholyltaurine complex and its comparison with homologous structures. Proteins. 2003. February 1;50(2):312–328. 10.1002/prot.10289 [DOI] [PubMed] [Google Scholar]
  • 16.Horváth G, Bencsura Á, Simon Á, Tochtrop GP, DeKoster GT, Covey DF, Cistola DP, Toke O. Structural determinants of ligand binding in the ternary complex of human ileal bile acid binding protein with glycocholate and glycochenodeoxycholate obtained from solution NMR. FEBS J. 2016. Feb;283(3):541–555. 10.1111/febs.13610 [DOI] [PubMed] [Google Scholar]
  • 17.Banaszak L, Winter N, Xu Z, Bernlohr DA, Cowan S, Jones TA. Lipid-binding proteins: a family of fatty acid and retinoid transport proteins. Adv Protein Chem. 1994;45:89–151. 10.1016/S0065-3233(08)60639-7 [DOI] [PubMed] [Google Scholar]
  • 18.Tochtrop GP, Bruns JL, Tang C, Covey DF, Cistola DP. Steroid ring hydroxylation patterns govern cooperativity in human bile acid binding protein. Biochemistry. 2003. October 14;42(40):11561–11567. 10.1021/bi0346502 [DOI] [PubMed] [Google Scholar]
  • 19.Toke O, Monsey JD, DeKoster GT, Tochtrop GP, Tang C, Cistola DP. Determinants of cooperativity and site selectivity in human ileal bile acid binding protein. Biochemistry. 2006. January 24;45(3):727–737. 10.1021/bi051781p [DOI] [PubMed] [Google Scholar]
  • 20.Tochtrop GP, DeKoster GT, Covey DF, Cistola DP. A single hydroxyl group governs ligand site selectivity in human ileal bile acid binding protein. J Am Chem Soc. 2004. September 8;126(35):11024–11029. 10.1021/ja047589c [DOI] [PubMed] [Google Scholar]
  • 21.Horváth G, Egyed O, Toke O. Temperature dependence of backbone dynamics in human ileal bile acid-binding protein: implications for the mechanism of ligand binding. Biochemistry. 2014. August 12;53(31):5186–5198. 10.1021/bi500553f [DOI] [PubMed] [Google Scholar]
  • 22.Hodsdon ME, Cistola DP. Discrete backbone disorder in the nuclear magnetic resonance structure of apo intestinal fatty acid-binding protein: implications for the mechanism of ligand entry. Biochemistry. 1997. February 11;36(6):1450–1460, 10.1021/bi961890r [DOI] [PubMed] [Google Scholar]
  • 23.Jenkins AE, Hockenberry JA, Nguyen T, Bernlohr DA. Testing of the portal hypothesis: analysis of a V32G, F57G, K58G mutant of the fatty acid binding protein of the murine adipocyte. Biochemistry. 2002. February 12;41(6):2022–2027. 10.1021/bi015769i [DOI] [PubMed] [Google Scholar]
  • 24.Ragona L, Pagano K, Tomaselli S, Favretto F, Ceccon A, Zanzoni S, D'Onofrio M, Assfalg M, Molinari H. The role of dynamics in modulating ligand exchange in intracellular lipid binding proteins. Biochim Biophys Acta. 2014. July;1844(7):1268−1278. 10.1016/j.bbapap.2014.04.011 [DOI] [PubMed] [Google Scholar]
  • 25.Cogliati C, Ragona L, D'Onofrio M, Günther U, Whittaker S, Ludwig C, Tomaselli S, Assfalg M, Molinari H. Site-specific investigation of the steady-state kinetics and dynamics of the multistep binding of bile acid molecules to a lipid carrier protein. Chemistry. 2010. October 4;16(37):11300–11310. 10.1002/chem.201000498 [DOI] [PubMed] [Google Scholar]
  • 26.Horváth G, Biczók L, Majer Z, Kovács M, Micsonai A, Kardos J, Toke O. Structural insight into a partially unfolded state preceding aggregation in an intracellular lipid-binding protein. FEBS J. 2017. November;284(21):3637–3661. 10.1111/febs.14264 [DOI] [PubMed] [Google Scholar]
  • 27.Ángyán AF, Gáspári Z. Ensemble-based interpretations of NMR structural data to describe protein internal dynamics. Molecules. 2013. August 30;18(9):10548–10567. 10.3390/molecules180910548 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Van Der Spoel D, Lindahl E, Hess B, Groenhof G, Mark AE, Berendsen HJ. GROMACS: fast, flexible and free. J Comput Chem. 2005. December;26(16):1701–1718. 10.1002/jcc.20291 [DOI] [PubMed] [Google Scholar]
  • 29.Pronk S, Páll S, Schulz R, Larsson P, Bjelkmar P, Apostolov R, Shirts MR, Smith JC, Kasson PM, van der Spoel D, Hess B, Lindahl E. GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit. Bioinformatics. 2013. April 1;29(7):845–854. 10.1093/bioinformatics/btt055 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Fizil Á, Gáspári Z, Barna T, Marx F, Batta G. "Invisible" conformers of an antifungal disulfide protein revealed by constrained cold and heat unfolding, CEST-NMR experiments and molecular dynamics calculations. Chemistry. 2015. March 23;21(13):5136–5144. 10.1002/chem.201404879 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Richter B, Gsponer J, Várnai P, Salvatella X, Vendruscolo M. The MUMO (minimal under-restraining minimal over-restraining) method for the determination of native state ensembles of proteins. J Biomol NMR. 2007. February;37(2):117–135. 10.1007/s10858-006-9117-7 [DOI] [PubMed] [Google Scholar]
  • 32.Kony D, Damm W, Stoll S, Van Gunsteren WF. An improved OPLS-AA force field for carbohydrates. J Comput Chem. 2002. November 30;23(15):1416–1429. 10.1002/jcc.10139 [DOI] [PubMed] [Google Scholar]
  • 33.Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of simple potential functions for simulating liquid water. J Chem Phys. 1983. April;79(2):926–935. 10.1063/1.445869 [DOI] [Google Scholar]
  • 34.Ángyán AF, Szappanos B, Perczel A, Gáspári Z. CoNSEnsX: an ensemble view of protein structures and NMR-derived experimental data. BMC Struct Biol. 2010. October 29;10:39 10.1186/1472-6807-10-39 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Hess B, Bekker H, Berendsen HJC, Fraaije JGEM. LINCS: A Linear Constraint Solver for Molecular Simulations. J Comput Chem. 07 December 1998;18(12):1463–1472. [DOI] [Google Scholar]
  • 36.Gabriel E, Fagg GE, Bosilca G, Angskun T, Dongarra JJ, Squyres JM, Sahay V, Kambadur P, Barrett B, Lumsdaine A, Castain RH, Daniel DJ, Graham RL, Woodall TS. Open MPI: Goals, Concept and Design of a Next Generation MPI Implementation In: Kranzlmüller D, Kacsuk P, Dongarra J, editors. Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI Lecture Notes in Computer Science, vol 3241 pp 97–104; 2004; Springer, Berlin, Heidelberg, Germany: 10.1007/978-3-540-30218-6_19 [DOI] [Google Scholar]
  • 37.Friesner RA, Murphy RB, Repasky MP, Frye LL, Greenwood JR, Halgren TA, Sanschagrin PC, Mainz DT. Extra precision glide: docking and scoring incorporating a model of hydrophobic enclosure for protein-ligand complexes. J Med Chem. 2006. October 19;49(21):6177–6196. 10.1021/jm051256o [DOI] [PubMed] [Google Scholar]
  • 38.Dudola D, Kovács B, Gáspári Z. CoNSEnsX+ Webserver for the Analysis of Protein Structural Ensembles Reflecting Experimentally Determined Internal Dynamics. J Chem Inf Model. 2017. August 28;57(8):1728–1734. 10.1021/acs.jcim.7b00066 [DOI] [PubMed] [Google Scholar]
  • 39.Bakan A, Meireles LM, Bahar I. ProDy: Protein Dynamics Inferred from Theory and Experiments. Bioinformatics. 2011. June 1;27(11):1575–1577. 10.1093/bioinformatics/btr168 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Humphrey W, Dalke A, Schulten K. VMD: visual molecular dynamics. J Mol Graph. 1996. February;14(1):33–38. 10.1016/0263-7855(96)00018-5 [DOI] [PubMed] [Google Scholar]
  • 41.Han B, Liu Y, Ginzinger SW, Wishart DS. SHIFTX2: significantly improved protein chemical shift prediction. J Biomol NMR. 2011. May;50(1):43–57. 10.1007/s10858-011-9478-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Xu D, Tsai CJ, Nussinov R. Hydrogen bonds and salt bridges across protein-protein interfaces. Protein Eng Des Sel. 1997. September;10(9):999–1012. 10.1093/protein/10.9.999 [DOI] [PubMed] [Google Scholar]
  • 43.Baker EN, Hubbard RE. Hydrogen bonding in globular proteins. Prog Biophys Mol Biol. 1984;44(2):97–179. 10.1016/0079-6107(84)90007-5 [DOI] [PubMed] [Google Scholar]
  • 44.Cai J, Lücke C, Chen Z, Qiao Y, Klimtchuk E, Hamilton JA. Solution structure and backbone dynamics of human liver fatty acid binding protein: fatty acid binding revisited. Biophys J. 2012. June 6;102(11):2585–2594. 10.1016/j.bpj.2012.04.039 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Lücke C, Zhang F, Hamilton JA, Sacchettini JC, Rüterjans H. Solution structure of ileal lipid binding protein in complex with glycocholate. Eur J Biochem. 2000. May;267(10):2929–2938. 10.1046/j.1432-1327.2000.01307.x [DOI] [PubMed] [Google Scholar]
  • 46.Czajlik A, Kovács B, Permi P, Gáspári Z. Fine-tuning the extent and dynamics of binding cleft opening as a potential general regulatory mechanism in parvulin-type peptidyl prolyl isomerases. Sci Rep. 2017. Mar 16;7:44504 10.1038/srep44504 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Abel R, Wang L, Mobley DL, Friesner RA. A Critical Review of Validation, Blind Testing, and Real- World Use of Alchemical Protein-Ligand Binding Free Energy Calculations. Curr Top Med Chem. 2017;17(23):2577–2585. 10.2174/1568026617666170414142131 [DOI] [PubMed] [Google Scholar]
  • 48.You W, Tang Z, Chang CA. Potential Mean Force from Umbrella Sampling Simulations: What Can We Learn and What Is Missed? J Chem Theory Comput. 2019. March 14 10.1021/acs.jctc.8b01142 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Pettersen EF, Goddard TD, Huang CC, Couch GS, Greenblatt DM, Meng EC, Ferrin TE. UCSF Chimera—a visualization system for exploratory research and analysis. J Comput Chem. 2004. October;25(13):1605–1612. 10.1002/jcc.20084 [DOI] [PubMed] [Google Scholar]
  • 50.Andersen CA, Palmer AG, Brunak S, Rost B. Continuum secondary structure captures protein flexibility. Structure. 2002. February;10(2):175–184. 10.1016/S0969-2126(02)00700-1 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Fig. Experimental and back-calculated S2 order parameters of different ensembles.

(A) Apo MUMO ensembles by amino acids

(B) Apo MUMO ensembles as correlation plots

(C) Holo MUMO ensembles by amino acids

(D)Holo MUMO ensembles as correlation plots

(E) Unrestrained ensembles by amino acids

(F) Unrestrained ensembles as correlation plots

The PDB ensembles are depicted in all panels as references. All the plots were generated by the CoNSEnsX webserver.

(PDF)

S2 Fig. Cα atom distance correlations.

(A) Cα atom distance correlation matrix with PC1 (above diagonal) and PC2 (below diagonal) for each amino acid pair. Helical regions are labeled with beige rectangles, β strand regions with gray rectangles

(B) The highest (red, brown lines) and lowest (blue, gray lines) distances depicted on the structure correalted with PC1

(C) The lowest (blue, gray lines) distances depicted on the structure correalted with PC2.

(JPG)

S3 Fig. Investigation of the docked frames.

(A) Tanimoto distances of the simulation frames and the docked frames based on ligand contact data (see text)

(B) Comparison of the average of the number of ligand heavy atoms (vertical axis) being closer, than 4 Å to each amino acid (horizontal axis) of CHO (blue all frames, black docked frames).

(C). Comparison of the average of the number of ligand heavy atoms (vertical axis) being closer, than 4 Å to each amino acid (horizontal axis) of GCH: green simulated frames, red docked frames. Correlations are listed on the top.

(JPG)

S4 Fig. Histograms of PC2 (Type II motion) of the MUMO ensembles along PC1 and PC2.

(JPG)

S1 Table. The number of frames in each simulation, where the respective hydrogen bonds are present.

The first column contains the specification of the hydrogen bonds in the structures as: a_<donor_residue_number>_<donor_atom_name>_a_<acceptor_residue_number>_<acceptor_atom_name>.

(PDF)

S1 Movie. The movement along the first PCA mode PC1.

(MPG)

S2 Movie. The movement along the second PCA mode PC2.

(MPG)

S1 File. Topology files for the the 6 ns MUMO simulations.

(ZIP)

S1 Appendix. Main features of protein-ligand interactions are similar in the simulated and docked structures.

(DOCX)

Data Availability Statement

All relevant data are within the manuscript and its Supporting Information files.


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES