Skip to main content
The Journal of Chemical Physics logoLink to The Journal of Chemical Physics
. 2010 Oct 19;133(15):155103. doi: 10.1063/1.3496997

Free energy surfaces for the interaction of D-glucose with planar aromatic groups in aqueous solution

Jakob Wohlert 1,2, Udo Schnupf 1, John W Brady 1,a)
PMCID: PMC2980541  PMID: 20969429

Abstract

Multidimensional potentials of mean force for the interactions in aqueous solution of both anomers of D-glucopyranose with two planar aromatic molecules, indole and para-methyl-phenol, have been calculated using molecular dynamics simulations with umbrella sampling and were subsequently used to estimate binding free energies. Indole and para-methyl-phenol serve as models for the side chains of the amino acids tryptophan and tyrosine, respectively. In all cases, a weak affinity between the glucose molecules and the flat aromatic surfaces was found. The global minimum for these interactions was found to be for the case when the pseudoplanar face of β-D -glucopyranose is stacked against the planar surfaces of the aromatic residues. The calculated binding free energies are in good agreement with both experiment and previous simulations. The multidimensional free energy maps suggest a mechanism that could lend kinetic stability to the complexes formed by sugars bound to sugar-binding proteins.

INTRODUCTION

Many proteins, such as sugar-binding proteins,1, 2 or enzymes with carbohydrate substrates, such as lysozyme3 or cellulases,4 have been designed by evolution to bind sugars, but the affinity of sugar molecules for such proteins is relatively weak. Lectins, which are the most common class of sugar-binding proteins, bind monosaccharides with binding constants around103M−1,5, 6 which corresponds to binding free energies on the order of 10kBT at physiological temperatures. For oligosaccharides the typical binding constants are higher, up to106–107M−1.7 This is still fairly low and constitutes a challenge to both biochemists and nature itself in the design of carbohydrate-recognizing proteins.5

The binding sites in sugar-binding proteins often feature the flat side chains of the aromatic amino acids phenylalanine (Phe), tyrosine (Tyr), or particularly, tryptophan (Trp).8 The extended active sites of cellulases, for example, contain several such residues spaced so as to interact with several sugar rings in cellulose.8 In complexes of these proteins with their substrates, the carbohydrate rings are in many cases known from X-ray diffraction studies9 to stack their hydrophobic, aliphatic protons against the flat hydrophobic faces of these aromatic side chains. This binding motif for carbohydrate binding sites is sufficiently widespread as to suggest a special affinity between sugar rings and these aromatic side chains.10, 11

There are relatively few studies investigating carbohydrate-aromatic interactions in isolation. Fernández-Alonzo et al.12 used NMR to demonstrate that monosaccharides (methyl β-galactoside) and aromatic rings (benzene) do indeed form complexes in solution and used ab initio calculations to quantify the interaction energies to around 12 kJ∕mol (in vacuum). The same strategy was employed by Vandenbussche et al.,10 who also used NMR in conjunction with ab initio and molecular mechanics modeling to study the same carbohydrate-aromatic complex. They further used titration to estimate the binding affinity to be very low, below1M−1. They also found a significant hydrophobic component in their results, since the affinity disappeared in other polar solvents than water.

Preliminary MD simulation studies of D-glucose interacting with a model peptide in solution have indeed found a highly specific and localized interaction site for D-glucose molecules adjacent to the planar face of tryptophan, with an estimated interaction energy of about −2.5kJ∕mol, equal to kBT at room temperature.13 Moreover, a recent experimental study of the thermal denaturation of synthetic peptides containing a tryptophan residue and a covalently bound saccharide, positioned so as to be able to stack against each other in the folded state, also estimated this interaction energy to be of the same magnitude, −3.35 kJ∕mol.14 This shows the importance of X-H⋯π interactions for carbohydrate recognition, but also, not surprisingly, that these interactions alone are not sufficient to account for the affinities of real binding sites, which are an order of magnitude larger. Clearly, some cooperativity at the binding site is present, usually attributed to hydrogen bonding,5, 10, 14 which helps desolvate and stabilize the sugar at the binding site. There is also some cooperativity present between binding sites, in that the binding affinities for oligosaccharides to multivalent sugar-binding proteins are greater that the sum of the individual affinities. This effect has been coined the cluster glycoside effect.7

Given that sugar-aromatic interactions seem to play a crucial role for carbohydrate recognition, as indicated by the widespread occurrence of this stacking motif in sugar-binding proteins, it would be of use to have a more complete description of the energetics of the interactions of sugar molecules with such planar functional groups. In this paper, we report the calculation of the potentials of mean force (pmf) for the interactions of both anomers of D-glucopyranose with two planar aromatic molecules, indole and para-methyl-phenol, which serve as models for the side chains of Trp and Tyr, respectively. Figure 1 illustrates the geometries of these molecules. The approach employed used symmetry considerations to identify the relevant degrees of freedom of the bound complex and molecular dynamics (MD) simulations with umbrella sampling to construct multidimensional pmfs, which are three dimensional descriptions of the Gibbs free energy as a function of both separation distance and relative orientations of the two molecules.

Figure 1.

Figure 1

The molecules modeled in the present study: (a) indole, (c) para-methyl-phenol, (b) α-D -glucopyranose, and (d) β-D -glucopyranose. Both glucose molecules are shown with their primary alcohol groups in the gt conformation (Ref. 16).

METHODS

The goal of the present study was to calculate the binding free energy of two species, AandB, forming a complex AB in solution. A possible way to proceed is to consider the equilibrium constant, Kb, of the binding reactionA+BAB. It can be expressed asKb=[AB]∕[A][B], where[A], [B], and [AB] are the equilibrium concentrations of A andB, and the bound complexAB, respectively. From this, the binding free energy ΔG is calculated fromΔG=−kBT ln(CKb), where kB is Boltzmann's constant, Tis the temperature, and C is a standard concentration, usually 1M, which corresponds to1∕1.661 nm−3.

From a classical statistical mechanical point of view, ΔGcan be expressed as

ΔG=RT(C8π2ZAB+SZSZA+SZB+S)+PΔV, (1)

where R is the universal gas constant, ZXare configuration integrals overA, B, and S (solvent) degrees of freedom. If the volume change ΔV upon binding is small, the pressure-volume term in Eq. 1 is negligible. Following Gilson et al.,15 ΔG can be written in a form which is more convenient for computer simulations,

ΔG=kBT ln(C8π2VBdrdξeW(r,ξ)kBT), (2)

where W(r,ξ) is the potential of mean force, or work function, which represents the work needed to bring molecule A from a point far away in the solution to(r,ξ). Note that W by construction includes all relevant contributions to the configuration integrals, including solvent effects. The integral is taken overVB, which is the volume associated with the bound state complexAB. In Eq. 2, W is assumed to be a function only of the six coordinates that describes the position and orientation of molecule A relative to moleculeB. This approximation rests on the assumption that the molecules’ internal degrees of freedom do not change significantly upon binding, and thus do not contribute to the binding free energy. The validity of this assumption may be questioned, especially for larger molecules like proteins which may undergo quite profound conformational changes upon ligand binding∕unbinding. For monosaccharides, which are relatively small molecules, the main concern would be rotations of the primary alcohol.16 If such changes in the conformational space for this particular degree of freedom do not significantly contribute to the total free energy, the assumption seems justified.

Even with the given approximations, the configurational integral must be evaluated over six degrees of freedom, which for most biomolecular systems poses a significant challenge in terms of simulation times needed for satisfactory convergence. Usually, one proceeds from this point by making some approximations regarding the relative motion of A andB. In an isotropic, infinitely dilute system, Ais assumed to rotate freely in the reference frame ofB, and the integration over the Euler angles can be carried out directly.17 For the case of spherical symmetry, the simplest possible case, the work function is a function of only one variable, the distancer. The free energy then becomes18

ΔG=kB ln(4πC0rCdreW(r)kBT), (3)

where rc is some cut off radius that distinguishes the bound state from the free state. The value selected for rc is somewhat arbitrary, but a common choice is to select it to be where W flattens out, showing no significant structure that could be attributed to a bound state.

Other approaches include the adoption of a model for the coordinates of the bound state, usually a harmonic or quasiharmonic approximation,17, 19 or combinations of restrained MD and thermodynamic decoupling methods.20 The selected approximations must, of course, be justified by existing symmetries of the system under study. Assuming spherical symmetry will for most biomolecular systems greatly overestimate the available phase space volume of the bound state, but leads on the other hand to fairly straightforward calculations. Assuming that A rotates completely freely also seems like a very crude approximation in most cases. It should however be possible to identify which degrees of freedom are the most relevant, for a particular type of problem, and eliminate the rest.

In the present case, a monosaccharide A interacts with a flat aromatic moleculeB, at close distances presumably being restricted to a stacked arrangement. Clearly, this problem does not exhibit spherical symmetry, nor does A rotate freely with respect to B at very close distances. Considering the system to be composed of two flat, circular disks as in Fig. 2, from symmetry considerations, three coordinates are sufficient to describe their relative positions and orientations. Using spherical coordinates for their relative position, one coordinate is the center of mass (COM) distance, r=|rArB|, while θ is the angle that |rArB| makes with the normal n̑B to moleculeB. The second angle, ϕ, is not needed, and can be integrated over directly. For their orientation, only one Euler angle β is needed, which is the angle the normal to the plane of moleculeA, n̑A, makes withn̑B. The remaining two Euler angles, usually denoted α andγ, do not need to be considered, and can be integrated out. This leads to a work function which is a function of three variables, W=W(r,θ,β), and the free energy of binding becomes

ΔG=kBT ln(πCdrdθdβr2 sinθ sinβeW(r,θ,β)kBT), (4)

where r runs from 0 torc, and the angles from 0 to2π.

Figure 2.

Figure 2

Definition of the reaction coordinates.

The work function W(r,θ,β) can be obtained from MD simulations via the probability distributionρ(r,θ,β), which is the probability of finding the system in the state described by the three coordinates, as

W(r,θ,β)=kBT ln[ρ(r,θ,β)]. (5)

This probability distribution can be calculated from a sufficiently long simulation simply by counting states. Such an approach would converge very slowly, however, and it is usually better to instead use some sort of biased simulation. In the present work, umbrella sampling along the COM separation was employed, using a harmonic umbrella potential21 of the form

U(r)=k(rri)2, (6)

with r=|rArB| as before, and i ranging from 1 to the total number of simulationsN. Each of the trajectories from the N simulations were decomposed into Nθ×Nβ probability distributions, where Nθ and Nβ are the number of grid points in θ and β space, respectively. All probability distributions belonging to the same θ-β-pair were combined into a single, unbiased, probability distribution using WHAM,22 and the corresponding free energy profile was calculated using Eq. 5. Although the system is contoured along one coordinate, the COM distance, the procedure leads to an approximation of the three dimensional work functionW(r,θ,β). Finally, the free energy of binding, ΔG˚, can be calculated by numerical integration of Eq. 4.

Molecular dynamics simulations

The MD simulations in this work were performed with the GROMACS 4.0 simulation software,23 using the CHARMM22 (Ref. 24) and CSFF (Ref. 25) force fields, as implemented in GROMACS,26 for the solutes, and the CHARMM implementation of the TIP3P potential for the water.27 The integration was done using a leap-frog algorithm with a 2 fs time step. All bonds involving hydrogen atoms were constrained to their equilibrium values using P-LINCS,28 and water molecules were kept completely rigid using SETTLE.29 Nonbonded electrostatic interactions were treated with PME (Ref. 30) using a cut off of 1.0 nm for the real space part, and Lennard-Jones interactions were shifted to zero between 1.0 and 1.2 nm. The simulations were run at a constant temperature of 298 K, using velocity rescaling,31 and constant pressure using a Parrinello–Rahman barostat with reference pressure of 1 atm.32

As described above, the system was made subject to a biasing harmonic potential along a coordinate defined by the center-of-mass separation between the two solutes, r [see Eq. 6]. The force constant k was set to 836.8 kJ∕mol (Ref. 21) and ri ranged from 0.4 to 1.2 nm in 0.05 nm increments. Each sampling window was simulated for 5 ns, with the exception for simulations at the very shortest distances (0.4–0.5 nm), which were run for a total of 25 ns to enhance the sampling. Since r is a one dimensional reaction coordinate in three dimensional space, a Jacobian correction of 2kBT ln(r) was added to the pmfs33 to make them level off to a constant value at large distances.

Starting coordinates for the solutes were generated using CHARMM (Ref. 34) and subsequently placed into a cubic simulation box with side lengths 2.6 nm, which was filled with TIP3P water. The system was then subjected to energy minimization followed by a short 100 ps equilibration run in which all solute atoms were harmonically constrained, to allow for the water to equilibrate around the solutes, and for pressure and temperature to settle.

RESULTS AND DISCUSSION

Figure 3 displays the radially averaged pmfs for each of the four cases studied; that is, for both of the anomers of D-glucopyranose (αandβ) paired with the flat aromatic molecules indole and para-methyl-phenol, which are models for the side chains of the amino acids tryptophan and tyrosine, respectively. These functions are averaged over all possible relative orientations of the two molecules (that is, all values of β andθ) at each separation distance. As can be seen, all four pmf functions suggest a weak affinity of both glucose anomers for the flat aromatic surfaces. The calculated well depths are only between 0.9 and 1.5 kJ∕mol, or around one-halfkBT. The statistical errors, however, which were calculated using a bootstrap procedure described in Hub and de Groot,35 are reassuringly small, of the order of ±9.6%. For each glucose anomer, there is little difference in the interaction with these two flat surfaces. However, the pmfs show a striking difference in the nature of the interaction for the two different anomers. The β-D -glucopyranose anomer, on both indole and para-methyl-phenol, gives a narrow first minimum, centered around approximately 0.45 nm, which is essentially the contact distance for a stacked conformation, with a second, much broader minimum around 0.6–0.7 nm. The α-D -glucopyranose anomer, however, exhibits only a very broad minimum around 0.6 nm, on both aromatic residues. This distance is clearly too large to be associated with direct stacking interactions, but is rather, as will be seen below, the separation for a conformation in which the glucose molecules are significantly tilted with respect to the planar residue. This difference in interaction geometry is consistent with the results of recent simulations of glucose interacting with a tryptophan residue in a model peptide,13 where more optimal stacking was exhibited by the β-anomer, with its hydrophobic H1-H3-H5 triad face [see Fig. 1d].

Figure 3.

Figure 3

Potentials of mean force as function of center-of-mass separation for α-D -glucopyranose-indole (solid black), β-D -glucopyranose-indole (dashed black), α-D -glucopyranose-para-methyl-phenol (solid gray), and β-D -glucopyranose-para-methyl-phenol (dashed gray).

By decomposition of the one dimensional work function into three explicit reaction coordinates as described in the previous section, three dimensional free energy surfaces were calculated as a function of the COM separation between each molecular pair and the two angles θ and β describing their relative orientation. Representations of these free energies are displayed in Figs. 45, where the free energy is plotted as a function of two variables r and the cosine of the angle θ for two different values of the angleβ, cos(β)≅1, for which the two solutes are more or less planar, and cos(β)≅0, for which the glucose molecule is tilted 90° with respect to the aromatic molecule. The free energy surfaces for all four systems look similar. For cos(β)≅1 there are two deep wells located at r≅0.45 nm and cos(θ)≅±1. It is easy to see that this corresponds to a situation where the glucose is stacked on top of the aromatic molecule in an almost completely flat arrangement, and that the two minima originate from the two different faces of the glucose. For β-D -glucopyranose, the depths of the two minima are the same, which is reasonable since the two faces differ by only one axial aliphatic proton (see Fig. 1). The depths of these minima, which are also the global minima, are about −10 kJ∕mol, relative to the separated state. For α-D -glucopyranose, the minimum at cos(θ)≅−1 is only about half as deep as the minimum at cos(θ)≅+1, around −5 kJ∕mol. That there is a difference is not unexpected, since α-D -glucopyranose has its anomeric hydroxyl group in an axial position, i.e., more or less perpendicular to the mean plane of the molecule (Fig. 2). This difference means that the anomeric group in one of the arrangements is pointing directly toward the aromatic molecule, and in the other, it is pointing away, allowing it to hydrogen bond to solvent. That these stacked arrangements correspond to free energy minima could broadly be understood as arising from hydrophobic association freeing structured water molecules from these weakly hydrated faces.

Figure 4.

Figure 4

Free energy surfaces for glucose-aromatic systems: α-D -glucopyranose-indole (a), β-D -glucopyranose-indole (b), α-D -glucopyranose-para-methyl-phenol (c), and β-D -glucopyranose-para-methyl-phenol (d). Energies are given as a function of separation r and cos(θ), for two values of cos(θ); cos(β)≅0 (left) for which the glucose is tilted with respect to the aromatic molecule, and cos(β)≅1 (right) which corresponds to a planar arrangement (see Fig. 2 and text).

Figure 5.

Figure 5

Free energy surfaces for glucose-aromatic systems: α-D -glucopyranose-indole (a), β-D -glucopyranose-indole (b), α-D -glucopyranose-para-methyl-phenol (c), and β-D -glucopyranose-para-methyl-phenol (d). Energy contours are in kJ∕mol.

For cos(β)≅0 there are also two minima at cos(θ)≅±1, which corresponds to an arrangement where the glucose is still more or less on top of the aromatic residue, but tilted 90° with respect to its normal. These minima are located further out than in the stacked arrangement, atr≅0.6 nm. Furthermore, they are much shallower than the one for cos(β)≅1, only −3 to −5 kJ∕mol, with the more negative value for the α-D -glucopyranose. A qualitative interpretation for the shallow minimum corresponding to perpendicular approach is more difficult to visualize than in the hydrophobic stacking case.

It should be noted that the lowest energy state, in all cases, is the stacked, flat geometry, which has an energy considerably lower than the fully separated state. This result is also apparent in the one dimensional pmfs shown in Fig. 3, but only in those for β-D -glucopyranose. For α-D -glucopyranose, the global minimum in the pmf instead coincides with the minimum associated with a tilted arrangement. Of course, the one dimensional pmf is an average over many relevant degrees of freedom and may therefore not convey all important features of the total free energy landscape.

The contour plots in Fig. 5 illustrate that while the lowest energy geometry for the glucose molecules relative to either aromatic partner is to be stacked against its face withθ≅0°, pulling these two molecules directly apart would require surmounting a significant barrier as a vacuum was initially created between the two surfaces until they had separated enough to allow water molecules to fill the space between them. For this reason, it can be seen that the lowest energy pathway for separation involves displacing the two molecules off-center, following a diagonal path of decreasing cos(θ) and increasing r on the contour plots until cos(θ) is almost 0 and r is approximately 0.9 nm, after which the molecules can separately more freely, particularly in the case of the β-D -glucopyranose anomer. In the restricted environment of a sugar-binding site in a protein, where the sugar molecule presumably is stabilized by hydrogen bonds to polar residues and caged by steric interactions, this lateral displacement may not be possible, forcing the sugar ligand to directly surmount such a barrier to escape, perhaps giving the protein-ligand complex greater kinetic stability than might be expected from the magnitude of the association energy alone. It is possible that such displacement could be facilitated by conformational changes in the protein, particularly if triggered by some signal at the site where the sugar ligand is to be given up. If so, such an arrangement would be more functional than a high absolute binding free energy, which would work against giving up the sugar when required.

As seen from Figs. 345, little significant difference can be found in the binding of the glucose molecules to the faces of the two different aromatic molecules, indicating that they are principally serving as flat hydrophobic surfaces rather than as specific asymmetric sites. Thus, hydrogen bonding between the sugar and the N–H or O–H functional groups of the aromatic systems did not serve to anchor the sugar in a particular arrangement on the surface, somewhat justifying the assumption of treating these molecules as symmetric disks, and ignoring the degrees of freedom describing the anisotropic placement of these groups. Inspection of the calculated trajectories found that no persistent hydrogen bonding took place between the two solute molecules in any of the calculations at any distance. This is in agreement with Laughrey et al.,14 who concluded that hydrogen bonding contributed to the total interaction energy with, at most, 0.4 kJ∕mol.14 In the simulations, the rotameric conformational distribution of the exocyclic hydroxymethyl group for most separations was an approximately 60:40 partitioning between gg andgt,16 respectively, as it would be for free glucose both in simulations25 and from NMR experiments.36 However, for a COM distance of around 0.4 nm, this primary alcohol conformational equilibrium shifted to 40:60, perhaps as a result of steric crowding when this exocyclic group is pointing axially down toward the aromatic surface. This shift also shows that hydrogen bonding between the molecules is of little importance, since this group would presumably have to be in this conformation to have any chance of making even a strained hydrogen bond to the aromatic hydroxyl, in the case of the para-methyl-phenol. Presumably even a strained hydrogen bond to the N–H group of the indole in a stacked arrangement would be topologically unlikely.

Figure 6 illustrates the variation of the potential of mean force with θ and β at two different values of the COM separationrc, 0.42 nm, corresponding to near van der Waals contact, and 0.62 nm, for β-D -glucopyranose interacting with indole. As can be seen, for the short COM separation the free energy landscape is complex. This is reasonable since the motion (rotation∕tilting) of the sugar molecule is restricted by steric interactions, and only one position is energetically preferred, with both |cos(θ)| and |cos(β)| being approximately 1 (see the left contour plot in Fig. 6). This particular position represents parallel stacking of the sugar against the aromatic surface (relative orientation “A” in Fig. 6). The barrier for motion away from this orientation is more than 25 kJ∕mol before other local minima can be reached, which are both more shallow and higher in energy. On the other hand, at larger COM distances the rotation of the sugar molecule is less restricted and it takes less energy to tilt the sugar against the aromatic ring plane and the energy landscape becomes flatter. This effect can be seen in the contour plot shown on the right in Fig. 6, which shows that there is no preferred parallel stacking, but rather tilting of the sugar molecule (relative orientations “B” and “C” in Fig. 6), with practically no energy barrier (compared tokBT) for moving between different values ofθ.

Figure 6.

Figure 6

Contour plots of the θ-β free energy landscape for β-D -glucopyranose interacting with indole, for two different values ofrc, 0.42 and 0.62 nm, along with illustrations of the relative orientations represented by three specific points, indicated on the plots by the corresponding letters. All energy contours are in kJ∕mol.

Finally, estimates of the standard free energy of binding ΔG were calculated using the methods described above, both by assuming spherical symmetry and integrating the one dimensional pmfs over the bound region [Eq. 3], and by integrating the three dimensional free energy according to Eq. 4. The results are shown in Table 1, for different choices of the cut off radius rc for the bound region, along with the corresponding one dimensional pmf well depth for comparison. The data in Table 1 show that there is a strong dependence of ΔG on the selected cutoff distancerc, around a factor of 2 between the shortest and the longest cut offs. Considering the somewhat arbitrary nature of the choice ofrc, it is difficult to make any precise statements regarding the absolute values ofΔG. Unfortunately, this is a characteristic of systems that are very weakly bound. Since the free energy landscape is relatively flat everywhere, even regions where the work function W is close to zero will give rise to non-negligible contributions toΔG. For more strongly bound systems, contributions from regions of large negative W will dominate, and ΔG will not depend so strongly upon the choice ofrc. However, it seems that it is at least possible to say that α-D -glucopyranose prefers para-methyl-phenol over indole, while β-D -glucopyranose prefers indole over para-methyl-phenol. This is somewhat consistent with Laughrey et al.,14 who see an increased affinity of glucose for a tryptophan compared to a phenol. Furthermore, by changing the equatorial hydroxyl group at C4 to axial, thereby transforming the glucose to a galactose, the affinity for Trp decreases.

Table 1.

Standard free energies of binding, calculated using different values for the cutoff for the bound region, rc. The first value is calculated using Eq. 4 and the approximation described in the previous section. The second value is calculated using Eq. 3 assuming spherical symmetry. ΔWare the well depths of the one dimensional pmfs in Fig. 3. All values are given in kJ∕mol.

  rc=0.85 nm rc=0.9 nm rc=0.95 nm ΔW
α-glucose-indole −0.8∕−1.6 −1.3∕−1.9 −1.8∕−2.2 −0.9
β-glucose-indole −1.1∕−1.6 −1.5∕−1.9 −1.8∕−2.2 −1.2
α-glucose– p -methyl-phenol −1.7∕−1.8 −2.1∕−2.1 −2.4∕−2.4 −1.4
β-glucose– p -methyl-phenol −0.6∕−1.6 −1.1∕−1.9 −1.5∕−2.3 −1.0

Finally, it should be noted that in the present model, rotations aboutϕ, α, and γ were neglected, essentially treating both the aromatic and sugar molecules as symmetric disks. Of course, this constitutes an approximation, especially since the glucose molecule is not symmetric with respect to rotations around its normal, and, strictly speaking, not even flat. There are certainly conformations that are more favorable than others even along these coordinates, and how that would affect the calculated ΔG is not investigated in this work. With present available resources, such approximations were necessary since convergence of the data would become a critical issue if an attempt was made to decompose the pmf into all six internal coordinates. Each additional coordinate would require an estimated increase in the simulation lengths by approximately an order of magnitude.

CONCLUSIONS

Surveys of the reported crystal structures for proteins that bind glucose reveal that nearly all of the sugar-binding sites in such proteins contain amino acids with aromatic side chains, such as Phe, Tyr, and Trp. These residues are believed to promote sugar-binding and selectivity by allowing the nearly flat face of the sugar molecule to interact in a stacked arrangement with the aromatic side chain. Recent experimental studies14 as well as MD simulations13 give a magnitude of the binding energy between glucose and such aromatic residues of aboutkBT.

The present calculation found an energetically small but significant binding affinity for glucose molecules for the hydrophobic faces of both indole and para-methyl-phenol. These affinities apparently arise from hydrophobic association, with virtually no contribution from hydrogen bonding, and are not strongly dependent on the structural features of the aromatic groups, apart from their extended flat surfaces. Absolute binding free energies were estimated to be 0.6–2.4 kJ∕mol, i.e., less thankBT, and were found to depend strongly on the cutoff radius used to define the bound state. However, no large differences between the different solutes were found. These values correlate well with previous experimental10, 14 and MD (Ref. 13) simulation values, but are considerably lower than most ab initio results.12 These numbers constitute extremely weak affinities, and it is clear that some other element is needed to reach the binding free energies of real sugar-binding sites, which typically are an order of magnitude greater. Hydrogen bonds between the polar groups that line sugar molecule and polar protein residues are probably needed to help desolvate the sugar, but do not contribute significantly to the total binding free energy, since hydrogen bonds to water are likely to be of the same magnitude from an energetic point of view. It is worth noting, however, that in many sugar-binding proteins, at least one of the hydrogen bonding side chains in the binding site is often a charged acid group, with much stronger potential hydrogen bonding interaction energies.1, 37

On the other hand, the multidimensional free energy maps presented here show that while the sugar-aromatic complex could be easily disrupted in aqueous solution by the molecules simply sliding sideways as they move apart, if forced to move straight out in a perpendicular fashion from a stacked arrangement, more significant kinetic barriers oppose the separation. Such sideways motions might be restricted in the binding sites of proteins, e.g., due to hydrogen bonds that could help stabilize the bound complex, thereby giving protein-sugar ligand interactions significant kinetic stability without requiring strong thermodynamic complexation.

ACKNOWLEDGMENTS

We thank Ben Widom, Phil Mason, and Lars Berglund for helpful discussions. This project was supported by a grant from the National Institutes of Health (Grant No. GM63018). J.W. thanks the Sweden-America Foundation for financial support.

References

  1. Quiocho F. A., Pure Appl. Chem. 61, 1293 (1989). 10.1351/pac198961071293 [DOI] [Google Scholar]
  2. Quiocho F. A., Vyas N. K., and Spurlino J. C., Trans. Am. Crystallogr. Assoc. 25, 23 (1991). [Google Scholar]
  3. Durek T., Torbeev V. Y., and Kent S. B. H., Proc. Natl. Acad. Sci. U.S.A. 104, 4846 (2007). 10.1073/pnas.0610630104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Boraston A. B., Bolam D. N., Gilbert H. J., and Davies G. J., Biochem. J. 382, 769 (2004). 10.1042/BJ20040892 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Ferrand Y., Crump M. P., and Davis A. P., Science 318, 619 (2007). 10.1126/science.1148735 [DOI] [PubMed] [Google Scholar]
  6. Meynier C., Feracci M., Espeli M., Chaspoul F., Gallice P., Schiff C., Guerlesquin F., and Roche P., Biophys. J. 97, 3168 (2009). 10.1016/j.bpj.2009.09.026 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Lundquist J. J. and Toone E. J., Chem. Rev. (Washington, D.C.) 102, 555 (2002). 10.1021/cr000418f [DOI] [PubMed] [Google Scholar]
  8. Divne C., Stahlberg J., Reinikainen T., Ruohonen L., Pettersson G., Knowles J. K. C., Teeri T. T., and Jones T. A., Science 265, 524 (1994). 10.1126/science.8036495 [DOI] [PubMed] [Google Scholar]
  9. Sujatha M. S., Sasidhar Y. U., and Balaji P. V., Protein Sci. 13, 2502 (2004). 10.1110/ps.04812804 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Vandenbussche S., Díaz D., Fernández-Alonso M. C., Pan W., Vincent S. P., Cuevas G., Cañada F. J., Jiménez-Barbero J., and Bartik K., Chem. Eur. J. 14, 7570 (2008). 10.1002/chem.200800247 [DOI] [PubMed] [Google Scholar]
  11. Martin J. N., Munoz E. M., Schwergold C., Souard F., Asensio J. L., Jiménez-Barbero J., Cañada J., and Vicent C., J. Am. Chem. Soc. 127, 9518 (2005) 10.1021/ja050794n [DOI] [PubMed] [Google Scholar]; Mazik M., Cavga H., and Jones P. G., J. Am. Chem. Soc. 127, 9045 (2005) 10.1021/ja043037i [DOI] [PubMed] [Google Scholar]; Espinosa J. F., Montero E., Vian A., García J. L., Dietrich H., Schmidt R. R., Martín-Lomas M., Imberty A., Cañada F. J., and Jiménez-Barbero J., J. Am. Chem. Soc. 120, 1309 (1998). 10.1021/ja972291q [DOI] [Google Scholar]
  12. Fernández-Alonso M. C., Cañada F. J., Jiménez-Barbero J., and Cuevas G., J. Am. Chem. Soc. 127, 7379 (2005). 10.1021/ja051020+ [DOI] [PubMed] [Google Scholar]
  13. Mason P. E., Lerbret A., Saboungi M. -L., Neilson G. W., Dempsey C. E., and Brady J. W., “Glucose interactions with model peptide,” Proteins: Struct., Funct., Bioinf. (submitted). [DOI] [PMC free article] [PubMed]
  14. Laughrey Z. R., Kiehna S. E., Riemen A. J., and Waters M. L., J. Am. Chem. Soc. 130, 14625 (2008). 10.1021/ja803960x [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Gilson M. K., Given J. A., Bush B. L., and McCammon J. A., Biophys. J. 72, 1047 (1997). 10.1016/S0006-3495(97)78756-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Marchessault R. H. and Peréz S., Biopolymers 18, 2369 (1979). 10.1002/bip.1979.360180925 [DOI] [Google Scholar]
  17. Lee M. S. and Olson M. A., Biophys. J. 90, 864 (2006). 10.1529/biophysj.105.071589 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Shoup D. and Szabo A., Biophys. J. 40, 33 (1982) 10.1016/S0006-3495(82)84455-X [DOI] [PMC free article] [PubMed] [Google Scholar]; Jorgensen W. L., J. Am. Chem. Soc. 111, 3770 (1989). 10.1021/ja00192a057 [DOI] [Google Scholar]
  19. Swanson J. M. J., Henchman R. H., and McCammon J. A., Biophys. J. 86, 67 (2004). 10.1016/S0006-3495(04)74084-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Woo H. -J. and Roux B., Proc. Natl. Acad. Sci. U.S.A. 102, 6825 (2005). 10.1073/pnas.0409005102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Beutler T. C. and van Gunsteren W. F., J. Chem. Phys. 100, 1492 (1994). 10.1063/1.466628 [DOI] [Google Scholar]
  22. Kumar S., Rosenberg J. M., Bouzida D., Swendsen R. H., and Kollman P. A., J. Comput. Chem. 13, 1011 (1992). 10.1002/jcc.540130812 [DOI] [Google Scholar]
  23. Hess B., Kutzner C., van der Spoel D., and Lindahl E., J. Chem. Theory Comput. 4, 435 (2008). 10.1021/ct700301q [DOI] [PubMed] [Google Scholar]
  24. MacKerell A. D., Bashford D., Bellott M., Dunbrack R. L., Evanseck J. D., Field M. J., Fischer S., Gao J., Guo H., Ha S., Joseph-McCarthy D., Kuchnir L., Kuczera K., Lau F. T. K., Mattos C., Michnick S., Ngo T., Nguyen D. T., Prodhom B., Reiher W. E., Roux B., Schlenkrich M., Smith J. C., Stote R., Straub J., Watanabe M., Wiorkiewicz-Kuczera J., Yin D., and Karplus M., J. Phys. Chem. B 102, 3586 (1998). 10.1021/jp973084f [DOI] [PubMed] [Google Scholar]
  25. Kuttel M., Brady J. W., and Naidoo K. J., J. Comput. Chem. 23, 1236 (2002). 10.1002/jcc.10119 [DOI] [PubMed] [Google Scholar]
  26. Bjelkmar P., Cuendet M. A., Hess B., and Lindahl E., J. Chem. Theory Comput. 6 (2), 459 (2010). 10.1021/ct900549r [DOI] [PubMed] [Google Scholar]
  27. Jorgensen W. L., Chandrasekhar J., Madura J. D., Impey R. W., and Klein M. L., J. Chem. Phys. 79, 926 (1983) 10.1063/1.445869 [DOI] [Google Scholar]; Durell S. R., Brooks B. R., and Ben-Naim A., J. Phys. Chem. 98, 2198 (1994). 10.1021/j100059a038 [DOI] [Google Scholar]
  28. Hess B., J. Chem. Theory Comput. 4, 116 (2008). 10.1021/ct700200b [DOI] [PubMed] [Google Scholar]
  29. Miyamoto S. and Kollman P. A., J. Comput. Chem. 13, 952 (1992). 10.1002/jcc.540130805 [DOI] [Google Scholar]
  30. Darden T., York D., and Pedersen L., J. Chem. Phys. 98, 10089 (1993) 10.1063/1.464397 [DOI] [Google Scholar]; Essmann U., Perera L., Berkowitz M. L., Darden T., Lee H., and Pedersen L. G., J. Chem. Phys. 103, 8577 (1995). 10.1063/1.470117 [DOI] [Google Scholar]
  31. Bussi G., Donadio D., and Parrinello M., J. Chem. Phys. 126, 014101 (2007). 10.1063/1.2408420 [DOI] [PubMed] [Google Scholar]
  32. Parrinello M. and Rahman A., J. Appl. Phys. 52, 7182 (1981). 10.1063/1.328693 [DOI] [Google Scholar]
  33. Trzesniak D., Kunz A. -P. E., and van Gunsteren W. F., ChemPhysChem 8, 162 (2007). 10.1002/cphc.200600527 [DOI] [PubMed] [Google Scholar]
  34. Brooks B. R., C. L.BrooksIII, MacKerell J. A. D., Nilsson L., Petrella R. J., Roux B., Won Y., Archontis G., Bartels C., Boresch S., Caflisch A., Caves L., Cui Q., Dinner A. R., Feig M., Fischer S., Gao J., Hodoscek M., Im W., Kuczera K., Lazaridis T., Ma J., Ovchinnikov V., Paci E., Pastor R. W., Post C. B., Pu J. Z., Schaefer M., Tidor B., Venable R. M., Woodcock H. L., Wu X., Yang W., York D. M., and Karplus M., J. Comput. Chem. 30, 1545 (2009). 10.1002/jcc.21287 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Hub J. S. and de Groot B. L., Biophys. J. 91, 842 (2006). 10.1529/biophysj.106.081406 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Nishida Y., Hori H., Ohrui H., and Meguro H., J. Carbohydr. Chem. 7, 239 (1988). 10.1080/07328308808058917 [DOI] [Google Scholar]
  37. Jeffrey G. A. and Saenger W., Hydrogen Bonding in Biological Structures (Springer-Verlag, Berlin, 1991). [Google Scholar]

Articles from The Journal of Chemical Physics are provided here courtesy of American Institute of Physics

RESOURCES