Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Oct 30.
Published in final edited form as: Phys Chem Chem Phys. 2015 Oct 28;17(43):29262–29270. doi: 10.1039/c5cp04668h

Cation-π Interactions: Computational Analyses of the Aromatic Box Motif and the Fluorination Strategy for Experimental Evaluation

Matthew R Davis a, Dennis A Dougherty a
PMCID: PMC4627595  NIHMSID: NIHMS731164  PMID: 26467787

Abstract

Cation-π interactions are common in biological systems, and many structural studies have revealed the aromatic box as a common motif. With the aim of understanding the nature of the aromatic box, several computational methods were evaluated for their ability to reproduce experimental cation-π binding energies. We find the DFT method M06 with the 6-31G(d,p) basis set performs best of several methods tested. The binding of benzene to a number of different cations (sodium, potassium, ammonium, tetramethylammonium, and guanidinium) was studied. In addition, the binding of the organic cations NH4+ and NMe4+ to ab initio generated aromatic boxes as well as examples of aromatic boxes from protein crystal structures were investigated. These data, along with a study of the distance dependence of the cation-π interaction, indicate that multiple aromatic residues can meaningfully contribute to cation binding, even with displacements of more than an angstrom from the optimal cation-π interaction. Progressive fluorination of benzene and indole was studied as well, and binding energies obtained were used to reaffirm the validity of the “fluorination strategy” to study cation-π interactions in vivo.

Introduction

The importance of noncovalent interactions in determining macromolecular structure and molecular associations in biology is well appreciated. Nevertheless, it remains a challenging task to confirm the importance of any particular noncovalent interaction, and even more difficult to quantify one. Different issues arise in attempting to evaluate hydrophobic effects, hydrogen bonds, or ion-pairing interactions, as each has its own unique characteristics. For some time we have been evaluating a particular noncovalent interaction – the cation-π interaction – in a range of proteins, most typically neuroreceptors and ion channels.14 Only in recent years has structural information become available for such systems. Remarkably, the available structures point to a general structural motif for cation-π interactions, the so-called aromatic box.2, 57 This is formed by 3 to 5 aromatic side chains that can encapsulate the positive charge of a cationic molecule. This molecular cage has been seen in many receptors and in other binding sites for cationic guests. It is not known what the advantage of this motif is, nor is it clear to what extent 3–5 aromatics can simultaneously contribute to the binding of a cation.

Even if a structure is available and it shows a close contact between an ion and a π system, it cannot be assumed that a functionally significant noncovalent interaction exists. Over the past 20 years we have addressed this issue using non-canonical amino acid mutagenesis.4, 8 The aromatic of interest (the side chain of a phenylalanine (Phe), tyrosine (Tyr), or tryptophan (Trp)) is progressively fluorinated. Fluorine is well known to be deactivating in a cation-π interaction, and its effects are typically additive. One thus expects a correlation between protein function and/or ligand binding and degree of fluorination if a cation-π interaction is important. In a number of systems we have found a linear trend between the activation of a receptor by a cationic ligand and the calculated binding of a sodium ion to a series of fluorinated aromatic rings (indoles to mimic the side chain of Trp or benzenes to mimic Phe/Tyr). We considered this compelling evidence for a cation-π interaction.

This “fluorination strategy” is surprisingly general. Linear plots have been seen in over 30 cases, spanning a range of proteins and ligand types. Drug-like molecules with widely differing structures have been studied, including quaternary ammonium ions (acetylcholine) and protonated amines, including primary (glycine, GABA, serotonin), secondary (epibatidine, cytidine, varenicline) and tertiary (nicotine). In addition, more complex cations such as granisetron, ondansetron,9 and the guanidinium toxin tetrodotoxin (TTX)10 have shown linear fluorination plots. In contrast, a study of another guanidinium compound, meta-chlorophenyl biguanide (mCPBG) binding to the 5-HT3 (serotonin) receptor showed behavior that was difficult to interpret.11 In all cases we compared experimental data to the binding of Na+ to the appropriate aromatics. While it may be reasonable to assume that a primary ammonium ion (RNH3+) is well modeled by Na+, more complex ions such as a quaternary ammonium or a guanidinium show much different charge distributions (Figure 1) and so may display different binding behaviors.

Fig 1.

Fig 1

Cations examined in this study. (a) the ammonium ion, (b) the tetramethylammonium ion, and (c) the guanidinium ion. Pictured are molecular structures and potential energy surfaces (Geometry optimized M06/6-31G(d,p) ranging from +400 (red) to +700 (blue) kJ/mol.

To address this issue we have computationally evaluated fluorination effects on cation-π interactions involving the more complex cations ammonium (NH4+), tetramethylammonium (NMe4+), and guanidinium (Figure 1). Substituent effects on cation-π interactions and related noncovalent interactions involving benzene have been the subject of several recent investigations, including some with very high levels of theory.1214 These studies have revealed some unanticipated effects in such noncovalent interactions. The more modest goals of the present work involve the trends in cation-π binding energies in response to progressive fluorination for several combinations of cation and aromatic. When constrained to a cation-π binding geometry, these larger cations mimic the trends seen with Na+ as probe ion.

Methods

All calculations were performed using Spartan 1415 unless otherwise stated.

Calculating Cation-π Energies

Cation-π interactions to benzene and derivatives were evaluated with full geometry optimization at M06/6-31G(d,p)16 with energies calculated using equation 1:

BE=(E+Eπ)ETOT (1)

where ETOT is the total complex energy, E+ is the energy of the cation alone, and Eπ is the energy of the aromatic system, whether it is a single aromatic or an aromatic box bereft of a cation. In this form, a positive BE signifies a favorable interaction.

Distance Dependence

Distance dependence data were obtained by constraining the distance of the ion in question to benzene and performing a constrained geometry-optimized M06/6-31G(d,p) calculation. This distance constraint was increased iteratively from the starting (optimal) geometry, and the binding energy calculated using equation 1. These data were fit using the KaleidaGraph™ software (Synergy Software).

Aromatic Box Calculations

Ab initio aromatic boxes with a complexed ion were generated using Spartan 14. Geometry-minimized (M06/6-31G(d,p) ≡ M06/6-31G**) structures were obtained for ammonium bound to 3 or 4 benzene molecules and for tetramethylammonium binding to 3, 4, or 5 benzene molecules. The binding energies were obtained using equation 1, where Eπ is the energy of the aromatic box without the tetramethylammonium ion.

Existing aromatic box motifs were imported from the Protein Data Bank using Gaussview 517, and protons were added to the structures. Cation-π binding energies were calculated using Gaussian09.18 Aromatic boxes were generated by deleting backbone and β atoms of the amino acids as well as the noncationic portion of acetylcholine or trimethyllysine (to generate NMe4+ in both cases). The cation-π binding energy was computed by taking the single-point energy (M06/6-31G(d,p)) with and without the tetramethylammonium ion. Individual cation-π interactions between tetra-methylammonium and aromatic residues were found by deleting all but the cation and aromatic group of interest and computing a single point energy.

Fluorination Series

In most cases, full geometry optimization was performed for complexes of cations to fluorinated aromatics. In some cases, however, this resulted in geometries that would not be considered cation-π interactions. In such cases, fluorines were appended to the benzene ring with C-F bond distances set to values determined by optimization of the isolated aromatic system. This allowed determination of a cation-π binding energy with the cation at the position appropriate to the benzene-ion complex. We will refer to this as the single-point binding energy. The structures were then allowed to relax to the lowest-energy geometry in the gas phase. The binding energy itself was calculated using equation 1.

Several cation-π structures represented relatively shallow local minima, and were challenging to isolate. For the three separate orientations of the tetramethylammonium ion (one, two and three methyl groups facing the benzene), starting structures were the molecular mechanics-minimized structure. This enforced a symmetry on the structure that was maintained while optimizing at a higher level of theory. To achieve a stacked conformation for guanidinium-benzene, the distance from the center of the guanidinium molecule to three meta carbons of benzene was constrained to 3.4 Å and the (constrained) equilibrium geometry was calculated at M06/6-31G(d,p). The final stacked geometry was obtained using this restricted geometry as a starting point for full geometry optimization.

Results

Choice of Computational Method

We considered several different levels of theory to find which offers the best combination of accuracy and computational efficiency. Since one goal of the present work is to enable evaluation of larger systems related to protein crystal structures, we did not consider more computationally expensive correlational methods such as CCSD, nor did we consider overly large basis sets of the sort that have often been used to evaluate various methods. We considered four prototype cation-π interactions for which experimental gas phase data are available (Table 1).1922 Some time ago, we computed the binding of Na+ to a number of simple aromatic systems.23 We found that a relatively simple level of theory – HF/6-31G(d,p) - performed remarkably well. As shown in Table 1, it remains the most accurate level of theory considered for both Na+ and K+ binding to benzene. HF does, however, perform poorly on the larger ions. MP2 performed inconsistently depending on the ion studied. Also, several years ago we evaluated the most popular DFT method at the time – B3LYP – and found the results to be disappointing. Subsequent studies have confirmed that B3LYP is not especially effective for noncovalent interactions without complex dispersion correcting potentials.2429

Table 1.

Binding Energies to Benzene (kcal/mol)a

Ion HF
6-31G(d,p)
HF
6-311+G(d,p)
MP2
6-31G(d,p)
MP2
6-311+G(d,p)
B3LYP
6-31G(d,p)
B3LYP
6-311+G(d,p)
M06
6-316(d,p)
M06
6-311+G(d,p)
Expt.
Na+ 27.1 23.2 29.9 24.9 28.4 23.8 26.8 22.2 28.0
K+ 19.3 15.6 23.8 19.5 20.8 16.1 20.3 16.3 19.2
NH4+ 15.3 14.5 19.0 19.2 17.5 15.8 19.5 18.2 19.3
NMe4+ 6.6 5.6 11.3 11.7 7.7 6.0 10.8 10.0 9.4
Errorb 1.9 4.3 2.2 1.5 1.4 3.6 0.98 2.6
a

Computed values are ΔE; experimental values are ΔH. All data in the gas phase.

b

Mean absolute error of the four values.

In recent years, a wide array of DFT methods has been developed, each targeting a specific computational challenge. Several methods have been specifically parameterized for noncovalent interactions, including M06-L, M06, M06-2X and others. A previous comparison of over a dozen DFT methods showed that M06 and M06-2X both performed well.16 For cation-π interactions, we find that M06 performs very well (Table 1), even with a modest basis set. The most significant effect is a substantial improvement in the binding to tetramethylammonium. Surprisingly, M06-2X overestimated the binding energies to benzene of Na+ and K+ by 2 and 4 kcal/mol, respectively, and so we did not consider it further. Clearly, when considering DFT methods one must make a choice on a case-by-case basis. Interestingly, for all the methods considered except MP2, the larger 6-311+G(d,p) basis set gave poorer results than 6-31G(d,p). This likely reflects a cancelation of effects to some extent, but the results are consistent across the various cations considered. Given the results of Table 1, we have settled on M06/6-31G(d,p) as the optimal level of theory. It gives quite good results for all the systems considered. We note that the simpler HF/6-31G(d,p) method reproduces all the trends seen with M06; the largest differences are seen in the quantitative results for tetramethylammonium and guanidinium ions.

Cation-π interactions with Benzene

The cation-π binding energies and optimized geometries were calculated at M06/6-31G(d,p) for a number of different ions. For the simple metal ions Na+ and K+ as well as NH4+, the cation binds to benzene in one conformation: directly centered in the middle of the benzene ring (en face). To evaluate the tetramethylammonium-benzene interaction, the complex was minimized in three separate cation-π geometries: with one, two, and three methyl groups en face (Figure 2). Geometries with more methyl groups en face had higher binding energies, consistent with previously reported work.30

Fig. 2.

Fig. 2

Examples of large cations binding to benzene. (a) Tetramethylammonium ions binding with one, two and three methyl groups towards the face of benzene. (b) Guanidinium ions binding in T-shaped and stacked conformations. (c) Tetramethylammonium binding to indole. Cation-π binding energies computed at M06/6-31G(d,p).

Cation-π energies of guanidinium ions to benzene were calculated in two separate conformations: T-shaped and stacked (see Figure 2). As has been seen in other studies,31, 32 T-shaped interactions have a much stronger binding energy. Interestingly, according to M06/6-31G(d,p) the guanidinium ion adopts a propeller shape in the stacked conformation, with the amine groups adopting a 25 degree torsion angle from the plane of the cation. All cation-π binding energies are listed in Table 2.

Table 2.

Cation-π Binding Energies (kcal/mol)a

Benzene F-Benzene F2-Benzene F3-Benzene
1,3 1,4 1,2,3 1,3,5
Sodium 26.8 22.7 19.2 18.9 16.1 15.6
Potassium 20.3 17.7 14.6 14.4 12.0 11.7
Ammonium 19.5 16.2 12.8 12.9 10.1 10.1
Tetramethylammonium (1) 6.5 5.3b 4.3b - 2.9b 3.0
Tetramethylammonium (2) 8.7 7.2b 5.5b 5.7 4.0b -
Tetramethylammonium (3) 10.8 9.0b 7.1b 7.0 5.3b 5.4
Guanidinium (T-Shaped) 14.9 12.3 10.4 9.8 7.0b 7.9
Guanidinium (Stacked) 8.6 7.2b 5.8b - 4.6b 5.1
a

M06/6-31G(d,p) calculations; full geometry optimization unless otherwise noted.

b

Binding energy obtained via single point method

Distance Dependence

In biological systems cations might bind to aromatic rings at non-optimal geometries (the optimum being essentially van der Waals contact), due to other constraints imposed by the protein and/or ligand. This might be especially true for the aromatic box motif, as discussed below. As such, it is important to probe non-optimal binding geometries. To evaluate this situation, we have determined the distance dependence of a cation-π interaction to benzene for Na+, K+, NH4+, and tetramethylammonium (1-methyl down). As shown in Figure 3, similar trends are seen, and the cation-π interaction distance dependence is not overly steep. For all three cations, displacement by a full Å from the optimal distance results in roughly a 40% drop in binding energy. When fit to a 1/rn function, n values of 1.9, 2.0, 2.8, and 2.8 are obtained for Na+, K+, NH4+, and tetramethylammonium, respectively. These results indicate that even a non-optimal cation-π interaction can contribute significantly to ligand binding or protein stability.

Fig. 3.

Fig. 3

Distance dependence of cation-π binding for sodium, potassium, ammonium, and NMe4+ (one methyl down) ions. Binding energy is relative to the optimal binding, and distance is from the center of the benzene to Na+ or K+; N atom of ammonium; or methyl C of tetramethylammonium. Complexes with very short distances deviate from this electrostatic plot and were not included in the fitting.

The Aromatic Box

In recent years, it has become apparent that there is a common motif for a biological cation-π binding site – the so-called aromatic box.2, 57 This is formed by 3 – 5 aromatic side chains that encapsulate the positive charge of a cationic molecule (Figures 4, 5). An early and especially compelling example was seen in the ACh binding proteins (AChBP), which provide excellent models for the nicotinic acetylcholine receptor (nAChR) binding site.33, 34 Five aromatics form a box around the quaternary ammonium of ACh and around cationic sites of other ligands in this much discussed system (Figure 5a). Another frequently seen box is found in proteins that bind a trimethyllysine (KMe3+) side chain of a histone protein.3539 The KMe3+ structure is a common and critical motif as part of the “histone code”. A number of crystal structures of transcriptional regulators bound to KMe3+ have appeared, and all show an “aromatic box” that binds the quaternary ammonium ion through cation-π interactions (Figure 5b).

Figure 4.

Figure 4

Geometry-optimized complexes of four benzenes to (a) one tetramethylammonium and (b) one ammonium ion. Binding energies computed at M06/6-31G(d,p).

Figure 5.

Figure 5

The aromatic box motif binding sites of (a) ACh binding to AChBP (PDB: 3WIP).34 Residues are labeled according to their conventional designation in the nAChR. (b) Trimethyllysine binding to the polycomb chromodomain from Drosophila histone H3 protein (PDB: 1PFB).43 Distances (Å) are from a carbon to the ring centroid; the calculated optimal distance for such an interaction is 3.6Å.

We first probed the aromatic box by studying the simultaneous binding of 4 benzene rings to a tetramethylammonium.40 As seen in Table 3 and Figure 4a, a tetramethylammonium can easily bind to multiple benzenes simultaneously. The total binding energy is 39.2 kcal/mol (Table 3). Each pairwise interaction has three methyl groups en face, and so four optimal interactions of this sort would produce a binding energy of 43.2 kcal/mol (4 × 10.8; listed as “Theoretical” in Table 3). The individual binding energies are nearly optimal, as evidenced by the “Sum” column in Table 3. Similar results are seen for 2 or 3 or 5 benzenes (Supporting Information), although the 5th benzene binds less tightly than the others.

Table 3.

Binding energies (kcal/mol) to a 4-benzene aromatic box

Binding Energy Theoreticala Sumb Discrepancyc
NH4+ 59.6 78.0 71.1 18.4
NMe4+ 39.2 43.2 42.3 4.0
a

Four times the binding energy for a single ion-benzene complex.

b

Four times the binding energy of a single cation-box aromatic.

c

The difference between the theoretical binding energy and the calculated binding energy.

Receptors that bind primary ammonium ions (RNH3+) such as GABA or serotonin also have a cluster of aromatic amino acids at the agonist binding site, so we considered the binding of ammonium to an aromatic box. Again, four benzenes can fit comfortably around an ammonium ion with no obvious steric conflicts (Figure 4b). Experimental studies of K+, which is similar in size to ammonium, have shown that multiple benzenes can complex the ion.21, 41, 42 The total binding energy of ammonium to four benzenes is 59.6 kcal/mol, but now the individual interactions are not optimal; the Sum value (four times the binding energy of a single cation-box aromatic) differs considerably from the true binding energy. The even larger difference between the theoretical value and the binding energy could reflect a small negative cooperativity due to conflicts between induced dipoles generated by the binding. Such induced dipoles are expected to be smaller when tetramethylammonium is the ion, consistent with the smaller deviation seen in Table 3.

The aromatic box motif found in biological systems uses Phe, Tyr, and Trp, although there is a general bias toward Tyr and Trp.32 Based on studies using Na+ as a probe ion,44 Trp is expected to provide the strongest cation-π binding. Indeed, calculated binding energies for tetramethylammonium (3 methyls down) to benzene (Phe), phenol (Tyr), and indole (Trp) are 10.8, 12.5, and 15.1 kcal/mol, respectively. As in previous studies, the cation is significantly offset from the center of the phenol,45 and it lies over the six-membered ring of the indole.32

We replaced the ACh of AChBP with tetramethyl-ammonium and evaluated the total binding energy as well as the interaction to each individual ring (Figure 5a, Table 4). We find a total binding energy of 28.9 kcal/mol, with four of the five aromatics contributing significantly. The largest contribution comes from TrpB, the residue that has been shown experimentally to make a functionally important cation-π interaction in most nAChRs.4

Table 4.

Binding energies (kcal/mol) of tetramethylammonium to protein aromatic boxes.a

AChBP Histone binding Protein
Aromatic Box 28.9 Aromatic Box 32.3
Predictedb 67.7 Predictedb 42.7
TrpB 8.3 Trp50 13.9
TyrC2 8.0 Trp47 12.0
TyrC1 7.5 Tyr26 9.0
TrpD 4.8
TyrA 2.3

Sum 30.9 Sum 34.9
a

M06/6-31G(d,p) calculations; single-point energies.

b

Binding energy if all interactions were optimal and independent.

We applied a similar analysis to the binding of KMe3+ to a histone binding protein (Figure 5b).43 We find a total binding energy of 32.3 kcal/mol for the tetramethylammonium. Trp47 and Trp50 contribute the bulk of the binding energy, but Tyr26 is still significant (Table 4). Based on the distance dependence of tetramethylammonium binding from Figure 3, it is evident that all eight potential methyl•••aromatic interactions of Figure 5b contribute to the total binding energy. For both protein structures, the calculated total binding energy is slightly less than the sum of the individual interactions, as was seen in the model structures. Also for both systems, the total binding energy is much less than what would be expected if all cation-π interactions were optimal; deviations of 39 and 10 kcal/mol are seen for AChBP and the histone binding protein, respectively.

Fluorination Studies

In our experimental studies of a possible cation-π interaction to a Phe or a Tyr,4 we have used 4-F-Phe, 3,5-F2-Phe, and 3,4,5-F3-Phe as our cation-π interaction probes, while for Trp we use the 5-, 5,7-, 5,6,7-, and 4,5,6,7-fluorotryptophans. As such, we have focused on the analogous benzenes and indoles for our computational studies (Tables 2 and 5). For the most part these studies were straightforward: the ion remained roughly centered over the ring and the cation-π binding energy progressively decreased as fluorines were introduced.

Table 5.

Cation-π Binding Energies (kcal/mol)a

Indole F-Indole F2-Indole F3-Indole F4-Indole
5 5,7 5,6,7 4,5,6,7
Sodium 32.7 28.8 25.7 22.5 18.9
Tetramethylammonium (3) 15.2 13.5 11.9 10.1 9.1b
a

M06/6-31G(d,p) calculations; full geometry optimization unless otherwise noted

b

Binding energy obtained via single point method

The exceptions occurred with the larger ions tetramethylammonium and guanidinium binding to the fluorobenzenes and highly fluorinated indoles. In these cases full geometry optimization led to edge-on binding, with the cation attracted to the fluorine(s). Full details of these results are presented in the Supporting Information.

While these edge-on structures are interesting with regard to possible gas phase studies, the focus of the present work is the binding of cationic ligands to proteins, where structural constraints are likely to discourage edge-on geometries. We developed two strategies to determine the effect of fluorination on such constrained systems. First, we simply added fluorines to the ion-benzene complex (which is in an en face cation-π geometry) and performed single point calculations. Alternatively, we could use the symmetrical 1,4- and 1,3,5-fluorobenzenes, for which the ions stayed centered over the ring. The two strategies produced similar results, with the expected trends on progressive fluorination.

Comparison of fluorination calculations with in vivo data

As noted above, we previously compared experimental results for a fluorination study with binding energies for Na+ to the analogous aromatic.4 We can now preform more realistic comparisons. For primary amines, the previously described cation-π interaction between glycine and fluorinated phenylalanine residues in the glycine receptor46 was re-evaluated using the results for the binding of an ammonium ion to fluorinated benzenes. As shown in Figure 6, a linear trend between agonist potency and cation-π binding energy is still obtained.

Figure 6.

Figure 6

Cation-π binding plots for (a) Glycine at the glycine receptor using computed cation-π binding energies for ammonium ions to fluorinated benzenes using in vivo data collected for activation of Glycine receptor mutants46 and (b) acetylcholine at the α4β2 nicotinic acetylcholine receptor (nAChR) using in vivo data collected for activation of α4β2 nAChR mutants47 using cation-π energies for tetramethylammonium ions binding to fluorinated indoles.

We have several demonstrations of strong cation-π interactions between ACh and nicotinic ACh receptors.47 The relevant calculation then is tetramethylammonium binding to fluorinated indoles. Figure 6 shows such an analysis, considering tetramethylammonium binding to the indole ring with three methyl groups en face (Figure 6). In each case, the calculated cation-π binding energies line up well with experimental results, confirming that the results of such an analysis of receptor binding are independent of the nature of the probe cation.

Discussion

The importance of the cation-π interaction in stabilizing protein structures and mediating protein-ligand interactions is well-established, and computational methods have always played an important role in evaluating cation-π interactions. Here we evaluated several modern computational protocols to find one that could be conveniently applied by experimentalists to evaluate potential cation-π interactions in biological structures. Gratifyingly, we find that a not overly complex method – M06/6-31G(d,p) – provides very good results on systems for which experimental data are available. For larger cations such as tetramethylammonium it provides a significant improvement over HF methods. Note that we did not employ counterpoise corrections or other methods to account for likely basis set superposition error (BSSE). It may well be that there is a compensation effect, where the relatively small basis set and the neglect of BSSE roughly compensate for each other. In the end, we feel the method espoused here is quite adequate for evaluating cation-π interactions.

The aromatic box motif is common in protein cation-π binding sites. This suggests that multiple aromatics can simultaneously have favorable interactions with a cation, although it seems likely that not all interactions would be optimal in a protein structure. This prompted us to revisit the issue of the distance dependence of the cation-π interaction. As in an earlier, preliminary study,1 we find that the distance dependence is not steep (Figure 3). For example, it could be argued that the cation-π interaction should be treated as an ion-quadrupole interaction. Ideally, an ion-quadrupole interaction should show a 1/r3 distance dependence for the binding energy. However, the cation-π interaction is better modeled by a 1/r2 distance dependence for the prototype Na+ and K+ ions. For larger ions a roughly 1/r3 distance dependence is seen, likely reflecting the increased importance of polarizability for the larger ions. In any case, displacement of the ion by a full angstrom from van der Waals contact (the optimal binding arrangement) diminishes the interaction by less than half.

We have taken two approaches to evaluating the aromatic box motif. First, we evaluated the binding of multiple benzenes to both tetramethylammonium and ammonium ions under optimal conditions. We find that four aromatics can easily bind simultaneously to either ion. For tetramethylammonium, the total binding energy is close to what would be expected if all four individual benzene tetramethylammonium interactions were optimal and independent. For ammonium there is a significant deviation from expectation based on four optimal ammonium•••benzene interactions. Mostly, this is because the individual interactions are not optimal; clearly the ammonium ion is a bit too small to perfectly bind four benzenes simultaneously. There is a second, non-additivity factor that we attribute to a clash between induced dipoles in the four benzenes. Nevertheless, the global conclusion is that multiple aromatics can readily contribute to the binding of a cation through cation-π interactions.

We next considered two exemplary aromatic boxes from protein structures: the acetylcholine binding protein (AChBP) and a Polycomb chromodomain binding to a peptide model of trimethyllysine from the histone H3 protein. In each case we truncated the ligand (ACh or trimethyllysine) to a tetramethylammonium. We anticipated that all contacts between tetramethylammonium and an aromatic of the box would not be optimal, and indeed that is the case. As shown in Table 4, the “predicted” binding energy – that which would arise if each cation-π interaction was optimal – is substantially larger than the actual binding energy. Recalling that an optimal cation-π interaction for tetramethylammonium to Tyr or Trp would be 12.5 or 15.1 kcal/mol, it is clear that each interaction evaluated individually is weaker than optimal. The histone binding protein has three interactions approaching the ab initio-generated geometries and interaction energies, whereas some components of the AChBP box make only relatively minor contributions to the overall binding energy. Nevertheless, it is clear that a ligand can benefit simultaneously from multiple cation-π interactions to components of an aromatic box. Gratifyingly, the largest interaction in the AChBP is to TrpB, which is the residue most commonly involved in a functionally significant cation-π interaction in studies of nAChRs.

We also considered the extent to which the total binding energy deviates from the sum of the individual interactions. In each case, the deviation is not large, suggesting a relative lack of cooperativity (positive or negative) in the biological aromatic boxes.

We conclude that while the positioning of the aromatics in the biological structures is not optimal, a cation can still make energetically significant cation-π interactions to multiple amino acid side chains. The shallow distance dependence of the cation-π interaction presaged this observation.

While not as large as is theoretically possible, total binding energies for the two aromatic boxes are quite large. Like most noncovalent interactions, the cation-π interaction is attenuated on moving from the gas phase to a condensed medium, although the attenuation is not as large as is typically associated with a hydrogen bond or an ion pair, since the desolvation penalty is less severe.48 Additionally, these calculations do not include any adjustments that the protein structure must make to adapt to the ligand; here we are considering perfectly preorganized binding sites.

It is also interesting that, despite the presence of 5 cation-π interactions, the AChBP shows a smaller binding energy than the histone binding protein with 3 cation-π interactions, because the cation-π interactions in the latter are individually much stronger. Experimentally, AChBP binds ACh with a ~4mM affinity,33 while affinities on the order of 0.3 mM have been determined for the binding of KMe3+ to a histone binding protein.49

An additional goal of the present work was to determine whether previous experimental studies of complex biological receptors – in which cation-π binding to a receptor was correlated to the affinity of a Na+ to a substituted aromatic ring – would be altered if more realistic models of the biological ligand were used. The ligands studied included simple protonated amines, (RNH3+), ACh and related compounds (RNMe3+), and derivatives of guanidinium. As such, we have evaluated the binding of NH4+, NMe4+, and guanidinium to a series of fluorinated benzenes and indoles.

As noted above, several highly detailed analyses of noncovalent interactions involving aromatics have appeared in recent years.49 These have emphasized direct interactions between the ion and the substituent, an interaction that might be expected to favor geometries in which the ion migrates away from the center of the ring toward the substituent(s). We find that for the simpler ions (Na+, K+, and ammonium) the ion stays over the center of the ring as fluorines are introduced. In contrast, for tetramethylammonium and guanidinium, edge-on geometries are competitive with and in some cases superior to the cation-π geometries for multiply fluorinated aromatics (Supporting Information). However, when in a cation-π binding geometry – achieved either through constraints or by using symmetrically substituted benzenes – tetramethylammonium and guanidinium follow the same patterns as the smaller ions.

Our results allow more reasonable modeling of the experimental studies. We find, perhaps not surprisingly, that the linear trends between receptor activation and cation-π binding energy arise regardless of the precise nature of the cation used in the calculations. These more realistic calculations do, however, allow some interesting quantitative comparisons to be made. We have several studies of ACh binding to a series of fluorinated Trp derivatives. The largest effect we have seen is a 540-fold drop in affinity on going from Trp to F4-Trp, corresponding to 3.7 kcal/mol.4 Our calculations for tetramethylammonium show a corresponding drop of 6.1 kcal/mol. For a simple ammonium ion, our biggest effect has been seen for GABA binding to the GABAA receptor, with a 16,500-fold drop in affinity on going from Phe to 3,4,5-F3-Phe, corresponding to 5.75 kcal/mol.4 Our calculations on ammonium show a corresponding drop of 8.4 kcal/mol. We consider these results to be quite encouraging. As noted above, one might expect some attenuation of a cation-π interaction on going from the gas phase to a receptor binding site. The fact that the trend between the two studies is in the right direction, with the protein binding being roughly 60% of the gas phase binding in both cases, suggests that the present computational model may be able to provide semi-quantitative analyses of protein cation-π binding sites.

Conclusions

The cation-π interaction, common in protein structures, can be modeled well using the recently-developed density functional M06 and the modest 6-31G(d,p) basis set. Calculations at this level of theory show that in the gas phase, there is a relatively shallow distance dependence for the cation-π interaction. This allows multiple interactions to contribute to ligand binding, providing a rationale for the aromatic box motif commonly seen in protein cation-π interaction binding sites. In addition, we examined cation-π binding to the AChBP and histone binding protein aromatic boxes, and reproduce in vivo observations.

All the ions studied here show a linear fluorination trend when in a cation-π (en face) binding geometry, enforced either by computational constraints or symmetrical substitution patterns. This supports the use of fluorination plots as a means to evaluate potential cation-π interactions.

Supplementary Material

Graphical Abstract
Supplementary Information

Acknowledgements

We thank the NIH (NS 34407) for support of this work.

Footnotes

Electronic Supplementary Information (ESI) available: Description of binding substituent effect: Figures S1, S2, S3, and S4; Tables S1, S2, S3. See DOI: 10.1039/x0xx00000x

Notes and references

  • 1.Dougherty DA. Science. 1996;271:163–168. doi: 10.1126/science.271.5246.163. [DOI] [PubMed] [Google Scholar]
  • 2.Dougherty DA. Acc Chem Res. 2013;46:885–893. doi: 10.1021/ar300265y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Ma JC, Dougherty DA. Chem. Rev. 1997;97:1303–1324. doi: 10.1021/cr9603744. [DOI] [PubMed] [Google Scholar]
  • 4.Van Arnam EB, Dougherty DA. Journal of Medicinal Chemistry. 2014;57:6289–6300. doi: 10.1021/jm500023m. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Meyer EA, Castellano RK, Diederich F. Angewandte Chemie International Edition. 2003;42:1210–1250. doi: 10.1002/anie.200390319. [DOI] [PubMed] [Google Scholar]
  • 6.Salonen LM, Ellermann M, Diederich F. Angew Chem Int Ed Engl. 2011;50:4808–4842. doi: 10.1002/anie.201007560. [DOI] [PubMed] [Google Scholar]
  • 7.Cheng J, Goldstein R, Gershenson A, Stec B, Roberts MF. Journal of Biological Chemistry. 2013;288:14863–14873. doi: 10.1074/jbc.M113.466532. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Dougherty DA, Van Arnam EB. ChemBioChem. 2014;15:1710–1720. doi: 10.1002/cbic.201402080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Duffy NH, Lester HA, Dougherty DA. ACS chemical biology. 2012;7:1738–1745. doi: 10.1021/cb300246j. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Santarelli VP, Eastwood AL, Dougherty DA, Horn R, Ahern CA. Journal of Biological Chemistry. 2007;282:8044–8051. doi: 10.1074/jbc.M611334200. [DOI] [PubMed] [Google Scholar]
  • 11.MIles TF., Ph. D California Institute of Technology. 2015 [Google Scholar]
  • 12.Parrish RM, Sherrill CD. Journal of the American Chemical Society. 2014;136:17386–17389. doi: 10.1021/ja5101245. [DOI] [PubMed] [Google Scholar]
  • 13.Wheeler SE, Bloom JWG. The Journal of Physical Chemistry A. 2014;118:6133–6147. doi: 10.1021/jp504415p. [DOI] [PubMed] [Google Scholar]
  • 14.Wheeler SE, Houk KN. Journal of the American Chemical Society. 2009;131:3126–3127. doi: 10.1021/ja809097r. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.I. C. Wavefunction Inc [Google Scholar]
  • 16.Zhao Y, Truhlar D. Theoretical Chemistry Accounts. 2008;120:215–241. [Google Scholar]
  • 17.Roy Dennington TK, Millam John. Shawnee Mission, KS: Semichem, Inc; 2009. [Google Scholar]
  • 18.Frisch MJ, Trucks GW, Schlegel HB, Scuseria GE, Robb MA, Cheeseman JR, Scalmani G, Barone V, Mennucci B, Petersson GA, Nakatsuji H, Caricato M, Li X, Hratchian HP, Izmaylov AF, Bloino J, Zheng G, Sonnenberg JL, Hada M, Ehara M, Toyota K, Fukuda R, Hasegawa J, Ishida M, Nakajima T, Honda Y, Kitao O, Nakai H, Vreven T, Montgomery JA, Jr, Peralta JE, Ogliaro F, Bearpark MJ, Heyd J, Brothers EN, Kudin KN, Staroverov VN, Kobayashi R, Normand J, Raghavachari K, Rendell AP, Burant JC, Iyengar SS, Tomasi J, Cossi M, Rega N, Millam NJ, Klene M, Knox JE, Cross JB, Bakken V, Adamo C, Jaramillo J, Gomperts R, Stratmann RE, Yazyev O, Austin AJ, Cammi R, Pomelli C, Ochterski JW, Martin RL, Morokuma K, Zakrzewski VG, Voth GA, Salvador P, Dannenberg JJ, Dapprich S, Daniels AD, Farkas Ö, Foresman JB, Ortiz JV, Cioslowski J, Fox DJ. Wallingford, CT, USA: Gaussian, Inc.; 2009. [Google Scholar]
  • 19.Meot-Ner (Mautner) M, Deakyne CA. J Am. Chem. Soc. 1985;107:469–474. [Google Scholar]
  • 20.Meot-Ner M, Deakyne CA. Journal of the American Chemical Society. 1985;107:474–479. [Google Scholar]
  • 21.Sunner J, Nishizawa K, Kebarle P. The Journal of Physical Chemistry. 1981;85:1814–1820. [Google Scholar]
  • 22.Guo BC, Purnell JW, Castleman AW. Chemical Physics Letters. 1990;168:155–160. [Google Scholar]
  • 23.Mecozzi S, West AP, Dougherty DA. Journal of the American Chemical Society. 1996;118:2307–2308. [Google Scholar]
  • 24.Grimme S. Journal of Computational Chemistry. 2004;25:1463–1473. doi: 10.1002/jcc.20078. [DOI] [PubMed] [Google Scholar]
  • 25.Johnson ER, DiLabio GA. Chemical Physics Letters. 2006;419:333–339. [Google Scholar]
  • 26.Johnson ER, Wolkow RA, DiLabio GA. Chemical Physics Letters. 2004;394:334–338. [Google Scholar]
  • 27.Kristyán S, Pulay P. Chemical Physics Letters. 1994;229:175–180. [Google Scholar]
  • 28.Torres E, DiLabio GA. The Journal of Physical Chemistry Letters. 2012;3:1738–1744. doi: 10.1021/jz300554y. [DOI] [PubMed] [Google Scholar]
  • 29.Tsuzuki S, Lüthi HP. The Journal of Chemical Physics. 2001;114:3949–3957. [Google Scholar]
  • 30.Pullman A, Berthier G, Savinelli R. Journal of Computational Chemistry. 1997;18:2012–2022. [Google Scholar]
  • 31.Duffy EM, Kowalczyk PJ, Jorgensen WL. Journal of the American Chemical Society. 1993;115:9271–9275. [Google Scholar]
  • 32.Gallivan JP, Dougherty DA. Proceedings of the National Academy of Sciences. 1999;96:9459–9464. doi: 10.1073/pnas.96.17.9459. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Sixma TK, Smit AB. Annual review of biophysics and biomolecular structure. 2003;32:311–334. doi: 10.1146/annurev.biophys.32.110601.142536. [DOI] [PubMed] [Google Scholar]
  • 34.Olsen JA, Balle T, Gajhede M, Ahring PK, Kastrup JS. PLoS ONE. 2014;9:e91232. doi: 10.1371/journal.pone.0091232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Hughes RM, Wiggins KR, Khorasanizadeh S, Waters ML. Proceedings of the National Academy of Sciences. 2007;104:11184–11188. doi: 10.1073/pnas.0610850104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Jacobs SA, Khorasanizadeh S. Science. 2002;295:2080–2083. doi: 10.1126/science.1069473. [DOI] [PubMed] [Google Scholar]
  • 37.Li H, Ilin S, Wang W, Duncan EM, Wysocka J, Allis CD, Patel DJ. Nature. 2006;442:91–95. doi: 10.1038/nature04802. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Peña PV, Davrazou F, Shi X, Walter KL, Verkhusha VV, Gozani O, Zhao R, Kutateladze TG. Nature. 2006;442:100–103. doi: 10.1038/nature04814. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Taverna SD, Li H, Ruthenburg AJ, Allis CD, Patel DJ. Nat Struct Mol Biol. 2007;14:1025–1040. doi: 10.1038/nsmb1338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Cheng J, Gong Z, Zhu W, Tang Y, Li W, Li Z, Jiang H. Journal of Physical Organic Chemistry. 2007;20:448–453. [Google Scholar]
  • 41.Cabarcos OM, Weinheimer CJ, Lisy JM. The Journal of Chemical Physics. 1998;108:5151–5154. [Google Scholar]
  • 42.Koyanagi GK, Bohme DK. International Journal of Mass Spectrometry. 2003;227:563–575. [Google Scholar]
  • 43.Min J, Zhang Y, Xu RM. Genes Dev. 2003;17:1823–1828. doi: 10.1101/gad.269603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Mecozzi S, West AP, Jr, Dougherty DA. Proc. Natl. Acad. Sci. USA. 1996;93:10566–10571. doi: 10.1073/pnas.93.20.10566. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Campo-Cacharrón A, Cabaleiro-Lago EM, Carrazana-García JA, Rodríguez-Otero J. Journal of Computational Chemistry. 2014;35:1290–1301. doi: 10.1002/jcc.23623. [DOI] [PubMed] [Google Scholar]
  • 46.Pless SA, Hanek AP, Price KL, Lynch JW, Lester HA, Dougherty DA, Lummis SCR. Molecular Pharmacology. 2011;79:742–748. doi: 10.1124/mol.110.069583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Xiu X, Puskar NL, Shanata JA, Lester HA, Dougherty DA. Nature. 2009;458:534–537. doi: 10.1038/nature07768. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Gallivan JP, Dougherty DA. J Am. Chem. Soc. 2000;122:870–874. [Google Scholar]
  • 49.van Ingen H, van Schaik FMA, Wienk H, Ballering J, Rehmann H, Dechesne AC, Kruijzer JAW, Liskamp RMJ, Timmers HTM, Boelens R. Structure. 2008;16:1245–1256. doi: 10.1016/j.str.2008.04.015. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Graphical Abstract
Supplementary Information

RESOURCES