Skip to main content
Acta Naturae logoLink to Acta Naturae
. 2009 Apr;1(1):124–127.

The Role of Stacking Interactions in Complexes of Proteins with Adenine and Guanine Fragments of Ligands

T V Pyrkov 1,2,*,, D V Pyrkova 1, E D Balitskaya 1,3, R G Efremov 1
PMCID: PMC3347508  PMID: 22649598

INTRODUCTION

The biological function of proteins is closely connected to interactions with their ligands and substrates. Proteins acting as receptors and enzymes bind these small molecules. Knowledge of the molecular mechanisms of protein-ligand interactions, particularly in the spatial structure of the protein-ligand complex, is a prerequisite for understanding the structure-functional properties of proteins and their role in biochemical pathways in the living cell. The availability of such a structure serves as a basis for rational drug design projects and greatly assists the search for new inhibitors (the ligands of certain protein-targets in an organism).

Experimental tools for determining the spatial structure of proteins and their complexes with ligands (such as X-ray crystallography or NMR spectroscopy) have particular limitations. Even if the structure of a protein is available, determining the structure of its complex with ligands may be experimentally demanding. Problems with purification and crystallization become especially difficult in studies of transmembrane proteins, which include a biologically important class of G-protein coupled receptors. However, recent successes in determining the structure of beta-adrenergic and adenosine receptors [1] are cause for optimism.

The technical difficulties restraining experimental methods stimulated computational molecular modeling. One of them (molecular docking) is a method aimed at predicting the spatial structure of a protein-ligand complex by docking a ligand molecule into the known atomic-resolution structure of a protein-binding site and estimating the reliability of the results. Nowadays, molecular docking has become an integral part of both fundamental studies aimed at understanding the structure-functional role of protein amino acids and applied drug-design programs [2, 3].

Docking approaches are further improved by implementing new algorithms of the conformational search and new scoring functions (methods to estimate the free energy of ligand binding). Scoring functions may include either components of molecular mechanics force fields [2] or empirical terms, e.g. hydrogen bonds described by their geometrical parameters [4]. In this work we studied stacking interactions, which usually are not properly taken into account in widely used scoring functions.

The Parameters of Stacking Interactions

Of all the various types of interactions in biomolecular complexes (such as hydrogen bonds, salt bridges, etc.), the stacking of aromatic substances deserves special attention. Most drugs include aromatic fragments in their chemical structure, and stacking often plays a notable role in their recognition by protein-targets. We have recently shown that an explicit account of stacking in scoring functions increases the efficiency of ATP docking [5]. The aromatic interactions were identified by the mutual orientation of two cycles described by geometrical parameters: the height h and displacement d of one cycle relative to the other, and the angle α between their planes (Fig. 1).

Fig. 1.

Fig. 1.

Geometrical parameters used to describe a stacking contact between two aromatic rings. Displacement (d) and height (h) are calculated for the center of one aromatic ring relative to another ring's plane. Angle α is calculated as the angle between the normal vectors of both rings.

However, the range of these parameters, which corresponds to the presence or absence of a stacking contact, is still not very well defined and usually taken as arbitrary [6, 7]. Defining it more accurately would assist in developing more efficient scoring functions and should increase the prediction quality of the spatial structures of protein-ligand complexes by molecular modeling methods. With this aim in view, we performed an analysis of the spatial structures of protein-ligand complexes determined experimentally with atomic resolution where ligands contained adenine or guanine as a substructure.

One well-known example of stacking interactions is the parallel packing of purine and pyrimidine nucleobases in DNA [8, 9]. Some aromatic compounds tend to orient perpendicular to each other (T-shaped stacking), as has been shown for amino acids in proteins [7, 10] and for model systems of carbon aromatic cycles (benzene and naphtalene) [1114]. Besides, such compounds participate in cation-π interactions, where a positively charged group interacts with the negatively charged cloud of aromatic π-electrons [1517].

Taking all that into account, we analyzed the distribution of geometrical parameters h, d, and α for contacts of adenine and guanine moieties of ligands with the aromatic side chains of receptor amino acids Phe, Tyr, Trp, and His, as well as with the positively charged guanidine group of Arg and amino group of Lys. The results obtained for guanine are presented in Fig. 2.

Fig. 2.

Fig. 2.

The distribution of aromatic rings and positively charged side chain groups of amino acids around the guanine moiety of various ligands in complexes with protein receptors. The color red corresponds to cos2α = 0.6 – 1.0 (parallel orientation), green corresponds to cos2α = 0.0 – 0.4 (T-shaped orientation), and yellow corresponds to intermediate geometry. Here, α is the angle between the planes of both rings. For Lys this value is not defined.

It can be seen that two distinct orientations are typical for Phe: parallel and perpendicular to the guanine plane (Fig. 2, shown in red and green, respectively). The displacement d lies in the same range 0–3 Å for both types of contacts. Meanwhile, they clearly differ in the value of height h, which is less than 4.5 for parallel Å and less than 5.5 Å for perpendicular orientation. Similar distributions were obtained for Tyr, Trp, and His, though the data are scarcer in these cases. However, the T-shaped contact is not as typical for Tyr, Trp, and His as it is for Phe.

Interestingly, the distribution for the planar guanidine group of Arg very much resembles that of Tyr, Trp, and His, where the orientation parallel to guanine is predominant. For the amino group of the Lys side chain, two modes of contacts with guanine were observed: above the plane (cation-π interaction) and in the plane. The latter corresponds to the formation of a hydrogen bond to the heteroatoms of the guanine ring.

Distributions for adenine are similar to those obtained for guanine (data not shown). The results of the presented analysis may be used in developing scoring criteria and applied to rescoring the results of docking or even during the docking procedure.

Guanine-Specific Scoring Function

We demonstrated the efficiency of an explicit account of stacking interactions in a scoring function along with reranking the results of GTP docking to the 14 different proteins that bind this ligand. All structures of GTP-protein complexes, which were generated with the docking procedure, were labeled as either correct or misleading by the value of root-mean-square deviation (rmsd) of the guanine atoms from the reference X-ray structure (see Methods). To estimate the validity of a docking pose, we constructed a number of scoring criteria in the form of a linear combination of the interaction terms. To do that, all GTP-protein complexes were divided into two equal groups: the training and the test sets. The weighting coefficients for these terms were fitted by the linear regression procedure to the binary function, which took on the value of 1 for the correct docking poses and 0 for the seven complexes of the training set. To test the robustness of the new scoring functions, the leave-one-out cross-validation procedure was performed. The relative error of all weighting coefficients of interaction terms was less than 30%, thus indicating the reliability of the results.

New scoring functions were used to analyze the GTP docking poses generated for each complex. The efficiency of the goldscore [18] function implemented in the docking algorithm achieved approximately 50%; it ranked correct poses at the top for only four and three complexes out of seven for the training and test sets, respectively (Table 1). Ranking by the value of Tstacking, which describes stacking contacts, was better than by goldscore; their combination (SF1) yields even better results. A similar effect was observed when the term Tstacking was added to the criterion based on lipophilic contacts and hydrogen bonds between the protein and the guanine moiety of the ligand (SF2 and SF3, Table 1).

Table 1.

Ranking of the results of GTP docking.

Ranking method Training set, 7 complexes Test set, 7 complexes
Number of complexes for which correct pose was ranked at top Average best rank of correct pose Number of complexes for which correct pose was ranked at top Average best rank of correct pose
goldscore 4 4.7 3 12.1
Tstack 5 2.3 3 7.0
SF1 = Р 1.3 + 0.21 × Tstack + 0.016 × goldscore 5 1.7 4 6.9
SF2 = 0.06 + 0.007 × Tlipophilic+ 0.43 × Th-bond 5 2.3 4 6.9
SF3 = 0.05 + 0.004 × Tlipophilic+ 0.39 × Th-bond + 0.22 × Tstack 6 2.0 5 6.4

Of the proposed scores, the SF3 is the most efficient. The number of complexes for which the correct pose was ranked at the top by SF3 considerably exceeds that of goldscore. Also, the average best rank of the correct docking pose improves; i.e. the quality of ranking increases uniformly for all complexes. This can also be seen from the results of ranking for each complex (Fig. 3).

Fig. 3.

Fig. 3.

Efficiency of ranking the docking poses for GTP-protein complexes by the SF3 (red) and by goldscore (green) for the training (A) and the test (B) sets. The best rank of a correct GTP pose is shown.

CONCLUSIONS

The analysis of structural data for complexes of proteins with ligands containing adenine or guanine moiety yielded a more accurate definition of the geometrical parameters of stacking interactions with aromatic side chains of receptor amino acids. Reranking the results of GTP docking demonstrated that an explicit account of stacking in scoring criteria provides a more efficient estimation of the reliability of the structure of the protein-ligand complex predicted with molecular modeling approaches. The obtained results can be further applied to a broader class of nucleobase-containing ligands.

METHODS

The structures of complexes of proteins with adenine- and guanine-containing ligands were taken from the Brookhaven Protein Data Bank (PDB) [19]. The PDBlig web server [20] was used to identify those PDB entries that contain a ligand with a purine nucleobase (adenine or guanine as a substructure). Structures with modified nucleobases and entries that contain nucleic acids other than simple nucleotides or nucleosides were omitted. Finally, to reduce the redundancy of the set of protein-ligand complexes, a multiple sequence alignment was carried out using the Clustalw program [21]. After that, all complexes with the same ligand were clustered according to similarity in a protein sequence and the structure with the best resolution from each cluster was retained.

GTP docking was performed using the GOLD [18] program with the goldscore scoring function. The parameters of the docking procedure were taken as default. For each complex, 60 docking poses were generated. An rmsd cut-off 2.5 Å over the coordinates of guanine atoms was used to judge whether a pose was correct or misleading.

The surface area of the hydrophobic GTP-protein contact (as a measure of hydrophobic interactions, Tlipophilic) was calculated based on the concept of Molecular Hydrophobicity Potential (MHP) using the PLATINUM web server [22]. A MHP-scale shift of +0.2 was applied to the ligand surface to achieve a more realistic distribution of the hydrophobic/hydrophilic properties of GTP.

The term Th-bond is a binary function that takes on the value of 1 when a hydrogen bond network of guanine and a particular motif is formed; if not, it takes on a value of 0. Such motifs were hydrogen bonds between the guanine atoms N1, N2, and O6 and residues i, i, i, or i, i, i-2.

REFERENCES

  • 1.Hanson M.A., Stevens R.C. Structure. 2009;17:8–14. doi: 10.1016/j.str.2008.12.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Kitchen D.B., Decornez H., Furr J.R., Bajorath J. J. Nat. Rev. Drug. Discov. 2004;3:935–949. doi: 10.1038/nrd1549. [DOI] [PubMed] [Google Scholar]
  • 3.Moitessier N., Englebienne P., Lee D., Lawandi J., Corbeil C.R. Br. J. Pharmacol. 2008;153:S7–S26. doi: 10.1038/sj.bjp.0707515. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Eldridge M.D., Murray C.W., Auton T.R., Paolini G.V., Mee R.P. J. Comput. Aided. Mol. Des. 1997;11:425–445. doi: 10.1023/a:1007996124545. [DOI] [PubMed] [Google Scholar]
  • 5.Pyrkov T.V., Kosinsky Y.A., Arseniev A.S., Priestle J.P., Jacoby E., Efremov R.G. PROTEINS. 2007;66:388–398. doi: 10.1002/prot.21122. [DOI] [PubMed] [Google Scholar]
  • 6.Deng Z., Chuaqui C., Singh J. J. Med. Chem. 2004;47:337–344. doi: 10.1021/jm030331x. [DOI] [PubMed] [Google Scholar]
  • 7.Chakrabarti P., Bhattacharyya R. Prog. Biophys. Mol. Biol. 2007;95:83–137. doi: 10.1016/j.pbiomolbio.2007.03.016. [DOI] [PubMed] [Google Scholar]
  • 8.Gotoh O. Adv. Biophys. 1983;16:1–52. doi: 10.1016/0065-227x(83)90007-2. [DOI] [PubMed] [Google Scholar]
  • 9.Sponer J., Leszczynski J., Hobza P. J. Biomol. Struct. Dynam. 1996;14:117–135. doi: 10.1080/07391102.1996.10508935. [DOI] [PubMed] [Google Scholar]
  • 10.Chelli R., Gervasio F.L., Procacci P., Schettino V. J. Am. Chem. Soc. 2002;124:6133–6143. doi: 10.1021/ja0121639. [DOI] [PubMed] [Google Scholar]
  • 11.Tsuzuki S., Honda K., Uchimaru T., Mikami M., Tanabe K. J. Am. Chem. Soc. 2001;124:104–112. doi: 10.1021/ja0105212. [DOI] [PubMed] [Google Scholar]
  • 12.Small D., Zaitsev V., Jung Y., Rosokha S.V., Head-Gordon M., Kochi J.K. J. Am. Chem. Soc. 2004;126:13850–13858. doi: 10.1021/ja046770i. [DOI] [PubMed] [Google Scholar]
  • 13.Sato T., Tsuneda T., Hirao K. J. Chem. Phys. 2005;123:104307–104317. doi: 10.1063/1.2011396. [DOI] [PubMed] [Google Scholar]
  • 14.Malathy Sony S.M., Ponnuswamy M.N. Crystal Growth and Design. 2006;6:736–742. [Google Scholar]
  • 15.Waters M.L. Curr. Opin. Chem. Biol. 2002;6:736–741. doi: 10.1016/s1367-5931(02)00359-9. [DOI] [PubMed] [Google Scholar]
  • 16.Meyer E.A., Castellano R.K., Diederich F. Angew. Chem. Int. Ed. 2003;42:1210–1250. doi: 10.1002/anie.200390319. [DOI] [PubMed] [Google Scholar]
  • 17.Tewari A.K., Dubey R. Bioorg. Med. Chem. 2008;16:126–143. doi: 10.1016/j.bmc.2007.09.023. [DOI] [PubMed] [Google Scholar]
  • 18.Jones G., Willett P., Glen R.C., Leach A.R., Taylor R.D. J. Mol. Biol. 1997;267:727–748. doi: 10.1006/jmbi.1996.0897. [DOI] [PubMed] [Google Scholar]
  • 19.Berman H.M., Westbrook J., Feng Z., Gilliland G., Bhat T.N., Weissig H., Shindyalov I.N., Bourne P.E. Nucleic. Acid. Res. 2000;28:235–242. doi: 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Chalk A.J., Worth C.L., Overington J.P., Chan A.W. J. Med. Chem. 2004;47:3807–3816. doi: 10.1021/jm040804f. [DOI] [PubMed] [Google Scholar]
  • 21.Larkin M.A., Blackshields G., Brown N.P., Chenna R., McGettigan P.A., McWilliam H., Valentin F., Wallace I.M., Wilm A., Lopez R., Thompson J.D., Gibson T.J., Higgins D.G. Bioinformatics. 2007;23:2947–2948. doi: 10.1093/bioinformatics/btm404. [DOI] [PubMed] [Google Scholar]
  • 22.Pyrkov T.V., Chugunov A.O., Krylov N.A., Nolde D.E., Efremov R.G. Bioinformatics. 2009 doi: 10.1093/bioinformatics/btp111. [DOI] [PubMed] [Google Scholar]

Articles from Acta Naturae are provided here courtesy of National Research University Higher School of Economics

RESOURCES