Skip to main content
Protein Science : A Publication of the Protein Society logoLink to Protein Science : A Publication of the Protein Society
. 2004 Oct;13(10):2725–2735. doi: 10.1110/ps.04878904

A united residue force-field for calcium–protein interactions

Mey Khalili 1, Jeffrey A Saunders 1,3, Adam Liwo 1,2, Stanislaw Ołdziej 1,2, Harold A Scheraga 1
PMCID: PMC2286548  PMID: 15388862

Abstract

United-residue potentials are derived for interactions of the calcium cation with polypeptide chains in energy-based prediction of protein structure with a united-residue (UNRES) force-field. Specific potentials were derived for the interaction of the calcium cation with the Asp, Glu, Asn, and Gln side chains and the peptide group. The analytical expressions for the interaction energies for each of these amino acids were obtained by averaging the electrostatic interaction energy, expressed by a multipole series over the dihedral angles not considered in the united-residue model, that is, the side-chain dihedral angles χ and the dihedral angles λ for the rotation of peptide groups about the Cα•••Cα virtual-bond axes. For the side-chains that do not interact favorably with calcium, simple excluded-volume potentials were introduced. The parameters of the potentials were obtained from ab initio quantum mechanical calculations of model systems at the Restricted Hartree-Fock (RHF) level with the 6–31G(d,p) basis set. The energy surfaces of pairs consisting of Ca2+-acetate, Ca2+-propionate, Ca2+-acetamide, Ca2+-propionamide, and Ca2+-N-methylacetamide systems (modeling the Ca2+-Asp, Ca2+-Glu, Ca2+-Asn, Ca2+-Gln, and Ca2+-peptide group interactions) at different distances and orientations were calculated. For each pair, the restricted free energy (RFE) surfaces were calculated by numerical integration over the degrees of freedom lost when switching from the all-atom model to the united-residue model. Finally, the analytical expressions for each pair were fitted to the RFE surfaces. This force-field was able to distinguish the EF-hand motif from all potential binding sites in the crystal structures of bovine α-lactalbumin, whiting parvalbumin, calbindin D9K, and apo-calbindin D9K.

Keywords: protein structure prediction, united-residue force-field, calcium cation binding, EF-hand motif


Calcium-binding proteins play many biological roles. They are involved in regulation, membrane-related events, and muscle movement and act as biological switches (Frausto da Silva et al. 1991; Bertini et al. 1994; Lippard and Berg 1994; Hanna and Doudna 2000). Metal cations, in general, bind to centers of high hydrophilicity and reduce the enthalpy of a system upon binding (Frausto da Silva et al. 1991). All metal ions bind to a shell of polar hydrophilic residues surrounded by a shell of nonpolar residues (Yamashita et al. 1990). However, the binding process probably occurs in a step-wise manner, in which the hydrated metal ion initially positions itself in the binding site and, later, some of its inner-shell water molecules are replaced by negatively charged side-chains (Dudev and Lim 2003).

It is not clear whether metal ions help proteins to fold or whether they are incorporated into the proteins after they have already formed their three-dimensional structure. Although there have been numerous attempts to design calcium-binding sites in proteins (Yang et al. 2002, 2003), there has been very little effort to locate the most probable binding site in a protein. Locating potential binding sites in a mean-field model is still more difficult because, in most such models, atomic details of the side-chains are eliminated and the side-chains are represented by spheres or ellipsoids. However, charges and the orientation of the side-chain are very important in the process of metal binding. Here, we present an effort to reproduce the correct charge and geometry distribution of the side-chains in a united-residue model. Such a model can help in incorporating metal ions efficiently in the folding process, especially because most metal-binding proteins are large multichain subunits, and it is currently difficult to simulate their folding by using an all-atom potential. To try to fold these proteins with a united-residue force-field that can include cations in the folding process might shed light on the roles that cations play in folding proteins. We start with the calcium cation because, in contrast to transition-metal cations, it forms complexes of predominantly electrostatic type, which are easier to model compared with covalent bonding interactions as occur with cations such as zinc. Because we treat the system at the mean-field level, we consider only the total energetic effect of replacing water molecules in the coordination sphere of a calcium cation and not the detailed kinetics of this process.

The calcium cation shows a greater tendency to form complexes with negatively charged carboxylate groups than with water molecules (Katz et al. 1996). It binds directly to the side-chains of the hydrophilic residues and to the carbonyl groups of the backbone peptide groups as opposed to indirect binding via a metal-bound water molecule (Dudev and Lim 2003; Dudev et al. 2003). The observed total coordination number of calcium in proteins varies from six to eight, with a coordination radius of 1.00 Å and 1.12 Å, respectively (Jernigan et al. 1994; Dudev and Lim 2003; Dudev et al. 2003).

Calcium binding sites can be classified into three categories (Einspahr and Bugg 1984; McPhalen et al. 1991; Pidcock and Moore 2001; Dudev and Lim 2003). Most calcium binding proteins possess a highly conserved structure, the EF-hand motif, that selectively binds calcium (Forsen and Koerdel 1996). This structure corresponds to a short continuous segment of the protein that provides groups for binding (the classical EF-hand motif). A second structure is the same as an EF-hand motif except that one of the groups comes from a distant site. A third structure corresponds to binding sites in which the groups come from different segments in the protein. The EF-hand motif is the most common of them all (Kretsinger and Nockolds 1973). The classical EF-hand motif is a 12-residue Ca2+-binding loop between two helices, forming a conserved helix-loop-helix structure (Kretsinger and Nockolds 1973; Dudev and Lim 2003). This motif contains Asp, Glu, Asn, and Gln residues that, along with the peptide groups, provide sites for calcium binding. In many calcium-binding proteins, Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. In the S100 superfamily, for example, to which calbindin D9K, calmodulin, and troponin C belong, this conformational change is described as a change from the ≪closed≫ conformational state in the absence of Ca2+ to the ≪open&Rt conformational state in its presence (Yap et al. 1999).

Results

The united-residue (UNRES) mean-field energy function, together with its extension to include calcium–polypeptide interactions and the parameterization of these additional terms, are described in the Materials and Methods section (equations 1, 8, 9, 10, and 13), with the parameters being summarized in Table 1 in the Materials and Methods section. In this section, we describe the results of tests of the force-field on bovine α-lactalbumin, whiting parvalbumin, and calbindin D9K.

Table 1.

Parametersaof the Ca2+force-field (equations 8 and 9) obtained by quantum mechanical energy calculations

Group ɛ (kcal/mole) r0 (Å) Wc (kcal/mole • Å) Wdip (kcal/mole • Å2) Wquad1 (kcal/mole • Å3) Wquad2 δ(Å)
Asp 0.1 3.5 −1381.06 283.2 −203.7 3.0 0.12
Glu 0.1 3.5 −1426.87 −98.92 −2454 3.0 0.02
Asn 0.1 3.5 −70.79 1499 3.0 0.05
Gln 0.1 3.5 −93.13 1975 3.0 0.01
Pep 0.1 6.0 109.32 3.901 3.0 0.00

a ɛ and r0 were estimated based on the AMBER force-field, and the rest of the parameters were obtained from RHF/6-31G (d,p) calculations. For Asn, Gln, and the peptide group, wc does not appear because these sites possess zero net charge. For the peptide group, δ = 0 because it is assumed that the calcium cation interacts with the center of mass of the peptide group.

Bovine α-lactalbumin (Protein Data Bank [PDB] access code 1F6S; Chrysina et al. 2000) is an α+β hexamer, each monomer containing 121 residues. Every monomer has one EF-hand motif starting at Asp78 and ending at Asp88. The binding pocket contains only Asp residues. The position of the calcium cation in the native structure and the position found by a systematic search of the optimum binding site and subsequent minimization of the calcium-chain energy (with the polypeptide chain fixed in the native structure; see Materials and Methods) are shown in Figure 1. The root mean square deviation (RMSD) between the force-field position of calcium and its position in the PDB structure is 0.7 Å.

Figure 1.

Figure 1.

Stereo view of the placement of the calcium cation in the native structure (black ball) of a monomer of α-lactalbumin and the optimal location found by the calcium force-field (gray ball). This protein contains one calcium-binding site per monomer. The light gray segments in the figure represent the bond between the Cα and side-chain sites in the interacting residues. The backbone geometry was not optimized.

Whiting parvalbumin (PDB access code 1A75; Declercq et al. 1999) is an α-helical dimeric protein. Each of the monomers has 108 residues and contains two EF-hand motifs. One of the EF-hand motifs starts at Asp53 and ends at Glu62. The other EF-hand motif starts at Asp92 and ends at Glu101. The positions of the two calcium cations in the experimental structure and the positions found by a systematic search of the optimum binding sites and subsequent energy minimization (with the polypeptide chain fixed in the native structure) are shown in Figure 2. The RMSD between the force-field positions of the calciums and those of the PDB structure is 3.37 Å in the first loop and 3.34 Å in the second loop, respectively.

Figure 2.

Figure 2.

Stereo view of the placement of calcium cations in the native structure (black balls) of a monomer of whitting parvalbumin and the optimal locations found by the calcium force-field (gray balls). This protein contains two calcium-binding EF-hand motifs per monomer. The light gray segments in the figure represent the bond between the Cα and side-chain sites in the interacting residues. The backbone geometry was not optimized.

Calbindin D9K (PDB access code 4ICB; Svensson et al. 1992) is an α-helical monomeric protein of 75 residues. This protein is very well studied, and there are many good crystal and NMR structures of the calcium-bound form, the apo-form, and structures with other ions such as magnesium and lanthanium in the PDB. It contains two EF-hand motifs. The first one starts at Glu17 and ends at Glu27. The second one starts at Asn54 and ends at Glu65. The second motif has more interacting residues, and therefore, it is energetically favored by calcium and other cations even though the pockets are comparable in size. The positions of the two calcium cations in the experimental structure and the positions found by systematic search of the optimum binding sites and subsequent energy minimization (with the polypeptide chain fixed in the native structure) are shown in Figure 3. The RMSD between the force-field positions of the calciums and those of the PDB structure is 5.33 Å in the first loop and 7.55 Å in the second loop, respectively.

Figure 3.

Figure 3.

Stereo view of the placement of calcium cations in the native structure (black balls) of calbindin D9K along with the optimal locations found by the calcium force-field (gray balls). The light gray segments in the figure represent the bond between the Cα and side-chain sites in the interacting residues. The backbone geometry was not optimized.

These examples demonstrate that our calcium force-field can correctly locate the EF-hand motif as the best binding site given the experimental geometry of the polypeptide chain. We carried out two more test calculations on calbindin D9K to determine whether the potential can locate the binding site correctly (1) in its apo-form, in which the binding sites have the closed form, and (2) in the native-like structure predicted with the UNRES force-field, in which the binding-site region is distorted because of imperfections of the force-field. In both tests 1 and 2, the geometry of the system containing the polypeptide chain and the calcium cation(s) was energy-minimized (see Materials and Methods).

To carry out test 1, we used the PDB structure of apo-calbindin D9K (PDB access code 1CLB; Skelton et al. 1995). The apo form examined is a mutant of the wild type, in which Phe43 is replaced by Gly. It appears that this mutation does not affect the potential calcium-binding regions, because the mutated residue is not at any of the two EF-hands and it is not a residue that interacts favorably with calcium. The main difference between calbindin D9K and apo-calbindin D9K is in the EF-hand regions. In apo-calbindin D9K, the EF-hands are positioned toward the center of the protein. This form of the EF-hand motif is also known as the ≪closed≫ form. However, when calciums bind to the EF-hands, they push the EF-hands to the outside and the EF-hands adopt the ≪open≫ form (Yap et al. 1999). The opening of the EF-hands upon calcium binding is thought to be responsible for the biological activity of the protein (Yap et al. 1999).

The total backbone α-carbon (Cα) RMSD between the PDB structures of calbindin D9K and apo-calbindin D9K (residues 1–75) is 2.11 Å. The Cα RMSDs between the first set of residues (17–27) and the second set of residues (54–65) in the two EF-hand motifs of the experimental structure of calbindin D9K and apo-calbindin D9K are 1.13 Å and 1.79 Å, respectively. The crystal structure of apo-calbindin D9K had to be relaxed by minimization of the UNRES energy because it had some side-chain clashes when it was converted from the PDB geometry to the UNRES geometry. However, the relaxation procedure increased the RMSD between the relaxed structure of apo-calbindin D9K and the structure of calbindin D9K in the crystal. The increase in RMSD was due mainly to the introduction of three residues to the helix in the region of the first EF-hand motif of apo-calbindin D9K. After the relaxation, the overall Cα RMSD between these structures of apo-calbindin D9K and calbindin D9K was 3.70 Å, and the RMSDs between the first and the second EF-hand motifs were 2.70 Å and 1.82 Å, respectively.

After the introduction of calcium cations into the relaxed apo-structure and minimization of the UNRES energy of the chain, including the positions of the cations, the total Cα RMSD of the final structure from the structure of calbindin D9K in the crystal was 3.21 Å as opposed to 3.70 Å before energy minimization. The RMSDs between the first and the second EF-hand motifs were 2.21 Å and 1.84 Å, respectively, as opposed to 2.70 Å and 1.82 Å of the relaxed structure before calcium placement. The RMSD between the position of calcium in the final calculated structure and in the experimental structure of calbindin D9K in the crystal was 4.39 Å in the first pocket and 7.44 Å in the second pocket. Therefore, introduction of calcium cations and subsequent minimization of the energy of the complex reduced the Cα RMSD between the relaxed apo-form and the native calcium-bound form, especially in the first EF-hand motif, which was distorted by the relaxation procedure. The superposition of the crystal structure of the calcium-bound form (black), the crystal structure of the apo form (red), and the final force-field structure (blue) are shown in shown in Figure 4. It can be seen in the figure that the calcium-bound structure and the final force-field structure are much more similar in the EF-hand motif regions than are the apo structure and the final force-field structure. This shows that our calcium force-field is capable of converting the EF-hand motifs from the ≪closed≫ form in the apo structure to the ≪open≫ form in the calcium-bound structure.

Figure 4.

Figure 4.

Stereo view of the superposition of calbindin D9K (black), apo-calbindin D9K (red), and the apo-calbindin D9K structure (blue) after the calcium placements and minimization of the backbone and calcium positions with the UNRES-calcium force-field. The backbone geometry of apo-calbindin D9K was optimized before addition of calciums. Although the loops are ≪open≫in the native structure of the apo-calbindin D9K, they become ≪closed≫ after the introduction of calcium and represent the geometry of the binding sites in the native structure of calbindin D9K.

To carry out test 2, we used the lowest-energy structure of apo-calbindin D9K, which was obtained by a conformational search of apo-calbindin D9K without the calcium force-field using the UNRES force-field and our conformational space annealing search (CSA) method (Lee et al. 1999). We used the latest parameterization of the UNRES force-field obtained by optimizing the free-energy landscapes of four benchmark proteins: 1E0G, 1E0L, 1GAB, and 1IGD (S. Oldziej, J. Lagiewka, A. Liwo, C. Czaplewski, M. Chinchio, M. Nanias, and H.A. Scheraga, in prep.).

The overall Cα RMSD between this lowest-energy structure and the crystal structure of apo-calbindin D9K was 6.3 Å. The RMSDs between the first and the second EF-hand motifs were 4.17 Å and 3.31 Å, respectively. In this structure, the EF-hand motifs were completely distorted. They did not posses any extended structure, which is the characteristic of the EF-hand motif. Rather, they possessed a significant amount of helical content.

To assess whether one calcium cation is capable of correcting one of the distorted EF-hand motifs, one calcium cation was introduced to this structure. The chain, as well as the position of the calcium cation, was energy-minimized. The final structure obtained from the calcium force-field had an overall Cα RMSD of 5.89 Å from the crystal structure of calbindin D9K. The RMSDs of the first and second EF-hand motif were 3.91 Å and 3.65 Å, respectively. This structure is shown in Figure 5A. It can be seen that although both the overall RMSD, as well as those of the EF-hand motifs, are lower in the final structure than in the initial structure, the calcium cation was not able to correct the distorted EF-hands. It did not enter into any of the EF-hand motifs but stayed in a region between the two distorted regions in the core of the structure.

Figure 5.

Figure 5.

(A) Stereo view of one calcium cation in the lowest energy structure obtained by folding apo-calbindin D9K with CSA, using the force-field of equation 1 and minimizing the total energy of the complex. The binding sites have significant helical content. Because of the distortion of the binding sites, the calcium cation is placed in the center of the protein so that it is close to as many interacting residues as possible. The calcium cation in this placement can interact with acidic/amide containing residues from both binding sites and from the helices. (B) Stereo view of two calcium cations in the structure of A after minimization of the total energy of the complex. The second calcium also ends up in the middle of protein. Further, helices on both sides of the proteins have shifted so that many of the interacting residues are close to the calcium cations. This shows that it is more energetically favorable for this force-field to shift helices than to unwind them in the distorted binding sites.

A closer examination of this preferential site for the calcium cation in Figure 5A shows that it is rich in interacting residues from both of the EF-hand motifs. In an undisturbed EF-hand, most of the interacting residues point toward the center of the EF-hand motif. But here, because of the helical pattern of the distorted EF-hands, the interacting residues alternately point inside and outside of the center of the EF-hands. As a result, some of the interacting residues from both of the EF-hands provide an optimal site between the two EF-hands for the calcium cation to bind.

Initially, we thought that only one calcium cation by itself might not be able to correct the distorted EF-hands. However, after introducing the second calcium cation, the total Cα RMSD of the final structure increased to 6.19 Å, and the RMSDs between the first and the second EF-hands became 3.95 Å and 4.45 Å, respectively. Compared with the original structure, this structure is farther from the crystal structure of calbindin D9K. This structure is shown in Figure 5B. The two calcium cations are in the core of the protein, and they are surrounded by the interacting residues of the EF-hands and the two adjacent helices. They interact with Glu17 and Glu27 of the first EF-hand motif and Glu48, Asp54, Asp58, Glu60, and Asn67 of the second EF-hand motif. Further, the helices on the sides of the EF-hands have shifted so that they provide as many interacting residues to the two calcium cations as possible. This shift of the helices is the reason for the increase in the RMSD and the deviation between this structure and the crystal structure of calbindin D9K.

Discussion

The results obtained in this study demonstrate that our calcium force-field is capable of locating the EF-hand motifs in the calcium-bound and apo forms of proteins. In other words, when the binding site has the correct or nearly correct geometry, the force-field locates the EF-hand motif as the most energetically favorable place for the calcium cation to bind after it has explored all the possible sites that interact with the calcium cation. This force-field can, therefore, be implemented to locate the EF-hand motif when other methods are used to predict the fold. Effort is under way in our laboratory to incorporate the calcium cation in simulated folding.

However, the force-field is not able to correct the topology of a distorted binding site, as can be seen in Figure 5, A and B. The most probable reason for this is that the UNRES force-field is not perfect in reproducing loop geometry; it tends to assign some secondary structure to loops. As can be seen in Figure 5B, the helices from both sides of the loops are arranged to surround the calcium cations by the greatest number of interacting residues. This means that it is energetically more favorable for this force-field to reposition an entire helix than to open a helix turn. However, the calcium-binding constant to α-lactalbumin is of the order of 107, so that the free energy of binding is ~10 kcal/mole (Permyakov et al. 2001); this is enough to unfold two or three turns of a helix.

The possible reasons for the distortion of the geometry of the calcium coordination sphere (i.e., the location of the calcium with respect to the interacting residues in the binding site) obtained with the force-field derived in this work are: (1) The parameterization was not refined by reproducing the geometry of high-resolution crystal structures of calcium-bound peptides; and (2) the multibody terms to express the calcium-interaction in the potential-energy function were absent. Both issues are now being investigated in our laboratory.

Materials and methods

The UNRES energy function

In the UNRES model (Liwo et al. 1997a,b, 1998, 1999a,Liwo et al. b, 2001, 2002, 2004; Lee et al. 1999), a polypeptide chain is represented as a sequence of α-carbon atoms, (Cαs), linked by virtual bonds with attached united side-chains (SCs), and united peptide groups (ps). Each united peptide group is located in the middle of two consecutive α-carbon (Cα) atoms (Fig. 6).

Figure 6.

Figure 6.

The UNRES model of the polypeptide chain with a calcium cation near an interacting residue. Filled circles represent united peptide groups (p); open circles represent the Cα atoms, which serve as geometric points; and ellipsoids represent side-chains. The variables to change the conformation of the polypeptide chain are the virtual-bond angles θ, the virtual-bond dihedral angles γ, and the angles αSC and βSC that define the location of a side-chain with respect to the backbone. For a side-chain with the center of mass at SCi, which is a calcium-binding site (Asp, Glu, Asn, or Gln), the calcium-interaction site, SCi, is shifted by a distance δSCi along the respective Cα•••SC axis, whereas for the remaining side-chains as well as for the peptide groups, the interaction site coincides with the center of mass, SCi. The distance dSCiof a calcium cation from SCi is calculated from its distance dCa2+ from Ciα, the angle αCa2+ between the Ciα•••SCi axis and the Ciα•••Ca2+ vector. ζCa2+ is the angle of rotation of Ca2+ around the Ciα•••SCi axis.

The interaction sites are the united peptide groups and the united side-chains. The α-carbon atoms are geometric points. All virtual-bond lengths (i.e., Cα•••Cα and Cα•••SC) are fixed; the distance between the neighboring Cαs is 3.8 Å, corresponding to trans peptide groups, whereas the side-chain angles (αSC and βSC), the virtual-bond angles (θs), and dihedral angles (λs) can vary. In this work, we introduce calcium cations. For side chains that bind the calcium cation specifically (Asp, Glu, Asn, and Gln), the calcium cation interacts with the carboxyl and carbonyl groups in the side-chain and not with the center of mass of the side-chain. However, UNRES ellipsoids do not contain the details of the side-chains. Therefore, the calcium-interaction site is assumed to lie on the Cα•••SC axis and to be shifted from the SC center of mass by the distance δSC; This new center is called SC′. Therefore, the location of a calcium cation with respect to a favorable amino acid residue i is described by its distance dSC′i from SC′ and the Ca2+•••Ci&agr;•••SCi angle (αCa2+) as shown in Figure 6. The Ca2+ cation can rotate about the Cα•••SC axis by the angle ζCa2+, which, together with dSCi and αCa2+, defines its location with respect to the polypeptide chain. It should be noted that the energy of interaction of the SC site with the calcium does not depend on ζCa2+, because the corresponding expression has cylindrical symmetry with respect to the Cα•••SC axis. For the remaining side-chain types, calcium is assumed to interact with the center of mass of the ellipsoid. For the peptide groups of the backbone, calcium is assumed to interact with the center of mass of the peptide.

UNRES is a physics-based force-field, which is derived as a Restricted Free Energy (RFE) function of a polypeptide chain. The RFE is defined as the free energy of a given coarse-grain conformation obtained by integrating the Boltzmann factor of the all-atom (i.e., the polypeptide chain-plus-solvent) energy over the degrees of freedom that are neglected in the united-residue model (Liwo et al. 1998, 1999b, 2001). The complete UNRES potential-energy function, which also includes the protein-calcium cation interaction terms that are introduced in this work, is expressed by equation 1:

graphic file with name M1.gif (1)

The terms USCiSCj correspond to the mean free energy of hydrophobic (hydrophilic) interactions between the side-chains. These terms implicitly contain the contributions from the interactions of the side-chain with the solvent. The terms USCipj correspond to the excluded-volume potential of the side chain–peptide group interactions. The terms Upipj represent the energy of average electrostatic interactions between backbone peptide groups. The terms Utor and Utord are the torsional and the double-torsional potentials, respectively, for rotation about a given virtual bond or two consecutive virtual bonds. The terms Ub and Urot are the virtual-angle–bending and side chain–rotamer potentials. The terms Ucorr (m) correspond to the correlations (of order m) between peptide-group electrostatic and backbone-local interactions. The last five terms pertain to the calcium–polypeptide chain interactions, which are described in the next section.

The terms USCiSCj, Ub, and Urot were parameterized (Liwo et al. 1997a,b) from the distribution and correlation functions determined from the PDB. Utor, Utord, and Ucorr were based on the cumulant expansion of the RFE of polypeptide chains (Liwo et al. 1998, 1999b, 2001) and parameterized from the RFEs of model systems obtained by high-level MP2/6–31G(d,p) ab initio calculations of model system (Oldziej et al. 2003; Liwo et al. 2004). Finally, the ws are weights of the various energy terms, obtained by optimization of the total potential-energy function to obtain a funnel-like energy landscape of benchmark proteins (Liwo et al. 2002).

Analytical expressions for energy of interaction of calcium cations with polypeptide chains

Binding of the calcium cation to the carboxyl and carbonyl groups is mainly electrostatic in nature. Therefore, to derive analytical expressions for the RFE of interaction of the calcium cation with the Asp, Glu, Asn, and Gln side-chains and with the peptide groups, we assumed that the charge distribution of each of these interaction sites (and, consequently, the interaction energy with the calcium cation) is expressed by a multipole series, which provided simplified analytical expressions for the interaction energy. Subsequently, by averaging these expressions over the secondary degrees of freedom (i.e., the rotation angle ζ of the Ca2+ cation about the Cα-SC axis shown in Fig. 6), we derived approximate analytical expressions for the mean-field interaction energies of the calcium cation with its interaction sites.

The Asp and Glu side-chains bear a net charge of -1e and also possess a quadripole moment, which is necessary to describe the angular dependence of the calcium-side-chain interaction energy. For charged systems, choosing the origin of the reference system in the charge center implies that the dipole moment is zero. However, because it is not know a priori what choice of the origin of the reference system provides the analytical expression best fitting the numerically calculated RFE surfaces, even for charged side-chains, we introduced a dipole moment parallel to the rotation axis. The energy of interaction of the calcium cation with the point charge and with the dipole moment does not depend on the rotation angleζ; therefore, it is necessary to average only the calcium–quadripole interaction energy (Fig. 7). In this figure, for clarity we have shown only the quadruple moment placed at SC′. However, in our derivations, there is also a dipole moment along the Z-axis and a point charge located at SC′.

Figure 7.

Figure 7.

Illustration of the linear quadripole representing the asymmetry of the charge distribution of the carboxylate group and of the definition of its local coordinate system. The site SC′ corresponds to that of Figure 6. q is any point charge (here +2 corresponding to the charge of Ca2+). dSC is the distance between the point charge q and the center of interaction SC′, and α corresponds to the angle that dSC makes with the rotation axis. ζ is the angle of rotation.

For an ionized carboxyl group, we assume that a linear quadripole composed of a charge 2q lies on the axis of rotation at the position SC′ and two charges of -q are positioned symmetrically with respect to the axis of rotation and separated by a distance d from the central charge of 2q, as shown in Figure 7.

The magnitude of the quadripole moment is given by equation 2:

graphic file with name M2.gif (2)

The energy of interaction of the quadripole placed at SC′ with a point charge of magnitude q′ positioned at coordinates x, y, and z is given by equation 3

graphic file with name M3.gif (3)

where dSC′ = x2 + y2 + z2 and is the dielectric constant. Transforming equation 3 from Cartesian coordinates to internal coordinates dSC′, α, ζ such that

graphic file with name M4.gif (4)

we obtain equation 5.

graphic file with name M5.gif (5)

The energy averaged over the rotation angle ζ is expressed by equation 6:

graphic file with name M6.gif (6)

The energy of interaction of a point dipole (with dipole moment p) with a point charge q′ (positioned at a distance dSC′ from the dipole and angle α with respect to the dipole axis) is expressed by equation 7:

graphic file with name M7.gif (7)

To represent the point charge energies, we use the expression for the Coulombic charge–charge interaction energy and finally a Lennard-Jones term to prevent the collapse of the calcium cation on the respective side-chain. Because the distance δSCi is much smaller than dSCi and dCa2+ in Figure 6, we assume that α in Figure 7 is approximately equal to αCa2+ in Figure 6 for simplicity. Taking all this into account, we obtain equation 8 for the energy of interaction of a calcium cation with an aspartate or glutamate side-chain, UAGCa2+.

graphic file with name M8.gif (8)

where X denotes the Asp or the Glu side-chain, ɛ XCa2+ and r0 XCa2+ are the parameters of the Lennard-Jones potential, wc,X is the parameter of the Coulomb term, wdip,X is the parameter of the dipole term, and wquad1,X and wquad2,X are the parameters of the quadripole term, respectively.

The expressions for the interaction energy between the Asn and Gln side-chains and calcium (UAGNCa2+) or the peptide groups and calcium (UpCa2+) are similar. Because these groups are neutral, there is no charge–charge interaction term:

graphic file with name M9.gif (9)

where X denotes the Asn or the Gln side-chain or the peptide group, and the meaning of the symbols is the same as described by the text under equation 8.

The constants of the Lennard-Jones potential ɛ and r0 were estimated from the calcium cation parameters of the AMBER force-field (Pearlman et al. 1995). Determination of other parameters of equations 8 and 9 is described in “Parameterization of the Expressions for Calcium–Polypeptide Interaction Energy.”

The terms USCiCa2+ correspond to the interaction between the calcium cation and any side-chains other than Asp, Glu, Asn, and Gln introduced to avoid clashes of the calcium cation with the side-chains. They are expressed by a Lennard-Jones potential, as given by equation 10.

graphic file with name M10.gif (10)

where

graphic file with name M11.gif (11)
graphic file with name M12.gif (12)

where r0Ca2+ = 3.4725 Å (Mayo et al. 1990), the values of σSCi were obtained from (Liwo et al. 1997a), and the values of ɛSCiCa2+ were all arbitrarily set at 0.2 kcal/mole.

Finally, UCa2+Ca2+ corresponds to the repulsive interaction between two calcium cations, and consists of an excluded-volume Lennard-Jones term and a Coulomb term in equation 13:

graphic file with name M13.gif (13)

where r0Ca2+ is defined in the text under equation 11, ɛCa2+ = 0.05 kcal/mole (Mayo et al. 1990), k = 332 is the conversion factor to express the energy in kilocalories per mole if the distance and charge are expressed in angströms and electron charge units, respectively, qCa2+ = 2e is the charge of the calcium cation, and ɛ is the dielectric constant of water.

Parameterization of the expressions for calcium–polypeptide interaction energy

As in our recent work on the parameterization of the torsional and correlation terms of the UNRES force-field (Oldziej et al. 2003; Liwo et al. 2004), the parameters of the calcium–polypeptide interaction terms were obtained from quantum mechanical calculations on model compounds, followed by fitting the resulting RFE hypersurfaces to the analytical expressions.

The energy landscapes for each of the different interacting residues were obtained by ab initio quantum mechanical calculations at the Restricted Hartree-Fock (RHF) level with the 6–31 G(d,p) basis set. The program GAMESS (Schmidt et al. 1993) was used in all calculations. The systems studied were Ca2+−AcO, Ca2+−PrO, Ca2+−AcNH2, Ca2+−PrNH2, and Ca2+−AcNHMe (where Ac and Pr denote the acetyl and propionyl group, respectively) to model the interactions of the calcium cation with the ionized aspartate and glutamate side-chains, the asparagine and glutamine side-chains, and the peptide group, respectively. The valence geometry of the side-chain sites was not optimized. The bond lengths and angles were taken from ECEPP/3 geometry (Némethy et al. 1992), and the peptide groups were assumed to be planar. The systems are shown in Figure 8, A through E.

Figure 8.

Figure 8.

Settings for the calculation of the RFEs of model systems with quantum mechanical calculations to derive the parameters for the force-field. (A) Ca2+-Asp system, modeled as AcO. d (equivalent to dSCi in Fig. 6) is varied from 3 Å to 7 Å in 1 Å increments. αCa2+ is changed from 0° to 180° in 15° increments. χ1 changes from 0° to 360° in 30° increments and ζCa2+ changes from 0° to 180° in 30° increments. (B) Ca2+-Glu system, modeled as PrO. d (equivalent to dSCi in Fig. 6) is varied from 3 Å to 7 Å in 1 Å increments. αCa2+ is changed from 0° to 180° in 15° increments. χ 1 and χ 2 change from 0° to 360° in 30° increments and ζCa2+ changes from 0° to 180° in 30° increments. (C) Ca2+-Asn system, modeled as AcNH2. d (equivalent to dSCi in Fig. 6) is varied from 3 Å to 7 Å in 1Å increments. αCa2+ is changed from 0° to 360° in 15° increments. χ 1 changes from 0° to 360° in 30° increments and ζCa2+ changes from 0° to 360° in 30° increments. (D) Ca2+-Gln system, modeled as PrNH2. d (equivalent to dSCi in Fig. 6) is varied from 3 Å to 7 Å in 1 Å increments. αCa2+ is changed from 0° to 360° in 15° increments. χ1 and χ2 change from 0° to 360° in 30° increments and ζCa2+ changes from 0° to 360° in 30° increments. (E) Ca2+-peptide system, modeled as AcNHMe. P corresponds to the center of mass of the molecule (equivalent to the peptide centers [p] in Fig. 6). Unlike other systems, AcNHMe is rigid and calcium rotates around it. d is varied from 2 Å to 6 Å in 1 Å increments. αCa2+ is changed from 0° to 180° in 15° increments. ζCa2+ changes from 0° to 180° in 30° increments.

For system X, the variables were the distance of the calcium cation from the center of mass of the system (SC in Fig. 8A–E), d, the angle between the Cα•••SC axis, αCa2+, the angle of rotation of the Ca2+ cation about the Cα•••SC axis, ζCa2+, and the angle χ is the dihedral angle about a bond as shown in Figure 8, A through D. For Ca2+−AcO, Ca2+−PrO, Ca2+−AcNH2, and Ca2+−PrNH2, the energy was evaluated on the following grid: 3Å ≤ d ≤ 7 Å with a 1 Å increment, 0° ≤ αCa2+ ≤ 180° with a 15° increment, 0° ≤ ζ Ca2+ ≤ 180° with a 30° increment, and 0° ≤ χ ≤ 360° with a 30° increment. For Ca2+−AcNHMe, unlike other systems, AcNHMe is assumed to be rigid and the energy was evaluated on the following grid: 2Å ≤ d ≤ 6 Å with a 1 Å increment, 0° ≤ α Ca2+ ≤ 180° with a 15° increment, and 0° ≤ ζ Ca2+ ≤ 180° with a 30° increment (Fig. 8E). Subsequently, the RFE surfaces in d and αCa2+ were calculated by integrating out the remaining degrees of freedom, as given by equation 14:

graphic file with name M14.gif (14)

where n is the number of dihedral angles χ in a system, χ denotes the vector of the dihedral anglesχ, and β = 0.1 (or, in other words, scaling down the energies) to account for the screening of the calcium cation by the solvent and, thereby, decreasing the interaction energy calculated in vacuo. Given the dielectric constant of water (78), the effective value of β at 298 K should be 1.69/78.4 ≅ 0.02. However, we consider short distances here between the calcium cation and the interacting residues, and the effective dielectric constant is smaller; therefore, β should take a value between 0.02 and 1.69, selected here as 0.1. Integration was carried out numerically.

The approximate analytical expressions for calcium–side-chain and calcium–peptide interaction energy were fitted to the RFE calculated numerically for each calcium–side-chain and calcium–peptide system by means of a nonlinear least-squares Marquardt method (Marquardt 1963). The resulting parameters are shown in Table 1. To take the effect of hydration into account, all the weights of the charge–charge (wc,X), charge–dipole (wdip,X), and charge–quadripole (wquad1,X) interaction terms in equations 8 and 9 were divided by the dielectric constant of water (i.e., ɛ = 78).

The weights wAGCa2+, wAGNCa2+, wSCCa2+, wpCa2+, and wCa2+Ca2+ of the energy terms in equation 1 were set arbitrarily so that the calcium–polypeptide chain energy term is about one-fourth of the polypeptide chain energy as represented by the UNRES terms of equation 1. This led to the best performance after the force-field of equation 1 was tested on the crystal structure of a number of calcium-binding proteins, and the placement of the calcium in the binding site was monitored. The weights for each of the energy terms of equation 1 are provided in Table 2.

Table 2.

The weights associated with the energy terms of the force-field of equation 1

Weight Value
wSCSC 1.00000
wSCp 2.07913
wel 0.54841
wtor 1.72954
wtord 1.23544
wb 2.31367
wrot 0.47054
wloc-el(3) 0.87864
wloc-el(4) 1.34109
wloc-el(5) 0.02730
wloc-el(6) 0.00741
wturn(3) 1.49008
wturn(4) 1.20192
wturn(6) 0.02391
wSCCa2 5.00000
wAGCa2 5.00000
wAGNCa2 10.0000
wpCa2 5.00000
wCa2Ca2 10.0000

The subscript for each weight corresponds to the energy term associated with it. For example, wSCp corresponds to the weight of the peptide–side chain interaction energy. The weights wloc-el(3) to wturn(6) are the weights of the Ucorr(m) terms, where m varies from 2 to 6.

Docking calcium cations to calcium-binding proteins

To find the optimal position of the calcium cation(s) in the crystal structure of calcium-bound proteins (from which the calcium cation(s) were removed), we ranked the different conformations obtained by placing the calcium cation in the neighborhood of a given side chain (i.e., Asp, Glu, Asn, or Gln) by energy based on predetermined distributions of dSC′ and α Ca2+ (see below). Subsequently, in each case, the energy of the system was minimized with respect to the calcium coordinates (keeping the polypeptide chain fixed). The conformation with the lowest energy was selected as the one with the calcium bound to the optimal calcium-binding site. To determine the distributions of dSC′ and αCa2+, we used the crystal structures of calcium-bound proteins from the PDB. The list of the proteins is available in the electronic version under the supplementary materials. We obtained the following mean and the variance for the distance distribution and angle distribution: <dSC′> = 4.5 Å and [<dSC′2> − <dSC>2]1/2 = 1.3 Å, whereas <αCa2+> = 80° and [<α2Ca2+> − <αCa2+>2]1/2 = 41°, respectively.

For the apo- and the UNRES-predicted structure, the procedure for searching the optimal binding site was similar to that described above except that all degrees of freedom (i.e., those of the calcium cation and those of the polypeptide chain) were subjected to energy minimization. This allowed the EF-hand motif to adopt the ≪open form≫ conformation upon binding of calcium.

Electronic supplemental material

The PDB code of the proteins that are used to calculate the probability distribution for the placement of calcium near an acidic residue is available in the electronic version.

Acknowledgments

This work was supported by NSF grant no. MCB00-03722. M.K. was an NIH biophysics trainee.

The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 USC section 1734 solely to indicate this fact.

Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.04878904.

Supplemental material: see www.proteinscience.org

References

  1. Bertini, I., Gray, H.B., Lippard, S.J., and Valentine, J.S. 1994. Bioinorganic chemistry. University Science Books, Mill Valley, CA.
  2. Chrysina, E.D., Brew, K., and Acharya, K.R. 2000. Crystal structures of Apo-and Holobovine α-lactalbumin at 2.2 Å resolution reveal an effect of calcium on inter-lope interactions. J. Biol. Chem. 275 37021–37029. [DOI] [PubMed] [Google Scholar]
  3. Declercq, J.P., Evrard, C., Lamzin, V., and Parello, J. 1999. Crystal structure of the EF-hand parvalbumin at atomic resolution (0.91 Å) and at low temperature (100 K). Evidence for conformational multistates within the hydrophobic core. Protein Sci. 8 2194–2204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Dudev, T. and Lim, C. 2003. Principles governing Mg, Ca, and Zn binding and selectivity in proteins. Chem. Rev. 103 773–787. [DOI] [PubMed] [Google Scholar]
  5. Dudev, T., Lin, Y., Dudev, M., and Lim, C. 2003. First-second shell interaction in metal binding sites in proteins: A PDB survey and DFT/CDM calculations. J. Am. Chem. Soc. 125 3168–3180. [DOI] [PubMed] [Google Scholar]
  6. Einspahr, H. and Bugg, C.E. 1984. Metal ions in biological systems, Vol. 17 (ed. H. Sigel), pp. 51–97. Dekker, Basel, Switzerland.
  7. Forsen, S. and Koerdel, J. 1996. Biomolecular structure and dynamics: Experiment and theory. J. Pharmaceut. Biomed. 14 233–246. [DOI] [PubMed] [Google Scholar]
  8. Frausto da Silva, J.J.R. and Williams, R.J.P. 1991. The biological chemistry of the elements: The inorganic chemistry of life. Oxford University Press, New York.
  9. Hanna, R. and Doudna, J. 2000. Metal ions in ribozyme folding and catalysis. Curr. Opin. Chem. Biol 4 166–170. [DOI] [PubMed] [Google Scholar]
  10. Jernigan, R., Raghunathan, G., and Bahar, I. 1994. Characterization of interactions and metal-ion binding-sites in proteins. Curr. Opin. Struct. Biol. 4 256–263. [Google Scholar]
  11. Katz, A.K., Glusker, J.P., Beebe, S.A., and Bock, C.W. 1996. Calcium ion coordination: A comparison with that of beryllium, magnesium and zinc. J. Am. Chem. Soc. 118 5752–5763. [Google Scholar]
  12. Kretsinger, R.H. and Nockolds, C.E. 1973. Carp muscle calcium-binding protein, II: Structure determination and general description. J. Biol. Chem. 248 3313–3326. [PubMed] [Google Scholar]
  13. Lee, J., Liwo, A., and Scheraga, H.A. 1999. Energy based de novo protein folding by conformational space annealing and an off-lattice united-residue force-field: Application to the 10–55 fragment of staphylococcal protein A and to apo-calbindin D9K. Proc. Natl. Acad. Sci. 96 2025–2030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Lippard, S.J. and Berg, J.M. 1994. Principles of bioinorganic chemistry. University Science Books, Mill Valley, CA.
  15. Liwo, A., Oldziej, S., Pincus, M.R., Wawak, R.J., Rackovsky, S., and Scheraga, H.A. 1997a. A united-residue force field for off-lattice protein-structure simulations, I: Functional forms and parameters of long-range side-chain interaction potentials from protein crystal data. J. Comput. Chem. 18 849–873. [Google Scholar]
  16. Liwo, A., Pincus, M.R., Wawak, R.J., Rackovsky, S., Oldziej, S., and Scheraga, H.A. 1997b. A united-residue force field for off-lattice protein-structure simulations, II: Parameterization of short-range interactions and determination of weights of energy terms by Z-score optimization. J. Comput. Chem. 18 874–887. [Google Scholar]
  17. Liwo, A., Kazmierkiewicz, R., Czaplewski, C., Groth, M., Oldziej, S., Wawak, R.J., Rackovsky, S., Pincus, M.R., and Scheraga, H.A. 1998. United-residue force field for off-lattice protein-structure simulations, III: Origin of backbone hydrogen-bonding cooperativity in united-residue potentials. J. Comput. Chem. 19 259–276. [Google Scholar]
  18. Liwo, A., Lee, J., Ripoll, D.R., Pillardy, J., and Scheraga, H.A. 1999a. Protein structure prediction by global optimization of a potential energy function. Proc. Natl. Acad. Sci. 96 5482–5485. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Liwo, A., Pillardy, J., Kazmierkiewicz, R., Wawak, R.J., Groth, M., Czaplewski, C., Oldziej, S., and Scheraga, H.A. 1999b. Prediction of protein structure using knowledge-based off-lattice united-residue force field and global optimization methods. Theor. Chem. Accounts 101 16–20. [Google Scholar]
  20. Liwo, A., Czaplewski, C., Pillardy, J., and Scheraga, H.A. 2001. Cumulant-based expressions for the multibody terms for the correlation between local and electrostatic interactions in the united-residue force field. J. Chem.Phys. 115 2323–2347. [Google Scholar]
  21. Liwo, A., Arlukowicz, P., Czaplewski, C., Oldziej, S., Pillardy, J., and Scheraga, H.A. 2002. A method for optimizing potential-energy functions by a hierarchical design of the potential-energy landscape: Application to the UNRES force field. Proc. Natl. Acad. Sci. 99 1937–1942. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Liwo, A., Oldziej, S., Czaplewski, C., Kozlowska, U., and Scheraga, H.A. 2004. Parameterization of backbone-electrostatic and multibody contributions to the UNRES force field for protein-structure prediction from ab initio energy surfaces of model systems. J. Phys. Chem. B 108 9421–9438. [Google Scholar]
  23. Marquardt, D.W. 1963. An algorithm for least-square estimation of nonlinear parameters. J. Soc. Indust. Appl. Math. 11 431–441. [Google Scholar]
  24. Mayo, S.A., Olafson, B.D., and Goddard, W.A. 1990. DREIDING: A generic force-field for molecular simulations. J. Phys. Chem. 94 8897–8909. [Google Scholar]
  25. McPhalen, C.A., Strynadka, N.C., and James, M.N. 1991. Calcium binding site in proteins: A structural perspective. Adv. Prot. Chem. 42 77–144. [DOI] [PubMed] [Google Scholar]
  26. Némethy, G., Gibson, K.D., Palmer, K.A., Yoon, C.N., Paterlini, G., Zagari, A., Rumsey, S., and Scheraga, H.A. 1992. Energy parameters in polypeptides, 10: Improved geometrical parameters and nonbonded interactions for use in ECEPP/3 algorithm with application to proline-containing peptides. J. Phys. Chem. 96 6472–6484. [Google Scholar]
  27. Oldziej, S., Kozlowska, U., Liwo, A., and Scheraga, H.A. 2003. Determination of the potentials of mean force for rotation about Cα•••Cα virtual bonds in polypeptides from the ab initio energy surfaces of terminally-blocked glycine, alanine, and proline. J. Phys. Chem. A 107 8035–8046. [Google Scholar]
  28. Pearlman, D.A., Case, D.A., Caldwell, J.W., Ross, W.S., Cheatham, T.E., De-Bolt, S., Ferguson, D., Seibel, G., and Kollman, P. 1995. ≪AMBER≫: A package of computer programs for applying molecular mechanics, normal mode analysis, molecular dynamics and free energy calculations to simulate the structural and energetic properties of molecules. Comp. Phys. Commun. 91 1–42. [Google Scholar]
  29. Permyakov, S.E., Uversky, V.N., Veprintsev, D.B., Cherskaya, A.M., Brooks, C.L., Permyakov, E.A., and Berliner, L.J. 2001. Mutating aspartate in the calcium-binding site of α-lactalbumin: Effects on the protein stability and cation binding. Prot. Eng. 14 785–789. [DOI] [PubMed] [Google Scholar]
  30. Pidcock, E. and Moore, G.R. 2001. Structural characteristics of protein binding sites for calcium and lanthanide ions. J. Biol. Inorg. Chem. 6 479–489. [DOI] [PubMed] [Google Scholar]
  31. Schmidt, M.W., Baldridge, K.K., Boatz, J.A., Elbert, S.T., Gordon, M.S., Jensen, J.H., Koseki, S., Matsunaga, N., Nguyen, K.A., Su, S., et al. 1993. General atomic and molecular electronic structure system. J. Comput. Chem. 14 1347–1363. [Google Scholar]
  32. Skelton, N.J., Koerdel, J., and Chazin, W.J. 1995. Determination of the solution structure of apo calbindin D9K by NMR spectroscopy. J. Mol. Biol. 249 441–462. [DOI] [PubMed] [Google Scholar]
  33. Svensson, L.A., Thulin, E., and Forsen, S. 1992. Proline cistrans isomers in calbindin D9K, observed by x-ray crystallography. J. Mol. Biol 223 601–606. [DOI] [PubMed] [Google Scholar]
  34. Yamashita, M.M., Wesson, L., Eisenman, G., and Eisenberg, D. 1990. Where metal ions bind in proteins. Proc. Natl. Acad. Sci. 87 5648–5652. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Yang, W., Lee, H.W., Hellinga, H., and Yang, J.J. 2002. Structural analysis, identification, and design of calcium-binding sites in proteins. Proteins 47 344–356. [DOI] [PubMed] [Google Scholar]
  36. Yang, W., Jones, L.M., Isley, L., Yiming, Y., Lee, H.W., Wilkins, A., Liu, Z.R., Hellinga, H.W., Malchow, R., Ghazi, M., et al. 2003. Rational design of a calcium-binding protein. J. Am. Chem. Soc. 125 6165–6171. [DOI] [PubMed] [Google Scholar]
  37. Yap, K.L., Ames, J.B., Swindells, M.B., and Ikura, M. 1999. Diversity of conformational states and changes within the EF-hand protein superfamily. Proteins 37 499–507. [DOI] [PubMed] [Google Scholar]

Articles from Protein Science : A Publication of the Protein Society are provided here courtesy of The Protein Society

RESOURCES