Abstract
Computing the volume occupied by individual atoms in macromolecular structures has been the subject of research for several decades. This interest has grown in the recent years, because weighted volumes are widely used in implicit solvent models. Applications of the latter in molecular mechanics simulations require that the derivatives of these weighted volumes be known. In this article, we give a formula for the volume derivative of a molecule modeled as a space-filling diagram made up of balls in motion. The formula is given in terms of the weights, radii, and distances between the centers as well as the sizes of the facets of the power diagram restricted to the space-filling diagram. Special attention is given to the detection and treatment of singularities as well as discontinuities of the derivative.
Keywords: molecular dynamics|implicit solvent models|spheres|Delaumay triangulation
It is widely believed that characterizing the geometry of a protein molecule is essential for understanding its folding process as well as its interactions with other biomolecules and small ligands. Among all geometric measures, volume is probably the most fundamental property to study. The relevance of calculations of molecular volume is seen in the correlation of volume, or change in volume, with experimental factors such as energetics (1) as well as with physical chemical properties of the molecule such as charge, temperature, and pressure (2, 3). There is a large amount of current research that focuses on the calculations of standard atomic volumes (4–6). These atomic volumes have been considered as parameters that can be used to measure the quality of a three-dimensional protein structure (7) as well as probes to quantify structural variations in proteins that occur during evolution (8). Atomic volumes have also been used in packing (9–11) and solvation-energy calculations (12, 13). The latter energy function is central to computational biology, because solvation is an important factor determining the structure and thermodynamics properties of molecules in aqueous solution. All solvent effects on a molecule can be included in an effective solvation potential, W = Welec + Wnp, in which the first term accounts for the molecule–solvent electrostatics polarization, and the second term accounts for the molecule–solvent van der Waals interactions and for the formation of a cavity in the solvent. Welec is usually described by a continuum model such as the generalized Born model (13, 14). Interestingly, an essential element of such a model is the knowledge of the volume associated with the individual atoms of the solute. Many solvation models describe the nonpolar part of the solvation energy Wnp as a weighted sum of the solvent-exposed or -accessible surface area of each atom of the solute (15, 16). There is evidence, however, that for small solute the hydrophobic term Wnp is not proportional to the surface area (17) but rather to the excluded volume of the molecule (18). A volume-dependent solvation term was originally introduced by Gibson and Scheraga (19) as the hydration shell model. Biomolecular simulations with implicit solvent that rely on iterative energy calculations such as energy minimization and molecular dynamics require efficient, exact algorithms for the evaluation of the solvation-energy function and its derivatives. In this article, we provide a complete geometric description of the derivatives of the weighted volume of a molecule when its atoms are in motion.
Lee and Richards (20) define the accessible surface of a protein as the van der Waals surface of the molecule expanded by the radius of the solvent sphere about each atom center. The solvent-excluded volume is the volume enclosed by the accessible surface. Computational methods that evaluate the solvent-excluded volume can be divided into approximate and exact methods. Most of the approximate methods rely on numerical integration (21–23), whereas most analytical methods (9, 12, 24–27) point out that overlap volumes of four or more atoms do not occur in real molecules. An alternative approach for computing the volume and studying the packing of balls was originally developed by Voronoi (28) and later was applied to protein by Richards (9). It has been used successfully for the calculation of protein constituents, the description of protein motions, and the analysis of cavities in proteins (2, 8, 29, 30). A drawback of this approach is that it requires information on the first shell of hydration so that polyhedra may be defined for atoms at the surface of the molecule. It was also developed for equal-size balls and its applications to proteins with atoms that have different radii required approximations (30). Pavani and Ranghino (22) proposed a method for computing the volume of a molecule by using the inclusion–exclusion principle. In their implementation, only intersections of up to three balls were considered. This idea was generalized to any levels of intersections by Gibson and Scheraga (26), applying a theorem from mathematics that higher-order unions and intersections can always be reduced to lower-order unions and intersections (31). Doing the reduction correctly, however, remains computationally difficult and expensive. The alpha-shape theory solves this problem exactly by using Delaunay triangulations and their filtrations (32). Alpha shapes have been used to compute the surface area and volume of proteins as well as for detecting and measuring cavities in proteins (33). Fig. 1 provides a simple overview of the application of the alpha-shape theory to compute the volume of a protein.
Figure 1.
Computing geometric properties of a molecule. As is common in biology, atoms are treated as intersecting balls, the union of which forms the space-filling diagram. Gerstein et al. (8) and Richards (9) proposed to study this union by using the Voronoi diagram (or more exactly the power diagram since the balls have different radii). That diagram divides the space into convex cells, one per atom. In the two-dimensional example shown here, the edges that separate these cells are shown as solid inside and dotted outside the union of disks. Note that the power cells of some surface atoms extend to infinity. The superimposed Delaunay triangulation (thick solid and dashed lines) is the dual of the power diagram obtained by drawing a line segment between two ball centers if their convex cells share a common edge. Despite their different appearance, the Delaunay triangulation and the power diagram contain exactly the same information. The key to connecting the power diagram to the molecule is to consider the intersection of the two; this defines a convex decomposition of the space-filling diagram (i.e. the light shaded area inside the disks, divided into convex regions by the solid part of the Voronoi edges). The dual of this decomposition is the dual complex (thick solid lines and shaded triangles). The dual complex is a subset of the Delaunay triangulation and contains all simplices (tetrahedra, triangles, and edges) that correspond to overlapping atoms. As demonstrated in ref. 32, the dual complex contains all information about a molecule required to compute its surface and volume. In this article we show that the same dual complex can be used to compute the derivative of the volume.
The distinction between approximate and exact computation also applies to existing methods for computing the derivatives of the solvent-excluded volume with respect to atomic coordinates (34–36). All these methods have to take a large number of singularities into account, where approximations are usually required (36). This problem of singularities is even more acute for surface-area calculation (37, 38). The alpha-shape theory proposes a robust solution to this problem by implementing arbitrary precision arithmetic to avoid numerical problems and systematically resolving all singularities without explicitly perturbing the positions of the ball centers. The latter method is referred to as Simulation of Simplicity (39). In this article we describe an extension of the alpha-shape method that implements the weighted volume derivative theorem described below to provide an efficient, robust, exact computation of the derivatives of volumes. There is an inherent difficulty in using a potential based on volume for energy minimization or molecular dynamics. Although the volume is continuous in the position of the atoms, its derivatives may not be. We examine this issue within the framework of the alpha-shape method and relate discontinuities to combinatorial changes in the subcomplex of the Delaunay triangulation that is dual to the space-filling diagram of the molecule.
Background and Results introduces the background and states the main result of this paper. Derivation uses geometric arguments to prove the formula for the weighted-volume derivative. Discontinuities and Implementation discuss the continuity of the derivatives and the implementation of the theorem, and Discussion concludes the article.
Background and Results
As common in biology, we model an atom as a ball and a molecule as the union of a finite collection of such balls. This union is referred to as the space-filling diagram of the molecule. In this section we explain how we approach the problem of computing the derivatives of the weighted volume of a three-dimensional space-filling diagram.
Geometric Structures.
Let Bi = (zi, ϱi) be the ball with center zi and radius ϱi for 0 ≤ i < n and call F = ∪i Bi a space-filling diagram. The corresponding state is the point z ∈ ℝ3n that lists the 3n center coordinates in sequence. The weighted volume of F is a map W:ℝ3n → ℝ, and its derivative at z is a linear map DWz:ℝ3n → ℝ defined by DWz(t) = 〈w, t〉, where w ∈ ℝ3n is the gradient of W at z. Specifying the derivative is equivalent to giving the gradient.
Denote the sphere bounding Bi by Si. The power distance of a point x ∈ ℝ3 from Si is πi(x) = ∥x − zi∥2 − ϱ
, and the power cell of Si is the set of points Pi for which Si minimizes the power distance. We assume or simulate the generic case, in which every power cell is a three-dimensional convex polyhedron, and every facet, edge, and vertex belongs to exactly two, three, and four power cells. The power diagram consists of all power cells and their facets, edges, and vertices. As illustrated in Fig. 1, the power diagram decomposes the space-filling diagram into convex cells of the form Pi ∩ F, and these cells share facets, edges, and vertices, which are the intersections of F with the facets, edges, and vertices of the power diagram. The dual complex K of that decomposition consists of all vertices, edges, triangles, and tetrahedra that are convex hulls of sphere centers with cells that intersect in nonempty cells, facets, edges, and vertices.
Sizes and Distances.
We use fractions to express the sizes of geometric entities in the decomposition of the space-filling diagram. For example, βi = vol(Pi ∩ F)/vol(Bi) is the fraction of the ith ball that belongs to its power cell. The volume of F = ∪i Bi is therefore (4π/3) ∑i βiϱ
, and given real weights αi, the weighted volume is W = (4π/3) ∑i αiβiϱ
. The formula for the derivative of W needs fractions of disks, which we now discuss. Two spheres intersect in a circle Sij = Si ∩ Sj, which bounds a disk Bij. The fraction of the disk that belongs to the power diagram is
![]() |
1 |
As explained in ref. 32, it is straightforward to compute these fractions from the dual complex K of F by using the principle of inclusion–exclusion. More details on how to perform this calculation are provided in Appendix. We will also need the area of the facet Pi ∩ Pj ∩ F, which we compute as βijAij. An expression for Aij = area(Bij) in terms of radii and distances between centers of spheres will be given in Derivation (see Eq. 7).
The derivative of W also requires the distance between the line Lij passing through the centers zi and zj and the line Lijk, the points of which have equal power distance from Si, Sj, and Sk. This distance δij,k is equal to the distance between the center zij of Sij and the point yijk on Lijk closest to zij. We note that yijk is the unique point in the plane of zi, zj, and zk that has equal power distance to all three spheres.
Averages.
The derivative of W also refers to the weighted-average vector from the center to the boundary of a facet Pi ∩ Pj ∩ F. The weight is the infinitesimal contribution to the area of the facet as we rotate the vector. That weight is constant over each edge and each arc in the boundary of the facet. We therefore compute the weighted-average vector as the weighted sum of the average vectors of the edges and arcs, with the weights being the areas of the subtended sectors. The average vector from the center to an edge xy = Pi ∩ Pj ∩ Pk ∩ F is the vector to the midpoint, Vxy = (x + y)/2 − zij. Consider next an arc ab on the circle Sij, and let ϕ = ϕab be half the angle, as in Fig. 2. Assuming the center is the origin, the radius is 1, and the arc midpoint lies on the abscissa, we get the average by integration
![]() |
2 |
To give a formula without the normalizing assumptions, we use the midpoint Mab of the arc ab. The average vector is Vab = vab(Mab − zij).
Figure 2.
The average of the solid arc ab lies on the line connecting the center zij and the midpoint Mab of the arc.
In general, the facet Pi ∩ Pj ∩ F has a boundary that consists of several edges and arcs. The weighted-average vector from the center to that boundary is the weighted sum of the individual averages,
![]() |
3 |
where hxy = δij,k is the distance from zij to the line spanned by xy, and ϱij is the radius of Sij. The first sums range over all edges xy, and the second sums range over all arcs ab of the facet. Note that the denominator is twice the area of the facet. We provide a method to compute the weighted-average vector using inclusion–exclusion in Appendix.
Main Result.
The main result of this article is a complete description of the weighted-volume derivative. We need some notation. The weight of the ith ball is αi ∈ ℝ, which may be positive or negative. We write ζij = ∥zi − zj∥ for the distance between two centers and Uij = (zi − zj)/ζij for the unit vector between the same. Recall also the definitions of βij, Aij, and Vij given above.
Weighted-Volume Derivative Theorem.
The derivative of the weighted volume of the space-filling diagram with state z is DWz(t) = 〈w, t〉, where
![]() |
4 |
for 0 ≤ i < n. The sum is over all edges zizj in K.
We illustrate the formula with a small example consisting of three spheres that intersect in two points, the projection of which along the shared edge of the power diagram is shown in Fig. 3. In the unweighted case we have αi = 1, wij = 1, and xij = 0 for all i and j. The derivative thus simplifies to a sum of vectors between the centers: ∑ βijAijUij. As shown by Csikós (40), this formula generalizes to dimensions different from three. We note that DW is not everywhere continuous. Specifically, it may not be continuous when two or more spheres coincide or three or more spheres intersect in a common circle (see Discontinuities).
Figure 3.
Three equal-sized balls with different weights of 1.0, 2.0, and 3.0. For each pair we get components of the gradient normal to (dotted) and parallel to (dashed) the separating facet. The solid vectors are the sums of the components as well as the portions of the gradient that correspond to the different balls.
Derivation
In this section we derive the formula claimed in the Weighted-Volume Derivative Theorem. It is the sum of contributions from locally direction-preserving and locally distance-preserving components of the motion.
Direction-Preserving Motion.
A direction is determined by two spheres and is preserved if one sphere stays put and the other moves along that line. To study the volume derivative for such a motion, we restrict out attention to two spheres Si and Sj with nonempty intersection Sij and define F = Bi ∪ Bj. Let ζi and ζj be the signed distances of the centers from the plane of Sij, as illustrated in Fig. 4. The sum of the two distances is ζij = ζi + ζj = ∥zi − zj∥. We have ϱ
− ϱ
= ζ
− ζ
= ζij(ζi − ζj), and therefore
![]() |
5 |
and
![]() |
6 |
It is geometrically obvious that the weighted-volume derivative is the area of the disk bounded by Sij times the sum of the weighted derivatives of ζi and ζj. Because the square radius of the disk is ϱ
= ϱ
− ζ
, the area of the disk is
![]() |
7 |
where the product ranges over the four different ways of assigning signs. The derivatives of ζi and ζj with respect to ζij can be computed from Eqs. 5 and 6, and the weighted-volume derivative is
![]() |
8 |
![]() |
Alternatively, we could work analytically, compute the weighted-volume contributions of the two balls by integration, and get dW/dζij by differentiation. The result is the same.
Figure 4.
The two spheres intersect in a circle with center zij and radius ϱij.
Distance-Preserving Motion.
The distance between the centers of two spheres Si and Sj is preserved if one rotates about the other. We consider what happens to the facet Pi ∩ Pj ∩ F that separates the two corresponding cells. As illustrated in Fig. 5, that facet is the intersection of a disk and a convex polygon. The boundary consists of edges and circular arcs. Let Ti normal to Uij be the motion vector of Si, and let Ri,j be the line normal to both vectors that passes through the center zij of the disk. The infinitesimal motion of the facet consists of a rotation about Ri,j composed with a translation along Ti, but only the rotation has a nonzero contribution to the volume derivative. On one side of the line we trade volume of Bi for that of Bj, and on the other side we do it the other way round. The net gain or loss is the accumulation of the contributions of the sectors over the edges and arcs in the boundary of the facet. Here we count a sector over an edge negative if the line defined by the edge separates zij and the facet.
Figure 5.
Infinitesimal motion of a facet between two balls Bi and Bj. The two bounding spheres intersect in the dotted circle Sij with center zij, perpendicular to the edge zizj. The (shaded) facet Pi ∩ Pj ∩ F between the corresponding two power cells is the intersection of the disk bounded by Sij with the convex polygon common to the two cells. The solid boundary of the facet consists of edges and circular arcs. The only relevant component of the infinitesimal motion of the facet is a rotation around the line Ri,j perpendicular to the edge zizj and the motion vector Ti. On one side of this line (dark shaded area), we trade volume of Bi for that of Bj, and on the other side (light shaded area) we do it the other way around.
We first consider an edge xy defined by Si, Sj, and a third sphere Sk. We compute the derivative of the signed volume swept out by the triangle zijxy with respect to the rotation angle θ,
![]() |
9 |
where hxy = δij,k is the distance from zij to the line passing through x and y, and Vxy = (x + y)/2 − zij is the vector to the midpoint. Note that the result is two thirds the area of the triangle times the scalar product divided by the distance between the two ball centers.
Second, we consider a circular arc ab, which is part of the circle Sij. We compute the derivative of the volume swept out by the disk sector zijab with respect to θ,
![]() |
10 |
where 2ϕab is the angle of the arc, and Vab is the average vector from the center to the arc. Again, the result is two thirds the area of the sector times the scalar product divided by the distance between the two ball centers. As explained in Background and Results, Vab = vab(Mab − zij), where Mab is the midpoint of the arc and vab = (sin ϕab)/ϕab.
Assembling the Relations.
Let t be the velocity vector of the state z and Ti = [t3i+1, t3i+2, t3i+3]T the velocity vector of zi. Because the derivative is linear, we can decompose the motion into components and add the contributions. For every ordered pair i, j, we consider the direction-preserving component 〈Uij, Ti〉Uij and the distance-preserving component Ti − 〈Uij, Ti〉Uij of Ti.
The contribution of the direction-preserving component is the fraction of the disk Bij that belongs to the power diagram times what is given in Eq. 8. This is βijAijwij〈Uij, Ti〉, where wij is given in Weighted-Volume Derivative Theorem. The contribution of the distance-preserving component is the weight difference times two thirds the area times the weighted average vector divided by the distance between the ball centers. The factor 2/3 is necessary to change the factor 1/2, which is used in the area computation of sectors, to 1/3, which arises in the volume computation of cones. The contribution is then xijβijAij〈Vij, Ti〉, where xij is given in Weighted-Volume Derivative Theorem. It is also the weight difference times the sum of contributions over all edges, given in Eq. 9, plus the sum of contributions over all arcs, given in Eq. 10. Adding the terms for all ordered pairs gives
![]() |
11 |
for the weighted-volume derivative. We can write this more succinctly as DWz(t) = 〈w, t〉, where w is as defined in Weighted-Volume Derivative Theorem. We have βij = 0 unless zizj is an edge of K, which implies that the summation can be limited to the edges in K.
Discontinuities
The weighted-volume derivative, DW, maps a state z to the gradient w = wz of W at z. From the formula in Weighted-Volume Derivative Theorem, we glean that DW is continuous except at states at which either two or more spheres coincide or three or more spheres intersect in a common circle. The set of such states is a (3n − 3)-dimensional subset of ℝ3n, which should be compared with the (3n − 1)-dimensional subset at which the weighted-area derivative is discontinuous. To see that there are only these two types of discontinuities, we note that Aij is everywhere continuous, xij, wij, and Uij are continuous except when zi = zj, and βij and Vij are continuous as long as the facet Pi ∩ Pj ∩ F varies continuously. States at which that facet changes abruptly either have zi = zj or contain at least one sphere, other than Si and Sj, that contains the circle Sij.
Every state z at which DW is discontinuous has the property that every open neighborhood contains states with combinatorially different dual complexes. In other words, when z passes through a discontinuity of DW, then the dual complex passes a moment at which it may undergo a combinatorial change. Note, however, that not every combinatorial change in the dual complex corresponds to a discontinuity in DW.
Implementation
To compute the volume and its derivatives with respect to the coordinates of the center of the balls, we first need the dual complex of the space-filling diagram. For that purpose, we have written a new version of the alpha-shape software, alphavol, specific to molecular simulation applications. alphavol first performs a regular triangulation of the set of centers of the balls by using the algorithm described by Edelsbrunner and Shah (41). Singularities are resolved by using the Simulation of Simplicity (39), whereas numerical imprecisions are avoided by using arbitrary precision arithmetic. The software is kept efficient through the application of a floating-point filter. The dual complex is computed by using the same care for singularities and numerical uncertainties. alphavol includes implementations of the volume and weighted-volume derivative formulas. Computation of the volume and its derivatives of a 250-residue protein by using alphavol requires ≈0.37 sec on a 1,000-Mhz Pentium processor. The 0.37 sec roughly breaks down to 0.11 sec for computing the regular triangulation, 0.03 sec for generating the dual complex, 0.17 sec to compute the weighted volume, and 0.06 sec to compute the weighted-volume derivatives.
The weighted-volume derivative formula, Eq. 4, was tested against the numerically estimated derivatives,
![]() |
12 |
where z3i+j is the jth coordinate of the ith atom center. The average relative difference between analytical and numerical derivatives was computed as a relative root mean square error:
![]() |
13 |
Using δ = 0.0001, we find μ(W) equal on the average to 5.1 × 10−8 and never larger than 9 × 10−8 for a collection of 100 proteins varying in size from 50 residues or ≈400 atoms to 500 residues or ≈4,000 atoms. The good agreement between the analytical and numerical derivatives of the weighted volume shows that our approach and implementation are correct. alphavol is available online at http://biogeometry.duke.edu/software/proshape.
Discussion
The software alphavol, which computes the weighted volume of a union of balls and its derivatives with respect to the coordinates of the centers of the balls, provides a fast, accurate, and robust method for computing the interaction of water with nonpolar atoms of a molecule in a hydration shell model (19). ALPHAVOL implements analytical formulas for computing the weighted volume and its derivatives and explicitly deals with the problem of discontinuities. We have made preliminary steps toward inserting alphavol into the molecular dynamics software encad (42) and gromacs (43), but it is too early to say anything about the corresponding results.
Measuring the volume occupied by individual atoms in macromolecular structures has been the subject of scientific investigation for several decades. In recent years, this interest has increased, because volumes of specific atom types are used in a wide range of implicit solvent models. One of the most widespread uses of atomic volumes in molecular mechanics calculations is in the generalized Born model of electrostatic solvation (14), where the atom volume appears in the calculation of the desolvation of one atom by another atom. In the present implementation of the generalized Born model, ball overlaps are ignored, corrected by scaling factors, or dealt with by using a Gaussian representation of the ball. We expect that the formalism introduced in this article for computing volumes and their derivatives that explicitly and accurately deal with ball overlaps should prove useful for the generalized Born model.
Voronoi polyhedra have been widely used to compute the volume occupied by the atoms of proteins (4, 6–9) and recently DNA (5) in solution. One difficulty in applying the Voronoi-polyhedra approach to macromolecular structures is that the treatment of atoms at the surface requires special attention because their power cells are infinite. Possible solutions to this problem are the construction of a layer of solvent molecules around the macromolecule (44) or limiting the power diagram to the space-filling diagram (29, 33). This article describes the latter approach and extends it by providing exact expression for the derivatives of the atomic volume. These derivatives are expected to be useful for including potential based on packing in molecular mechanics or molecular dynamics calculation.
The mathematical tools used in this paper to express the volume derivative of a space-filling diagram have also been used to compute the weighted surface-area derivatives (R. Bryant, H.E., P.K., and M. Levitt, unpublished data).
Acknowledgments
This work was supported in part by National Science Foundation Grant CCR-00-86013. P.K. acknowledges support from the Department of Energy (DE-F603-95ER62135).
Appendix
In this section we explain how to compute the fractions βij and the weighted-average vectors Vij needed to express the weighted-volume derivative. The descriptions are relevant only in the context of implementing the derivative once the dual complex K of the space-filling diagram is known. We begin by introducing a notion of neighborhood within K: The star of a simplex τ ∈ K is the collection of simplices that contain τ, including τ itself, and the link of τ is the collection of faces of simplices in the star that are disjoint of τ. Writing the face relation between simplices with a smaller equal symbol, we can express these definitions more formally:
![]() |
14 |
![]() |
15 |
For example, the star of an interior edge contains the edge together with a ring of triangles and tetrahedra around the edge. The link is the cycle of vertices and edges that surrounds the ring away from the interior edge; for each triangle in the star it contains the opposite vertex, and for each tetrahedron it contains the opposite edge.
Computing βij.
Recall that Bij is the disk spanned by the circle Sij, which is the intersection of the spheres Si and Sj. Formally, Bij is the set of points x ∈ ℝ3 for which πi(x) = πj(x) ≤ 0. The number we are after is the fraction of that disk that belongs to the power diagram,
![]() |
16 |
Given the dual complex K, we can use inclusion–exclusion to compute βij from simple pieces of the disk that are dual to simplices in the star of the edge zizj. Let B
be the segment of the disk on zk's side of the bisector. More formally, B
is the set of points x ∈ Bij for which πk(x) ≤ πi(x) = πj(x). The general results in ref. 32 imply that βij can be computed by subtracting the segments B
from the disk Bij and adding back their pairwise intersections,
![]() |
17 |
where ρij is the radius of Bij. The first sum ranges over all vertices zk, and the second ranges over all edges zkzl in the link of zizj in K.
Computing Vij.
Recall that Vij is the weighted-average vector from the center zij of the disk Bij to the boundary of the facet that is the portion of the disk belonging to the Voronoi diagram. The weight of each vector is its infinitesimal contribution to the area of the facet, which is constant along each edge and each arc. Hence, Vij can be computed as the weighted sum of the average vectors of the edges xy and the arcs ab of the facet,
![]() |
18 |
where hxy is the distance from zij to the line spanned by the edge xy, and 2ϕab is the angle of the arc ab. The first sums in the numerator and the denominator range over all edges xy, and the second sums range over all arcs ab of the facet. We note that the denominator is twice the area of the facet, which is 2βijAij. We thus focus on the numerator, which we will express as an alternating sum using again inclusion–exclusion over the link of the edge zizj in K. We start with the average vector to the boundary of the entire disk, which is Nij = 0. From this we subtract the contributions of disk segments and we add back contributions of pairs of disk segments.
Case 1 (disk segments).
Let x and y be the points at which the sphere Sk intersects the circle Sij. The disk segment B
is thus bounded by the edge xy and the arc ab connecting a = x with b = y along the circle as illustrated in Fig. 6 Left. We add the contribution of the edge and subtract the contribution of the arc. The former is twice the area of the triangle times the vector to the midpoint, which is 2ϱ
cos ϕabsin ϕabVxy = ϱ
sin(2ϕab)Vxy. Assuming ϕab ≠ ±90°, the average vector of the arc can be computed as Vab = Vxy(tan ϕab)/ϕab. This expression is undefined if ϕab = ±90°, in which case we may go back to Eq. 2 and get Vab = (2/π)(Mab − zij), where Mab is the midpoint of the arc. The weight of this vector is twice the area of the disk sector defined by ab and is given by 2ϱ
ϕab. The accumulated contribution of the disk segment therefore is
![]() |
19 |
where we take the contribution of the arc positive and that of the edge negative, in preparation for subtracting N
in the final sum.
Figure 6.
The weighted average vector N
for a disk segment (Left) and the vector N
for the intersection of a pair of disk segments (Right). Using inclusion–exclusion, we can compute the contribution of the union of the two disk segments as N
+ N
− N
(see Eq. 20).
Case 2 (pairs of disk segments).
Let x, y and u, v be the endpoints of the edges and arcs bounding the disk segments B
and B
, and define B
= B
∩ B
. We only need to consider the case in which the endpoints alternate along the circle. As shown in Fig. 6 Right, the intersection point of the two edges lies inside the disk, and the boundary of B
consists of one arc and two edges that both end at that intersection point. Let N
be the weighted sum of average vectors taking the contribution of the arc positive and those of the edges negative, as before.
We get the weighted average vector over the boundary of the facet Pi ∩ Pj ∩ F as
![]() |
20 |
where Nij = 0, the first sum ranges over all vertices zk, and the second sum ranges over all edges zkzl in the link of zizj in K.
References
- 1.Privalov P L, Gill S G. Adv Protein Chem. 1988;39:191–234. doi: 10.1016/s0065-3233(08)60377-0. [DOI] [PubMed] [Google Scholar]
- 2.Richards F M. Annu Rev Biophys Bioeng. 1977;6:151–176. doi: 10.1146/annurev.bb.06.060177.001055. [DOI] [PubMed] [Google Scholar]
- 3.Morild E. Adv Protein Chem. 1981;34:93–166. doi: 10.1016/s0065-3233(08)60519-7. [DOI] [PubMed] [Google Scholar]
- 4.Tsai J, Taylor R, Chothia C, Gerstein M. J Mol Biol. 1999;290:253–266. doi: 10.1006/jmbi.1999.2829. [DOI] [PubMed] [Google Scholar]
- 5.Nadassy K, Tomás-Oliveira I, Alberts I, Janin J, Wodak S J. Nucleic Acids Res. 2001;29:3362–3376. doi: 10.1093/nar/29.16.3362. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Tsai J, Gerstein M. Bioinformatics. 2002;18:985–995. doi: 10.1093/bioinformatics/18.7.985. [DOI] [PubMed] [Google Scholar]
- 7.Pontius J, Richelle J, Wodak S J. J Mol Biol. 1996;264:121–136. doi: 10.1006/jmbi.1996.0628. [DOI] [PubMed] [Google Scholar]
- 8.Gerstein M, Sonnhammer E L L, Chothia C. J Mol Biol. 1994;236:1067–1078. doi: 10.1016/0022-2836(94)90012-4. [DOI] [PubMed] [Google Scholar]
- 9.Richards F M. J Mol Biol. 1974;82:1–14. doi: 10.1016/0022-2836(74)90570-1. [DOI] [PubMed] [Google Scholar]
- 10.Rashin A A, Iofin M, Honig B. Biochemistry. 1986;25:3619–3625. doi: 10.1021/bi00360a021. [DOI] [PubMed] [Google Scholar]
- 11.Pettitt B M, Karplus M. Chem Phys Lett. 1985;121:194–201. [Google Scholar]
- 12.Kang Y K, Nemethy G, Scheraga H A. J Phys Chem. 1987;91:4105–4109. [Google Scholar]
- 13.Still W C, Tempczyk A, Hawley R C, Hendrikson T. J Am Chem Soc. 1990;112:6127–6129. [Google Scholar]
- 14.Bashford D, Case D A. Annu Rev Phys Chem. 2000;51:129–152. doi: 10.1146/annurev.physchem.51.1.129. [DOI] [PubMed] [Google Scholar]
- 15.Eisenberg D, McLachlan A. Nature. 1986;319:199–203. doi: 10.1038/319199a0. [DOI] [PubMed] [Google Scholar]
- 16.Ooi T, Oobatake M, Nemethy G, Scheraga H. Proc Natl Acad Sci USA. 1987;84:3086–3090. doi: 10.1073/pnas.84.10.3086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Simonson T, Brünger A T. J Phys Chem. 1994;98:4683–4694. [Google Scholar]
- 18.Lum K, Chandler D, Weeks J D. J Phys Chem B. 1999;103:4570–4577. [Google Scholar]
- 19.Gibson K, Scheraga H. Proc Natl Acad Sci USA. 1967;58:420–427. doi: 10.1073/pnas.58.2.420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Lee B, Richards F M. J Mol Biol. 1971;55:379–400. doi: 10.1016/0022-2836(71)90324-x. [DOI] [PubMed] [Google Scholar]
- 21.Rowlinson J S. Mol Phys. 1963;6:517–524. [Google Scholar]
- 22.Pavani R, Ranghino G. Comput Chem. 1982;6:133–135. [Google Scholar]
- 23.Gavezotti A. J Am Chem Soc. 1983;105:5220–5225. [Google Scholar]
- 24.Richmond T J. J Mol Biol. 1984;178:63–89. doi: 10.1016/0022-2836(84)90231-6. [DOI] [PubMed] [Google Scholar]
- 25.Connolly M L. J Am Chem Soc. 1985;107:1118–1124. [Google Scholar]
- 26.Gibson K, Scheraga H. Mol Phys. 1987;62:1247–1265. [Google Scholar]
- 27.Petitjean M. J Comput Chem. 1994;15:507–523. [Google Scholar]
- 28.Voronoi G F. J Reine Angew Math. 1908;134:198–287. [Google Scholar]
- 29.Gellatly B J, Finney J L. J Mol Biol. 1982;161:305–322. doi: 10.1016/0022-2836(82)90155-3. [DOI] [PubMed] [Google Scholar]
- 30.Goede A, Preissner R, Frömmel C. J Comput Chem. 1997;18:1113–1123. [Google Scholar]
- 31.Kratky K W. J Phys A Math Gen. 1978;11:1017–1024. [Google Scholar]
- 32.Edelsbrunner H. Discrete Comput Geom. 1995;13:415–440. [Google Scholar]
- 33.Liang J, Edelsbrunner H, Fu P, Sudhakar P V, Subramaniam S. Proteins Struct Funct Genet. 1998;33:1–29. [PubMed] [Google Scholar]
- 34.Kundrot C E, Ponder J W, Richards F M. J Comput Chem. 1991;12:402–409. [Google Scholar]
- 35.Gogonea V, Osawa E. J Mol Struct THEOCHEM. 1994;311:305–324. [Google Scholar]
- 36.Gogonea V, Osawa E. J Comput Chem. 1995;7:817–842. [Google Scholar]
- 37.Perrot G, Cheng B, Gibson K D, Vila J, Palmer K A, Nayeem A, Maigret B, Scheraga H A. J Comput Chem. 1992;13:1–11. [Google Scholar]
- 38.Wawak R J, Gibson K D, Scheraga H A. J Math Chem. 1994;15:207–232. [Google Scholar]
- 39.Edelsbrunner H, Mücke E P. Am Chem Soc Trans Graphics. 1990;9:66–104. [Google Scholar]
- 40.Csikós B. Discrete Comput Geom. 1998;20:449–461. [Google Scholar]
- 41.Edelsbrunner H, Shah N R. Algorithmica. 1996;15:223–241. [Google Scholar]
- 42.Levitt M, Hirshberg M, Sharon R, Daggett V. Comput Phys Common. 1995;91:215–231. [Google Scholar]
- 43.Lindahl E, Hess B, van der Spoel D. J Mol Mod. 2001;7:306–317. [Google Scholar]
- 44.Finney J L. J Mol Biol. 1975;96:721–732. doi: 10.1016/0022-2836(75)90148-5. [DOI] [PubMed] [Google Scholar]



























