Geometric Measures of Large Biomolecules: Surface, Volume and Pockets

Paul Mach; Patrice Koehl

doi:10.1002/jcc.21884

. Author manuscript; available in PMC: 2012 Nov 15.

Published in final edited form as: J Comput Chem. 2011 Aug 8;32(14):3023–3038. doi: 10.1002/jcc.21884

Geometric Measures of Large Biomolecules: Surface, Volume and Pockets

Paul Mach ¹, Patrice Koehl ^2,^*

PMCID: PMC3188685 NIHMSID: NIHMS308577 PMID: 21823134

Abstract

Geometry plays a major role in our attempt to understand the activity of large molecules. For example, surface area and volume are used to quantify the interactions between these molecules and the water surrounding them in implicit solvent models. In addition, the detection of pockets serves as a starting point for predictive studies of biomolecule-ligand interactions. The alpha shape theory provides an exact and robust method for computing these geometric measures. Several implementations of this theory are currently available. We show however that these implementations fail on very large macromolecular systems. We show that these difficulties are not theoretical; rather, they are related to the architecture of current computers that rely on the use of cache memory to speed up calculation. By rewriting the algorithms that implement the different steps of the alpha shape theory such that we enforce locality, we show that we can remediate these cache problems; the corresponding code, UnionBall has an apparent Inline graphic (n) behavior over a large range of values of n (up to tens of millions), where n is the number of atoms. As an example, it takes 136 seconds with UnionBall to compute the contribution of each atom to the surface area and volume of a viral capsid with more than five million atoms on a commodity PC. UnionBall includes functions for computing the surface area and volume of the intersection of two, three and four spheres that are fully detailed in an appendix. UnionBall is available as an OpenSource software.

Keywords: space-filling diagrams, surface area, volume, pockets, macromolecules

1 Introduction

Cellular functions rest mostly on the activity of two types of large bio-molecules, namely proteins and nucleic acids. A fundamental finding that has shaped over the last forty years of studies of large molecules is that geometry plays a major role in our attempt to understand their activities. This paper emphasizes the former, i.e. the connection between geometry and chemistry. In particular we focus on measuring the shapes of molecules and detecting their cavities and pockets.

Significance of shape

The idea that shape defines function is a general concept from physical chemistry. Molecular structure or shape and chemical reactivity are highly correlated as the latter depends on the positions of the nuclei and electrons within the molecule. Indeed, chemists have long used three-dimensional plastic and metal models to understand the many subtle effects of structure on reactivity and have invested in experimentally determining the structure of important molecules. A common concrete model representing molecular shape is a union of balls, in which each ball corresponds to an atom. Properties of the molecule are then expressed in terms of properties of the union. For example, the putative active sites of an enzyme are detected as cavities and the interaction between a protein and its environment is quantified through the surface area and/or volume of the union of balls [1–4]. The most common use of molecular shape however is found in the quantification of the hydrophobic effect. For this, Lee and Richards introduced the concept of the solvent-accessible surface [5]. They computed the accessible area of each atom in both the folded and extended state of a protein, and found that the decrease in accessible area between the two states is greater for hydrophobic than for hydrophilic atoms. These ideas were refined by Eisenberg and McLachlan [1], who introduced the concept of a solvation free energy for large biomolecules, computed as a weighted sum of the accessible areas of all their atoms i. It is not clear, however, which surface area should be used to compute this solvation energy [6–8]. There is also some evidence that for small solute, the hydrophobic term is not proportional to the surface area [8], but rather to the solvent excluded volume of the molecule [9]. Current models for the non-polar part of the solvent energy include both a surface-based term and a volume-based term [10]. Within this debate on the exact form of the solvation energy, there is however a consensus that it depends on the geometry of the biomolecule under study, more specifically on its volume and surface area. In what follows, we discuss how these geometric measures are usually computed for a union of balls.

Geometric measures of biomolecules

The original approach of Lee and Richards [5] computed the accessible surface area by first cutting the molecule with a set of parallel planes. The intersection of a plane with an atomic ball, if it exists, is a circle which can be partitioned into accessible arcs on the boundary and occluded arcs in the interior of the union. The accessible surface area of atom i is the sum of the contributions of all its accessible arcs, computed approximately as the product of the arc length and the spacing between the plane. This method was originally implemented in the program ACCESS [5]. Shrake and Rupley [11] refined Lee and Richards’ method and proposed a Monte Carlo numerical integration of the accessible surface area. Their method placed 92 points on each atomic sphere, and determined which points were accessible to solvent (not inside any other sphere). Efficient implementations of this method include applications of look-up tables [12], of vectorized algorithm [13] and of parallel algorithms [14]. Similar numerical methods have been developed for computing the volume of a union of balls [15–18].

The surface area and/or volume computed by numerical integration over a set of points, even if closely spaced, is not accurate and cannot be readily differentiated. To improve upon the numerical methods, analytical approximations to the accessible surface area have been developed, which either treat multiple overlapping balls probabilistically [19–21] or ignore them altogether [22, 23]. While these approaches are approximative, they are fast and lead to differentiable geometric measures. In addition, they are well suited for hardware acceleration on graphics processing units [24].

Better analytical methods describe the molecule as a union of pieces of balls, each defined by their center, radius, and arcs forming their boundary, and subsequently apply analytical geometry to compute the surface area and volume [25–29]. For example, Pavani and Ranghino [16] proposed a method for computing the volume of a molecule by inclusion-exclusion. In their implementation, only intersections of up to three balls were considered. Petitjean however noticed that practical situations for proteins frequently involve simultaneous overlaps of up to six balls [28]. Subsequently, Pavani and Ranghino’s idea was generalized to any number of simultaneous overlaps by Gibson and Scheraga [30] and by Petitjean [28], applying a theorem that states that higher-order overlaps can always be reduced to lower-order overlaps [31]. Doing the reduction correctly remains however computationally difficult and expensive. The Alpha Shape Theory solves this problem using Delaunay triangulations and their filtrations, as described by Edelsbrunner [32].

The distinction between approximate and exact computation also applies to existing methods for computing the derivatives of the volume and surface area of a molecule with respect to its atomic coordinates [33–36]. In the case of the derivatives of the surface area, computationally efficient methods were implemented in the MSEED software by Perrot et al. [37] and in the SASAD software by Sridharan et al. [38]. All these methods introduce approximations to deal with singularities caused by numerical errors or by discontinuities in the derivatives [35, 37, 39].

Note that the complexity of the computation of the area and volume of a union of balls, the problems of singularities encountered when computing their derivatives, and the inherent existence of discontinuities have led to the development of alternative geometric representations of molecules. We cite for example the Gaussian description of molecular shape, that allows for easy analytical computation of surface area, volume and derivatives [40, 41], as well as the molecular skin that provides a smooth definition of the surface of a molecule [42].

Detecting pockets and cavities in biomolecular structure

The problem of detecting and measuring internal cavities of biomolecules is very popular as these cavities often serve as leads for drug design as they correspond to putative binding sites for these drugs. Most solutions to this problem rely heavily on geometry. They can be divided into three categories: (i) the grid based methods, (ii) the probe sphere detection and (ii) the analytical methods.

In the grid based method, the molecule is positioned in a 3D Cartesian grid whose vertices are then sorted into two groups: those that are covered by a protein atom and those that are not. The latter are further characterized as being inside a pocket if they satisfy some geometric conditions (such as being inside and at a distance greater than the radius of a water molecule from the convex hull of the biomolecule). The measures of these pockets (volume and surface area) are then computed by Monte Carlo integration over their corresponding grid points. POCKET [43], LIGSITE [44], LigandFit [45], PocketPicker [46], and McVol [18] are cavity-detecting programs that implement the grid-based method.

The probe sphere method proceeds by placing probe spheres that are tangent to the surfaces of two atoms of the biomolecules and then reducing their radii to eliminate overlaps with neighboring atoms; all remaining spheres whose radii exceed a minimal cutoff value (usually 1 Å) are used to define the pockets and cavities. This method was originally implemented in the program SURFNET [47] and later modified in the programs PASS [48] and PHECOM [49]. Interestingly, the grid and probe sphere methods were recently combined in the program POCASA [50].

The alpha shape theory combined with the discrete flow concept was the first analytical method proposed for detecting and measuring inaccessible cavities [4] as well as pockets [51, 52] in biomolecules. It has been extended since to detect channels between inner cavities and the outside [53]. The program CAVE implements a complementary approach in which the boundary of the pockets are directly triangulated, forming the so-called enveloping triangulation [54]

This work

Edelsbrunner and colleagues have developed analytical methods based on the alpha shape theory for computing the metrics of a union of balls, including surface area, volume, their derivatives with respect to the Cartesian coordinates of the centers of the balls and the detection and measurement of pockets [32, 55–57]. These methods have been implemented in different software packages, such as AlphaShape, CASTp [52, 58] and AlphaVol [57]. Most of these softwares however have not been recently updated; in addition, they have been written using generic algorithms that work fine for small molecules but have not been tested on vary large molecular systems (i.e. with more than one million atoms). In this paper, we show that these algorithms lead to inefficient programs for these very large systems. In response, we describe a new efficient implementation of these methods in an open source software package, UnionBall; this new implementation allows us to characterize and quantify the geometry of a molecular system with more than sixteen million atoms is less than eight minutes CPU time on a single processor running at 3.15 GHz. We also propose new geometric derivations of the equations that give the surface areas and volumes of the intersection of two, three and four balls, as well as their derivatives with respect to inter-atomic distances. The paper is organized as follows. The next section provides a brief description of the alpha shape theory and its application to measuring a union of balls. The following section describes our implementation of this theory in the program UnionBall; it includes testing on a set of large virus capsids. Appendix A covers the geometry of the intersections of two, three and four balls while appendix B describes the geometry of a tetrahedron.

2 Measuring Union of Balls

2.1 Surface area and volume of a union of balls

Given a collection P_i of N three dimensional sets, the volume and the surface area of the union of P can be computed using the principle of inclusion-exclusion. That is, the volume and surface area of the union ∪ P can be expressed as an alternating sum of volumes and surface areas of the common intersections of the subsets of P,

M (\cup P) = \sum_{i = 1}^{N} M (P_{i}) - \sum_{i < j} M (P_{i} \cap P_{j}) + \sum_{i < j < k} M (P_{i} \cap P_{j} \cap P_{k}) - \sum_{i < j < k < l} M (P_{i} \cap P_{j} \cap P_{k} \cap P_{l}) + \dots

(1)

where Inline graphic stands for the volume of the union of sets or the area of its boundary . There are two issues that need to be solved to make these equations computationally tractable. Firstly, we need to have a consistent way to reduce significantly the number of terms in the Inclusion-Exclusion formula; brute force application of this formula would lead to an algorithm with exponential running time, as the total number of terms in in formula in 2^N − 1, with each term corresponding to the measure of the intersection of at most N balls. Secondly, we need analytical formula for computing the non-empty intersection of sets. The next two sections overview solutions to these two issues when the sets are 3D balls.

2.1.1 A simplified inclusion-exclusion formula for union of balls

The Alpha Shape Theory provides a method for reducing significantly the number of terms in the inclusion-exclusion formula applied to unions of balls. It is based on the concept of Voronoi decompositions and Delaunay triangulations and their filtrations, as described by Edelsbrunner [32]. Note that the concept of using the Voronoi decomposition and Delaunay triangulation to simplify the inclusion-exclusion formula was originally introduced by Naiman and Wynn [59].

Voronoi decomposition and dual complex

Let us consider a finite set of spheres S_i with centers z_i and radii r_i and let B_i be the ball bounded by S_i. We define the square distance between a point x and a sphere S_i as $π_{i} (x) = {| | x - z_{i} | |}^{2} - r_{i}^{2}$ . This distance definition allows for varying radii for the spheres.

The Voronoi region of S_i consists of all points x at least as close to S_i as to any other sphere: V_i = {x ∈ ℝ³ | π_i(x) ≤ π_j(x)}. The Voronoi region of S_i is a convex polyhedron obtained as the common intersection of finitely many closed half-spaces, one per sphere S_j ≠ = S_i. These half-spaces are defined as follows. If S_i and S_j intersect in a circle then the plane bounding the corresponding half-spaces passes through that circle. The union of all Voronoi regions V_i defines the Voronoi diagram of the union of spheres; this union covers the whole space. The intersection of the Voronoi diagram with the union of balls B_i decomposes this union into convex regions of the form B_i ∩ V_i, as illustrated in figure 1. The boundary of each such region consists of spherical patches on S_i and planar patches on the boundary of V_i. The spherical patches separate the inside from the outside and the planar patches decompose the inside of the union.

Given a finite set of disks, the Voronoi diagram decomposes the plane into regions, one per disk, such that any point in the region assigned to disk *S_i* is closer to that disk than to any other disk, where the distance to *S_i* is defined as ${| | x - z_{i} | |}^{2} - r_{i}^{2}$ . In the drawing, we restrict the Voronoi diagram (dashed lines) to within the portion of the plane covered by the disks (magenta) and get a decomposition of the union into convex regions. The dual Delaunay triangulation is obtained by drawing edges between circle centers of neighboring Voronoi regions. The dual complex is a subset of the Delaunay triangulation, limited to the edges (in blue) and triangles (light green) whose corresponding Voronoi regions intersect within the union of disks.

The Delaunay triangulation is the dual of the Voronoi diagram, obtained by drawing an edge between the centers of S_i and S_j if the two corresponding Voronoi regions share a common face. Furthermore, we draw a triangle connecting z_i, z_j and z_k if V_i, V_j and V_k intersect in a common line segment, and we draw a tetrahedron connecting z_i, z_j, z_k and z_ℓ if V_i, V_j, V_k and V_ℓ meet at a common point. Assuming general position of the spheres, there are no other cases to be considered. We refer to this as the generic case; it is important to mention that it is rare in practice because of limited precision. Nevertheless, it is possible to simulate a perturbation of the union of balls that restores the generic case [60]. This method, referred to as simulation of simplicity, consistently unfold potentially complicated degenerate cases to non-degenerate ones.

Let us limit the construction of the Delaunay triangulation to within the union of balls. In other words, we draw a dual edge between the two vertices z_i and z_j only if B_i ∩ V_i and B_j ∩ V_j share a common face, and similarly for triangles and tetrahedra. The result is a sub-complex of the Delaunay triangulation which we refer to as the dual complex K of the set of spheres.

Area and volume formulas

A simplex τ in the dual complex can be interpreted abstractly as a collection of balls, one ball if it is a vertex, two if it is an edge, etc. In this interpretation, the dual complex is a system of sets of balls. We write vol ∩ τ for the volume of the intersection of the balls in τ. This is exactly the term we would see in an inclusion-exclusion formula for the volume of the union of balls, ∪_i B_i. As proved in [32, 59], the inclusion-exclusion formula that corresponds to the dual complex gives the correct volume of a union of balls, as well as the correct area of its boundary.

We state the corresponding theorems for the case in which the contribution of each ball B_i is weighted by a constant α_i, yielding the weighted volume Inline graphic of the union of balls and weighted area of its boundary. When the coefficients α_i correspond to atomic solvation parameters, these two terms estimate the solvation free energy of the molecule represented by the union of balls. Let τ_i be the simplex corresponding to the ball B_i, τ_ij the simplex formed by the edge between the balls B_i and B_j, τ_ijk the triangle corresponding the the three balls B_i, B_j and B_k, and finally τ_ijkl the tetrahedron defined by the four balls B_i, B_j, B_k and B_l. then:

Weighted Volume Theorem

A_{W} (\underset{i}{\cup} B_{i}) = \sum_{τ_{i} \in K} α_{i} A_{i} - \sum_{τ_{i j} \in K} (α_{i} A_{i; j} + α_{j} A_{j; i}) + \sum_{τ_{ijk} \in K} (α_{i} A_{i; j k} + α_{j} A_{j; i k} + α_{k} A_{k; i j}) - \sum_{τ_{ijkl} \in K} (α_{i} A_{i; jkl} + α_{j} A_{j; ikl} + α_{k} A_{k; ijl} + α_{l} A_{l; ijk})

(2)

and

Weighted Volume Theorem

V_{W} (\underset{i}{\cup} B_{i}) = \sum_{τ_{i} \in K} α_{i} V_{i} - \sum_{τ_{i j} \in K} (α_{i} V_{i; j} + α_{j} V_{j; i}) + \sum_{τ_{ijk} \in K} (α_{i} V_{i; j k} + α_{j} V_{j; i k} + α_{k} V_{k; i j}) - \sum_{τ_{ijkl} \in K} (α_{i} V_{i; jkl} + α_{j} V_{j; ikl} + α_{k} V_{k; ijl} + α_{l} V_{l; ijk})

(3)

Here Inline graphic is the volume of the ball B_i, is the contribution of B_i to the volume of the intersection of the balls B_i and B_j, etc. Similar definitions are used for the surface areas .

These results are direct extensions of the Area and Volume Theorems derived by Edelsbrunner [32, 57]; they overcome past difficulties by implicitly reducing higher-order to lower-order overlaps. An added advantage of these formulas is that the balls in each term form a unique geometric configuration so that the analytic calculation of the volume can be done without case analysis [32, 57].

As a side note, it is interesting that the dual complex is not the only simplicial complex that leads to a minimal inclusion exclusion formulas: Attali and Edelsbrunner have shown that it is possible to construct a family of such complexes, that are characterized by the independence of their simplices and by geometric realizations with the same underlying space as the dual complex [61].

2.1.2 Angle weighted inclusion-exclusion formula for union of balls

Even though the equations described above are minimal, it is possible to find even shorter expressions for the weighted areas and volumes if non integer coefficients are considered. This is what is referred to as the short inclusion-exclusion method and is described in detail in [32]. In this method, the area and volume are expressed as the sums of the contributions of intersections of at most three balls, with angular coefficients. The corresponding expressions for the weighted areas and weighted volumes are:

Short Weighted Area Formula

A_{W} (\underset{i}{\cup} B_{i}) = \sum_{τ_{i} \in K} α_{i} γ_{i} A_{i} - \sum_{τ_{i j} \in K} γ_{i j} (α_{i} A_{i; j} + α_{j} A_{j; i}) + \sum_{τ_{ijk} \in K} γ_{ijk} (α_{i} A_{i; j k} + α_{j} A_{j; i k} + α_{k} A_{k; i j})

(4)

and

Short Weighted Volume Formula

\begin{array}{l} V_{W} (\underset{i}{\cup} B_{i}) = \sum_{τ_{i} \in K} γ_{i} α_{i} V_{i} - \sum_{τ_{i j} \in K} γ_{i j} (α_{i} V_{i; j} + α_{j} V_{j; i}) \\ + \sum_{τ_{ijk} \in K} γ_{ijk} (α_{i} V_{i; j k} + α_{j} V_{j; i k} + α_{k} V_{k; i j}) \\ + \sum_{τ_{ijkl} \in K} (α_{i} vol (F_{i}) + α_{j} vol (F_{j}) + α_{k} vol (F_{k}) + α_{l} vol (F_{l})) \end{array}

(5)

where F_i is the fraction of the Voronoi region of S_i delimited by the planes defined by the triangles Δz_iz_jz_k, Δz_iz_jz_l and Δz_iz_kz_l. We show in appendix A how to compute the volume of F_i.

The coefficients γ are the normalized exposed angles of the simplices [57]; they are given by:

γ_{i} = 1 - \sum_{j, k, l ∣ τ_{ijkl} \in K} \frac{Ω_{i}}{4 π}

(6)

γ_{i j} = 1 - \sum_{k, l ∣ τ_{ijkl} \in K} \frac{φ_{i j}}{2 π}

(7)

γ_{ijk} = 1 - \sum_{l ∣ τ_{ijkl} \in K} \frac{1}{2}

(8)

where Ω_i is the solid angle at vertex z_i and φ_ij is the dihedral angle associated with the edge z_iz_j in the tetrahedron defined by z_i, z_j, z_k and z_l. These coefficients can be interpreted as the fraction of solid angle (for a vertex), of dihedral angle (for an edge) or face of triangle that remains accessible in the dual complex. All edges and triangles that are fully buried have zero contribution in equations 4 and 5. In parallel, tetrahedra in the dual complex that are fully buried do not contribute to the area, and only contribute their volume (which is easer to compute than the volume of the intersection of balls) in the volume formula.

Note that in the special case that the weights of all atoms are equal to 1, these equations give the surface area and volume of a union of balls and can be written:

A (\underset{i}{\cup} B_{i}) = \sum_{τ_{i} \in K} γ_{i} A_{i} (B_{i}) - \sum_{τ_{i j} \in K} γ_{i j} A (B_{i} \cap B_{j}) + \sum_{τ_{ijk} \in K} γ_{ijk} A (B_{i} \cap B_{j} \cap B_{k})

(9)

and

V (\underset{i}{\cup} B_{i}) = \sum_{τ_{i} \in K} γ_{i} V (B_{i}) - \sum_{τ_{i j} \in K} γ_{i j} V (B_{i} \cap B_{j}) + \sum_{τ_{ijk} \in K} γ_{ijk} V (B_{i} \cap B_{j} \cap B_{k}) + \sum_{τ_{ijkl} \in K} vol (τ_{ijkl})

(10)

2.1.3 Area and volume derivatives

We are interested in the derivatives of the area and the volume of a union of N balls with respect to their positions. We have recently derived expressions for these derivatives with respect to the Cartesian coordinates of the center of the balls [55, 56]. We revisit this problem here and propose new expressions for the derivatives with respect to the distances between the center of these balls; these distances represent internal coordinates for the system that are rotationally invariant.

Derivatives with respect to internal distances

The volume of a union of balls and area of its boundary are fully characterized by the simplified, angle-weighted inclusion-exclusion equations 5 and 4, respectively. In appendix A, we show that all terms included in these two formulas can be expressed as functions of the radii of the balls and the distances between their centers. We compute the derivatives of the volume and area with respect to these distances algebraically. Note that the derivatives with respect to the distance r_ab between the centers z_a and z_b of the two balls B_a and B_b is non zero if and only if the edge z_az_b belongs to the dual complex. We get:

Weighted Area Derivative Theorem

\frac{δ A_{W}}{δ r_{a b}} = \sum_{τ i \in K} α_{i} A_{i} \frac{δ γ_{i}}{δ r_{a b}} - γ_{a b} α_{a} \frac{δ A_{a; b}}{δ r_{a b}} - γ_{a b} α_{b} \frac{δ A_{b; a}}{δ r_{a b}} - \sum_{τ_{i j} \in K} \frac{δ γ_{i j}}{δ r_{a b}} (α_{i} A_{i; j} + α_{j} A_{j; i}) + \sum_{i ∣ τ_{abi} \in K} γ_{abi} (α_{a} \frac{δ A_{a; b i}}{δ r_{a b}} + α_{b} \frac{δ A_{b; a i}}{δ r_{a b}} + α_{i} \frac{δ A_{i; a b}}{δ r_{a b}})

(11)

and

Weighted Volume Derivative Theorem

\begin{array}{l} \frac{δ V_{W}}{δ r_{a b}} = \sum_{τ i \in K} α_{i} V_{i} \frac{δ γ_{i}}{δ r_{a b}} - γ_{a b} α_{a} \frac{δ V_{a; b}}{δ r_{a b}} - γ_{a b} α_{b} \frac{δ V_{b; a}}{δ r_{a b}} - \sum_{τ_{i j} \in K} \frac{δ γ_{i j}}{δ r_{a b}} (α_{i} V_{i; j} + α_{j} V_{j; i}) \\ + \sum_{i ∣ τ_{abi} \in K} γ_{abi} (α_{a} \frac{δ V_{a; b i}}{δ r_{a b}} + α_{b} \frac{δ V_{b; a i}}{δ r_{a b}} + α_{i} \frac{δ V_{i; a b}}{δ r_{a b}}) \\ + \sum_{i, j ∣ τ_{abij} \in K} (α_{a} \frac{δ vol (F_{a})}{δ r_{a b}} + α_{b} \frac{δ vol (F_{b})}{δ r_{a b}} + α_{i} \frac{δ vol (F_{i})}{δ r_{a b}} + α_{j} \frac{δ vol (F_{j})}{δ r_{a b}}) \end{array}

(12)

In the specific case that all weights are equal to 1, the derivative of the volume is:

Volume Derivative Theorem

\frac{δ V}{δ r_{a b}} = \sum_{τ i \in K} V_{i} \frac{δ γ_{i}}{δ r_{a b}} - γ_{a b} \frac{δ V_{a; b}}{δ r_{a b}} - γ_{a b} \frac{δ V_{b; a}}{δ r_{a b}} - \sum_{τ_{i j} \in K} \frac{δ γ_{i j}}{δ r_{a b}} (V_{i; j} + V_{j; i}) + \sum_{i ∣ τ_{abi} \in K} γ_{abi} (\frac{δ V_{a; b i}}{δ r_{a b}} + \frac{δ V_{b; a i}}{δ r_{a b}} + \frac{δ V_{i; a b}}{δ r_{a b}}) + \sum_{i, j ∣ τ_{abij} \in K} \frac{δ vol (τ_{abij})}{δ r_{a b}}

(13)

Note that there are no terms involving the derivatives of γ_abi as those are independent of distances.

Formulas for the derivatives of the different terms Inline graphic , , , and vol (F_i) are straightforward from their analytical expressions (see appendix A). The angular coefficient γ_i of a vertex z_i is computed over all tetrahedra of K that contain i. if z_i is such that it belongs to at least one tetrahedron of K that also contains z_a and z_b, then:

\frac{δ γ_{i}}{δ r_{a b}} = - \frac{1}{4 π} \sum_{j ∣ τ_{ijab} \in K} (\frac{δ φ_{i j}}{δ r_{a b}} + \frac{δ φ_{i a}}{δ r_{a b}} + \frac{δ φ_{i b}}{δ r_{a b}})

In all other cases, $\frac{δ γ_{i}}{δ r_{a b}} = 0$ . Similarly,

\frac{δ γ_{i j}}{δ r_{a b}} = - \frac{1}{2 π} \frac{δ φ_{i j}}{δ r_{a b}}

if τ_ijab ∈ K, and 0 otherwise.

The derivatives of the dihedral angles φ_ij of a tetrahedron with respect to its edge lengths are given in appendix B. Finally, the volume derivative formula (13) also includes the derivatives of the volume of a tetrahedron with respect to its edge lengths, whose expressions are also given in appendix B.

Derivatives with respect to Cartesian coordinates

Once the derivatives with respect to internal coordinates are available, derivatives with respect to Cartesian coordinates are easily computed using the chain rule:

Cartesian Derivative Theorem The gradients a and v ∈ ℝ³ⁿ of the area and volume derivatives are

\begin{array}{l} [\begin{matrix} a_{3 i + 1} \\ a_{3 i + 2} \\ a_{3 i + 3} \end{matrix}] = \sum_{j \neq i} \frac{δ A}{δ r_{i j}} u_{i j} \\ [\begin{matrix} v_{3 i + 1} \\ v_{3 i + 2} \\ v_{3 i + 3} \end{matrix}] = \sum_{j \neq i} \frac{δ V}{δ r_{i j}} u_{i j} \end{array}

where u_ij = (z_i − z_j)/r_ij is the unit vector in the direction of the edge z_iz_j.

2.2 Voids and Pockets

A full description of how to detect and measure pockets in a union of balls based on the alpha shape theory is available in [51]. Briefly, the concept of pockets is ultimately connected to the notion of a continuous flow field defined on the Delaunay triangulation of these balls. Let T be the set of tetrahedra in the Delaunay triangulation and T = T ∪ τ_∞ where τ_∞ is a dummy element representing the complement of the triangulation in Inline graphic . The flow relation ’➢’ with τ ➢ σ is defined by:

τ and σ share a common triangle Δ, and
The interior of τ and the orthogonal center z_τ of τ lie on different sides of the plane defined by Δ.

where the orthogonal center z_τ is the center of the smallest ball that is orthogonal to all four balls whose centers are the vertices of τ.

If τ ➢ σ, τ is a predecessor of σ and σ is a successor of τ. σ ∈ T is a sink if it has no successors; in other words, a tetrahedron is a sink if and only if it contains its orthogonal center. Sinks are important since they are responsible for the formation of voids: if H is a void of the union of balls then at least one tetrahedron in H is a sink.

By definition, pockets consist of the Delaunay tetrahedra that do not belong to the dual complex K and are not ancestors of τ_∞. The only type of pockets without connection to the outside are the voids. All other pockets connect to the outside at one or more places, called mouth. Figure 2 illustrates these concepts on a simple two-dimensional example.

The dual complex of the union of disks is shown in red; all triangles in the Delaunay complex that do not belong to the dual complex are referred to as empty. Acute empty triangles contain their orthocenters: they correspond to sinks. We identify them with large blue dots to mark the position of the orthocenter. The obtuse empty triangles either flow to these acute triangles or to the outside (”infinity”). Triangles III, IV and V (shown in light blue) for example flow to infinity: they do not define pockets. The remaining triangles can be partitioned into two groups: region I is completely surrounded by the union of disks and therefore defines a void, while region II is connected to the outside by one mouth, and is referred to as a pocket.

The surface area and volume of a pocket are easily computed by first identifying their tetrahedra and their faces that belong to the dual complex followed by the application of simplified inclusion-exclusion formula similar to those used for measuring the dual complex (see [57] for details).

3 Algorithm & Implementation

AlphaVol is our original software package that implemented the Alpha Shape theory for measuring biomolecules [57]; its origins lie in the Alpha Shape package [62]. AlphaVol takes as input a set of balls in ℝ³, each specified by the coordinates of its center and the radius. In the case of biomolecules, this set is extracted from the corresponding PDB file using one of several standard sets of van der Waals radii. The computation is performed through four successive tasks:

Step 1. Construct the Delaunay triangulation.
Step 2. Extract the dual complex.
Step 3. Measure the union using inclusion-exclusion.
Step 4. Detecting and measuring the pockets

AlphaVol uses a standard algorithm from computational geometry for each of these tasks: the incremental flipping algorithm from Edelsbrunner and Shah [63] for computing the Delaunay, an algorithm based on the primitives described by Edelsbrunner [64] for computing the dual complex, our own algorithm for implementing the inclusion-exclusion formula [55, 56] and the algorithm of Edelsbrunner, Facello and Liang [51] for computing pockets. We had made a few small modifications to these algorithms as our interests are mostly measuring biomolecules. For example, we only compute the dual complex and not the full filtration of the Delaunay complex.

AlphaVol showed good performances on a set of small to medium-sized proteins [57]; table 1 illustrates however that the algorithms we have implemented fail, or at least become very slow for vary large system. This is especially true for computing the Delaunay (the capsid corresponding to 2dum is ten times larger than the capsid for 1ihm, however it takes more than 100 times longer to compute its Delaunay triangulation which is not in par with an expected O(n log(n)) time complexity) and even more for detecting and measuring pockets. We have written a new version of the AlphaVol software [57] in which the original algorithms have been either modified or fully rewritten to alleviate these severe drawbacks. In the following we describe these modified algorithms and their implementations. The new program is called UnionBall.

Table 1.

CPU times for measuring biomolecules using AlphaVol

Molecules ^a	Number of atoms	Delaunay	Dual complex	Volume	Pockets
1TIM	2288	0.02	0.01	0.01	0.02
GroEL	66136	1.76	0.35	0.80	23.16
1ihm	677040	78.27	9.39	5.87	2810.00
2dum	5214540	8577.00	99.00	59.30	205818.00

Open in a new tab

The four different molecules are: the chicken triose phosphate isomerase (PDB code 1TIM), the GroEL chaperonin (PDB code 1SX3), a Norwalk virus (PDB code 1ihm; we use the fully reconstructed capsid available at the viperdb web site [65], and the full capsid of a human adenovirus, PDB code 2dum, available at viperdb. For each molecule, we compute the accessible surface area, the corresponding volume inaccessible to solvent, and identify and measure all pockets. Calculation are performed on a single Intel processor running at 3.16 GHz, with 6MB of cache memory and 8 GB of RAM. Computing times are reported in seconds.

We compare the original and modified algorithms as implemented in the corresponding programs AlphaVol and UnionBall for all four steps defined above on a dataset of 285 virus capsids varying in size from sixty thousand atoms to sixteen million atoms; the structures for these capsids were downloaded from the web enabled relational database VIPERdb2 [65]. Note that these capsids are highly symmetric (icosahedral) and repetitive; we do not however make use of these symmetries. For all capsids, we compute the accessible surface area, the corresponding solvent-excluded volume, we detect all pockets, and we compute their volume and surface area. Figure 3 illustrates this process on the Sindbis virus.

The Sindbis virus is an RNA virus, member of the alphavirus; it is transmitted by mosquitoes and is responsible for the Sindbis fever, most common is South and East Africa, the Philipines and Australia. The structure of the full virion consists of two protein capsids (the outside capsid made of glycoproteins and the inner nucleocapsid, a lipid bilayer sandwiched between the two capsids, a set of transmembrane domains that cross the lipid bilayer and connect the two capsids, and the single stranded RNA that occupy the cavity inside the nucleocapsid; it was determined by combination of X-ray crystallography on individual proteins of the capsid and cryoelectron microscopy [78]. All images show here are based on the complete structure of the capsids obtained from the VIPERdb2 database (file 1ld4.vdb); note that this file only includes the proteins (capsids and transmembrane domians). A. Surface of the virus: the outer glycoprotein capsid. B. Cross section through the capsid, showing the outer capsid, the inner nucleocapsid and the transmembrane. C. Cross section of the dual complex corresponding to the two capsids and transmembrane domains. The simplices of this complex define all the terms of the inclusion-exclusion formula needed to compute the volume and surface area of the virus D. The two main pockets identified by UnionBall: the central pocket (in green) occupies the whole region in the center; it includes many large tetrahedra that are cut by the cross section. The second largest pocket (shown in pink) between the two capsids corresponds to the region where the lipid bilayer is found.

3.1 Improved Delaunay computations for large molecular systems

Our implementation of the Delaunay triangulation in the original program AlphaVol was based on the randomized incremental algorithm described in [63]. In this algorithm, the triangulation is constructed incrementally, by adding one sphere at a time. Before starting the construction, the spheres are re-indexed such that S₁, S₂, …, S_n is a random permutation of the spheres as they appear in the input file. Four dummy additional spheres with their centers at infinity are added so that all input spheres are contained in the tetrahedron they define. Let D_i be the Delaunay triangulation of the four spheres at infinity together with S₁, S₂, …, S_i. The algorithm proceeds by iterating three steps:

	for i = 1 to n do
1	find tetrahedron τ ∈ D_i₋₁ that contains z_i;
2	add z_i to decompose τ into four tetrahedra;
3	flip locally non-Delaunay triangles attached to z_i
	endfor.

Open in a new tab

The randomization preprocessing in this algorithm guarantees an expected theoretical running time of O(n log(n) + n²) in the worst case [63]. In practice however, a very different behavior is observed for very large dataset as illustrated in figure 4. This is unfortunately a known problem related to memory access on a computer observed for large dataset. Inherent to their nature, randomized algorithms access the data structures they maintain randomly, and random access wroks poorly with memory hierarchies available on modern computers. Virtual memory operating systems cache recently used data in memory, under the assumption that they are more likely to be used again soon. This assumption is violated by randomized algorithms who consequently perform poorly as the data structure exceeds the cache size.

The running times of AlphaVol (with randomization) and UnionBall (without randomization) for computing the regular Delaunay triangulation.

A simple solution is to insert points in an order which improves locality. Amenta, Choi and Rote [66] developed such a scheme while at the same time maintaining enough randomness so that the algorithm remains theoretically optimal. Their Biased Randomized Insertion Order (BRIO) method was shown to significantly improve performance. Later, Liu and Snoeyink [67] proposed a different method for ordering the points based on the space-filling curve. In their methods, all input points are placed into a 3D grids of N³ bins which are then visited in a Hilbert curve order; this method was found to speed up step 1 (point location). Both BRIO and this method require preprocessing of the data that comes with a computational cost.

Interestingly, the order in which data points are stored in a PDB file is inherently local. In most cases, two consecutive atoms either belong to the same amino acids or to two sequential amino acids that are in contact. Breaks occur for missing data and/or between chains in the case of a multimeric structures. These breaks may lead to non locality; they are however the exception and are not expected to play a significant role. We tested the effect of using the locality provided by the input PDB file by simply removing the randomization step in the algorithm described above. As illustrated in figure 4, this resulted in a significant improvement in performance. Removing the randomization leads to an observed linear dependence of the computing time with respect to the number of weighted spheres considered. We implemented this modification in UnionBall. Note that our initial attempts to implement BRIO and the Hilbert curve ordering did not lead to improved performances (data not shown).

3.2 Improved dual complex construction

Changing the Delaunay algorithm leads to a different ordering of the tetrahedra in the geometric data structure that stores the triangulation; this by itself is expected to result in faster construction of the dual complex. There is however another step that can be improved.

Given the Delaunay triangulation D of the input spheres, we construct the dual complex K ⊆ D by labeling the Delaunay simplices. Specifically, for each simplex τ ∈ D there is a threshold α_τ such that τ ∈ K if and only if $α_{τ}^{2} \leq 0$ . To label the Delaunay simplices we therefore need to decide the signs of their square thresholds. This test can be expressed in terms of the signs of the determinants of small matrices whose entries are center coordinates and square radii of the input spheres. Detailed expressions for these tests can be found in [62, 64]. An important ingredient in this context is the treatment of singularities. Inexact versions of the numerical tests are vulnerable to roundoff errors and can lead to wrong output. Following work in computational geometry [68], we implemented these tests using a so-called floating-point filter that first evaluates the tests approximately, using floating-points arithmetic, and if the results cannot be trusted, switches to exact arithmetic. The difficult part in implementing such a filter comes in the definition of ”trust”. Let us consider for example the determinant:

D = | \begin{matrix} x_{i} & y_{i} & z_{i} & x_{i}^{2} + y_{i}^{2} + z_{i}^{2} - r_{i}^{2} \\ x_{j} & y_{j} & z_{j} & x_{j}^{2} + y_{j}^{2} + z_{j}^{2} - r_{j}^{2} \\ x_{k} & y_{k} & z_{k} & x_{k}^{2} + y_{k}^{2} + z_{k}^{2} - r_{k}^{2} \\ x_{l} & y_{l} & z_{l} & x_{k}^{2} + y_{k}^{2} + z_{l}^{2} - r_{l}^{2} \end{matrix} |

that is needed for computing $α_{τ}^{2}$ for the tetrahedron τ defined by the four vertices z_i, z_j, z_k, and z_l. An upper bound on the error in computing D is given by:

ε (D) = C * C_{\max}^{5} * ε_{0}

where C is a constant that depends on the number of terms in the expansion of D, C_max is the maximum absolute value of all coordinates of the four vertices, and ε₀ is the IEEE machine precision equal to 2⁻⁵³. For large molecules, C_max can be quite large (in the order of several thousands) leading to large values for ε(D). This value however is unnecessarily large for the predicates involved in the construction of the dual complex. A tetrahedron belongs to the dual complex if the four spheres it represents have a common intersection. As such, the distance between any two of these centers cannot be larger than the sum of the radii of the two spheres, typically lower than 10 in the case of molecule, i.e. much smaller than the absolute coordinates of the point. We can use this fact by first centering the simplex under consideration on its orthocenter; the corresponding C_max value is consequently much smaller, leading to smaller ε(D) and consequently to a smaller number of switches to exact arithmetics. We have implemented this modification in UnionBall.

Figure 5 compares the computing times required to construct the dual complex for all virus capsids in our dataset by AlphaVol and UnionBall; the improvement is not as drastic as for computing the Delaunay triangulation but still significant.

The running times of AlphaVol and UnionBall (with a new floating point filter) for constructing the dual complex.

Weighted surface areas, volumes, and their derivatives

In UnionBall, we compute the weight surface area and volume of the union of balls using the short weighted formulas given by equations 4 and 5 as well as the formulas for computing the intersections of one, two and three balls given in appendix A. Note that these formulas only depend on the distances between the centers of the balls, and not on their Cartesian coordinates. All the distances can be precomputed, resulting in a significant speedup. Figure 6 compares the computing times required to measure the dual complex for all virus capsids in our dataset by AlphaVol and UnionBall; we do believe that most of the improvement comes from the better ordering of the tetrahedron resulting from the modified algorithm used for computing the Delaunay triangulation.

The running times of AlphaVol and UnionBall for computing the volume and surface area of the dual complex.

The derivatives of the weighted surface area and volume are computed using equations 11, 12, and 14.

Detecting and measuring voids

As shown in table 1, the detection of voids and cavities as implemented in AphaVol is the most inefficient step in characterizing the geometry of a very large union of balls, with a near N² dependence with respect to the number of balls. The corresponding algorithm was originally designed for generic union of balls. It starts from the master list of tetrahedra in the Delaunay complex, stored in the order in which they belong to the alpha complex and proceeds in two steps (see [51] for more details). Firstly, it computes a depth for each tetrahedron which is the index of its largest successor based on the discrete flow relationship. Secondly, pockets are constructed as sets of tetrahedra represented by a union-find data structure. The initial list of pockets is empty. The process then scans the tetrahedra that do not belong to the the dual complex as they appear in the master list; when it reaches the tetrahedron with index j, all tetrahedra with depth j are added to the union-find structure, each as an individual pocket. When a tetrahedron is added however, the algorithm checks its four direct neighbours; if one of these belong to an existing pocket and the face between the two tetrahedra is not in the dual complex, the two corresponding pockets are joined. The algorithm stops when all tetrahedra have been processed. Each set of tetrahedra in the final union-find structure is deemed a pocket, with the exception of the set containing the dummy tetrahedron τ_∞ which represents the outside.

This algorithm is theoretically optimal; in practice however, it suffers from the same problem of lack of locality when accessing data in memory, leading to significant thrashing. To circumvent this problem, we propose a different approach that is geared towards improved locality. Similar to the original approach, we compute the depth of each tetrahedron in the Delaunay complex; this step is fast as it is local by nature. From this knowledge, we can isolate the tetrahedra that flows to the outside. Each tetrahedron in the Delaunay complex is then assigned a flag, visited, initially set to one if it belongs to the dual complex or flows to outside and zero otherwise. The algorithm then proceeds as follows:

  for all tetrahedra σ in del(B) do
    if visited(σ) = 0 then
      Define new pocket P = {σ}; Define L(P) = {σ}; visited(σ) = 1;
      while |L(P)| ≠ = 0 do
        τ = pop(L(P))
        for all φ ∈ N(τ) with visited(φ)=0 do
          let T be the triangle shared by τ and φ
          if T ∉ K then
            P = P∪{φ}; L(P) = L(P)∪{φ}; visited(φ) = 1
          endif
        endfor
      endwhile
    endif
  endfor.

where N(τ) is the list of (up to) four neighbors of the tetrahedron τ. As written, this algorithm detects the pockets in the union of ball; it can easily be extended to compute their volume and surface area (as a tetrahedron is added to a pocket we also compute its contribution to the geometric measures).

The main element of this algorithm is the list L(P): it enforces spatial locality, which to a first approximation matches with locality in the list of tetrahedron, resulting in a much better usage of cache; figure 7 illustrates this improvement.

The running times of AlphaVol and UnionBall for detecting and measuring voids in large biomolecules.

Characterizing the geometry of large biomolecules

UnionBall incorporates all the modifications presented above. With this new program, it takes 18.4 s and 136.5 seconds to fully characterize the two viruses 1ihm and 2dum, respectively; these numbers are significantly better than the corresponding computing times for AlphaVol, namely 2903s for 1ihm and 214553 seconds for 2dum (see table 1).

The plots in Figure 8 shows the running times per point in millisecond for all four steps performed by UnionBall as a function of the virus size. All four steps show nearly constant behavior over the whole range of virus sizes. Constructing the regular triangulation is the slowest step with an average running time of 13 μ-seconds per points. This running time compares favorably with those reported for five popular codes that compute 3D Delaunay tessalations (figures 3 and 4 in Liu and Snoeyink [67]), even after correcting for the difference in processor speed. Among these five codes, Tess3 is the fastest, with an average running time of 20 μ-second per point, which would correspond to 10 μ-second on the processor we have used; it should be mentioned however that Tess3 performs all calculations using floating points, resulting in some topological mistakes in a few rare cases. Filtering the regular triangulation to construct the dual complex, computing the accessible surface area and volume of the virus, and detecting and measuring the pockets require on average 3 μ-seconds, 7 μ-seconds and 5 μ-seconds, respectively.

The running time of the different steps performed by UnionBall. The timings are computed on a single Intel Xeon processor running at 3.16 GHz with 6MB of cache memory and 8 GB of RAM. UnionBall is written in Fortran, except for all calculation in arbitrary precision arithmetics that are performed with the C library libgmp. The program is compiled with the Intel Fortran and C compilers. Lines correspond to the averages of five runs.

4 Conclusion

The alpha shape theory provides an accurate and robust method for computing the geometric measures of a biomolecule. Among these measures, surface area and volume are used to quantify the interactions between such a biomolecule, and the water surrounding it, in implicit solvent models. In addition, the detection of pockets within a biomolecule and the determination of their sizes serve as a starting point for predictive studies of biomolecule-ligand interactions. Several implementations of the alpha shape theory exist, including our own, AlphaVol [57]; these implementations have mostly been tested on globular proteins or medium-size nucleic acids. In the last few years however, spectacular advances in structural biology have produces an abundance of data on large macromolecular complexes, such as the RNA polymerase transcription complexes [69], the ribosome complexes [70, 71], and full size viruses [65] that contain several millions of atoms. Modeling these large systems is as important as modeling smaller proteins or nucleic acids. We have shown in this paper that the standard implementations of the alpha shape theory fail on such large systems, or at least become impractical as their running times are then unrealistically large. We have shown also that these difficulties are not theoretical; rather, they are related to the architecture of current computers that rely on the use of cache memory based on the principle of locality to speed up calculation. By rewriting the algorithms that implement the different steps of the alpha shape theory such that we improve locality, we have shown that we can remediate these cache problems; the corresponding code, UnionBall has an apparent O(n) behavior over a large range of values of n (up to tens of millions), where n is the number of atoms in the macromolecule. The two critical steps for which the largest improvements are observed are the construction of the Delaunay tessalation and the detection and measurements of the pockets.

The key to improving the construction of the Delaunay complex for large sets of points was to recognize that a fully randomized algorithm is impractical as it brakes the locality principle. We have remediated this problem by biasing the order in which the points are inserted, using the simplest possible scheme, i.e. the order in which the points appear in the PDB structure file. Since biomolecules are basically long chains of monomers that are stored sequentially as they appear along the chain, this natural ordering ensures locality. Obviously, this ”trick” is specific to biomolecular structural data and would not apply to generic data sets. The concept of biasing the order of insertion for constructing the Delaunay complex is however more general and has been implemented before [66, 67].

Further improvements might still be possible if we take into account the nature of the data even more. For example, all the viral capsids that were used as a test set in this study have a large empty cavity in their center (see figure 3). In the process of building the Delaunay complex for such a capsid, the algorithm will cover this cavity with a large number of complicated, elongated tetrahedra. It might be possible to generate a more regular tessalation of this cavity by introducing dummy points in the cavity, following the idea of adding points in a tetrahedral mesh to improve its quality [72]. We are currently exploring this idea.

We conclude this paper by mentioning that UnionBall is available as OpenSource software by contacting P.K.

Acknowledgments

This work derives from a long standing collaboration between P.K. and Prof. Herbert Edelsbrunner; we thank him for his mentorship and guidance. P.K. acknowledges current support from the NIH under contract GM080399.

Appendix A: Measuring the intersections of two, three and four balls

Several formulas have been presented for the volume and surface areas of the intersection of two, three and four spheres with unequal radii (see for example [30, 73, 74]). Here we propose new geometric derivations of these formulas that satisfy a specific constraint, namely we need expressions for the intersections that only depend of the radii of the spheres and the distance between their centers.

Notation

We consider up to four balls B_i, B_j, B_k and B_l whose boundaries are the spheres S_i, S_j, S_k and S_l, respectively. Let z_i and r_i be the center and radius of ball B_i and let r_ij be the distance between z_i and z_j. The intersection between the two balls B_i and B_j is the union of two caps Inline graphic and , illustrated in red and blue respectively in figure A.1. These two caps are connected at the level of the plane that separates the Voronoi cells of S_i and S_j; this plane cuts the two spheres in a circle with center y_i;j and radius r_i;j. We also define the height of spherical cap Inline graphic as h_i;j. It can easily be shown that:

h_{i; j} = r_{i} - \frac{r_{i j}}{2} - \frac{r_{i}^{2} - r_{j}^{2}}{2 r_{i j}}

(A.1)

r_{i; j} = \sqrt{2 r_{i} h_{i; j} - h_{i; j}^{2}}

(A.2)

As above, Inline graphic is the surface area of the sphere S_i; , and are the contributions of S_i to the surface areas of the intersections of S_i and S_j, of S_i, S_j and S_k, and of S_i, S_j, S_k and S_l, respectively:

A_{i; j} = area (C_{i; j}) A_{i; j k} = area (C_{i; j} \cap C_{i; k}) A_{i; jkl} = area (C_{i; j} \cap C_{i; k} \cap C_{i; l})

Similar expressions are used for volumes.

Intersection of two balls

Lemma 1

The intersection between two balls is illustrated in figure A.1. We have:

A_{i; j} = 2 π r_{i} h_{i; j}

(A.3)

V_{i; j} = \frac{1}{3} π h_{i; j}^{2} (3 r_{i} - h_{i; j})

(A.4)

with h_i;j defined in equation A.1.

Proof

Computing the volume and surface area of a spherical cap is a standard textbook problem that has been solved in many forms. Proofs for the formula given above can be found for example at the MathWorld web site (http://mathworld.wolfram.com/SphericalCap.html).

Intersection of three balls

The contribution of B_i to the surface area and volume of the intersection of the three balls B_i, B_j and B_k is defined by the intersection of the two caps Inline graphic and , illustrated in red and blue respectively in panel A of figure A.2. The three spheres S_i, S_j and S_k intersect in two points P_ijk and P_ikj. We consider the tetrahedron T₃ formed by the centers of the three balls and P_ijk (see panel B of figure A.2). The faces of T₃ are labeled z_iz_jz_k, z_iz_jP_ijk, z_iz_kP_ijk and z_jz_kP_ijk with areas s_P, s_k, s_j and s_i, respectively. The areas are computed using Heron’s formula (see appendix A). The dihedral angles corresponding to the edges z_iz_j and z_iz_k are denoted as θ_ij;k and θ_ik;j, respectively, while ψ_i is the dihedral angle corresponding to the edge z_iP_ijk.

Figure A2 — A. Intersection of three balls. B. The core tetrahedron T that defines the intersection of the three balls. *z_i*, *z_j* and *z_k* are the centers of the spheres. *P_ijk* is one of the two points common to the three spheres; as such, it is located at distances *r_i*, *r_j* and *r_k* from *z_i*, *z_j* and *z_k*, respectively.

Lemma 2

The contributions of S_i and B_i to the surface area and volume of the triple intersection are given by:

A_{i; j k} = 2 r_{i} h_{i; j} θ_{i j; k} + 2 r_{i} h_{i; k} θ_{i k; j} - 2 r_{i}^{2} (θ_{i j; k} + θ_{i k; j} + ψ_{i} - π)

(A.5)

V_{i; j k} = \frac{1}{3} r_{i} A_{i; j k} - \frac{1}{3} (r_{i} - h_{i; j}) (2 r_{i} h_{i; j} - h_{i; j}^{2}) (θ_{i j; k} - sin (θ_{i j; k}) cos (θ_{i j; k})) - \frac{1}{3} (r_{i} - h_{i; k}) (2 r_{i} h_{i; k} - h_{i; k}^{2}) (θ_{i k; j} - sin (θ_{i k; j}) cos (θ_{i k; j}))

(A.6)

where the dihedral angles are computed from the edge lengths of the tetrahedron T₃ (see appendix B). Formulas for the contributions of B_j and B_k to the intersection are easily deduced by index permutation on these equations.

Proof

We focus on the geometric proofs of equations A.5 and A.6.

Surface area

Let z_i;j and z_i;k be the points of intersection of the sphere S_i bounding the ball B_i with the lines z_iz_j and z_iz_k, respectively; these two points can be seen as ”centers” of the two caps. P_ijk and P_ikj are the two points that are common to all three spheres. These four points form a spherical quadrangle, with spherical angles β_ij;k, β_ik;j, α_ijk and α_ikj (see figure A.3). Note that this quarangle is symmetric with respect to the plane formed by the centers of the three balls which is also the plane passing by the three points z_i, z_i;j and z_i:k. Consequently, α_ijk = α_ikj.

Figure A3 — Intersection of three spheres *B_i*, *B_j* and *B_k* viewed on the flattened surface of *B_i*. Key to our approach is the spherical quadrangle formed by the two points *P_ijk* and *P_ikj* that are common to the three spheres and by the ”centers” of the caps, *z_i;j* and *z_i;k*.

The spherical angle β_ij;k is the dihedral angle between the plane Δz_iz_i;jP_ijk and the plane Δz_iz_i;jP_ikj. Because of the symmetry with respect to the plane containing the three centers, and because z_i;j belongs to the line z_iz_j, we find:

β_{i j; k} = 2 * dihed (Δ z_{i} z_{j} z_{k}, Δ z_{i} z_{j} P_{ijk}) = 2 θ_{i j; k}

Similarly, β_ik;j = 2θ_ik;j and α_ijk = φ_i.

We compute the surface area Q of this spherical quadrangle in two different ways. First, we use the formula for the area of a polygon on a sphere ( $A = R^{2} (\sum_{i = 1}^{n} θ_{i} - (n - 2) π)$ , where R is the radius of the sphere, n the number of vertices in the polygon, and θ_i the internal angle at vertex i):

Q = r_{i}^{2} (2 θ_{i j; k} + 2 θ_{i k; j} + 2 α_{i} - 2 π)

(A.7)

Second, we observe that the area of the quadrangle can be decomposed as:

+ the area A₁ of the sector of the cap that is delimited by the two arcs z_i;jP_ijk and z_i;jP_ikj
+ the area A₂ of the sector of the cap that is delimited by the two arcs z_i;kP_ijk and z_i;kP_ikj,
− the area of the intersection as it appears twice.

Therefore

Q = A_{1} + A_{2} - A_{i; j k}

(A.8)

The surface areas A₁ and A₂ are the fraction of the surface areas of the caps Inline graphic and covered by the angles 2θ_ij and 2θ_ik

\begin{array}{l} A_{1} = 2 r_{i} h_{i; j} θ_{i j; k} \\ A_{2} = 2 r_{i} h_{i; k} θ_{i k; j} \end{array}

(A.9)

where h_i:j and h_i:k are the heights of the two caps.

Combining equations A.7, A.8 and A.9, we validate equation A.5.

Volume

To compute the contribution Inline graphic of ball B_i to the volume of the intersection of the three balls, we consider the sector of B_i that joins its center z_i to the sphere sector whose surface is . The volume V_s of this sector can be computed in two different ways:

First, the volume V_s of a sector is given as r_iA/3, where r_i is the radius of the ball and A is the area of the sector on the surface of the ball:
$V_{s} = \frac{1}{3} r_{i} A_{i; j k}$ (A.10)
Second, the same sector can be divided into three parts (see panel A in figure A.4): two fractions of cones (filled in red and blue), and the region B_i;jk, whose volume is (shown in green):
$V_{s} = vol (F_{i j; k}) + vol (F_{i k; j}) + V_{i; j k}$ (A.11)

The volume of F_ij;k is:
$vol (F_{i j; k}) = \frac{1}{3} (r_{i} - h_{i; j}) {A S}_{i j; k}$ (A.12)

where AS_ij;k is the area of the base of F_ij;k, i.e. the area of the disk of intersection between B_i and B_j covered by the cap C_i;k (see panel B in figure A.4). AS_ij;k is computed as the difference between the area of the disk covered by 2θ_ik and the triangle Δy_i;jP_ijkP_ikj.
${A S}_{i j; k} = r_{i; j}^{2} (θ_{i j; k} - sin θ_{i j; k} cos θ_{i j; k})$ (A.13)

where r_i;j is the radius of the disk (see equation A.2). Note that this formula is valid even if the disk sector covers the disk center. Similar expressions are derived for the volume of F_ik;j.

Figure A4 — A The plane passing through the centers *z_i*, *z_j* and *z_k* of the three balls. *v_i;j* and *v_i;k* are the distances between the center *z_i* and the Voronoi planes separating i and j, and i and k, respectively, while *y_i;j* and *y_i;k* are the points of intersection between the edges *z_iz_j* and *z_iz_k* with these two planes. The contribution *B_i;jk* of *B_i* to the intersection of the three balls is shown in green. The sector joining *z_i* to *B_i;jk* is the key to computing its volume. This sector can be divided into three parts: *B_i;jk* itself, and two fractions of cones *F_ij;k* and *F_ik;j*, filled in blue and red, respectively. B. Projected view on the plane identified with arrows on panel A, i.e. the Voronoi plane between balls *B_i* and *B_j*. The base of the fraction of cone *F_ij;k* is shown filled in blue.

Combining equations A.10 to A.13, we validate equation A.6.

Intersection of four balls

Let B_i, B_j, B_k and B_l be the four balls with a common intersection. Their centers define a tetrahedron T₄ with faces T_i, T_j, T_k and T_l and corresponding areas s_i, s_j, s_k and s_l, respectively, defined such that z_a ∉ T_a for all a = i, j, k, l. We denote the dihedral angle of T₄ between the two faces that share the edge z_iz_i as φ_ij. We also define the solid angle subtented by the vertex z_i as Ω_i. These angles can be computed from the edge lengths of the tetrahedron (see appendix B). Note that:

Ω_{i} = φ_{i j} + φ_{i k} + φ_{i l} - π

Lemma 3

The contribution of B_i to the surface area and volume of the intersection of the four balls is defined by the intersection of the three caps Inline graphic , and :

A_{i; jkl} = \frac{Ω_{i}}{4 π} A_{i} - \frac{φ_{i j}}{2 π} A_{i; j} - \frac{φ_{i k}}{2 π} A_{i; k} - \frac{φ_{i l}}{2 π} A_{i; l} + \frac{1}{2} A_{i; j k} + \frac{1}{2} A_{i; j l} + \frac{1}{2} A_{i; k l}

(A.14)

V_{i; jkl} = \frac{Ω_{i}}{4 π} V_{i} - \frac{φ_{i j}}{2 π} V_{i; j} - \frac{φ_{i k}}{2 π} V_{i; k} - \frac{φ_{i l}}{2 π} V_{i; l} + \frac{1}{2} V_{i; j k} + \frac{1}{2} V_{i; j l} + \frac{1}{2} V_{i; k l} - vol (F_{i})

(A.15)

where

\begin{array}{l} vol (F_{i}) = \frac{1}{6} (r_{i} - h_{i; j}) r_{i; j}^{2} \frac{2 cos θ_{i j; k} cos θ_{i j; l} - ({cos}^{2} θ_{i j; k} + {cos}^{2} θ_{i j; l}) cos φ_{i j}}{sin φ_{i j}} \\ + \frac{1}{6} (r_{i} - h_{i; k}) r_{i; k}^{2} \frac{2 cos θ_{i k; j} cos θ_{i k; l} - ({cos}^{2} θ_{i k; j} + {cos}^{2} θ_{i k; l}) cos φ_{i k}}{sin φ_{i k}} \\ + \frac{1}{6} (r_{i} - h_{i; l}) r_{i; l}^{2} \frac{2 cos θ_{i l; j} cos θ_{i l; k} - ({cos}^{2} θ_{i l; j} + {cos}^{2} θ_{i l; k}) cos φ_{i l}}{sin φ_{i l}} \end{array}

(A.16)

and the angles θ have been defined above for the different intersections of three balls.

Proof

We focus on the geometric proofs of equations A.14 and A.15.

Surface area

Let us consider the spherical triangle ST = {z_i;jz_i:kz_i:l} whose vertices are the intersections of the edges z_iz_j, z_iz_k and z_iz_l with the sphere S_i. The spherical angle at vertex z_i;j is the dihedral angle between the planes defined by z_iz_i;jz_k and z_iz_i;jz_l; it is therefore the dihedral angle between the planes z_iz_jz_k and z_iz_jz_l, i.e. φ_ij. Similarly, the spherical angles at vertices z_i:k and z_i:l are φ_ik and φ_il, respectively. We compute the area A_tot of the spherical triangle ST using two approaches:

Firstly, we use the formula for the area of a spherical polygon:
$A_{tot} = r_{i}^{2} (φ_{i j} + φ_{i k} + φ_{i l} - π) = A_{i} \frac{Ω_{i}}{4 π}$ (A.17)

where is the surface area of sphere S_i, and Ω_i the solid angle subtented by the vertex z_i of T₄.
Secondly, A_tot is computed using an inclusion-exclusion formula: it is the sum of the surface areas A_j, A_k and A_l of sectors of the three spherical caps C_i:j, C_i:k and C_i:l, minus the surface areas A_jk, A_jl and A_kl of the intersections of these sectors, plus the surface area of their triple intersection (see panel B in figure A.5).

The surface areas of the sectors are:
$A_{j} = \frac{φ_{i j}}{2 π} A_{i; j} A_{k} = \frac{φ_{i k}}{2 π} A_{i; k} A_{l} = \frac{φ_{i l}}{2 π} A_{i; l}$ (A.18)

By noticing that the arc circle z_i;az_i:b cuts the intersection between the two caps C_i:a and C_i:b in half for all a ≠ b = j, k, l, we get:
$A_{j k} = \frac{1}{2} A_{i; j k} A_{j l} = \frac{1}{2} A_{i; j l} A_{k l} = \frac{1}{2} A_{i; k l}$ (A.19)

Finally, the surface area of the triple intersection is . Therefore,
$A_{tot} = \frac{φ_{i j}}{2 π} A_{i; j} + \frac{φ_{i k}}{2 π} A_{i; k} + \frac{φ_{i l}}{2 π} A_{i; l} - \frac{1}{2} A_{i; j k} - \frac{1}{2} A_{i; j l} - \frac{1}{2} A_{i; k l} + A_{i; jkl}$ (A.20)

Figure A5 — A Projected view on the flattened surface of *S_i*. Key to our approach is the spherical triangle ST formed by the ”centers” of the caps, *z_i;j z_i:k* and *z_i;l*, corresponding to the points of intersection of *S_i* with the lines *z_iz_j*, *z_iz_k* and *z_iz_l*, respectively. B. The surface area of ST is computed using an inclusion exclusion formula.

Combining the two equations A.17 and A.20, we validate equation A.14.

Volume

Let us now consider the sector of the ball B_i obtained by joining its center z_i to the spherical triangle ST = {z_i;jz_i:kz_i:l} (see panel A of figure A.6). Similar to the computation of the surface area, we compute the volume V_tot of this sector using two approaches:

Firstly, the volume of a ball sector is equal to 1/3 times the radius of the ball times the surface area of the sector:
$V_{tot} = \frac{1}{3} r_{i} A_{tot} = V_{i} \frac{Ω_{i}}{4 π}$ (A.21)
Secondly, we divide the sector into two regions: the region F_i delimited by the tetrahedron T₄ and the three Voronoi planes that separates B_i from B_j, B_k and B_l, and the region G_i delimited by these three planes and the sphere S_i (panel A of figure A.6). The volume of G_i is computed using the same inclusion-exclusion formula that was used for the surface area of ST:
$vol (G_{i}) = \frac{φ_{i j}}{2 π} V_{i; j} + \frac{φ_{i k}}{2 π} V_{i; k} + \frac{φ_{i l}}{2 π} V_{i; l} - \frac{1}{2} V_{i; j k} - \frac{1}{2} V_{i; j l} - \frac{1}{2} V_{i; k l} + V_{i; jkl}$ (A.22)

therefore,
$V_{tot} = \frac{φ_{i j}}{2 π} V_{i; j} + \frac{φ_{i k}}{2 π} V_{i; k} + \frac{φ_{i l}}{2 π} V_{i; l} - \frac{1}{2} V_{i; j k} - \frac{1}{2} V_{i; j l} - \frac{1}{2} V_{i; k l} + V_{i; jkl} + vol (F_{i})$ (A.23)

Combining the two equations A.21 and A.23, we validate equation A.15.

The volume of F_i is computed as the sum of the volumes of the three pyramids with apex z_i and bases on the Voronoi planes relative to B_i (see figure A.6):

vol (F_{i}) = \frac{1}{3} (r_{i} - h_{i; j}) area (A_{1}) + \frac{1}{3} (r_{i} - h_{i; k}) area (A_{2}) + \frac{1}{3} (r_{i} - h_{i; l}) area (A_{3})

(A.24)

The surface area of the base A₁ is computed as the difference between the area of the triangles Δy_i;jD_lD and ΔD_kDP_ijkl (see panel B in figure A.6):

\begin{array}{l} area (A_{1}) = \frac{1}{2} d_{l}^{2} tan φ_{i j} - \frac{1}{2} {(\frac{d_{l}}{cos φ_{i j}} - d_{k})}^{2} \frac{1}{tan φ_{i j}} \\ = \frac{2 d_{k} d_{l} - (d_{k}^{2} + d_{l}^{2}) cos φ_{i j}}{sin φ_{i j}} \end{array}

(A.25)

where d_k = r_i;j cos θ_ij;k and d_l = r_i;j cosθ_ij;l (see figure A.4). Similar equations are derived for the areas of A₂ and A₃. Combining these equations with equation A.24 validates equation A.16.

Note that:

F_{i} + F_{j} + F_{k} + F_{l} = vol (T_{4})

(A.26)

Angle weighted inclusion-exclusion formula for union of balls

Equation A.14 implies that the surface area of the intersection of four spheres is a linear combination of the surface areas of the individual spheres and of the intersections of two and three spheres, where the linear coefficients are related to the six dihedral angles of the tetrahedron formed by the centers of the four spheres. The same relationship exists for the volume of the intersection of four balls, with additional terms for the intersection of the tetrahedron formed by the center of the four balls and their Voronoi cells. Replacing the corresponding equations (A.14 and A.15) in the Weighted Volume and Weighted Area Theorems, we get the simplified inclusion-exclusion formulas 4 and 5.

Appendix B: The geometry of a tetrahedron

Let us consider the tetrahedron T defined by the four vertices P₁, P₂, P₃ and P₄. The four faces of this tetrahedron are T₁ = ΔP₂P₃P₄, T₂ = ΔP₁P₃P₄, T₃ = ΔP₁P₂P₄, and T₄ = ΔP₁P₂P₃ and their areas are s₁, s₂, s₃ and s₄, respectively. We denote the dihedral angle with respect T_i and T_j for i ≠ j = 1, 2, 3, 4 as θ_ij. The edge between P_i and P_j has length l_ij, for i ≠ j = 1, 2, 3, 4.

Surface area and volume

The Cayley-Menger matrix M associated with T is given by:

M = (\begin{matrix} 0 & l_{12}^{2} & l_{13}^{2} & l_{14}^{2} & 1 \\ l_{12}^{2} & 0 & l_{23}^{2} & l_{24}^{2} & 1 \\ l_{13}^{2} & l_{23}^{2} & 0 & l_{34}^{2} & 1 \\ l_{14}^{2} & l_{24}^{2} & l_{34}^{2} & 0 & 1 \\ 1 & 1 & 1 & 1 & 0 \end{matrix}) .

(B.1)

We also define the submatrix M_i,j of M obtained by deleting its i – th row and j – th column.

The volume of the tetrahedron T and the surface areas of its faces can be expressed in terms of the determinants of these matrices:

vol {(T)}^{2} = \frac{1}{288} det (M)

(B.2)

s_{i}^{2} = - \frac{1}{16} det (M_{i, i})

(B.3)

Dihedral angles

The well-known relationship between the volume of a tetrahedron and any of its dihedral angle [75]

V = \frac{2}{3 l_{i j}} s_{i} s_{j} sin (θ_{i j})

(B.4)

cannot be used directly to compute the latter as it does not distinguish if the angle is obtuse or not. We use instead a result referred to as the law of cosine of dihedrals [76, 77]:

cos θ_{i j} = \frac{{(- 1)}^{i + j} det (M_{i j})}{16 s_{i} s_{j}}

(B.5)

for 1 ≤ i < j ≤ 4. Combining these two equations, we get:

cot θ_{i j} = \frac{cos θ_{i j}}{sin θ_{i j}} = \frac{2}{3} \frac{{(- 1)}^{i + j} l_{i j} det (M_{i j})}{V}

(B.6)

Derivatives of the volume of a tetrahedron

Lemma 4

Let T be a non degenerate tetrahedron whose volume is V. The derivative of V with respect to the length of the edge P_aP_b is given by:

\frac{δ V}{δ l_{a b}} = \frac{1}{6} l_{a b}^{2} cot θ_{a b}

(B.7)

Proof

The Cayley-Menger matrix M of a non degenerate tetrahedron T is invertible (if it is not invertible, its determinant is 0 and the volume of the tetrahedron is 0). Let us call M⁻¹ the inverse of M. Using Jacobi’s formula for the differential of a determinant, we get:

\begin{array}{l} \frac{δ det (M)}{δ l_{a b}} = det (M) Tr (M^{- 1} \frac{δ M}{δ l_{a b}}) \\ = 4 l_{a b} det (M) {(M^{- 1})}_{a b} \end{array}

(B.8)

where (M⁻¹)_ab is the element of the matrix M⁻¹ at row a and column b: this element is the co-factor of M corresponding to the positions (a, b), i.e.:

{(M^{- 1})}_{a b} = {(- 1)}^{a + b} \frac{det (M_{a b})}{det (M)} .

(B.9)

Therefore,

\frac{δ det (M)}{δ l_{a b}} = {(- 1)}^{a + b} 4 l_{a b} det (M_{a b})

(B.10)

Then we have:

\begin{array}{l} \frac{δ V}{δ l_{a b}} = \frac{1}{576 V} \frac{δ det (M)}{δ l_{a b}} \\ = {(- 1)}^{a + b} \frac{l_{a b}}{144 V} det (M_{a b}) \end{array}

(B.11)

Combining this equation with the equations B.4 and B.5, we validate equation B.7.

Derivatives of the surface areas of the faces of a tetrahedron

Jacobi’s formula can also be used to compute the derivatives of the surface areas of the faces of a tetrahedron, based on equation B.3. It is easier however to expand the determinant:

s_{i}^{2} = \frac{1}{16} (4 l_{k l}^{2} l_{j l}^{2} - {(l_{k l}^{2} + l_{j l}^{2} - l_{j k}^{2})}^{2})

(B.12)

Then:

\begin{matrix} \frac{δ s_{i}}{δ l_{j k}} = \frac{l_{j k}}{4 s_{i}} (l_{k l}^{2} + l_{j l}^{2} - l_{j k}^{2}) \\ \frac{δ s_{i}}{δ l_{j l}} = \frac{l_{j l}}{4 s_{i}} (l_{k l}^{2} + l_{j k}^{2} - l_{j l}^{2}) \\ \frac{δ s_{i}}{δ l_{k l}} = \frac{l_{k l}}{4 s_{i}} (l_{j k}^{2} + l_{j l}^{2} - l_{k l}^{2}) \end{matrix}

(B.13)

and

\frac{δ s_{i}}{δ l_{a b}} = 0

(B.14)

if a = i or b = i.

Derivatives of the dihedral angles

Deriving equation B.6 with respect to the length l_ab of the edge P_aP_b, we get:

- (1 + {cot}^{2} θ_{i j}) \frac{δ θ_{i j}}{δ l_{a b}} = δ_{i j; a b} \frac{cot θ_{i j}}{l_{i j}} + \frac{2}{3} \frac{{(- 1)}^{i + j} l_{i j}}{V} \frac{δ det (M_{i j})}{δ l_{a b}} - \frac{l_{a b}^{2} cot θ_{i j} cot θ_{a b}}{6 V}

(B.15)

where δ_ij;ab is 1 if the pair (i, j) is equal to the pair (a, b) and equal to 0 otherwise.

All terms in this equation are known except for the derivatives of det (M_ij). While we could use Jacobi’s formula to compute these derivatives, it is easier to expand the determinant:

det (M_{i j}) = 2 r_{i j}^{2} (r_{i k}^{2} + r_{i l}^{2} - r_{k l}^{2}) - (r_{i j}^{2} + r_{i k}^{2} - r_{j k}^{2}) (r_{i j}^{2} + r_{i l}^{2} - r_{j l}^{2})

(B.16)

Its derivatives with respect to each edge length are then straightforward.

Contributor Information

Paul Mach, Email: mach@math.ucdavis.edu, Graduate Group in Applied Mathematics, University of California, Davis, CA 95616.

Patrice Koehl, Email: koehl@cs.ucdavis.edu, Department of Computer Science and Genome Center, University of California, Davis, CA 95616.

References

1.Eisenberg D, McLachlan AD. Nature (London) 1986;319:199–203. doi: 10.1038/319199a0. [DOI] [PubMed] [Google Scholar]
2.Ooi T, Oobatake M, Nemethy G, Scheraga HA. Proc Natl Acad Sci (USA) 1987;84:3086–3090. doi: 10.1073/pnas.84.10.3086. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Liang J, Edelsbrunner H, Fu P, Sudhakar PV, Subramaniam S. Proteins: Struct Func Genet. 1998;33:1–17. [PubMed] [Google Scholar]
4.Liang J, Edelsbrunner H, Fu P, Sudhakar PV, Subramaniam S. Proteins: Struct Func Genet. 1998;33:18–29. [PubMed] [Google Scholar]
5.Lee B, Richards FM. J Mol Biol. 1971;55:379–400. doi: 10.1016/0022-2836(71)90324-x. [DOI] [PubMed] [Google Scholar]
6.Wood RH, Thompson PT. Proc Natl Acad Sci (USA) 1990;87:946–949. doi: 10.1073/pnas.87.3.946. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Tunon I, Silla E, Pascual-Ahuir JL. Protein Eng. 1992;5:715–716. doi: 10.1093/protein/5.8.715. [DOI] [PubMed] [Google Scholar]
8.Simonson T, Brünger AT. J Phys Chem. 1994;98:4683–4694. [Google Scholar]
9.Lum K, Chandler D, Weeks JD. J Phys Chem B. 1999;103:4570–4577. [Google Scholar]
10.Wagoner J, Baker N. Proc Natl Acad Sci (USA) 2006;103:8331–8336. doi: 10.1073/pnas.0600118103. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Shrake A, Rupley JA. J Mol Biol. 1973;79:351–371. doi: 10.1016/0022-2836(73)90011-9. [DOI] [PubMed] [Google Scholar]
12.Legrand SM, Merz KM. J Comp Chem. 1993;14:349–352. [Google Scholar]
13.Wang H, Levinthal C. J Comp Chem. 1991;12:868–871. [Google Scholar]
14.Futamura N, Alura S, Ranjan D, Hariharan B. IEEE Trans Parallel Dist Syst. 2004;13:544–555. [Google Scholar]
15.Rowlinson JS. Mol Phys. 1963;6:517–524. [Google Scholar]
16.Pavani R, Ranghino G. Computers and Chemistry. 1982;6:133–135. [Google Scholar]
17.Gavezzotti A. J Am Chem Soc. 1983;105:5220–5225. [Google Scholar]
18.Till M, Ullmann GM. J Mol Model. 2010;16:419–429. doi: 10.1007/s00894-009-0541-y. [DOI] [PubMed] [Google Scholar]
19.Wodak SJ, Janin J. Proc Natl Acad Sci (USA) 1980;77:1736–1740. doi: 10.1073/pnas.77.4.1736. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Hasel W, Hendrikson TF, Still WC. Tetrahed Comp Method. 1988;1:103–106. [Google Scholar]
21.Cavallo LJK, Fraternali F. Nucl Acids Res. 2003;31:3364–3366. doi: 10.1093/nar/gkg601. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Street AG, Mayo SL. Folding & Design. 1998;3:253–258. doi: 10.1016/S1359-0278(98)00036-4. [DOI] [PubMed] [Google Scholar]
23.Weiser J, Shenkin PS, Still WC. J Comp Chem. 1999;20:217–230. [Google Scholar]
24.Dynerman D, Butzlaff E, Mitchell J. J Comput Biol. 2009;16:523–537. doi: 10.1089/cmb.2008.0157. [DOI] [PubMed] [Google Scholar]
25.Richmond TJ. J Mol Biol. 1984;178:63–89. doi: 10.1016/0022-2836(84)90231-6. [DOI] [PubMed] [Google Scholar]
26.Connolly ML. J Am Chem Soc. 1985;107:1118–1124. [Google Scholar]
27.Dodd LR, Theodorou DN. Mol Phys. 1991;72:1313–45. [Google Scholar]
28.Petitjean M. J Comp Chem. 1994;15:507–523. [Google Scholar]
29.Irisa M. Comp Phys Comm. 1996;98:317–338. [Google Scholar]
30.Gibson KD, Scheraga HA. Mol Phys. 1987;62:1247–1265. [Google Scholar]
31.Kratky KW. J Phys A: Math Gen. 1978;11:1017–1024. [Google Scholar]
32.Edelsbrunner H. Discrete Comput Geom. 1995;13:415–440. [Google Scholar]
33.Kundrot CE, Ponder JW, Richards FM. J Comp Chem. 1991;12:402–409. [Google Scholar]
34.Gogonea V, Osawa E. J Mol Struct (Theochem) 1994;311:305–324. [Google Scholar]
35.Gogonea V, Osawa E. J Comp Chem. 1995;16:817–842. [Google Scholar]
36.Cossi M, Mennucci B, Cammi R. J Comp Chem. 1996;17:57–73. [Google Scholar]
37.Perrot G, Cheng B, Gibson KD, Vila J, Palmer KA, Nayeem A, et al. J Comp Chem. 1992;13:1–11. [Google Scholar]
38.Sridharan S, Nicholls A, Sharp KA. J Comp Chem. 1994;16:1038–1044. [Google Scholar]
39.Wawak RJ, Gibson KD, Scheraga HA. J Math Chem. 1994;15:207–232. [Google Scholar]
40.Grant JA, Pickup BT. J Phys Chem. 1995;99:3503–3510. [Google Scholar]
41.Weiser J, Shenkin PS, Still WC. J Comp Chem. 1999;20:688–703. doi: 10.1002/(SICI)1096-987X(199905)20:7<688::AID-JCC4>3.0.CO;2-F. [DOI] [PubMed] [Google Scholar]
42.Edelsbrunner H. Discrete Comput Geom. 1999;21:87–115. [Google Scholar]
43.Levitt D, Banaszak L. J Mol Graph. 1992;10:229–234. doi: 10.1016/0263-7855(92)80074-n. [DOI] [PubMed] [Google Scholar]
44.Hendlich M, Rippmann F, Barnickel G. J Mol Graph Model. 1997;15:359–363. doi: 10.1016/s1093-3263(98)00002-3. [DOI] [PubMed] [Google Scholar]
45.Venkatachalam C, Jiang X, Oldfield T, Waldman M. J Mol Graph Model. 2003;21:289–307. doi: 10.1016/s1093-3263(02)00164-x. [DOI] [PubMed] [Google Scholar]
46.Weisel M, Proschak E, Schneider G. Chem Central J. 2007;1:7. doi: 10.1186/1752-153X-1-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Laskowski R. J Mol Graph. 1995;13:323–330. doi: 10.1016/0263-7855(95)00073-9. [DOI] [PubMed] [Google Scholar]
48.Brady G, Stouten P. J Comput Aided Mol Des. 2000;14:383–401. doi: 10.1023/a:1008124202956. [DOI] [PubMed] [Google Scholar]
49.Kawabata T, Go N. Proteins: Struct Func Genet. 2007;68:516–529. doi: 10.1002/prot.21283. [DOI] [PubMed] [Google Scholar]
50.Yu J, Zhou Y, Tanaka I, Yao M. Bioinformatics. 2010;26:46–52. doi: 10.1093/bioinformatics/btp599. [DOI] [PubMed] [Google Scholar]
51.Edelsbrunner H, Facello MA, Liang J. Discrete Appl Math. 1998;88:83–102. [Google Scholar]
52.Liang J, Edelsbrunner H, Woodward C. Prot Sci. 1998;7:1884–1897. doi: 10.1002/pro.5560070905. [DOI] [PMC free article] [PubMed] [Google Scholar]
53.Yaffe E, Fishelovitch D, Wolfson H, Halperin D, Nussinov R. Nucl Acids Res. 2008;36:W210–W215. doi: 10.1093/nar/gkn223. [DOI] [PMC free article] [PubMed] [Google Scholar]
54.Busa J, Hayryan S, Hu C-K, Skrivanek J, Wu M-C. J Comp Chem. 2009;30:346–357. doi: 10.1002/jcc.21060. [DOI] [PubMed] [Google Scholar]
55.Edelsbrunner H, Koehl P. Proc Natl Acad Sci (USA) 2003;100:2203–2208. doi: 10.1073/pnas.0537830100. [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Bryant R, Edelsbrunner H, Koehl P, Levitt M. Discrete Comput Geom. 2004 [Google Scholar]
57.Edelsbrunner H, Koehl P. Discrete and Computational Geometry (MSRI Publications) 2005;52:243–275. [Google Scholar]
58.Dundas J, Ouyang Z, Tseng J, Binkowski A, Turpaz Y, Liang J. Nucl Acids Res. 2006;34:W116–W118. doi: 10.1093/nar/gkl282. [DOI] [PMC free article] [PubMed] [Google Scholar]
59.Naiman D, Wynn H. Annals of Stat. 1992:43–76. [Google Scholar]
60.Edelsbrunner H, Mücke EP. ACM Trans Graphics. 1990;9:66–104. [Google Scholar]
61.Attali D, Edelsbrunner H. Discrete Comput Geom. 2007;37:59–77. [Google Scholar]
62.Edelsbrunner H, Mücke EP. ACM Trans Graphics. 1994;13:43–72. [Google Scholar]
63.Edelsbrunner H, Shah NR. Algorithmica. 1996;15:223–241. [Google Scholar]
64.Edelsbrunner H. Weighted alpha shapes Technical Report UIUC-CS-R-92-1760. Comput. Sci. Dept., Univ. Illinois; Urbana, Illinois: 1992. [Google Scholar]
65.Carrillo-Tripp M, Shephered C, Borelli I, Venkataram S, Lander G, Natarajan P, Johnson J, III, CB, Reddy V. Nucl Acids Res. 2009;37:D436–D442. doi: 10.1093/nar/gkn840. [DOI] [PMC free article] [PubMed] [Google Scholar]
66.Amenta N, Choi S, Rote G. Proc. 19th ACM Sympos. Comput. Geom; 2003. pp. 211–219. [Google Scholar]
67.Liu Y, Snoeyink J. Discrete and Computational Geometry (MSRI Publications) 2005;52:439–458. [Google Scholar]
68.Fortune S, VanWyk CJ. ACM Trans Graph. 1996;15:223–248. [Google Scholar]
69.Cramer P, Bushnell DA, Kornberg RD. Science. 2001;292:1863–1876. doi: 10.1126/science.1059493. [DOI] [PubMed] [Google Scholar]
70.Wimberly BT, Brodersen DE, Clemons WM, Jr, Morgan-Warren RJ, Carter AP, Vonrhein C, et al. Nature (London) 2000;407:327–39. doi: 10.1038/35030006. [DOI] [PubMed] [Google Scholar]
71.Ban N, Nissen P, Hansen J, Moore PB, Steitz TA. Science. 2002;289:905–20. doi: 10.1126/science.289.5481.905. [DOI] [PubMed] [Google Scholar]
72.Shewchuk J. Proc. 14th Ann. Sympos. Comput. Geom; 1998. pp. 86–95. [Google Scholar]
73.Gibson KD, Scheraga HA. Mol Phys. 1988;64:641–644. [Google Scholar]
74.Edelsbrunner H, Fu P. Measuring space filling diagrams and voids Technical Report UIUC-BI-MB-94-01. Beckman Inst., Univ. Illinois; Urbana, Illinois: 1994. [Google Scholar]
75.Lee J. J Korea Soc Math Educ Ser B: Pure Appl Math. 1997;4:1–6. [Google Scholar]
76.Yang L, Zhang J. Metric equations in geometry and their applications Technical Report IC/89/281. International Center for Theoretical Physics; Trieste, Italy: 1989. [Google Scholar]
77.Yang L, Zeng Z. In: Proc ADG2006, LNAI. Botana F, Recio T, editors. Vol. 4869. 2007. pp. 203–211. [Google Scholar]
78.Zhang W, Mukhopadhyay S, Pletnev S, Baker T, Kuhn R, Rossmann M. J Virol. 2002;76:11645–11658. doi: 10.1128/JVI.76.22.11645-11658.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R1] 1.Eisenberg D, McLachlan AD. Nature (London) 1986;319:199–203. doi: 10.1038/319199a0. [DOI] [PubMed] [Google Scholar]

[R2] 2.Ooi T, Oobatake M, Nemethy G, Scheraga HA. Proc Natl Acad Sci (USA) 1987;84:3086–3090. doi: 10.1073/pnas.84.10.3086. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] 3.Liang J, Edelsbrunner H, Fu P, Sudhakar PV, Subramaniam S. Proteins: Struct Func Genet. 1998;33:1–17. [PubMed] [Google Scholar]

[R4] 4.Liang J, Edelsbrunner H, Fu P, Sudhakar PV, Subramaniam S. Proteins: Struct Func Genet. 1998;33:18–29. [PubMed] [Google Scholar]

[R5] 5.Lee B, Richards FM. J Mol Biol. 1971;55:379–400. doi: 10.1016/0022-2836(71)90324-x. [DOI] [PubMed] [Google Scholar]

[R6] 6.Wood RH, Thompson PT. Proc Natl Acad Sci (USA) 1990;87:946–949. doi: 10.1073/pnas.87.3.946. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] 7.Tunon I, Silla E, Pascual-Ahuir JL. Protein Eng. 1992;5:715–716. doi: 10.1093/protein/5.8.715. [DOI] [PubMed] [Google Scholar]

[R8] 8.Simonson T, Brünger AT. J Phys Chem. 1994;98:4683–4694. [Google Scholar]

[R9] 9.Lum K, Chandler D, Weeks JD. J Phys Chem B. 1999;103:4570–4577. [Google Scholar]

[R10] 10.Wagoner J, Baker N. Proc Natl Acad Sci (USA) 2006;103:8331–8336. doi: 10.1073/pnas.0600118103. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] 11.Shrake A, Rupley JA. J Mol Biol. 1973;79:351–371. doi: 10.1016/0022-2836(73)90011-9. [DOI] [PubMed] [Google Scholar]

[R12] 12.Legrand SM, Merz KM. J Comp Chem. 1993;14:349–352. [Google Scholar]

[R13] 13.Wang H, Levinthal C. J Comp Chem. 1991;12:868–871. [Google Scholar]

[R14] 14.Futamura N, Alura S, Ranjan D, Hariharan B. IEEE Trans Parallel Dist Syst. 2004;13:544–555. [Google Scholar]

[R15] 15.Rowlinson JS. Mol Phys. 1963;6:517–524. [Google Scholar]

[R16] 16.Pavani R, Ranghino G. Computers and Chemistry. 1982;6:133–135. [Google Scholar]

[R17] 17.Gavezzotti A. J Am Chem Soc. 1983;105:5220–5225. [Google Scholar]

[R18] 18.Till M, Ullmann GM. J Mol Model. 2010;16:419–429. doi: 10.1007/s00894-009-0541-y. [DOI] [PubMed] [Google Scholar]

[R19] 19.Wodak SJ, Janin J. Proc Natl Acad Sci (USA) 1980;77:1736–1740. doi: 10.1073/pnas.77.4.1736. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] 20.Hasel W, Hendrikson TF, Still WC. Tetrahed Comp Method. 1988;1:103–106. [Google Scholar]

[R21] 21.Cavallo LJK, Fraternali F. Nucl Acids Res. 2003;31:3364–3366. doi: 10.1093/nar/gkg601. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] 22.Street AG, Mayo SL. Folding & Design. 1998;3:253–258. doi: 10.1016/S1359-0278(98)00036-4. [DOI] [PubMed] [Google Scholar]

[R23] 23.Weiser J, Shenkin PS, Still WC. J Comp Chem. 1999;20:217–230. [Google Scholar]

[R24] 24.Dynerman D, Butzlaff E, Mitchell J. J Comput Biol. 2009;16:523–537. doi: 10.1089/cmb.2008.0157. [DOI] [PubMed] [Google Scholar]

[R25] 25.Richmond TJ. J Mol Biol. 1984;178:63–89. doi: 10.1016/0022-2836(84)90231-6. [DOI] [PubMed] [Google Scholar]

[R26] 26.Connolly ML. J Am Chem Soc. 1985;107:1118–1124. [Google Scholar]

[R27] 27.Dodd LR, Theodorou DN. Mol Phys. 1991;72:1313–45. [Google Scholar]

[R28] 28.Petitjean M. J Comp Chem. 1994;15:507–523. [Google Scholar]

[R29] 29.Irisa M. Comp Phys Comm. 1996;98:317–338. [Google Scholar]

[R30] 30.Gibson KD, Scheraga HA. Mol Phys. 1987;62:1247–1265. [Google Scholar]

[R31] 31.Kratky KW. J Phys A: Math Gen. 1978;11:1017–1024. [Google Scholar]

[R32] 32.Edelsbrunner H. Discrete Comput Geom. 1995;13:415–440. [Google Scholar]

[R33] 33.Kundrot CE, Ponder JW, Richards FM. J Comp Chem. 1991;12:402–409. [Google Scholar]

[R34] 34.Gogonea V, Osawa E. J Mol Struct (Theochem) 1994;311:305–324. [Google Scholar]

[R35] 35.Gogonea V, Osawa E. J Comp Chem. 1995;16:817–842. [Google Scholar]

[R36] 36.Cossi M, Mennucci B, Cammi R. J Comp Chem. 1996;17:57–73. [Google Scholar]

[R37] 37.Perrot G, Cheng B, Gibson KD, Vila J, Palmer KA, Nayeem A, et al. J Comp Chem. 1992;13:1–11. [Google Scholar]

[R38] 38.Sridharan S, Nicholls A, Sharp KA. J Comp Chem. 1994;16:1038–1044. [Google Scholar]

[R39] 39.Wawak RJ, Gibson KD, Scheraga HA. J Math Chem. 1994;15:207–232. [Google Scholar]

[R40] 40.Grant JA, Pickup BT. J Phys Chem. 1995;99:3503–3510. [Google Scholar]

[R41] 41.Weiser J, Shenkin PS, Still WC. J Comp Chem. 1999;20:688–703. doi: 10.1002/(SICI)1096-987X(199905)20:7<688::AID-JCC4>3.0.CO;2-F. [DOI] [PubMed] [Google Scholar]

[R42] 42.Edelsbrunner H. Discrete Comput Geom. 1999;21:87–115. [Google Scholar]

[R43] 43.Levitt D, Banaszak L. J Mol Graph. 1992;10:229–234. doi: 10.1016/0263-7855(92)80074-n. [DOI] [PubMed] [Google Scholar]

[R44] 44.Hendlich M, Rippmann F, Barnickel G. J Mol Graph Model. 1997;15:359–363. doi: 10.1016/s1093-3263(98)00002-3. [DOI] [PubMed] [Google Scholar]

[R45] 45.Venkatachalam C, Jiang X, Oldfield T, Waldman M. J Mol Graph Model. 2003;21:289–307. doi: 10.1016/s1093-3263(02)00164-x. [DOI] [PubMed] [Google Scholar]

[R46] 46.Weisel M, Proschak E, Schneider G. Chem Central J. 2007;1:7. doi: 10.1186/1752-153X-1-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R47] 47.Laskowski R. J Mol Graph. 1995;13:323–330. doi: 10.1016/0263-7855(95)00073-9. [DOI] [PubMed] [Google Scholar]

[R48] 48.Brady G, Stouten P. J Comput Aided Mol Des. 2000;14:383–401. doi: 10.1023/a:1008124202956. [DOI] [PubMed] [Google Scholar]

[R49] 49.Kawabata T, Go N. Proteins: Struct Func Genet. 2007;68:516–529. doi: 10.1002/prot.21283. [DOI] [PubMed] [Google Scholar]

[R50] 50.Yu J, Zhou Y, Tanaka I, Yao M. Bioinformatics. 2010;26:46–52. doi: 10.1093/bioinformatics/btp599. [DOI] [PubMed] [Google Scholar]

[R51] 51.Edelsbrunner H, Facello MA, Liang J. Discrete Appl Math. 1998;88:83–102. [Google Scholar]

[R52] 52.Liang J, Edelsbrunner H, Woodward C. Prot Sci. 1998;7:1884–1897. doi: 10.1002/pro.5560070905. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R53] 53.Yaffe E, Fishelovitch D, Wolfson H, Halperin D, Nussinov R. Nucl Acids Res. 2008;36:W210–W215. doi: 10.1093/nar/gkn223. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R54] 54.Busa J, Hayryan S, Hu C-K, Skrivanek J, Wu M-C. J Comp Chem. 2009;30:346–357. doi: 10.1002/jcc.21060. [DOI] [PubMed] [Google Scholar]

[R55] 55.Edelsbrunner H, Koehl P. Proc Natl Acad Sci (USA) 2003;100:2203–2208. doi: 10.1073/pnas.0537830100. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R56] 56.Bryant R, Edelsbrunner H, Koehl P, Levitt M. Discrete Comput Geom. 2004 [Google Scholar]

[R57] 57.Edelsbrunner H, Koehl P. Discrete and Computational Geometry (MSRI Publications) 2005;52:243–275. [Google Scholar]

[R58] 58.Dundas J, Ouyang Z, Tseng J, Binkowski A, Turpaz Y, Liang J. Nucl Acids Res. 2006;34:W116–W118. doi: 10.1093/nar/gkl282. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R59] 59.Naiman D, Wynn H. Annals of Stat. 1992:43–76. [Google Scholar]

[R60] 60.Edelsbrunner H, Mücke EP. ACM Trans Graphics. 1990;9:66–104. [Google Scholar]

[R61] 61.Attali D, Edelsbrunner H. Discrete Comput Geom. 2007;37:59–77. [Google Scholar]

[R62] 62.Edelsbrunner H, Mücke EP. ACM Trans Graphics. 1994;13:43–72. [Google Scholar]

[R63] 63.Edelsbrunner H, Shah NR. Algorithmica. 1996;15:223–241. [Google Scholar]

[R64] 64.Edelsbrunner H. Weighted alpha shapes Technical Report UIUC-CS-R-92-1760. Comput. Sci. Dept., Univ. Illinois; Urbana, Illinois: 1992. [Google Scholar]

[R65] 65.Carrillo-Tripp M, Shephered C, Borelli I, Venkataram S, Lander G, Natarajan P, Johnson J, III, CB, Reddy V. Nucl Acids Res. 2009;37:D436–D442. doi: 10.1093/nar/gkn840. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R66] 66.Amenta N, Choi S, Rote G. Proc. 19th ACM Sympos. Comput. Geom; 2003. pp. 211–219. [Google Scholar]

[R67] 67.Liu Y, Snoeyink J. Discrete and Computational Geometry (MSRI Publications) 2005;52:439–458. [Google Scholar]

[R68] 68.Fortune S, VanWyk CJ. ACM Trans Graph. 1996;15:223–248. [Google Scholar]

[R69] 69.Cramer P, Bushnell DA, Kornberg RD. Science. 2001;292:1863–1876. doi: 10.1126/science.1059493. [DOI] [PubMed] [Google Scholar]

[R70] 70.Wimberly BT, Brodersen DE, Clemons WM, Jr, Morgan-Warren RJ, Carter AP, Vonrhein C, et al. Nature (London) 2000;407:327–39. doi: 10.1038/35030006. [DOI] [PubMed] [Google Scholar]

[R71] 71.Ban N, Nissen P, Hansen J, Moore PB, Steitz TA. Science. 2002;289:905–20. doi: 10.1126/science.289.5481.905. [DOI] [PubMed] [Google Scholar]

[R72] 72.Shewchuk J. Proc. 14th Ann. Sympos. Comput. Geom; 1998. pp. 86–95. [Google Scholar]

[R73] 73.Gibson KD, Scheraga HA. Mol Phys. 1988;64:641–644. [Google Scholar]

[R74] 74.Edelsbrunner H, Fu P. Measuring space filling diagrams and voids Technical Report UIUC-BI-MB-94-01. Beckman Inst., Univ. Illinois; Urbana, Illinois: 1994. [Google Scholar]

[R75] 75.Lee J. J Korea Soc Math Educ Ser B: Pure Appl Math. 1997;4:1–6. [Google Scholar]

[R76] 76.Yang L, Zhang J. Metric equations in geometry and their applications Technical Report IC/89/281. International Center for Theoretical Physics; Trieste, Italy: 1989. [Google Scholar]

[R77] 77.Yang L, Zeng Z. In: Proc ADG2006, LNAI. Botana F, Recio T, editors. Vol. 4869. 2007. pp. 203–211. [Google Scholar]

[R78] 78.Zhang W, Mukhopadhyay S, Pletnev S, Baker T, Kuhn R, Rossmann M. J Virol. 2002;76:11645–11658. doi: 10.1128/JVI.76.22.11645-11658.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Geometric Measures of Large Biomolecules: Surface, Volume and Pockets

Paul Mach

Patrice Koehl

Abstract

1 Introduction

Significance of shape

Geometric measures of biomolecules

Detecting pockets and cavities in biomolecular structure

This work

2 Measuring Union of Balls

2.1 Surface area and volume of a union of balls

2.1.1 A simplified inclusion-exclusion formula for union of balls

Voronoi decomposition and dual complex

Figure 1. Voronoi decomposition and dual complex.

Area and volume formulas

2.1.2 Angle weighted inclusion-exclusion formula for union of balls

2.1.3 Area and volume derivatives

Derivatives with respect to internal distances

Derivatives with respect to Cartesian coordinates

2.2 Voids and Pockets

Figure 2. Illustration of the discrete flow and pockets in a union of disks.

3 Algorithm & Implementation

Table 1.

Figure 3. Caracterizing the geometry of the Sindbis virus.

3.1 Improved Delaunay computations for large molecular systems

Figure 4.

3.2 Improved dual complex construction

Figure 5.

Weighted surface areas, volumes, and their derivatives

Figure 6.

Detecting and measuring voids

Figure 7.

Characterizing the geometry of large biomolecules

Figure 8.

4 Conclusion

Acknowledgments

Appendix A: Measuring the intersections of two, three and four balls

Notation

Figure A1.

Intersection of two balls

Lemma 1

Proof

Intersection of three balls

Figure A2.

Lemma 2

Proof

Surface area

Figure A3.

Volume

Figure A4. Computing the volume of the intersection of 3 balls.

Intersection of four balls

Lemma 3

Proof

Surface area

Figure A5. Computing the surface area of the intersection of four spheres.

Volume

Figure A6.

Angle weighted inclusion-exclusion formula for union of balls

Appendix B: The geometry of a tetrahedron

Surface area and volume

Dihedral angles

Derivatives of the volume of a tetrahedron

Lemma 4

Proof

Derivatives of the surface areas of the faces of a tetrahedron

Derivatives of the dihedral angles

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases