FAST MOLECULAR SOLVATION ENERGETICS AND FORCE COMPUTATION

CHANDRAJIT BAJAJ; WENQI ZHAO

doi:10.1137/090746173

. Author manuscript; available in PMC: 2010 Mar 2.

Published in final edited form as: SIAM J Sci Comput. 2010 Jan 20;31(6):4524–4552. doi: 10.1137/090746173

FAST MOLECULAR SOLVATION ENERGETICS AND FORCE COMPUTATION^{^*}

CHANDRAJIT BAJAJ ^†, WENQI ZHAO ^‡

PMCID: PMC2830669 NIHMSID: NIHMS153453 PMID: 20200598

Abstract

The total free energy of a molecule includes the classical molecular mechanical energy (which is understood as the free energy in vacuum) and the solvation energy which is caused by the change of the environment of the molecule (solute) from vacuum to solvent. The solvation energy is important to the study of the inter-molecular interactions. In this paper we develop a fast surface-based generalized Born method to compute the electrostatic solvation energy along with the energy derivatives for the solvation forces. The most time-consuming computation is the evaluation of the surface integrals over an algebraic spline molecular surface (ASMS) and the fast computation is achieved by the use of the nonequispaced fast Fourier transform (NFFT) algorithm. The main results of this paper involve (a) an efficient sampling of quadrature points over the molecular surface by using nonlinear patches, (b) fast linear time estimation of energy and inter-molecular forces, (c) error analysis, and (d) efficient implementation combining fast pairwise summation and the continuum integration using nonlinear patches.

Keywords: generalized Born, molecular surface, fast summation, error analysis

1. Introduction

Most of the protein molecules live in the aqueous solvent environment and the stabilities of the molecules depend largely upon their configuration and the solvent type. Since the solvation energy term models the interaction between a molecule and the solvent, the computation of the molecular solvation energy (also known as molecule - solvent interaction energy) is a key issue in molecular dynamics (MD) simulations, as well as in determining the inter-molecular binding affinities “in-vivo” for drug screening. Molecular dynamics simulations where the solvent molecules are explicitly represented at atomic resolution, for example as in the popular package NAMD [1], provide direct information about the important influence of solvation. Moreover, as the total number of atoms of solvent molecules far outnumber the atoms of the solute, a larger fraction of the time is spent on computing the trajectory of the solvent molecules, even though the primary focus of the simulation is the configuration and energetics of the solute molecule. Implicit solvent models, attempt to considerably lower the cost of computation through a continuum representation (mean-field approximation) of the solvent [2]. In the implicit model, the solvation free energy G_sol which is the free energy change to transfer a molecule from vacuum to solvent, consists of three components: the energy to form a cavity in the solvent which is also known as the hydrophobic interactions, the van der Waals interactions between the molecule and the solvent, and the electrostatic potential energy between the molecule and the solvent (also known as polarization energy), G_sol = G_cav + G_vdw + G_pol. Based on the Weeks-Chandler-Andersen (WCA) perturbation theory [3, 4], the non-polar solvation energies are of the form G_cav + G_vdw = G^(rep) + G^(rep). In [5], G_(rep) is described as the weighted sum of the solvent-accessible surface area A_i of the atoms. In [31], a volume term is added: $G^{(rep)} = \sum_{i = 1}^{M} γ_{i} A_{i} + p V$ , where p is the solvent pressure parameter and V is the solvent-accessible volume. In [6], the attractive van der Waals dispersion energy $G^{(att)} = \sum_{i = 1}^{M} G_{i}^{(att)}$ , where $G_{i}^{(att)} = ρ_{0} \int u_{i}^{(att)} (x_{i}, y) θ (y) d y$ , ρ₀ is the bulk density, $u_{i}^{(att)} (x_{i}, y)$ is the van der Waals dispersive component of the interaction between atom i in the solute and the volume of solvent at y, θ(y) is a density distribution function for the solvent. Hence the non-polar solvation energies

G_{cav} + G_{vdw} = \sum_{i = 1}^{M} γ_{i} A_{i} + p V + ρ_{0} \int u_{i}^{(att)} (x_{i}, y) θ (y) d y .

(1.1)

The electrostatic solvation energy is caused by the induced polarization in the solvent when the molecule is dissolved in the solvent, therefore

G_{pol} = \frac{1}{2} \int φ_{reaction} (r) ρ (r) d r,

(1.2)

where φ_reaction = φ_solvent − φ_gas-phase, φ(r) and ρ(r) are the electrostatic potential and the charge density at r, respectively.

The Poisson-Boltzmann (PB) model was developed to compute the electrostatic solvation energy by solving the equation −∇(ε(x)∇φ(x)) = ρ(x) for the electrostatic potential φ. Numerical methods to solve the equation include the finite difference method [7, 8], finite element method [9, 10], and boundary element method [11]. However the PB methods are prohibitive for large molecules such as proteins due to the limited computational resources. As an alternative, (1.2) is approximated by a generalized Born (GB) model which is in the form of discrete sum [12]

G_{pol} = - \frac{τ}{2} \sum_{i, j} \frac{q_{i} q_{j}}{{[r_{i j}^{2} + R_{i} R_{j} exp (- \frac{r_{i j}^{2}}{4 R_{i} R_{j}})]}^{\frac{1}{2}}},

(1.3)

where $τ = \frac{1}{ε_{p}} - \frac{1}{ε_{w}}$ , εp and εw are the solute (low) and solvent (high) dielectric constants, q_i and R_i are the charge and effective Born radius of atom i, respectively, and r_{i j} is the distance between atoms i and j. The solvation force acting on atom α, which is part of the forces driving dynamics is computed as

F_{α}^{sol} = - \frac{\partial G_{sol}}{\partial x_{α}} .

(1.4)

Because the GB calculation is much faster than solving the PB equation, the GB model is widely used in the MD simulations. Programs which implement the GB methods include CHARMM [13], Amber [14], Tinker [15], and Impact which is now part of Schrodinger, Inc.’s FirstDiscovery program suite. Even though the GB computation is much faster than the PB model, the computation of the Born radius R_i is still slow. During the MD simulation, the Born radii need to be frequently recomputed at different time steps. Because this part of computation is too time-consuming, there are attempts to accelerate the MD simulation by computing the Born radii at a larger time step. For example, in [16] in their test of a 3 ns GB simulation of a 10-base pair DNA duplex, they change the time step of computing the Born radii and long-range electrostatic energy from 1 fs to 2 fs. This reduces the time of carrying out the simulation from 13.84 hours to 7.16 hours. From this example we can see that the calculation of the Born radii takes a large percentage of total computation time in the MD simulation. In the long dynamic runs, this decrease in the frequency of evaluating the effective Born radii are not accurate enough to conserve energy which restricts the MD simulation of the protein folding process to small time scale [17]. Hence it is demanding to calculate the Born radii and the solvation energy accurately and efficiently.

In this paper we develop a method for fast computation of the GB solvation energy, along with the energy derivatives for the solvation forces, based on a discrete and continuum model of the molecules (Figure 1.1). An efficient method of sampling quadrature points on the nonlinear patch is given. We also show that the error of the Born radius calculation is controlled by the size of the triangulation mesh and the regularity of the periodic function used in the fast summation algorithm. The time complexity of the forces computation is reduced from the original O(MN +M²) to nearly linear time O(N +M +n³ logn +M logM), where M is the number of atoms of a molecule, N is the number of integration points that we sample on the surface of the molecule when we compute the Born radius for each atom, and n is a parameter introduced in the fast summation algorithm. The fast summation method shows its advantage when it is applied to the Born radius calculations for macromolecules, where there could be tens of thousands or millions of atoms, and N could be even larger. In the fast summation method, one only need to choose a small n which is much smaller than M and N to get a good approximation, which makes the new fast summation based GB method more efficient.

Fig. 1.1 — Top left: the discrete van der Waals surface model (436 atoms); top middle: the triangulation of the continuum Gaussian surface model with 6004 triangles; top right: the regularized triangular mesh where the quality of the elements is improved (making each as close as possible to an equilateral triangles); bottom left: the continuum ASMS model generated from the triangular mesh up right; bottom right: the molecular surface rendered according to the interaction with the solvent where red means strong and blue means weak interaction.

The rest of the paper is organized as follows: in Section 2 we explain the geometric model that our energy and force computation are based on; we discuss in detail the energy computation in Section 3 and the force computation in Section 4; some implementation results are shown in Section 5; some details such as the fast summation algorithm and the NFFT algorithm are discussed in the appendix.

2. Geometric model

2.1. Gaussian surface

The electron density and shape are used in a similar sense in the literature with respect to the modeling of molecular surfaces or interfaces between the molecule and its solvent. The electron density of atom i at a point x is represented as a Gaussian function: $f (x) = e^{β (\frac{∣ x - x_{i} ∣^{2}}{r_{i}^{2}} - 1)}$ where x_i, r_i are the position of the center and radius of the atom k. If we consider the function value of 1, we see that it is satisfied at the surface of the sphere (x: |x − x_i| = r_i). Using this model, the electron density at x due to a protein with M atoms is just a summation of Gaussians:

f (x) = \sum_{i = 1}^{M} e^{β (\frac{∣ x - x_{i} ∣^{2}}{r_{i}^{2}} - 1)}

(2.1)

where β is a parameter used to control the rate of decay of the Gaussian and known as the blobbiness of the Gaussian. In [18] β = −2.3, isovalue = 1 is indicated as a good approximation to the molecular surface.

2.2. Triangular mesh

The triangular mesh of the Gaussian surface is generated by using the dual contouring method [19, 20]. In the dual contouring method a top-down octree is recursively constructed to enforce that each cell has at most one isocontour patch. The edges whose endpoints lie on different side of the isocontour are tagged as sign change edges. In each cube that contains a sign change edge, we compute the intersection points (and their unit normals) of the isocontour and the edges of the cube, denoted as p_i and n_i, and compute the minimizer point in this cube which minimizes the quadratic error function (QEF) [21]:

QEF (x) = \sum_{i} {[n_{i} \cdot (x - p_{i})]}^{2} .

Since each sign change edge is shared by either four cubes (uniform grid) or three cubes (adaptive grid), connecting the minimizer points of these neighboring cubes forms a quad or a triangle that approximates the isocontour. We divide the quads into triangles to generate the pure triangular mesh.

2.3. Algebraic spline molecular surface (ASMS)

The triangular mesh is a linear approximation to the Gaussian surface. In our solvation energy computation, we generate another higher order approximation called ASMS model (Figure 2.3(f)) based on the triangular mesh to improve accuracy and efficiency [22]. Starting from the triangular mesh, we first construct a prism scaffold as follows. Let [v_iv_jv_k] be a triangle of the mesh where v_i, v_j, v_k are the vertices of the triangle and n_i, n_j, n_k be their unit normals. Define v_l(λ) = v_l + λn_l. Then the prism is define as

Fig. 2.3 — (a) is the discrete van der Waals model of protein 1BGX with 19,647 atoms; (b) and (c) are the zoom-in views of the the initial triangulation of the continuum surface with 85656 triangles; (d) and (e) are the zoom-in views of the quality improved mesh; (f) is the a continuum ASMS model generated based on the quality improved mesh.

D_{ijk} : = {p : p = b_{1} v_{i} (λ) + b_{2} v_{j} (λ) + b_{3} v_{k} (λ), λ \in I_{ijk}},

where b₁, b₂, b₃ ∈ [0, 1], b₁ + b₂ + b₃ = 1, and I_ijk is a maximal open interval such that (i) 0 ∈ I_ijk, (ii) for any λ ∈ I_ijk, v_i(λ), v_j(λ) and v_k(λ) are not collinear, and (iii) for any λ ∈ I_ijk, n_i, n_j and n_k point to the same side of the plane P_ijk(λ):= {p: p = b₁v_i(λ) + b₂v_j(λ) + b₃v_k(λ)} (Figure 2.1).

Fig. 2.1 — A prism D_ijk constructed with a triangle [v_iv_jv_k] as a basis.

Next we define a function over the prism D_ijk in the cubic Bernstein-Bezier (BB) basis:

F (b_{1}, b_{2}, b_{3}, λ) = \sum_{i + j + k = 3} b_{ijk} (λ) B_{ijk}^{3} (b_{1}, b_{2}, b_{3}),

(2.2)

where $B_{ijk}^{3} (b_{1}, b_{2}, b_{3}) = \frac{3!}{i! j! k!} b_{1}^{i} b_{2}^{j} b_{3}^{k}$ . The ASMS denoted as Γ is the zero contour of F. The scheme for defining the coefficients b_ijk are defined is described in detail in [22]. In short they are defined such that

the vertices of the triangular mesh are points on Γ;
Γ is C¹ at the vertices of mesh;
Γ is C¹ at the midpoints of the mesh edges.

Later, given the barycentric coordinates of a point (b₁, b₂, b₃) in triangle [v_iv_jv_k], we solve the equation F(b₁, b₂, b₃, λ) = 0 for λ by Newton’s method. In this way we can get the corresponding point (x,y,z) on Γ:

{(x, y, z)}^{T} = b_{1} v_{i} (λ) + b_{2} v_{j} (λ) + b_{3} v_{k} (λ) .

(2.3)

We have proved in [22] that the ASMS model is C¹ everywhere if the normals of the mesh satisfy certain symmetry conditions. The error between the ASMS and the Gaussian surface is bounded and we have shown that the ASMS converges to the Gaussian surface at the rate of O(h³) where h is the maximum edge length of the mesh.

3. Fast solvation energy computation

3.1. Method

Similarly to what is done for other GB models, we use (1.3) as the electrostatic solvation energy function. Before we compute (1.3), we need to first compute the effective Born radius R_i for every atom which reflects the depth a charge buried inside the molecule (Figure: 3.1). An atom buried deep in a molecule has a larger Born radius, whereas an atom near the surface has a smaller radius. Hence surfactant atoms have a stronger impact on the polarization. Given a discrete van der Waals (vdW) atom model, as long as we know R_i for each atom, we can compute (1.3) by using the fast multipole method (FMM) [23] with the time complexity O(M logM). However the Born radii computation is not easy and is very time-consuming. There are various ways of computing the Born radius as summarized in [24]. These methods can be divided into two categories: volume integration based methods and surface integration based methods. In general, the surface integration methods are more efficient than the volume integration methods due to the decreased dimension. So we adopt the surface integration method given in [25] to compute the Born radius:

R_{i}^{- 1} = \frac{1}{4 π} \int_{Γ} \frac{(r - x_{i}) \cdot n (r)}{∣ r - x_{i} ∣^{4}} d S i = 1, \dots, M,

(3.1)

where Γ is the molecule-solvent interface, x_i is the center of atom i, and n(r) is the unit normal on the surface at r and we use ASMS as the model of Γ.

Fig. 3.1 — The effective Born radius reflects how deep a charge is buried inside the molecule. The Born radius of an atom is small if the atom is close to the surface of the molecule, otherwise the Born radius is large therefore has weaker interaction with the solvent.

Applying the Gaussian quadrature, We compute (3.1) numerically:

R_{i}^{- 1} = \frac{1}{4 π} \sum_{k = 1}^{N} w_{k} \frac{(r_{k} - x_{i}) \cdot n (r_{k})}{∣ r_{k} - x_{i} ∣^{4}} i = 1, \dots, M,

(3.2)

where w_k and r_k are the Gaussian integration weights and nodes on Γ (Figure 3.2). r_k are computed by mapping the Gaussian nodes of a master triangle to the algebraic patch via the transformation Inline graphic . Let $r_{k}^{0}$ and $w_{k}^{0}$ be one of the Gaussian nodes and weights on the master triangle. Then the corresponding node r_k and weight w_k are and where |J()| is the Jacobian determinant of .

Fig. 3.2 — Gaussian integration points on the surface of protein (a) 1PPE, (b) 1ANA, (c) 1MAG, and (d) 1CGI l. The surfaces are partitioned into 24244 triangular patches for (a), 28620 triangular patches for (b), 30624 triangular patches for (c), and 29108 triangular patches for (d). There are three Gaussian quadrature nodes per triangle. The nodes are then mapped onto the ASMS to form the red point cloud.

We formalize (3.2) in two steps. First we split it into two parts:

R_{i}^{- 1} = \frac{1}{4 π} \sum_{k = 1}^{N} \frac{w_{k} r_{k} \cdot n (r_{k})}{∣ r_{k} - x_{i} ∣^{4}} - \frac{1}{4 π} \sum_{k = 1}^{N} \frac{w_{k} x_{i} \cdot n (r_{k})}{∣ r_{k} - x_{i} ∣^{4}} .

(3.3)

Then we split the second summation in (3.3) into three components:

\sum_{k = 1}^{N} \frac{w_{k} x_{i} \cdot n (r_{k})}{∣ r_{k} - x_{i} ∣^{4}} = x_{i} \sum_{k = 1}^{N} \frac{w_{k} n_{x}^{k}}{∣ r_{k} - x_{i} ∣^{4}} + y_{i} \sum_{k = 1}^{N} \frac{w_{k} n_{y}^{k}}{∣ r_{k} - x_{i} ∣^{4}} + z_{i} \sum_{k = 1}^{N} \frac{w_{k} n_{z}^{k}}{∣ r_{k} - x_{i} ∣^{4}} .

(3.4)

The first summation in (3.3) and the three summations in (3.4) without the coefficients in front are of the common form:

G (x_{i}) = \sum_{k = 1}^{N} c_{k} g (x_{i} - r_{k}) i = 1, \dots, M,

(3.5)

with the kernel function $g (x - r_{k}) = \frac{1}{∣ x - r_{k} ∣^{4}}$ and the coefficient c_k = w_kr_k · n(r_k), $w_{k} n_{x}^{k}, w_{k} n_{y}^{k}, w_{k} n_{z}^{k}$ , respectively. (3.5) can be efficiently computed by using the fast summation algorithm introduced in [26] with complexity O(M + N + n³ logn), where n is a parameter used in the fast summation algorithm.

3.2. Fast summation

The fast summation algorithm is published in [26]. For convenience, we discuss this algorithm in this section briefly. The fast summation algorithm is often applied to compute the summations of the form

G (x_{i}) = \sum_{k = 1}^{N} c_{k} g (x_{i} - r_{k}), i = 1, \dots, M,

(3.6)

where the kernel function g is a fast decaying function. Cutting off the tail of g, one can assume that the support of g is bounded. In our Born radii computation, since the distance between x_i and r_k is no less than the smallest radius of the atoms, there is no singularity in g. Without loss of generality, we assume $x - r_{k} \in Π : = {[- \frac{1}{2}, \frac{1}{2}]}^{3}$ . After duplicating g in the other intervals, g can be extended to be a periodic function of period one in ℝ³ and this periodic function can be decomposed into the Fourier series:

g (x - r_{k}) = \sum_{ω \in I_{\infty}} g_{ω} e^{2 π i ω \cdot (x - r_{k})},

(3.7)

where I_∞:= {(ω₁, ω₂, ω₃) ∈ ℤ³} and g_ω = ∫_Π g(x)e⁻²^πⁱ^ω^·^x dx. We approximate (3.7) by a truncated series:

g (x - r_{k}) \approx \sum_{ω \in I_{n}} g_{ω} e^{2 π i (x - r_{k}) \cdot ω},

(3.8)

where $I_{n} : = {(ω_{1}, ω_{2}, ω_{3}) \in Z^{3} : - \frac{n}{2} \leq ω_{i} < \frac{n}{2}}$ . We compute the Fourier coefficients g_ω numerically by

g_{ω} = \frac{1}{n^{3}} \sum_{j \in I_{n}} g (\frac{j}{n}) e^{- 2 π i ω \cdot j / n}, ω \in I_{n} .

(3.9)

by using the fast Fourier transform (FFT) algorithm with complexity O(n³ logn).

Plugging (3.8) into (3.6), we get

\begin{array}{l} G (x_{i}) \approx \sum_{k = 1}^{N} c_{k} (\sum_{ω \in I_{n}} g_{ω} e^{2 π i (x_{i} - r_{k}) \cdot ω}) = \sum_{ω \in I_{n}} g_{ω} (\sum_{k = 1}^{N} c_{k} e^{- 2 π i ω \cdot r_{k}}) e^{2 π i ω \cdot x_{i}} \\ = \sum_{ω \in I_{n}} g_{ω} a_{ω} e^{2 π i ω \cdot x_{i}} \end{array}

(3.10)

where

a_{ω} = \sum_{k = 1}^{N} c_{k} e^{- 2 π i ω \cdot r_{k}} .

(3.11)

(3.10) is computed by using the NFFT algorithm with complexity O(n³ logn +M) and (3.11) is computed by the NFFT^T algorithm with complexity O(n³ logn +N). Hence the total complexity of computing (3.6) is O(N + M + n³ logn), which is significantly faster than the the trivial O(MN) summation method once the number of terms in the Fourier series n is much smaller than M and N. We explain the NFFT algorithm and the NFFT^T algorithm in Appendix A and B, respectively.

3.3. Error analysis

The numerical analysis of the error introduced during the computation of (3.1) can be decomposed as follows: (i) the sum of a quadrature error E_Q; (ii) some “fast computation” error in the evaluation of the quadrature itself. The latter error is then decomposed in three terms, which correspond to different steps in the numerical procedure. They are the truncation error E_FS when we truncate the Fourier series (3.7) into finite terms, NFFT^T errors E_ω when we compute the coefficients (3.11), and an NFFT error E_NFFT when we finally evaluate (3.10) by the NFFT algorithm.

Let I_i and Ĩ_i denote the exact integration and the numerical output of (3.1) for atom i, respectively. Then We have

I_{i} = {\tilde{I}}_{i} + E_{NFFT} + E_{{NFFT}^{T}} + E_{FS} + E_{Q} .

Let ${| | E | |}_{\infty} = max_{i} ∣ I_{i} - {\tilde{I}}_{i} ∣$ . We have

{| | E | |}_{\infty} \leq {| | E_{Q} | |}_{\infty} + {| | E_{FS} | |}_{\infty} + {| | E_{NFFT} | |}_{\infty} + {| | E_{{NFFT}^{T}} | |}_{\infty} .

(3.12)

Next we will analyze each individual error ||E_Q||_∞, ||E_FS||_∞, ||E_NFFT||_∞, and ||E_NFFT^T||_∞.

3.3.1. Quadrature error

Let Γ_e be one of the algebraic patches on the molecular surface Γ. Suppose Γ_e is built based on a triangle e:= [v_i, v_j, v_k]. Any point (b₁, b₂, b₃) ∈ e can be mapped to a point r(b₁, b₂) ∈ Γ_e. The integration (3.1) over Γ_e is

\begin{array}{l} I_{e} = \int_{Γ_{e}} \frac{(r - x_{i}) \cdot n (r)}{∣ r - x_{i} ∣^{4}} d S \\ = \iint_{Ω_{0}} \frac{(r (b_{1}, b_{2}) - x_{i}) \cdot n (r (b_{1}, b_{2}))}{∣ r (b_{1}, b_{2}) - x_{i} ∣^{4}} ∣ J ∣ d b_{1} d b_{2} \end{array}

(3.13)

where Ω₀ is the canonical triangle, (b₁, b₂, b₃) is the barycentric coordinates of the points in Ω₀ and |J| is the Jacobian. Let f (b₁, b₂) denote the integrand in (3.13). As we discuss in Appendix C, f (b₁, b₂) ∈ C^∞(Ω₀). Suppose we use an s-th order quadrature rule on element e, then

I_{e} = \iint_{Ω_{0}} f (b_{1}, b_{2}) d b_{1} d b_{2} = \sum_{k = 1}^{s_{e}} w_{k} f (b_{1}^{k}, b_{2}^{k}) + E .

(3.14)

We expand f(b₁, b₂) in a Taylor series around a point $(b_{1}^{'}, b_{2}^{'}, b_{3}^{'}) \in Ω_{0}$ :

f (b_{1}, b_{2}) = P_{s} (b_{1}, b_{2}) + R_{s} (b_{1}, b_{2}),

(3.15)

where P_s(b₁, b₂) is a polynomial of degree s:

P_{s} (b_{1}, b_{2}) = f (b_{1}^{'}, b_{2}^{'}) + \frac{1}{s!} {[(b_{1} - b_{1}^{'}) \frac{\partial}{\partial b_{1}} + (b_{2} - b_{2}^{'}) \frac{\partial}{\partial b_{2}}]}^{s} f (b_{1}^{'}, b_{2}^{'})

(3.16)

and the residue R_s is

R_{s} (b_{1}, b_{2}) = \frac{1}{(s + 1)!} {[(b_{1} - b_{1}^{'}) \frac{\partial}{\partial b_{1}} + (b_{2} - b_{2}^{'}) \frac{\partial}{\partial b_{2}}]}^{s + 1} f (b_{1}^{*}, b_{2}^{*}), (b_{1}^{*}, b_{2}^{*}) \in Ω_{0} .

(3.17)

Then the error E becomes

E = \iint_{Ω_{0}} R_{s} (b_{1}, b_{2}) d b_{1} d b_{2} - \sum_{k = 1}^{s_{e}} w_{k} R_{s} (b_{1}^{k}, b_{2}^{k}) .

Let W_k = max(|w_k|), we get

∣ E ∣ \leq \iint_{Ω_{0}} ∣ R_{n} (b_{1}, b_{2}) ∣ d b_{1} d b_{2} + W_{k} \sum_{k = 1}^{3} ∣ R_{n} (b_{1}^{k}, b_{2}^{k}) ∣ .

Within Ω₀, $∣ b_{1} - b_{1}^{'} ∣ \leq 1$ and $∣ b_{2} - b_{2}^{'} ∣ \leq 1$ , hence

∣ R_{s} (b_{1}, b_{2}) ∣ \leq \frac{1}{(s + 1)!} {[∣ \frac{\partial}{\partial b_{1}} ∣ + ∣ \frac{\partial}{\partial b_{2}} ∣]}^{s + 1} f (b_{1}^{*}, b_{2}^{*}),

(3.18)

where $∣ \frac{\partial}{\partial b} ∣ \cdot$ denotes $∣ \frac{\partial \cdot}{\partial b} ∣$ . By the chain rule,

\frac{\partial}{\partial b_{1}} = \frac{\partial}{\partial x} \frac{\partial x}{\partial b_{1}} + \frac{\partial}{\partial y} \frac{\partial y}{\partial b_{1}} + \frac{\partial}{\partial z} \frac{\partial z}{\partial b_{1}} .

According to (2.3), we have $\frac{\partial x}{\partial b_{1}} = v_{1}^{x} - v_{3}^{x} + λ (n_{1}^{x} - n_{3}^{x})$ . Let h_max be the maximum edge length of the triangular mesh, λ_max = max{|λ |}, and h = max(h_max, λ_max). Then we have $∣ \frac{\partial x}{\partial b_{1}} ∣ \leq 2 h$ . Similarly, we can get the same bound for the derivatives of x, y, z with respect to b₁ and b₂. Therefore

∣ R_{s} (b_{1}, b_{2}) ∣ \leq \frac{{(2 h)}^{s + 1}}{(s + 1)!} {[∣ \frac{\partial}{\partial x} ∣ + ∣ \frac{\partial}{\partial y} ∣ + ∣ \frac{\partial}{\partial z} ∣]}^{s + 1} \tilde{f} (x^{*}, y^{*}, z^{*}) \leq C \frac{{(2 h)}^{s + 1}}{(s + 1)!}

(3.19)

where $(x^{*}, y^{*}, z^{*}) = b_{1}^{*} v_{i} (λ) + b_{2}^{*} v_{j} (λ) + b_{3}^{*} v_{k} (λ), \tilde{f} (x^{*}, y^{*}, z^{*}) = f (b_{1}^{*}, b_{2}^{*})$ , and the constant $C = max_{(x, y, z) \in Γ} ∣ D^{s + 1} \tilde{f} (x, y, z) ∣ < \infty$ . Noticing that the area of Ω₀ is 1/2, we can write

∣ E ∣ \leq (\frac{1}{2} + s_{e} W_{k}) \frac{2^{s + 1}}{(s + 1)!} {C h}^{s + 1} .

(3.20)

Even though a greater number of quadrature nodes correspond to the higher order of accuracy, the increase in complexity is a limiting factor. Meanwhile, since the ASMS error is of the order h³, there is no point in a very accurate approximation of (3.20) to too high an order. As a trade-off, we use a two dimensional 3-point Gaussian quadrature over the triangle Ω₀ which is of order 2 [27]. So s = 2 and s_e = 3. The nodes are ( $\frac{1}{6}, \frac{1}{6}, \frac{1}{3}$ ) and its permutations. $W_{k} = \frac{1}{3}$ for k = 1,2,3. Then

∣ E ∣ \leq 2 {C h}^{3} .

(3.21)

Suppose there are N_e patches on Γ, then |E_Q| ≤ 2N_eCh³. So we have the same bound

{| | E_{Q} | |}_{\infty} \leq 2 N_{e} {C h}^{3} .

(3.22)

3.3.2. Fast summation error

According to the fast summation method described in Section 3.2, the Fourier series is truncated into a finite series

E_{FS}^{i} : = \sum_{k = 1}^{N} c_{k} (\sum_{ω \in I_{\infty} \ I_{n}} g_{ω} e^{2 π i ω \cdot (x_{i} - r_{k})}) = \sum_{k = 1}^{N} c_{k} T_{k}^{i}

where $T_{k}^{i}$ denotes the truncation error of the Fourier series. Hence

∣ E_{FS}^{i} ∣ \leq {| | c | |}_{\infty} \sum_{k = 1}^{N} ∣ T_{k}^{i} ∣

(3.23)

where ${| | c | |}_{\infty} : = max_{k = 1, \dots, N} ∣ c_{k} ∣$ ,

∣ T_{k}^{i} ∣ = ∣ \sum_{ω \in I_{\infty} \ I_{n}} g_{ω} e^{2 π i ω \cdot (x_{i} - r_{k})} ∣ \leq \sum_{ω \in I_{\infty} \ I_{n}} ∣ g_{ω} ∣

(3.24)

with

g_{ω} = \int_{Π} g (x) e^{- 2 π i ω \cdot x} d x

(3.25)

and g being the the kernel function in the fast summation. In the Born radii calculation, $g (x) = \frac{1}{∣ x ∣^{4}}$ . As defined in Section 3.2, Π is bounded and excludes 0. Let ω = (ω₁, ω₂, ω₃). Then we rewrite Σ_{ω∈I_∞\I_n}|g_ω| as

\sum_{ω \in I_{\infty} \ I_{n}} ∣ g_{ω} ∣ = \sum_{i, j, k, = 0, 1} \sum_{ω_{1} = n + 1}^{\infty} \sum_{ω_{2} = n + 1}^{\infty} \sum_{ω_{3} = n + 1}^{\infty} ∣ g_{{(- 1)}^{i} ω_{1} {(- 1)}^{j} ω_{2} {(- 1)}^{k} ω_{3}} ∣ .

(3.26)

By successive integration by parts for each dimension, we get

g_{ω_{1} ω_{2} ω_{3}} = {(\frac{- i}{2 π ω_{1}})}^{m_{1}} {(\frac{- i}{2 π ω_{2}})}^{m_{2}} {(\frac{- i}{2 π ω_{3}})}^{m_{3}} \int_{Π} D^{m} g (x) e^{- 2 π i ω \cdot x} d x,

where m = m₁ + m₂ + m₃ and $D^{m} g = (\frac{\partial^{m_{1}}}{\partial x^{m_{1}}} + \frac{\partial^{m_{2}}}{\partial y^{m_{2}}} + \frac{\partial^{m_{3}}}{\partial z^{m_{3}}}) g$ . Therefore

∣ g_{ω_{1} ω_{2} ω_{3}} ∣ \leq \frac{1}{{(2 π)}^{m} ω_{1}^{m_{1}} ω_{2}^{m_{2}} ω_{3}^{m_{3}}} \int_{Π} ∣ D^{m} g (x) ∣ d x .

Let μ_m = ∫_Π |D^mg(x)| dx. We obtain $∣ g_{ω_{1} ω_{2} ω_{3}} ∣ \leq \frac{μ_{m}}{{(2 π)}^{m} ω_{1}^{m_{1}} ω_{2}^{m_{2}} ω_{3}^{m_{3}}}$ . For the other terms in (3.26) we have the same upper bound. If we assume m₁, m₂, m₃ ≥ 2, then

\begin{array}{l} ∣ T_{k}^{i} ∣ \leq \frac{8 μ_{m}}{{(2 π)}^{m}} (\sum_{ω_{1} = n + 1}^{\infty} \frac{1}{ω_{1}^{m_{1}}}) (\sum_{ω_{2} = n + 1}^{\infty} \frac{1}{ω_{2}^{m_{2}}}) (\sum_{ω_{3} = n + 1}^{\infty} \frac{1}{ω_{3}^{m_{3}}}) \\ \leq \frac{8 μ_{m}}{{(2 π)}^{m}} (\int_{n}^{\infty} \frac{1}{ω_{1}^{m_{1}}} d ω_{1}) (\int_{n}^{\infty} \frac{1}{ω_{2}^{m_{2}}} d ω_{2}) (\int_{n}^{\infty} \frac{1}{ω_{3}^{m_{3}}} d ω_{3}) \\ = \frac{8 μ_{m}}{{(2 π)}^{m} (m_{1} - 1) (m_{2} - 1) (m_{3} - 1) n^{m - 3}} . \end{array}

For m₁ = m₂ = m₃, we have

∣ T_{k}^{i} ∣ \leq \frac{8 μ_{6}}{{(2 π)}^{6} n^{3}} .

(3.27)

Then for (3.24), we have

∣ E_{FS}^{i} ∣ \leq {| | c | |}_{\infty} \frac{8 μ_{6} N}{{(2 π)}^{6} n^{3}} .

(3.28)

In fact, the right hand side of (3.28) is independent of i. Therefore we get

{| | E_{FS} | |}_{\infty} \leq {| | c | |}_{\infty} \frac{8 μ_{6} N}{{(2 π)}^{6} n^{3}} .

(3.29)

3.3.3. NFFT error

The error analysis of the NFFT algorithm is thoroughly discussed at the end of Appendix A. This error estimation is derived based on the analysis in [28]. In summary, the NFFT error is split into the aliasing error $E_{NFFT}^{1}$ and the truncation error $E_{NFFT}^{2}$ [28]:

{| | E_{NFFT} | |}_{\infty} \leq {| | E_{NFFT}^{1} | |}_{\infty} + {| | E_{NFFT}^{2} | |}_{\infty} .

The error bounds of $E_{NFFT}^{1}$ and $E_{NFFT}^{2}$ are

{| | E_{NFFT}^{1} | |}_{\infty} \leq {| | \hat{G} | |}_{1} max_{ω \in I_{n}} \sum_{i \in Z^{3} \ {0}} ∣ \frac{C_{ω + i σ n} (ξ)}{C_{ω} (ξ)} ∣,

(3.30)

{| | E_{NFFT}^{2} | |}_{\infty} \leq \frac{1}{σ^{3} n^{3}} max_{ω \in I_{n}} (C_{ω}^{- 1} (ξ)) {| | \hat{G} | |}_{1} max_{i} \sum_{l \in I_{σ n}} ∣ ξ (x_{i} - \frac{l}{σ n}) - η (x_{i} - \frac{1}{σ n}) ∣,

(3.31)

where ξ is a 1-periodic window function defined in Appendix A, C_ω (ξ) are the Fourier co-efficients of ξ, and η is a truncated version of ξ. In the fast summation method (3.10), ||Ĝ||₁ = Σ_{ω∈I_n}|g_ωa_ω|, where g_ω and a_ω are defined in Section 3.2. Combining (3.30) and (3.31), one obtains

{| | E_{NFFT} | |}_{\infty} \leq C (ξ, m, σ) {| | \hat{G} | |}_{1} .

(3.32)

In [26], the coefficient C(ξ, m, σ) is given for some special ξ. They are

Gaussian, ξ(x) = (πb)^−1/2e^{−||σnx||²/b}, where $b : = \frac{2 σ}{2 σ - 1} \frac{m}{π}$ , the coefficient C(ξ, m, σ) = 4e⁻^m^π(1−1/(2^σ⁻¹⁾⁾;
cardinal central B-splines [29], ξ(x) = M₂_m(σnx), the coefficient $C (ξ, m, σ) = 4 {(\frac{1}{2 σ - 1})}^{2 m}$ ;
powers of sinc function, $ξ (x) = \frac{n (2 σ - 1)}{2 m} {sinc}^{2 m} (\frac{(2 σ - 2) n π x}{2 m})$ , the coefficient $C (ξ, m, σ) = \frac{1}{m - 1} (\frac{2}{σ^{2 m}} + {(\frac{σ}{2 σ - 1})}^{2 m})$ ;
Kaiser-Bessel function [30]
$\begin{array}{l} ξ (x) = \frac{1}{π} {\begin{matrix} \frac{sinh (b \sqrt{m^{2} - {(σ n)}^{2} {| | x | |}^{2}})}{\sqrt{m^{2} - {(σ n)}^{2} {| | x | |}^{2}}}, & | | x | | \leq \frac{m}{σ n}, \\ \frac{sinh (b \sqrt{{(σ n)}^{2} {| | x | |}^{2} - m^{2}})}{\sqrt{{(σ n)}^{2} {| | x | |}^{2} - m^{2}}}, & otherwise, \end{matrix} \\ C (ξ, m, σ) = 5 π^{2} m^{3 / 2} \sqrt[4]{1 - \frac{1}{σ}} e^{- m 2 π \sqrt{1 - 1 / σ}} . \end{array}$

3.3.4. NFFT^T error

As we mentioned in Section 3.2, (3.11) is computed by the NFFT^T algorithm and then they are plugged in (3.10) for the following evaluation of the summation. So the NFFT error E_NFFT^T is

E_{{NFFT}^{T}} = \sum_{ω \in I_{n}} g_{ω} E_{ω} e^{2 π i ω \cdot x_{i}},

(3.33)

where E_ω denotes the error of the NFFT^T algorithm and g_ω is the same as is defined in (3.25). Then we have

∣ E_{{NFFT}^{T}} ∣ \leq \sum_{ω \in I_{n}} ∣ g_{ω} E_{ω} ∣ \leq max_{ω \in I_{n}} ∣ E_{ω} ∣ \sum_{ω \in I_{n}} ∣ g_{ω} ∣ = {| | E_{ω} | |}_{\infty} {| | g | |}_{1}

(3.34)

with ${| | E_{ω} | |}_{\infty} : = max_{ω \in I_{n}} ∣ E_{ω} ∣$ and ${| | g | |}_{1} : = \sum_{ω \in I_{n}} ∣ g_{ω} ∣$ .

As we discussed in Appendix B, the NFFT^T error E_ω is decomposed into the aliasing error ( $E_{ω}^{1}$ ) and the truncation error $(E_{ω}^{2}), E_{ω} = E_{ω}^{1} + E_{ω}^{2}$ . So

{| | E_{ω} | |}_{\infty} \leq {| | E_{ω}^{1} | |}_{\infty} + {| | E_{ω}^{2} | |}_{\infty},

where ${| | E_{ω}^{1} | |}_{\infty} = max_{ω \in I_{n}} ∣ E_{ω}^{1} ∣$ and ${| | E_{ω}^{2} | |}_{\infty} = max_{ω \in I_{n}} ∣ E_{ω}^{2} ∣$ . Based on the error bounds derived in Appendix B,

{| | E_{ω}^{1} | |}_{\infty} \leq {| | c | |}_{1} max_{ω \in I_{n}} \sum_{i \in Z^{3} \ {0}} \frac{C_{ω + i σ n} (ξ)}{C_{ω} (ξ)}

(3.35)

and

{| | E_{ω}^{2} | |}_{\infty} \leq \frac{1}{{(σ n)}^{3}} {| | c | |}_{1} max_{ω \in I_{n}} (C_{ω}^{- 1} (ξ)) max_{k} \sum_{l \in I_{σ n}} ∣ ξ (\frac{l}{σ n} - r_{k}) - η (\frac{l}{σ n} - r_{k}) ∣

(3.36)

where ${| | c | |}_{1} = \sum_{k = 1}^{N} ∣ c_{k} ∣$ . Comparing (3.35) with (3.30) and comparing (3.36) with (3.31) yield the error estimation of E_ω which is similar to E_NFFT:

{| | E_{ω} | |}_{\infty} \leq C (ξ, m, σ) {| | c | |}_{1} .

Hence

∣ E_{{NFFT}^{T}} ∣ \leq C (ξ, m, σ) {| | c | |}_{1} {| | g | |}_{1} .

(3.37)

The inequality (3.37) is independent of i, therefore,

{| | E_{{NFFT}^{T}} | |}_{\infty} \leq C (ξ, m, σ) {| | c | |}_{1} {| | g | |}_{1} .

(3.38)

4. Fast solvation force computation

The solvation force acting at the center of atom α, which is part of the forces driving dynamics is

F_{α}^{elec} = - \frac{\partial G_{sol}}{\partial x_{α}} .

(4.1)

Partition the solvation energy into polar and non-polar parts:

\frac{\partial G_{sol}}{\partial x_{α}} = \frac{\partial}{\partial x_{α}} (G_{cav} + G_{vdw}) + \frac{\partial G_{pol}}{\partial x_{α}} = γ \frac{\partial S_{A}}{\partial x_{α}} + \frac{\partial G_{pol}}{\partial x_{α}} .

(4.2)

The non-polar force is proportional to the derivatives of the volume and/or the surface area with respect to the atomic coordinates. There has been previous work on analytically computing the derivatives of the area/volume [31, 32, 33]. To compute the polar force, we first define

G_{i j} = q_{i} q_{j} / {(r_{i j}^{2} + R_{i} R_{j} exp (- \frac{r_{i j}^{2}}{4 R_{i} R_{j}}))}^{1 / 2} .

(4.3)

Then

G_{pol} = - τ \sum_{i = 1}^{M} \sum_{j = i + 1}^{M} G_{i j} - \frac{τ}{2} \sum_{i = 1}^{M} G_{i i} .

(4.4)

Differentiating (4.4) w.r.t. x, one gets

\frac{\partial G_{pol}}{\partial x_{α}} = - τ \sum_{i = 1}^{M} \sum_{j = i + 1}^{M} \frac{\partial G_{i j}}{\partial x_{α}} - \frac{τ}{2} \sum_{i = 1}^{M} \frac{\partial G_{i i}}{\partial x_{α}},

(4.5)

where

\frac{\partial G_{i j}}{\partial x_{α}} = \frac{\partial G_{i j}}{\partial r_{i j}} \frac{\partial r_{i j}}{x_{α}} + \frac{\partial G_{i j}}{\partial R_{i}} \frac{\partial R_{i}}{x_{α}} + \frac{\partial G_{i j}}{\partial R_{j}} \frac{\partial R_{j}}{x_{α}} .

(4.6)

From (4.3), one can easily compute $\frac{\partial G_{i j}}{\partial r_{i j}}$ and $\frac{\partial G_{i j}}{\partial R_{i}}$ , which are

\begin{array}{l} \frac{\partial G_{i j}}{\partial r_{i j}} = q_{i} q_{j} {(r_{i j}^{2} + R_{i} R_{j} e^{- \frac{r_{i j}^{2}}{4 R_{i} R_{j}}})}^{- \frac{3}{2}} (\frac{1}{4} e^{- \frac{r_{i j}^{2}}{4 R_{i} R_{j}}} - 1) r_{i j}, \\ \frac{\partial G_{i j}}{\partial R_{i}} = - \frac{q_{i} q_{j}}{8 R_{i}} {(r_{i j}^{2} + R_{i} R_{j} e^{- \frac{r_{i j}^{2}}{4 R_{i} R_{j}}})}^{- \frac{3}{2}} e^{- \frac{r_{i j}^{2}}{4 R_{i} R_{j}}} (4 R_{i} R_{j} + r_{i j}^{2}) . \end{array}

$\frac{\partial r_{i j}}{\partial x_{α}}$ is nonzero if i or j = α which will be $\frac{\partial r_{α j}}{\partial x_{α}} = \frac{x_{α} - x_{j}}{r_{α j}}$ . In (4.6) the computation of $\frac{\partial R_{i}}{x_{α}}$ for i = 1,…,M is not trivial. Because Γ depends on the position of the atoms, it is not easy to compute the derivative of R_i directly from (3.1). To solve this problem, we convert the integration domain back to the volume:

R_{i}^{- 1} = \frac{1}{4 π} \int_{ex} \frac{1}{∣ r - x_{i} ∣^{4}} d r .

(4.7)

Then by defining a volumetric density function to distinguish the exterior from the interior of the molecule, we may have an integration domain that is independent of {x_i}. One way of defining the volumetric function is given in [34] where they first define a density function for each of the atoms

χ_{i} (r) = {\begin{array}{l} 1, & | | r - x_{i} | | \leq a_{i} \\ 0, & | | r - x_{i} | | > a_{i} \end{array}

and then define the volumetric function by following the inclusion-exclusion principle

ρ (r) = \sum_{i} χ_{i} - \sum_{i < j} χ_{i} χ_{j} + \sum_{i < j < k} χ_{i} χ_{j} χ_{k} - \sum_{i < j < k < l} χ_{i} χ_{j} χ_{k} χ_{l} + \dots .

(4.8)

There are some nice properties of this model. For example, the exterior region of the molecule is well characterized by ρ = 0 and two atoms i and j are disconnected if for any r ∈ ℝ³, χ_i(r)χ _j(r) = 0. The drawback of this model is that function χ is not smooth, which makes it inapplicable to the derivative computation. Therefore we smoothen χ by introducing a cubic spline near the atom boundary:

ρ_{i} (x) = {\begin{array}{l} 1, & x \leq a_{i} \\ \frac{2}{w^{3}} {(x - a_{i})}^{3} - \frac{3}{w^{2}} {(x - a_{i})}^{2} + 1, & a_{i} < x < a_{i} + w \\ 0, & x \geq a_{i} + w \end{array}

(4.9)

with x =|| r − x_i||. The region defined by ρ_i ≠ 0 is regarded as the interior of atom i and this region converges to the van der Waals volume of the atom as w goes to 0. In the SES model, two atoms are considered to be completely separated if the distance between the centers is greater than the sum of the radii plus the probe diameter. Otherwise they can be connected by the reentrant surface of the rolling probe. By setting w = 1.4 Å, atoms i and j are disconnected in the same sense as in the SES model iff ρ_i(r)ρ_j(r) = 0, for any r ∈ ℝ³. In addition to this modification, we neglect the cases that more than four atoms overlap simultaneously. Therefore the molecular volumetric density function becomes

ρ (r) = \sum_{i} ρ_{i} - \sum_{i < j} ρ_{i} ρ_{j} + \sum_{i < j < k} ρ_{i} ρ_{j} ρ_{k} - \sum_{i < j < k < l} ρ_{i} ρ_{j} ρ_{k} ρ_{l} .

(4.10)

We define the complementary function ρ̄ = 1 − ρ. It is easy to show that within the VWS of the molecule, ρ̄ is always 0, beyond the SAS, ρ̄ is always 1, in between, 0 < ρ̄ < 1. Then (4.7) can be rewritten as

R_{i}^{- 1} = \frac{1}{4 π} \int_{ℝ^{3}} \frac{\bar{ρ} (r, {x_{j}})}{∣ r - x_{i} ∣^{4}} d r .

(4.11)

Differentiating both sides of (4.11), one gets

- \frac{1}{R_{i}^{2}} \frac{\partial R_{i}}{\partial x_{α}} = \frac{1}{4 π} \int_{ℝ^{3}} \frac{\partial}{\partial x_{α}} (\frac{\bar{ρ} (r, {x_{j}})}{∣ r - x_{i} ∣^{4}}) d r .

(4.12)

\frac{\partial R_{i}}{\partial x_{α}} = - \frac{R_{i}^{2}}{4 π} (\int_{ℝ^{3}} \frac{\frac{\partial}{\partial x_{α}} \bar{ρ} (r, {x_{j}})}{∣ r - x_{i} ∣^{4}} d r + \int_{ex} \frac{\partial}{\partial x_{α}} \frac{1}{∣ r - x_{i} ∣^{4}} d r) .

(4.13)

For the first integral in (4.13),

\frac{\partial}{\partial x_{α}} \bar{ρ} = - \frac{\partial}{\partial x_{α}} ρ = - \frac{\partial ρ_{α}}{\partial x_{α}} (1 - \sum_{j} ρ_{j} + \sum_{j < k} ρ_{j} ρ_{k} - \sum_{j < k < l} ρ_{j} ρ_{k} ρ_{l}) = - \frac{\partial ρ_{α}}{\partial x_{α}} g_{α},

where j, k, l are the atoms overlapping with atom α, g = 1 − Σ_j ρ_j + Σ_j_<_k ρ_jρ_k − Σ_j_<_k_<_l ρ_jρ_kρ_l, and

\frac{\partial ρ_{i}}{\partial x_{α}} (r) = {\begin{array}{l} 0, & x \leq a_{α} \\ (\frac{6}{w^{3}} {(x - a_{α})}^{2} - \frac{6}{w^{2}} (x - a_{α})) \frac{x_{α} - r}{x}, & a_{α} < x < a_{α} + w \\ 0, & x \geq a_{α} + w \end{array}

with x = ||r − x_α||. Noticing that $\frac{\partial ρ_{α}}{\partial x_{α}} \neq 0$ only if a_α < |r − x_α| < a_α + w, the first integral in (4.13) is simplified as

\int_{∣ r - x_{α} ∣ = a_{α}}^{∣ r - x_{α} ∣ = a_{α} + w} - \frac{\partial ρ_{α}}{\partial x_{α}} g_{α} (r) \frac{1}{∣ r - x_{i} ∣^{4}} d r .

(4.14)

The integration domain of (4.14) is a regular spherical shell of the width w around atom α (Figure 4.1(a)). We switch to the spherical coordinate system:

{\begin{array}{l} x = x_{α} + (a_{α} + r) cos θ sin φ \\ y = y_{α} + (a_{α} + r) sin θ sin φ \\ z = z_{α} + (a_{α} + r) cos φ \end{array}

where (r, θ, φ) ∈ [0, w] × [0, 2π] × [0, π]. We sample r, θ, φ by using the 2-point Gaussian quadrature nodes in each dimension. For all the atoms in the molecule, they share the same set of sampling points (r, θ, φ).

Fig. 4.1 — When computing the derivatives of the Born radii $\frac{\partial R_{i}}{\partial x_{α}}$ , the quadrature points of the first integral are points within a spherical shell around atom α, as shown in (a), whereas the second integral is necessary when i ≡ α and the quadrature points are points on the surface, as shown in (b). The dark region represents the molecule, the light grey region is the shell of width w around atom α.

The second integral in (4.13) is nonzero if i ≡ α. In that case

\int_{ex} \frac{\partial}{\partial x_{i}} \frac{1}{∣ r - x_{i} ∣^{4}} d r = - \int_{ex} \frac{\partial}{\partial r} \frac{1}{∣ r - x_{i} ∣^{4}} d r .

(4.15)

We compute each component of (4.15) individually and convert to the surface integration (Figure: 4.1(b)) by the divergence theorem:

- \int_{ex} \frac{\partial}{\partial x} \frac{1}{∣ r - x_{i} ∣^{4}} d r = \int_{Γ} \frac{n_{x} (r)}{∣ r - x_{i} ∣^{4}} d S ≃ \sum_{k = 1}^{N} \frac{w_{k} n_{x}^{k}}{∣ r_{k} - x_{i} ∣^{4}},

(4.16)

- \int_{ex} \frac{\partial}{\partial y} \frac{1}{∣ r - x_{i} ∣^{4}} d r = \int_{Γ} \frac{n_{y} (r)}{∣ r - x_{i} ∣^{4}} d S ≃ \sum_{k = 1}^{N} \frac{w_{k} n_{y}^{k}}{∣ r_{k} - x_{i} ∣^{4}},

(4.17)

- \int_{ex} \frac{\partial}{\partial z} \frac{1}{∣ r - x_{i} ∣^{4}} d r = \int_{Γ} \frac{n_{z} (r)}{∣ r - x_{i} ∣^{4}} d S ≃ \sum_{k = 1}^{N} \frac{w_{k} n_{z}^{k}}{∣ r_{k} - x_{i} ∣^{4}},

(4.18)

where the quadrature weights and points (w_k, r_k) and the unit normals ( $n_{x}^{k}, n_{y}^{k}, n_{z}^{k}$ ) are the same as those used in Section 3. We compute (4.16), (4.17), and (4.18) by directly applying the fast summation method with the coefficients $c_{k} = w_{k} n_{x}^{k}, w_{k} n_{y}^{k}, w_{k} n_{z}^{k}$ , respectively. Since the same algorithm is used in the Born radius derivative calculation, the error analysis is similar to the error analysis of the Born radius calculation except that a quadrature error of the integration over the shell region needs to be added.

To compute the force acting on each of the M atoms, we need to compute (4.15) for i = 1, …, M. By using the fast summation algorithm, the computational complexity of this part is O(N + M + n³ log n), the same as the energy computation. To compute (4.14), since the shell integration domain is narrow, only a small number of atoms have non-zero densities in this region, therefore the complexity of computing (4.14) for a fixed α for i = 1, …, M is O(M). Moreover, since the integrand in (4.14) is very small if atom i and atom α are far apart, we use a cut-off distance d₀ in our computation and compute (4.14) only if d(i, α) ≤ d₀. Therefore the overall time complexity of computing (4.13) is O(N +M +n³ log n).

5. Results

We compare the polarization energy computed based on the fast summation algorithm and the trivial summation in Table 5.1 for four proteins (PDB ID: 1CGI_l, 1BGX, 1DE4, 1N2C). An ASMS model is constructed for each protein with N_e number of patches. A three-point Gaussian quadrature is used on each algebraic patch. We also compare the overall computation time of the two methods. As we see from the table, for the small proteins (e.g. 1CGI_l), the fast summation method is slower than the trivial summation. However as the protein size gets larger (e.g. 1BGX, 1DE4, 1N2C), the fast summation is apparently faster than the trivial summation without losing too much accuracy. The relative error ε between the fast summation and the trivial summation is small. As for the trade-off between efficiency and accuracy, since in the current research of the MD simulation efficiency is more concerned, the fast-summation-based GB is superior to the trivial GB method.

Table 5.1.

Comparison of the electrostatic solvation energy G_pol (kcal/mol) and computation time (second) of the fast summation method (A) and the trivial summation method (B). M is the number of atoms. N_e is the number of patches, N is the number of integration points. n, σ, and m are parameters in the fast summation method. ε is the relative percentage error $∣ (G_{pol}^{A} - G_{pol}^{B}) / G_{pol}^{B} ∣$ .

Protein ID		1CGI_l	1BGX	1DE4	1N2C
M		852	19,647	26,003	39,946
N_e		29,108	112,636	105,288	83,528
N		116,432	450,544	421,152	334,112
n		100	100	100	100
σ		2	2	2	2
m		4	4	4	4
A	G_pol	−1380.988	−19734.848	−25754.552	−41408.959
A	timing	86	358	863	631
B	G_pol	−1343.150	−19297.528	−25388.455	−40675.383
B	timing	49	4327	5368	9925
ε		2.8%	2.3%	1.4%	1.8%

Open in a new tab

In Figure 5.1 we compare G_pol computed by the fast summation based GB and the trivial summation method along with their computation time for proteins of various sizes. For all these proteins, we generate the ASMS of the same number of patches (in our test we use 20,000 patches for each protein). We choose the fixed parameters n = 30, m = 4, and σ = 2 for all the proteins. We observe that the G_pol computed by the fastsum GB is close to that computed by the trivial GB methods and the error gets larger as the molecule gets bigger. Even though the error analysis in Section 3.3 does not show that the error depends on the size of the molecule, the analysis is based on the assumption that the kernel function is defined on the domain ${[- \frac{1}{2}, \frac{1}{2}]}^{3}$ . To ensure that x_i − r_k, i = 1, …, M, k = 1, …, N are all within this range, we scale the molecule. The larger the molecule, the larger the scaling factor. Later on when we scale back to the original coordinates by multiplying the scaling factor, the error gets amplified. As we expect, computation time of the fastsum GB increases as M becomes large but is much faster than the traditional GB method.

Fig. 5.1 — In (a) we compare G_pol computed by the fastsum GB and the non-fastsum GB for various proteins containing different number of atoms. In (b) we compare the computation time of the two methods.

In Figure 5.2, we compare G_pol computed by the fast summation based GB versus the trivial summation method and the computation time for a test protein 1JPS where we generate the ASMS with different numbers of patches. We use the same values for the parameters n, m, and σ as in the previous test. As shown in the figure, as the triangular mesh becomes denser, the fast summation result converges rapidly to the result of the trivial method but takes less computation time.

Fig. 5.2 — For protein 1JPS, in (a) we compare G_pol computed by the fastsum GB and the non-fastsum GB with various number of surface elements. In (b) we compare the computation time of the two methods.

For the test proteins 1ANA, 1MAG, 1PPE_l, 1CGI_l, we compute the solvation force $F_{α}^{elec}$ , for α = 1, …, M. We show the timing results in Table 5.2. In general, if an atom has a strong solvation force, this atom is in favor of being polarized, and hence is an active atom. On the contrary, if an atom has a weak solvation force, it is more likely to be an inactive atom. For every test protein, after we compute the solvation force for each atom, we sort the forces based on their magnitude and choose the top most active atoms and the top inactive atoms. As shown in Figure 5.3, the top 5% of the most active atoms are rendered in red and the bottom 5% of the atoms are rendered in blue. This provides a convenient and cheaper way, alternative to the experimental method, to help the biologists quickly find an active site of a protein.

Table 5.2.

Force calculation timing: M is the number of atoms, N is the number of triangles in the surface triangular mesh, t₁ is the time (in seconds) for computing (4.15) for i = 1, …, M and t₂ is the time for computing the rest of the terms in (4.6) for i, j, α = 1,…, M. T_total is the overall timing.

Protein ID	M	N	t₁ (s)	t₂ (s)	T_total (s)
1ANA	249	6,676	66.05	0.14	66.19
1MAG	544	7,328	69.58	0.23	69.81
1PPE_l	436	5,548	59.55	0.56	60.11
1CGI_l	852	6,792	68.71	3.27	71.98

Open in a new tab

Fig. 5.3 — Atoms that have the greatest electrostatic solvation force (top 5%) are colored in red; atoms that have the weakest electrostatic solvation force (bottom 5%) are colored in blue.

6. Conclusion

We introduce a fast summation based algorithm to calculate the effective Born radii and their derivatives in the generalized Born model of implicit solvation. The algorithm relies on a variation of the formulation for the Born radii and an additional analytical volumetric density function for the derivatives. For a system of M atoms and N sampling points on the molecular surface, the trivial way of computing the Born radii requires O(MN) arithmetic operations, whereas with the aids of the Fourier expansion of the kernel functions of the Born radii (and their derivatives) and the NFFT algorithm which essentially approximates the complex exponentials in the NDFT by the DFT of a fast decaying smooth window function, the Born radii as well as their derivatives can be obtained at cost of (M +N +n³ log n) where n is the number of frequencies in the Fourier expansion. We show that the error of the algorithm decreases as the mesh gets denser, or as any of the parameters σ, m, n increase. Other than the Born model developed with a Coulomb field approximation, there has been other models for the Born radii evaluation, for example the Kirkwood-Grycuk model [35] where $R_{i}^{- 1} = {(\frac{3}{4 π} \int_{ex} \frac{1}{∣ r - x_{i} ∣^{6}} d r)}^{1 / 3}$ . This model is recently applied to the GBr⁶NL model which approximates the solvation energy of the nonlinear Poisson-Boltzmann equation [36]. It is interesting to note that we can utilize a similar quadrature point generation via ASMS and the fast summation algorithm to speed up this GBr⁶NL computation. In fact, by the divergence theorem, $\int_{ex} \frac{1}{∣ r - x_{i} ∣^{6}} d r = \frac{1}{3} \int_{Γ} \frac{(r - x_{i}) \cdot n (r)}{∣ r - x_{i} ∣^{6}} d r$ and the rest follows similar to the methods in this paper.

Fig. 2.2 — The control coefficients of the cubic Bernstein-Bezier basis of function F

Acknowledgments

This research was supported in part by NSF grant CNS-0540033 and NIH contracts R01-EB00487, R01-GM074258, R01-GM07308. We thank the reviewers, as well as Dr. Rezaul Chowdhury for all the excellent suggestions that have resulted in a considerably improved paper. We also wish to thank several members of our CVC group for developing and maintaining TexMol, our molecular modeling and visualization software tool, which was used in conjunction with our nFFTGB implementation, to produce all the pictures in our paper (http://cvcweb.ices.utexas.edu/software/).

Appendix A. NFFT

The NFFT [28] is an algorithm for fast computation of multivariate discrete Fourier transforms for nonequispaced data in spacial domain (NDFT¹). The NDFT¹ problem is to evaluate the trigonometric polynomials

G (x_{j}) = \sum_{ω \in I_{n}} G_{ω} e^{2 π i ω \cdot x_{j}} j = 1, \dots, M,

(A.1)

where $I_{n} = {(ω_{1}, ω_{2}, ω_{3}) \in Z^{3} : - \frac{n}{2} \leq ω_{i} \leq \frac{n}{2}}$ . Without loss of generality, we assume $x_{j} \in {[- \frac{1}{2}, \frac{1}{2}]}^{3}$ . Instead of computing the summations in (A.1) directly, one can approximate G by a function s(x) which is a linear combination of the shifted 1-periodic kernel function ξ:

s (x) : = \sum_{l \in I_{σ n}} g_{l} ξ (x - \frac{l}{σ n}),

(A.2)

where $I_{σ n} : = {(l_{1}, l_{2}, l_{3}) : l_{i} \in [- \frac{σ n}{2}, \frac{σ n}{2}] \cap Z, σ > 1}$ and $\frac{l}{σ n} : = {(\frac{l_{1}}{σ n}, \frac{l_{2}}{σ n}, \frac{l_{3}}{σ n})}$ . We have σ > 1 because of the error estimation discussed in Section 3.3.3.

The kernel function ξ is defined as

ξ (x) : = \sum_{i \in Z^{3}} ξ_{0} (x + i), where ξ_{0} \in L_{2} (ℝ^{3}) .

Good candidates for ξ₀ include Gaussian, B-spline, sinc, and Kaiser-Bessel functions. Expand the periodic kernel function ξ by its Fourier series

ξ (x) = \sum_{ω \in Z^{3}} C_{ω} (ξ) e^{2 π i ω \cdot x},

(A.3)

with the Fourier coefficients

C_{ω} (ξ) : = \int_{{[- \frac{1}{2}, \frac{1}{2}]}^{3}} ξ (x) e^{- 2 π i ω \cdot x} d x = \int_{ℝ^{3}} ξ_{0} (x) e^{- 2 π i ω \cdot x} d x = {\hat{ξ}}_{0} (ω) .

Cut off the higher frequencies in (A.3), one can get

ξ (x) = (\sum_{ω \in I_{σ n}} + \sum_{ω \in Z^{3} \ I_{σ n}}) C_{ω} (ξ) e^{2 π i ω \cdot x} \approx \sum_{ω \in I_{σ n}} C_{ω} (ξ) e^{2 π i ω \cdot x} .

(A4.)

Plug (A.4) into (A.2), we get

\begin{array}{l} s (x_{j}) \approx \sum_{l \in I_{σ n}} g_{l} \sum_{ω \in I_{σ n}} C_{ω} (ξ) e^{2 π i ω \cdot (x_{j} - \frac{l}{σ n})} \\ = \sum_{ω \in I_{σ n}} {\tilde{G}}_{ω} C_{ω} (ξ) e^{2 π i ω \cdot x_{j}}, \end{array}

(A.5)

with the coefficients

{\tilde{G}}_{ω} : = \sum_{l \in I_{σ n}} g_{l} e^{- 2 π i ω \cdot \frac{l}{σ n}} .

(A.6)

By defining

{\tilde{G}}_{ω} : = {\begin{array}{l} \frac{G_{ω}}{C_{ω} (ξ)} & for ω \in I_{n}, \\ 0 & for ω \in I_{σ n} \ I_{n}, \end{array}

(A.7)

one can immediately get

s (x_{j}) \approx \sum_{ω \in I_{n}} G_{ω} e^{2 π i ω \cdot x_{j}} = G (x_{j}) .

(A.8)

The next problem is to compute g_l. From (A.6), one can compute the coefficients g_l which are also coefficients in (A.8) by the discrete Fourier transform

g_{l} = \frac{1}{σ^{3} n^{3}} \sum_{ω \in I_{σ n}} {\tilde{G}}_{ω} e^{2 π i ω \cdot \frac{l}{σ n}} = \frac{1}{σ^{3} n^{3}} \sum_{ω \in I_{n}} \frac{G_{ω}}{C_{ω} (ξ)} e^{2 π i ω \cdot \frac{l}{σ n}}, l \in I_{σ n},

(A.9)

with complexity O(n³ log n) by the FFT algorithm.

Since the function ξ drops very fast, one can further reduce the computation complexity of (A.8) by cutting off the tail of ξ. Define a function η₀:

η_{0} : = ξ_{0} (x) χ_{{[- \frac{m}{σ n}, \frac{m}{σ n}]}^{3}} (x) where m ≪ σ n, m \in ℕ .

Construct the one-periodic function η the same way as ξ is constructed:

η (x) = \sum_{i \in Z^{3}} η_{0} (x + i) .

Replacing ξ with η in (A.8), we obtain that

G (x_{j}) \approx \sum_{l \in I_{σ n, m} (x_{j})} g_{l} η (x_{j} - \frac{l}{σ n}),

(A.10)

where I_σn_,_m(x_j) = {(l₁, l₂, l₃): σnx_j_,_i − m ≤ l_i ≤ σnx_j_,_i + m, i = 1, 2, 3}. There are at most (2m + 1)³ nonzero terms in (A.10). Therefore the complexity of evaluating (A.10) for j = 1, …, M is O(m³M). Adding the complexity of computing the coefficients g_l, the overall complexity of NFFT algorithm is O(n³ log n + m³M).

Remark

If we reorganize the above equations, it is not hard to see that, in fact, (A.1) is approximately computed by the expression

G (x_{j}) = \sum_{ω \in I_{n}} G_{ω} (\frac{1}{{(σ n)}^{3} C_{ω} (ξ)} \sum_{l \in I_{σ n}} η (x_{j} - \frac{l}{σ n}) e^{2 π i ω \cdot \frac{l}{σ n}}) .

(A.11)

From a linear algebra point of view, equation (A.11) can be written as the product of a matrix and a vector. For example, for a one dimensional NFFT, (A.11) is equivalent to

g = Ξ F D \hat{g}

(A.12)

with vectors

g : = {[G (x_{i})]}_{i = 1}^{M}, \hat{g} : = {[G_{ω}]}_{ω = - \frac{n}{2}}^{\frac{n}{2} - 1} .

Ξ is a sparse matrix

Ξ : = {[η (x_{i} - \frac{l_{j}}{σ n})]}_{M \times σ n},

F is the classical Fourier matrix

F : = {[e^{2 π i ω_{j} \frac{l_{i}}{σ n}}]}_{σ n \times n},

and D is an n × n diagonal matrix with the iith element being $\frac{1}{σ n C_{ω_{i}} (ξ)}$ . For a multi-dimensional NFFT, it is the same as the 1D case as long as one orders the indices of the multi-dimension into one dimension.

As discussed in [28], in the first approximation (A.8), we see that s is equal to G after its higher frequencies in the Fourier series are cut off. Hence the error introduced in (A.8) which is known as the aliasing error is

\begin{array}{l} E_{NFFT}^{1} : = \sum_{l \in I_{σ n}} g_{l} ξ (x_{j} - \frac{l}{σ n}) - G (x_{j}) \\ = \sum_{i \in Z^{3} \ {0}} \sum_{ω \in I_{σ n}} {\tilde{G}}_{ω + i σ n} C_{ω + i σ n} (ξ) e^{2 π i (ω + i σ n) \cdot x_{j}} . \end{array}

(A.13)

Note that from (A.6), we have the condition G̃_ω₊_i_σn = G̃_ω, for i ∈ ℤ³ and ω ∈ I_σn. By the definition (A.7), one obtains

∣ E_{NFFT}^{1} ∣ \leq \sum_{i \in Z^{3} \ {0}} \sum_{ω \in I_{n}} ∣ G_{ω} \frac{C_{ω + i σ n} (ξ)}{C_{ω} (ξ)} ∣ .

(A.14)

Let ${| | \hat{G} | |}_{1} = \sum_{ω \in I_{n}} ∣ G_{ω} ∣$ . Then

∣ E_{NFFT}^{1} ∣ \leq {| | \hat{G} | |}_{1} max_{ω \in I_{n}} \sum_{i \in Z^{3} \ {0}} ∣ \frac{C_{ω + i σ n} (ξ)}{C_{ω} (ξ)} ∣ .

(A.15)

In the second approximation (A.10), since ξ is replaced by η, the so caused error, known as the truncation error, is

\begin{array}{l} E_{NFFT}^{2} : = \sum_{l \in I_{σ n}} g_{l} ξ (x - \frac{l}{σ n}) - \sum_{l \in I_{σ n, m}} g_{l} η (x - \frac{l}{σ n}) \\ = \sum_{l \in I_{σ n} \ I_{σ n, m}} g_{l} [ξ (x - \frac{l}{σ n}) - η (x - \frac{l}{σ n})] \\ = \sum_{l \in I_{σ n} \ I_{σ n, m}} \frac{1}{σ^{3} n^{3}} \sum_{ω \in I_{n}} \frac{G_{ω}}{C_{ω} (ξ)} e^{2 π i ω \cdot \frac{l}{σ n}} [ξ (x - \frac{l}{σ n}) - η (x - \frac{l}{σ n})] . \end{array}

(A.16)

Thus

\begin{array}{l} ∣ E_{NFFT}^{2} ∣ \leq \frac{1}{σ^{3} n^{3}} \sum_{l \in I_{σ n}} ∣ \frac{G_{ω}}{C_{ω} (ξ)} [ξ (x - \frac{l}{σ n}) - η (x - \frac{l}{σ n})] ∣ \\ \leq \frac{1}{σ^{3} n^{3}} max_{ω \in I_{n}} (C_{ω}^{- 1} (ξ)) {| | \hat{G} | |}_{1} \sum_{l \in I_{σ n}} ∣ ξ (x - \frac{l}{σ n}) - η (x - \frac{l}{σ n}) ∣ . \end{array}

(A.17)

Appendix B. NFFT^T

The NFFT^T algorithm deals with the fast computation of multivariate discrete Fourier transforms for nonequispaced data in frequency domain (NDFT²):

a (ω) = \sum_{k = 1}^{N} c_{k} e^{- 2 π i ω \cdot r_{k}}, ω \in I_{n} .

(B.1)

Define a function

A (x) : = \sum_{k = 1}^{N} c_{k} ξ (x - r_{k}),

(B.2)

where ξ is defined as same as in Appendix A. The Fourier series of A(x) is:

A (x) = \sum_{ω \in Z^{3}} C_{ω} (A) e^{2 π i ω \cdot x} .

(B.3)

On the other hand,

\sum_{k = 1}^{N} c_{k} ξ (x - r_{k}) = \sum_{k = 1}^{N} c_{k} \sum_{ω \in Z^{3}} C_{ω} (ξ) e^{2 π i ω \cdot (x - r_{k})} .

(B.4)

Hence we get the relationship of Fourier coefficients of A and ξ:

C_{ω} (A) = \sum_{k = 1}^{N} c_{k} e^{- 2 π i ω \cdot r_{k}} C_{ω} (ξ), ω \in Z^{3} .

(B.5)

Comparing (B.5) with (B.1) one obtains

a (ω) = \frac{C_{ω} (A)}{C_{ω} (ξ)}, ω \in I_{n} .

(B.6)

It remains to compute C_ω (A). By definition,

\begin{array}{l} C_{ω} (A) = \int_{{[- \frac{1}{2}, \frac{1}{2}]}^{3}} A (x) e^{- 2 π i ω \cdot x} d x \\ = \int_{{[- \frac{1}{2}, \frac{1}{2}]}^{3}} (\sum_{k = 1}^{N} c_{k} ξ (x - r_{k})) e^{- 2 π i ω \cdot x} d x \\ = \sum_{k = 1}^{N} c_{k} \int_{{[- \frac{1}{2}, \frac{1}{2}]}^{3}} ξ (x - r_{k}) e^{- 2 π i ω \cdot x} d x . \end{array}

(B.7)

Discretizing the integration in (B.7) by the left rectangular rule leads to

a (ω) \approx \frac{1}{C_{ω} (ξ)} \sum_{k = 1}^{N} c_{k} \frac{1}{{(σ n)}^{3}} \sum_{l \in I_{σ n}} ξ (\frac{l}{σ n} - r_{k}) e^{- 2 π i ω \cdot \frac{l}{σ n}} .

(B.8)

Replacing ξ with η yields

a (ω) \approx \frac{1}{{(σ n)}^{3}} \frac{1}{C_{ω} (ξ)} \sum_{l \in I_{σ n}} {\hat{g}}_{l} e^{- 2 π i ω \cdot \frac{l}{σ n}},

(B.9)

where

{\hat{g}}_{l} : = \sum_{k = 1}^{N} c_{k} η (\frac{l}{σ n} - r_{k}), l \in I_{σ n} .

(B.10)

To compute ĝ_l, if one scans the r_k list, then for each r_k there are at most (2m + 1)³ grid points (l) that contribute nonzero η. Hence, the complexity of computing ĝ_l is O(m³N). After computing ĝ_l one can easily evaluate (B.9) by the FFT algorithm at the complexity of O(n³ log n). Lastly the complexity of computing (B.6) is O(n³). So the overall complexity of the NFFT^T algorithm is O(m³N +n³ log n).

Remark

Similar to the NFFT algorithm, we may write the one-line formula for computing (B.1) by the NFFT^T:

a (ω) = \sum_{k = 1}^{N} c_{k} (\frac{1}{{(σ n)}^{3} C_{ω} (ξ)} \sum_{l \in I_{σ n}} η (\frac{l}{σ n} - r_{k}) e^{- 2 π i ω \cdot \frac{l}{σ n}}),

(B.11)

which in one dimension is equivalent to the linear system:

\hat{a} = D^{T} F^{*} Ξ^{T} c

(B.12)

with vectors

\hat{a} : = {[a (ω)]}_{ω = - \frac{n}{2}}^{\frac{n}{2}}, c : = {[c_{k}]}_{k = 1}^{N} .

Matrix Ξ is similar to that defined in Appendix A

Ξ : = {[η (\frac{l_{j}}{σ n} - r_{i})]}_{N \times σ n} .

F^* is the conjugate transpose of the Fourier matrix F, and D is the same as that defined in Appendix A. From the matrix expression, we see why the algorithm is called the “transpose” of NFFT.

Let E_ω designate the error of a(ω). E_ω can also be split into the aliasing error $E_{ω}^{1}$ introduced in (B.8) and the truncation error $E_{ω}^{2}$ introduced in (B.9), $E_{ω} = E_{ω}^{1} + E_{ω}^{2}$ , for ω ∈ I_σn. By taking the Fourier expansion of ξ, we get form (B.8), so

\begin{array}{l} E_{ω}^{1} = a (ω) = \frac{1}{C_{ω} (ξ)} \sum_{k = 1}^{N} c_{k} \frac{1}{{(σ n)}^{3}} \sum_{l \in I_{σ n}} (\sum_{j \in Z^{3}} C_{j} (ξ) e^{2 π i (\frac{l}{σ n} - r_{k}) \cdot j}) e^{- 2 π i ω \cdot \frac{l}{σ n}} \\ = a (ω) - \frac{1}{C_{ω} (ξ)} \sum_{k = 1}^{N} c_{k} \sum_{j \in Z^{3}} C_{j} (ξ) e^{- 2 π i r_{k} \cdot j} \frac{1}{{(σ n)}^{3}} \sum_{l \in I_{σ n}} e^{2 π i (j - ω) \cdot \frac{l}{σ n}} . \end{array}

Since

\frac{1}{{(σ n)}^{3}} \sum_{l \in I_{σ n}} e^{2 π i (j - ω) \cdot \frac{l}{σ n}} = {\begin{array}{l} 1, & if j - ω = i σ n, i \in Z^{3}, \\ 0, & otherwise, \end{array}

we have,

E_{ω}^{1} = a (ω) - \frac{1}{C_{ω} (ξ)} \sum_{k = 1}^{N} c_{k} \sum_{i \in Z^{3}} C_{ω + i σ n} (ξ) e^{- 2 π i r_{k} \cdot (ω + i σ n)} .

(B.13)

By (B.1),

E_{ω}^{1} = \frac{1}{C_{ω} (ξ)} \sum_{k = 1}^{N} c_{k} \sum_{i \in Z^{3} \ {0}} C_{ω + i σ n} (ξ) e^{- 2 π i r_{k} \cdot (ω + i σ n)} .

(B.14)

Define ${| | c | |}_{1} = \sum_{k = 1}^{N} ∣ c_{k} ∣$ . Then we have

∣ E_{ω}^{1} ∣ \leq {| | c | |}_{1} \sum_{i \in Z^{3} \ {0}} \frac{C_{ω + i σ n} (ξ)}{C_{ω} (ξ)} .

(B.15)

In (B.9), the truncation error

E_{ω}^{2} = \sum_{k = 1}^{N} c_{k} (\frac{1}{{(σ n)}^{3} C_{ω} (ξ)} \sum_{l \in I_{σ n}} [ξ (r_{k} - \frac{l}{σ n}) - η (r_{k} - \frac{l}{σ n})] e^{- 2 π i ω \cdot \frac{l}{σ n}}),

(B.16)

which has the bound

∣ E_{ω}^{2} ∣ \leq \frac{1}{{(σ n)}^{3}} {| | c | |}_{1} \frac{1}{C_{ω} (ξ)} max_{k} \sum_{l \in I_{σ n}} ∣ ξ (\frac{l}{σ n} - r_{k}) - η (\frac{l}{σ n} - r_{k}) ∣ .

(B.17)

Appendix C. Continuity of f

As defined in Section 3.3.1,

f = \frac{(r - x_{i}) \cdot n (r)}{∣ r - x_{i} ∣^{4}},

(C.1)

where r ≠ x_i and n = ∇F with F given in (2.2). r(b₁, b₂, λ) is simply defined in (2.3). In this appendix, we mainly discuss the continuity of n. As derived in [22],

graphic file with name nihms153453e1.jpg

(C.2)

where

graphic file with name nihms153453e2.jpg

is a nonsingular matrix. Hence n is well defined. Consider ( $\frac{\partial n}{\partial b_{1}}, \frac{\partial n}{\partial b_{2}}$ ):

(\frac{\partial n}{\partial b_{1}}, \frac{\partial n}{\partial b_{2}}) = (\begin{array}{l} F_{x x} & F_{x y} & F_{x z} \\ F_{x y} & F_{y y} & F_{y z} \\ F_{x z} & F_{y z} & F_{z z} \end{array}) (\begin{matrix} \frac{\partial x}{\partial b_{1}} & \frac{\partial x}{\partial b_{2}} \\ \frac{\partial y}{\partial b_{1}} & \frac{\partial y}{\partial b_{2}} \\ \frac{\partial z}{\partial b_{1}} & \frac{\partial z}{\partial b_{2}} \end{matrix}) .

Let $ν = {(\frac{\partial F}{\partial b_{1}}, \frac{\partial F}{\partial b_{2}}, \frac{\partial F}{\partial λ})}^{T}$ . We have

graphic file with name nihms153453e3.jpg

(C.3)

where Inline graphic , and

M = (\begin{array}{l} F_{b_{1} b_{1}} & F_{b_{1} b_{2}} & F_{b_{1} λ} \\ F_{b_{1} b_{2}} & F_{b_{2} b_{2}} & F_{b_{2} λ} \\ F_{b_{1} λ} & F_{b_{2} λ} & F_{λ λ} \end{array}) .

(C.4)

To show Inline graphic is differentiable, we take the first row of and compute its derivative with respect to x, i.e. ( $\frac{\partial^{2} b_{1}}{\partial x^{2}} \frac{\partial^{2} b_{2}}{\partial x^{2}} \frac{\partial^{2} λ}{\partial x^{2}}$ ) as an example. We write (2.3) in the form of

{\begin{array}{l} x = x (b_{1}, b_{2}, λ), \\ y = y (b_{1}, b_{2}, λ), \\ z = z (b_{1}, b_{2}, λ) . \end{array}

(C.5)

Taking the second derivatives of both sides of (C.5) with respect to x, we get

0 = C_{f} + \frac{\partial x}{\partial b_{1}} \frac{\partial^{2} b_{1}}{\partial x^{2}} + \frac{\partial x}{\partial b_{2}} \frac{\partial^{2} b_{2}}{\partial x^{2}} + \frac{\partial x}{\partial λ} \frac{\partial^{2} λ}{\partial x^{2}},

(C.6)

0 = C_{g} + \frac{\partial y}{\partial b_{1}} \frac{\partial^{2} b_{1}}{\partial x^{2}} + \frac{\partial y}{\partial b_{2}} \frac{\partial^{2} b_{2}}{\partial x^{2}} + \frac{\partial y}{\partial λ} \frac{\partial^{2} λ}{\partial x^{2}},

(C.7)

0 = C_{h} + \frac{\partial z}{\partial b_{1}} \frac{\partial^{2} b_{1}}{\partial x^{2}} + \frac{\partial z}{\partial b_{2}} \frac{\partial^{2} b_{2}}{\partial x^{2}} + \frac{\partial z}{\partial λ} \frac{\partial^{2} λ}{\partial x^{2}} .

(C.8)

where

\begin{array}{l} C_{f} = (\begin{matrix} \frac{\partial b_{1}}{\partial x} \\ \frac{\partial b_{2}}{\partial x} \\ \frac{\partial λ}{\partial x} \end{matrix}) \cdot (\begin{matrix} \frac{\partial^{2} x}{\partial b_{1}^{2}} \frac{\partial b_{1}}{\partial x} + \frac{\partial^{2} x}{\partial b_{1} \partial b_{2}} \frac{\partial b_{2}}{\partial x} + \frac{\partial^{2} x}{\partial b_{1} \partial λ} \frac{\partial λ}{\partial x} \\ \frac{\partial^{2} x}{\partial b_{1} \partial b_{2}} \frac{\partial b_{1}}{\partial x} + \frac{\partial^{2} x}{\partial b_{2}^{2}} \frac{\partial b_{2}}{\partial x} + \frac{\partial^{2} x}{\partial b_{2} \partial λ} \frac{\partial λ}{\partial x} \\ \frac{\partial^{2} x}{\partial b_{1} \partial λ} \frac{\partial b_{1}}{\partial x} + \frac{\partial^{2} x}{\partial b_{2} \partial λ} \frac{\partial b_{2}}{\partial x} + \frac{\partial^{2} x}{\partial λ^{2}} \frac{\partial λ}{\partial x} \end{matrix}), \\ C_{g} = (\begin{matrix} \frac{\partial b_{1}}{\partial x} \\ \frac{\partial b_{2}}{\partial x} \\ \frac{\partial λ}{\partial x} \end{matrix}) \cdot (\begin{matrix} \frac{\partial^{2} y}{\partial b_{1}^{2}} \frac{\partial b_{1}}{\partial x} + \frac{\partial^{2} y}{\partial b_{1} \partial b_{2}} \frac{\partial b_{2}}{\partial x} + \frac{\partial^{2} y}{\partial b_{1} \partial λ} \frac{\partial λ}{\partial x} \\ \frac{\partial^{2} y}{\partial b_{1} \partial b_{2}} \frac{\partial b_{1}}{\partial x} + \frac{\partial^{2} y}{\partial b_{2}^{2}} \frac{\partial b_{2}}{\partial x} + \frac{\partial^{2} y}{\partial b_{2} \partial λ} \frac{\partial λ}{\partial x} \\ \frac{\partial^{2} y}{\partial b_{1} \partial λ} \frac{\partial b_{1}}{\partial x} + \frac{\partial^{2} y}{\partial b_{2} \partial λ} \frac{\partial b_{2}}{\partial x} + \frac{\partial^{2} y}{\partial λ^{2}} \frac{\partial λ}{\partial x} \end{matrix}), \\ C_{h} = (\begin{matrix} \frac{\partial b_{1}}{\partial x} \\ \frac{\partial b_{2}}{\partial x} \\ \frac{\partial λ}{\partial x} \end{matrix}) \cdot (\begin{matrix} \frac{\partial^{2} z}{\partial b_{1}^{2}} \frac{\partial b_{1}}{\partial x} + \frac{\partial^{2} z}{\partial b_{1} \partial b_{2}} \frac{\partial b_{2}}{\partial x} + \frac{\partial^{2} z}{\partial b_{1} \partial λ} \frac{\partial λ}{\partial x} \\ \frac{\partial^{2} z}{\partial b_{1} \partial b_{2}} \frac{\partial b_{1}}{\partial x} + \frac{\partial^{2} z}{\partial b_{2}^{2}} \frac{\partial b_{2}}{\partial x} + \frac{\partial^{2} z}{\partial b_{2} \partial λ} \frac{\partial λ}{\partial x} \\ \frac{\partial^{2} z}{\partial b_{1} \partial λ} \frac{\partial b_{1}}{\partial x} + \frac{\partial^{2} z}{\partial b_{2} \partial λ} \frac{\partial b_{2}}{\partial x} + \frac{\partial^{2} z}{\partial λ^{2}} \frac{\partial λ}{\partial x} \end{matrix}) . \end{array}

So we get

graphic file with name nihms153453e4.jpg

(C.9)

Using the same method, we can get the other rows of Inline graphic , matrices and by changing C_f, C_g, C_h in (C.9). Therefore is differentiable. Similarly, we can compute the higher order derivatives of and prove that ∈ C^∞, thus prove F ∈ C^∞ (Ω₀), where Ω₀ defined in Section 3.3.1 is the canonical triangle. Therefore, as defined in (C.1), f ∈ C^∞(Ω₀).

Footnotes

This research was supported in part by NSF grant: CNS-0540033, and in part by NIH grants: P20-RR020647, R01-GM074258, R01-GM073087, and R01-EB004873.

Contributor Information

CHANDRAJIT BAJAJ, Email: bajaj@ices.utexas.edu.

WENQI ZHAO, Email: wzhao@ices.utexas.edu.

References

1.Phillips J, Braun R, Wang W, Gumbart J, Tajkhorshid E, Villa E, Chipot C, Skeel R, Kale L, Schulten K. Scalable molecular dynamics with NAMD. J Comput Chem. 2005;26:1781–1802. doi: 10.1002/jcc.20289. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Roux B, Simonson T. Implicit solvent models. Biophys Chem. 1999;78:1–20. doi: 10.1016/s0301-4622(98)00226-9. [DOI] [PubMed] [Google Scholar]
3.Weeks J, Chandler D, Andersen H. Role of repulsive forces in determining the equilibrium structure of simple liquids. J Chemical Physics. 1971;54:5237–5247. [Google Scholar]
4.Chandler D, Weeks J, Andersen H. Van der Waals picture of liquids, solids, and phase transformations. Science. 1983;220:787–794. doi: 10.1126/science.220.4599.787. [DOI] [PubMed] [Google Scholar]
5.Eisenberg D, Mclachlan AD. Solvation energy in protein folding and binding. Nature (London) 1986;319:199–203. doi: 10.1038/319199a0. [DOI] [PubMed] [Google Scholar]
6.Wagoner JA, Baker NA. Assessing implicit models for nonpolar mean solvation forces: The importance of dispersion and volume terms. 2006;103:8331–8336. doi: 10.1073/pnas.0600118103. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Sharp K. Incorporating solvent and ion screening into molecular dynamics using the finite-difference Poisson-Boltzmann method. J Comput Chem. 1991;12:454–468. [Google Scholar]
8.Kollman PA, Massova I, Reyes C, Kuhn B, Huo S, Chong L, Lee M, Lee T, Duan Y, Wang W, Donini O, Cieplak P, Srinivasan J, Case DA, Cheatham TE. Calculating structures and free energies of complex molecules: combining molecular mechanics and continuum models. Acc Chem Res. 2000;33:889–897. doi: 10.1021/ar000033j. [DOI] [PubMed] [Google Scholar]
9.Holst M, Baker N, Wang F. Adaptive multilevel finite element solution of the Poisson-Boltzmann equation I. Algorithms and examples. J Comput Chem. 2000;21:1319–1342. [Google Scholar]
10.Baker N, Holst M, Wang F. Adaptive multilevel finite element solution of the Poisson-Boltzmann equation II. Refinement at solvent-accessible surfaces in biomolecular systems. J Comput Chem. 2000;21:1343–1352. [Google Scholar]
11.Lu B, Zhang D, McCammon JA. Computation of electrostatic forces between solvated molecules determined by the Poisson-Boltzmann equation using a boundary element method. J Chemical Physics. 2005;122:214102–214109. doi: 10.1063/1.1924448. [DOI] [PubMed] [Google Scholar]
12.Still WC, Tempczyk A, Hawley RC, Hendrickson T. Semianalytical treatment of solvation for molecular mechanics and dynamics. J Am Chem Soc. 1990;112:6127–6129. [Google Scholar]
13.MacKerel AD, Jr, Brooks CL, III, Nilsson L, Roux B, Won Y, Karplus M. CHARMM: The Energy Function and Its Parameterization with an Overview of the Program, volume 1 of The Encyclopedia of Computational Chemistry. John Wiley & Sons; Chichester: 1998. pp. 271–277. [Google Scholar]
14.Case D, Cheatham T, III, Darden T, Gohlke H, Luo R, Merz K, Jr, Onufriev A, Simmerling C, Wang B, Woods R. The Amber biomolecular simulation programs. J Comput Chem. 2005;26:1668–1688. doi: 10.1002/jcc.20290. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Ren P, Ponder JW. Polarizable atomic multipole water model for molecular mechanics simulation. J Phys Chem B. 2003;107:5933–5947. [Google Scholar]
16.Tsui V, Case DA. Theory and applications of the generalized Born solvation model in macromolecular simulations. Biopolymers. 2001;56:275–291. doi: 10.1002/1097-0282(2000)56:4<275::AID-BIP10024>3.0.CO;2-E. [DOI] [PubMed] [Google Scholar]
17.Shih Amy Y, Denisov Ilia G, Phillips James C, Sligar Stephen G, Schulten Klaus. Molecular dynamics simulations of discoidal bilayers assembled from truncated human lipoproteins. Biophys J. 2005;88:548–556. doi: 10.1529/biophysj.104.046896. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Ritchie David W. Evaluation of protein docking predictions using hex 3.1 in capri rounds 1 and 2. Proteins: Structure, Function, and Genetics. 2003 July;52(1):98–106. doi: 10.1002/prot.10379. [DOI] [PubMed] [Google Scholar]
19.Ju T, Losasso F, Schaefer S, Warren J. Dual contouring of hermite data. Proceedings of ACM SIG-GRAPH. 2002:339–346. [Google Scholar]
20.Zhang Y, Xu G, Bajaj C. Quality meshing of implicit solvation models of biomolecular structures. Computer Aided Geometric Design. doi: 10.1016/j.cagd.2006.01.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Garland M, Heckbert P. Simplifying surfaces with color and texture using quadric error metrics. IEEE Visualization. 1998:263–270. [Google Scholar]
22.Zhao W, Xu G, Bajaj C. An algebraic spline model of molecular surfaces. ACM Symp Sol Phys Model. 2007:297–302. doi: 10.1109/TCBB.2011.81. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Greengard L, Rokhlin V. A fast algorithm for particle simulations. J Chemical Physics. 1987;73:325–348. [Google Scholar]
24.Feig M, Onufriev A, Lee MS, Im W, Case DA, Brooks C., III Performance comparison of generalized Born and Poisson methods in the calculation of electrostatic solvation energies for protein structures. J Comput Chem. 2004;25:265–284. doi: 10.1002/jcc.10378. [DOI] [PubMed] [Google Scholar]
25.Ghosh A, Rapp CS, Friesner RA. Generalized Born model based on a surface integral formulation. J Phys Chem B. 1998;102:10983–10990. [Google Scholar]
26.Potts D, Steidl G. Fast summation at nonequispaced knots by NFFTs. SIAM J Sci Comput. 2003;24:2013–2037. [Google Scholar]
27.Dunavant D. High degree efficient symmetrical Gaussian quadrature rules for the triangle. International Journal of Numerical Methods in Engineering. 1985;21:1129–1148. [Google Scholar]
28.Potts D, Steidl G, Tasche M. Modern Samplling Theory: mathematics and Applications. Birkhauser; 2001. Fast Fourier transforms for nonequispaced data: A tutorial; pp. 247–270. [Google Scholar]
29.Beylkin G. On the fast Fourier transform of functions with singularities. Appl Comput Harmon Anal. 1995;2:363C–381. [Google Scholar]
30.Jackson JI. Selection of a convolution function for Fourier inversion using gridding. IEEE Trans Med Imag. 1991;10:473–C478. doi: 10.1109/42.97598. [DOI] [PubMed] [Google Scholar]
31.Edelsbrunner H, Koehl P. The weighted-volume derivative of a space-filling diagram. 2003;100:2203–2208. doi: 10.1073/pnas.0537830100. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Im W, Lee MS, Brooks C., III Generalized Born model with a simple smoothing function. J Comput Chem. 2003;24:1691–1702. doi: 10.1002/jcc.10321. [DOI] [PubMed] [Google Scholar]
33.Bryant R, Edelsbrunner H, Koehl P, Levitt M. The area derivative of a space-filling diagram. Discrete Comput Geom. 2004;32:293–308. [Google Scholar]
34.Grant JA, Pickup BT. A Gaussian description of molecular shape. J Phys Chem. 1995;99:3503–3510. [Google Scholar]
35.Grycuk T. Deficiency of the Coulomb-field approximation in the generalized Born model: An improved formula for Born radii evaluation. J Chemical Physics. 2003;119:4817–4827. [Google Scholar]
36.Tjong H, Zhou H. GBr6NL: A generalized Born method for accurately reproducing solvation energy of the nonlinear Poisson-Boltzmann equation. J Chemical Physics. 2007;126:195102–195106. doi: 10.1063/1.2735322. [DOI] [PubMed] [Google Scholar]

[R1] 1.Phillips J, Braun R, Wang W, Gumbart J, Tajkhorshid E, Villa E, Chipot C, Skeel R, Kale L, Schulten K. Scalable molecular dynamics with NAMD. J Comput Chem. 2005;26:1781–1802. doi: 10.1002/jcc.20289. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] 2.Roux B, Simonson T. Implicit solvent models. Biophys Chem. 1999;78:1–20. doi: 10.1016/s0301-4622(98)00226-9. [DOI] [PubMed] [Google Scholar]

[R3] 3.Weeks J, Chandler D, Andersen H. Role of repulsive forces in determining the equilibrium structure of simple liquids. J Chemical Physics. 1971;54:5237–5247. [Google Scholar]

[R4] 4.Chandler D, Weeks J, Andersen H. Van der Waals picture of liquids, solids, and phase transformations. Science. 1983;220:787–794. doi: 10.1126/science.220.4599.787. [DOI] [PubMed] [Google Scholar]

[R5] 5.Eisenberg D, Mclachlan AD. Solvation energy in protein folding and binding. Nature (London) 1986;319:199–203. doi: 10.1038/319199a0. [DOI] [PubMed] [Google Scholar]

[R6] 6.Wagoner JA, Baker NA. Assessing implicit models for nonpolar mean solvation forces: The importance of dispersion and volume terms. 2006;103:8331–8336. doi: 10.1073/pnas.0600118103. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] 7.Sharp K. Incorporating solvent and ion screening into molecular dynamics using the finite-difference Poisson-Boltzmann method. J Comput Chem. 1991;12:454–468. [Google Scholar]

[R8] 8.Kollman PA, Massova I, Reyes C, Kuhn B, Huo S, Chong L, Lee M, Lee T, Duan Y, Wang W, Donini O, Cieplak P, Srinivasan J, Case DA, Cheatham TE. Calculating structures and free energies of complex molecules: combining molecular mechanics and continuum models. Acc Chem Res. 2000;33:889–897. doi: 10.1021/ar000033j. [DOI] [PubMed] [Google Scholar]

[R9] 9.Holst M, Baker N, Wang F. Adaptive multilevel finite element solution of the Poisson-Boltzmann equation I. Algorithms and examples. J Comput Chem. 2000;21:1319–1342. [Google Scholar]

[R10] 10.Baker N, Holst M, Wang F. Adaptive multilevel finite element solution of the Poisson-Boltzmann equation II. Refinement at solvent-accessible surfaces in biomolecular systems. J Comput Chem. 2000;21:1343–1352. [Google Scholar]

[R11] 11.Lu B, Zhang D, McCammon JA. Computation of electrostatic forces between solvated molecules determined by the Poisson-Boltzmann equation using a boundary element method. J Chemical Physics. 2005;122:214102–214109. doi: 10.1063/1.1924448. [DOI] [PubMed] [Google Scholar]

[R12] 12.Still WC, Tempczyk A, Hawley RC, Hendrickson T. Semianalytical treatment of solvation for molecular mechanics and dynamics. J Am Chem Soc. 1990;112:6127–6129. [Google Scholar]

[R13] 13.MacKerel AD, Jr, Brooks CL, III, Nilsson L, Roux B, Won Y, Karplus M. CHARMM: The Energy Function and Its Parameterization with an Overview of the Program, volume 1 of The Encyclopedia of Computational Chemistry. John Wiley & Sons; Chichester: 1998. pp. 271–277. [Google Scholar]

[R14] 14.Case D, Cheatham T, III, Darden T, Gohlke H, Luo R, Merz K, Jr, Onufriev A, Simmerling C, Wang B, Woods R. The Amber biomolecular simulation programs. J Comput Chem. 2005;26:1668–1688. doi: 10.1002/jcc.20290. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] 15.Ren P, Ponder JW. Polarizable atomic multipole water model for molecular mechanics simulation. J Phys Chem B. 2003;107:5933–5947. [Google Scholar]

[R16] 16.Tsui V, Case DA. Theory and applications of the generalized Born solvation model in macromolecular simulations. Biopolymers. 2001;56:275–291. doi: 10.1002/1097-0282(2000)56:4<275::AID-BIP10024>3.0.CO;2-E. [DOI] [PubMed] [Google Scholar]

[R17] 17.Shih Amy Y, Denisov Ilia G, Phillips James C, Sligar Stephen G, Schulten Klaus. Molecular dynamics simulations of discoidal bilayers assembled from truncated human lipoproteins. Biophys J. 2005;88:548–556. doi: 10.1529/biophysj.104.046896. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] 18.Ritchie David W. Evaluation of protein docking predictions using hex 3.1 in capri rounds 1 and 2. Proteins: Structure, Function, and Genetics. 2003 July;52(1):98–106. doi: 10.1002/prot.10379. [DOI] [PubMed] [Google Scholar]

[R19] 19.Ju T, Losasso F, Schaefer S, Warren J. Dual contouring of hermite data. Proceedings of ACM SIG-GRAPH. 2002:339–346. [Google Scholar]

[R20] 20.Zhang Y, Xu G, Bajaj C. Quality meshing of implicit solvation models of biomolecular structures. Computer Aided Geometric Design. doi: 10.1016/j.cagd.2006.01.008. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Garland M, Heckbert P. Simplifying surfaces with color and texture using quadric error metrics. IEEE Visualization. 1998:263–270. [Google Scholar]

[R22] 22.Zhao W, Xu G, Bajaj C. An algebraic spline model of molecular surfaces. ACM Symp Sol Phys Model. 2007:297–302. doi: 10.1109/TCBB.2011.81. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] 23.Greengard L, Rokhlin V. A fast algorithm for particle simulations. J Chemical Physics. 1987;73:325–348. [Google Scholar]

[R24] 24.Feig M, Onufriev A, Lee MS, Im W, Case DA, Brooks C., III Performance comparison of generalized Born and Poisson methods in the calculation of electrostatic solvation energies for protein structures. J Comput Chem. 2004;25:265–284. doi: 10.1002/jcc.10378. [DOI] [PubMed] [Google Scholar]

[R25] 25.Ghosh A, Rapp CS, Friesner RA. Generalized Born model based on a surface integral formulation. J Phys Chem B. 1998;102:10983–10990. [Google Scholar]

[R26] 26.Potts D, Steidl G. Fast summation at nonequispaced knots by NFFTs. SIAM J Sci Comput. 2003;24:2013–2037. [Google Scholar]

[R27] 27.Dunavant D. High degree efficient symmetrical Gaussian quadrature rules for the triangle. International Journal of Numerical Methods in Engineering. 1985;21:1129–1148. [Google Scholar]

[R28] 28.Potts D, Steidl G, Tasche M. Modern Samplling Theory: mathematics and Applications. Birkhauser; 2001. Fast Fourier transforms for nonequispaced data: A tutorial; pp. 247–270. [Google Scholar]

[R29] 29.Beylkin G. On the fast Fourier transform of functions with singularities. Appl Comput Harmon Anal. 1995;2:363C–381. [Google Scholar]

[R30] 30.Jackson JI. Selection of a convolution function for Fourier inversion using gridding. IEEE Trans Med Imag. 1991;10:473–C478. doi: 10.1109/42.97598. [DOI] [PubMed] [Google Scholar]

[R31] 31.Edelsbrunner H, Koehl P. The weighted-volume derivative of a space-filling diagram. 2003;100:2203–2208. doi: 10.1073/pnas.0537830100. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] 32.Im W, Lee MS, Brooks C., III Generalized Born model with a simple smoothing function. J Comput Chem. 2003;24:1691–1702. doi: 10.1002/jcc.10321. [DOI] [PubMed] [Google Scholar]

[R33] 33.Bryant R, Edelsbrunner H, Koehl P, Levitt M. The area derivative of a space-filling diagram. Discrete Comput Geom. 2004;32:293–308. [Google Scholar]

[R34] 34.Grant JA, Pickup BT. A Gaussian description of molecular shape. J Phys Chem. 1995;99:3503–3510. [Google Scholar]

[R35] 35.Grycuk T. Deficiency of the Coulomb-field approximation in the generalized Born model: An improved formula for Born radii evaluation. J Chemical Physics. 2003;119:4817–4827. [Google Scholar]

[R36] 36.Tjong H, Zhou H. GBr6NL: A generalized Born method for accurately reproducing solvation energy of the nonlinear Poisson-Boltzmann equation. J Chemical Physics. 2007;126:195102–195106. doi: 10.1063/1.2735322. [DOI] [PubMed] [Google Scholar]

PERMALINK

FAST MOLECULAR SOLVATION ENERGETICS AND FORCE COMPUTATION*

CHANDRAJIT BAJAJ

WENQI ZHAO

Abstract

1. Introduction

Fig. 1.1.

2. Geometric model

2.1. Gaussian surface

2.2. Triangular mesh

2.3. Algebraic spline molecular surface (ASMS)

Fig. 2.3 .

Fig. 2.1 .

3. Fast solvation energy computation

3.1. Method

Fig. 3.1 .

Fig. 3.2 .

3.2. Fast summation

3.3. Error analysis

3.3.1. Quadrature error

3.3.2. Fast summation error

3.3.3. NFFT error

3.3.4. NFFTT error

4. Fast solvation force computation

Fig. 4.1 .

5. Results

Table 5.1.

Fig. 5.1 .

Fig. 5.2 .

Table 5.2.

Fig. 5.3 .

6. Conclusion

Fig. 2.2 .

Acknowledgments

Appendix A. NFFT

Remark

Appendix B. NFFTT

Remark

Appendix C. Continuity of f

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

FAST MOLECULAR SOLVATION ENERGETICS AND FORCE COMPUTATION^{^*}

3.3.4. NFFT^T error

Appendix B. NFFT^T