A FAST SIMPLE ALGORITHM FOR COMPUTING THE POTENTIAL OF CHARGES ON A LINE

ZYDRUNAS GIMBUTAS; NICHOLAS F MARSHALL; VLADIMIR ROKHLIN

. Author manuscript; available in PMC: 2020 Oct 20.

Published in final edited form as: Appl Comput Harmon Anal. 2020;49(3):https://doi.org/10.1016/j.acha.2020.06.002.

A FAST SIMPLE ALGORITHM FOR COMPUTING THE POTENTIAL OF CHARGES ON A LINE

ZYDRUNAS GIMBUTAS ¹, NICHOLAS F MARSHALL ², VLADIMIR ROKHLIN ³

PMCID: PMC7574583 NIHMSID: NIHMS1629758 PMID: 33088166

Abstract

We present a fast method for evaluating expressions of the form

u_{j} = \sum_{i = 1, i \neq j}^{n} \frac{α_{i}}{x_{i} - x_{j}}, for j = 1, \dots, n,

2010 Mathematics Subject Classification: 31C20 (primary) and 41A55, 41A50 (secondary)

Keywords: Fast multipole method, Chebyshev system, generalized Gaussian quadrature

1. Introduction and motivation

1.1. Introduction.

In this paper, we describe a simple fast algorithm for evaluating expressions of the form

u_{j} = \sum_{i = 1, i \neq j}^{n} \frac{α_{i}}{x_{i} - x_{j}}, for j = 1, \dots, n,

(1)

where α_i are real numbers, and x_i are points in a compact interval of $R$ . This expression can be viewed as representing the electrostatic potential generated by charges on a line in $R^{3}$ . We remark that fast algorithms for computing the electrostatic potential generated by general distributions of charges in $R^{3}$ exist, see for example the Fast Multipole Method [9] whose relation to the method presented in this paper is discussed in §1.2. However, in a number of situations in computational physics it is useful to have a simple and extremely fast method for evaluating the potential of charges on a line; we present such a method in this paper. Under mild assumptions the presented method involves $O (n \log n)$ operations and has a small constant. The method is based on writing the potential 1/r as

\frac{1}{r} = \int_{0}^{\infty} e^{- r t} d t .

We show that there exists a small set of quadrature nodes t₁, … , t_m and weights w₁, … , w_m such that for a large range of values of r we have

\frac{1}{r} \approx \sum_{j = 1}^{m} w_{j} e^{- r t_{j}},

(2)

see Lemma 4.5, which is a quantitative version of (2). Numerically the nodes t₁, … , t_m and weights w₁, … , w_m are computed using a procedure for constructing generalized Gaussian quadratures, see §5.2. An advantage of representing 1/r as a sum of exponentials is that the translation operator

\frac{1}{r} \mapsto \frac{1}{r + r^{'}}

(3)

can be computed by taking an inner product of the weights (w₁ , … , w_m) with a diagonal transformation of the vector (e^−rt₁ , … , e^−rt_m). Indeed, we have

\frac{1}{r + r^{'}} \approx \sum_{j = 1}^{m} w_{j} e^{- (r + r^{'}) t_{j}} = \sum_{j = 1}^{m} w_{j} e^{- r^{'} t_{j}} e^{- r t_{j}} .

(4)

The algorithm described in §3 leverages the existence of this diagonal translation operator to efficiently evaluate (1).

1.2. Relation to past work.

We emphasize that fast algorithms for computing the potential generated by arbitrary distributions of charges in $R^{3}$ exist. An example of such an algorithm is the Fast Multipole Method that was introduced by [9] and has been extended by several authors including [7, 10, 16]. In this paper, we present a simple scheme for the special case where the charges are on a line, which occurs in a number of numerical calcuations, see 1.3. The presented scheme has a much smaller runtime constant compared to general methods, and is based on the diagonal form (4) of the translation operator (3). The idea of using the diagonal form of this translation operator to accelerate numerical computations has been studied by several authors; in particular, the diagonal form is used in algorithms by Dutt, Gu and Rokhlin [6], and Yavin and Rokhlin [22] and was subsequently studied in detail by Beylkin and Monzón [1, 2].

The current paper improves upon these past works by taking advantage of robust generalized Gaussian quadrature codes [4] that were not previously available; these codes construct a quadrature rule that is exact for functions in the linear span of a given Chebyshev system, and can be viewed as a constructive version of Lemma 4.2 of Kreĭn [13]. The resulting fast algorithm presented in §3 simplifies past approaches, and has a small runtime constant; in particular, its computational cost is similar to the computational cost of 5-10 Fast Fourier Transforms on data of a similar length, see 5.

1.3. Motivation.

Expressions of the form (1) appear in a number of situations in computational physics. In particular, such expressions arise in connection with the Hilbert Transform

H f (x) = lim_{ε \to 0} \frac{1}{π} \int_{∣ x - y ∣ \geq ε} \frac{f (y)}{y - x} d y .

For example, the computation of the projection P_mf of a function f onto the first m + 1 functions in a family of orthogonal polynomials can be reduced to an expression of the form (1) by using the Christoffel–Darboux formula, which is related to the Hilbert transform; we detail the reduction of P_mf to an expression of the form (1) in the following.

Let ${p_{k}}_{k = 0}^{\infty}$ be a family of monic polynomials that are orthogonal with respect to the weight w(x) ≥ 0 on $(a, b) \subseteq R$ . Consider the projection operator

P_{m} f (x) ≔ \int_{a}^{b} \sum_{k = 0}^{m} \frac{p_{k} (x) p_{k} (y)}{h_{k}} f (y) w (y) d y,

where $h_{k} ≔ \int_{a}^{b} p_{k} (x)^{2} w (x) d x$ . Let x₁ , … , x_n and w₁ , … , w_n be the n > m/2 point Gaussian quadrature nodes and weights associated with ${p_{k}}_{k = 0}^{\infty}$ , and set

u_{j} ≔ \sum_{i = 1}^{n} \sum_{k = 0}^{m} \frac{p_{k} (x_{j}) p_{k} (x_{i})}{h_{k}} f (x_{i}) w (x_{i}), for j = 1, \dots, n .

(5)

By construction the polynomial that interpolates the values u₁ , … , u_n at the points x₁ , … , x_n will accurately approximate P_mf on (a, b) when f is sufficiently smooth, see for example §7.4.6 of Dahlquist and Björck [5]. Directly evaluating (5) would require Ω(n²) operations. In contrast, the algorithm of this paper together with the Christoffel–Darboux Formula can be used to evaluate (5) in $O (n \log n)$ operations. The Christoffel-Darboux formula states that

\sum_{k = 0}^{m} \frac{p_{k} (x) p_{k} (y)}{h_{k}} = \frac{1}{h_{m}} \frac{p_{m + 1} (x) p_{m} (y) - p_{m} (x) p_{m + 1} (y)}{x - y},

(6)

see §18.2(v) of [17]. Using (6) to rewrite (5) yields

u_{j} = \frac{1}{h_{m}} (f (x_{j}) + \sum_{i = 1, i \neq j}^{m} \frac{p_{m + 1} (x_{j}) p_{m} (x_{i}) - p_{m} (x_{j}) p_{m + 1} (x_{i})}{x_{j} - x_{i}} f (x_{i}) w (x_{i})),

(7)

where we have used the fact that the diagonal term of the double summation is equal to f(x_j)/h_m. The summation in (7) can be rearranged into two expressions of the form (1), and thus the method of this paper can be used to compute a representation of P_mf in $O (n \log n)$ operations.

Remark 1.1. Analogs of the Christoffel–Darboux formula hold for many other families of functions; for example, if J_ν(w) is a Bessel function of the first kind, then we have

\sum_{k = 1}^{\infty} 2 (ν + k) J_{ν + k} (w) J_{v + k} (z) = \frac{w z}{w - z} (J_{ν + 1} (w) J_{ν} (z) - J_{ν} (w) J_{ν + 1} (z)),

see [21]. This formula can be used to write a projection operator related to Bessel functions in an analogous form to (7), and the algorithm of this paper can be similarly applied

Remark 1.2. A simple modification of the algorithm presented in this paper can be used to evaluate more general expressions of the form

v_{j} = \sum_{i = 1}^{n} \frac{α_{i}}{x_{i} - y_{j}}, for j = 1, \dots, m,

where x₁ , … , x_n are source points, and y₁ , … , y_m are target points. For simplicity, this paper focuses on the case where the source and target points are the same, which is the case in the projection application described above.

2. Main result

2.1. Main result.

Our principle analytical result is the following theorem, which provides precise accuracy and computational complexity guarantees for the algorithm presented in this paper, which is detailed in §3.

Theorem 2.1. Let x₁ < … < x_n ∈ [a, b] and α₁ , … , α_n ∈ $R$ be given. Set

u_{j} ≔ \sum_{i = 1, i \neq j}^{n} \frac{α_{i}}{x_{i} - x_{j}}, f o r j = 1, \dots, n .

Given δ > 0 and ε > 0, the algorithm described in §3 computes values ${\tilde{u}}_{j}$ such

\frac{∣ {\tilde{u}}_{j} - u_{j} ∣}{\sum_{i = 1}^{n} ∣ α_{i} ∣} \leq ε, f o r j = 1, \dots, n

(8)

in $O (n \log (δ^{- 1}) \log (ε^{- 1}) + N_{δ})$ operations, where

N_{δ} ≔ \sum_{j = 1}^{n} # {x_{i} : ∣ x_{j} - x_{i} ∣ < δ (b - a)} .

(9)

The proof of Theorem 2.1 is given in §4. Under typical conditions, the presented algorithm involves $O (n \log n)$ operations. The following corollary describes a case of interest, where the points x₁, … , x_n are Chebyshev nodes for a compact interval [a, b] (we define Chebyshev nodes in §4.2).

Corollary 2.1. Fix ε = 10⁻¹⁵, and let the points x₁ , … , x_n be Chebyshev nodes on [a, b]. If δ = 1/n, then the algorithm of §3 involves $O (n \log n)$ operations.

The proof of Corollary 2.1 is given in §4.4. The following corollary states that a similar result holds for uniformly random points.

Corollary 2.2. Fix ε = 10⁻¹⁵, and suppose that x₁ , … , x_n are sampled uniformly at random from [a, b]. If δ = 1/n, then the algorithm of §3 involves $O (n \log n)$ operations with high probability.

The proof of Corollary 2.2 is immediate from standard probabilistic estimates. The following remark describes an adversarial configuration of points.

Remark 2.1. Fix ε > 0, and let x₁ , … , x_2n be a collection of points such that x₁ , … , x_n and x_n+1, … , x_{2_n} are evenly spaced in [0, 2⁻ⁿ] and [1 − 2⁻ⁿ, 1], respectively, that is

x_{j} = 2^{- n} (\frac{j - 1}{n - 1}), and x_{n + j} = 1 + 2^{- n} (\frac{j - n}{n - 1}), for j = 1, \dots, n .

We claim that Theorem 2.1 cannot guarantee a complexity better than $O (n^{2})$ for this configuration of points. Indeed, if δ ≥ 2⁻ⁿ, then N_δ ≥ n²/2, and if δ < 2⁻ⁿ, then log₂(δ⁻¹) > n. In either case

n \log (δ^{- 1}) + N_{δ} = Ω (n^{2}) .

This complexity is indicative of the performance of the algorithm for this point configuration; the reason that the algorithm performs poorly is that structures exist at two different scales. If such a configuration were encountered in practice, it would be possible to modify the algorithm of §3 to also involve two different scales to achieve evaluation in $O (n \log n)$ operations.

3. Algorithm

3.1. High level summary.

The algorithm involves passing over the points x₁ , … , x_n twice. First, we pass over the points in ascending order and compute

{\tilde{u}}_{j}^{+} \approx \sum_{i = 1}^{j - 1} \frac{α_{i}}{x_{i} - x_{j}}, for j = 1, \dots, n,

(10)

and second, we pass over the points in descending order and compute

{\tilde{u}}_{j}^{-} \approx \sum_{i = j + 1}^{n} \frac{α_{i}}{x_{i} - x_{j}}, for j = 1, \dots, n .

(11)

Finally, we define ${\tilde{u}}_{j} ≔ {\tilde{u}}_{j}^{+} + {\tilde{u}}_{j}^{-}$ for j = 1, … , n such that

{\tilde{u}}_{j} \approx \sum_{i = 1, i \neq j}^{n} \frac{α_{i}}{x_{i} - x_{j}}, for j = 1, \dots, n .

We call the computation of ${\tilde{u}}_{1}^{+}$ , … , ${\tilde{u}}_{n}^{+}$ the forward pass of the algorithm, and the computation of ${\tilde{u}}_{1}^{-}$ , … , ${\tilde{u}}_{n}^{+}$ the backward pass of the algorithm. The forward pass of the algorithm computes the potential generated by all points to the left of a given point, while the backward pass of the algorithm computes the potential generated by all points to the right of a given point. In §3.2 and §3.3 we give an informal and detailed description of the forward pass of the algorithm. The backward pass of the algorithm is identical except it considers the points in reverse order.

3.2. Informal description.

In the following, we give an informal description of the forward pass of the algorithm that computes

{\tilde{u}}_{j}^{+} \approx \sum_{i = 1}^{j - 1} \frac{α_{i}}{x_{i} - x_{j}}, for j = 1, \dots, n .

Assume that a small set of nodes t₁, … , t_m and weights w₁, … , w_m such that

\frac{1}{r} \approx \sum_{i = 1}^{m} w_{i} e^{- r t_{i}} for r \in [δ (b - a), b - a],

(12)

where δ > 0 is given and fixed. The existence and computation of such nodes and weights is described in §4.4 and §5.2. We divide the sum defining $u_{j}^{+}$ into two parts:

{\tilde{u}}_{j}^{+} \approx \sum_{i = 1}^{j_{0}} \frac{α_{i}}{x_{i} - x_{j}} + \sum_{i = j_{0} + 1}^{j - 1} \frac{α_{i}}{x_{i} - x_{j}},

(13)

where j₀ = max {i : x_i − x_i > δ(b − a)}. By definition, the points x₁, … , x_j₀ are all distance at least δ(b − a) from x_j. Therefore, by (12)

{\tilde{u}}_{j}^{+} \approx - \sum_{i = 1}^{j_{0}} \sum_{k = 1}^{m} w_{k} α_{i} e^{- (x_{j} - x_{i}) t_{k}} + \sum_{i = j_{0} + 1}^{j - 1} \frac{α_{i}}{x_{i} - x_{j}} .

If we define

g_{k} (j_{0}) = \sum_{i = 1}^{j_{0}} α_{i} e^{- (x_{j_{0}} - x_{i}) t_{k}}, for k = 1, \dots, m,

(14)

then it is straightforward to verify that

{\tilde{u}}_{j}^{+} \approx - \sum_{k = 1}^{m} w_{k} g_{k} (j_{0}) e^{- (x_{j} - x_{j_{0}}) t_{k}} + \sum_{i = j_{0} + 1}^{j - 1} \frac{α_{i}}{x_{i} - x_{j}} .

(15)

Observe that we can update g_k(j₀) to g_k(j₀ + 1) using the following formula

g_{k} (j_{0} + 1) = α_{j_{0}} + e^{- (x_{j_{0} + 1} - x_{j_{0}}) t_{k}} g_{k} (j_{0}), for k = 1, \dots, m .

(16)

We can now summarize the algorithm for computing ${\tilde{u}}_{1}^{+}$ , … , ${\tilde{u}}_{n}^{+}$ . For each j, we compute ${\tilde{u}}_{j}^{+}$ by the following three steps:

Update g₁, … , g_m as necessary
Use g₁, … , g_m to evaluate the potential from x_i such that x_j − x_i > δ(b − a)
Directly evaluate the potential from x_i such that 0 < x_j − x_i < δ(b − a)

By (16), each update of g₁, … , g_m requires $O (m)$ operations, and we must update g₁, … , g_m at most n times, so we conclude that the total cost of the first step of the algorithm is $O (n m)$ operations. For each j = 1, … , n, the second and third step of the algorithm involve $O (m)$ and $O (# {x_{i} : 0 < x_{j} - x_{i} < δ (b - a)})$ operations, respectively, see (15). It follows that the total cost of the second and third step of the algorithm is $O (n m + N_{δ})$ operations, where N_δ is defined in (9). We conclude that ${\tilde{u}}_{1}^{+}$ , … , ${\tilde{u}}_{n}^{+}$ can be computed in $O (n m + N_{δ})$ operations. In §4, we complete the proof of the computational complexity guarantees of Theorem 2.1 by showing that there exist $m = O (\log (δ^{- 1}) \log (ε^{- 1}))$ nodes t₁, … , t_m and weights w₁, … , w_m that satisfy (12), where ε > 0 is the approximation error in (12).

3.3. Detailed description.

In the following, we give a detailed description of the forward pass of the algorithm that computes ${\tilde{u}}_{1}^{+}$ , … , ${\tilde{u}}_{n}^{+}$ . Suppose that δ > 0 and ε > 0 are given and fixed. We describe the algorithm under the assumption that we are given quadrature nodes t₁, … , t_m and weights w₁, … , w_m such that

∣ \frac{1}{r} - \sum_{j = 1}^{m} w_{j} e^{- r t_{j}} ∣ \leq ε for r \in [δ (b - a), b - a] .

(17)

The existence of such weights and nodes is established in §4.4, and the computation of such nodes and weights is discussed in §5.2. To simplify the description of the algorithm, we assume that x₀ = −∞ is a placeholder node that does not generate a potential.

Algorithm 3.1.

I n p u t : x_{1} < \dots < x_{n} \in [a, b], α_{1}, \dots, α_{n} \in R . O u t p u t : {\tilde{u}}_{1}^{+}, \dots, {\tilde{u}}_{n}^{+} .

\begin{matrix} 1 : j_{0} = 0 and g_{1} = \dots = g_{m} = 0 \\ 2 : \\ 3 : m a i n l o o p : \\ 4 : for j = 1, \dots, n \\ 5 : \\ 6 : u p d a t e g_{1}, \dots, g_{m} a n d j_{0} : \\ 7 : while x_{j} - x_{j_{0} + 1} > δ (b - a) \\ 8 : for i = 1, \dots, m \\ 9 : g_{i} = g_{i} e^{- (x_{j_{0} + 1} - x_{j_{0}}) t_{i}} + α_{i} \\ 10 : end for \\ 11 : j_{0} = j_{0} + 1 \\ 12 : end while \\ 13 : \\ 14 : c o m p u t e p o t e n t i a l f r o m x_{i} s u c h t h a t x_{i} \leq x_{j_{0}} : \\ 15 : {\tilde{u}}_{j}^{+} = 0 \\ 16 : for i = 1, \dots, m \\ 17 : {\tilde{u}}_{j}^{+} = {\tilde{u}}_{j}^{+} - w_{i} g_{i} e^{- (x_{j} - x_{j_{0}}) t_{i}} \\ 18 : end for \\ 19 : \\ 20 : c o m p u t e p o t e n t i a l f o r m x_{i} s u c h t h a t x_{j_{0} + 1} \leq x_{i} \leq x_{j - 1} \\ 21 : for i = j_{0} + 1, \dots, j - 1 \\ 22 : {\tilde{u}}_{j}^{+} = {\tilde{u}}_{j}^{+} + α_{i} ∕ (x_{i} - x_{j}) . \\ 23 : end for \\ 24 : end for \end{matrix}

Open in a new tab

Remark 3.1. In some applications, it may be necessary to evaluate an expression of the form (1) for many different weights α₁, … , α_n associated with a fixed set of points x₁, … , x_n. For example, in the projection application described in §1.3 the weights α₁, … , α_n correspond to a function that is being projected, while the points x₁, … , x_n are a fixed set of quadrature nodes. In such situations, pre-computing the exponentials e^{−(x_j−x_j₀)t_i} used in the Algorithm 3.1 will significantly improve the runtime, see §5.1.

4. Proof of Main Result

4.1. Organization.

In this section we complete the proof of Theorem 2.1; the section is organized as follows. In §4.2 we give mathematical preliminaries. In § 4.3 we state and prove two technical lemmas. In §4.4 we prove Lemma 4.5, which together with the analysis in §3 establishes Theorem 2.1. In §4.5 we prove Corollary 2.1, and Corollary 2.2.

4.2. Preliminaries.

Let a < b ∈ $R$ and $n \in Z_{> 0}$ be fixed, and suppose that $f : [a, b] \to R$ , and x₁ < … < x_n ∈ [a, b] are given. The interpolating polynomial P of the function f at x₁, … , x_n is the unique polynomial of degree at most n − 1 such that

P (x_{j}) = f (x_{j}), for j = 1, \dots, n .

This interpolating polynomial P can be explicitly defined by

P (x) = \sum_{j = 1}^{n} f (x_{j}) q_{j} (x),

(18)

where q_j is the nodal polynomial for x_j, that is,

q_{j} (x) = \prod_{k = 1, k \neq j}^{n} \frac{x - x_{k}}{x_{j} - x_{k}} .

(19)

We say x₁, … , x_n are Chebyshev nodes for the interval [a, b] if

x_{j} = \frac{b + a}{2} + \frac{b - a}{2} \cos (π \frac{j - \frac{1}{2}}{n}), for j = 1, \dots, n .

(20)

The following lemma is a classical result in approximation theory. It says that a smooth function on a compact interval is accurately approximated by the interpolating polynomial of the function at Chebyshev nodes, see for example §4.5.2 of Dahlquist and Björck [5].

Lemma 4.1. Let f ∈ Cⁿ([a, b]), and x₁, … , x_n be Chebyshev nodes for [a, b]. If P is the interpolating polynomial for f at x₁, … , x_n, then

sup_{x \in [a, b]} ∣ f (x) - P (x) ∣ \leq \frac{2 M}{n!} {(\frac{b - a}{4})}^{n},

where

M = sup_{x \in [a, b]} ∣ f^{(n)} (x) ∣ .

In addition to Lemma 4.1, we require a result about the existence of generalized Gaussian quadratures for Chebyshev systems. In 1866, Gauss [8] established the existence of quadrature nodes x₁, … , x_n and weights w₁, … , w_n for an interval [a, b] such that

\int_{a}^{b} f (x) d x = \sum_{j = 1}^{n} w_{j} f (x_{j}),

whenever f(x) is a polynomial of degree at most 2n − 1. This result was generalized from polynomials to Chebyshev systems by Kreĭn [13]. A collection of functions f₀, … , f_n on [a, b] is a Chebyshev system if every nonzero generalized polynomial

g (t) = a_{0} f_{0} (t) + \dots + a_{n} f_{n} (t), for a_{0}, \dots, a_{n} \in R,

has at most n distinct zeros in [a, b]. The following result of Kreĭn says that any function in the span of a Chebyshev system of 2n functions can be integrated exactly by a quadrature with n nodes and n weights.

Lemma 4.2 (Kreĭn [13]). Let f₀, … , f_2n−1 be a Chebyshev system of continuous functions on [a, b], and w : (a, b) → $R$ be a continuous positive weight function. Then, there exists unique nodes x₁, … , x_n and weights w₁, …, w_n such that

\int_{a}^{b} f (x) w (x) d x = \sum_{j = 1}^{n} w_{j} f (x_{j}),

whenever f is in the span of f₀, … , f_2n−1.

4.3. Technical Lemmas.

In this section, we state and prove two technical lemmas that are involved in the proof of Theorem 2.1. We remark that a similar version of Lemma 4.3 appears in [18].

Lemma 4.3. Fix a > 0 and t ∈ [0, ∞), and let r₁, … , r_n be Chebyshev nodes for [a, 2a]. If P_t(r) is the interpolating polynomial for e^−rt at r₁, … , r_n, then

sup_{r \in [a, 2 a]} ∣ e^{- r t} - P_{t} (r) ∣ \leq \frac{1}{4^{n}} .

Proof. We have

sup_{r \in [a, 2 a]} ∣ \frac{\partial^{n}}{\partial r^{n}} e^{- r t} ∣ = sup_{r \in [a, 2 a]} ∣ t^{n} e^{- r t} ∣ = t^{n} e^{- t a} .

By writing the derivative of tⁿe^−ta as

\frac{d}{d t} t^{n} e^{- t a} = (\frac{n}{a} - t) a t^{n - 1} e^{- a t},

we can deduce that the maximum of tⁿe^−ta occurs at t = n/a, that is,

sup_{t \in [0, \infty)} t^{n} e^{- t a} = {(\frac{n}{a})}^{n} e^{- a (n ∕ a)} .

(21)

By (21) and the result of Lemma 4.1, we conclude that

sup_{t \in [a, 2 a]} ∣ e^{- r t} - P_{t} (r) ∣ \leq \frac{2 (n ∕ a)^{n} e^{- a (n ∕ a)}}{n!} {(\frac{a}{4})}^{n} = \frac{2 n^{n} e^{- n}}{n!} \frac{1}{4^{n}} .

It remains to show that 2nⁿe⁻ⁿ ≤ n!. Since ln(x) is a increasing function, we have

n ln n - n + 1 = \int_{1}^{n} ln (x) d x \leq \int_{1}^{n} \sum_{j = 1}^{n - 1} χ_{[j, j + 1]} (x) ln (j + 1) d x = \sum_{j = 1}^{n} ln (j) .

Exponentiating both sides of this inequality gives enⁿe⁻ⁿ ≤ n!, which is a classical inequality related to Stirling’s approximation. This completes the proof. □

Lemma 4.4.Suppose that ε > 0 and M > 1 are given. Then, there exists

m = O (\log (M) \log (ε^{- 1}))

values r₁, …, r_m ∈ [1, M] such that for all r ε [1, M] we have

sup_{t \in [0, \infty)} ∣ e^{- r t} - \sum_{j = 1}^{m} c_{j} (r) e^{- r_{j} t} ∣ \leq ε,

(22)

for some choice of coefficients C_j(r) that depend on r.

Proof. We construct an explicit set of m := (⌊log₂ M⌋ + 1)(⌊log₄ ε⁻¹⌋ + 1) points and coefficients such that (22) holds. Set n := ⌊log₄ ε⁻¹⌋ + 1. We define the points r₁, … , r_m by

r_{i n + k} ≔ 2^{i - 1} (3 + \cos (π \frac{k - \frac{1}{2}}{n})),

(23)

for k = 1, … , n and i = 0, …, ⌊log₂ M⌋, and define the coefficients c₁(r), … , c_m(r) by

c_{i n + k} (r) ≔ χ_{[2^{i}, 2^{i + 1})} (r) \prod_{l = 1, l \neq k}^{⌊ \log_{4} ε^{- 1} ⌋} \frac{r - r_{i n + l}}{r_{i n + l} - r_{i n + k}},

(24)

for k = 1, …, n and i = 0, … , ⌊log₂ M⌋. We claim that

sup_{r \in [1, M]} sup_{t \in [0, \infty)} ∣ e^{- r t} - \sum_{j = 1}^{m} c_{j} (r) e^{- r_{j} t} ∣ \leq ε .

Indeed, fix r ∈ [1, M], and let i₀ ∈ {0, … , ⌊log₂ M⌋} be the unique integer such that r ∈ [2^i₀, 2^i₀+1). By the definition of the coefficients, see (24), we have

\sum_{j = 1}^{m} c_{j} (r) e^{- r_{j} t} = \sum_{k = 1}^{n} e^{- r_{i_{0} n + k} t} \prod_{l = 1, l \neq k}^{⌊ \log_{4} ε^{- 1} ⌋} \frac{r - r_{i_{0} n + l}}{r_{i_{0} n + l} - r_{i_{0} n + k}} .

We claim that the right hand side of this equation is the interpolating polynomial P_t,i₀ (r) for e^−rt at r_i₀n+k, … , r_(i₀+1)n, that is,

\sum_{k = 1}^{n} e^{- r_{i_{0} n + k} t} \prod_{l = 1, l \neq k}^{⌊ \log_{4} ε^{- 1} ⌋} \frac{r - r_{i_{0} n + l}}{r_{i_{0} n + l} - r_{i_{0} n + k}} = P_{t, i_{0}} (r) .

Indeed, see (18) and (19). Since the points r_i₀n+k, … , r(i₀+1)n are Chebyshev nodes for the interval [2^i₀, 2^i₀+1], and since i₀ was chosen such that r ∈ [2^i₀, 2^i₀+1), it follows from Lemma 4.3 that

∣ e^{- r t} - P_{t, i_{0}} (r) ∣ \leq \frac{1}{4^{n}} for t \in [0, \infty) .

Since n = ⌊log₄ ε⁻¹⌋ + 1 the proof is complete. □

Remark 4.1. The proof of Lemma 4.4 has the additional consequence that the coefficients c₁(r), … , c_m(r) in (22) can be chosen such that they satisfy

∣ c_{j} (r) ∣ \leq \sqrt{2} for j = 1, \dots, m .

Indeed, in (24) the coefficients C_j (r) are either equal zero or equal to the nodal polynomial, see (19), for Chebyshev nodes on an interval that contains r. The nodal polynomials for Chebyshev nodes on an interval [a, b] are bounded by $\sqrt{2}$ on [a, b], see for example [18]. The fact that e^−rt can be approximated as a linear combination of functions e^−r₁t, … , e^−r_mt with small coefficients means that the approximation of Lemma 4.4 can be used in finite precision environments without any unexpected catastrophic cancellation.

4.4. Completing the proof of Theorem 2.1.

Previously in §3.2, we proved that the algorithm of §3 involves $O (n m + N_{δ})$ operations. To complete the proof of Theorem 2.1 it remains to show that there exists

m = O (\log (ε^{- 1}) \log (δ^{- 1}))

points t₁, … , t_m and weights w₁, … , w_m that satisfy (17); we show the existence of such nodes and weights in the following lemma, and thus complete the proof of Theorem 2.1. The computation of such nodes and weights is described in §5.2.

Lemma 4.5. Fix a < b ∈ $R$ , and let δ > 0 and ε > 0 be given. Then, there exists $m = O (\log (ε^{- 1}) \log (δ^{- 1}))$ nodes t₁, … , t_m and weights w₁, … , w_m such that

∣ \frac{1}{r} - \sum_{j = 1}^{m} w_{j} e^{- r t_{j}} ∣ \leq ε, f o r r \in [δ (b - a), b - a] .

(25)

Proof. Fix a < b ∈ $R$ , and let δ, ε > 0 be given. By the possibility of rescaling r, w_j, and t_j, we may assume that b − a = δ⁻¹ such that we want to establish (25) for r ∈ [1, δ⁻¹]. By Lemma 4.4 we can choose $2 m = O (\log (ε^{- 1}) \log (δ^{- 1}))$ points r₀, … , r_2m−1 ∈ [1, δ⁻¹], and coefficients c₀(r), … , c_2m−1(r) depending on r such that

sup_{r \in [1, δ^{- 1}]} sup_{t \in [0, \infty)} ∣ e^{- r t} - \sum_{j = 0}^{2 m - 1} c_{j} (r) e^{- r_{j} t} ∣ \leq \frac{ε}{2 \log (2 ε^{- 1})} .

(26)

The collection of functions e^−r₀t, … , e^−r_2m−1t form a Chebyshev system of continuous functions on the interval [0, log(2ε⁻¹)], see for example [12]. Thus, by Lemma 4.2 there exists m quadrature nodes t₁, … , t_m and weights w₁, … , w_m such that

\int_{0}^{\log (2 ε^{- 1})} f (t) d t = \sum_{j = 1}^{m} w_{j} f (t_{j}),

whenever f(t) is in the span of e^−r₀t, … , e^−r_2m−1t. By the triangle inequality

∣ \frac{1}{r} - \sum_{j = 1}^{m} w_{j} e^{- r t_{j}} ∣ \leq ∣ \frac{1}{r} - \int_{0}^{\log (2 ε^{- 1})} e^{- r t} d t ∣ + ∣ \int_{0}^{\log (2 ε^{- 1})} e^{- r t} d t - \sum_{j = 1}^{m} w_{j} e^{r t_{j}} ∣ .

(27)

Recall that we have assumed r ∈ [1, δ⁻¹], in particular, r ≥ 1 so it follows that

∣ \frac{1}{r} - \int_{0}^{\log (2 ε^{- 1})} e^{- r t} d t ∣ \leq ε ∕ 2 .

(28)

By (26), the function e^−rt can be approximated to error ε/(2log(2ε⁻¹)) in the L^∞-norm on [0, log(2ε⁻¹)] by functions in the span of e^−r₀t, … , e^−r_2m−1t. Since our quadrature is exact for these functions, we conclude that

∣ \int_{0}^{\log (2 ε^{- 1})} e^{- r t} d t - \sum_{j = 1}^{m} w_{j} e^{r t_{j}} ∣ \leq ε ∕ 2 .

(29)

Combining (27), (28), and (29) completes the proof. □

4.5. Proof of Corollary 2.1.

In this section, we prove Corollary 2.1, which states that the algorithm of §3 involves $O (n \log n)$ operations when x₁, … , x_n are Chebyshev nodes, ε = 10⁻¹⁵, and δ = 1/n.

Proof of Corollary 2.1. By rescaling the problem we may assume that [a, b] = [−1, 1] such that the Chebyshev nodes x₁, … , x_n are given by

x_{j} = \cos (π \frac{j - \frac{1}{2}}{n}), for j = 1, \dots, n .

By the result of Theorem 2.1, it suffices to show that $N_{δ} = O (n \log n)$ , where

N_{δ} ≔ \sum_{j = 1}^{m} # {x_{i} : ∣ x_{j} - x_{i} ∣ < \frac{1}{n}} .

It is straightforward to verify that the number of Chebyshev nodes within an interval of radius 1/n around the point −1 < x < 1 is $O (1 ∕ \sqrt{1 - x^{2}})$ , that is,

# {x_{i} : ∣ x - x_{i} ∣ < \frac{1}{n}} = O (\frac{1}{\sqrt{1 - x^{2}}}), for - 1 < x < 1 .

This estimate, together with the fact that the first and last Chebyshev node are distance at least 1/n² from 1 and −1, respectively, gives the estimate

\sum_{j = 1}^{n} # {x_{i} : ∣ x_{j} - x_{i} ∣ < \frac{1}{n}} = O (\int_{1 ∕ n^{2}}^{π - 1 ∕ n^{2}} \frac{n}{\sqrt{1 - \cos (t)^{2}}} d t) .

(30)

Let π/2 > η > 0 be a fixed parameter; direct calculation yields

\int_{η}^{π - η} \frac{1}{\sqrt{1 - \cos (t)^{2}}} d t = 2 \log (cot (\frac{η}{2})) = O (\log (η^{- 1})) .

Combining this estimate with (30) yields $N_{δ} = O (n \log n)$ as was to be shown. □

5. Numerical results and implementation details

5.1. Numerical results.

We report numerical results for two different point distributions: uniformly random points in [1, 10], and Chebyshev nodes in [−1, 1]. In both cases, we choose the weights α₁, … , α_n uniformly at random from [0, 1], and test the algorithm for

n = 1000 \times 2^{k} points, for k = 0, \dots, 10 .

We time two different versions of the algorithm: a standard implementation, and an implementation that uses precomputed exponentials. Precomputing exponentials may be advantageous in situations where the expression

u_{j} = \sum_{j = 1}^{n} \frac{α_{i}}{x_{i} - x_{j}}, for j = 1, \dots, n,

(31)

must be evaluated for many different weights α₁, … , α_n associated with a fixed set of points x₁, …, x_n, see Remark 3.1. We find that using precomputed exponentials makes the algorithm approximately ten times faster, see Tables 1, 2, and 3. In addition to reporting timings, we report the absolute relative difference between the output of the algorithm of §3 and the output of direct evaluation; we define the absolute relative difference ϵ_r between the output ${\tilde{u}}_{j}$ of the algorithm of §3 and the output $u_{j}^{d}$ of direct calculation by

ϵ_{r} ≔ sup_{j = 1, \dots, n} ∣ \frac{{\tilde{u}}_{j} - u_{j}^{d}}{{\bar{u}}_{j}} ∣, where {\bar{u}}_{j} ≔ \sum_{i = 1}^{n} ∣ \frac{α_{i}}{x_{i} - x_{j}} ∣,

(32)

Table 1.

Key for column labels of Tables 2, 3, and 4.

Label	Definition
n	number of points
t_w	time of algorithm of §3 without precomputation in seconds
t_p	time of precomputing exponentials for algorithm of §3 in seconds
t_u	time of algorithm of §3 using precomputed exponentials in seconds
t_d	time of direct evaluation in seconds
ϵ_r	maximum absolute relative difference defined in (32)
t_f	time of FFT using precomputed exponentials (for time comparison only)

Open in a new tab

Table 2.

Numerical results for uniformly random points in [1, 10].

n	t_w	t_p	t_u	t_d	ϵ_r
1000	0.74 E −03	0.18 E −02	0.93 E −04	0.66 E −03	0.19 E −14
2000	0.19 E −02	0.31 E −02	0.19 E −03	0.25 E −02	0.30 E −14
4000	0.42 E −02	0.61 E −02	0.43 E −03	0.10 E −01	0.52 E −14
8000	0.85 E −02	0.10 E −01	0.89 E −03	0.37 E −01	0.72 E −14
16000	0.18 E −01	0.25 E −01	0.18 E −02	0.14 E +00	0.92 E −14
32000	0.38 E −01	0.49 E −01	0.37 E −02	0.59 E +00	0.19 E −13
64000	0.84 E −01	0.98 E −01	0.78 E −02	0.23 E +01	0.21 E −13
128000	0.16E +00	0.19 E +00	0.18 E −01	0.95 E +01	0.35 E −13
256000	0.37 E +00	0.53 E +00	0.34 E −01	0.40 E +02	0.59 E −13
512000	0.75 E +00	0.10 E +01	0.71 E −01	0.19 E +03	0.88 E −13
1024000	0.17 E +01	0.23 E +01	0.15 E +00	0.81 E +03	0.14 E −12

Open in a new tab

Table 3.

Numerical results for Chebyshev nodes on [−1, 1].

n	t_w	t_p	t_u	t_d	ϵ_r
1000	0.54 E −03	0.12 E −02	0.74 E −04	0.60 E −03	0.11 E −14
2000	0.15 E −02	0.26 E −02	0.15 E −03	0.24 E −02	0.14 E −14
4000	0.38 E −02	0.51 E −02	0.37 E −03	0.99 E −02	0.39 E −14
8000	0.83 E −02	0.10 E −01	0.85 E −03	0.38 E −01	0.35 E −14
16000	0.19 E −01	0.23 E −01	0.17 E −02	0.14 E +00	0.58 E −14
32000	0.41 E −01	0.48 E −01	0.37 E −02	0.62 E +00	0.89 E −14
64000	0.98 E −01	0.90 E −01	0.82 E −02	0.24 E +01	0.12 E −13
128000	0.22 E +00	0.19 E +00	0.23 E −01	0.10 E +02	0.19 E −13
256000	0.44 E +00	0.47 E +00	0.32 E −01	0.40 E +02	0.26 E −13
512000	0.84 E +00	0.94 E +00	0.73 E −01	0.19 E +03	0.52 E −13
1024000	0.19 E +01	0.19 E +01	0.14 E +00	0.84 E +03	0.64 E −13

Open in a new tab

Dividing by ${\bar{u}}_{j}$ accounts were the fact that the calculations are performed in finite precision; any remaining loss of accuracy in the numerical results is a consequence of the large number of addition and multiplication operations that are performed. All calculations are performed in double precision, and the algorithm of §3 is run with ε = 10⁻¹⁵. The parameter δ > 0 is set via an empirically determined heuristic. The numerical experiments were performed on a laptop with a Intel Core i5-8350U CPU and 7.7 GiB of memory; the code was written in Fortran and compiled with gfortran with standard optimization flags. The results are reported in Tables 1, 2, and 3.

To put the run time of the algorithm in context, we additionally perform a time comparison to the Fast Fourier Transform (FFT), which also has complexity $O (n \log n)$ . Specifically, we compare the run time of the algorithm of §3 on random data using precomputed exponentials with the run time of an FFT implementation from FFTPACK [20] on random data of the same length using precomputed exponentials. We report these timings in Table 4; we find that the FFT is roughly 5-10 times faster than our implementation of the algorithm of §3; we remark that no significant effort was made to optimize our implementation, and that it may be possible to improve the run time by vectorization.

Table 4.

Time comparison with FFT.

n	t_u	t_f
1000	0.91 E − 04	0.16 E − 04
2000	0.28 E − 03	0.37 E − 04
4000	0.41 E − 03	0.44 E − 04
8000	0.93 E − 03	0.85 E − 04
16000	0.18 E − 02	0.24 E − 03
32000	0.38 E − 02	0.41 E − 03
64000	0.81 E − 02	0.88 E − 03
128000	0.18 E − 01	0.19 E − 02
256000	0.38 E − 01	0.59 E − 02
512000	0.71 E − 01	0.12 E − 01
1024000	0.14 E + 00	0.25 E − 01

Open in a new tab

5.2. Computing nodes and weights.

The algorithm of §3 is described under the assumption that nodes t₁, … , t_m and weights w₁, … , w_m are given such that

∣ \frac{1}{r} - \sum_{j = 1}^{m} w_{j} e^{- r t_{j}} ∣ \leq ε for r \in [δ (b - a), b - a],

(33)

where ε > 0 and δ > 0 are fixed parameters. As in the proof of Lemma 4.5 we note that by rescaling r it suffices to find nodes and weights satisfying

∣ \frac{1}{r} - \sum_{j = 1}^{m} w_{j} e^{- r t_{j}} ∣ \leq ε for r \in [1, δ^{- 1}] .

(34)

Indeed, if the nodes t₁, … , t_m and weights w₁, … , w_m satisfy (34), then the nodes t₁/(b − a), … , t_m/(b − a) and weights w₁/(b − a), … , w_m/(b − a) will satisfy (33). Thus, in order to implement the algorithm of §3 it suffices to tabulate nodes and weights that are valid for r ∈ [1, M] for various values of M. In the implementation used in the numerical experiments in this paper, we tabulated nodes and weights valid for r ∈ [1, M] for

M = [1, 4^{k}] for k = 1, \dots, 10 .

For example, in Tables 5 and 6 we have listed m = 33 nodes t₁, … , t₃₃ and weights w₁, … , w₃₃ such that

∣ \frac{1}{r} - \sum_{j = 1}^{33} w_{j} e^{- r t_{j}} ∣ \leq 10^{- 15},

for all r ∈ [1, 1024].

Table 5.

A list of 33 nodes t₁, … , t₃₃.

0.2273983006898589D−03, 0.1206524521003404D−02, 0.3003171636661616D−02,

0.5681878572654425D−02, 0.9344657316017281D−02, 0.1414265501822061D−01,

0.2029260691940998D−01, 0.2809891134697047D−01, 0.3798133147119762D−01,

0.5050795277167632D−01, 0.6643372693847560D−01, 0.8674681067847460D−01,

0.1127269233505314D+00, 0.1460210820252656D+00, 0.1887424688689547D+00,

0.2435986924712581D+00, 0.3140569015209982D+00, 0.4045552087678740D+00,

0.5207726670656921D+00, 0.6699737362118449D+00, 0.8614482005965975D+00,

0.1107074709906516D+01, 0.1422047253849542D+01, 0.1825822499573290D+01,

0.2343379511131976D+01, 0.3006948272874077D+01, 0.3858496861353812D+01,

0.4953559345813267D+01, 0.6367677940017810D+01, 0.8208553424367139D+01,

0.1064261195532074D+02, 0.1396688222191633D+02, 0.1889449184151398D+02

Open in a new tab

Table 6.

A list of 33 weights w₁, …, w₃₃.

0.5845245927410881D−03, 0.1379782337905140D−02, 0.2224121503815854D−02,

0.3150105276431181D−02, 0.4200370923383030D−02, 0.5431379037435571D−02,

0.6918794756934398D−02, 0.8763225538492927D−02, 0.1109565843047196D−01,

0.1408264766413004D−01, 0.1793263393523491D−01, 0.2290557147478609D−01,

0.2932752351846237D−01, 0.3761087060298772D−01,0.4828044150885936D−01,

0.6200636888239893D−01, 0.7964527252809662D−01, 0.1022921587521237D+00,

0.1313462348178323D+00, 0.1685948994092301D+00, 0.2163218289369589D+00,

0.2774479391081561D+00, 0.3557192797195578D+00, 0.4559662159666857D+00,

0.5844792718191478D+00, 0.7495918095861060D+00, 0.9626599456939077D+00,

0.1239869481076760D+01, 0.1605927580173348D+01, 0.2102583514906888D+01,

0.2811829220697454D+01, 0.3937959064316012D+01, 0.6294697335695096D+01

Open in a new tab

The nodes and weights satisfying (34) can be computed by using a procedure for generating generalized Gaussian quadratures for Chebyshev systems together with the proof of Lemma 4.4. Indeed, Lemma 4.4 is constructive with the exception of the step that invokes Lemma 4.2 of Kreĭn. The procedure described in [4] is a constructive version of Lemma 4.2: given a Chebyshev system of functions, it generates the corresponding quadrature nodes and weights. We remark that generalized Gaussian quadrature generation codes are a powerful tools for numerical computation with a wide range of applications. The quadrature generation code used in this paper was an optimized version of [4] recently developed by Serkh for [19].

Acknowledgements.

The authors would like to thank Jeremy Hoskins for many useful discussions. Certain commercial equipment is identified in this paper to foster understanding. Such identification does not imply recommendation or endorsement by the National Institute of Standards and Technology, nor does it imply that equipment identified is necessarily the best available for the purpose.

N.F.M. was supported in part by NSF DMS-1903015.

V.R. was supported in part by AFOSR FA9550-16-1-0175 and ONR N00014-14-1-0797.

Contributor Information

ZYDRUNAS GIMBUTAS, National Institute of Standards and Technology, Boulder, CO 80305, USA.

NICHOLAS F. MARSHALL, Department of Mathematics, Princeton University, Princeton, NJ 08540, USA

VLADIMIR ROKHLIN, Program in Applied Mathematics, Yale University, New Haven, CT 06511, USA.

References

[1].Beylkin Gregory and Monzón Lucas, Approximation by exponential sums revisited, Appl. Comput. Harmon. Anal 28 (2010), no. 2, 131–149. MR2595881 [Google Scholar]
[2].Beylkin Gregory and Monzón Lucas, On approximation of functions by exponential sums, Appl. Comput. Harmon. Anal 19 (2005), no. 1, 17–48. MR2147060 [Google Scholar]
[3].Braess Dietrich, Nonlinear approximation theory, Springer Series in Computational Mathematics, vol. 7, Springer-Verlag, Berlin, 1986. MR866667 [Google Scholar]
[4].Bremer James, Gimbutas Zydrunas, and Rokhlin Vladimir, A nonlinear optimization procedure for generalized Gaussian quadratures, SIAM J. Sci. Comput 32 (2010), no. 4, 1761–1788. MR2671296 [Google Scholar]
[5].Dahlquist Germund and Björck Åke, Numerical methods, Dover Publications, Inc., Mineola, NY, 2003, Translated from the Swedish by Ned Anderson, Reprint of the 1974 English translation. MR1978058 [Google Scholar]
[6].Dutt A, Gu M, and Rokhlin V, Fast algorithms for polynomial interpolation, integration, and differentiation, SIAM J. Numer. Anal 33 (1996), no. 5, 1689–1711. MR1411845 [Google Scholar]
[7].Fong William and Darve Eric, The black-box fast multipole method, J. Comput. Phys 228 (2009), no. 23, 8712–8725. MR2558773 [Google Scholar]
[8].Gauss CF. Methodus nova integralium valores per approximationen inveniendi, Werke, 3 (1866), 1630–196. [Google Scholar]
[9].Greengard Leslie, The rapid evaluation of potential fields in particle systems, ACM Distinguished Dissertations, MIT Press, Cambridge, MA, 1988. MR936632 [Google Scholar]
[10].Greengard Leslie and Rokhlin Vladimir, A new version of the fast multipole method for the Laplace equation in three dimensions, Acta numerica, 1997, Acta Numer., vol. 6, Cambridge Univ. Press, Cambridge, 1997, pp. 229–269. MR1489257 [Google Scholar]
[11].Jakob-Chien Rüdiger and Alpert Bradley K., A fast spherical filter with uniform resolution, Journal of Computational Physics 136 (1997), no. 2, 580–584. [Google Scholar]
[12].Karlin Samuel and Studden William J., Tchebycheff systems: With applications in analysis and statistics, Pure and Applied Mathematics, Vol. XV, Interscience Publishers John Wiley & Sons, New York-London-Sydney, 1966. MR0204922 [Google Scholar]
[13].Kreĭn MG, The ideas of P. L. Čebyšev and A. A. Markov in the theory of limiting values of integrals and their further development, Amer. Math. Soc. Transl. (2) 12 (1959), 1–121. MR0113106 [Google Scholar]
[14].Ma J, Rokhlin V, and Wandzura S, Generalized Gaussian quadrature rules for systems of arbitrary functions, SIAM J. Numer. Anal 33 (1996), no. 3, 971–996. MR1393898 [Google Scholar]
[15].Martinsson Per-Gunnar, Rokhlin Vladimir, and Tygert Mark, On interpolation and integration in finite-dimensional spaces of bounded functions, Commun. Appl. Math. Comput. Sci 1 (2006), 133–142. MR2244272 [Google Scholar]
[16].Nabors K, Korsmeyer FT, Leighton FT, and White J, Preconditioned, adaptive, multipole-accelerated iterative methods for three-dimensional first-kind integral equations of potential theory, SIAM J. Sci. Comput 15 (1994), no. 3, 713–735, Iterative methods in numerical linear algebra (Copper Mountain Resort, CO, 1992). MR1273161 [Google Scholar]
[17].NIST Digital Library of Mathematical Functions. http://dlmf.nist.gov/, Release 1.0.22 of 2019-03-15. Olver FWJ, Olde Daalhuis AB, Lozier DW, Schneider BI, Boisvert RF, Clark CW, Miller BR and Saunders BV, eds
[18].Rokhlin V, A fast algorithm for the discrete Laplace transformation, J. Complexity 4 (1988), no. 1, 12–32. MR939693 [Google Scholar]
[19].Serkh Kirill, On the Solution of Elliptic Partial Differential Equations on Regions with Corners, ProQuest LLC, Ann Arbor, MI, 2016, Thesis (Ph.D.)–Yale University. MR3564124 [Google Scholar]
[20].Swarztrauber PN, Vectorizing the FFTs, Parallel Computations (Rodrigue G, ed.), Academic Press, 1982, pp. 51–83. [Google Scholar]
[21].Tygert M. Analogues for Bessel Functions of the Christoffel-Darboux Identity. Yale Tech. Rep (2016). [Google Scholar]
[22].Yarvin Norman and Rokhlin Vladimir, An improved fast multipole algorithm for potential fields on the line, SIAM J. Numer. Anal 36 (1999), no. 2, 629–666. MR1675269 [Google Scholar]
[23].Yarvin N and Rokhlin V, Generalized Gaussian quadratures and singular value decompositions of integral operators, SIAM J. Sci. Comput 20 (1998), no. 2, 699–718. MR1642612 [Google Scholar]

[R1] [1].Beylkin Gregory and Monzón Lucas, Approximation by exponential sums revisited, Appl. Comput. Harmon. Anal 28 (2010), no. 2, 131–149. MR2595881 [Google Scholar]

[R2] [2].Beylkin Gregory and Monzón Lucas, On approximation of functions by exponential sums, Appl. Comput. Harmon. Anal 19 (2005), no. 1, 17–48. MR2147060 [Google Scholar]

[R3] [3].Braess Dietrich, Nonlinear approximation theory, Springer Series in Computational Mathematics, vol. 7, Springer-Verlag, Berlin, 1986. MR866667 [Google Scholar]

[R4] [4].Bremer James, Gimbutas Zydrunas, and Rokhlin Vladimir, A nonlinear optimization procedure for generalized Gaussian quadratures, SIAM J. Sci. Comput 32 (2010), no. 4, 1761–1788. MR2671296 [Google Scholar]

[R5] [5].Dahlquist Germund and Björck Åke, Numerical methods, Dover Publications, Inc., Mineola, NY, 2003, Translated from the Swedish by Ned Anderson, Reprint of the 1974 English translation. MR1978058 [Google Scholar]

[R6] [6].Dutt A, Gu M, and Rokhlin V, Fast algorithms for polynomial interpolation, integration, and differentiation, SIAM J. Numer. Anal 33 (1996), no. 5, 1689–1711. MR1411845 [Google Scholar]

[R7] [7].Fong William and Darve Eric, The black-box fast multipole method, J. Comput. Phys 228 (2009), no. 23, 8712–8725. MR2558773 [Google Scholar]

[R8] [8].Gauss CF. Methodus nova integralium valores per approximationen inveniendi, Werke, 3 (1866), 1630–196. [Google Scholar]

[R9] [9].Greengard Leslie, The rapid evaluation of potential fields in particle systems, ACM Distinguished Dissertations, MIT Press, Cambridge, MA, 1988. MR936632 [Google Scholar]

[R10] [10].Greengard Leslie and Rokhlin Vladimir, A new version of the fast multipole method for the Laplace equation in three dimensions, Acta numerica, 1997, Acta Numer., vol. 6, Cambridge Univ. Press, Cambridge, 1997, pp. 229–269. MR1489257 [Google Scholar]

[R11] [11].Jakob-Chien Rüdiger and Alpert Bradley K., A fast spherical filter with uniform resolution, Journal of Computational Physics 136 (1997), no. 2, 580–584. [Google Scholar]

[R12] [12].Karlin Samuel and Studden William J., Tchebycheff systems: With applications in analysis and statistics, Pure and Applied Mathematics, Vol. XV, Interscience Publishers John Wiley & Sons, New York-London-Sydney, 1966. MR0204922 [Google Scholar]

[R13] [13].Kreĭn MG, The ideas of P. L. Čebyšev and A. A. Markov in the theory of limiting values of integrals and their further development, Amer. Math. Soc. Transl. (2) 12 (1959), 1–121. MR0113106 [Google Scholar]

[R14] [14].Ma J, Rokhlin V, and Wandzura S, Generalized Gaussian quadrature rules for systems of arbitrary functions, SIAM J. Numer. Anal 33 (1996), no. 3, 971–996. MR1393898 [Google Scholar]

[R15] [15].Martinsson Per-Gunnar, Rokhlin Vladimir, and Tygert Mark, On interpolation and integration in finite-dimensional spaces of bounded functions, Commun. Appl. Math. Comput. Sci 1 (2006), 133–142. MR2244272 [Google Scholar]

[R16] [16].Nabors K, Korsmeyer FT, Leighton FT, and White J, Preconditioned, adaptive, multipole-accelerated iterative methods for three-dimensional first-kind integral equations of potential theory, SIAM J. Sci. Comput 15 (1994), no. 3, 713–735, Iterative methods in numerical linear algebra (Copper Mountain Resort, CO, 1992). MR1273161 [Google Scholar]

[R17] [17].NIST Digital Library of Mathematical Functions. http://dlmf.nist.gov/, Release 1.0.22 of 2019-03-15. Olver FWJ, Olde Daalhuis AB, Lozier DW, Schneider BI, Boisvert RF, Clark CW, Miller BR and Saunders BV, eds

[R18] [18].Rokhlin V, A fast algorithm for the discrete Laplace transformation, J. Complexity 4 (1988), no. 1, 12–32. MR939693 [Google Scholar]

[R19] [19].Serkh Kirill, On the Solution of Elliptic Partial Differential Equations on Regions with Corners, ProQuest LLC, Ann Arbor, MI, 2016, Thesis (Ph.D.)–Yale University. MR3564124 [Google Scholar]

[R20] [20].Swarztrauber PN, Vectorizing the FFTs, Parallel Computations (Rodrigue G, ed.), Academic Press, 1982, pp. 51–83. [Google Scholar]

[R21] [21].Tygert M. Analogues for Bessel Functions of the Christoffel-Darboux Identity. Yale Tech. Rep (2016). [Google Scholar]

[R22] [22].Yarvin Norman and Rokhlin Vladimir, An improved fast multipole algorithm for potential fields on the line, SIAM J. Numer. Anal 36 (1999), no. 2, 629–666. MR1675269 [Google Scholar]

[R23] [23].Yarvin N and Rokhlin V, Generalized Gaussian quadratures and singular value decompositions of integral operators, SIAM J. Sci. Comput 20 (1998), no. 2, 699–718. MR1642612 [Google Scholar]

PERMALINK

A FAST SIMPLE ALGORITHM FOR COMPUTING THE POTENTIAL OF CHARGES ON A LINE

ZYDRUNAS GIMBUTAS

NICHOLAS F MARSHALL

VLADIMIR ROKHLIN

Abstract

1. Introduction and motivation

1.1. Introduction.

1.2. Relation to past work.

1.3. Motivation.

2. Main result

2.1. Main result.

3. Algorithm

3.1. High level summary.

3.2. Informal description.

3.3. Detailed description.

4. Proof of Main Result

4.1. Organization.

4.2. Preliminaries.

4.3. Technical Lemmas.

4.4. Completing the proof of Theorem 2.1.

4.5. Proof of Corollary 2.1.

5. Numerical results and implementation details

5.1. Numerical results.

Table 1.

Table 2.

Table 3.

Table 4.

5.2. Computing nodes and weights.

Table 5.

Table 6.

Acknowledgements.

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

A FAST SIMPLE ALGORITHM FOR COMPUTING THE POTENTIAL OF CHARGES ON A LINE

ZYDRUNAS GIMBUTAS

NICHOLAS F MARSHALL

VLADIMIR ROKHLIN

Abstract

1. Introduction and motivation

1.1. Introduction.

1.2. Relation to past work.

1.3. Motivation.

2. Main result

2.1. Main result.

3. Algorithm

3.1. High level summary.

3.2. Informal description.

3.3. Detailed description.

4. Proof of Main Result

4.1. Organization.

4.2. Preliminaries.

4.3. Technical Lemmas.

4.4. Completing the proof of Theorem 2.1.

4.5. Proof of Corollary 2.1.

5. Numerical results and implementation details

5.1. Numerical results.

Table 1.

Table 2.

Table 3.

Table 4.

5.2. Computing nodes and weights.

Table 5.

Table 6.

Acknowledgements.

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases