Skip to main content
NIST Author Manuscripts logoLink to NIST Author Manuscripts
. Author manuscript; available in PMC: 2020 Oct 20.
Published in final edited form as: J Math Phys. 2017;58(2):https://doi.org/10.1063/1.4976499.

A Fast Summation Method for Oscillatory Lattice Sums

Ryan Denlinger *, Zydrunas Gimbutas , Leslie Greengard , Vladimir Rokhlin §
PMCID: PMC7574582  NIHMSID: NIHMS1629753  PMID: 33087986

Abstract

We present a fast summation method for lattice sums of the type which arise when solving wave scattering problems with periodic boundary conditions. While there are a variety of effective algorithms in the literature for such calculations, the approach presented here is new and leads to a rigorous analysis of Wood’s anomalies. These arise when illuminating a grating at specific combinations of the angle of incidence and the frequency of the wave, for which the lattice sums diverge. They were discovered by Wood in 1902 as singularities in the spectral response. The primary tools in our approach are the Euler-Maclaurin formula and a steepest descent argument. The resulting algorithm has super-algebraic convergence and requires only milliseconds of CPU time.

1. Introduction

A variety of problems in acoustics and electromagnetics require the solution of quasiperiodic scattering problems - that is, time-harmonic wave scattering from a periodic structure with a well-defined unit cell [21, 25]. For the sake of simplicity, we will focus on the scalar (acoustic) case. In the three dimensional setting, we imagine that an acoustic plane wave of the form uin=ei(k1x+k2y+k3z) impinges on a two-dimensional array of scatterers centered at (nd1;md2) for n,m (Fig. 1, left). We denote the unit cell centered at the origin by U=[d12,d12]×[d22,d22]×(,) and the scatterer centered in the unit cell by S. The incoming wave satisfies the Helmholtz equation

Δu+k2u=0, (1)

where k=k12+k22+k32, and quasiperiodic boundary conditions on U, namely,

uin(x+d1,y,z)=eiαuin(x,y,z),uin(x,y+d2,z)=eiβuin(x,y,z),

where eiα=eik1d1 and eiβ=eik2d2 are complex (Bloch) phases. Note that for a normally incident wave, with k3 = k and k1 = k2 = 0, the boundary conditions reduce to simple periodicity. Assuming a “sound-soft” obstacle S, the scattered field uscat exterior to S in the domain (x, y, z) ∈ U must satisfy the Helmholtz equation (1) and the boundary conditions

uscat(x+d1,y,z)=eiαuscat(x,y,z),uscat(x,y+d2,z)=eiβuscat(x,y,z),uscat(x,y,z)=uin(x,y,z)|S.

Figure 1:

Figure 1:

On the left is a periodic two-dimensional array of spherical scatterers lying in the xy-plane, with an incident plane wave at frequency k=k12+k22+k32 traveling in the direction (k1, k2, k3). The unit cell is denoted by U. On the right is a periodic one-dimensional array of scatterers lying along the x-axis, with an incident plane wave at frequency k=k12+k22 traveling in the direction (k1, k2). The unit cell is again denoted by U.

In the two dimensionsal case, the incoming acoustic plane wave takes the form uin=ei(k1x+k2y)=ei(kcosψx+ksinψy), where k=k12+k22 and ψ is the angle of incidence of the incoming wave with respect to the x-axis. The one-dimensional array of scatterers is assumed to be centered at (nd, 0) for n (Fig. 1, right). The unit cell will again denoted by U=[d2,d2]×(,) and the scatterer centered at the origin will again denoted by S. The incoming wave satisfies the Helmholtz equation, while the quasiperiodic condition is now simply

uin(x+d,y)=eiαuin(x,y),

with Bloch phase eiα=eik1d. The scattered field uscat, exterior to S but within the domain U, must satisfy the Helmholtz equation (1) and the boundary conditions

uscat(x+d,y)=eiαuscat(x,y),uscat(x,y)=uin(x,y)|S.

Without entering into the details of integral equation methods or multiple scattering theory [5, 20, 24], we note that the Green’s function for the Helmholtz equation in free space is given by

G3D(x,y,z)=eikr4πr=(ik4π)h0(kr),    G2D(x,y,z)=H0(kr)4i,

in three and two dimensions, respectively, where r=x2+y2+z2 or x2+y2 depending on the dimension, h0(x) denotes the spherical Hankel function of order zero, and H0(x) denotes the standard Hankel function of order zero. For quasiperiodic scattering, the Green’s function can be expressed formally as

G3DQP(x,y,z)=ik4πn=m=h0(k(xnd1)2+(ymd2)2+z2)eink1d1eimk2d2, (2)
G2DQP(x,y)=14in=H0(k(xnd)2+y2)eik1nd. (3)

Note that both formulas (2) and (3) are simply infinite series of translated fundamental solutions.

By making use of standard addition theorems [20], it is straightforward and well-known that we can write

G3DQP(x,y,z)=ik4π[h0(kr)+l=0j=llSl,jYlj(θ,ϕ)jl(kr)],

where

Sl,j=Sl,j(d1,d2,k,k1,k2)=n,m=(n,m)(0,0)hl(k(nd1)2+(md2)2)Ylj(π2,ϕnm)eink1d1eimk2d2. (4)

Here, hn and jn denote the spherical Hankel and Bessel functions of order n and ϕnm is the angle subtended by the point (nd1, md2) with respect to the x-axis. The function Ylj denotes the spherical harmonic of degree l and order j:

Ylj(θ,ϕ)=2l+14π(l|j|)!(l+|j|)!Pl|j|(cosθ)eijϕ, (5)

where the associated Legendre functions Plj can be defined by the Rodrigues’ formula

Plj(x)=(1)j(1x2)j/2djdxjPl(x),

with Pl(x) the standard Legendre polynomial of degree l.

Similarly, in two dimensions, we have

G2DQP(x,y,z)=14i[H0(kr)+l=SlJl(kr)eilθ],

where Hn and Jn denote the usual Hankel and Bessel functions and

Sl=Sl(d,k,k1)=n\{0}Hl(|n|kd)(sgn n)leik1nd. (6)

The sums appearing in (4) and (6) are referred to as lattice sums [11, 17, 18, 23]. If the expressions in (2), (3) (4) and (6) were well-defined, it would be straightforward to verify that G3DQP and G2DQP satisfy the desired quasiperiodicity conditions.

Unfortunately, three fundamental difficulties are encountered in the use of lattice sums: they are conditionally convergent, their “direct” numerical evaluation is extremely slow by naive methods, and they diverge for certain values of the wave parameters k1, k2, k3 and lattice parameters d1 and d2 - giving rise to what are known as Wood’s anomalies [26]. The behavior of the scattered field is quite striking in the neighborhood of those parameter values, as discussed in [4, 11, 16] and in some detail below. In the two-dimensional case, it is straightforward to see from Fourier analysis that the scattered field (away from the obstacle) must take the form [11]

uscat(x,y)=n=ane2πinx/deik1xeiβny,

for y > 0 and

uscat(x,y)=n=bne2πinx/deikcosψxeiβny,

for y < 0, where βn2+(k1+2πnd)2=k2 with the root taken as positive real or positive imaginary. Real values of βn correspond to propagating modes, while values of βn on the positive imaginary axis correspond to evanescent modes. Wood’s anomalies occur when βn = 0 and the scattered wave is propagating exactly along the array in the x-direction - a very special type of physical resonance. In three dimensions, the scattered field (away from the plane of obstacles) must take the form

uscat(x,y,z)=m,nam,ne2πinx/d1eik1xe2πimy/d2eik2yeiβn,mz,

for z > 0 and

uscat(x,y,z)=m,nbm,ne2πinx/d1eik1xe2πimy/d2eik2yeiβn,mz,

for z < 0, where

βn,m2+(k1+2πnd1)2+(k2+2πmd2)2=k2. (7)

Wood’s anomalies occur when βn,m = 0 and the scattered wave is propagating in some direction along the xy-plane [15, 22, 12].

The computation of lattice sums and the resonant behavior corresponding to Wood’s anomalies have been widely studied, from both a physical and a mathematical perspective (see, for example, [4, 10, 11, 12, 13, 16, 17, 18, 19, 22, 23, 27]). Oddly enough, in the numerical literature for evaluating lattice sums, the occurence of Wood’s anomalies is often ignored, despite the fact that the series can diverge and despite the potential loss of accuracy in computational results near such singularities.

In this paper, we focus on the rigorous analysis of one-dimensional lattice sums using a novel method based on quadrature, Euler-MacLaurin corrections to the trapezoidal rule, and steepest descent arguments. The reason for concentrating on the one-dimensional case is the remarkable work of McPhedran, Nicorovici, Botten, Grubits, Enoch and Nixon [8, 17], who showed that, once the one-dimensional lattice sums along the x-axis are obtained, highlighted in blue on the left-hand side of Fig. 1, the remaining contributions in higher dimensions can be computed semi-analytically using the Poisson summation formula. This technique, sometimes referred to as lattice reduction [13], is outlined briefly in section 2.

From a physical perspective, the correct choice of the conditionally convergent lattice sums corresponds to adding an infinitesimal amount of dissipation: that is, we replace the real wave number k by k + iϵ and then let ϵ → 0+. For any fixed ϵ > 0, the relevant infinite series converges absolutely. Thus, instead of (6) and (4), we define the conditionally convergent lattice sums by

Sl=limϵ0+n\{0}Hl(|n|(k+iϵ)d)(sgnn)leik1nd (8)

and

Sl,j=limϵ0+n,m=(n,m)(0,0)hl((k+iϵ)(nd1)2+(md2)2)Ylj(π2,ϕnm)eink1d1eimk2d2. (9)

The existence of these limits will be established in our analysis, and the original physical interpretation does not play a role. We will show that the one-dimensional sums may be evaluated through techniques of complex analysis, combined with the Euler-Maclaurin formula with superalgebraic convergence. Our fast algorithm is derived in sections 3 and 6, and with proofs collected in sections 4 and 5. Section 7 presents some numerical experiments, and section 8 contains some concluding remarks.

2. Lattice Reduction

We illustrate the lattice reduction technique of [8, 17] for the case shown on the left-hand side of Fig. 1, with the added simplification that we assume the lattice to be square with unit cell of area one (d1 = d2 = 1). For the prescribed wavenumber k, the Bloch phases are then given by (eiα,eiβ)=(eik1,eik2). For the sake of brevity, we consider only the lattice sum S0,0 from (4) which now takes the form:

S0,0=(n,m)(0,0)h0(kn2+m2)eiαneiβm=S0,0++S0,0grating+S0,0, (10)

where

S0,0,+=m>0nh0(kn2+m2)eiαneiβm,    S0,0=m<0nh0(kn2+m2)eiαneiβm,

and

S0,0grating=n0h0(k|n|)eiαn. (11)

Let us consider the sum S0,0+, which we write in the form

S0,0+=m>0nh0(kn2+m2)eiαneiβm=m>0eiβmsm,

where

sm=nh0(kn2+m2)eiαn.

(S0,0 is treated in an analogous fashion.)

The important things to note about sm are (1) that it is the sum of the function h0(kx2+m2)eiαx sampled at the integers n, (2) that the function is smooth since m > 0, and (3) that the spherical Hankel function h0 has the spectral representation [5, 20, 24]

h0(kx2+y2+z2)=12πikeys2+t2k2s2+t2k2eisxeitzdsdt,

for y > 0. From this, letting x = n, y = m, and z = 0, we have

sm=12πiknems2+t2k2s2+t2k2ein(s+α)dsdt.

We now apply the Poisson summation formula [17, 23], which we write informally as

neinx=2πnδ(x+2πn).

This yields

sm=1iknemt2k2+(2πnα)2t2k2+(2πnα)2dt.

From this, we have

S0,0+=1ikm>0neiβmemt2k2+(2πnα)2t2k2+(2πnα)2dt=1ikneiβt2k2+(2πnα)2t2k2+(2πnα)211eiβt2k2+(2πnα)2dt.

The last expression is obtained by summing a geometric series in the index m. Assuming that this formal manipulation makes sense, the resulting integral is rapidly converging in t and the outer summation is rapidly converging in n. We omit further details, referring the reader to [8, 17]. Suffice it to say that, as a result of this observation, the principal obstacle in evaluating the lattice sum S0,0 is that of computing the one-dimensional sum S0,0grating in (11). This “punctured sum” cannot be evaluated through the Poisson summation formula directly, since the summand at n = 0 is undefined (although Ewald type methods could be used to overcome this [11]).

The remainder of this paper is devoted to a new approach for the punctured sum, which results in a fast algorithm and may be of mathematical interest in its own right.

3. One-dimensional lattice sums and the Euler-MacLaurin formula

To develop a unified framework that can handle one-dimensional sums such as S0,0grating in (11) or Sl in (6), let us now fix k, d > 0 and suppose that f(z) is a complex-analytic function such that, for some β > 0, there exists a representation for the pth derivative of the form

f(p)(z)=Φp(z)zβeiz, (12)

where Φp(z) has an asymptotic series Φp(z)~μ=0cp,μzμ valid for |arg z| < π/2 + δ and |z| → ∞, for sufficiently small δ > 0.We also assume that Φp is bounded on the region {|z| > δ′, |arg z| < π/2 + δ}, for each δ′ > 0.The Bessel functions hn and Hn are well-known to satisfy such estimates [1].

For ϵ > 0 and α, we define the function

Fαϵ(z)=f((k+iϵ)z)eiαz. (13)

We shall denote by Fα the corresponding expression with ϵ = 0. For reasons that will become apparent later, we shall assume that {(k±α)d/2π}=. We now consider absolutely convergent sum of the general form:

Sϵ(α)=n>0Fαϵ(nd). (14)

Clearly, for l, sums of the form

n\{0}f((k+iϵ)|nd|)(sgnn)leiαnd=Sϵ(α)+(1)lSϵ(α),

so we shall concern ourselves primarily with Sϵ(α). This framework covers the cases of physical interest in quasiperiodic scattering.

Our approach will require the use of an infinitely differentiable filter function, which we introduce here.

Definition 3.1 Let ψC([0, 1]) be a decreasing function such that, for some r with 0<r<14, ψ|[0,2r] = 1 and ψ|[1−2r,1] = 0. For b, c, we denote the scaled filter function ψb,c(x) by

ψb,c(x)={1,x<b,ψ[1c(xb)],bxb+c,0,b+c<x, (15)

See Fig. 2. Note that the parameter r determines both how flat the scaled filter function ψb,c(x) is at the points x = b and x = b + c, and how steep the transition is from 1 to 0.

Figure 2:

Figure 2:

Lattice sums and the Euler-MacLaurin formula: The top row shows the discrete values of Fα(nd) whose sum corresponds to S(α). The second row depicts an infinitely differentiable filter function ψb,c(x), which equals 1 for xb and 0 for x > b + c. The third row is a plot of the discrete values (nd)ψb,c(n)) and the bottom row is a plot of (xd)(1–ψb,c(x)). Our method exploits the relation between the desired lattice sum and the improper integral n=bFα(xd)(1ψb,c(x))dx, which involves Euler-MacLaurin corrections at the endpoints.

From the properties of ψb,c, we may write

Sϵ(α)=n=1b1Fαϵ(nd)+n=bb+cFαϵ(nd)ψb,c(n)+n=bFαϵ(nd)(1ψb,c(n)).

Note that the third term corresponds to the trapezoidal approximation for the integral

bFα(xd)(1ψb,c(x))dx.

We now show that this integral can be evaluated by contour deformation and that the difference between the integral and the desired sum can be computed with high precision using Euler-MacLaurin corrections.

Lemma 3.1 Let ϕ ∈ (−π/2,π/2), such that tan ϕ = (k + α)/ϵ. Then

bFαϵ(xd)dx=1d0Fαϵ(bd+ζτ)ζdτ

where ζ=eiϕ=ϵ+i(k+α)ϵ2+(k+α)2.

Proof Let C0, Cϕ denote the outward rays in the complex plane with arg z = 0 and arg z = ϕ. We begin by writing

bFαϵ(xd)dx=1d0Fαϵ(bd+t)dt=1dC0Fαϵ(bd+z)dz

Now let C0(R) = [0, R] and let Cϕ(R) = [0, ]. Let B(R) denote the circular arc from R to , let I = [0, ϕ] if ϕ > 0, and let I = [ϕ, 0] if ϕ < 0. It is straightforward to verify that

|B(R)Fαϵ(bd+z)dz|=|IFαϵ(bd+Reiϕ)Rieiϕdϕ|C(|k|bd)βReϵRcosϕ,

which goes to zero as R → ∞. This justifies the contour deformation from C0 to Cϕ:

bFαϵ(xd)dx=1dCϕFαϵ(bd+z)dz=1d0Fαϵ(bd+ζτ)ζdτ.

Theorem 3.1 Let ψb,c be a scaled filter function (Definition 3.1.) Then, for any fixed integer s ≥ 2, we have

Sϵ(α)=n=1b1Fαϵ(nd)+n=bb+cFαϵ(nd)ψb,c(n)bb+cFαϵ(xd)ψb,c(x)dx+1d0Fαϵ(bd+ϵ+i(k+α)ϵ2+(k+α)2τ)ϵ+i(k+α)ϵ2+(k+α)2dτ+Rs,b,cϵ(α), (16)

where

Rs,b,cϵ(α)=(1)s1s!bBs(x[x])[dsdts(Fαϵ(td)(1ψb,c(t)))|t=x]dx, (17)

and Bs(x) denotes the Bernoulli polynomial of order s.

Proof We have

Sϵ(α)=n>0Fαϵ(nd)=n>0Fαϵ(nd)ψb,c(n)+n>0Fαϵ(nd)(1ψb,c(n)). (18)

Since ψb,c(n) = 1, for nb, and ψb,c(n) = 0, for n > b + c, we have

Sϵ(α)=n=1b1Fαϵ(nd)+n=bb+cFαϵ(nd)ψb,c(n)+n=b+1Fαϵ(nd)(1ψb,c(n)).

We now employ the Euler-Maclaurin formula (see [2], p. 619, Theorem D.2.1) to find that, for s\{0}, N > b,

n=b+1NFαϵ(nd)(1ψb,c(n))=bNFαϵ(xd)(1ψb,c(x))dx+γ=1s(1)γBγγ!dγ1dxγ1(Fαϵ(xd)(1ψb,c(x)))|x=Nγ=1s(1)γBγγ!dγ1dxγ1(Fαϵ(xd)(1ψb,c(x)))|x=b+(1)s1s!bNBs(x[x])[dsdts(Fαϵ(td)(1ψb,c(t)))|t=x]dx,

where the Bs(x) are the Bernoulli polynomials, and the Bγ are Bernoulli numbers. Observe that (1 − ψb,c(x)) is identically zero in a neighborhood of x = b and that the boundary term at x = b vanishes outright. Moreover, Fαϵ(xd) and all its derivatives tend to zero exponentially as x. Thus, only the two integral terms on the right hand side of the Euler-Maclaurin formula are preserved in the limit N → ∞.

Thus,

n=b+1Fαϵ(nd)(1ψb,c(n))=bFαϵ(xd)(1ψb,c(x))dx+Rs,b,cϵ(α)=bFαϵ(xd)dxbb+cFαϵ(xd)ψb,c(x)dx+Rs,b,cϵ(α),

and

Sϵ(α)=n=1b1Fαϵ(nd)+n=bb+cFαϵ(nd)ψb,c(n)bb+cFαϵ(xd)ψb,c(x)dx+bFαϵ(xd)dx+Rs,b,cϵ(α).

The desired result now follows from Lemma 3.1.

Before turning to the algorithm itself, we will require two more results, whose proofs are deferred to the next sections.

Lemma 3.2 The term Rs,b,cϵ(α) from eq. (17) satisfies the estimate

limclim¯ϵ0+|Rs,b,cϵ(α)|=O(cNβ)

for any fixed N.

Lemma 3.3 Recall that Fαϵ(z)=f((k+iϵ)z)eiαz and for p{0} and q let

Gα,p,qϵ(x)=xf(p)((k+iϵ)td)eiαtde2πiqtdt. (19)

Then Gα,p,q(x)limϵ0+Gα,p,qϵ(x) exists (see formula (24)) and

limclim supϵ0+|n=bb+cFαϵ(nd)ψb,c(n)bb+cFαϵ(xd)ψb,c(x)dxA|=0 (20)

where

A=Fα(bd)γ=1s(1)γBγγ!dγ1dxγ1(Fα(xd))|x=b(1)s1s!Asj1=0s(sj1)dsksj1(iα)j1[q\{0}qsGα,sj1,q(b)], (21)

where As = −s!/(2πi)s (see [1],23.1.16).

4. Proof of Lemma 3.2

For Lemma 3.2, we need to estimate the remainder Rs,b,cϵ(α), defined via (17). This is precisely the error incurred by using the Euler-Maclaurin formula.

We begin by using the generalized product rule to expand the derivatives in (17):

Rs,b,cϵ(α)=(1)s1s!bBs(x[x])[dsdts(Fαϵ(td)(1ψb,c(t)))|t=x]dx=(1)s1s!j=0sj1=0sj(sj)(sjj1)dsj(k+iϵ)sjj1(iα)j1×bBs(x[x])(δj,0ψb,c(j)(x))f(sjj1)((k+iϵ)xd)eiαxddx.s

We see that it su ces to verify the smallness of finitely many terms of the following form:

Rα,b,c,p,jϵ=bBs(x[x])(δj,0ψb,c(j)(x))f(p)((k+iϵ)xd)eiαxddx. (22)

The functions Bs(x−[x]) are Bernoulli polynomials, and for s ≥ 2 we have the absolutely convergent Fourier expansion:

Bs(x[x])=Asq\{0}qse2πiqx,

with As = −s!/(2πi)s (see [1], 23.1.16). We shall assume henceforth that s ≥ 2 to guarantee convergence; then by Fubini’s theorem, we have

Rα,b,c,p,jϵ=Asq\{0}qsb(δj,0ψb,c(j)(x))f(p)((k+iϵ)xd)eiαxde2πiqxdx.

Let us define (for x > 0)

Gα,p,qϵ(x)=xf(p)((k+iϵ)td)eiαtde2πiqtdt. (23)

Integrating by parts, we have

Rα,b,c,p,jϵ=Asq\{0}qsbψb,c(j+1)(x)Gα,p,qϵ(x)dx=Asq\{0}qsbb+cψb,c(j+1)(x)Gα,p,qϵ(x)dx=Ascjq\{0}qs01ψ(j+1)(x)Gα,p,qϵ(b+cx)dx.

Let λq = (α + k)d + 2πq, and let ψ ∈ (π/2,π/2) such that tan ψq(dϵ)−1. Since we assumed that (k±α)d/2π, we have that λq ≠ 0 for any q; indeed, it holds that infq|λq|>0. Let C0, Cψ be contours in the z-plane along the outward rays arg z = 0 and arg z = ψ. We have

Gα,p,qϵ(x)=xf(p)((k+iϵ)td)eiαtde2πiqtdt=C0Φp((k+iϵ)(x+z)d)[(k+iϵ)(x+z)d]βei(λq+idϵ)(x+z)dz=ei(λq+idϵ)xC0Φp((k+iϵ)(x+z)d)[(k+iϵ)(x+z)d]βei(λq+idϵ)zdz

We now deform the contour C0 into Cψ, which will be justified momentarily.

Gα,p,qϵ(x)=ei(λq+idϵ)xCψΦp((k+iϵ)(x+z)d)[(k+iϵ)(x+z)d]βei(λq+idϵ)z dz=ei(λq+idϵ)x0Φp((k+iϵ)(x+eiψτ)d)[(k+iϵ)(x+eiψτ)d]βei(λq+idϵ)(cosψ+isinψ)τeiψ dτ=ei(λq+idϵ)x0Φp((k+iϵ)(x+eiψτ)d)[(k+iϵ)(x+eiψτ)d]βeτλq2+d2ϵ2eiψ dτ.

This implies that Gα,p,qϵ(x) is bounded on [1, ∞) uniformly as ϵ → 0+, |q| → ∞, moreover, by the dominated convergence theorem (with η = sgn λq),

limϵ0+Gα,p,qϵ(x)=eiλqxiη0Φp((x+iητ)kd)[(x+iητ)kd]βe|λq|τdτ. (24)

The contour deformation in the above calculation, which takes place at some fixed ϵ > 0, is justified by the following estimate, where B(R) and I are defined as on the proof of Lemma 3.1.

|B(R)Φp((k+iϵ)(x+z)d)[(k+iϵ)(x+z)d]βei(λq+idϵ)zdz|=|IΦp((k+iϵ)(x+Reiψ)d)[(k+iϵ)(x+Reiψ)d]βeiR(λq+idϵ)(cosψ+isinψ)Rieiψdψ|CR(|k|xd)βIe|λq|Rsin|ψ|eϵdRcosψdψC(|k|xd)βReϵdRcosψ0 as R.

Let us define Gα,p,q(x)=limϵ0+Gα,p,qϵ(x); we desire an asymptotic approximation to this function as x → ∞. Recall that we have an asymptotic approximation Φp(z)~μ=0cp,μzμ, |z| → ∞. Thus, for any fixed N ≥ 1, we have

Gα,p,q(x)=eiλqxiη0Φp((x+iητ)kd)[(x+iητ)kd]βe|λq|τdτ=eiλqxiη(μ=0Ncp,μ(kd)μ+β0e|λq|τ(x+iητ)μ+βdτ+O(xNβ)),

Now we integrate by parts iteratively, finitely many times, until the error term is O(xNβ)

Gα,p,q(x)=eiλqxiη(μ=0Nν=0Nμcp,μ(iη)ν0i1<ν(μβi1)(kd)μ+β|λq|ν+1x(μ+ν)β+O(xNβ))=eiλqxiη(ρ=0Nμ=0ρcp,μ(iη)ρμ0i1<(ρμ)(μβi1)(kd)μ+β|λq|ρμ+1xρβ+O(xNβ))=eiλqxiη(ρ=0N1[μ=0ρcp,μ(iη)ρμ0i1<(ρμ)(μβi1)(kd)μ+β|λq|ρμ+1]xρβ+O(xNβ)).

In particular, we find that Gα,p,q(x)~eiλqxρ=0Gρ,p,qxρβ; moreover, since k and d are fixed, we have an estimate |Gρ,p,q| ≤ Cρ,p·(1 + |λq|ρ−1). since {(k±α)d/2π}=, we have that is infqq| > 0, so for fixed ρ, Gρ,p,q is uniformly bounded in q. Additionally, the error estimate O(xNβ) is uniform in q, in that both the constant and the domain of validity are uniform in q, if N is held constant. (This relies on infq |λq| > 0.)

Recall that

Rα,b,c,p,jϵ=Ascjq\{0}qs01ψ(j+1)(x)Gα,p,qϵ(b+cx)dx.

Since Gα,p,qϵ(x) is uniformly bounded in the simultaneous limit ϵ → 0+, |q| → ∞, we may use the dominated convergence theorem to commute the limit ϵ → 0+ with the integral, as well as the sum over q:

limϵ0+Rα,b,c,p,jϵ=Ascjq\{0}qs01ψ(j+1)(x)Gα,p,q(b+cx)dx.

By using Gα,p,q(x)=eiλqxρ=0N1Gρ,p,qxρβ+O(xNβ) (the error estimate being uniform in q), and the fact that j0,ψ(j+1)(x) is identically zero on [0,2r], we have (in the limit c → ∞)

limϵ0+Rα,b,c,p,jϵ=Ascjρ=0N1q\{0}Gρ,p,qeiλqbqsr1rψ(j+1)(x)eiλqcx(b+cx)ρ+βdx+O(cjNβ)=Ascjρ=0N1q\{0}Gρ,p,qeiλqbcρ+βqsr1rψ(j+1)(x)xρ+βeiλqcx(1+bcx)ρβdx+O(cjNβ)

Since c is large and xr, we may expand (1 + b/cx)ρβ in powers of (b/cx).

limϵ0+Rα,b,c,p,jϵ=Asρ=0N1ω=0Nρ1[(1)ωi1=0ω1(ρ+i1+β)ω!bωcj+ρ+ω+βq\{0}Gρ,p,qeiλqbqsr1rψ(j+1)(x)xρ+ω+βeiλqcxdx]+O(cjNβ).

Note that the integral on the right hand side depends on c only through eiλqcx, and ψ(j+1)(x)/xρ+ω+β is smooth and compactly supported on [r, 1 – r]. Thus, the integral decays faster than any power of c−1 as c → ∞, and such decay is uniform in q.

Thus, as c → ∞, for any fixed N, we have

limϵ0+Rα,b,c,p,jϵ=O(cjNβ),

yielding the desired result.

5. Proof of Lemma 3.3

For Lemma 3.3 we employ the Euler-Maclaurin formula yet again.

n=bb+cFαϵ(nd)ψb,c(n)bb+cFαϵ(xd)ψb,c(x)dx=Fαϵ(bd)+γ=1s(1)γBγγ![dγ1dxγ1(Fαϵ(xd)ψb,c(x))|x=b+cdγ1dxγ1(Fαϵ(xd)ψb,c(x))|x=b]+(1)s1s!bb+cBs(x[x])[dsdts(Fαϵ(td)ψb,c(t))|t=x]dx.

Since ψb,c(x) is identically zero in a neighborhood of b + c and identically one in a neighborhood of b, this reduces to

n=bb+cFαϵ(nd)ψb,c(n)bb+cFαϵ(xd)ψb,c(x)dx=Fαϵ(bd)γ=1s(1)γBγγ!dγ1dxγ1(Fαϵ(xd))|x=b+(1)s1s!bb+cBs(x[x])[dsdts(Fαϵ(td)ψb,c(t))|t=x]dx.

We expand the derivatives as before and define

Eα,b,c,p,jϵ=bb+cBs(x[x])ψb,c(j)(x)f(p)((k+iϵ)xd)eiαxddx, (25)
Eα,b,cϵ=(1)s1s!j=0sj1=0sj(sj)(sjj1)dsj(k+iϵ)sjj1(iα)j1Eα,b,c,sjj1,jϵ. (26)

Then

n=bb+cFαϵ(nd)ψb,c(n)bb+cFαϵ(xd)ψb,c(x)dx=Fαϵ(bd)γ=1s(1)γBγγ!dγ1dxγ1(Fαϵ(xd))|x=b+Eα,b,cϵ. (27)

The limit of each term as ϵ → 0+ exists (trivially for the first two terms, by the dominated convergence theorem for the third):

limϵ0+[n=bb+cFαϵ(nd)ψb,c(n)bb+cFαϵ(xd)ψb,c(x)dx]=Fα(bd)γ=1s(1)γBγγ!dγ1dxγ1(Fα(xd))|x=b+limϵ0+Eα,b,cϵ.

Notice that the first two terms are fully independent of c; to complete the argument, it suffices to show that limclimϵ0+Eα,b,c,p,jϵ exists (we do not claim a limit of zero). Moreover, the c → ∞ convergence behavior of the desired limit is fully determined by the c → ∞ convergence behavior of limϵ0+Eα,b,c,p,jϵ.

As before, we replace Bs (x – [x]) by its absolutely convergent Fourier expansion (s ≥ 2) and use Fubini’s theorem:

Eα,b,c,p,jϵ=bb+cBs(x[x])ψb,c(j)(x)f(p)((k+iϵ)xd)eiαxddx=Asq\{0}qsbb+cψb,c(j)(x)f(p)((k+iϵ)xd)eiαxde2πiqxdx=As[δj,0q\{0}qsGα,p,qϵ(b)+q\{0}qsbb+cψb,c(j+1)(x)Gα,p,qϵ(x)dx]=Asδj,0q\{0}qsGα,p,qϵ(b)Rα,b,c,p,jϵ. (28)

Due to the uniform boundedness of Gα,p,qϵ(x) on [1,∞] in the simultaneous limit ϵ → 0+, |q| → ∞, we may commute the ϵ → 0+ limit with the sum in the first term:

lim supϵ0+|Eα,b,c,p,jϵ+Asδj,0q\{0}qsGα,p,q(b)|=lim supϵ0+|Rα,b,c,p,jϵ|. (29)

The right-hand side is O(cjN1/2) as c → ∞, for any N, as has already been established.

6. Informal description of the algorithm

Using equations (16) and Lemma 3.3, we can conclude the existence of limϵ→0+ Sϵ(α) and obtain an explicit formula for this quantity (for a fixed s ≥ 2):

limϵ0+Sϵ(α)=n=1bFα(nd)γ=1s(1)γBγγ!dγ1dxγ1(Fα(xd))|x=b+1d0Fα(bd+iτsgn(k+α))isgn(k+α)dτ(1)s1s!Asj1=0s(sj1)dsksj1(iα)j1[q\{0}qsGα,sj1,q(b)]. (30)

While this formula is mathematically interesting, it does not provide a convenient algorithm due to the presence of terms depending on Gα,p,q(b). Instead, our algorithm approximates this limit by computing the first four terms of the formula (16) at ϵ = 0, for larger and larger values of c. Specifically, the expression is evaluated for increasing c at fixed b until convergence is obtained with the desired number of digits. This procedure is formally justified due to Lemma 3.2 and Lemma 3.3.

It is, perhaps, of interest to note that the conditional convergence of the lattice sums is discussed in the literature (see, for example [11]), but not typically studied in terms of an explicit limit with vanishing dissipation. An exception is [7].

6.1. Wood anomalies

As noted in the introduction, when (k ± α)d = 2πn, where n, the lattice sums considered in this paper may actually diverge. Such singularities are known as Wood’s anomalies and correspond to a kind of physical resonance in the underlying system. Using (30) and (24), we expect to see |S(α)||δk|12 for sums involving H and |S(α)|log1|δk| for sums involving h as |δk| → 0. Here, |δk| is the distance (in wavenumber variable k) to the closest Wood anomaly. Two dimensional Wood anomalies (see (7)), are more complicated to characterize and will be considered at a later date.

7. Numerical Validation

Our fast summation algorithm for one-dimensional lattice sums, as previously indicated, is based upon the formula (16),

Sϵ(α)=n>0Fαϵ(nd)=n>0f((k+iϵ)nd)eiαnd=n=1b1Fαϵ(nd)+n=bb+cFαϵ(nd)ψb,c(n)bb+cFαϵ(xd)ψb,c(x)dx+1d0Fαϵ(bd+ϵ+i(k+α)ϵ2+(k+α)2τ)ϵ+i(k+α)ϵ2+(k+α)2dτ+Rs,b,cϵ(α).

Using Lemma 3.2 and Lemma 3.3 which have been proven in the preceding sections, if c is sufficiently large then

limϵ0+Sϵ(α)n=1b1Fα(nd)+n=bb+cFα(nd)ψb,c(n)bb+cFα(xd)ψb,c(x)dx+1d0Fα(bd+iτsgn(k+α))isgn(k+α)dτ. (31)

Moreover, for any fixed cuto function ψ as in the statement of Theorem 3.1, the error associated with this approximation vanishes more quickly than any power of c, as c → ∞.

The fast algorithm proceeds as follows: First, we fix (once and for all) a smooth cut-off function ψ as in the statement of Theorem 3.1. We also fix a positive integer value for the parameter b, which does not play an essential role in the analysis. Second, we evaluate the right-hand side of (31) for integer values of c, doubling c at each iteration until convergence is obtained.

To test the speed and accuracy of our algorithm, we consider the cases where f is the Hankel function of the first kind Hl(1) or the spherical Hankel function of the first kind hl(1). With a Fortran implementation on a 1.3GHz Intel Core M processor, we found that the first 120 lattice sums Sl in (6) were computed in 1.2 milliseconds with 7 digits of accuracy for k = 1, with a unit cell of length d = 1 and α = 0.4. For k = 10, 1.5 milliseconds were required and for k = 100, 300 lattice sums were computed in 2 milliseconds. For validation, the sums Sϵ(α) were computed directly for ϵ > 0, and the limit ϵ → 0+ was determined numerically using fourth-order Richardson extrapolation starting at ϵ = 10−3, with an estimated accuracy of twelve digits.

In Fig. 3, we scan a range of frequencies k with the phase α set to either 0 or 0.4. The blowup of S0, defined by (6) is clearly visible. To investigate the behavior of our numerical method near Wood anomalies, we set parameters to d = 1, α=π4, and k=7π4+η, where 0 < η ≤ 1, and consider the lattice sums Sl involving H and h, respectively. In Fig. 4, we plot Sl and Sl,0grating, defined as the one-dimensional grating sum component of (4), for l = 0, 1, 2, 3. In the first case, there is a clear power-law singularity with the correct exponent. In the latter case, involving h, the blowup is only logarithmic and harder to fit precisely.

Figure 3:

Figure 3:

(Top) The periodic blowup of the lattice sums S0(α), defined by (6) as a function of frequency k, with α set to either 0 or 0.4 and d = 1. (Bottom) Time required for lattice sums with α = 0.4 at eack k.

Figure 4:

Figure 4:

(l) Power-law blow-up of Sl(α) near a Wood’s anomaly when f=Hl(1) with parameters d = 1, α=π4, k=7π4+δk. (r) Blow-up of the lattice sums Sl,0grating(α) near a Wood’s anomaly when f=hl(1), with the same parameters. Note that on the left, the y-axis is on a logarithmic scale, while on the right, it is on a linear scale.

8. Conclusions

We have described a general approach for the numerical evaluation of one-dimensonal lattice sums which play an important role in diffraction and wave propagation problems in both two and three dimensions. Indeed, it is often possible to efficiently reduce higher-dimensional sums to their one-dimensional counterparts as discussed in section 2. Our algorithm achieves super-algebraic convergence rates and is able to evaluate lattice sums accurately and efficiently. Moreover, our estimates supply an interesting analytic interpretation of Wood’s anomalies - physical resonances that occur when (k ± α)d = 2πn for integer n - which cause the lattice sums to diverge [12, 11]. In particular, the formulas (24) and (30) allow us to directly estimate the type of blow-up one should expect to see.

We believe that higher dimensional Wood anomalies can be analyzed by coupling our method with lattice reduction techniques, and we will report on such work at a later date. Finally, it is worth noting that lattice sums can be avoided altogether. Quasiperiodic boundary conditions can be imposed, for example, using layer potentials [3, 4, 9], fundamental solutions [6], or spherical harmonics [14] to enforce the conditions explicitly. Which approach is more efficient will depend, we suspect, on the ambient dimension, the frequency, and on the aspect ratio of the unit cell. Both this question and a comparison with other fast algorithms, such as those discussed in [10, 11, 12, 13, 16, 17, 18, 19, 23, 27] remain to be explored.

References

  • [1].Abramowitz M and Stegun IA. Handbook of Mathematical Functions. Dover, 1972. [Google Scholar]
  • [2].Andrews GE, Askey R, and Roy R. Special Functions. Cambridge University Press, 1999. [Google Scholar]
  • [3].Barnett A and Greengard L. A new integral representation for quasi-periodic fields and its application to two-dimensional band structure calculations. Journal of Computational Physics, 229:6898–6914, 2010. [Google Scholar]
  • [4].Barnett A and Greengard L. A new integral representation for quasi-periodic scattering problems in two dimensions. BIT Numer. Math, 51(1):67–90, 2011. [Google Scholar]
  • [5].Chew WC, editor. Waves and Fields in Inhomogeneous Media. IEEE Press, New York, 1995. [Google Scholar]
  • [6].Min Hyung Cho H Barnett. Robust fast direct integral equation solver for quasi-periodic scattering problems with a large number of layers. Optics Express, 23:1775–1799, 2015. [DOI] [PubMed] [Google Scholar]
  • [7].Dienstfrey Andrew, Hang Fengbo, and Huang Jingfang. Lattice sums and the two-dimensional, periodic Green’s function for the Helmholtz equation. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, 457:67–85, 2001. [Google Scholar]
  • [8].Enoch S, McPhedran RC, Nicorovici NA, Botten LC, and Nixon JN. Sums of spherical waves for lattices, layers, and lines. J. Math. Phys, 42:5859–5870, 2001. [Google Scholar]
  • [9].Gillman Adrianna and Barnett Alex H.. A fast direct solver for quasi-periodic scattering problems. Journal of Computational Physics, 248:309–322, 2013. [Google Scholar]
  • [10].Kurkcu H and Reitich F. Stable and efficient evaluation of periodized green’s functions for the Helmholtz equation at high frequencies. Journal of Computational Physics, 228:75–95, 2009. [Google Scholar]
  • [11].Linton CM. Lattice sums for the Helmholtz equation. SIAM Rev., 52(4):630–674, 2010. [Google Scholar]
  • [12].Linton CM and Thompson I. Resonant effects in scattering by periodic arrays. Wave Motion, 44(3):165–175, 2007. [Google Scholar]
  • [13].Linton CM and Thompson I. One- and two-dimensional lattice sums for the three-dimensional Helmholtz equation. J. Comp. Phys, 228(6):1815, 2009. [Google Scholar]
  • [14].Liu Yuxiang and Barnett Alex H.. Efficient numerical solution of acoustic scattering from doubly-periodic arrays of axisymmetric objects, 2015. accepted, J. Comput. Phys; arxiv:1506.05083. [Google Scholar]
  • [15].Liu Yuxiang and Barnett Alex H.. Efficient numerical solution of acoustic scattering from doubly-periodic arrays of axisymmetric objects. Journal of Computational Physics, 324:226–245, 2016. [Google Scholar]
  • [16].Maystre Daniel. Theory of Wood’s Anomalies In Enoch S and Bonod N, editors, Plasmonics, chapter 2 Springer, Berlin, 2012. [Google Scholar]
  • [17].McPhedran RC, Nicorovici NA, Botten LC, and Grubits KA. Lattice sums for gratings and arrays. J. Math. Phys, 41(11):7808–7816, 2000. [Google Scholar]
  • [18].Moroz A. Exponentially convergent lattice sums. Optics Letters, 26(15):1119–1121, 2001. [DOI] [PubMed] [Google Scholar]
  • [19].Moroz A. Quasi-periodic Green’s functions of the Helmholtz and Laplace equations. J. Phys. A: Math. Gen, 39(36):11247, 2006. [Google Scholar]
  • [20].Morse P and Feshbach H, editors. Methods of Theoretical Physics. McGraw-Hill, New York, 1953. [Google Scholar]
  • [21].Petit R, editor. Electronmgnetic Theory of Gratings. Springer, Heidelberg, 1980. [Google Scholar]
  • [22].Shipman Stephen P.. Resonant scattering by open periodic waveguides In Ehrhardt Matthias, editor, Wave Propagation in Periodic Media: Progress in Computational Physics, vol. 1, pages 7–50. Bentham Books, Berlin, 2010. [Google Scholar]
  • [23].Twersky V. Elementary function representations of Schlomilch series. Arch. Ration. Mech. Anal, 8(1):323–332, 1961. [Google Scholar]
  • [24].van de Hulst HC, editor. Light Scattering by Small Particles. Dover, New York, 1981. [Google Scholar]
  • [25].Wilcox Calvin H., editor. Scattering Theory for Diffraction Gratings. Springer, New York, 2013. [Google Scholar]
  • [26].Wood RW. On a remarkable case of uneven distribution of light in a diffraction grating spectrum. Philos. Mag, 4:396–408, 1902. [Google Scholar]
  • [27].Yasumoto K and Yoshitomi K. Efficient calculation of lattice sums for free-space periodic Green’s function. IEEE Trans. Antennas Propag, 47(6):1050, 1999. [Google Scholar]

RESOURCES