Double exponential quadrature for fractional diffusion

Alexander Rieder

doi:10.1007/s00211-022-01342-8

. 2023 Jan 26;153(2-3):359–410. doi: 10.1007/s00211-022-01342-8

Double exponential quadrature for fractional diffusion

Alexander Rieder ^1,^2,^✉

PMCID: PMC9998606 PMID: 36915282

Abstract

We introduce a novel discretization technique for both elliptic and parabolic fractional diffusion problems based on double exponential quadrature formulas and the Riesz–Dunford functional calculus. Compared to related schemes, the new method provides faster convergence with fewer parameters that need to be adjusted to the problem. The scheme takes advantage of any additional smoothness in the problem without requiring a-priori knowledge to tune parameters appropriately. We prove rigorous convergence results for both, the case of finite regularity data as well as for data in certain Gevrey-type classes. We confirm our findings with numerical tests.

Mathematics Subject Classification: 65N15, 65M12

Introduction

The study of processes governed by fractional linear operators has gathered significant interest over the last few years [8, 22, 35] with applications ranging from physics [1] to image processing [1, 15, 16], inverse problems [19] and more. See [33] for an overview of applications in different fields. The goal is to solve problems of the form

\begin{matrix} {(- Δ)}^{β} u = f or \partial_{t}^{α} u + {(- Δ)}^{β} u = f \end{matrix}

with parameters $β$ , $α \in (0, 1]$ . There are multiple (non-equivalent) ways of defining fractional powers of operators. We mention the integral fractional Laplacian and the spectral definition [22]. In this paper, we focus on the spectral definition which is equivalent to the functional calculus definition.

For discretization of such problems, both stationary and time dependent, multiple approaches have been presented. A summary of the most common can be found in [2, 22]. They can be broadly distinguished into three categories. The first class of methods uses the Caffarelli-Silvestre extension to reformulate the problem as a PDE posed in one additional spatial dimension. This problem is then treated by standard finite element techniques [6, 24, 25, 27–29]. The second big class of discretization schemes, and the one our new scheme is part of, was first introduced in [7] and later extended to more general operators [5] and time dependent problems [3, 4, 26]. They are based on the Riesz–Dunford calculus (sometimes also referred to as Dunford-Taylor or Riesz-Taylor) and employ a $sinc$ quadrature scheme to discretize the appearing contour integral. $sinc$ quadrature, and overall $sinc$ -based numerical methods are less well known than their polynomial based counterparts, but provide rapidly converging schemes [21, 32] with very easy implementation. The quadrature relies on appropriate coordinate transforms in order to yield analytic, rapidly decaying integrands over the real line and then discretization using the trapezoidal quadrature rule. In [34] it was realized that by adding an additional $sinh$ -transformation, it is possible to get an even faster convergence for certain integrals. Namely, writing $N_{q}$ for the number of quadrature points, instead of convergence of the form $e^{- \sqrt{N_{q}}}$ , it is possible to get rid of the square root and obtain rates of the form $e^{- \frac{N_{q}}{ln N_{q}}}$ . Further developments in this direction are summarized in [23]. Such schemes are commonly referred to as double exponential quadrature or $sinh$ - $tanh$ quadrature. Thirdly there is the large class of methods based on rational approximation of the functions $z^{- β}$ and the Mittag Leffler-Function $e_{γ, μ} (z)$ (see (3.18) for the precise definition). As shown in [17], this class also encompasses the previous two approaches while also allowing some other methods, based on general rational approximation algorithms like Best-Uniform-Rational approximation (BURA) or the “Adaptive Antoulas-Anderson”-algorithm (AAA) from [30]. Finally, there exist some further methods based on reduced basis and rational Krylov methods [9, 10, 12, 13] which are strongly related to rational approximation.

In this paper we investigate whether the discretization of the Riesz–Dunford integral can benefit from using a double exponential quadrature scheme instead of the more established $sinc$ -quadrature. We present a scheme that retains all the advantages of [3–5] while delivering improved convergence rates. Namely, the scheme is very easy to implement if a solver for elliptic finite element problems is available. It is almost trivially parallelizable, as the main cost consists of solving a series of independent elliptic problems. In addition, it provides (compared to $sinc$ -methods) superior accuracy over a wide range of applications and does not require subtle tweaking of parameters in order to get good performance. Instead it will automatically pick up any additional smoothness of the underlying problem to give improved convergence. Since for each quadrature point an elliptic FEM problem needs to be solved, reducing the number of quadrature points greatly increases performance of the overall method.

Compared to the BURA and AAA rational approximation methods, the sinc- and double-exponential quadrature based algorithms have several advantages. Firstly, the implementation is very simple with quadrature nodes that are known explicitly. The quadrature points are also independent of the spectrum of the operator $L$ and no explicit bound on the largest eigenvalue is required. This makes them better suited for highly accurate but highly ill-conditioned discretizations like the hp-FEM scheme in [26].

Secondly, the quadrature points are independent of the function that is to be approximated. Most notably, when considering the time-dependent problem with inhomogeneus right-hand side in Sect. 3.3, all the linear systems that need to be solved are independent of the time t or the integration variable $τ$ . This makes the full time-dependent problem of the same cost (with respect to the number of systems that need solving) as the simple stationary problem. Thirdly, the quadrature based methods allow for very detailed analysis, as showcased in this article. In addition to the quadrature analysis, they also allow for detailed analysis of the error brought in by a discretization in space [26]. In practice, we also observed better numerical stability in the presence of rounding errors, as showcased in Fig. 4.

Fig. 4 — Comparison of approximation schemes for 2d elliptic problems with different parameter $β$

The paper is structured as follows. After fixing the model problem and notation in Sects. 1.1, 3 introduces the double exponential formulas in an abstract way and we collect some known properties. In addition, we provide one small convergence result which, to our knowledge, has not yet appeared in the literature; we show that the double exponential formulas at least provide comparable convergence of order $e^{- \sqrt{N_{q}}}$ even without requiring additional analyticity compared to standard $sinc$ methods.

The paper is structured as follows. In Sect. 1, we introduce the general setting and the functional calculus. Sect. 2 introduces the quadrature scheme as well as the model problems we are interested in. We also state the main convergence results. Sect. 3 is devoted to proving these results. Sect. 3.1 presents the abstract analysis for sinc methods and collects some known properties. In addition, we provide one small convergence result which, to our knowledge, has not yet appeared in the literature; we show that the double exponential formulas at least provide comparable convergence of order $e^{- \sqrt{N_{q} / ln (N_{q})}}$ even without requiring additional analyticity compared to standard sinc methods. In Sect. 3.2, we look at the case of a purely elliptic problem without time dependence. It will showcase the techniques used and provide the building block for the more involved problems later on. In Sect. 3.3, we then consider what happens if we move into the time-dependent regime. Section 4 provides extensive numerical evidence supporting the theory. We also compare our new method to the standard $sinc$ -based methods. Finally, Appendix A collects some properties of the coordinate transform involved. The proofs and calculations are elementary but somewhat lengthy and thus have been relegated to the appendix in order to not impact readability of the article.

Throughout this work we will encounter two types of error terms. For those of the form $e^{- \frac{γ}{k}}$ we will be content with not working out the constants $γ$ explicitly. For the more important terms of the form $e^{- \frac{γ^{'}}{\sqrt{k}}}$ we will derive explicit constants $γ^{'}$ which prove sharp in several examples of Sect. 4.

We close with a remark on notation. Throughout this text, we write $A ≲ B$ to mean that there exists a constant $C > 0$ , which is independent of the main quantities of interest like number of quadrature points $N_{q}$ or step size k such that $A \leq C B$ . The detailed dependencies of C are specified in the context. We write $A \sim B$ to mean $A ≲ B$ and $B ≲ A$ .

General setting and notation

In this paper, we consider problems of applying holomorphic functions f to self-adjoint operators, for example the Laplacian. The two large classes of problems treated in this paper stem from the study of fractional diffusion problems, both in the stationary as well as in the transient version. Since it does not incur additional difficulty compared to the explicit setting of Remark 1.2, we will work in the following abstract setting:

Assumption 1.1

Let $X$ be a Hilbert space and $L$ be a positive definite, self adjoint operator on $X$ such that there exists a sequence of eigenvalues $λ_{j} > 0$ with associated eigenfunctions $φ_{j} \in X$ , $j \in N_{0}$ , such that ${(φ_{j})}_{j = 0}^{\infty}$ is an orthonormal basis of $X$ .

Given the eigenvalues and eigenfunctions of $L$ , we define the spaces for $β \geq 0$

\begin{matrix} H^{β} : = {u \in X : {∥u∥}_{H^{β} (Ω)} < \infty} with {∥u∥}_{H^{β}}^{2} : = \sum_{j = 0}^{\infty} λ_{j}^{β} {|{(u, v_{j})}_{X}|}^{2} . \end{matrix}

1.1

Remark 1.2

The problem we have in mind for our applications is the following: given a bounded Lipschitz domain $Ω$ , we consider the space $X : = L^{2} (Ω)$ and the self adjoint operator

\begin{matrix} L u : = - div (A \nabla u) + c u, \end{matrix}

where $A \in L^{\infty} (Ω ; R^{d \times d})$ is uniformly symmetric and positive definite and $c \in L^{\infty} (Ω)$ satisfies $c \geq 0$ almost everywhere. The domain of $dom (L)$ is always taken to include homogeneous Dirichlet boundary conditions. In this case, the spaces $H^{β} (X)$ correspond to the standard (fractional) Sobolev spaces often denoted by $H^{β} (Ω)$ or ${\tilde{H}}^{β} (Ω)$ in the literature. $□$

Remark 1.3

[5] considers an even more general class of operators, namely the class of “regular accretive operators”. We expect some of the results of this article to carry over also to such a class, but since many of our proofs rely on the decomposition using real eigenvalues, such generalizations would be non-trivial. $□$

The spaces $H^{β}$ are the natural setting for our regularity assumptions on the data. If we are interested in convergence beyond root-exponential rates, we need the following class of functions of Gevrey-type.

\begin{matrix} G^{L} (C_{f}, R_{f}, ω) : = {f \in X : {∥f∥}_{H^{ρ}} & \leq C_{f} R_{f}^{ρ} (Γ (ρ + 1))^{ω} < \infty \forall ρ \geq 0} . \end{matrix}

1.2

Compared to the standard Gevrey-class of functions, these spaces also include boundary conditions for the functions $L^{n} f$ for all $n \in N$ . If the boundary conditions are met, we can then estimate

\begin{matrix} {∥f∥}_{H^{ρ}} \leq {‖ f ‖}_{X} + {‖ L^{⌈ ρ / 2 ⌉} f ‖}_{X} . \end{matrix}

Examples for such functions are those only containing a finite number of frequencies when decomposed into the eigenbasis of $L$ , but also more complex functions such as smooth bump functions with compact support are admissible (see [31, Section 1.4]).

One natural way of defining a functional calculus for the operator $L$ is based on the spectral decomposition.

Definition 1.4

(Spectral calculus) Let $O \subseteq R_{+}$ such that $O$ contains the spectrum of $L$ . Let $g : O \to C$ be continuous with $|g (z)| ≲ {(1 + |z|)}^{μ}$ for $μ \in R$ . We define for $u \in H^{2 μ} (Ω)$ :

\begin{matrix} g (L) u : = \sum_{j = 0}^{\infty} g (λ_{j}) (u, v_{j})_{X} v_{j} . \end{matrix}

An alternative definition for holomorphic functions, which will prove more useful for approximation is given in the following Definition. For simplicity, we restrict our considerations to decaying functions g. In this case, it can be shown (see also [3, Section 2]) that the operators resulting from Definitions 1.4 and 1.5 coincide.

Definition 1.5

(Riesz–Dunford calculus) Fix parameters $σ = 1 / 2$ or $σ = 1$ , $θ \geq 1$ and $κ > 0$ . Let $O \subseteq C$ such that $C_{+} : = {z \in C : Re (z) > 0} \subseteq O$ . Let $g : O \to C$ be holomorphic with $|g (z)| ≲ {(1 + |z|)}^{μ}$ for $μ < 0$ . We define

\begin{matrix} g (L) : = \frac{1}{2 π i} \int_{C} g (z) (L - z)^{- 1} d z, \end{matrix}

1.3

where the integral is taken in the sense of Riemann, and $C$ is the smooth path

\begin{matrix} C : = {κ (cosh (σ w) + i θ sinh (w)) for w \in R} . \end{matrix}

The parameter $κ > 0$ is taken sufficiently small such that $κ < λ_{0}$ , where $λ_{0}$ is the smallest eigenvalue of $L$ . The parameters $σ$ and $θ$ can be used to tweak the discretization. We have observed the best behavior for $σ : = 1 / 2$ and $θ : = 4$ ; cf. Sect. 4.

Remark 1.6

The choice of path in Definition 1.5 is somewhat arbitrary. It is only required to encircle the spectrum of $L$ with winding number 1. Throughout this paper, we will only ever use the same path and thus make it part of our definition. $□$

Remark 1.7

One could also think to allow $σ \in (0, 1)$ . For the practical application of the scheme this does not make a big difference, but the analysis for $σ \neq 1$ in this paper makes heavy use of the half-angle formula. Therefore we restrict our view to the cases $σ = 1$ or $σ = 1 / 2$ . In numerical experiments, methods with $σ \neq 1 / 2$ work, but we decided that the small difference in performance does not warrant the much greater complexity of analysis. $□$

Model problems, discretization and results

In this section, we introduce the discretization methods and, in order to ease the reading of the article, we present the most important of the convergence results. All of the sometimes very technical proofs are relegated to Sect. 3.

The main role in our discretization schemes will be played by the following coordinate transform which parametrizes the contour in Definition 1.5:

\begin{matrix} ψ_{θ, σ} (y) : = κ [cosh (\frac{σ π}{2} sinh (y)) + i θ sinh (\frac{π}{2} sinh (y))] . \end{matrix}

2.1

We will focus on the cases $σ \in {\frac{1}{2}, 1}$ and $θ \geq 1$ . $κ$ is again taken sufficiently small as in Definition 1.5.

Using this transformation, we can introduce the double exponential quadrature approximation of the Riesz–Dunford calculus in Definition 1.5. Because the discretization by quadrature will appear repeatedly for different functions and operators, we introduce the following notation:

Definition 2.1

Let $O \subseteq C$ such that $C_{+} \subseteq O$ . For $g : O \to C$ holomorphic as in Definition 1.5, $k \geq 0$ and $N_{q} \in N \cup {\infty}$ we write for all $u \in X$

\begin{matrix} Q^{L} (g, N_{q}) u & : = \frac{1}{2 π i} \sum_{j = - N_{q}}^{N_{q}} g (ψ_{σ, θ} (j k)) ψ_{σ, θ}^{'} (j k) {(L - ψ_{σ, θ} (j k))}^{- 1} u \end{matrix}

2.2

and $Q^{L} (g) : = Q^{L} (g, \infty)$ for the case where no cutoff is performed. The quadrature error will be denoted by

\begin{matrix} E^{L} (g, N_{q}) & : = g (L) u - Q^{L} (g, N_{q}) u, \forall u \in dom (g (L)) \end{matrix}

2.3

where $g (L)$ is given via the Riesz–Dunford integral 1.5. Again, we write $E^{L} (g) : = E^{L} (g, \infty)$ .

Remark 2.2

In Definition 2.1, we will often work with the special case $L = λ$ . This is taken to mean the scalar multiplication operator $u \mapsto λ u$ on the vector space $X$ . $□$

We apply the function to the following problems:

(i)
$g (z) : = z^{- β}$ : This corresponds to solving an elliptic fractional diffusion problem; see Sect. 2.1 for the model problem and Sect. 3.2 for the proofs.
(ii)
$g (z) = e_{γ, μ} (- t^{α} z^{β})$ , with $e_{γ, μ}$ the Mittag–Leffler function: This corresponds to a parabolic model problem; see Sects. 2.2 and 3.3.

For both model problems, we prove two convergence results, depending on the regularity of the data. In the case of “finite regularity”, the data (f or $u_{0}$ ) are assumed to be in a space $H^{2 ρ}$ for some $ρ > 0$ . This results in bounds of root-exponential order $\sqrt{N_{q} / ln (N_{q})}$ .

The second case is the one were the data are in the Gevrey-type classes $G^{L}$ introduced in (1.2). For such functions, the double-exponential discretization leads to an improved convergence of the form $O (e^{- \frac{γ N_{q}}{ln (N_{q})}})$ .

The elliptic problem

As our first model problem, we consider the following elliptic fractional diffusion problem:

\begin{matrix} Given f \in L^{2} (Ω) and β > 0, find u \in dom (L^{β}) such that L^{β} u = f . \end{matrix}

2.4

Using the Riesz–Dunford formula, this is equivalent to computing

\begin{matrix} u & = \frac{1}{2 π i} \int_{C} z^{- β} (L - z)^{- 1} f d z . \end{matrix}

In order to get a discrete scheme, we replace the integral with the quadrature formula. Given $N_{q} \in N$ and $k > 0$ , the approximation to (2.4) is then given by

\begin{matrix} u_{k} : = Q^{L} (z^{- β}, N_{q}) f . \end{matrix}

Remark 2.3

Since in practice, the solution operator ${(L - z)}^{- 1}$ is not computable, one would in addition replace ${(L - z)}^{- 1}$ by a Galerkin solver in order to obtain a fully computable scheme. In the Setting of Remark 1.2, this means the following: given a closed subspace $V_{h} \subseteq H_{0}^{1} (Ω)$ , the discrete resolvent $R_{h} (z) : L^{2} (Ω) \to V_{h}$ is given as the solution

\begin{matrix} R_{h} (z) f : = u_{h}, with (A \nabla u_{h}, \nabla v_{h})_{L^{2} (Ω)} + ((c - z) u_{h}, v_{h})_{L^{2} (Ω)} = & (f, v_{h})_{L^{2} (Ω)} \\ \forall v_{h} \in V_{h} . \end{matrix}

Given discretization parameters $V_{h} \subseteq H^{1} (Ω)$ , $N_{q} \in N$ and $k > 0$ , the fully discrete approximation to (2.4) is then given by

\begin{matrix} u_{h, k} : = \frac{1}{2 π i} \sum_{j = - N_{q}}^{N_{q}} (ψ_{σ, θ} (j k))^{- β} ψ_{σ, θ}^{'} (j k) R_{h} (ψ_{σ, θ} (j k)) f . \end{matrix}

2.5

In order to keep presentation to a reasonable length, we focus on the spatially continuous setting. We only remark that discretization in space can be easily incorporated into the analysis. For low order finite elements one can follow [3]; for an exponentially convergent hp-FEM scheme we refer to [26]. $□$

Remark 2.4

We should point out that for the elliptic problem, there exist methods based on the Balakrishnan formula (see also Sect. 4) which do not require complex arithmetic. On the other hand, since we are only approximating real valued functions, we can exploit the symmetry of (2.2) to only solve for $j \geq 0$ , thus halving the number of linear systems. This results in (roughly) comparable computational effort for both the Balakrishnan and the double exponential schemes. Due to their better convergence the DE-schemes might therefore still be advantageous. $□$

The convergence of the new method can be summarized in the following two theorems.

Theorem 2.5

Let u be the exact solution to (2.4) and assume $f \in H^{2 ρ} (Ω)$ for some $ρ \geq 0$ . Let $β \geq \bar{β}$ with $\bar{β} \in (0, 1]$ and $u_{k} : = Q^{L} (z^{- β}, N_{q}) f$ denote the approximation computed using stepsize $k > 0$ and $N_{q} \in N$ quadrature points. Then, the following estimate holds for all $ε \geq 0$ and $r \in [0, \bar{β} / 2]$ :

\begin{matrix} {∥u - u_{k}∥}_{H^{2 r}} & = {∥E^{L}, (z^{- β}, N_{q}), f∥}_{H^{2 r}} \\ ≲ e^{- \frac{[p (σ, θ) - ε] \sqrt{β + ρ - r}}{\sqrt{k}}} {∥f∥}_{H^{2 ρ}} + [exp (- \frac{γ}{k}) + exp (- γ e^{k N_{q}})] {∥f∥}_{X}, \end{matrix}

where the rate $p (σ, θ)$ is given by

\begin{matrix} p (σ, θ) : = \{\begin{matrix} 2 \sqrt{2 π {tan}^{- 1} (θ)} & for σ = 1, \\ 2 π, & for σ = 1 / 2 . \end{matrix}) \end{matrix}

2.6

For $ε > 0$ , the implied constant and $γ$ may depend on $ε, r$ , the smallest eigenvalue $λ_{0}$ of $L$ , $\bar{β}$ , $κ$ , $θ$ and $σ$ . But they are independent of $ρ$ , $β$ , k, and f. If $ε = 0$ , the constants may in addition depend on $ρ$ and $β$ .

Remark 2.6

When comparing Theorem 2.5 to the estimates of the standard $sinc$ -quadrature one might think that the double exponential method is inferior due to the $\sqrt{k}$ vs $k$ behavior. This misconception can be cleared up by considering the better decay properties of the double-exponential formula. It allows to choose $k \sim ln (N_{q}) / N_{q}$ compared to the standard $sinc$ -quadrature choice of $k \sim N_{q}^{- 1 / 2}$ without the cutoff error becoming dominant. Using this choice, the exponential term scales like $\sqrt{N_{q} / ln (N_{q})}$ for double exponential and $\sqrt{N_{q}}$ for standard sinc respectively. As is shown in Sect. 4, the better constants in the exponential still often outweigh the presence of the $ln$ -term for the double-exponential quadrature. $□$

Remark 2.7

For most of the computation, the convergence rate is determined by the factor $p (σ, θ)$ in Corollary 3.8. We observe that for $θ = 1$ , picking $σ = 1 / 2$ roughly doubles the convergence rate. Similarly, it often appears beneficial to pick larger values of $θ$ . Especially for $σ = 1$ , we get an asymptotic rate for $θ \to \infty$ , which is the same as in the case of $σ = 1 / 2$ . But we need to point out that increasing $θ$ means that we have to decrease the value $d (θ)$ , which determines the rate in the higher orders terms of the form $e^{- γ / k}$ , thus leading to those terms dominating in a larger and larger preasymptotic regime. Overall, the method using $σ = 1 / 2$ and setting $θ$ moderately large is expected to give the best convergence rates; cf. Sect. 4. $□$

The previous theorem shows that in general, the convergence behaves like $O (e^{- \frac{γ}{\sqrt{k}}})$ . It also shows that, if the function f in the right-hand side has some additional smoothness, the method automatically detects this and delivers an improved convergence rate. If the additional smoothness is in the right Gevrey type classes, we can establish convergence which is beyond the root exponential behavior. The details can be found in the following theorem:

Theorem 2.8

Let u be the exact solution to (2.4) and assume that there exist constants $C_{f}, ω, R_{f} > 0$ such that $f \in G^{L} (C_{f}, R_{f}, ω)$ , i.e.,

\begin{matrix} {∥f∥}_{H^{ρ}} & \leq C_{f} R_{f}^{ρ} (Γ (ρ + 1))^{ω} < \infty \forall ρ \geq 0 . \end{matrix}

Assume that $β > \bar{β}$ with $\bar{β} \in (0, 1]$ . Let $u_{k} : = Q^{L} (z^{- β}, N_{q}) f$ denote the approximation computed using stepsize $k \in (0, 1 / 2)$ and $N_{q} \in N$ quadrature points. Then, the following estimate holds:

\begin{matrix} {∥u - u_{k}∥}_{H^{\bar{β}}} = {∥E^{L}, (z^{- β}, N_{q})∥}_{H^{\bar{β}}} ≲ C_{f} exp (- \frac{γ}{k |ln (k)|}) + C_{f} exp (- γ e^{k N_{q}}) . \end{matrix}

The implied constant and $γ$ may depend on $ω$ , the smallest eigenvalue $λ_{0}$ of $L$ , $κ$ , $θ$ , $σ$ , $R_{f}$ , $\bar{β}$ , and $ω$ . If $ω = 0$ , the logarithmic term may be removed.

The parabolic problem

The second model problem we consider is a time-dependent fractional diffusion problem of parabolic type. We fix $α, β \in (0, 1]$ and a final time $T > 0$ . Given an initial condition $u_{0} \in X$ and right-hand side $f \in C ([0, T], X)$ we seek $u : [0, T] \to dom (L^{β})$ satisfying

\begin{matrix} \partial_{t}^{α} u + L^{β} u & = f in [0, T], u (t) \in dom (L^{β}) \forall t > 0, and u (0) = u_{0}, \end{matrix}

2.7

where $\partial_{t}^{α}$ denotes the Caputo fractional derivative. Following [4], the solution u can be written using the Mittag–Leffler function $e_{α, μ}$ (see (3.18)) as

\begin{matrix} u (t) & : = e_{α, 1} (- t^{α} L^{β}) u_{0} + \int_{0}^{t} τ^{α - 1} e_{α, α} (- τ^{α} L^{β}) f (t - τ) d τ . \end{matrix}

2.8

Here we again use either the spectral or, equivalently, the Riesz–Dunford calculus to define the operators. We discretize this problem by using our double exponential formula. Namely for $k > 0$ and using $N_{q} \in N$ quadrature points,

\begin{matrix} u^{k} (t) & : = Q^{L} (e_{α, 1} (- t^{α} z^{β}), N_{q}) u_{0} + \int_{0}^{t} τ^{α - 1} Q^{L} (e_{α, α} (- τ^{α} z^{β}), N_{q}) f (t - τ) d τ . \end{matrix}

2.9

Remark 2.9

In practice, in order to get a fully computable discrete scheme, one would again replace the resolvent by a Galerkin solver and the convolution in time by an appropriate numerical quadrature. For example, [4] presents a low order approximation scheme. In order to retain exponential convergence, [26] uses a scheme based on hp-FEM and hp-quadrature. We summarize the construction briefly. For a given degree $p \in N_{0}$ , and interval I, we denote the Gauss quadrature points and weights on $(- 1, 1)$ by $(x_{j}^{I, p}, w_{j}^{I, p}) \in I \times R_{+}$ , $j = 0, \dots, p$ . See [11, Section 2.7] for details. We then consider a geometric mesh on (0, 1) with grading factor $σ \in (0, 1)$ and parameter $L \in N$ , $L \leq p$ given by

\begin{matrix} K_{0} : = (0, σ^{L}), K_{1} : = (σ^{L}, σ^{L - 1}), \dots, K_{L} : = (σ, 1) . \end{matrix}

On each one of these elements, we apply a Gauss quadrature, reducing the order as we approach the singularity, i.e., we get the nodes and weights as

\begin{matrix} (X, W) : = ⋃_{ℓ = 0}^{L} {(x_{j}^{K_{ℓ}, p - L + ℓ}, w_{j}^{K_{ℓ}, p - L + ℓ}) : j = 0, \dots, p} . \end{matrix}

The convolution in (2.7) is then replaced by

\begin{matrix} \int_{0}^{t} τ^{α - 1} e_{α, α} (- τ^{α} z^{β}) d τ & \approx t \sum_{(x_{j}, w_{j}) \in (X, W)} w_{j} {(t x_{j})}^{α - 1} e_{α, α} (- {(t x_{j})}^{α} z^{β}) f (t - (t x_{j})) \\ = : h_{t} (z) . \end{matrix}

In order to get a fully discrete scheme, this function is then discretized using the double exponential quadrature scheme:

\begin{matrix} \int_{0}^{t} τ^{α - 1} e_{α, α} (- τ^{α} L^{β}) f (t - τ) d τ \approx Q^{L} (h_{t}, N_{q}) . \end{matrix}

In order to not overwhelm the presentation of the paper, we do not consider these types of discretization errors. The analysis of such errors could be taken almost verbatim from the references [3, 26]. $□$

The analysis of the method again comes in the form of two theorems, one for the case of finite regularity and one for regularity in the Gevrey-type classes $G^{L} (C_{f}, R_{f}, ω)$ .

Theorem 2.10

Assume that either $α + β < 2$ or $σ = 1$ (i.e., the case $α = β = 1$ and $σ = 1 / 2$ is not allowed). Let u be the solution to (2.7). Assume $u_{0} \in H^{2 ρ}$ for some $ρ > 0$ , and $f \in C^{m} ([0, T], H^{2 ρ})$ for some $m \in N$ . Let $u_{k}$ be the corresponding discretization using stepsize $k > 0$ and $N_{q} \in N$ quadrature points as defined in (2.9).

Then, the following estimate holds for all $t \in (0, T)$ , $r \in [0, β / 2]$ and any $q < 1$ :

\begin{matrix} {∥u (t) - u_{k} (t)∥}_{H^{2 r}} ≲ & max (t^{- m - q α}, T^{α}) ({∥u_{0}∥}_{H^{2 ρ}} + \sum_{j = 0}^{m} max_{τ \leq t} {‖ f^{(j)} (τ) ‖}_{H^{2 ρ}}) \\ \times (e^{- \frac{min {p (σ, θ) \sqrt{β + ρ - r}, γ_{1} \sqrt{m / α + q}}}{\sqrt{k}}} + e^{- γ / k} + e^{- γ e^{k N_{q}}}), \end{matrix}

where $p (σ, θ)$ is as in (2.6) and $γ_{1}$ is the constant from Corollary 3.5. The implied constant and $γ$ may depend on q, r, the smallest eigenvalue $λ_{0}$ of $L$ , $β$ , $α$ , $κ$ , $θ$ , $σ$ and $ρ$ .

Theorem 2.11

Assume that either $α + β < 2$ or $σ = 1$ (i.e., the case $α = β = 1$ and $σ = 1 / 2$ is not allowed). Let u be the solution to (2.7), and assume that the data satisfy

\begin{matrix} \begin{matrix} {∥u_{0}∥}_{H^{ρ}} & \leq C_{u_{0}} R_{u_{0}}^{ρ} (Γ (ρ + 1))^{ω} < \infty \forall ρ \geq 0 \\ ‖ f^{(n)} (t) ‖_{H^{ρ}} & \leq C_{f} R_{f}^{ρ + n} (Γ (ρ + 1))^{ω} (n!)^{ω} < \infty \forall t \in [0, T], \forall ρ \geq 0, \forall n \in N_{0} . \end{matrix} \end{matrix}

2.10

Let $u_{k}$ be the corresponding discretization using stepsize $k \in (0, 1 / 2)$ and $N_{q} \in N$ quadrature points as defined in (2.9). Then, the following estimate holds:

\begin{matrix} {∥u (t) - u_{k} (t)∥}_{H^{β}} & ≲ (1 + t) exp (- \frac{γ}{k |ln t_{⋆}| |ln k|}) + (t + t^{- γ / 2}) exp (- γ e^{k N_{q}}) \end{matrix}

with $t_{⋆} : = min (t, 1 / 2)$ . The implied constant and $γ$ may depend on the smallest eigenvalue $λ_{0}$ of $L$ , $β$ , $θ$ , $σ$ and the constants from (2.10).

Error analysis

In this section, we analyze the quadrature error when applying a double exponential formula for discretizing certain integrals.

For $θ \geq 1$ , $δ > 0$ we define the sets

\begin{matrix} D_{d (θ)} : = {z \in C : |Im (z)| < d (θ)}, and D_{δ}^{exp} : = {z \in C : |Im (z)| < δ e^{- |Re (z)|}}, \end{matrix}

3.1

where for each $θ$ , $d (θ)$ is a constant which is assumed sufficiently small in order for Lemmas A.3, A.4, and A.8 to hold.

Since all the proofs analyzing the properties of $ψ_{σ, θ}$ are elementary but somewhat lengthy and cumbersome, they have been relegated to Appendix A. The most important properties are, that $y \mapsto ψ_{σ, θ} (y)$ for $y \in R$ traces the contour in the definition of the Riesz Dunford calculus (see Definition 1.5), and that it is analytic in $D_{d (θ)}$ . The other important results concern the points where $ψ_{σ, θ}$ crosses the real axis, as these points correspond to (possible) poles in the integrand of Definition 1.5. The location of these points, as well as other important estimates are collected in Lemma A.8. Roughly summarizing, the finitely many points y satisfying $ψ_{σ, θ} (y) = λ$ have distance $1 / ln (λ)$ from the real axis. Away from such points $|ψ_{σ, θ} (y) - λ| ≳ λ$ holds and for $y \to \pm \infty$ the function $ψ_{σ, θ}$ behaves doubly-exponential (Lemma A.4).

Abstract analysis of $sinc$ -quadrature

In this section, we collect some results on $sinc$ -quadrature formulas.

Remark 3.1

As is common in the literature, we define the $sinc$ function as

\begin{matrix} sinc (ζ) : = \{\begin{matrix} \frac{sin (π ζ)}{π ζ} & ζ \neq 0 \\ 1 & ζ = 0 . \end{matrix}) \end{matrix}

The following result is the main work-horse when analyzing $sinc$ -quadrature schemes. In order to reduce the required notation, we use a simplified version of [32, Problem 3.2.6].

Proposition 3.2

(Bialecki, see [32, Problem 3.2.6 and Theorem 3.1.9]) We make the following assumptions on g:

(i)
g is a meromorphic function on the infinite strip $D_{d (θ)}$ . It is also continuous on $\partial D_{d (θ)}$ . The poles $(p_{ℓ})_{ℓ = 1}^{N_{p}}$ are all simple and located in $D_{d (θ)} \ R$ .
(ii)
There exists a constant $C > 0$ independent of $y \in R$ such that for sufficiently large $y > 0$ ,
$\begin{matrix} \int_{- d (θ)}^{d (θ)} |g (y + i w)| d w & \leq C . \end{matrix}$ 3.2
(iii)
We have
$\begin{matrix} N (g, D_{d (θ)}) : = \int_{- \infty}^{\infty} |g (y + i d (θ))| + |g (y - i d (θ))| d y < \infty . \end{matrix}$ 3.3

Denote by $res (g ; p_{ℓ})$ the residue of g at $p_{ℓ}$ , and define $γ (k ; p_{ℓ}) : = \frac{1}{sin (π p_{ℓ} / k)} .$

Then for all $k > 0$ , using $s_{ℓ} : = sign (Im (p_{ℓ}))$ :

\begin{matrix} | \int_{R} g (t) d t - k \sum_{n = - \infty}^{\infty} g (k n) - π \sum_{ℓ = 1}^{N_{p}} e^{i \frac{s_{ℓ} π p_{ℓ}}{k}} res (g ; p_{ℓ}) γ (k ; p_{ℓ}) | \\ \leq \frac{e^{- 2 π d (θ) / k}}{1 - e^{- 2 π d (θ) / k}} N (g, D_{d (θ)}) . \end{matrix}

3.4

Proposition 3.2 requires certain decay properties for the integrand in a complex strip, and thus is not always applicable. As is shown in Appendix A, the transformation $ψ_{σ, θ}$ maps partly into the left-half plane. One can even show that the real part changes sign infinitely many times when evaluating along a line of fixed imaginary part. If we therefore consider the case when $f (z) : = e^{- z}$ is the exponential function, this means that $f \circ ψ$ is exponentially increasing in such regions. This puts showing estimates of the form required in Proposition 3.2 (iii) out of reach.

On the other hand, Lemma A.5 shows that for $σ = 1$ , restricted to the domain $D_{δ}^{exp}$ , the map $ψ_{σ, θ}$ stays in the right half-plane. Here the exponential function is decreasing. Similarly, the Mittag–Leffler function $e_{α, μ}$ is decreasing on slightly larger sectors, allowing for the choice of $σ = 1 / 2$ if $α < 1$ . This motivates the following modification of Proposition 3.2.

Lemma 3.3

Assume that $g : D_{δ}^{exp} \to C$ is holomorphic and is doubly-exponentially decreasing, i.e., there exist constants $C_{g} > 0$ , $μ_{g} > 0,$ such that g satisfies

\begin{matrix} |g (y)| \leq C_{g} exp (- μ_{g} e^{Re (y)}) \forall y \in D_{δ}^{exp} . \end{matrix}

3.5

Then, for all Inline graphic , there exists a constant $C > 0$ which is independent of k, $μ$ and g such that the following error estimate holds:

\begin{matrix} |\int_{R} g (t) d t - k \sum_{n = - \infty}^{\infty} g (k n)| & \leq C C_{g} k exp (- \sqrt{8 π δ} \frac{\sqrt{μ_{g} - 2 ε}}{\sqrt{k}}) . \end{matrix}

3.6

Proof

We closely follow the proof of [21, Theorem 2.13], but picking a different contour and later exploiting the strong decay properties of g.

For $N \in N$ , set $R_{N} : = {y \in C : |Re (y)| \leq (N + \frac{1}{2}), |Im (y)| ≲ δ e^{- |Re (y)|}}$ . For fixed $t \in R$ , we fix N large enough such that $t \in R_{N}$ . By applying the residue theorem to the function

\begin{matrix} h (y) : = \frac{sin (π t / k) g (y)}{(t - y) sin (π y / k)}, \end{matrix}

one can show the equality

\begin{matrix} g (t) - k \sum_{n = - N}^{N} g (n k) sinc (\frac{t - n k}{k}) & = \int_{\partial R_{N}} \frac{sin (π t / k) g (y)}{(t - y) sin (π y / k)} d y . \end{matrix}

Since asymptotically g(t) decreases doubly exponentially, while $1 / sin (π y / k)$ only grows exponentially along the path ${(ξ, δ e^{- ξ}), ξ \in R}$ , we can pass to the limit $N \to \infty$ to get the representation

\begin{matrix} g (t) - k \sum_{n = - \infty}^{\infty} g (k n) sinc (\frac{t - n k}{k}) & = \int_{\partial D_{δ}^{exp}} \frac{sin (π t / k) g (y)}{(t - y) sin (π y / k)} d y . \end{matrix}

3.7

Integrating (3.7) over $R$ and exchanging the order of integration gives:

\begin{matrix} \int_{R} g (t) d τ - k \sum_{n = - \infty}^{\infty} g (k n) & = \int_{\partial D_{δ}^{exp}} \frac{g (y)}{sin (π y / k)} \int_{R} \frac{sin (π t / k)}{t - y} d t d y . \\ = π \int_{\partial D_{δ}^{exp}} \frac{g (y)}{sin (π y / k)} e^{\frac{i sign (Im (y)) π y}{k}} d y, \end{matrix}

3.8

where in the last step we invoked [21, Lemma 2.19] to explicitly evaluate the integral. What remains to be done is bound the integral on the right-hand side. For simplicity, we focus on the upper-right half-plane. The other cases follow analogously. There, we can parameterize $\partial D_{δ}^{exp}$ as $y = ξ + i δ e^{- ξ}$ . We estimate

\begin{matrix} |\frac{g (y)}{sin (π y / k)}, e^{\frac{i sign Im (y) π y}{k}}| & = | g (y) | \frac{| e^{\frac{i π y}{k}} |}{| e^{i π y / k} - e^{- i π y / k} |} \\ = |g (y)| \frac{exp (- π δ \frac{e^{- ξ}}{k})}{exp (π δ \frac{e^{- ξ}}{k}) - exp (- π δ \frac{e^{- ξ}}{k})} \\ = |g (y)| \frac{exp (- 2 π δ \frac{e^{- ξ}}{k})}{1 - exp (- 2 π δ \frac{e^{- ξ}}{k})} \\ ≲ |g (y)| k e^{ξ} exp (- 2 π δ \frac{e^{- ξ}}{k}) \\ \overset{(3.5)}{≲} C_{g} k exp (- μ_{g} e^{ξ} + ξ - \frac{2 π δ e^{- ξ}}{k}) \end{matrix}

3.9

For $ε > 0$ , we can absorb the linear $ξ$ -term into the first exponential, and estimate:

\begin{matrix} (3.9) & ≲ ε^{- 1} C_{g} k exp (- (μ_{g} - 2 ε) e^{ξ} - \frac{2 π δ e^{- ξ}}{k}) exp (- ε e^{ξ}) \end{matrix}

where the second term will be used to regain integrability, whereas the first one will give us approximation quality. For $ξ = 0$ and $ξ \to \infty$ , we get sufficient bounds to prove (3.6). We thus have to look for maxima of the function with respect to $ξ$ in between $(0, \infty)$ . Due to monotonicity of the exponential, we focus on the argument and set $τ : = e^{ξ} .$ By setting its derivative to zero we get that the map

\begin{matrix} τ \mapsto - (μ_{g} - 2 ε) τ - \frac{2 π δ}{τ k} is maximized for τ_{max} = \sqrt{\frac{2 δ π}{k (μ_{g} - 2 ε)}} . \end{matrix}

Inserting all this into (3.8), we get

\begin{matrix} | \int_{R} g (t) d τ - k \sum_{n = - \infty}^{\infty} g (k n) | & ≲ C_{g} k exp (- \sqrt{8 π δ} \sqrt{\frac{μ_{g} - 2 ε}{k}}) \int_{0}^{\infty} exp (- ε e^{|Re (y)|}) d ξ \\ ≲ C_{g} k exp (- 2 \sqrt{2 π δ} \sqrt{\frac{μ_{g} - 2 ε}{k}}) . \end{matrix}

$□$

Remark 3.4

It is also possible to admit meromorphic functions with finitely many poles into Lemma 3.3, as long as additional error terms analogous to (3.4) are introduced. Since we will not need this generalization we stay in the analytic setting. $□$

While Lemma 3.3 provides a reduced rate of convergence compared to the more-standard $sinc$ -quadrature of Proposition 3.2 ( $k^{- 1 / 2}$ vs $k^{- 1}$ ), thus removing the advantage we want to achieve by using the double exponential transformation, we will later consider a class of functions which decay fast enough to allow us to tune the parameter $μ \sim k^{- 1}$ to regain almost full speed of convergence.

Finally, we show how the transformation $ψ_{σ, θ}$ and the operator $L$ enter the estimates. The next corollary also showcases how the cutoff error is controlled.

Corollary 3.5

Let $O \subseteq C$ contain the right half-plane, and if $σ = 1 / 2$ also a sector

\begin{matrix} S_{ω} : = {z \in C : |Arg (z)| \leq ω} for some ω > \frac{π}{2} . \end{matrix}

Assume that $g : O \to C$ is analytic and satisfies the polynomial bound

\begin{matrix} |g (z)| \leq C_{g} {(1 + |z|)}^{- μ} for μ \in R . \end{matrix}

Then, for all $ε > 0$ , $s, r \in R$ such that $μ - r + s - 2 ε > 0$ , the quadrature errors can be bounded by:

\begin{matrix} {∥E^{L}, (g, N_{q})∥}_{H^{2 s} \to H^{2 r}} & \leq C C_{g} [e^{- \hat{γ} \frac{\sqrt{μ - r + s - 2 ε}}{\sqrt{k}}} + exp (- (μ - r + s) γ e^{k N_{q}})] . \end{matrix}

The constant C is independent of g, k,r,s and $β$ , but may depend on $ε$ , $σ$ , $θ$ . The rate $\hat{γ}$ depends on $θ$ and $ω$ . $γ$ depends on $σ$ .

Proof

Let ${(λ_{j}, v_{j})}_{j = 0}^{\infty}$ denote the eigenvalues and eigenfunctions of the self-adjoint operator $L$ . Following [3], plugging the eigen-decomposition of a function u into the Riesz–Dunford calculus, we can write the exact function $g (L) u$ as

\begin{matrix} g (L) u & = \sum_{j = 0}^{\infty} (\frac{1}{2 π i} \int_{C} g (ψ_{σ, θ} (y)) {(ψ_{σ, θ} - λ_{j})}^{- 1} ψ_{σ, θ}^{'} (y) (u, v_{j})_{L^{2} (Ω)} d y) v_{j} \end{matrix}

and analogously for the discrete approximation $Q^{L} (g, N_{q}) u$ . For the norm, as defined in (1.1), this means:

\begin{matrix} {∥E^{L}, (g, N_{q})∥}_{H^{2 r}}^{2} & = \frac{1}{4 π^{2}} \sum_{j = 0}^{\infty} | (1 + λ_{j}^{r}) E^{λ_{j}} (g, N_{q}) (u, v_{j})_{L^{2} (Ω)} |^{2} \\ ≲ sup_{λ \geq λ_{0}} | (1 + λ^{r - s}) E^{λ} (g, N_{q}) |^{2} {∥u∥}_{H^{2 s}}^{2} . \end{matrix}

We have thus reduced the problem to one of scalar quadrature, for which we aim to apply Lemma 3.3. We fix $λ > λ_{0} > κ$ . $ψ_{σ, θ}$ maps $D_{δ}^{exp}$ analytically to $O$ via Lemma A.5 ( $δ$ depends on $θ$ and $ω$ ). What remains to be shown is a pointwise bound for the function

\begin{matrix} h_{λ} (y) : = λ^{r - s} g (ψ_{σ, θ} (y)) {(ψ_{σ, θ} - λ)}^{- 1} ψ_{σ, θ}^{'} (y) \forall y \in D_{δ}^{exp} . \end{matrix}

By distinguishing the cases $|ψ_{σ, θ}, (y)| < λ / 2$ and $|ψ_{σ, θ}, (y)| \geq λ / 2$ we get using either (A6) or Lemma A.5

\begin{matrix} λ {|ψ_{σ, θ} (y) - λ|}^{- 1} |ψ_{σ, θ}^{'}, (y)| ≲ |ψ_{σ, θ}, (y)| cosh (Re (y)) . \end{matrix}

We conclude using Lemma A.5:

\begin{matrix} |h_{λ}, (y)| & \leq C_{g} {|ψ_{σ, θ}, (y)|}^{- μ} (λ {|ψ_{σ, θ} (y) - λ|}^{- 1} |ψ_{σ, θ}^{'}, (y)|)^{r - s} \\ \times ({|ψ_{σ, θ} (y) - λ|}^{- 1} |ψ_{σ, θ}^{'}, (y)|)^{1 - r + s} \\ ≲ C_{g} {|ψ_{σ, θ}, (y)|}^{- μ + r - s} cosh (Re (y)) . \end{matrix}

The double exponential growth of $ψ_{σ, θ}$ (see Lemma A.4) then gives after absorbing the $cosh$ term by slightly adjusting $ε$ :

\begin{matrix} |h_{λ}, (y)| & \leq C_{g} c_{1} exp (- γ_{1} (μ - r + s) e^{| Re (y) |}) cosh (Re (y)) \\ ≲ C_{g} exp (- γ_{1} (μ - r + s - ε) e^{| Re (y) |}) . \end{matrix}

3.10

Using Lemma 3.3, with $μ_{g} : = γ_{1} (μ - r + s - ε)$ then gives, after readjusting $ε$ :

\begin{matrix} λ^{r - s} | E^{λ} (g) | & = \frac{1}{2 π} | \int_{R} h_{λ} (y) d y - k \sum_{n = - \infty}^{\infty} h_{λ} (k n) | ≲ C_{g} e^{- \hat{γ} \frac{\sqrt{μ - r + s - 2 ε}}{\sqrt{k}}} \end{matrix}

with $\hat{γ} : = \sqrt{8 π δ γ_{1}}$ . The cutoff error is handled easily, also using the estimate (3.10). We calculate

\begin{matrix} |Q^{λ} (g) - Q^{λ} (g, N_{q})| & \leq k \sum_{|j| \geq N_{q} + 1} |h_{λ}, (ψ_{σ, θ} (j k))| \\ \leq C_{g} k λ^{- r + s} \sum_{|j| \geq N_{q} + 1} exp (- γ_{1} (μ - r + s) e^{jk}) \\ ≲ C_{g} λ^{- r + s} exp (- γ_{1} (μ - r + s) e^{N_{q} k}), \end{matrix}

where the last step follows by estimating the sum by the integral and elementary estimates. $□$

The elliptic problem

In this section, we analyze the error when discretizing the elliptic fractional diffusion problem from Sect. 2.1. In order to analyze the quadrature error, we need to understand a specific scalar function. This is done in the next Lemma.

Lemma 3.6

Fix $λ > λ_{0} > κ$ and $β > 0$ . For $y \in R$ , define the function

\begin{matrix} g_{λ}^{β} (y) & : = (ψ_{σ, θ} (y))^{- β} (ψ_{σ, θ} (y) - λ)^{- 1} ψ_{σ, θ}^{'} (y) . \end{matrix}

Then the following statements hold:

(i)
$g_{λ}^{β}$ can be extended to a meromorphic function on $D_{d (θ)}$ . It has finitely many poles. All poles p satisfy $ψ_{σ, θ} (p) = λ$ and are all simple. For any $ν \geq 0$ , the number of poles within the strip
$\begin{matrix} ν - \frac{1}{ln (λ / κ)} \leq |Im (y)| \leq ν + \frac{1}{ln (λ / κ)} \end{matrix}$
can be bounded independently of $ν$ , $β$ and $λ$ . The imaginary part of p can be bounded away from zero and for large $λ$ , the following asymptotics hold:
$\begin{matrix} |Im (p)| & \geq \{\begin{matrix} \frac{{tan}^{- 1} (θ)}{ln (λ / κ)} - O (\frac{1}{ln {(λ / κ)}^{2}}) & if σ = 1, \\ \frac{π}{2 ln (λ / κ)} - O (\frac{1}{ln {(λ / κ)}^{2}}) & if σ = 1 / 2 . \end{matrix}) \end{matrix}$ 3.11
where the implied constants depend on $θ$ , $κ$ , and $λ_{0}$ .
(ii)
There exist constants $C > 0$ , $γ > 0$ , independent of $λ$ and $β$ and a value $d_{λ} \in (d (θ) / 2, d (θ))$ such that $g_{λ}^{β}$ satisfies the bounds
$\begin{matrix} | (1 + λ^{\frac{β}{2}}) g_{λ}^{β} (a \pm i d_{λ}) | \leq C exp (- γ β e^{a}) \forall a \in R . \end{matrix}$ 3.12
(iii)
There exists a constant $C > 0$ such that for $d_{λ}$ from (ii) and $β \geq \bar{β}$ with $\bar{β} \in (0, 1]$
$\begin{matrix} \int_{R} (1 + λ^{\frac{\bar{β}}{2}}) | g_{λ}^{β} (w \pm i d_{λ}) | d w & \leq C < \infty . \end{matrix}$
The constant C may depend on $\bar{β}$ but can be chosen independently of $λ$ and $β$ .

Proof

Proof of (i): We note that by Lemma A.3, $ψ_{σ, θ}$ is non-vanishing in $D_{d (θ)}$ . Since $D_{d (θ)}$ is simply connected, we may define

\begin{matrix} h (y) : = ln (κ) + \int_{0}^{y} \frac{ψ_{σ, θ}^{'} (ζ)}{ψ_{σ, θ} (ζ)} d ζ . \end{matrix}

It is easy to check that on $R$ we have $h (y) = ln (ψ_{σ, θ} (y))$ since the derivative as well as the value at $y = 0$ coincide. Thus, defining

\begin{matrix} g_{λ}^{β} (y) : = e^{- β h (y)} (ψ_{σ, θ} (y) - λ)^{- 1} ψ_{σ, θ}^{'} (y) \end{matrix}

provides a valid meromorphic extension. The only poles are located where $ψ_{σ, θ} (z) = λ$ . By Lemma A.8 (i), the number of such poles within strips of width $ln {(λ)}^{- 1}$ is uniformly bounded. By Lemma A.3, $ψ_{σ, θ}^{'}$ has no zeros in the domain $D_{d (θ)}$ , which means all the poles are simple. The bound on the imaginary part follows from Lemma A.8 (ii).

Proof of (ii): We first note for $y = a \pm i d_{λ}$ , if $λ < |ψ_{σ, θ}, (y)| / 2$ , the trivial estimate ${|ψ_{σ, θ} (y) - λ|}^{- 1} \leq \frac{2}{|ψ (y)|}$ holds. Otherwise, we use Lemma A.8(iii) to get

\begin{matrix} {|ψ_{σ, θ} (y) - λ|}^{- 1} ≲ λ^{- 1} \leq 2 {|ψ_{σ, θ}, (y)|}^{- 1} . \end{matrix}

Overall, we can estimate using Lemma A.4

\begin{matrix} | g_{λ}^{β} (y) | & ≲ {|ψ_{σ, θ}, (y)|}^{- β} {|ψ_{σ, θ} (y) - λ|}^{- 1} |ψ_{σ, θ}^{'}, (y)| ≲ {|ψ_{σ, θ}, (y)|}^{- β - 1} |ψ_{σ, θ}^{'}, (y)| \\ ≲ exp (- γ β e^{Re (y)}), \end{matrix}

where in the last step, we used that $ψ_{σ, θ}^{'}$ has the same asymptotic behavior as $ψ_{σ, θ}$ up to single exponential terms, which we absorb into the double exponential by slightly reducing $γ$ .

Looking at $| λ^{\frac{\bar{β}}{2}} g_{λ}^{β} (y) |$ , one can calculate using two different ways to estimate $ψ_{σ, θ} (y) - λ$ :

\begin{matrix} λ^{\bar{β} / 2} | g_{λ}^{β} (y) | & ≲ {|ψ_{σ, θ}, (y)|}^{- β} (\underset{≲ 1}{\underset{⏟}{λ {|ψ_{σ, θ} (y) - λ|}^{- 1}}})^{β / 2} (\underset{≲ {|ψ_{σ, θ}, (y)|}^{- 1}}{\underset{⏟}{{|ψ_{σ, θ} (y) - λ|}^{- 1}}})^{1 - \bar{β} / 2} |ψ_{σ, θ}^{'}, (y)| \\ ≲ {|ψ_{σ, θ}, (y)|}^{- β} {|ψ_{σ, θ}, (y)|}^{- 1 + β / 2} |ψ_{σ, θ}^{'}, (y)| \\ \overset{L e m m a A . 4}{≲} exp (- \frac{γ β}{2} e^{Re (y)}) . \end{matrix}

The integral bound then follows easily from the pointwise ones. $□$

Theorem 3.7

Fix $λ_{0} > κ$ , $\bar{β} \in (0, 1]$ and $r \in [0, \bar{β} / 2]$ . Then there exist constants $C > 0$ , $γ > 0, γ_{1} > 0$ such that for $λ > λ_{0}$ , $β \geq \bar{β}$ , $k > 0$ , $N_{q} \in N$ , the following estimate holds

\begin{matrix} λ^{r} |E^{λ}, (z^{- β}, N_{q})| & ≲ k^{2} max {(1, ln (λ))}^{2} λ^{- β + r} e^{- \frac{max {p (σ, θ, λ), γ_{1}}}{k max (1, ln (λ / κ))}} \\ + e^{- \frac{γ}{k}} + exp (- γ β e^{k N_{q}}), \end{matrix}

3.13a

where the rate is given by

\begin{matrix} p (σ, θ, λ) = \{\begin{matrix} 2 π {tan}^{- 1} (θ) - \frac{c_{2}}{ln (λ / κ)} & if σ = 1, \\ π^{2} - \frac{c_{2}}{ln (λ / κ)} & if σ = 1 / 2 . \end{matrix}) \end{matrix}

3.13b

Thus for $k \sim ln (N_{q}) / N_{q}$ we get (almost) exponential convergence:

\begin{matrix} λ^{r} | E^{λ} (z^{- β}, N_{q}) | & ≲ k^{2} max {(1, ln (λ / κ))}^{2} λ^{- β + r} e^{- \frac{max {p (σ, θ, λ), γ_{1}} N_{q}}{ln (λ / κ) ln (N_{q})}} + e^{- γ^{'} \frac{N_{q}}{ln (N_{q})}} . \end{matrix}

3.14

The implied constants and $γ$ may depend on $λ_{0}$ , $\bar{β}$ , $σ$ , $θ$ and $κ$ .

Proof

To cut down on notation, we only consider the case $ln (λ / κ) \geq c_{1} > 1$ so that the first term in the minimum of (3.11) dominates. If $λ$ is small, the error can be absorbed into the $e^{- γ / k}$ term. The error $E^{λ} (z^{- β}, N_{q})$ corresponds to approximating $g_{λ}^{β}$ by $sinc$ quadrature. We split the error into two parts, the quadrature error and the cutoff error.

\begin{matrix} λ^{r} | \int_{- \infty}^{\infty} g_{λ}^{β} (y) d y - k \sum_{j = - N_{q}}^{N_{q}} g_{λ}^{β} (j k) | \\ \leq \underset{= E^{λ} (z^{- β})}{\underset{⏟}{λ^{r} | \int_{- \infty}^{\infty} g_{λ}^{β} (y) d y - k \sum_{j = - \infty}^{\infty} g_{λ}^{β} (j k) |}} + \underset{= : E_{c}}{\underset{⏟}{k λ^{r} \sum_{|j| > N_{q} + 1} |g_{λ}^{β}, (j k)|}} . \end{matrix}

The term $E_{c}$ can be handled by the same argument as in Corollary 3.5. We therefore focus on the quadrature error $E^{λ} (z^{- β})$ and apply Proposition 3.2. By Lemma 3.6(iii) it holds that $N (g_{λ}^{β}, D_{d (θ)}) < \infty$ . To satisfy assumption (ii), it suffices that (for sufficiently large y) the vertical strips do not contain any poles and we can use the asymptotics of Lemma 3.6(ii).

By Lemma 3.6, there are at most finitely many simple poles. The residue of the function at these poles can be easily calculated using the well-known rule

\begin{matrix} res (f / g : z_{0}) = \frac{f (z_{0})}{g^{'} (z_{0})}, \end{matrix}

provided that f is analytic and $g^{'} (z_{0}) \neq 0$ . In our case this means, if $ψ_{σ, θ} (y_{λ}) = λ$ :

\begin{matrix} res (g_{λ}^{β} ; y_{λ}) & = \frac{e^{- β h (y_{λ})} ψ^{'} (y_{λ})}{ψ^{'} (y_{λ})} = e^{- 2 i π β ζ} {(ψ (y_{λ}))}^{- β} = e^{- 2 i π β ζ} λ^{- β}, \end{matrix}

where $ζ \in N_{0}$ denotes the branch of the complex logarithm picked by h.

Thus, for a single pole $y_{λ}$ with $s_{y_{λ}} : = sign (Im (y_{λ}))$ , recalling the definition of Inline graphic , we can estimate

\begin{matrix} |e^{i \frac{π s_{y_{λ}} y_{λ}}{k}}, res, (g_{λ}^{β}, y_{λ}), γ, (k ; y_{λ})| & ≲ λ^{- β} | \frac{e^{i π s_{y_{λ}} y_{λ} / k}}{e^{i π s_{y_{λ}} y_{λ} / k} - e^{- i π s_{y_{λ}} y_{λ} / k}} | \\ = λ^{- β} \frac{e^{- 2 π |Im (y_{λ})| / k}}{1 - e^{- 2 π |Im (y_{λ})| / k}} . \end{matrix}

By Lemma 3.6(i), we can group poles into buckets of size $\frac{1}{ln (λ / κ)}$ , denoted by

\begin{matrix} B_{ℓ} : = {y : ψ_{σ, θ} (y) = λ with \frac{\frac{p (σ, θ, λ)}{2 π} + ℓ}{ln (λ / κ)} \leq |Im (y)| \leq min (\frac{\frac{p (σ, θ, λ)}{2 π} + ℓ + 1}{ln (λ / κ)}, d (θ))} \end{matrix}

such that the number of elements in each bucket $B_{ℓ}$ is uniformly bounded (independently of $λ$ , $β$ and $ℓ$ ). This allows us to calculate for the pole contribution in Proposition 3.2:

\begin{matrix} | π \sum_{y_{λ} \in P_{λ}^{y}} e^{i \frac{s_{λ} π y_{λ}}{k}} res (g_{λ}^{β} ; y_{λ}) γ (k ; y_{λ}) | \\ \leq λ^{- β} π \sum_{ℓ = 0}^{\infty} | \sum_{y_{λ} \in B_{ℓ}} e^{i \frac{π s_{y_{λ}} y_{λ}}{k}} γ (k ; y_{λ}) | ≲ λ^{- β} \sum_{ℓ = 0}^{\infty} \frac{e^{- \frac{p (σ, θ, λ) + ℓ}{k ln (λ / κ)}}}{1 - e^{- \frac{p (σ, θ, λ) + ℓ}{k ln (λ / κ)}}} \\ ≲ \frac{λ^{- β}}{1 - e^{- \frac{p (σ, θ, λ)}{k ln (λ / κ)}}} \sum_{ℓ = 1}^{\infty} e^{- \frac{p (σ, θ, λ) + ℓ}{k ln (λ / κ)}} ≲ λ^{- β} ln {(λ)}^{2} k^{2} e^{- \frac{p (σ, θ, λ)}{k ln (λ / κ)}}, \end{matrix}

where we used the elementary estimate $1 - e^{- 2 x} ≳ min (x, 1)$ for $x \geq 0$ .

Applying Proposition 3.2 and inserting this estimate for the pole-contributions gives:

\begin{matrix} λ^{r} E^{λ} (z^{- β}) & = E^{λ} (λ^{r} z^{- β}) \\ \overset{P r o p . 3.2}{≲} \frac{e^{- 2 π d_{λ} / k}}{1 - e^{- 2 π d_{λ} / k}} N (λ^{r} g_{λ}^{β}, D_{d_{λ}}) + λ^{- β + r} ln {(λ)}^{2} k^{2} e^{- \frac{p (σ, θ, λ)}{k ln (λ / κ)}} . \end{matrix}

The bound from Lemma 3.6(iii) then completes the proof. $□$

The previous estimate gives (almost) exponential convergence with respect to $N_{q}$ . But the rate of the exponential deteriorates like $1 / ln (λ)$ for large $λ$ . In the following corollary, we give a $λ$ -robust version of this estimate. We allow for an additional factor $λ^{ρ}$ which will allow us to make use of possible additional smoothness when considering function-valued integrals.

Corollary 3.8

Fix $λ_{0} > κ > 0$ , $\bar{β} \in (0, 1]$ and $r \in [0, \bar{β} / 2]$ . Then, for every $ε \geq 0$ , there exist constants $C > 0$ , $γ > 0$ such that for $λ > λ_{0}$ , $β > \bar{β}$ , $ρ \geq 0$ , $k > 0$ , $N_{q} \in N$ , the following estimate holds

\begin{matrix} λ^{r} | E^{λ} (z^{- β}, N_{q}) | ≲ & exp (- \frac{[p (σ, θ) - ε] \sqrt{β + ρ - r}}{\sqrt{k}}) λ^{ρ} + exp (- \frac{γ}{k}) \\ + exp (- γ e^{k N_{q}}) . \end{matrix}

3.15

where the rate is given by (2.6). For $ε > 0$ , the implied constant in the estimate and $γ$ may depend on $λ_{0}$ , $σ$ , $θ$ , $\bar{β}$ , $κ$ . If $ε = 0$ , the constants in addition depend on $ρ$ and $β$ .

Proof

We first show the estimate for $ε > 0$ . We note that for $ln (λ / κ) \geq k^{- 1}$ , we can bound the error in Theorem 3.7 by $exp (- γ / k)$ (for an appropriate choice of constant $γ$ ) due to the smallness of the term $λ^{- β}$ . Thus it remains to consider the case $ln (λ / κ) < k^{- 1}$ . Similarly, if $ln (λ) \leq max (\frac{c_{2}}{ε}, - ln (κ) \frac{p (σ, θ) - 2 ε}{ε}, 1) = : μ_{0}$ , the leading error term behaves like $exp (- γ \frac{μ_{0}}{k})$ . We are left to consider the remaining case. Writing $μ : = ln (λ)$ , the error term can be estimated:

\begin{matrix} k^{2} ln {(λ / κ)}^{2} λ^{- β + r - ρ} e^{- \frac{p (σ, θ, λ)}{max (1, ln (λ / κ)) k}} λ^{ρ} \\ ≲ exp (- (β + ρ - r) μ - \frac{p^{2} (σ, θ) / 4 - \frac{c_{2}}{μ}}{(μ - ln (κ)) k}) λ^{ρ} \\ ≲ exp (- (β + ρ - r) μ - \frac{p^{2} (σ, θ) - 2 ε}{4 μ k}) λ^{ρ} . \end{matrix}

3.16

We look for the minimum of the exponent. Setting the derivative of the map

\begin{matrix} μ \mapsto - (β + ρ - r) μ - \frac{p^{2} (σ, θ) - 2 ε}{4 μ k} \end{matrix}

to zero, we get that the minimum satisfies

\begin{matrix} 0 = - (β + ρ - r) μ_{min}^{2} + \frac{p^{2} (σ, θ) - 2 ε}{4 k}, or μ_{min} : = \sqrt{\frac{1}{(β + ρ - r)} \frac{p^{2} (σ, θ) - 2 ε}{4 k}} . \end{matrix}

Inserting this value into (3.16) gives the stated result (after slightly changing $ε$ to get to the stated form).

To see the case for $ε = 0$ , we note that if $ln (λ / κ) \leq \frac{γ_{1} k^{- 1 / 2}}{p (σ, θ) \sqrt{β + ρ - r}}$ , we can estimate for the leading term in Theorem 3.7:

\begin{matrix} k^{2} ln {(λ / κ)}^{2} λ^{- β + r - ρ} e^{- \frac{γ_{1}}{μ k}} λ^{ρ} & \leq k^{2} ln {(λ / κ)}^{2} λ^{- β + r - ρ} e^{- \frac{p (σ, θ) \sqrt{β + ρ - r}}{\sqrt{k}}} λ^{ρ} . \end{matrix}

In the remaining case, we can estimate the higher order term in the $ln (λ / κ)$ -asymptotics as

\begin{matrix} e^{\frac{c_{2}}{ln {(λ / κ)}^{2} k}} \leq e^{\frac{c_{2} p (σ, θ) \sqrt{β + ρ - r}}{γ_{1}}} = : C (σ, θ, β, ρ) . \end{matrix}

We can also write

\begin{matrix} λ^{- β + r - ρ} = κ^{- β + r - ρ} (\frac{λ}{κ})^{- β + r - ρ} \end{matrix}

and continue as in the proof for $δ > 0$ but using $μ : = ln (λ / κ)$ . This time we no longer have to compensate for the factors involving $c_{2} / μ$ and $- ln (κ)$ by slightly reducing the rate. The price we pay is that the constant may blow up for $ρ \to \infty$ . $□$

We can now leverage our knowledge about the function $g_{λ}^{β}$ to gain insight into the discretization error for (2.5). This allows us to prove the two main theorems of this section. First we deal with the finite regularity case.

Proof of Theorem 2.5

Let ${(λ_{j}, v_{j})}_{j = 0}^{\infty}$ denote the eigenvalues and eigenfunctions of the self-adjoint operator $L$ . Just as we did in the proof of Corollary 3.5, we plug the eigen-decomposition into the Riesz–Dunford calculus and Definition 2.1 to get for the discretization error:

\begin{matrix} {∥u - u_{k}∥}_{H^{2 r}}^{2} & = \sum_{j = 0}^{\infty} | (1 + λ_{j}^{r}) \frac{1}{2 π i} \int_{C} g_{λ_{j}}^{β} (y) d y - \frac{1}{2 π i} \sum_{n = - N_{q}}^{N_{q}} g_{λ_{j}}^{β} (k n) |^{2} {|(f, v_{j})_{X}|}^{2} . \end{matrix}

Applying Corollary 3.8 then gives for $ρ \geq 0$

\begin{matrix} {∥u - u_{k}∥}_{H^{2 r}}^{2} & ≲ e^{- \frac{2 [p (σ, θ) - ε] \sqrt{β + ρ - r}}{\sqrt{k}}} \sum_{j = 0}^{\infty} λ_{j}^{2 ρ} {|(f, v_{j})_{X}|}^{2} + [e^{- \frac{γ}{k}} + e^{- γ e^{k N_{q}}}]^{2} {∥f∥}_{X}^{2} \\ ≲ e^{- \frac{2 [p (σ, θ) - ε] \sqrt{β + ρ - r}}{\sqrt{k}}} {∥f∥}_{H^{2 ρ}}^{2} + [e^{- \frac{γ}{k}} + e^{- γ e^{k N_{q}}}]^{2} {∥f∥}_{X}^{2} . \end{matrix}

$□$

Next we prove the improved estimates for the case of $G^{L} (C_{f}, R_{f}, ω)$ -regularity.

Proof of Theorem 2.8

For simplicity of notation, we ignore the cutoff error, i.e., for now consider $N_{q} = \infty$ . The cutoff error can either be easily tracked throughout the proof or added at the end, analogously to Corollary 3.5.

We first note, that by Stirling’s formula, we can estimate the derivatives of f by

\begin{matrix} {∥f∥}_{H^{ρ}} & \leq {\tilde{C}}_{f} exp (ρ (ω ln (ρ) + c_{2})) . \end{matrix}

By assumption, we can apply Theorem 2.5 for any $ρ \geq 0$ . Picking $ρ = \frac{δ}{k ln {(k)}^{2}}$ for $δ$ sufficiently small and $ε : = p (σ, θ) / 2$ (because we need $ρ$ -robust error estimates) gives:

\begin{matrix} {∥u - u_{k}∥}_{H^{β}} & ≲ exp (- \frac{p (σ, θ) \sqrt{β / 2 + δ k^{- 1} {|ln (k)|}^{- 2}}}{2 \sqrt{k}}) {∥f∥}_{H^{\frac{2 δ}{k ln {(k)}^{2}}}} + e^{- \frac{γ}{k}} {∥f∥}_{X} \\ ≲ e^{- \frac{\sqrt{δ} γ^{'}}{k |ln (k)|}} C_{f} e^{\frac{2 δ}{k {|ln (k)|}^{2}} (ω ln (\frac{2 δ}{k {|ln (k)|}^{2}}) + c_{2})} + e^{- \frac{γ}{k}} {∥f∥}_{X} \\ ≲ e^{- \frac{\sqrt{δ}}{k |ln (k)|} (γ^{'} - \frac{2 \sqrt{δ}}{|ln (k)|} (ω ln (\frac{2 δ}{k {|ln (k)|}^{2}}) + c_{2})} + e^{- \frac{γ}{k}} {∥f∥}_{X} . \end{matrix}

3.17

We need to show that the bracket in the exponential is positive. In order to do this, we expand the logarithmic term as

\begin{matrix} ln (\frac{2 δ}{k {|ln (k)|}^{2}}) & = ln (2 δ) - ln (k) - 2 ln (| ln (k) |) . \end{matrix}

This first term is negative, and for the others we note that

\begin{matrix} \frac{2 ω \sqrt{δ}}{|ln (k)|} (- ln (k) - 2 ln (| ln (k) |) + c_{2}) \end{matrix}

is uniformly bounded as $| ln (| ln (k) |) |$ grows slower than $| ln (k) |$ as $k \to 0$ . Due to the leading $\sqrt{δ}$ term, we can make $δ$ small enough (independently of k) to ensure that the second term in the exponent of (3.17) is smaller than $γ^{'}$ and the statement follows. If $ω = 0$ , we don’t have to compensate the factor $e^{ω ρ ln (ρ)}$ , therefore picking $ρ \sim k^{- 1}$ is sufficient and the improved statement follows. $□$

The parabolic problem

Now that the stationary problem is well understood, we can move on to analyzing the discretization of the time dependent problem introduced in Sect. 2.2.

The Mittag Leffler function

The representation (2.8) hints that it is crucial to understand the Mittag–Leffler function if one wants to analyze the time dependent problem (2.7). We follow [20, Section 1.8]. For parameters $α > 0$ , $μ \in R$ , the Mittag–Leffler function is an analytic function on $C$ and given by the power series

\begin{matrix} e_{α, μ} (z) : = \sum_{n = 0}^{\infty} \frac{z^{n}}{Γ (n α + μ)} . \end{matrix}

3.18

We collect some important properties we will need later on. We start with the following decomposition result, also giving us asymptotic estimates.

Proposition 3.9

For $0 < α < 2, μ \in R$ and $\frac{α π}{2} < ζ < α π$ , we can decompose the Mittag–Leffler function as

\begin{matrix} e_{α, μ} (z) & = - \sum_{n = 1}^{N} \frac{1}{Γ (μ - α n)} \frac{1}{z^{n}} + R_{α, μ}^{N} (z) for ζ \leq |Arg, z| \leq π . \end{matrix}

3.19

where $R_{α, μ}^{N}$ is analytic away from zero and satisfies

\begin{matrix} |R_{α, μ}^{N}, (z)| \leq C Γ (α N) {|z|}^{- (N + 1)} \forall |z| \geq z_{0} > 0 \end{matrix}

3.20

for a constant $C > 0$ depending only on $z_{0}$ and $ζ$ .

Proof

The statement can be found in [20, Eqn 1.8.28] where the dependence of the remainder term on N is not made explicit. To get the explicit estimate on the remainder, we follow [14, Section 18.1]. There, it is proven that the remainder can be written as

\begin{matrix} R_{α, μ}^{N} (z) & = \frac{z^{- N - 1}}{2 π i} \int_{\tilde{C}} (1 - \frac{t^{α}}{z})^{- 1} t^{(N + 1) α - μ} e^{t} d t, \end{matrix}

where $\tilde{C}$ can be taken as two rays ${r ζ_{0} : r \geq 1}$ , ${r \bar{ζ_{0}} : r \geq 1}$ and a small circular arc connecting the two without crossing the negative real axis. $ζ_{0}$ is taken in the left half-plane such that the opening angle of $\tilde{C}$ is sufficiently large in order to avoid possible poles of the integrand and ensure that the term ${(1 - t^{α} / z)}^{- 1}$ is uniformly bounded. The stated result then follows easily by comparing the integral under consideration to the definition of the Gamma function. $□$

Setting $N = 1$ in Proposition 3.9 and simple calculation yields the following estimates:

\begin{matrix} |e_{α, μ}, (z)| \leq \frac{C}{1 + {|z|}^{s}} for ζ \leq |Arg, z| \leq π, s \in [0, 1] \end{matrix}

3.21

For $α = μ = 1$ , the Mittag–Leffler function $e_{1, 1}$ is the usual exponential function. For the decomposition result, we can skip the terms involving powers $z^{- n}$ in this case as $e^{z}$ already decays faster than any polynomial.

Finally, we need a way of computing antiderivatives of the convolution kernel in (2.8).

Proposition 3.10

For $n \in N_{0}$ , $α > 0$ , $z \in C \ {0}$ , $λ \in C$ , it holds that

\begin{matrix} z^{α - 1} e_{α, α} (λ z^{α}) & = (\frac{\partial}{\partial z})^{n} (z^{α + n - 1} e_{α, α + n} (λ z^{α})) . \end{matrix}

3.22

Proof

Follows from [20, Eqn. 1.10.7] by taking $β : = α + n$ . $□$

Double exponential quadrature for the parabolic problem

The case of finite regularity In this section, we investigate the convergence of our method in the case that $u_{0}$ and f have finite $H^{2 ρ}$ -regularity for some $ρ \geq 0$ . It will showcase most of the new ingredients needed to go from the elliptic case to the time dependent one while keeping the technicalities to a minimum. The step towards Gevrey-regularity will then mainly consist of carefully retracing the argument and fine-tuning parameters. We start with the case if $f = 0$ .

Lemma 3.11

Assume that either $α + β < 2$ or $σ = 1$ (i.e., the case $α = β = 1$ and $σ = 1 / 2$ is not allowed). Let $u (t) : = e_{α, μ} (- t^{α} L^{β}) u_{0}$ and assume $u_{0} \in H^{2 ρ} (Ω)$ for some $ρ > 0$ . Let $u_{k} : = Q^{L} (e_{α, μ} (- t^{α} z^{β}), N_{q}) u_{0}$ be the corresponding discretization using stepsize $k > 0$ and $N_{q} \in N$ quadrature points.

Then, the following estimate holds for all $η \geq 1$ and $r \in [0, β / 2]$ :

\begin{matrix} {∥u (t) - u_{k} (t)∥}_{H^{2 r}} ≲ & t^{- η α} (e^{- min {p (σ, θ) \sqrt{β + ρ - r}, \sqrt{β η} γ_{1}} \frac{1}{\sqrt{k}}} + e^{- \frac{γ}{k}}) {∥u_{0}∥}_{H^{2 ρ}} \\ + t^{- α / 2} exp (- γ e^{k N_{q}}) {∥u_{0}∥}_{H^{2 ρ}} . \end{matrix}

Here $γ_{1}$ is the constant from Corollary 3.5. The implied constant and $γ$ may depend on r, the smallest eigenvalue $λ_{0}$ of $L$ , $β$ , $α$ , $κ$ , $θ$ , $σ$ and $ρ$ .

Proof

We start with $N_{q} = \infty$ and split the Mittag–Leffler function according to (3.19). We write

\begin{matrix} E^{L} (e_{α, μ} (- t^{α} z^{β})) & = \sum_{n = 1}^{N} \frac{{(- 1)}^{n} t^{- α n}}{Γ (μ - α n)} E^{L} (z^{- β n}) + E^{L} (R_{α, μ}^{N} (- t^{α} z^{β})) . \end{matrix}

3.23

For the first terms, we apply Theorem 2.5, and for the final term we use the decay estimate (3.20) and Corollary 3.5. Note that this is where we have to exclude the case $α = β = 1$ and $σ = 1 / 2$ . If $α < 1$ the Mittag–Leffler function is contractive on a large enough sector. If $β < 1$ , the map $z \mapsto z^{β}$ maps the required sector into the right half plane. Otherwise, the exponential function only decays in the right half-plane, not any slightly bigger sector. Thus, if $σ = 1 / 2$ , Corollary 3.5 does not apply.

Overall, we get the estimate:

\begin{matrix} {∥E^{L}, (, e_{α, μ}, (t^{α} z^{β}), )∥}_{H^{2 r}} & ≲ \sum_{n = 1}^{N - 1} \frac{t^{- α n}}{Γ (μ - α n)} e^{- min {\frac{p (σ, θ) \sqrt{β n + ρ - r}}{\sqrt{k}}, \frac{γ}{k}}} {∥u_{0}∥}_{H^{2 ρ}} \\ + Γ (α N) t^{- α N} e^{- γ_{1} \frac{\sqrt{β (N + 1) + ρ - r - 2 ε}}{\sqrt{k}}} {∥u_{0}∥}_{H^{2 ρ}} . \end{matrix}

To simplify the calculations, we make use of the fact that $β - r - 2 ε \geq β / 2 - 2 ε > 0$ and $ρ > 0$ . That way, the last term can be simplified to

\begin{matrix} Γ (α N) t^{- α N} exp (- γ_{1} \frac{\sqrt{β N}}{\sqrt{k}}) {∥u_{0}∥}_{H^{2 ρ}} . \end{matrix}

If $η$ is an integer, we can pick $N = η$ to get the statement for $N_{q} = \infty$ . For general $η \geq 1$ , we can interpolate between $⌊ η ⌋$ and $⌊ η ⌋ + 1$ . The treatment of the cutoff error follows as in Corollary 3.5, exploiting that $e_{α, μ} (z)$ decays like (3.21) with $s : = β / 2$ . $□$

Picking $η$ large enough, Lemma 3.11 shows that for fixed times $t > 0$ we get the same convergence rate as for the elliptic problem, though the approximation deteriorates as t gets small.

Now that we understand the homogeneous problem, we can look at the case of allowing inhomogeneous right-hand sides f by using the representation formula (2.8), and finally prove the main result Theorem 2.10. We point out that naive application of Corollary (3.16) also inside the time-convolution integral would fail to give good rates, as the error may blow up faster than $τ^{- α}$ for small times, leading to a non-integrable function. Instead, the following proof relies on integration by parts and (3.22) to split the convolution into point evaluations similar to Lemma 3.11 and an integrable remainder term.

Proof of Theorem 2.10

As we have already estimated the error of the homogeneous part, we only consider the part corresponding to the inhomogenity, i.e., for now let $u_{0} = 0$ . We integrate by parts m times, using (3.22):

\begin{matrix} \int_{0}^{t} τ^{α - 1} e_{α, α} (- τ^{α} λ^{β}) f (t - τ) d τ = & \sum_{j = 1}^{m} t^{α + j - 1} e_{α, α + j} (- t^{α} λ^{β}) f^{(j - 1)} (0) \\ + \int_{0}^{t} τ^{α + m - 1} e_{α, α + m} (- τ^{α} λ^{β}) f^{(m)} (t - τ) d τ \end{matrix}

Transferring this identity to the operator-valued setting, this means that we can analyze the quadrature error for these terms separately.

\begin{matrix} {∥u (t) - u_{k} (t)∥}_{H^{2 r}} & = ‖ \int_{0}^{t} τ^{α - 1} E^{L} (e_{α, α} (- τ^{α} z^{β})) f (t - τ) d τ ‖_{H^{2 r}} \\ \leq \sum_{j = 1}^{m} t^{α + j - 1} ‖ E^{L} (e_{α, α + j} (- t^{α} z^{β})) f^{(j - 1)} (0) ‖_{H^{2 r}} \\ + \int_{0}^{t} τ^{α + m - 1} ‖ E^{L} (e_{α, α + m} (- τ^{α} z^{β})) f^{(m)} (t - τ) ‖_{H^{2 r}} d τ . \end{matrix}

3.24

All the terms appearing are of the structure in Lemma 3.11. Most notably, the first m terms are evaluated at a fixed $t > 0$ thus we don’t have to analyze them further and can just accept some t-dependence.

Investigating the remaining integral, we get by using $η : = m / α + q$ in Lemma 3.11:

\begin{matrix} \int_{0}^{t} τ^{α + m - 1} {∥E^{L}, (, e_{α, α + m}, (- τ^{α} z^{β}), ), f^{(m)}, (t - τ)∥}_{H^{2 r}} d τ \\ ≲ \int_{0}^{t} τ^{α + m - 1 - m - α q} [e^{- \frac{min {p (σ, θ) \sqrt{β + ρ - r}, γ_{1} \sqrt{η}}}{\sqrt{k}}} + e^{- \frac{γ}{k}}] ‖ f^{(m)} (t - τ) ‖_{H^{2 ρ}} d τ . \end{matrix}

For $q < 1$ , this is an integrable function (with respect to $τ$ ) and the integral grows like $t^{α (1 - q)}$ .

We now focus on extracting the correct t dependencies. For small times, the dominating t-dependence in the estimates above can be found in the first term of (3.24), which behaves like $t^{- m α (1 - q)}$ . If we put back the homogeneous contribution from Lemma 3.11, this term will dominate for small times like $t^{- m - q α}$ . For larger times, the initial error term in (3.24) is dominant, giving behavior $T^{α}$ . The cutoff error is treated like before, making use of the decay of $e_{α, α}$ . We just point out that the homogeneous cutoff error behaves like $t^{- α / 2}$ and the inhomogeneous part $t^{α / 2}$ . We crudely estimated both by $max (t^{- m - q α}, T^{α})$ to simplify the statement of the theorem). $□$

Remark 3.12

Corollary 2.10 shows that, as long as we assume that f is smooth enough in time we recover the same convergence rate $p (σ, θ) \sqrt{β + ρ - r}$ as in the homogeneous and elliptic case. $□$

The case of Gevrey-type regularity If the data not only satisfies some finite regularity estimates but instead is even in some Gevrey-type class of functions, we can again improve the convergence rate, and almost get rid of the square root in the exponent. We go back to the homogeneous problem and assume that $k < 1 / 2$ so that the logarithmic terms can be written down succinctly.

Lemma 3.13

Assume that either $α + β < 2$ or $σ = 1$ (i.e., the case $α = β = 1$ and $σ = 1 / 2$ is not allowed). Let $u (t) : = e_{α, μ} (- t^{α} L^{β}) u_{0}$ and assume that there exist constants $C_{u_{0}}, ω, R_{u_{0}} > 0$ such that

\begin{matrix} {∥u_{0}∥}_{H^{ρ}} & \leq C_{u_{0}} R_{u_{0}}^{ρ} (Γ (ρ + 1))^{ω} < \infty \forall ρ \geq 0 . \end{matrix}

Let $u_{k} (t) : = Q^{L} (e_{α, μ} (- t^{α} z^{β}), N_{q}) u_{0}$ be the discretization of u using stepsize $k \in (0, 1 / 2)$ and $N_{q} \in N$ quadrature points. Then, the following estimate holds:

\begin{matrix} {∥u (t) - u_{k} (t)∥}_{H^{β}} & ≲ C_{u_{0}} exp (- \frac{γ}{k |ln (t_{⋆})| |ln k|}) + C_{u_{0}} t^{- α / 2} exp (- γ e^{k N_{q}}) . \end{matrix}

with $t_{⋆} : = min (t, 1 / 2)$ . The implied constant and $γ$ may depend on $ε$ , the smallest eigenvalue $λ_{0}$ of $L$ , $β$ , $α$ , $κ$ , $θ$ , $σ$ , $R_{u_{0}}$ and $ω$ .

Proof

We go back to (3.23), but apply Theorem 2.8 to each of the first N terms, getting:

\begin{matrix} \begin{matrix} {∥E^{L}, (, e_{α, μ}, (- t^{α} z^{β}), ), u_{0}∥}_{H^{β}} & ≲ C_{u_{0}} \sum_{n = 1}^{N} \frac{t^{- α n}}{Γ (μ - α n)} exp (- \frac{γ}{k |ln (k)|}) \\ + Γ (α N) t^{- α N} exp (- γ \frac{\sqrt{N - 2 ε}}{\sqrt{k}}) {∥u_{0}∥}_{X} . \end{matrix} \end{matrix}

3.25

We estimate the first N terms by

\begin{matrix} \frac{t^{- α n}}{Γ (μ - α n)} exp (- \frac{γ}{k |ln (k)|}) & ≲ exp (- α ln (t) n + c_{1} n log (n) - \frac{γ}{k |ln (k)|}) . \end{matrix}

For $n \leq \frac{δ}{k {|ln (t_{⋆})|}^{2} {|ln (k)|}^{2}}$ , we can estimate the exponent by

\begin{matrix} - \frac{1}{k |ln (t_{⋆})| |ln (k)|} [- \frac{α δ}{|ln (k)|} - \frac{c_{1} δ}{|ln (t_{⋆})| |ln (k)|} ln (\frac{δ}{ln {(t_{⋆})}^{2} ln {(k)}^{2} k}) + γ ln (2)] \end{matrix}

For $δ$ small enough, depending on $c_{1}$ , $α$ and $γ$ , the term in brackets is uniformly positive (i.e., independently of t and k), we can thus estimate for some $γ_{1} > 0$ :

\begin{matrix} \frac{t^{- α n}}{Γ (μ - α n)} exp (- \frac{γ}{k |ln (k)|}) & ≲ e^{- \frac{γ_{1}}{k |ln (t_{⋆})| |ln (k)|}} . \end{matrix}

The remainder term behaves like

\begin{matrix} t^{- α N} Γ (α N) exp (- γ \frac{\sqrt{N - 2 ε}}{\sqrt{k}}) & ≲ exp (- α N ln (t) - c_{2} N ln (N) - γ \frac{\sqrt{N - 2 ε}}{\sqrt{k}}) . \end{matrix}

By picking $N = ⌈ \frac{δ}{k {|ln (t_{⋆})|}^{2} {|ln (k)|}^{2}} ⌉$ , the exponent be bounded up to a constant by

\begin{matrix} - \frac{\sqrt{δ}}{k |ln (t_{⋆})| |ln (k)|} [- \frac{γ \sqrt{δ}}{|ln (k)|} - \frac{c_{1} \sqrt{δ}}{|ln (t_{⋆}) ln (k)|} ln (\frac{δ}{k {|ln (t_{⋆})|}^{2} {|ln (k)|}^{2}}) + γ ln (2)] . \end{matrix}

By taking the factor $δ$ sufficiently small, we get that the term in brackets stays uniformly positive, which shows

\begin{matrix} {∥E^{L}, (, e_{α, μ}, (- t^{α} z^{β}), )∥}_{H^{β}} & ≲ exp (- \frac{γ}{|ln (t_{⋆})| |ln (k)| k}) . \end{matrix}

The cutoff error can easily be dealt with as in the previous results, as the Mittag–Leffler function satisfies the decay bound (3.21) for $s = 1 / 2$ . $□$

Finally, we are in a position to also include the inhomogenity f into our treatment. This means we can prove the main result Theorem 2.11. Just as in Lemma 2.10, we use integration by parts to decompose the error into parts for positive times and a remainder integral with “nice enough” behavior with respect to $τ$ .

Proof of Theorem 2.11

We again work under the assumption $u_{0} = 0$ and focus on the error when dealing with the inhomogenity f alone and also start with $N_{q} = \infty$ . We also for now take $t \leq 1$ .

Going back to (3.24) we get for $N \in N_{0}$ to be fixed later

\begin{matrix} \begin{matrix} {∥u (t) - u_{k} (t)∥}_{H^{2 r}} & \leq \sum_{j = 1}^{N} t^{α + j - 1} ‖ E^{L} (e_{α, α + N} (- t^{α} z^{β})) f^{(j - 1)} (0) ‖_{H^{β}} \\ + \int_{0}^{t} τ^{α + N - 1} ‖ E^{L} (e_{α, α + n} (- τ^{α} z^{β})) f^{(N)} (t - τ) ‖_{H^{β}} d τ . \end{matrix} \end{matrix}

3.26

For the first terms, we apply Lemma 3.13 to get exponential convergence, as long as $f^{(j)}$ is in the right Gevrey-type class. Namely, we note that we can estimate

\begin{matrix} {∥f^{(n)}, (t)∥}_{H^{2 ρ}} ≲ e^{\tilde{ω} N ln (N)} e^{\tilde{ω} ρ ln (ρ)} \end{matrix}

by possibly tweaking $\tilde{ω}$ compared to $ω$ . This allows us to estimate

\begin{matrix} \sum_{j = 1}^{N} t^{α + j - 1} {∥E^{L}, (, e_{α, α + j}, (- t^{α} z^{β}), ), f^{(j - 1)}, (0)∥}_{H^{2 r}} & ≲ e^{\tilde{ω} N ln (N)} e^{- \frac{γ}{|ln (t_{⋆})| |ln (k)| k}} . \end{matrix}

3.27

Again restricting $\tilde{ω}$ to absorb the factor N due to the summation.

For the remainder in (3.26), we look at the pointwise error at fixed $0 < τ < t$ , shortening ${\tilde{f}}^{(N)} : = f^{(N)} (t - τ)$ . Going back to (3.25), we can use the additional powers of t to get rid of the $ln (t)$ term in the exponential:

\begin{matrix} τ^{α + N - 1} {∥E^{L} (e_{α, α + n} (- τ^{α} z^{β})) {\tilde{f}}^{(N)}∥}_{H^{β}} \\ ≲ \sum_{n = 1}^{N - 1} C_{f} e^{\tilde{ω} N ln (N)} \frac{τ^{- α (n - 1) + N - 1}}{Γ (μ - α n)} exp (- \frac{γ}{k |ln (k)|}) \\ + Γ (α N) τ^{(1 - α) (N - 1)} C_{f} e^{\tilde{ω} N ln (N)} exp (- γ \frac{\sqrt{N - 2 ε}}{\sqrt{k}}) . \end{matrix}

We then proceed as in the proof of Lemma 3.13, noting that since the $τ$ -dependent terms can be bounded independently of N we can get by without the $ln (t_{⋆})$ -term in the exponent. Overall, we get by tuning $N \sim δ / ({|ln (k)|}^{2} k)$ (also in (3.27)) appropriately:

\begin{matrix} {∥u (t) - u_{k} (t)∥}_{H^{2 r}} & ≲ exp (- \frac{γ}{k |ln (t_{⋆})| |ln (k)|}) + \int_{0}^{t} exp (- \frac{γ}{k |ln (k)|}) d τ . \end{matrix}

which easily gives the stated result. If $t > 1$ , we can skip the integration by parts step for the integration over (1, t) and directly apply Lemma 3.13. The cutoff error is treated as always. $□$

Numerical examples

In this section, we investigate, whether the theoretical results obtained in Sects. 3.2 and 3.3 can also be observed in practice. We compare the following quadrature schemes:

(i)
DE1: double exponential quadrature using $σ = 1 / 2$ and $θ = 4$ ,
(ii)
DE2: double exponential quadrature using $σ = 1$ and $θ = 4$ ,
(iii)
DE3: double exponential quadrature using $σ = 1$ and $θ = 1$ ,
(iv)
sinc: standard sinc quadrature
(v)
Balakrishnan: a quadrature scheme based on the Balakrishnan formula
(vi)
BURA: best uniform rational approximation

For the double exponential quadrature schemes, we used $k = 0.9 ln (r N_{q}) / N_{q}$ with $r : = 1$ for $β > 0.4$ and $r : = 5$ for $β < 0.4$ . This makes the cutoff error decay like $e^{- β r N_{q}^{0.9}}$ , which is sufficiently fast to not impact the overall convergence rate. The factor 0.9 was observed to have some slightly improved stability compared to 1. The damping constant r was introduced to get good behavior for small $β$ ; see Sect. 4.3.

For the standard sinc-quadrature, the proper tuning of k and $N_{q}$ is more involved. Following [4], we picked $k = \sqrt{\frac{2 π d}{β N_{q}}}$ with $d = π / 5$ . The Balakrishnan formula is only possible for the elliptic problem. It is described in detail in [5]. Following [5, Remark 3.1] we used

\begin{matrix} k : = \sqrt{\frac{π^{2}}{1.8 β N}} M : = ⌈\frac{π^{2}}{2 (1 - β) k^{2}}⌉, \end{matrix}

where M is the number of negative quadrature points. This corresponds (in their notation) to taking $s^{+} : = β / 10$ , which was taken because it yielded good results (Fig. 4).

The pure quadrature problem

In this section, we focus on a scalar quadrature problem. Namely, we investigate how well our quadrature scheme can approximately evaluate the following functions using the Riesz–Dunford calculus (a) $z^{- β}$ and (b) $e_{α, 1} (- t^{α} z^{β})$ at different values $λ \in (4, \infty)$ . This is equivalent to solving the elliptic and parabolic problem with data consisting of a single eigenfunction corresponding to the eigenvalue $λ$ . Throughout, we used $κ : = 3$ . Theoretical investigations revealed, that the quadrature error is largest at $ln (λ) \sim k^{- 1 / 2}$ (see the proof of Corollary 3.8). Therefore, we make sure that for each k under consideration, such a value of $e^{\frac{1}{\sqrt{k}}}$ is among the $λ$ -values sampled. More precisely, the sample points consist of

\begin{matrix} ⋃_{N_{q} = 1}^{N_{max}} {5 + exp (2 \sqrt{β / k (N_{q})})} \cup {5 + exp (β / k (N_{q}))}, \end{matrix}

with $k (N_{q}) = 0.9 ln (N_{q}) / N_{q}$ , and we consider the maximum error over all these samples. We used $t : = 1$ for all experiments.

Fig. 1 — Comparison of quadrature schemes—scalar problem

We observe that for the most part, choosing $σ = 1 / 2$ and $θ$ moderately large gives the best result. This agrees with our theoretical findings. This method fails to converge though if $α = β = 1$ is chosen as the parameters for the Mittag–Leffler function. This also agrees with the theory, because in this case, $ψ_{σ, θ}$ fails to map into the domain where $e_{α, μ}$ is decaying (see (3.21)). This shows that the restriction on $σ$ in the theorems of Sect. 3.3 is necessary. If we only consider the elliptic problem, no such restriction is necessary, as the decay property is valid on all of the complex plane. All the other methods perform well in all of the cases. The straight-forward double exponential formula, i.e., $σ = θ = 1$ , is often outperformed by the simple sinc quadrature scheme, (except in the $α = β = 1$ case of the exponential). For comparison, we’ve included the (rounded) predicted rate for the DE1 scheme in the plots. We observe that for several applications our estimates appear sharp. For $f (z) = z^{- 1}$ the scheme outperforms the prediction, but this might be due to a large preasymptotic regime. We note that for $e^{- z^{β}}$ , we expect better estimates than the ones presented in this article to be possible due to the exponential decay. This is also true for the standard sinc methods, see [3].

Second, we look at the case of a single frequency $λ$ and see how the convergence rate decays as $λ \to \infty$ . In order to better see the $λ$ -dependence of the quadrature error, we consider the relative error of the quadrature, i.e., we look at $E^{λ} (z^{- β}) / λ^{- β}$ for $β = 0.5$ . The theory from Theorem 3.7 predicts behavior of the form $e^{- \frac{γ}{ln (λ) k}}$ , i.e., the rate drops like $ln (λ)$ . In Fig. 2, we can see this behavior quite well. In comparison, using standard $sinc$ quadrature gives a $λ$ -robust asymptotic rate, but only of order $\sqrt{N_{q}}$ .

Fig. 2 — Comparison of $λ$ dependence for different quadrature schemes

A 2d example

In order to confirm our theoretical findings in a more complex setting, we now look at a 2d model problem with more realistic data than a single eigenfunction. Namely, we work in the PDE-setting of Remark 1.2 using the unit square $Ω = {(0, 1)}^{2}$ and the standard Laplacian with Dirichlet boundary conditions. We focus on two cases: first we look at what happens if the initial condition does not satisfy any compatibility condition, i.e., $u_{0} \notin H^{2 ρ}$ for $ρ \geq 1 / 4$ . The second example is then taken such that the data is (almost) in the Gevrey-type class as required by Theorem 2.8 and Theorem 2.11. The inhomogenity in time is taken as $f (t) : = sin (t) u_{0}$ , thus possessing analogous regularity properties. We computed the function at $t = 0.1$ .

For the discretization in space and of the convolution in time of (2.8), we consider the scheme presented in [26]. It is based on hp-finite elements for the Galerkin solver and a hp-quadrature on a geometric grid in time for the convolution. As it is shown there, such a scheme delivers exponential convergence with respect to the polynomial degree and the number of quadrature points. Since we are not interested in these kinds of discretization errors, we fixed these discretization parameters in order to give good accuracy and only focus on the error due to discretizing the functional calculus. Namely, we used 5 layers of geometric refinement towards the boundary and vertices and a polynomial degree of $p = 12$ .

Since the exact solution is not available, we computed a reference solution with high accuracy and compared our other approximations to it. The reference solution is computed by the DE1 scheme (as it outperformed the others) by using 8 additional quadrature points to the finest approximation present in the graph. As the DE1 scheme has finished convergence at this point, we can expect this to be a good approximation.

We start with the parabolic problem. The initial condition is given by

\begin{matrix} u_{0} (x, y) : = ω^{- 1} exp (- \frac{{(x - 0.5)}^{2}}{ω}) exp (- \frac{{(y - 0.5)}^{2}}{ω}) . \end{matrix}

For $ω : = 1$ , this function does not vanish near the boundary of $Ω$ and therefore only satisfies $u_{0} \in H^{1 / 2 - ε}$ . We are in the setting of Lemma 2.10. By inserting $ρ = 1 / 4$ (up to $ε$ ) and $r = 0$ , the predicted rates for DE1 and DE2 are roughly $e^{- \frac{6.13}{\sqrt{k}}}$ and $e^{- \frac{5.62}{k}}$ respectively. Figure 3a contains our findings. We observe that all methods converge with exponential rate proportional to $\sqrt{N_{q}}$ . The double exponential formulas outperforming the standard $sinc$ quadrature. We also observe that picking $σ \neq 1$ and $θ \neq 1$ can greatly improve the convergence. The best results being delivered by DE1, i.e. $σ = 1 / 2$ and $θ = 4$ . For DE1 and DE2, we observe that for a large part of the computation, the scheme outperforms the predicted asymptotic rate, but for DE2, the rate appears sharp for large values of $N_{q}$ .

Fig. 3 — Comparison of quadrature schemes for 2d parabolic example; $α = 1 / \sqrt{2}$ , $β = 0.7$

As a second example, we used $ω = 0.05$ . This function is almost equal to 0 in a vicinity of the boundary of $Ω$ . Thus we may hope to achieve the improved convergence rate of Theorem 2.11. Figure 3b shows that it is plausible that the exponential rate of order $N_{q}$ is achieved, and all the double exponential schemes greatly outperform the standard $sinc$ quadrature. The best results are again achieved by DE1 and DE2, which also greatly outperform the predicted rate for the non-smooth case.

Elliptic problem and behavior for small $β$

Thus far, all our estimates worked under the assumption of $β > \bar{β} > 0$ . In order to shed some light on the behavior, and in addition gain insight into the behavior for the elliptic problem (2.4), we look at the following model problem for different values of $β$ .

As geometry we again used the unit square. We chose $f = 1$ as the constant function. In this class, we also included the method based on the Balakrishnan formula as well as a rational approximation method, namely the one based on computing the best uniform rational approximation as described in [17]. Where we approximate $z^{1 - β}$ on [0, 1] using a rational function and then divide by $z^{- 1}$ and scale back to the interval $[λ_{min}, λ_{max}]$ . For computing the approximation we used the brasil algorithm described in [18], the implementation of which can be found in the baryrat python package [18]. To determine $λ_{max}$ , we used a simple power iteration with 10 iterations. This gave the estimate $λ_{max} \approx 6 \cdot 10^{15}$ . For $λ_{min}$ we used the constant $κ : = 3$ also used in the other algorithms.

For small $β$ , preliminary experiments suggest a severe degrading of performance if the choice $k : = 0.9 ln (N_{q}) / N_{q}$ is made. Therefore it was necessary to introduce the constant r in our considerations. We point out that setting $r : = 1$ for $β > 0.4$ is not fully necessary and only gives small improvements for larger values of $β$ . Thus, if multiple values of $β$ are of interest, in order to be able to reuse the approximate inverses ${(L - ψ_{σ, θ} (j k))}^{- 1}$ , the choice of this damping factor should be according to the smallest value of $β$ one is interested in.

In Fig. 4, we again observe that with $θ = 4$ and $σ \in {0.5, 1}$ , the double exponential formulas significantly outperform the standard $sinc$ based strategies, where $σ = 0.5$ again delivers the best performance. For comparison, we included the predicted rates for the DE1 and DE2 schemes into the graphics. We observe that asymptotically our estimates appear sharp, but with a large range of values, for which the scheme outperforms the predictions. The rational approximation method performs very well for small numbers of systems, but the performance degrades severely when higher accuracy is required. This instability with respect to numerical errors is most likely due to the requirement of rewriting the rational function in the partial fraction form to apply it to a matrix as described in [17] – even though a multiprecision library is used for computing the poles and residuals of the rational function. We also tried the method based on the AAA-algorithm [30], but there the numerical instability was even more problematic.

Acknowledgements

The author would like to thank J. M. Melenk for the many fruitful discussions on the topic. Financial support was provided by the Austrian Science Fund (FWF) through the special research program “Taming complexity in partial differential systems” (grant SFB F65) and the project P29197-N32.

Appendix A: Properties of the coordinate transform $ψ_{σ, θ}$

In this appendix, we study the transformation $ψ_{σ, θ}$ in detail, as it is crucial to understand the double exponential quadrature scheme. Since this transformation is itself defined in a two-part way, we introduce the following nomenclature.

Definition A.1

We call the complex plane on which $ψ_{σ, θ}$ is defined the y-plane, mainly using the parameter y for its points. The main subset of interest there is the strip $D_{d (θ)}$ .

Using the function $y \mapsto \frac{π}{2} sinh (y)$ , the y-plane is mapped to the w-plane. The most interesting set is the image of $D_{d (θ)}$ under this deformation, denoted by $H_{θ} : = {\frac{π}{2} sinh (y), y \in D_{d (θ)}}$ (named due to its hyperbola shape).

Finally, using the function $φ_{σ, θ} (w) : = κ [cosh (σ w) + i θ sinh (w)]$ , mapping $H_{θ} \to C$ , we arrive at the z-plane which corresponds to the range of $ψ_{σ, θ}$ , and the domain of the functions used for the Riesz–Dunford calculus. The situation is summarized in Fig. 5.

Fig. 5 — Illustration of the different planes involved in the mapping $ψ_{σ, θ}$

If we talk about generic complex numbers without relation to any of the specific planes, we use the letter $ζ$ instead.

We start out with some basic properties of $sinh$ .

Lemma A.2

The map $y \mapsto \frac{π}{2} sinh (y)$ has the following properties:

(i)
It is a bijective mapping $D_{d (θ)} \to H_{θ}$ , see Fig. 5.
(ii)
For $|Re (ζ)| \geq ζ_{0} > 0$ , there exist constants $c_{1}, c_{2} > 0$ depending only on $ζ_{0}$ such that
$\begin{matrix} c_{1} |sinh (ζ)| \leq |cosh (ζ)| \leq c_{2} |sinh (ζ)| . \end{matrix}$
(iii)
For any $δ < π / 2$ , $sinh$ maps the domain $D_{δ}^{exp}$ , as defined in (3.1), to a strip of size $δ$ , i.e.,
$\begin{matrix} |Im (sinh (y))| < δ \forall y \in D_{δ}^{exp} . \end{matrix}$ A1

Proof

Proof of (i): It is well known that $sinh$ is injective for $|Im (y)| < π / 2$ . Since $H_{θ}$ is defined as the range of the map this is sufficient.

Part (ii) is an easy consequence of the fact that $sinh$ and $cosh$ have the same asymptotic behavior for $Re (ζ) \to \infty$ . To see (iii), we estimate for $y \in D_{δ}^{exp}$ :

\begin{matrix} |Im (sinh (y))| & = cosh (Re (y)) |sin (Im (y))| < δ cosh (Re (y)) e^{- |Re (y)|} \leq δ . \end{matrix}

$□$

Lemma A.3

$ψ_{σ, θ}$ is analytic on the infinite strip $D_{d (θ)}$ . For $d (θ)$ sufficiently small, both $ψ_{σ, θ}$ and $ψ_{σ, θ}^{'}$ are non-vanishing on $D_{d (θ)}$ .

Proof

The analyticity of $ψ_{σ, θ}$ is clear. In order to analyze the roots, we first rewrite for $w = a + i b$ , separating the real and imaginary parts:

\begin{matrix} \begin{matrix} cosh (σ w) + i θ sinh (w) & = (cosh (σ a) cos (σ b) - θ cosh (a) sin (b)) \\ + i (sinh (σ a) sin (σ b) + θ sinh (a) cos (b)) . \end{matrix} \end{matrix}

We first focus on the case $σ = 1$ . In this case, (A2) shows that any root y of $ψ_{σ, θ}$ must satisfy for $w : = \frac{π}{2} sinh (y) = : a + b i$ :

\begin{matrix} cosh (a) (cos (b) - θ sin (b)) & = 0 and sinh (a) (θ cos (b) + sin (b)) = 0 . \end{matrix}

Since $cosh$ has no roots, we get $cos (b) = θ sin (b)$ . As $cos (b) = θ sin (b)$ and $θ cos (b) = - sin (b)$ is impossible at the same time, we get that $a = 0$ and $b = {tan}^{- 1} (1 / θ) + ℓ π$ for some $ℓ \in Z$ .

It remains to show that $\frac{π}{2} sinh (y)$ does not map to these points. Looking at the real part of $sinh (y)$ we immediately deduce that if $Im (y) \in (- π / 2, π / 2)$ , in order to produce a purely imaginary result, it must hold that $Re (y) = 0$ . For the imaginary part, we then get the equation:

\begin{matrix} sin (Im (y)) = 2 ℓ + \frac{2 {tan}^{- 1} (1 / θ)}{π} for some ℓ \in Z \end{matrix}

which is not possible for $|Im (y)| \leq d (θ) < {sin}^{- 1} (\frac{2 {tan}^{- 1} (1 / θ)}{π})$ . Next, we show that $ψ_{σ, θ}^{'}$ also does not vanish. A simple calculation shows

\begin{matrix} ψ_{1, θ}^{'} (y) = i θ ψ_{1, \frac{1}{θ}} (- y) \frac{π}{2} cosh (y) . \end{matrix}

Since the restriction $θ \geq 1$ was not crucial for the proof, $ψ_{1, \frac{1}{θ}}$ and $cosh$ have no roots in the symmetric (w.r.t. sign flip) domain $D_{d (θ)}$ . This shows that $ψ_{1, θ}^{'}$ also is non-vanishing.

The case $σ = 1 / 2$ is similar, but a little more involved. We first show that all zeros of $cosh (σ w) + i θ sinh (w)$ satisfy $Re (w) = 0$ and $|Im (w)| > w_{0} > 0$ for a constant $w_{0}$ depending only on $θ$ . If $cosh (w / 2) \neq 0$ we can use the double angle formula for $sinh$ to get

\begin{matrix} 0 = cosh (w / 2) (1 + 2 θ i sinh (w / 2)) which implies sinh (w / 2) = \frac{i}{2 θ} . \end{matrix}

Splitting into real and imaginary part, we get for $w = a + i b$ :

\begin{matrix} sinh (a / 2) cos (b / 2) = 0 and cosh (a / 2) sin (b / 2) = \frac{1}{2 θ} . \end{matrix}

If $a \neq 0$ , we get that $cos (b / 2) = 0$ and thus $sin (b / 2) = \pm 1$ . This would imply that $|cosh (a / 2)| = \frac{1}{2 θ} \leq 1 / 2$ which is a contradiction. If $a = 0$ , we get that

\begin{matrix} |b| \geq 2 | {sin}^{- 1} (\frac{1}{2 θ}) | > 0 . \end{matrix}

Similarly, one can argue if $cosh (w / 2) = 0$ , that $a = 0$ and $b = π (2 ℓ + 1)$ . We then proceed just like in the case $σ = 1$ to conclude that $\frac{π}{2} sinh$ does not map to such points.

In order to investigate its roots, we compute the derivative of $ψ_{\frac{1}{2}, θ}$ as

\begin{matrix} ψ_{\frac{1}{2}, θ}^{'} (y) & = (\frac{1}{2} sinh (π sinh (y) / 4) + i θ cosh (π sinh (y) / 2)) \frac{π}{2} cosh (y) . \end{matrix}

Our main concern is when the first bracket reaches zero. Substituting $t : = sinh (π sinh (y) / 4)$ and using the double angle formula for $cosh$ we get

\begin{matrix} 0 & = \frac{t}{2} + i θ (1 + 2 t^{2}) . \end{matrix}

Solving this equation, we get that t is purely imaginary and for $θ \geq 1$ satisfies $0 < |Im (t)| < 1$ . Again writing $w = : a + i b$ we get

\begin{matrix} sinh (a / 2) cos (b / 2) = 0 and cosh (a / 2) sin (b / 2) = Im (t) . \end{matrix}

Just like we did when showing $ψ_{\frac{1}{2}, θ} \neq 0$ we can argue that $a = 0$ . We get $sin (b / 2) = Im (t)$ . Since t only depends on $θ$ , we get that $|b| > b_{0} > 0$ with a constant only depending on $θ$ . We proceed as when showing $ψ_{\frac{1}{2}, θ} \neq 0$ to conclude that $ψ_{\frac{1}{2}, θ}^{'}$ has no root in $D_{d (θ)}$ for d sufficiently small (depending on $θ$ ). $□$

Next, we study the growth of $ψ_{σ, θ} (y)$ as $|Re (y)| \to \infty$ .

Lemma A.4

Assume that $d (θ) < 1 / 2$ . Then there exist constants $c_{1}, c_{2}$ , $γ_{1}, γ_{2} > 0$ such that for $y \in D_{d (θ)}$ we can estimate

\begin{matrix} c_{1} exp (γ_{1} e^{|Re (y)|}) \leq |ψ_{σ, θ}, (y)| \leq c_{2} exp (γ_{2} e^{|Re (y)|}), \end{matrix}

i.e., the growth of $ψ$ is double exponential. Additionally, we can bound

\begin{matrix} | ψ_{σ, θ}^{'} (y) | ≲ | ψ_{σ, θ} (y) | cosh (Re (y)) . \end{matrix}

Proof

We start with the simple case $σ = 1$ , and focus on what happens if $|Re (y)| > y_{0} > 0$ for a to be tweaked constant $y_{0}$ . The values with $0 \leq |Re (y)| \leq y_{0}$ can be covered by adjusting $c_{1}$ and $c_{2}$ , due to the compactness of the set ${y \in D_{d (θ)} : |Re (y)| \leq y_{0}}$ and the fact that $|ψ_{σ, θ}, (y)| > 0$ by Lemma A.3. We first only consider the $y - w$ part of the transformation. We compute, since $\frac{1}{2} < cos (t) \leq 1$ for $t \in (- 1 / 2, 1 / 2)$ :

\begin{matrix} |Re (sinh (y))| & = |sinh (Re (y))| cos (Im (y)) \sim |sinh (Re (y))| \sim c_{1} e^{|Re (y)|} . \end{matrix}

Easy calculation shows that for $h_{1} (η) : = |cos (η) - θ sin (η)|$ and $h_{2} (η) : = |θ cos (η) + sin (η)|$ it holds that ${[h_{1} (η)]}^{2} + {[h_{2} (η)]}^{2} = 1 + θ^{2}$ . Thus, for $w \in H_{θ}$ with $|Re (w)| \geq 1$ , we can calculate:

\begin{matrix} {|cosh (w) + i θ sinh (w)|}^{2} = & {|cosh (Re (w))|}^{2} {[h_{1} (Im (w))]}^{2} \\ + {|sinh (Re (w))|}^{2} {[h_{2} (Im (w))]}^{2} \\ ≳ & min (cosh (Re (w)), |sinh (Re (w))|)^{2} (1 + θ^{2}) \\ ≳ & e^{2 |Re (w)|} . \end{matrix}

Overall, we see the lower bound of (A5). The upper bound is easily seen, as $|sinh (y)|$ and $|cosh (y)|$ both grow exponentially and the bound only depends on the real part of the argument. (A6) follows from (A3) and the asymptotics (A7)and (A5).

We now look at how to adapt the proof to the case $σ = 1 / 2$ . If $|Re (w)| \geq 2 ln (1 + \sqrt{5})$ , we get

\begin{matrix} |cosh (w / 2) + i θ sinh (w)| \\ \geq θ |sinh (w)| - |cosh (w / 2)| \geq \frac{1}{2} (e^{|Re (w)|} - 2 - e^{|Re (w) / 2|}) \geq \frac{1}{4} e^{|Re (w)|}, \end{matrix}

where in the last step we used the monotonicity of the expression and the fact that $\frac{e^{|Re (w)|}}{2} - e^{|Re (w) / 2|} = 2$ for $Re (w) = 2 ln (1 + \sqrt{5})$ . The argument for the $y - w$ -transformation stays the same. The upper bound also follows easily from the triangle inequality and the growth of $sinh$ and $cosh$ .

To see (A6), we combine (A4) with the asymptotic estimate (A8) to get

\begin{matrix} | ψ_{σ, θ}^{'} (y) | & ≲ e^{π |Re (sinh (y))| / 2} cosh (Re (y)) ≲ | ψ_{σ, θ} (y) | cosh (Re (y)) . \end{matrix}

$□$

While on the full strip $D_{d (θ)}$ , the image of the transformation is difficult to study, the restriction to a certain subdomain is much better behaved.

Lemma A.5

For $σ = 1$ , there exists a constant $δ > 0$ , depending on $θ$ , such that restricted to the domain $D_{δ}^{exp}$ , $ψ_{1, θ}$ maps to a sector in the right-half plane,

\begin{matrix} S_{ω} : = {z \in C : |Arg (z)| \leq ω} with ω < \frac{π}{2} . \end{matrix}

For $σ = 1 / 2$ , and for all $ε > 0$ , there exists a constant $δ > 0$ , depending on $θ$ and $ε$ , such that restricted to the domain $D_{δ}^{exp}$ the transformation $ψ_{\frac{1}{2}, θ}$ maps to the sector $S_{π / 2 + ε}$ .

In both cases, there exist constants $c_{1}, c_{2} > 0$ such that $ψ_{σ, θ}$ satisfies for all $λ \geq λ_{0} > κ :$

\begin{matrix} |ψ_{σ, θ} (y) - λ| \geq c_{1} and |ψ_{σ, θ}^{'}, (y), {(ψ_{σ, θ} (y) - λ)}^{- 1}| \leq c_{2} cosh (Re (y)) \forall y \in D_{δ}^{exp}, \end{matrix}

where $c_{1}, c_{2}$ only depend on $λ_{0}$ and $θ$ .

Proof

By Lemma A.2(iii) it is sufficient to consider the mapping of $φ_{σ, θ}$ restricted to small strips in the w-plane around the real axis. We start with the simpler case $σ = 1$ . Going back to (A2) and writing $w : = \frac{π}{2} sinh (y) = : a + i b$ , we note that if $|b|$ is sufficiently small, we can guarantee that $cos (b) - θ sin (b) > c > 0$ for some constant $c > 0$ depending on $θ$ .

We have

\begin{matrix} Re (φ_{σ, θ} (w)) & = κ cosh (a) (cos (b) - θ sin (b)) > c κ cosh (a), \\ |Im (φ_{σ, θ} (w))| & = κ |sinh (a)| | θ cos (b) + sin (b) | \leq (1 + θ) κ |sinh (a)| . \end{matrix}

This implies

\begin{matrix} 0 \leq \frac{|Im (φ_{σ, θ} (w))|}{Re (φ_{σ, θ} (w))} \leq (1 + θ) c^{- 1} \forall w \in C, |Im (w)| sufficiently small . \end{matrix}

Next, we show that for $σ = 1 / 2$ , sufficiently thin strips in the w-domain are mapped to sectors with opening angle $π / 2 + ε$ . Such sectors are characterized by

\begin{matrix} - Re (φ_{σ, θ} (w)) \leq \tilde{ε} |Im (φ_{σ, θ} (w))| \forall |Im (w)| < b_{0} (\tilde{ε}) \end{matrix}

for $\tilde{ε} > 0$ depending on $ε$ . The interesting case is $Re (φ_{σ, θ} (w)) < 0$ . There, we get for $|b| \leq π$ :

\begin{matrix} - Re (φ_{σ, θ} (w)) & = - κ cosh (a / 2) cos (b / 2) + κ θ cosh (a) sin (b) \leq κ θ cosh (a) |sin (b)| . \end{matrix}

For the imaginary part, the double angle-formula gives:

\begin{matrix} |Im (φ_{σ, θ} (w))| & \geq κ θ |sinh (a) cos (b)| - κ |sinh (a / 2) sin (b / 2)| \\ \geq \frac{κ θ}{2} |sinh (a) cos (b)| + κ |sinh (a / 2)| (θ cosh (a / 2) |cos (b)| - |sin (b / 2)|) . \end{matrix}

For $|b|$ sufficiently small, we get $θ cos (b) - |sin (b / 2)| > 0$ , and thus the last term is non-negative. We conclude

\begin{matrix} - Re (φ_{σ, θ} (w)) & \leq 2 |\frac{cosh (a)}{sinh (a)}, \frac{sin (b)}{cos (b)}| |Im (φ_{σ, θ} (w))| . \end{matrix}

For $a > 1$ , the ratio Inline graphic is uniformly bounded. By making b sufficiently small, we can ensure $|sin (b)| / cos b < \tilde{ε}$ . For $a < 1$ , we note that by restricting the values of b it can be easily seen that $Re (φ_{σ, θ} (w)) \geq 0$ . Thus we can conclude that $ψ_{σ, θ}$ maps to the stated sectors.

Next, we prove the bounds on the distance to the real axis, again primarily investigating the behavior of $φ_{σ, θ}$ in thin strips. We focus on $σ = 1 / 2$ . Since $φ_{σ, θ} (0) = κ < λ_{0}$ and $φ_{σ, θ}$ is continuous, there exist constants $0 < q < 1$ and $\tilde{δ} > 0$ such that $|φ_{σ, θ} (w) - φ_{σ, θ} (0)| < q (λ_{0} - κ)$ for all $|w| < \tilde{δ}$ . This gives:

\begin{matrix} |φ_{σ, θ} (w) - λ| & \geq λ - |φ_{σ, θ}, (w)| \geq λ_{0} - |φ_{σ, θ}, (0)| - |φ_{σ, θ} (w) - φ_{σ, θ} (0)| \\ > λ_{0} - κ - q (λ_{0} - κ) \geq (1 - q) (λ_{0} - κ) > 0 \forall |w| \leq \tilde{δ} . \end{matrix}

By selecting $δ < \tilde{δ} / 2$ in the definition of $D_{δ}^{exp}$ , we may then continue by only considering $|a| : = |Re (w)| > \tilde{δ} / 2$ . The imaginary part of $φ_{\frac{1}{2}, θ} (y)$ satisfies:

\begin{matrix} Im (φ_{\frac{1}{2}, θ} (y)) & = κ sinh (a / 2) sin (b / 2) + κ θ sinh (a) cos (b) \\ = κ sinh (a / 2) (sin (b / 2) + 2 θ cosh (a / 2) cos (b)) . \end{matrix}

For $|b| \leq π / 4$ we have $sin (b / 2) + 2 cos (b) > 0$ and thus can conclude that $| Im (φ_{\frac{1}{2}, θ} (y)) | \geq c > 0$ and in turn $| φ_{\frac{1}{2}, θ} (y) - λ | \geq c > 0$ . The case $σ = 1$ follows similarly, but not using the double angle formula.

To see that $ψ_{σ, θ}^{'} (y) {(ψ_{σ, θ} (y) - λ)}^{- 1}$ can also be uniformly bounded, we only need to focus on large values of y (and therefore w). Asymptotically, we estimate for $|b| < π / 4$ :

\begin{matrix} |Im (ψ_{σ, θ} (y))| & \geq - κ |sinh (σ a)| |sin (σ b)| + κ θ |sinh (a) cos (b)| ≳ |sinh (a)| \\ \overset{(A7) or (A8)}{≳} |ψ_{σ, θ}, (y)| . \end{matrix}

Using (A6) then concludes the proof. $□$

In order to apply the double exponential formulas for the Riesz–Dunford calculus, it is important to understand where $ψ_{σ, θ} (z)$ hits the real line. We start with the w-domain.

Lemma A.6

Fix $λ \geq λ_{0} > 1$ . Then the following holds for every $w \in C$ with $Re (w) \neq 0$ and

\begin{matrix} cosh (σ w) + i θ sinh (w) = λ : \end{matrix}

A10

(i)
There exist constants $c_{1}, c_{2}, c_{3} > 0$ such that w satisfies $log (λ) - c_{1} \leq |Re (w)| \leq log (λ) + c_{1}$ and
$\begin{matrix} |Im (w)| \geq \{\begin{matrix} {tan}^{- 1} (θ) & if σ = 1 \\ max (\frac{π}{2} - \frac{c_{2}}{θ \sqrt{λ}}, c_{3}) & if σ = 1 / 2 \end{matrix}), \end{matrix}$
where $c_{1}$ depends on $λ_{0}$ and $θ$ , $c_{2}$ depends on $λ_{0}$ , and $c_{3}$ depends on $θ$ .
(ii)
Given $0 < r < R$ , the number $N_{w} (λ, r, R)$ of points w satisfying (A10) with $r \leq |Im (w)| \leq R$ is bounded uniformly in $λ$ by
$\begin{matrix} N_{w} (λ, r, R) \leq C |R - r| \end{matrix}$
The constant C depends only on $θ$ .
(iii)
There exist at most four values $p_{1}, \dots, p_{4}$ depending on $λ$ , $θ$ , and $σ$ such that all points satisfying (A10) can be written as
$\begin{matrix} w = p_{j} + \frac{2 ℓ}{σ} π i for some ℓ \in Z, j \in 1, \dots, 4 . \end{matrix}$ A11
If w solves (A10) then $- \bar{w}$ does as well.

Proof

We start with the simpler case $σ = 1$ . By separating the real and imaginary parts as in (A2), we can observe that the critical points $w = a + i b$ with $a \neq 0$ are located at

\begin{matrix} cosh (a) & = \frac{λ}{\sqrt{1 + θ^{2}}}, b = - {tan}^{- 1} (θ) + 2 ℓ π, ℓ \in Z . \end{matrix}

A12

This implies that $|a| \sim ln (λ)$ , and we also see that for each $ℓ$ , there are at most two such points, one in each half-plane. All the statements follow easily. Note that in (iii) only two families are needed.

For the remainder of the proof we therefore focus on the case $σ = 1 / 2$ . Proof of (i): We start with the bound on the real part and write $w = a + i b$ . We note that if $|a| > max (1, 2 ln (\frac{8}{θ}))$ one can estimate using elementary considerations that $e^{|a|} / 4 \leq |sinh (w)|$ and $e^{|a| / 2} / θ \leq e^{|a|} / 8$ .

We then calculate:

\begin{matrix} \frac{e^{|a|}}{4} & \leq |sinh (w)| = \frac{1}{θ} |λ - cosh (w / 2)| \leq \frac{λ}{θ} + \frac{e^{\frac{|a|}{2}}}{θ} \leq \frac{λ}{θ} + \frac{e^{|a|}}{8} . \end{matrix}

From this, the statement readily follows. The other direction is shown similarly:

\begin{matrix} e^{| a |} & \geq |sinh (w)| = \frac{1}{θ} |λ - cosh (w / 2)| \geq \frac{λ}{θ} - \frac{e^{\frac{|a|}{2}}}{θ} \geq \frac{λ}{θ} - \frac{e^{|a|}}{8} . \end{matrix}

For $|a| \leq max (1, 2 ln (\frac{8}{θ}))$ , we use the bound $|φ_{σ, θ}, (w)| ≲ e^{|a|}$ to see that $λ = φ_{σ, θ} (w) ≲ e^{|a|}$ , giving that $ln (λ)$ must be uniformly bounded. By taking $c_{1}$ large enough we can make the $ln (λ) - c_{1}$ negative, thus making the first estimate in (i) trivial. Since $ln (λ) \geq ln (λ_{0}) > 0$ , we can also immediately see

The final bound on the real part of w then follows for $c_{1} : = max (ln (8 / θ), ln (9 θ / 8), \tilde{c})$ , where $\tilde{c}$ is used to compensate for the case of small a.

Looking at the imaginary part of equation (A10), we get from the double-angle formulas

\begin{matrix} 0 & = sinh (a / 2) sin (b / 2) + θ sinh (a) cos (b) \\ = sinh (a / 2) (sin (b / 2) + 2 θ cosh (a / 2) (1 - 2 {sin}^{2} (b / 2))) . \end{matrix}

A13

Since we assume $a \neq 0$ , we get by substituting $τ : = sin (b / 2)$

\begin{matrix} 0 = τ + 2 θ cosh (a / 2) (1 - 2 τ^{2}) or τ = \frac{1 \pm \sqrt{1 + 32 θ^{2} {cosh}^{2} (a / 2)}}{8 θ cosh (a / 2)} . \end{matrix}

A14

From this, using the asymptotic behavior

\begin{matrix} τ = \pm \frac{1}{\sqrt{2}} + O (\frac{1}{θ cosh (a / 2)}) and {sin}^{- 1} (\frac{1}{\sqrt{2}} + h) = \frac{π}{4} + O (h) \end{matrix}

the statement follows for large a, since $b = 2 {sin}^{- 1} (τ)$ and $cosh (a / 2) ≳ \sqrt{λ}$ by (i). For small a, we note that (A14) shows that $Im (w) \neq 0$ . By continuity, it must therefore hold that $|Im (w)| > 0$ uniformly.

Proof of (ii) and (iii): We square the defining equation (A10), getting

\begin{matrix} λ^{2} & = {cosh}^{2} (w / 2) + 2 i θ cosh (w / 2) sinh (w) - θ^{2} {sinh}^{2} (w) \\ = {cosh}^{2} (w / 2) + 4 i θ {cosh}^{2} (w / 2) sinh (w / 2) - 4 θ^{2} {sinh}^{2} (w / 2) {cosh}^{2} (w / 2) . \end{matrix}

A15

Writing ${cosh}^{2} (w / 2) = 1 + {sinh}^{2} (w / 2)$ , we get that $t : = sinh (w / 2)$ solves the quartic equation

\begin{matrix} λ^{2} & = 1 + t^{2} + 4 i θ (1 + t^{2}) t - 4 θ^{2} t^{2} - 4 θ^{2} t^{4} . \end{matrix}

A16

This means there can be at most 4 such values $t_{1}, \dots t_{4}$ for any $λ$ and it must hold that

\begin{matrix} sinh (w / 2) & = t_{j} or w = w_{j} + 4 π ℓ i ℓ \in Z \end{matrix}

A17

Here $w_{j}$ for $j = 1, \dots, 4$ is the solution to $sinh (w_{j} / 2) = t_{j}$ with $Re (w_{j}) > 0$ and minimal value of $|Im (w_{j})|$ .

To see (ii), we note that for each $t_{j}$ at most Inline graphic values lie in the sought after strip. Therefore we can estimate

The statement (iii) follows readily from (A17). The fact that $- \bar{w_{j}}$ also solves (A10) follows by conjugating both sides of the Eq. (A16).

$□$

Next, we show that points which have positive distance to the poles in the w-plane are mapped to points with distance $λ$ in the z-plane. Note that in the following Lemma we include some additional points in order to avoid distinguishing more cases. We also exclude most of the imaginary axis, as for small values of $λ$ it might contain poles which are structurally different than the ones involving large $λ$ .

Lemma A.7

Define

\begin{matrix} b_{0} : = max {b \geq 0 : |φ_{σ, θ}, (i τ)| \leq (λ_{0} + κ) / 2 \forall |τ| \leq b} . \end{matrix}

Fix $λ \geq λ_{0} > κ$ and define the set

\begin{matrix} M_{λ} & : = {p_{λ} + i ℓ π, with p_{λ} \in C such that φ_{σ, θ} (p_{λ}) = λ and ℓ \in Z} \cup {i b : |b| \leq b_{0}} . \end{matrix}

For any $δ > 0$ , there exists a constant $c (δ) > 0$ , depending only on $δ$ and $θ$ , such that for all $w \in C$ with $dist (w, M_{λ}) > δ$ we can estimate

\begin{matrix} |φ_{σ, θ} (w) - λ| \geq c (δ) λ . \end{matrix}

Proof

Without loss of generality, we assume $δ < b_{0}$ . We first deal with the issues close to the imaginary axis. Since $|φ_{σ, θ}, (i b)| \leq (λ_{0} + κ) / 2$ if $|b| \leq b_{0}$ , we can find $ε \in (0, δ / 2)$ such that

\begin{matrix} |φ_{σ, θ}, (\tilde{w})| < 3 λ_{0} / 4 + κ / 4 for all \tilde{w} \in U_{ε} & : = {|Re (\tilde{w})| \leq ε and |Im (\tilde{w})| \leq b_{0}} . \end{matrix}

If $|Re (w)| \leq ε$ , the distance condition to $M_{λ}$ implies for all $|b| \geq b_{0}$ :

\begin{matrix} δ^{2} \leq {| w - i b |}^{2} = {| Im (w) - b |}^{2} + {| Re (w) |}^{2} \leq {| Im (w) - b |}^{2} + δ^{2} / 4, \end{matrix}

or $| Im (w) - b | \geq δ / 2$ . Since all $|b| \geq b_{0}$ are admissible, it can not be that $| Im (w) | \geq b_{0}$ . Therefore we readily see $|Im (w)| \leq b_{0} - δ / 2$ and thus $w \in U_{ε}$ .

We can therefore calculate:

\begin{matrix} |φ_{σ, θ} (w) - λ| \geq λ - |φ_{σ, θ}, (w)| \geq \frac{λ}{4} (1 - \frac{κ}{λ_{0}}) . \end{matrix}

Next, we deal with small values of $λ$ . For constants $Λ$ , $C_{w}$ to be fixed later, consider $λ \in [λ_{0}, Λ]$ and define the set

\begin{matrix} R : = {w \in C : |Re (w)| \geq ε, |w| \leq C_{w}} . \end{matrix}

We show that the stated bound holds for $w \in R$ . By Lemma A.6(iii), the points of $M_{λ}$ can be written as

\begin{matrix} M_{λ} = {p_{j} + π ℓ i, ℓ \in Z, j = 1, \dots, 4} \cup {i b : |b| \leq b_{0}} \end{matrix}

for some reference points $p_{j} \in C$ , w.l.o.g. $|Im (p_{j})| \leq π$ . We introduce set

\begin{matrix} M_{Λ, C_{w}} : = {p_{j} + ℓ π i, |ℓ| \leq L, j = 1, \dots, 4} \end{matrix}

A18

where $L : = ⌈ (C_{w} + 1) / π ⌉$ is taken large enough that for all $λ \in [λ_{0}, Λ]$ , it holds that

\begin{matrix} M_{λ} \cap R \subseteq M_{Λ, C_{w}}, \end{matrix}

i.e., it contains all the poles of size less or equal than $C_{w}$ uniformly in $λ$ (but possibly also some larger ones). We consider the map

\begin{matrix} Φ (λ, w) : = (φ_{σ, θ} (w) - λ)^{- 1} \prod_{p_{λ} \in M_{Λ, C_{w}}} (w - p_{λ}) . \end{matrix}

By the inverse function theorem, the points $p_{j} = φ_{σ, θ}^{- 1} (λ)$ depend continuously on $λ$ since $φ_{σ, θ}^{'} \neq 0$ away from the imaginary axis (where we stay away from by construction). Thus, also the other points $p_{λ}$ depend continuously on $λ$ . Similarly, the denominator only has simple zeros for $w \in M_{λ}$ . Since, in that case the numerator also vanishes one can argue that $Φ$ has a continuous extension to $[λ_{0}, Λ] \times R$ which is bounded, i.e., it holds

\begin{matrix} |\frac{\prod_{p_{λ} \in M_{Λ, C_{w}}} (w - p_{λ})}{φ_{σ, θ} (w) - λ}| \leq C (Λ, C_{w}) \end{matrix}

\begin{matrix} |φ_{σ, θ} (w) - λ| \geq \frac{1}{C (Λ, C_{w})} \prod_{p_{λ} \in M_{Λ, C_{w}}} |w - p_{λ}| . \end{matrix}

Thus, if w is separated from $M_{λ}$ and the imaginary axis by $δ$ , we get:

\begin{matrix} |φ_{σ, θ} (w) - λ| & \geq \frac{1}{C (Λ, C_{w})} \prod_{p_{λ} \in M_{Λ, C_{w}}} \underset{\geq δ}{\underset{⏟}{|w - p_{λ}|}} \\ \geq \frac{δ^{# M_{Λ, C_{w}}}}{C (Λ, C_{w})} \frac{λ}{Λ} = : C (Λ, δ, C_{w}) λ . \end{matrix}

Here in the last step we used the fact that the number of elements in $M_{Λ, C_{w}}$ is uniformly bounded, as can be seen from the definition in (A18).

If $λ \in [0, Λ]$ and $|w| > C_{w} : = max (log (2 Λ / c_{1}), 4 ln (2))$ , (where $c_{1}$ is the constant in (A7) or (A8)) we get:

\begin{matrix} |φ_{σ, θ} (w) - λ| \geq |φ_{σ, θ}, (w)| - λ \overset{(A 7), (A 8)}{\geq} c_{1} e^{|Re (w)|} - Λ \geq Λ \geq λ . \end{matrix}

We therefore may from now on assume that $λ$ is sufficiently large as we see fit. In preparation for the rest of the proof, we note that for $ζ, μ \in R$ , w.l.o.g., $|ζ| \leq |μ|$ :

\begin{matrix} |cosh (μ) - cosh (ζ)| = & cosh (|μ|) - cosh (|ζ|) = \int_{|ζ|}^{|μ|} sinh (τ) d τ \\ \geq & sinh (|ζ|) (|μ| - |ζ|) . \end{matrix}

A19

Because it is much simpler, we start with the case $σ = 1$ . We note that in this case $M_{λ}$ consists of the points mapped to $\pm λ$ . We distinguish three cases, depending on whether $Re (w)$ is small and if $Im (w)$ is close to a pole or not.

Case 1: $(1 + θ) κ cosh (Re (w)) < λ / 2 :$ The triangle inequality gives:

\begin{matrix} |κ [cosh (w) + i θ sinh (w)] - λ| & \geq λ - (1 + θ) κ cosh (Re (w)) \geq \frac{λ}{2} . \end{matrix}

Case 2: $2 (1 + θ) κ cosh (Re (w)) \geq λ$ and there exists a point $p \in M_{λ}$ with $|Im w - Im p| \leq ε_{1}$ for $ε_{1}$ sufficiently small. We note that this implies that $|Re (w) - Re (p)|$ is positive. Due to the symmetry in (A11) we may in addition assume that $sign (Re (w)) = sign (Re (p))$ .

By Lemma A.2 and Lemma A.6(i) we note the following estimates:

\begin{matrix} | sinh (Re (w)) | \sim | cosh (Re (w)) | ≳ λ, and | sinh (Re (p)) | \sim | cosh (Re (p)) | ≳ λ . \end{matrix}

A20

Writing $h (η) : = cos (η) - θ sin (η)$ , we note that $|h (Im (w))| > c > 0$ by (A12) and the fact that adding $π ℓ$ might only change the sign. If $h (Im (p)) \geq 0$ , we consider the real part of $φ_{σ, θ} (w) - λ$ to get:

\begin{matrix} |cosh (Re (w)) h (Im (w))) - λ| \\ = |cosh (Re (w)) h (Im (p)) - λ + cosh (Re (w)) (h (Im (w)) - h (Im (p)))| \\ \geq |cosh (Re (w)) h (Im (p)) - λ| - cosh (Re (w)) |h (Im (w)) - h (Im (p))| \\ \overset{(A 19)}{≳} min (|sinh (Re (w))|, |sinh (Re (p))|) |Re (w) - Re (p)| - (1 + θ) cosh (Re (w)) ε_{1} \\ \overset{(A 20)}{≳} λ, \end{matrix}

where in the last step we chose $ε_{1}$ sufficiently small (but independent of $λ$ ). If $h (Im (p)) \leq 0$ , by continuity we can enforce $h (Im (w)) \leq 0$ as long as $ε_{1}$ is sufficiently small. The necessary calculation then is even easier because $φ_{σ, θ} (w)$ maps to the left half-plane.

Case 3: $2 (1 + θ) cosh (Re (w)) \geq λ$ and $|Im (w) - Im (p)| \geq ε_{1} > 0$ for all $p \in M_{λ}$ . We estimate imaginary part of $φ_{σ, θ}$ . Since $Im (w)$ has positive distance from the points in (A12), we get $|sin (Im (w)) + θ cos (Im (w))| > c > 0$ . Which in term gives

\begin{matrix} |sinh (Re (w))| |sin (Im (w)) + θ cos (Im (w))| \geq c |sinh (Re (w))| ≳ λ \end{matrix}

where the last part only holds for large enough cases of $Re (w)$ not covered by Case 1.

Now we show, how the proof has to be adapted for the case $σ = 1 / 2$ , again focusing on the asymptotic case of large $λ$ . By Lemma A.6(i), all the points $p \in M_{λ}$ satisfy $|Re (p)| \sim log (λ)$ . Looking at the imaginary part of the defining equation for p we get that

\begin{matrix} |cos (Im (p))| & = |\frac{sinh (Re (p) / 2)}{θ cosh (Re (p))} sin (Re (p))| ≲ \frac{e^{- Re (p) / 2}}{θ} ≲ λ^{- 1 / 2} . \end{matrix}

Thus, for any $ε_{2} \in (0, 1)$ , assuming $λ$ is sufficiently large, all the points $p \in M_{λ}$ satisfy $cos (Im (p)) < ε_{2}$ . We again have to distinguish three cases:

Case 1: $(1 + θ) cosh (Re (w)) < λ / 2 :$ One can argue just like in the $σ = 1$ case.

Case 2: $2 (1 + θ) cosh (Re (w)) \geq λ$ and there exists a point $p \in M_{λ}$ with $|Im w - Im p| \leq ε_{1}$ for $ε_{1}$ sufficiently small. We note that this implies that $|Re (w) - Re (p)|$ is positive. Since $|cos (w)| < ε_{2}$ , we note that $|sin (w)| > 1 - ε_{2}$ . By possibly adding $i ℓ π$ , we can write

\begin{matrix} λ = - θ cosh (Re (p)) sin (Im (p + ℓ i π)) + cosh (Re (p / 2)) sin (Im (p + ℓ i π) / 2) . \end{matrix}

A21

For large values of $λ$ , the $cosh (Re (p))$ term is dominating. Therefore, we have $sign (Re (φ_{σ, θ} (p))) = - sign (sin (Im (p)))$ . By continuity we can enforce that $sign (sin (Im (w))) = sign (sin (Im (p)))$ . Since for the case $Re (φ_{σ, θ} (w)) \leq 0$ the statement is trivial, we only have to consider the case $sign (sin (Im (w))) = - 1$ or $ℓ = 0$ in (A21). We look at the real part of $φ_{σ, θ} (w) - λ$ to get:

\begin{matrix} | - θ cosh (Re (w)) sin (Im (w)) - λ + cosh (Re (w) / 2) cos (Im (w) / 2) | \\ \geq |- θ cosh (Re (w)) sin (Im (p)) + θ cosh (Re (p)) sin (Im (p))| \\ - |cosh (Re (w)) (sin (Im (w)) - sin (Im p))| - O (cosh (Re (w) / 2)) \\ ≳ θ min (|sinh (Re (w))|, |sinh (Re (p))|) |Re (w) - Re (p)| - C cosh (Re (w)) ε_{1} \\ ≳ λ \end{matrix}

where we absorbed the term $cosh (Re (w) / 2)$ into $cosh (Re (w)) ε_{1}$ by assuming $λ$ sufficiently large and in the last step we chose $ε_{1}$ sufficiently small (but independent of $λ$ ).

Case 3: $2 (1 + θ) cosh (Re (w)) \geq λ$ and $|Im w - Im p| \geq ε_{1} > 0$ for all $p \in M_{λ}$ . Since all the points $p \in M_{λ}$ satisfy $|cos (Im (p))| < ε_{2}$ and for every zero of $cos$ , there exists a value $p \in M_{λ}$ with $Im (p)$ close to it, this means that $|cos (Im (w))| \geq δ_{2}$ for a constant $δ_{2}$ depending only on $ε_{1}$ and $ε_{2}$ . We estimate

\begin{matrix} |Im (φ_{σ, θ} (w))| \geq θ |sinh (Re (w))| |cos (Im (w))| - |cosh (w / 2)| ≳ |sinh (Re (w))| ≳ λ \end{matrix}

where the last part only holds for large enough cases of $Re (w)$ not covered by the first case. $□$

Lemma A.8

Fix $λ \geq λ_{0} > κ$ . Consider the set

\begin{matrix} P_{λ}^{y} : = {y \in D_{d (θ)} : ψ_{σ, θ} (y) = λ}, \end{matrix}

where $d (θ)$ is taken sufficiently small. Then the following holds:

(i)
For any $ν_{0} \in [0, d (θ)]$ and $δ > 0$ there are at most finitely many points $y_{1}, \dots, y_{N_{p} (λ, δ)} \in P_{λ}^{y}$ satisfying
$\begin{matrix} ν_{0} - \frac{δ}{ln (λ / κ)} < |Im (y_{ℓ})| < ν_{0} + \frac{δ}{ln (λ / κ)} \forall ℓ \in 1, \dots, N_{p} (λ, δ) . \end{matrix}$
The number $N_{p} (λ, δ)$ of such points can be bounded by a constant depending only on $δ$ , $θ$ , $σ$ and $d (θ)$ , but independently of $λ$ and $ν_{0}$ .
(ii)
For $λ \in (λ_{0}, Λ)$ , one can bound $|Im (y)| \geq γ (Λ) > 0 \forall y \in P_{λ}^{y},$ with a constant $γ (Λ)$ depending on $Λ$ , $θ$ , $κ$ , $λ_{0}$ . For $λ$ sufficiently large, the following asymptotic holds:
$\begin{matrix} |Im (y)| \geq \{\begin{matrix} \frac{{tan}^{- 1} (θ)}{ln (λ / κ)} - O (\frac{1}{{ln}^{2} (λ / κ)}) & if σ = 1, \\ \frac{π}{2 ln (λ / κ)} - O (\frac{1}{{ln}^{2} (λ / κ)}) & if σ = 1 / 2 \end{matrix}) \forall y \in P_{λ}^{y}, \end{matrix}$ A22
where the implied constants depend only on $θ$ , $κ$ , $λ_{0}$ .
(iii)
There exists a parameter $d_{λ} \in (d (θ) / 2, d (θ)]$ and a constant $c > 0$ such that
$\begin{matrix} |ψ_{σ, θ} (a \pm i d_{λ}) - λ| \geq c λ / κ \forall a \in R . \end{matrix}$ A23
c depends on $θ$ , $σ$ and $λ_{0}$ but is independent of $λ$ .

Proof

For now, consider the case $κ = 1$ and $λ_{0} > 1$ . Since $ψ_{σ, θ} (0) = 1$ , due to continuity, there exists a factor $d (θ) > 0$ and $ε > 0$ such that

\begin{matrix} |ψ_{σ, θ}, (y)| < 1 + ε < λ_{0} < λ \forall |y| \leq d (θ) . \end{matrix}

By taking $d (θ)$ at last this small in the definition of $D_{d (θ)}$ we can exclude the poles satisfying $Re (w) = Re (sinh (y)) = 0$ .

Proof of (i): Since $sinh$ is injective by Lemma A.2(i) we only need to count the points in $H_{θ}$ which are mapped to $λ$ by $φ_{σ, θ}$ . Consider a point $w = a + i b$ in the w domain and let $\frac{π}{2} sinh (y) = w$ with $y = ξ + i ν \in D_{d (θ)}$ . Then

\begin{matrix} |a| = \frac{π}{2} |sinh (ξ)| |cos (ν)| and |b| & = \frac{π}{2} |cosh (ξ)| |sin (ν)| . \end{matrix}

Simple computations give $|a| tan (|ν|) \leq |b| \leq π / 2 sin (| ν |) + |a| tan (|ν|)$ . we only show the second inequality:

\begin{matrix} |b| & = \frac{π}{2} |cosh (ξ)| |sin (ν)| \leq \frac{π}{2} (1 + |sinh (ξ)|) |sin (ν)| = \frac{π}{2} (1 + \frac{2 | a |}{π | cos (ν) |}) |sin (ν)| \\ \leq \frac{π}{2} sin (| ν |) + |a| tan (|ν|) . \end{matrix}

By inserting Lemma A.6(i) we get

\begin{matrix} (ln (λ) - c_{1}) tan (|ν|) \leq |b| \leq \frac{π}{2} sin (|ν|) + (c_{1} + ln (λ)) tan (|ν|) . \end{matrix}

A24

For the length L of this interval, we compute for $ν_{\pm} : = ν_{0} \pm δ ln {(λ / κ)}^{- 1}$ (without changing the number of points, we may assume $0 \leq ν_{-} \leq ν_{+} \leq d (θ)$ ):

\begin{matrix} L & = ln (λ) (tan (|ν_{+}|) - tan (ν_{-})) + \frac{π}{2} sin (ν_{+}) + c_{1} tan (ν_{+}) + c_{1} tan (ν_{-}) \\ \leq ln (λ) \int_{ν_{-}}^{ν_{+}} \frac{1}{{cos}^{2} (τ)} d τ + \frac{π}{2} + 2 c_{1} \leq 8 δ \frac{ln (λ)}{ln (λ) - ln (κ)} + \frac{π}{2} + 2 c_{1}, \end{matrix}

where in the last step we used $cos (τ) > 1 / 2$ for $|τ| < 1 / 2$ and the definition of $ν_{\pm}$ . The right-hand side stays uniformly bounded for $λ \to \infty$ . Therefore, since the length of this interval is bounded uniformly in $λ$ , we can apply Lemma A.6(ii) to bound $N_{p} (λ, δ)$ .

Proof of (ii): We only show the asymptotics.

Fix $y \in D_{d (θ)}$ and write $w : = \frac{π}{2} sinh (y)$ . For $0 < |y| < 1 / 2$ it can be easily seen that $|y| + \frac{2}{3} {|y|}^{3} > tan (|y|) .$ Now, if $|Im (y)| \geq 2 ln {(λ)}^{- 1}$ there is nothing left to show. Otherwise, we can bound

\begin{matrix} |Im (y)| & \geq |tan (y)| - \frac{4}{3 ln {(λ)}^{3}} \overset{(A 24)}{\geq} \frac{|Im (w)|}{1 + c_{1} + ln (λ)} - \frac{4}{3 ln {(λ)}^{3}} \\ \geq \frac{|Im (w)|}{ln (λ)} - O (\frac{1}{ln {(λ)}^{2}}) . \end{matrix}

The result follows from Lemma A.6(i).

Proof of (iii): For $d_{λ} = d (θ)$ , we can not guarantee that $ψ (ξ + i d_{λ})$ does not hit the value $λ$ . In this case, we have to modify $d_{λ}$ slightly to get robust estimates. For $d \in R$ , consider the hyperbolas

\begin{matrix} γ_{d} (ξ) : = \frac{π}{2} sinh (ξ + i d) ξ \in R . \end{matrix}

A25

In the light of Lemma A.7 we need to ensure that $dist (γ_{d_{λ}}, w_{p}) ≳ 1$ for all $w_{p} \in M_{λ}$ . We will be looking for $d_{λ}$ in a small strip around $d (θ)$ . To simplify notation we define the length

\begin{matrix} ω : = d (θ) \frac{ln (λ_{0})}{2 ln (λ)} such that d (θ) / 2 \leq d (θ) - ω \leq d (θ) . \end{matrix}

To make things symmetric with respect to the real axis, we consider ${\tilde{M}}_{λ} : = M_{λ} - M_{λ}$ . It will therefore be sufficient to focus on the upper right quadrant of the complex plane. All other cases follow by symmetry.

We write ${\tilde{M}}_{λ}^{y} : = {sinh}^{- 1} (\frac{2}{π} {\tilde{M}}_{λ})$ for the corresponding points in the y-domain. We start by noting that we can easily stay away from the problematic parts of the imaginary axis by making $d (θ)$ sufficiently small, as if $|Re (sinh (y))| < ε$ we have $|Im (sinh (y))| < (1 + ε) sin (Im (y))$ ; thus for small real parts we can ensure to fit between $(- b_{0}, b_{0})$ on the imaginary axis. This also means that we only consider points $w_{λ} \in {\tilde{M}}_{λ}$ with $|Re (w_{λ})| > ε > 0$ since our path will have already positive distance to other possible poles.

By (i), the number of points $y_{λ}$ in ${\tilde{M}}_{λ}^{y}$ in the strip $d (θ) - ω \leq Im (y_{λ}) \leq d (θ)$ can be bounded by a constant N, independent of $λ$ . In order to also avoid points in ${\tilde{M}}_{λ}^{y}$ which are close but outside the critical strip we also avoid the boundary points $d (θ) - ω$ and $d (θ)$ . Since $N + 2$ strips of width $\frac{ω}{2 N + 4}$ can not cover a strip of width $ω$ , there exists a value $d_{λ}$ such that

\begin{matrix} d (θ) - ω \leq d_{λ} \leq d (θ) and |Im (y_{λ}) - d_{λ}| \geq \frac{ω}{2 (N + 2)} \forall y_{λ} \in {\tilde{M}}_{λ}^{y} . \end{matrix}

For ease of notation, we define $δ : = ln (λ_{0}) / (4 N + 8)$ and note that $|Im (y_{λ}) - d_{λ}| \geq δ ln {(λ)}^{- 1}$ . We show that

\begin{matrix} dist (γ_{d_{λ}}, w_{p}) & \geq μ > 0 \forall w_{p} \in {\tilde{M}}_{λ} \end{matrix}

for a constant $μ > 0$ independent of $λ$ . We fix $y_{p} : = ξ_{p} + i ν_{p} \in {\tilde{M}}_{λ}^{y}$ with $w_{λ} = π sinh (y_{p}) / 2$ and a point on $γ_{d_{λ}}$ denoted by $y_{γ} = ξ_{γ} + d_{λ} i$ . We write $s_{p} : = sign (ν_{p} - d_{λ})$ and distinguish two cases: $(ξ_{p} - ξ_{γ}) s_{p} \leq 0$ and $(ξ_{p} - ξ_{γ}) s_{p} > 0$ . For symmetry reasons, we only consider the case $ξ_{γ}, ξ_{p}, ν_{p} > 0$ . If $(ξ_{p} - ξ_{γ}) s_{p} \geq 0$ , we get:

\begin{matrix} Im (s_{p} (sinh (y_{p}) - sinh (y_{γ}))) \\ = s_{p} (cosh (ξ_{p}) sin (ν_{p}) - cosh (ξ_{γ}) sin (d_{λ})) \\ = cosh (ξ_{p}) s_{p} (sin (ν_{p}) - sin (d_{λ})) + \underset{\geq 0}{\underset{⏟}{s_{p} (cosh (ξ_{p}) - cosh (ξ_{γ})) sin (d_{λ})}} \\ \geq cosh (ξ_{p}) s_{p} \int_{d_{λ}}^{ν_{p}} cos (τ) d τ ≳ cosh (ξ_{p}) \frac{δ}{ln (λ)} . \end{matrix}

For the case $(ξ_{p} - ξ_{γ}) s_{p} < 0$ , we calculate:

\begin{matrix} - s_{p} Re (sinh (y_{p}) - sinh (y_{γ})) \\ = - s_{p} sinh (ξ_{p}) cos (ν_{p}) + s_{p} sinh (ξ_{γ}) cos (d_{λ}) \\ = - s_{p} sinh (ξ_{p}) (cos (ν_{p}) - cos (d_{λ})) + \underset{\geq 0}{\underset{⏟}{s_{p} (sinh (ξ_{γ}) - sinh (ξ_{p})) cos (d_{λ}))}} \\ \geq sinh (ξ_{p}) s_{p} \int_{d_{λ}}^{ν_{p}} sin (τ) d τ ≳ sinh (ξ_{p}) sin (d (θ) / 2) \frac{δ}{ln (λ)} . \end{matrix}

A26

We have, since $sinh (ξ_{p}) \in \frac{2}{π} {\tilde{M}}_{λ}$ and using Lemma A.6(i) that

\begin{matrix} sinh (ξ_{p}) & = \frac{Re (sinh (ξ))}{cos (ν_{p})} = \frac{Re (w_{λ})}{cos (ν_{p})} ≳ \frac{ln (λ) - c_{1}}{cos (ν_{p})} . \end{matrix}

Together with the previous assumption $Re (w_{λ}) \geq ε$ , this gives $cosh (ξ_{p}) \geq sinh (ξ_{p}) ≳ max (ln (λ) - c_{1}, ε)$ . With this can conclude that for $ln (λ) > 2 c_{1} :$

\begin{matrix} dist (γ_{d_{λ}}, w_{λ}) ≳ \frac{ln (λ) - c_{1}}{ln (λ)} ≳ 1 - \frac{c_{1}}{2 c_{1}} \geq \frac{1}{2} . \end{matrix}

For $ln (λ) < 2 c_{1}$ we get

\begin{matrix} dist (γ_{d_{λ}}, w_{λ}) ≳ ε \frac{1}{ln (λ)} > \frac{ε}{2 c_{1}} > 0 . \end{matrix}

We can now apply Lemma A.7 to get to the final result. The general case $κ \neq 1$ follows by dividing the equation $ψ_{σ, θ} (y) = λ$ by $κ$ . We can therefore just replace $λ$ by $λ / κ$ in all statements. $□$

Funding Information

Open access funding provided by Austrian Science Fund (FWF).

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Antil H, Bartels S. Spectral approximation of fractional PDEs in image processing and phase field modeling. Comput. Methods Appl. Math. 2017;17(4):661–678. doi: 10.1515/cmam-2017-0039. [DOI] [Google Scholar]
2.Bonito A, Borthagaray JP, Nochetto RH, Otárola E, Salgado AJ. Numerical methods for fractional diffusion. Comput. Vis. Sci. 2018;19(5–6):19–46. doi: 10.1007/s00791-018-0289-y. [DOI] [Google Scholar]
3.Bonito A, Lei W, Pasciak JE. The approximation of parabolic equations involving fractional powers of elliptic operators. J. Comput. Appl. Math. 2017;315:32–48. doi: 10.1016/j.cam.2016.10.016. [DOI] [Google Scholar]
4.Bonito A, Lei W, Pasciak JE. Numerical approximation of space–time fractional parabolic equations. Comput. Methods Appl. Math. 2017;17(4):679–705. doi: 10.1515/cmam-2017-0032. [DOI] [Google Scholar]
5.Bonito A, Lei W, Pasciak JE. On sinc quadrature approximations of fractional powers of regularly accretive operators. J. Numer. Math. 2019;27(2):57–68. doi: 10.1515/jnma-2017-0116. [DOI] [Google Scholar]
6.Banjai L, Melenk JM, Nochetto RH, Otárola E, Salgado AJ, Schwab C. Tensor FEM for spectral fractional diffusion. Found. Comput. Math. 2019;19(4):901–962. doi: 10.1007/s10208-018-9402-3. [DOI] [Google Scholar]
7.Bonito A, Pasciak JE. Numerical approximation of fractional powers of elliptic operators. Math. Comp. 2015;84(295):2083–2110. doi: 10.1090/S0025-5718-2015-02937-8. [DOI] [Google Scholar]
8.Bucur, C., Valdinoci, E.: Nonlocal Diffusion and Applications, volume 20 of Lecture Notes of the Unione Matematica Italiana. Springer, Cham; Unione Matematica Italiana, Bologna (2016)
9.Danczul T, Hofreither C. On rational Krylov and reduced basis methods for fractional diffusion. J. Numer. Math. 2022;30(2):121–140. doi: 10.1515/jnma-2021-0032. [DOI] [Google Scholar]
10.Danczul, T., Hofreither, C., Schöberl, J.: A unified rational Krylov method for elliptic and parabolic fractional diffusion problems (2021)
11.Davis PJ, Rabinowitz P. Methods of numerical integration, Computer Science and Applied Mathematics. 2. Orlando: Academic Press Inc; 1984. [Google Scholar]
12.Danczul T, Schöberl J. A reduced basis method for fractional diffusion operators II. J. Numer. Math. 2021;29(4):269–287. doi: 10.1515/jnma-2020-0042. [DOI] [Google Scholar]
13.Danczul T, Schöberl J. A reduced basis method for fractional diffusion operators I. Numer. Math. 2022;151(2):369–404. doi: 10.1007/s00211-022-01287-y. [DOI] [Google Scholar]
14.Erdélyi, A., Magnus, W., Oberhettinger, F., Tricomi, F. G.: Higher Transcendental Functions. Vol. III. Robert E. Krieger Publishing Co., Inc., Melbourne (1981). Based on notes left by Harry Bateman, Reprint of the 1955 original
15.Gatto P, Hesthaven JS. Numerical approximation of the fractional Laplacian via $hp$ -finite elements, with an application to image denoising. J. Sci. Comput. 2015;65(1):249–270. doi: 10.1007/s10915-014-9959-1. [DOI] [Google Scholar]
16.Gilboa G, Osher S. Nonlocal operators with applications to image processing. Multiscale Model. Simul. 2008;7(3):1005–1028. doi: 10.1137/070698592. [DOI] [Google Scholar]
17.Hofreither C. A unified view of some numerical methods for fractional diffusion. Comput. Math. Appl. 2020;80(2):332–350. doi: 10.1016/j.camwa.2019.07.025. [DOI] [Google Scholar]
18.Hofreither C. An algorithm for best rational approximation based on barycentric rational interpolation. Numer. Algorithms. 2021;88(1):365–388. doi: 10.1007/s11075-020-01042-0. [DOI] [Google Scholar]
19.Kaltenbacher B, Rundell W. Regularization of a backward parabolic equation by fractional operators. Inverse Probl. Imaging. 2019;13(2):401–430. doi: 10.3934/ipi.2019020. [DOI] [Google Scholar]
20.Kilbas AA, Srivastava HM, Trujillo JJ. Theory and Applications of Fractional Differential Equations North-Holland Mathematics Studies. Amsterdam: Elsevier Science B.V; 2006. [Google Scholar]
21.Lund J, Bowers KL. Sinc Methods for Quadrature and Differential Equations. Philadelphia: Society for Industrial and Applied Mathematics (SIAM); 1992. [Google Scholar]
22.Lischke, A., Pang, G., Gulian, M. et al.: What is the fractional Laplacian? A comparative review with new results. J. Comput. Phys., 404:109009, 62 (2020)
23.Mori, M.: Developments in the double exponential formulas for numerical integration. In Proceedings of the International Congress of Mathematicians, Vol. I, II (Kyoto, 1990), pp. 1585–1594. Mathematical Society, Japan, Tokyo (1991)
24.Meidner, D., Pfefferer, J., Schürholz, K., Vexler, B.: $hp$ -finite elements for fractional diffusion (2018)
25.Melenk JM, Rieder A. $hp$ -FEM for the fractional heat equation. IMA J. Numer. Anal. 2021;41(1):412–454. doi: 10.1093/imanum/drz054. [DOI] [Google Scholar]
26.Melenk, J. M., Rieder, A.: An exponentially convergent discretization for space-time fractional parabolic equations using $hp$ -fem. to appear in IMA J. Numer. Anal. (2022). arxiv: 2202.02067
27.Nochetto RH, Otárola E, Salgado AJ. A PDE approach to fractional diffusion in general domains: a priori error analysis. Found. Comput. Math. 2015;15(3):733–791. doi: 10.1007/s10208-014-9208-x. [DOI] [Google Scholar]
28.Nochetto RH, Otárola E, Salgado AJ. A PDE approach to fractional diffusion in general domains: a priori error analysis. Found. Comput. Math. 2015;15(3):733–791. doi: 10.1007/s10208-014-9208-x. [DOI] [Google Scholar]
29.Nochetto RH, Otárola E, Salgado AJ. A PDE approach to space-time fractional parabolic problems. SIAM J. Numer. Anal. 2016;54(2):848–873. doi: 10.1137/14096308X. [DOI] [Google Scholar]
30.Nakatsukasa Y, Sète O, Trefethen LN. The AAA algorithm for rational approximation. SIAM J. Sci. Comput. 2018;40(3):A1494–A1522. doi: 10.1137/16M1106122. [DOI] [Google Scholar]
31.Rodino L. Linear Partial Differential Operators in Gevrey Spaces. River Edge: World Scientific Publishing Co. Inc; 1993. [Google Scholar]
32.Stenger F. Numerical Methods Based on Sinc and Analytic Functions. Springer Series in Computational Mathematics. New York: Springer; 1993. [Google Scholar]
33.Sun H, Zhang Y, Baleanu D, Chen W, Chen Y. A new collection of real world applications of fractional calculus in science and engineering. Commun. Nonlinear Sci. Numer. Simul. 2018;64:213–231. doi: 10.1016/j.cnsns.2018.04.019. [DOI] [Google Scholar]
34.Takahasi, H., Mori, M.: Double exponential formulas for numerical integration. Publ. Res. Inst. Math. Sci., 9:721–741 (1973/74)
35.Vázquez, J. L.: The mathematical theories of diffusion: nonlinear and fractional diffusion. In Nonlocal and Nonlinear Diffusions and Interactions: New Methods and Directions, volume 2186 of Lecture Notes in Mathematics, pp. 205–278. Springer, Cham (2017)

[CR1] 1.Antil H, Bartels S. Spectral approximation of fractional PDEs in image processing and phase field modeling. Comput. Methods Appl. Math. 2017;17(4):661–678. doi: 10.1515/cmam-2017-0039. [DOI] [Google Scholar]

[CR2] 2.Bonito A, Borthagaray JP, Nochetto RH, Otárola E, Salgado AJ. Numerical methods for fractional diffusion. Comput. Vis. Sci. 2018;19(5–6):19–46. doi: 10.1007/s00791-018-0289-y. [DOI] [Google Scholar]

[CR3] 3.Bonito A, Lei W, Pasciak JE. The approximation of parabolic equations involving fractional powers of elliptic operators. J. Comput. Appl. Math. 2017;315:32–48. doi: 10.1016/j.cam.2016.10.016. [DOI] [Google Scholar]

[CR4] 4.Bonito A, Lei W, Pasciak JE. Numerical approximation of space–time fractional parabolic equations. Comput. Methods Appl. Math. 2017;17(4):679–705. doi: 10.1515/cmam-2017-0032. [DOI] [Google Scholar]

[CR5] 5.Bonito A, Lei W, Pasciak JE. On sinc quadrature approximations of fractional powers of regularly accretive operators. J. Numer. Math. 2019;27(2):57–68. doi: 10.1515/jnma-2017-0116. [DOI] [Google Scholar]

[CR6] 6.Banjai L, Melenk JM, Nochetto RH, Otárola E, Salgado AJ, Schwab C. Tensor FEM for spectral fractional diffusion. Found. Comput. Math. 2019;19(4):901–962. doi: 10.1007/s10208-018-9402-3. [DOI] [Google Scholar]

[CR7] 7.Bonito A, Pasciak JE. Numerical approximation of fractional powers of elliptic operators. Math. Comp. 2015;84(295):2083–2110. doi: 10.1090/S0025-5718-2015-02937-8. [DOI] [Google Scholar]

[CR8] 8.Bucur, C., Valdinoci, E.: Nonlocal Diffusion and Applications, volume 20 of Lecture Notes of the Unione Matematica Italiana. Springer, Cham; Unione Matematica Italiana, Bologna (2016)

[CR9] 9.Danczul T, Hofreither C. On rational Krylov and reduced basis methods for fractional diffusion. J. Numer. Math. 2022;30(2):121–140. doi: 10.1515/jnma-2021-0032. [DOI] [Google Scholar]

[CR10] 10.Danczul, T., Hofreither, C., Schöberl, J.: A unified rational Krylov method for elliptic and parabolic fractional diffusion problems (2021)

[CR11] 11.Davis PJ, Rabinowitz P. Methods of numerical integration, Computer Science and Applied Mathematics. 2. Orlando: Academic Press Inc; 1984. [Google Scholar]

[CR12] 12.Danczul T, Schöberl J. A reduced basis method for fractional diffusion operators II. J. Numer. Math. 2021;29(4):269–287. doi: 10.1515/jnma-2020-0042. [DOI] [Google Scholar]

[CR13] 13.Danczul T, Schöberl J. A reduced basis method for fractional diffusion operators I. Numer. Math. 2022;151(2):369–404. doi: 10.1007/s00211-022-01287-y. [DOI] [Google Scholar]

[CR14] 14.Erdélyi, A., Magnus, W., Oberhettinger, F., Tricomi, F. G.: Higher Transcendental Functions. Vol. III. Robert E. Krieger Publishing Co., Inc., Melbourne (1981). Based on notes left by Harry Bateman, Reprint of the 1955 original

[CR15] 15.Gatto P, Hesthaven JS. Numerical approximation of the fractional Laplacian via $hp$ -finite elements, with an application to image denoising. J. Sci. Comput. 2015;65(1):249–270. doi: 10.1007/s10915-014-9959-1. [DOI] [Google Scholar]

[CR16] 16.Gilboa G, Osher S. Nonlocal operators with applications to image processing. Multiscale Model. Simul. 2008;7(3):1005–1028. doi: 10.1137/070698592. [DOI] [Google Scholar]

[CR17] 17.Hofreither C. A unified view of some numerical methods for fractional diffusion. Comput. Math. Appl. 2020;80(2):332–350. doi: 10.1016/j.camwa.2019.07.025. [DOI] [Google Scholar]

[CR18] 18.Hofreither C. An algorithm for best rational approximation based on barycentric rational interpolation. Numer. Algorithms. 2021;88(1):365–388. doi: 10.1007/s11075-020-01042-0. [DOI] [Google Scholar]

[CR19] 19.Kaltenbacher B, Rundell W. Regularization of a backward parabolic equation by fractional operators. Inverse Probl. Imaging. 2019;13(2):401–430. doi: 10.3934/ipi.2019020. [DOI] [Google Scholar]

[CR20] 20.Kilbas AA, Srivastava HM, Trujillo JJ. Theory and Applications of Fractional Differential Equations North-Holland Mathematics Studies. Amsterdam: Elsevier Science B.V; 2006. [Google Scholar]

[CR21] 21.Lund J, Bowers KL. Sinc Methods for Quadrature and Differential Equations. Philadelphia: Society for Industrial and Applied Mathematics (SIAM); 1992. [Google Scholar]

[CR22] 22.Lischke, A., Pang, G., Gulian, M. et al.: What is the fractional Laplacian? A comparative review with new results. J. Comput. Phys., 404:109009, 62 (2020)

[CR23] 23.Mori, M.: Developments in the double exponential formulas for numerical integration. In Proceedings of the International Congress of Mathematicians, Vol. I, II (Kyoto, 1990), pp. 1585–1594. Mathematical Society, Japan, Tokyo (1991)

[CR24] 24.Meidner, D., Pfefferer, J., Schürholz, K., Vexler, B.: $hp$ -finite elements for fractional diffusion (2018)

[CR25] 25.Melenk JM, Rieder A. $hp$ -FEM for the fractional heat equation. IMA J. Numer. Anal. 2021;41(1):412–454. doi: 10.1093/imanum/drz054. [DOI] [Google Scholar]

[CR26] 26.Melenk, J. M., Rieder, A.: An exponentially convergent discretization for space-time fractional parabolic equations using $hp$ -fem. to appear in IMA J. Numer. Anal. (2022). arxiv: 2202.02067

[CR27] 27.Nochetto RH, Otárola E, Salgado AJ. A PDE approach to fractional diffusion in general domains: a priori error analysis. Found. Comput. Math. 2015;15(3):733–791. doi: 10.1007/s10208-014-9208-x. [DOI] [Google Scholar]

[CR28] 28.Nochetto RH, Otárola E, Salgado AJ. A PDE approach to fractional diffusion in general domains: a priori error analysis. Found. Comput. Math. 2015;15(3):733–791. doi: 10.1007/s10208-014-9208-x. [DOI] [Google Scholar]

[CR29] 29.Nochetto RH, Otárola E, Salgado AJ. A PDE approach to space-time fractional parabolic problems. SIAM J. Numer. Anal. 2016;54(2):848–873. doi: 10.1137/14096308X. [DOI] [Google Scholar]

[CR30] 30.Nakatsukasa Y, Sète O, Trefethen LN. The AAA algorithm for rational approximation. SIAM J. Sci. Comput. 2018;40(3):A1494–A1522. doi: 10.1137/16M1106122. [DOI] [Google Scholar]

[CR31] 31.Rodino L. Linear Partial Differential Operators in Gevrey Spaces. River Edge: World Scientific Publishing Co. Inc; 1993. [Google Scholar]

[CR32] 32.Stenger F. Numerical Methods Based on Sinc and Analytic Functions. Springer Series in Computational Mathematics. New York: Springer; 1993. [Google Scholar]

[CR33] 33.Sun H, Zhang Y, Baleanu D, Chen W, Chen Y. A new collection of real world applications of fractional calculus in science and engineering. Commun. Nonlinear Sci. Numer. Simul. 2018;64:213–231. doi: 10.1016/j.cnsns.2018.04.019. [DOI] [Google Scholar]

[CR34] 34.Takahasi, H., Mori, M.: Double exponential formulas for numerical integration. Publ. Res. Inst. Math. Sci., 9:721–741 (1973/74)

[CR35] 35.Vázquez, J. L.: The mathematical theories of diffusion: nonlinear and fractional diffusion. In Nonlocal and Nonlinear Diffusions and Interactions: New Methods and Directions, volume 2186 of Lecture Notes in Mathematics, pp. 205–278. Springer, Cham (2017)

PERMALINK

Double exponential quadrature for fractional diffusion

Alexander Rieder

Abstract

Introduction

Fig. 4.

General setting and notation

Assumption 1.1

Remark 1.2

Remark 1.3

Definition 1.4

Definition 1.5

Remark 1.6

Remark 1.7

Model problems, discretization and results

Definition 2.1

Remark 2.2

The elliptic problem

Remark 2.3

Remark 2.4

Theorem 2.5

Remark 2.6

Remark 2.7

Theorem 2.8

The parabolic problem

Remark 2.9

Theorem 2.10

Theorem 2.11

Error analysis

Abstract analysis of sinc-quadrature

Remark 3.1

Proposition 3.2

Lemma 3.3

Proof

Remark 3.4

Corollary 3.5

Proof

The elliptic problem

Lemma 3.6

Proof

Theorem 3.7

Proof

Corollary 3.8

Proof

Proof of Theorem 2.5

Proof of Theorem 2.8

The parabolic problem

The Mittag Leffler function

Proposition 3.9

Proof

Proposition 3.10

Proof

Double exponential quadrature for the parabolic problem

Lemma 3.11

Proof

Proof of Theorem 2.10

Remark 3.12

Lemma 3.13

Proof

Proof of Theorem 2.11

Numerical examples

The pure quadrature problem

Fig. 1.

Fig. 2.

A 2d example

Fig. 3.

Elliptic problem and behavior for small β

Acknowledgements

Appendix A: Properties of the coordinate transform ψσ,θ

Definition A.1

Fig. 5.

Lemma A.2

Proof

Lemma A.3

Proof

Lemma A.4

Proof

Lemma A.5

Proof

Lemma A.6

Abstract analysis of $sinc$ -quadrature

Elliptic problem and behavior for small $β$

Appendix A: Properties of the coordinate transform $ψ_{σ, θ}$