An almost symmetric Strang splitting scheme for the construction of high order composition methods

Lukas Einkemmer; Alexander Ostermann

doi:10.1016/j.cam.2014.04.015

. 2014 Dec 1;271(100):307–318. doi: 10.1016/j.cam.2014.04.015

An almost symmetric Strang splitting scheme for the construction of high order composition methods^☆

Lukas Einkemmer ^1,^⁎, Alexander Ostermann ¹

PMCID: PMC4144832 PMID: 25473146

Abstract

In this paper we consider splitting methods for nonlinear ordinary differential equations in which one of the (partial) flows that results from the splitting procedure cannot be computed exactly. Instead, we insert a well-chosen state $y_{⋆}$ into the corresponding nonlinearity $B (y) y$ , which results in a linear term $B (y_{⋆}) y$ whose exact flow can be determined efficiently. Therefore, in the spirit of splitting methods, it is still possible for the numerical simulation to satisfy certain properties of the exact flow. However, Strang splitting is no longer symmetric (even though it is still a second order method) and thus high order composition methods are not easily attainable. We will show that an iterated Strang splitting scheme can be constructed which yields a method that is symmetric up to a given order. This method can then be used to attain high order composition schemes. We will illustrate our theoretical results, up to order six, by conducting numerical experiments for a charged particle in an inhomogeneous electric field, a post-Newtonian computation in celestial mechanics, and a nonlinear population model and show that the methods constructed yield superior efficiency as compared to Strang splitting. For the first example we also perform a comparison with the standard fourth order Runge–Kutta methods and find significant gains in efficiency as well better conservation properties.

Keywords: Splitting methods, Non-symmetric Strang splitting, Approximate partial flows, Nonlinear ordinary differential equations, Application to the sciences

1. Introduction

If an ordinary differential equation can be cast in the form

y^{'} = A (y) + B (y),

where the exact solutions of $y^{'} = A (y)$ , denoted by $φ_{t}^{A} (y (0))$ , and $y^{'} = B (y)$ , denoted by $φ_{t}^{B} (y (0))$ , are known, or can be computed efficiently, splitting methods often provide a viable alternative compared to more traditional integration schemes (such as Runge–Kutta methods). In addition, if the flows generated by $A$ and $B$ preserve a given property of the ordinary differential equation, so does the splitting scheme. In some instances this can be used to construct schemes which conserve certain properties of the exact flow (see, e.g. [1]). If the exact partial flows are used, the Strang splitting scheme with step size $τ$ , i.e.

S_{τ} = φ_{\frac{τ}{2}}^{A} \circ φ_{τ}^{B} \circ φ_{\frac{τ}{2}}^{A},

is a symmetric scheme of second order. It is then possible to construct schemes of arbitrary (even) order by composition (see, e.g. [2]). For certain classes of ordinary differential equations more efficient schemes can be constructed (for the example of separable Hamiltonian systems see [3]). For a review of splitting methods we refer the reader to [4].

However, even if one of the partial flows cannot be computed exactly, in some circumstances splitting methods can still be applied. The systems of interest in this paper are ordinary differential equations which can be written as

y^{'} = A (y) + B (y) y + d,

where, as before, we assume that $y^{'} = A (y)$ can be solved exactly. However, no such assumption is made about $y^{'} = B (y) y + d$ . Instead, we assume that once a fixed value, say $y_{⋆}$ , is substituted, the flow corresponding to

y^{'} = B (y_{⋆}) y + d

(1)

can be computed efficiently. Let us duly note that $B (y_{⋆})$ is still a matrix which is then applied to $y$ . We denote the corresponding flow by $φ_{t}^{B (y_{⋆})}$ , which can also be written explicitly by employing the exponential and $ϕ_{1}$ functions. This yields

φ_{t}^{B (y_{⋆})} (y (0)) = e^{t B (y_{⋆})} y (0) + t ϕ_{1} (t B (y_{⋆})) d,

where

ϕ_{1} (z) = \frac{e^{z} - 1}{z} .

Note that if we apply the Strang splitting scheme to $y^{'} = A (y) + B (y_{0}) y + d$ , a numerical method results that is only of order 1. This is intuitively clear, as $B$ is evaluated at the left endpoint only, and can be verified by a simple argument based on the Taylor expansion of the scheme. However, in the literature an alternative scheme has been proposed in the context of partial differential equations (see e.g. [5]) that is usually referred to as Strang splitting also and is given by

y_{1 / 2} = φ_{\frac{τ}{2}}^{B (y_{0})} \circ φ_{\frac{τ}{2}}^{A} (y_{0})

(2a)

y_{1} = M_{τ} (y_{0}) = φ_{\frac{τ}{2}}^{A} \circ φ_{τ}^{B (y_{1 / 2})} \circ φ_{\frac{τ}{2}}^{A} (y_{0}) .

(2b)

Open in a new tab

Note that since we use an approximation of order 1 to $B (y (τ / 2))$ , this is in fact a method of order two. Consistent with the literature we will, from now on, refer to this scheme as Strang splitting.

For a symmetric scheme it must hold that (see, e.g. [1, Chap. II.3])

M_{- τ} \circ M_{τ} = I,

where $I$ denotes the identity. Now

M_{- τ} \circ M_{τ} (y_{0}) = φ_{- \frac{τ}{2}}^{A} \circ φ_{- τ}^{B ({\tilde{y}}_{1 / 2})} \circ φ_{τ}^{B (y_{1 / 2})} \circ φ_{\frac{τ}{2}}^{A} (y_{0}),

where

{\tilde{y}}_{1 / 2} = φ_{- \frac{τ}{2}}^{B (y_{1})} \circ φ_{- \frac{τ}{2}}^{A} (y_{1}) .

Inserting (2) shows that

{\tilde{y}}_{1 / 2} = φ_{- \frac{τ}{2}}^{B (y_{1})} \circ φ_{τ}^{B (y_{1 / 2})} \circ φ_{\frac{τ}{2}}^{A} (y_{0}) .

Therefore, the Strang splitting scheme is symmetric if and only if

φ_{- \frac{τ}{2}}^{B (y_{1})} \circ φ_{τ}^{B (y_{1 / 2})} = φ_{\frac{τ}{2}}^{B (y_{0})},

which is not satisfied in general.

Due to the lost symmetry, the corresponding triple jump scheme is only of order 3 (not of order 4, as one might naively expect). From this consideration it is also clear that further composition in the same manner does not result in schemes of arbitrary order (as is the case with the classical Strang splitting method based on exact flows).

In Section 2, we will propose a modified Strang splitting scheme that, in addition of being second order accurate, can be iterated to give a scheme that is symmetric up to a predetermined order $q$ (as made precise in Definition 1). Therefore, the usual construction of composition methods of arbitrary (even) order can be accomplished in this context (this is shown in Section 3). In Section 4 we show that for certain stiff problems the schemes constructed in this paper can be employed as well. In addition, we discuss in some detail the numerical results for three examples (namely for a charged particle in an inhomogeneous electric field, a post-Newtonian computation of celestial mechanics, and a nonlinear population model) in Section 5. Finally, we conclude in Section 6.

2. An almost symmetric Strang splitting scheme

Let us start from the Lie splitting scheme

y_{1 / 2} = L_{\frac{τ}{2}} (y_{0}) = φ_{\frac{τ}{2}}^{B (y_{0})} \circ φ_{\frac{τ}{2}}^{A} (y_{0}) .

(3)

We recall that the adjoint of a scheme $L_{τ}$ , which we denote by $L_{τ}^{*}$ , is defined as $L_{τ}^{*} = L_{- τ}^{- 1}$ . Therefore, to give a representation of the adjoint scheme corresponding to (3) we interchange $y_{1 / 2}$ with $y_{0}$ and $τ$ with $- τ$ . This yields

y_{1} = φ_{\frac{τ}{2}}^{A} \circ φ_{\frac{τ}{2}}^{B (y_{1})} (y_{1 / 2}),

(4)

i.e. $y_{1} = L_{\frac{τ}{2}}^{*} (y_{1 / 2})$ . Now the (implicit) Strang splitting scheme

S_{τ} = L_{\frac{τ}{2}}^{*} \circ L_{\frac{τ}{2}}

is of second order and symmetric by construction (see, e.g. [1]). However, it would require the solution of an implicit equation in each step, which is prohibitively expensive. Since Eq. (4) has the form of a fixed-point problem, we can employ fixed-point iteration to approximate $L_{\frac{τ}{2}}^{*}$ . We will denote the resulting scheme by $S_{τ}^{(i)}$ , where $i$ is the number of iterations that are conducted. Note that during the iteration $y_{1 / 2}$ is fixed; that is, only the two evolution operators given explicitly in Eq. (4) are applied at each step in the fixed-point iteration. As an initial value for the fixed-point iteration we employ $y_{1 / 2}$ (however, any approximation of order $τ$ to $y_{1}$ would constitute a possible choice) and therefore

S_{τ}^{(1)} (y_{0}) = φ_{\frac{τ}{2}}^{A} \circ φ_{\frac{τ}{2}}^{B (y_{1 / 2})} (\underset{y_{1 / 2}}{\underset{︸}{φ_{\frac{τ}{2}}^{B (y_{0})} \circ φ_{\frac{τ}{2}}^{A} (y_{0})}}) .

Clearly we need at least two iterations such that the scheme is of second order. There is no hope that $S_{τ}^{(i)}$ is symmetric. However, we will show that it is almost symmetric as defined below.

Before proceeding, let us note that we perform the iteration in the context of the Strang splitting scheme (a method of order two) only. More specifically, the order of the method is not raised by performing the iteration (for $i \geq 2$ ). This is not the case for the iterative operator splitting (as described in [6], for example). Furthermore, the method described in this paper only requires an algorithm to efficiently compute the solution of (1); it neither makes such an assumption for the nonlinear problem nor does it introduces an additional inhomogeneous part (as is the case in [6] and references therein).

Definition 1

A one-step method $Φ_{τ}$ is symmetric of order $q$ if

$Φ_{τ}^{*} = Φ_{τ} + O (τ^{q + 1}),$ (5)

where $Φ_{τ}^{*}$ is the adjoint method of $Φ_{τ}$ .

Next let us show that the fixed-point iteration described above actually yields a scheme that is symmetric of order $i$ .

Theorem 1

Suppose that $B (\cdot)$ is Lipschitz continuous. Then the Strang splitting scheme $S_{τ}^{(i)}$ is symmetric of order $i$ .

Proof

We have to show that the fixed-point problem (4), i.e., $y = F (y)$ with

$F (y) = φ_{\frac{τ}{2}}^{A} \circ φ_{\frac{τ}{2}}^{B (y)} (y_{1 / 2})$

has a unique solution in a sufficiently small neighborhood of $y_{1 / 2}$ . First note that there exists a constant $C > 0$ such that

$‖ F (y_{1 / 2}) - y_{1 / 2} ‖ \leq C τ$

for $τ$ sufficiently small. Now, let $D = 2 C$ and denote by $Ω_{D}$ the closed ball with center $y_{1 / 2}$ and radius $D τ$ . Then, for all $u, v \in Ω_{D}$ it holds

$‖ F (u) - F (v) ‖ \leq ‖ φ_{\frac{τ}{2}}^{A} ‖ \cdot ‖ φ_{\frac{τ}{2}}^{B (u)} (y_{1 / 2}) - φ_{\frac{τ}{2}}^{B (v)} (y_{1 / 2}) ‖,$ (6)

where $‖ φ_{\frac{τ}{2}}^{A} ‖ = 1 + O (τ)$ denotes the Lipschitz constant of $φ_{\frac{τ}{2}}^{A} (\cdot)$ on the bounded set

$Ω = ⋃_{y \in Ω_{D}} {φ_{\frac{τ}{2}}^{B (y)} (y_{1 / 2})} .$

Further note that

$‖ φ_{\frac{τ}{2}}^{B (u)} (y_{1 / 2}) - φ_{\frac{τ}{2}}^{B (v)} (y_{1 / 2}) ‖ \leq C τ ‖ B (u) - B (v) ‖,$ (7)

which is a direct consequence of the variation-of-constants formula (note that the constant $C$ does depend only on the norm of $y_{1 / 2}$ and $d$ ). By combining the bounds (6), (7) with the Lipschitz continuity of $B (\cdot)$ , we obtain that $F$ is Lipschitz continuous on $Ω_{D}$ with a Lipschitz constant of order $τ$ . Moreover, using the triangle inequality we get the bound

$‖ F (y) - y_{1 / 2} ‖ \leq ‖ F (y) - F (y_{1 / 2}) ‖ + ‖ F (y_{1 / 2}) - y_{1 / 2} ‖ \leq D τ$

for all $y \in Ω_{D}$ and $τ$ sufficiently small. This shows that $F$ maps the closed ball $Ω_{D}$ onto itself. Consequently, by Banach’s fixed-point theorem, $F$ has a unique fixed-point $y_{1}$ in $Ω_{D}$ , which is the locally unique solution of (4).

Since $S_{τ}^{(1)} (y_{0}) = F (y_{1 / 2})$ and $S_{τ} (y_{0}) = F (y_{1})$ , we also obtain that

$‖ S_{τ}^{(1)} (y_{0}) - S_{τ} (y_{0}) ‖ = ‖ F (y_{1 / 2}) - F (y_{1}) ‖ \leq L τ ‖ y_{1 / 2} - y_{1} ‖ \leq L D τ^{2} .$

Moreover, as the Lipschitz constant of $F$ is of order $τ$ this implies

$S_{τ}^{(i)} (y_{0}) = S_{τ} (y_{0}) + O (τ^{i + 1}) .$ (8)

Also recall that $S_{τ}$ is a symmetric scheme by construction. Thus, (8) proves the desired result. □

Thus, we have established that we can iteratively compute a second order method that is symmetric to arbitrary order. Moreover, the computational effort is linear in the desired order of symmetry.

In the next section we will discuss how the scheme described here can be used to construct composition methods of arbitrary (even) order.

3. Composition methods

It is well-known (see, e.g. [1, Chap. II.4]) that if a symmetric one-step method $Φ_{τ}$ of even order $r$ is composed in the following manner

Φ_{γ_{3} τ} \circ Φ_{γ_{2} τ} \circ Φ_{γ_{1} τ},

(9)

where

γ_{1} = γ_{3} = \frac{1}{2 - 2^{1 / (r + 1)}}, γ_{2} = - 2^{1 / (r + 1)} γ_{1},

(10)

then a one-step method of order $r + 2$ results. Thus, we can construct methods of arbitrary even order $p$ , where the cost, in terms of a single evaluation of the corresponding second order method, is given by $3^{p / 2 - 1}$ . For $p = 4$ , for example, the corresponding method is the well-known triple jump scheme.

The justification for this procedure is given by Theorem 4.1 in [1]. We will now generalize that result for methods that are (only) symmetric of order $q$ .

Lemma 2

Suppose that the one-step method $Φ_{τ}$ is of odd order $p$ and symmetric of order $q$ with $q \geq p + 1$ , see (5). Then, the method is in fact of order $p + 1$ .

Proof

Let us denote the exact flow by $φ_{τ}$ . Since the method has order $p$ ,

$Φ_{τ} (y_{0}) - φ_{τ} (y_{0}) = C (y_{0}) τ^{p + 1} + O (τ^{p + 2})$

and further the adjoint method satisfies

$Φ_{τ}^{*} (y_{0}) - φ_{τ} (y_{0}) = {(- 1)}^{p} C (y_{0}) τ^{p + 1} + O (τ^{p + 2}) .$

Using now assumption (5), with $p$ odd, we get

${(- 1)}^{p} C (y_{0}) τ^{p + 1} = C (y_{0}) τ^{p + 1} + O (τ^{q + 1}) + O (τ^{p + 2})$

and thus $C (y_{0}) = 0$ if $q \geq p + 1$ . Therefore, we deduce that $Φ_{τ}$ is of order $p + 1$ . □

As a corollary we get the desired order for the composition methods as well as the number of iterations we have to perform. As we will see in Section 5.1 this is a worst case estimate that can be improved upon for some applications.

Corollary 3

The composition method constructed from $S_{τ}^{(i)}$ by using $ℓ$ compositions, as described in Eq. (9), results in a scheme of order $p = 2 + 2 ℓ$ if $i \geq p$ .

Proof

If $i \geq 2$ then $S_{τ}^{(i)}$ is of order 2 by construction. Thus, the composition method is at least of order 3. However, from Lemma 2 we know that if $i \geq 4$ this method is in fact of order 4. Since the composition given in (9) is symmetric, a method which is symmetric of order $q$ retains this property if composed in the manner described. Therefore, we can complete the proof by induction. □

Before we turn our attention to the applications given in the next section, let us investigate the (worst case) computational cost of the composition methods considered in this section. In Table 1 the number of computations of either $φ_{τ}^{A}$ or $φ_{τ}^{B (q_{⋆})}$ is given for the triple jump scheme as well as the composition of the triple jump scheme (which we call composite 9). In addition, Table 1 lists an abbreviation of all the schemes discussed (which we will employ heavily in the next section). The methods constructed here will be referred to as iterated.

Table 1.

The effort in number of (possibly approximated) partial flows that have to be computed is listed for a number of composition schemes. In addition, the abbreviations used for the composition methods employed in the next section are given.

Method	Abbreviation	Order	Iterations	Effort
Strang (2)	S	2	–	4
Iterated Strang	IS	2	2	6

Triple jump	TJ	3	–	12
Iterated triple jump	ITJ	4	4	30

Composite 9	C9	3	–	36
Iterated composite 9	IC9	6	6	126

Open in a new tab

Note that even though the high order methods given in Table 1 are about three times as costly as conventional composition methods (which are employed for separable Hamiltonian systems, for example), we are now able to construct methods of arbitrary (even) order. We will show in Section 5 that for realistic problems this can still result in a considerable gain in performance (as compared to the more commonly employed Strang splitting scheme, for example).

In addition, it should be duly noted that similar to conventional composition methods, the schemes introduced here conserve all invariants that are invariants of the two partial flows as well. To conclude this section, let us remark that the schemes introduced here do not require any modification in the code used to implement the numerical solution of the partial flows. That is, if for a given problem the Lie or Strang splitting scheme is already implemented, the generalization to the methods discussed here is almost immediate.

4. Extension to stiff problems

In this section we show that in certain circumstances we can extend our analysis to the stiff case. Let us consider, for example, an ordinary differential equation for which the operator $B$ , as defined in Section 1, can be written as

B (y) = b_{S} + b_{N} (y),

(11)

i.e., the nonlinear operator $B$ can be split in a stiff linear part and a non-stiff nonlinear part. In this case we can show that the speed of convergence of the fixed-point iteration in Theorem 1 (see Section 2) is independent of the stiff part. This is the content of the following corollary.

Corollary 4

Suppose that $B (\cdot)$ can be cast into the form (11) with $b_{N} (\cdot)$ Lipschitz continuous. Then the Strang splitting scheme $S_{τ}^{(i)}$ is symmetric of order $i$ and the error of the method can be estimated independently of $‖ b_{S} ‖$ .

Proof

We employ the variation-of-constants formula to get

$φ_{\frac{τ}{2}}^{B (u)} (y_{1 / 2}) - φ_{\frac{τ}{2}}^{B (v)} (y_{1 / 2}) = \int_{0}^{\frac{τ}{2}} φ_{\frac{τ}{2} - σ}^{b_{S}} (b_{N} (u) y_{1 / 2} - b_{N} (v) y_{1 / 2}) d σ$

which allows us to estimate

$‖ φ_{\frac{τ}{2}}^{A} \circ φ_{\frac{τ}{2}}^{B (u)} (y_{1 / 2}) - φ_{\frac{τ}{2}}^{A} \circ φ_{\frac{τ}{2}}^{B (v)} (y_{1 / 2}) ‖ \leq C τ ‖ b_{N} (u) - b_{N} (v) ‖ ‖ y_{1 / 2} ‖,$

where $C$ depends on $‖ φ_{\frac{τ}{2}}^{A} ‖$ and $‖ φ_{\frac{τ}{2}}^{b_{S}} ‖$ but not on $‖ b_{S} ‖$ .

The proof is completed by employing the same arguments used in the proof of Theorem 1. □

Therefore, we have shown that, in the situation described, the step size can be chosen independently of the stiff part of the problem. This is of interest in some applications, where the inclusion of $b_{S}$ in the operator $A$ would result in partial flows that are more difficult to compute, or where a conservation property of the problem under consideration is destroyed if $b_{S}$ is treated separately from the nonlinearity $b_{N}$ .

To conclude this section let us briefly discuss the Brusselator (which is described in [7, Chap. IV.1]). In this case we have a discretized diffusion–reaction equation, where the flow corresponding to $A$ can be computed very efficiently by employing fast Fourier transform techniques. The remaining stiffness in the system is then only due to the linear part of the flow corresponding to $B$ . In addition, an analytical expression of the partial flow corresponding to the nonlinearity is not easily attainable (due to the coupling of the equations involved). Therefore, the problem is of the form considered in this section. The implementation and analysis of such methods in the context of partial differential equations is the subject of further research.

5. Applications

In this section, we discuss three applications of the schemes constructed in the previous sections. First, we consider a Hamiltonian system that describes the movement of a charged particle in an inhomogeneous electromagnetic field. This system will turn out to require fewer iterations for a desired order of symmetry as compared to the worst case described in Section 3. Second, we consider a post-Newtonian approximation to the relativistic Kepler problem. Also in this case we will observe that fewer iterations are necessary, compared to the worst case, to construct schemes of order four and six. Third, a nonlinear population model is considered. This model, in fact, exhibits the worst case behavior as outlined in Section 3. Nevertheless, we can show, by conducting numerical experiments, that in all three examples the use of high order methods results in a significant performance increase.

In all the simulations conducted, we compute a reference solution by using the (classic) Strang splitting scheme and a sufficiently small (experimentally determined) step size.

5.1. A charged particle in an inhomogeneous magnetic field

The equations of motion of a charged particle in an external electromagnetic field are given by the Lorentz force law

m \ddot{x} = q (E + v \times B),

where $x, v, q$ are the particle’s position, velocity, and charge, respectively; the electric field is denoted by $E$ and the magnetic field by $B$ (both can depend on the position of the particle under consideration, i.e. on $x$ ). This differential equation can be reformulated as a Hamiltonian system with Hamiltonian

H = \frac{p^{2}}{2 m} + q ϕ,

where the electric potential $ϕ$ is related to the electric field by $E = - \nabla ϕ$ . We should note that the momentum $p = m v$ used above is not the conjugate variable to the position (as would be the case in the electrostatic limit).

The equations of motion in this framework are then given by

\dot{x} = p / m \dot{p} = F (x) + Ω (x) p,

where

F = q E, Ω = [\begin{matrix} 0 & {\tilde{B}}_{3} & - {\tilde{B}}_{2} \\ - {\tilde{B}}_{3} & 0 & {\tilde{B}}_{1} \\ {\tilde{B}}_{2} & - {\tilde{B}}_{1} & 0 \end{matrix}]

with ${\tilde{B}}_{i} = q B_{i} / m$ (see e.g. [8]). To set up the splitting, we use

A (x, p) = [\begin{matrix} 0 \\ F (x) \end{matrix}], B (x_{⋆}, p_{⋆}) = [\begin{matrix} 0 & \frac{1}{m} I \\ 0 & Ω (x_{⋆}) \end{matrix}], d = 0

and therefore

φ_{τ}^{A} (x_{0}, p_{0}) = [\begin{matrix} x_{0} \\ p_{0} + τ F (x_{0}) \end{matrix}]

whereas the second partial flow can be computed exactly once we substitute $x_{⋆}$ (and thus consider $Ω$ to be constant). The analytic expression is given by

φ_{τ}^{B (x_{⋆})} (x_{0}, p_{0}) = [\begin{matrix} \frac{1}{m} \int_{0}^{τ} exp (s Ω (x_{⋆})) p_{0} d s + x_{0} \\ exp (τ Ω (x_{⋆})) p_{0} \end{matrix}] .

(12)

For actual computations we can use

exp (τ Ω) = I + \frac{sin τ {‖ \tilde{B} ‖}_{2}}{{‖ \tilde{B} ‖}_{2}} Ω + \frac{1 - cos τ {‖ \tilde{B} ‖}_{2}}{{‖ \tilde{B} ‖}_{2}^{2}} Ω^{2}

and

\int_{0}^{τ} exp (s Ω) d s = τ I + \frac{1 - cos τ {‖ \tilde{B} ‖}_{2}}{{‖ \tilde{B} ‖}_{2}^{2}} Ω + \frac{τ {‖ \tilde{B} ‖}_{2} - sin τ {‖ \tilde{B} ‖}_{2}}{{‖ \tilde{B} ‖}_{2}^{3}} Ω^{2},

where both $Ω$ and $\tilde{B}$ depend on $x_{⋆}$ ; this dependence is, for the sake of brevity, omitted in the notation used. Thus, we have fulfilled all the requirements outlined in Section 1. Note that for a uniform magnetic field a number of symmetric second order schemes are available (see, e.g. [9]). However, for non-uniform magnetic fields such schemes cannot be employed to get higher order schemes by composition.

Let us now discuss a peculiarity of the system under consideration. As $B (\cdot)$ does only depend on the position component of the phase space and the evolution operator $φ_{τ}^{A}$ does not depend on the momentum (which is a consequence of the specific splitting conducted here), we have

\int_{0}^{τ} exp (s Ω (x_{2})) p_{0} d s - \int_{0}^{τ} exp (s Ω (x_{1})) p_{0} d s = O (τ^{2} ‖ Ω (x_{2}) - Ω (x_{1}) ‖) .

That is, the Lipschitz constant for our fixed-point iteration is of order $τ^{2}$ . However, such a result is not entirely unexpected as it is quite common that the position is integrated in time with a higher order than the momentum component (this is also true for the popular leapfrog scheme, for example). Therefore, any resulting approximation to $y_{1} = (x_{1}, p_{1})$ is of order $2 ℓ + 1$ , for some $ℓ \in N$ , in position and, as we can easily deduce from Eq. (12), the momentum is then approximated up to order $2 ℓ$ . Therefore, to use the notation from Section 2 we have a symmetric scheme of order $q = 2 ℓ - 1$ .

Thus, three iterations are sufficient to get a fourth order scheme whereas four iterations suffice to get a sixth order scheme. This is clearly below the worst case behavior discussed in Section 3.

We now turn our attention to the presentation of the numerical simulations conducted. As an example we will use an electric field configuration that corresponds to an ideal Penning trap (such as described in [10]). However, we will use a magnetic field that is not homogeneous in space. Further we will use natural units for the problem, i.e., $m$ and $q$ are set to unity. For the ideal Penning trap the electric potential is given by

ϕ (x) = \frac{1}{20} (2 x_{3}^{2} - x_{1}^{2} - x_{2}^{2}) .

In order to impose an inhomogeneous magnetic field, we use

B (x) = {[\frac{1}{10} x_{3}, \frac{1}{10} x_{2}, 100 sin x_{3} + x_{2}]}^{T} .

We consider an initial value in both position as well as momentum close to zero and evolve the system until time $T = 100$ . In Fig. 1 we show that the numerical experiments match the expected order for the splitting schemes discussed in Section 3. Note, however, that for the composite 9 scheme only three iterations are required to reach order six (instead of the four predicted above). This is a clear indication at the presence of further simplifications (in the system under consideration). Now let us turn our attention to run time considerations. In Fig. 2 the run time is plotted against the achieved accuracy. It is clear from the figure that even for moderate precision requirements, high order methods provide a significant advantage over the more commonly employed Strang splitting scheme.

Fig. 2 — Run time as a function of the achieved accuracy for a charged particle in an inhomogeneous magnetic field (the results for various splitting schemes are shown). The abbreviations for the different numerical schemes are listed in Table 1. For comparison, the standard Runge–Kutta scheme of order four (RK4) is also shown.

The system under consideration is Hamiltonian; therefore, the energy is exactly conserved. This is, in general, no longer true if a numerical scheme is considered. However, schemes can be engineered which, to machine precision, conserve the energy. It is clear that this is not true in this case as the partial flows do not conserve the energy (Fig. 3 confirms this behavior). However, the error in energy is still four orders of magnitude below the integration error made by the scheme under consideration.

Fig. 3 — Energy conservation for a charged particle in an inhomogeneous magnetic field, where the iterated triple jump scheme ( $i = 3$ ) with $τ = 0.01$ is employed. This results in an error (in the infinity norm) in the position/momentum that is approximately 3⋅10⁻⁴. For comparison, the standard Runge–Kutta method of order four is shown. There the step size $τ = 0.0015$ is chosen, which results in a comparable accuracy and twice the run time. Note, however, that the error in energy is better by an order of magnitude for the iterated triple jump scheme.

To end this section let us note that the discussion here can easily be generalized to multiple particles. This is still true if particle–particle interactions (via the electric or magnetic field, for example) are considered.

5.2. Post-Newtonian Kepler problem

As a second example we consider the post-Newtonian¹ approximation to the (general) relativistic $n$ -body problem. In this section we will limit ourselves to the relativistic Kepler problem in the Post-Newtonian approximation up to terms of order $1 / c^{4}$ , where $c$ denotes the speed of light. The equations of motions for the first body are then given by (see, e.g. [11])

{\dot{r}}_{1} = v_{1}

{\dot{v}}_{1} = - \frac{μ_{2}}{r_{12}^{2}} n_{12} + \frac{1}{c^{2}} (5 \frac{μ_{1} μ_{2}}{r_{12}^{3}} + 4 \frac{μ_{2}^{2}}{r_{12}^{3}}) n_{12} + \frac{1}{c^{2}} \frac{μ_{2}}{r_{12}^{2}} (\frac{3}{2} {(n_{12} \cdot v_{2})}^{2} - v_{1}^{2} + 4 v_{1} \cdot v_{2} - 2 v_{2}^{2}) n_{12} + \frac{1}{c^{2}} \frac{μ_{2}}{r_{12}^{2}} (4 n_{12} \cdot v_{1} - 3 n_{12} \cdot v_{2}) (v_{1} - v_{2}),

where $r_{12} = {‖ r_{1} - r_{2} ‖}_{2}$ , $n_{12} = \frac{1}{r_{12}} (r_{1} - r_{2})$ , and $μ_{i} = G m_{i}$ is the standard gravitational parameter (which can be computed from the gravitational constant $G$ and the mass of the body $m_{i}$ ). The equations of motion for the second body can then be determined by interchanging the indices corresponding to the first and the second body in the equations of motion stated above. Let us note that the Newtonian equations of motion are recovered in the limit as $c \to \infty$ (in this case only the first force term remains). The structure of the equations of motion naturally lends itself to the splitting scheme described in Section 1. To that end let us define

A (r_{1}, v_{1}, r_{2}, v_{2}) = [\begin{matrix} 0 \\ - \frac{μ_{2}}{r_{12}^{2}} n_{12} + \frac{1}{c^{2}} (5 \frac{μ_{1} μ_{2}}{r_{12}^{3}} + 4 \frac{μ_{2}^{2}}{r_{12}^{3}}) n_{12} \\ 0 \\ \frac{μ_{1}}{r_{12}^{2}} n_{12} - \frac{1}{c^{2}} (5 \frac{μ_{1} μ_{2}}{r_{12}^{3}} + 4 \frac{μ_{1}^{2}}{r_{12}^{3}}) n_{12} \end{matrix}]

and

B (r_{1 ⋆}, v_{1 ⋆}, r_{2 ⋆}, v_{2 ⋆}) = [\begin{matrix} 0 & 1 & 0 & 0 \\ K_{1} & L_{1} & - K_{1} & - L_{1} \\ 0 & 0 & 0 & 1 \\ K_{2} & L_{2} & - K_{2} & - L_{2} \end{matrix}],

where

K_{1} = \frac{1}{c^{2}} \frac{μ_{2}}{r_{12 ⋆}^{3}} (\frac{3}{2} {(n_{12 ⋆} \cdot v_{2 ⋆})}^{2} - v_{1 ⋆}^{2} + 4 v_{1 ⋆} \cdot v_{2 ⋆} - 2 v_{2 ⋆}^{2}), L_{1} = \frac{1}{c^{2}} \frac{μ_{2}}{r_{12 ⋆}^{2}} (4 n_{12 ⋆} \cdot v_{1 ⋆} - 3 n_{12 ⋆} \cdot v_{2 ⋆}) .

The corresponding quantities $K_{2}$ and $L_{2}$ can once again be obtained by reversing the indices corresponding to the first and second body. It is clear that the flows corresponding to both $A$ and $B (r_{1 ⋆}, v_{1 ⋆}, r_{2 ⋆}, v_{2 ⋆})$ , as defined above, can be computed efficiently.

In the subsequent discussion, we will employ the SI system of units (for convenience we will not state the units explicitly). Let us consider the orbit of two celestial objects with $μ_{1} = 10^{26}$ , i.e., approximately $0.75 \cdot 10^{6}$ solar masses, and $μ_{2} = 10^{20}$ . We initialize the first body with zero velocity and the second one with $v_{2} = 5.898 \cdot 10^{6}$ and place it at the perihelion of the orbit which we determine to be $r_{2} = 4.6 \cdot 10^{10}$ , i.e., a mercury like orbit. We integrate the equations of motion up to the final time $T = 10^{6}$ , which corresponds to about fifteen orbits. The order plots for a number of schemes are shown in Fig. 4.

Fig. 4 — Order plots for a post-Newtonian Kepler problem (the results for various splitting schemes are shown). The lines drawn are, from top to bottom, of slope 2, 3, 3, 4, and 6 respectively. The error is scaled to the perihelion (the point of least distance between the two bodies) of the orbit. The abbreviations for the different numerical schemes are listed in Table 1.

The number of iterations necessary for the iterated triple jump scheme (ITJ) as well as the iterated composite 9 scheme (IC9) have been determined by conducting numerical experiments. A theoretical analysis is beyond the scope of this paper. Note, however, that similar to the previous example we do not observe the worst case behavior described in Section 3.

In addition, let us investigate the run time as a function of the error. This is shown in Fig. 5. As is apparent from the figure, the fourth order iterated triple jump scheme (ITJ) is superior to both the third order triple jump scheme and the Strang splitting scheme. For medium accuracy requirement it becomes advantageous to employ the sixth order IC9 scheme.

5.3. A nonlinear population model

As a third example, we consider a nonlinear population model (the so-called May model) that is given by

x^{'} = a x (1 - \frac{x}{b}) - \frac{c x y}{x + d} y^{'} = e y - \frac{y^{2}}{f x},

where in line with [12] we use $a = 0.6$ , $b = 10.0$ , $c = 0.5$ , $d = 1.0$ , $e = 0.1$ , and $f = 2.0$ . In this context, $x$ is interpreted as a (appropriately scaled) prey population while $y$ represents the predator population. We can argue that such an equation lends itself to splitting as if interaction effects are neglected we are usually left with either an exponential growth model or a logistic equation in each variable. In fact, this is the case for the equation stated above, since the decoupled system can be written as

[\begin{matrix} x^{'} \\ y^{'} \end{matrix}] = A (x, y) = [\begin{matrix} a x (1 - \frac{x}{b}) \\ e y \end{matrix}],

of which an analytical solution can easily be found; it is given by

x (t) = \frac{b e^{a t}}{e^{a t} - 1 + \frac{b}{x (0)}} y (t) = e^{e t} y (0) .

To complete our splitting scheme, we set

B (x_{⋆}, y_{⋆}) [\begin{matrix} x \\ y \end{matrix}] = [\begin{matrix} - \frac{c y_{⋆}}{x_{⋆} + d} x \\ - \frac{y_{⋆}}{f x_{⋆}} y \end{matrix}]

which once again is exactly the situation described in Section 1.

One might rightfully object that our splitting approach is somewhat artificial as we can simply add the $A$ operator to the $B$ operator. After all, the resulting operator still has the desired form and splitting would not be necessary. The only potential advantage of using the splitting scheme is that we can solve the flow corresponding to $A$ exactly. Although numerical experiments demonstrate that this can result in a significant increase in performance, the goal of this section is to show that the number of iterations given in Section 3 constitutes a sharp bound.

To investigate that behavior let us choose the initial values $x (0) = 100$ and $y (0) = 20$ , i.e., the prey population is significantly larger than the predator population. In Fig. 6 we plot the run time as a function of the error for a number of schemes discussed so far (we integrate up to $t = 5$ ).

Fig. 6 — Run time as a function of the achieved accuracy for the May model. The abbreviations for the different numerical schemes are listed in Table 1.

It is also clear that contrary to the example discussed in the previous section, high order schemes (beyond triple jump) are only advantageous if very high precision is needed; however, this behavior is not surprising as the solution approaches a steady state quite rapidly.

Therefore, let us now turn our attention to the number of iterations necessary to obtain a given order. In Fig. 7 we can clearly see that the behavior described in Section 3 is regained. Thus, the system under consideration does not possess the simplifying property we discussed in Section 5.1 for the charged particle and in Section 5.2 for the post-Newtonian approximation. We can also conclude that the number of iterations given in Section 3 constitutes a sharp bound.

As in the previous section we note that generalizations, for example the inclusion of multiple predator species, can be easily accomplished in the context of the schemes discussed.

6. Conclusion

Besides providing a theoretical analysis, we have conducted numerical simulations that demonstrate the applicability of composition schemes to three examples of interest in the sciences. In all of these examples we have demonstrated that, depending on the accuracy requirement, the high order schemes constructed in this paper can provide significant gains in performance compared to Strang splitting. For a charged particle in an inhomogeneous field, we have also demonstrated increased efficiency as well as better conservation properties as compared to the standard fourth order Runge–Kutta method.

Footnotes

^☆

This work is supported by the Austrian Science Fund (FWF)—project id: P25346.

The equations of motion in the post-Newtonian approximation are determined by expanding the field equations of general relativity for point objects in powers of $1 / c^{2}$ .

Contributor Information

Lukas Einkemmer, Email: lukas.einkemmer@uibk.ac.at.

Alexander Ostermann, Email: alexander.ostermann@uibk.ac.at.

References

1.Hairer E., Lubich C., Wanner G. Springer-Verlag; Berlin, Heidelberg: 2006. Geometric Numerical Integration: Structure-Preserving Algorithms for Ordinary Differential Equations. [Google Scholar]
2.McLachlan R. On the numerical integration of ordinary differential equations by symmetric composition methods. SIAM J. Sci. Comput. 1995;16:151–168. [Google Scholar]
3.Yoshida H. Construction of higher order symplectic integrators. Phys. Lett. A. 1990;150:262–268. [Google Scholar]
4.McLachlan R., Quispel G. Splitting methods. Acta Numer. 2002;11:341–434. [Google Scholar]
5.Cheng C., Knorr G. The integration of the Vlasov equation in configuration space. J. Comput. Phys. 1976;22:330–351. [Google Scholar]
6.Faragó I. Some notes on the iterative operator splitting. J. Appl. Comput. Math. 2013;2:e129. [Google Scholar]
7.Hairer E., Wanner G. second ed. Springer-Verlag; Berlin: 1996. Solving Ordinary Differential Equations II: Stiff and Differential-Algebraic Problems. [Google Scholar]
8.Knapp C., Kendl A., Koskela A., Ostermann A. Splitting methods for time integration of trajectories in magnetic fields. Phys. Rev. E. 2014 doi: 10.1103/PhysRevE.92.063310. submitted for publication. [DOI] [PubMed] [Google Scholar]
9.Spreiter Q., Walter M. Classical molecular dynamics simulation with the velocity Verlet algorithm at strong external magnetic fields. J. Comput. Phys. 1999;152:102–119. [Google Scholar]
10.Kretzschmar M. Particle motion in a Penning trap. Eur. J. Phys. 2000;12:240. [Google Scholar]
11.Blanchet L. On the two-body problem in general relativity. C. R. Acad. Sci., Paris IV. 2001;22:1343–1352. [Google Scholar]
12.J. Callahan, L. Senechal, D. O’Shea, H. Polachek, K. Hoffman, Calculus in Context, http://www.math.smith.edu/Local/cicintro/book.pdf, 1993.

[br000005] 1.Hairer E., Lubich C., Wanner G. Springer-Verlag; Berlin, Heidelberg: 2006. Geometric Numerical Integration: Structure-Preserving Algorithms for Ordinary Differential Equations. [Google Scholar]

[br000010] 2.McLachlan R. On the numerical integration of ordinary differential equations by symmetric composition methods. SIAM J. Sci. Comput. 1995;16:151–168. [Google Scholar]

[br000015] 3.Yoshida H. Construction of higher order symplectic integrators. Phys. Lett. A. 1990;150:262–268. [Google Scholar]

[br000020] 4.McLachlan R., Quispel G. Splitting methods. Acta Numer. 2002;11:341–434. [Google Scholar]

[br000025] 5.Cheng C., Knorr G. The integration of the Vlasov equation in configuration space. J. Comput. Phys. 1976;22:330–351. [Google Scholar]

[br000030] 6.Faragó I. Some notes on the iterative operator splitting. J. Appl. Comput. Math. 2013;2:e129. [Google Scholar]

[br000035] 7.Hairer E., Wanner G. second ed. Springer-Verlag; Berlin: 1996. Solving Ordinary Differential Equations II: Stiff and Differential-Algebraic Problems. [Google Scholar]

[br000040] 8.Knapp C., Kendl A., Koskela A., Ostermann A. Splitting methods for time integration of trajectories in magnetic fields. Phys. Rev. E. 2014 doi: 10.1103/PhysRevE.92.063310. submitted for publication. [DOI] [PubMed] [Google Scholar]

[br000045] 9.Spreiter Q., Walter M. Classical molecular dynamics simulation with the velocity Verlet algorithm at strong external magnetic fields. J. Comput. Phys. 1999;152:102–119. [Google Scholar]

[br000050] 10.Kretzschmar M. Particle motion in a Penning trap. Eur. J. Phys. 2000;12:240. [Google Scholar]

[br000055] 11.Blanchet L. On the two-body problem in general relativity. C. R. Acad. Sci., Paris IV. 2001;22:1343–1352. [Google Scholar]

[br000060] 12.J. Callahan, L. Senechal, D. O’Shea, H. Polachek, K. Hoffman, Calculus in Context, http://www.math.smith.edu/Local/cicintro/book.pdf, 1993.

PERMALINK

An almost symmetric Strang splitting scheme for the construction of high order composition methods^☆

Lukas Einkemmer

Alexander Ostermann

Abstract

1. Introduction

2. An almost symmetric Strang splitting scheme

Definition 1

Theorem 1

Proof

3. Composition methods

Lemma 2

Proof

Corollary 3