Abstract
In this paper we consider splitting methods for nonlinear ordinary differential equations in which one of the (partial) flows that results from the splitting procedure cannot be computed exactly. Instead, we insert a well-chosen state into the corresponding nonlinearity , which results in a linear term whose exact flow can be determined efficiently. Therefore, in the spirit of splitting methods, it is still possible for the numerical simulation to satisfy certain properties of the exact flow. However, Strang splitting is no longer symmetric (even though it is still a second order method) and thus high order composition methods are not easily attainable. We will show that an iterated Strang splitting scheme can be constructed which yields a method that is symmetric up to a given order. This method can then be used to attain high order composition schemes. We will illustrate our theoretical results, up to order six, by conducting numerical experiments for a charged particle in an inhomogeneous electric field, a post-Newtonian computation in celestial mechanics, and a nonlinear population model and show that the methods constructed yield superior efficiency as compared to Strang splitting. For the first example we also perform a comparison with the standard fourth order Runge–Kutta methods and find significant gains in efficiency as well better conservation properties.
Keywords: Splitting methods, Non-symmetric Strang splitting, Approximate partial flows, Nonlinear ordinary differential equations, Application to the sciences
1. Introduction
If an ordinary differential equation can be cast in the form
where the exact solutions of , denoted by , and , denoted by , are known, or can be computed efficiently, splitting methods often provide a viable alternative compared to more traditional integration schemes (such as Runge–Kutta methods). In addition, if the flows generated by and preserve a given property of the ordinary differential equation, so does the splitting scheme. In some instances this can be used to construct schemes which conserve certain properties of the exact flow (see, e.g. [1]). If the exact partial flows are used, the Strang splitting scheme with step size , i.e.
is a symmetric scheme of second order. It is then possible to construct schemes of arbitrary (even) order by composition (see, e.g. [2]). For certain classes of ordinary differential equations more efficient schemes can be constructed (for the example of separable Hamiltonian systems see [3]). For a review of splitting methods we refer the reader to [4].
However, even if one of the partial flows cannot be computed exactly, in some circumstances splitting methods can still be applied. The systems of interest in this paper are ordinary differential equations which can be written as
where, as before, we assume that can be solved exactly. However, no such assumption is made about . Instead, we assume that once a fixed value, say , is substituted, the flow corresponding to
| (1) |
can be computed efficiently. Let us duly note that is still a matrix which is then applied to . We denote the corresponding flow by , which can also be written explicitly by employing the exponential and functions. This yields
where
Note that if we apply the Strang splitting scheme to , a numerical method results that is only of order 1. This is intuitively clear, as is evaluated at the left endpoint only, and can be verified by a simple argument based on the Taylor expansion of the scheme. However, in the literature an alternative scheme has been proposed in the context of partial differential equations (see e.g. [5]) that is usually referred to as Strang splitting also and is given by
Note that since we use an approximation of order 1 to , this is in fact a method of order two. Consistent with the literature we will, from now on, refer to this scheme as Strang splitting.
For a symmetric scheme it must hold that (see, e.g. [1, Chap. II.3])
where denotes the identity. Now
where
Inserting (2) shows that
Therefore, the Strang splitting scheme is symmetric if and only if
which is not satisfied in general.
Due to the lost symmetry, the corresponding triple jump scheme is only of order 3 (not of order 4, as one might naively expect). From this consideration it is also clear that further composition in the same manner does not result in schemes of arbitrary order (as is the case with the classical Strang splitting method based on exact flows).
In Section 2, we will propose a modified Strang splitting scheme that, in addition of being second order accurate, can be iterated to give a scheme that is symmetric up to a predetermined order (as made precise in Definition 1). Therefore, the usual construction of composition methods of arbitrary (even) order can be accomplished in this context (this is shown in Section 3). In Section 4 we show that for certain stiff problems the schemes constructed in this paper can be employed as well. In addition, we discuss in some detail the numerical results for three examples (namely for a charged particle in an inhomogeneous electric field, a post-Newtonian computation of celestial mechanics, and a nonlinear population model) in Section 5. Finally, we conclude in Section 6.
2. An almost symmetric Strang splitting scheme
Let us start from the Lie splitting scheme
| (3) |
We recall that the adjoint of a scheme , which we denote by , is defined as . Therefore, to give a representation of the adjoint scheme corresponding to (3) we interchange with and with . This yields
| (4) |
i.e. . Now the (implicit) Strang splitting scheme
is of second order and symmetric by construction (see, e.g. [1]). However, it would require the solution of an implicit equation in each step, which is prohibitively expensive. Since Eq. (4) has the form of a fixed-point problem, we can employ fixed-point iteration to approximate . We will denote the resulting scheme by , where is the number of iterations that are conducted. Note that during the iteration is fixed; that is, only the two evolution operators given explicitly in Eq. (4) are applied at each step in the fixed-point iteration. As an initial value for the fixed-point iteration we employ (however, any approximation of order to would constitute a possible choice) and therefore
Clearly we need at least two iterations such that the scheme is of second order. There is no hope that is symmetric. However, we will show that it is almost symmetric as defined below.
Before proceeding, let us note that we perform the iteration in the context of the Strang splitting scheme (a method of order two) only. More specifically, the order of the method is not raised by performing the iteration (for ). This is not the case for the iterative operator splitting (as described in [6], for example). Furthermore, the method described in this paper only requires an algorithm to efficiently compute the solution of (1); it neither makes such an assumption for the nonlinear problem nor does it introduces an additional inhomogeneous part (as is the case in [6] and references therein).
Definition 1
A one-step method is symmetric of order if
(5) where is the adjoint method of .
Next let us show that the fixed-point iteration described above actually yields a scheme that is symmetric of order .
Theorem 1
Suppose that is Lipschitz continuous. Then the Strang splitting scheme is symmetric of order .
Proof
We have to show that the fixed-point problem (4), i.e., with
has a unique solution in a sufficiently small neighborhood of . First note that there exists a constant such that
for sufficiently small. Now, let and denote by the closed ball with center and radius . Then, for all it holds
(6) where denotes the Lipschitz constant of on the bounded set
Further note that
(7) which is a direct consequence of the variation-of-constants formula (note that the constant does depend only on the norm of and ). By combining the bounds (6), (7) with the Lipschitz continuity of , we obtain that is Lipschitz continuous on with a Lipschitz constant of order . Moreover, using the triangle inequality we get the bound
for all and sufficiently small. This shows that maps the closed ball onto itself. Consequently, by Banach’s fixed-point theorem, has a unique fixed-point in , which is the locally unique solution of (4).
Since and , we also obtain that
Moreover, as the Lipschitz constant of is of order this implies
(8) Also recall that is a symmetric scheme by construction. Thus, (8) proves the desired result. □
Thus, we have established that we can iteratively compute a second order method that is symmetric to arbitrary order. Moreover, the computational effort is linear in the desired order of symmetry.
In the next section we will discuss how the scheme described here can be used to construct composition methods of arbitrary (even) order.
3. Composition methods
It is well-known (see, e.g. [1, Chap. II.4]) that if a symmetric one-step method of even order is composed in the following manner
| (9) |
where
| (10) |
then a one-step method of order results. Thus, we can construct methods of arbitrary even order , where the cost, in terms of a single evaluation of the corresponding second order method, is given by . For , for example, the corresponding method is the well-known triple jump scheme.
The justification for this procedure is given by Theorem 4.1 in [1]. We will now generalize that result for methods that are (only) symmetric of order .
Lemma 2
Suppose that the one-step method is of odd order and symmetric of order with , see (5). Then, the method is in fact of order .
Proof
Let us denote the exact flow by . Since the method has order ,
and further the adjoint method satisfies
Using now assumption (5), with odd, we get
and thus if . Therefore, we deduce that is of order . □
As a corollary we get the desired order for the composition methods as well as the number of iterations we have to perform. As we will see in Section 5.1 this is a worst case estimate that can be improved upon for some applications.
Corollary 3
The composition method constructed from by using compositions, as described in Eq. (9), results in a scheme of order if .
Proof
If then is of order 2 by construction. Thus, the composition method is at least of order 3. However, from Lemma 2 we know that if this method is in fact of order 4. Since the composition given in (9) is symmetric, a method which is symmetric of order retains this property if composed in the manner described. Therefore, we can complete the proof by induction. □
Before we turn our attention to the applications given in the next section, let us investigate the (worst case) computational cost of the composition methods considered in this section. In Table 1 the number of computations of either or is given for the triple jump scheme as well as the composition of the triple jump scheme (which we call composite 9). In addition, Table 1 lists an abbreviation of all the schemes discussed (which we will employ heavily in the next section). The methods constructed here will be referred to as iterated.
Table 1.
The effort in number of (possibly approximated) partial flows that have to be computed is listed for a number of composition schemes. In addition, the abbreviations used for the composition methods employed in the next section are given.
| Method | Abbreviation | Order | Iterations | Effort |
|---|---|---|---|---|
| Strang (2) | S | 2 | – | 4 |
| Iterated Strang | IS | 2 | 2 | 6 |
| Triple jump | TJ | 3 | – | 12 |
| Iterated triple jump | ITJ | 4 | 4 | 30 |
| Composite 9 | C9 | 3 | – | 36 |
| Iterated composite 9 | IC9 | 6 | 6 | 126 |
Note that even though the high order methods given in Table 1 are about three times as costly as conventional composition methods (which are employed for separable Hamiltonian systems, for example), we are now able to construct methods of arbitrary (even) order. We will show in Section 5 that for realistic problems this can still result in a considerable gain in performance (as compared to the more commonly employed Strang splitting scheme, for example).
In addition, it should be duly noted that similar to conventional composition methods, the schemes introduced here conserve all invariants that are invariants of the two partial flows as well. To conclude this section, let us remark that the schemes introduced here do not require any modification in the code used to implement the numerical solution of the partial flows. That is, if for a given problem the Lie or Strang splitting scheme is already implemented, the generalization to the methods discussed here is almost immediate.
4. Extension to stiff problems
In this section we show that in certain circumstances we can extend our analysis to the stiff case. Let us consider, for example, an ordinary differential equation for which the operator , as defined in Section 1, can be written as
| (11) |
i.e., the nonlinear operator can be split in a stiff linear part and a non-stiff nonlinear part. In this case we can show that the speed of convergence of the fixed-point iteration in Theorem 1 (see Section 2) is independent of the stiff part. This is the content of the following corollary.
Corollary 4
Suppose that can be cast into the form (11) with Lipschitz continuous. Then the Strang splitting scheme is symmetric of order and the error of the method can be estimated independently of .
Proof
We employ the variation-of-constants formula to get
which allows us to estimate
where depends on and but not on .
The proof is completed by employing the same arguments used in the proof of Theorem 1. □
Therefore, we have shown that, in the situation described, the step size can be chosen independently of the stiff part of the problem. This is of interest in some applications, where the inclusion of in the operator would result in partial flows that are more difficult to compute, or where a conservation property of the problem under consideration is destroyed if is treated separately from the nonlinearity .
To conclude this section let us briefly discuss the Brusselator (which is described in [7, Chap. IV.1]). In this case we have a discretized diffusion–reaction equation, where the flow corresponding to can be computed very efficiently by employing fast Fourier transform techniques. The remaining stiffness in the system is then only due to the linear part of the flow corresponding to . In addition, an analytical expression of the partial flow corresponding to the nonlinearity is not easily attainable (due to the coupling of the equations involved). Therefore, the problem is of the form considered in this section. The implementation and analysis of such methods in the context of partial differential equations is the subject of further research.
5. Applications
In this section, we discuss three applications of the schemes constructed in the previous sections. First, we consider a Hamiltonian system that describes the movement of a charged particle in an inhomogeneous electromagnetic field. This system will turn out to require fewer iterations for a desired order of symmetry as compared to the worst case described in Section 3. Second, we consider a post-Newtonian approximation to the relativistic Kepler problem. Also in this case we will observe that fewer iterations are necessary, compared to the worst case, to construct schemes of order four and six. Third, a nonlinear population model is considered. This model, in fact, exhibits the worst case behavior as outlined in Section 3. Nevertheless, we can show, by conducting numerical experiments, that in all three examples the use of high order methods results in a significant performance increase.
In all the simulations conducted, we compute a reference solution by using the (classic) Strang splitting scheme and a sufficiently small (experimentally determined) step size.
5.1. A charged particle in an inhomogeneous magnetic field
The equations of motion of a charged particle in an external electromagnetic field are given by the Lorentz force law
where are the particle’s position, velocity, and charge, respectively; the electric field is denoted by and the magnetic field by (both can depend on the position of the particle under consideration, i.e. on ). This differential equation can be reformulated as a Hamiltonian system with Hamiltonian
where the electric potential is related to the electric field by . We should note that the momentum used above is not the conjugate variable to the position (as would be the case in the electrostatic limit).
The equations of motion in this framework are then given by
where
with (see e.g. [8]). To set up the splitting, we use
and therefore
whereas the second partial flow can be computed exactly once we substitute (and thus consider to be constant). The analytic expression is given by
| (12) |
For actual computations we can use
and
where both and depend on ; this dependence is, for the sake of brevity, omitted in the notation used. Thus, we have fulfilled all the requirements outlined in Section 1. Note that for a uniform magnetic field a number of symmetric second order schemes are available (see, e.g. [9]). However, for non-uniform magnetic fields such schemes cannot be employed to get higher order schemes by composition.
Let us now discuss a peculiarity of the system under consideration. As does only depend on the position component of the phase space and the evolution operator does not depend on the momentum (which is a consequence of the specific splitting conducted here), we have
That is, the Lipschitz constant for our fixed-point iteration is of order . However, such a result is not entirely unexpected as it is quite common that the position is integrated in time with a higher order than the momentum component (this is also true for the popular leapfrog scheme, for example). Therefore, any resulting approximation to is of order , for some , in position and, as we can easily deduce from Eq. (12), the momentum is then approximated up to order . Therefore, to use the notation from Section 2 we have a symmetric scheme of order .
Thus, three iterations are sufficient to get a fourth order scheme whereas four iterations suffice to get a sixth order scheme. This is clearly below the worst case behavior discussed in Section 3.
We now turn our attention to the presentation of the numerical simulations conducted. As an example we will use an electric field configuration that corresponds to an ideal Penning trap (such as described in [10]). However, we will use a magnetic field that is not homogeneous in space. Further we will use natural units for the problem, i.e., and are set to unity. For the ideal Penning trap the electric potential is given by
In order to impose an inhomogeneous magnetic field, we use
We consider an initial value in both position as well as momentum close to zero and evolve the system until time . In Fig. 1 we show that the numerical experiments match the expected order for the splitting schemes discussed in Section 3. Note, however, that for the composite 9 scheme only three iterations are required to reach order six (instead of the four predicted above). This is a clear indication at the presence of further simplifications (in the system under consideration). Now let us turn our attention to run time considerations. In Fig. 2 the run time is plotted against the achieved accuracy. It is clear from the figure that even for moderate precision requirements, high order methods provide a significant advantage over the more commonly employed Strang splitting scheme.
Fig. 1.
Order plot for a charged particle in an inhomogeneous magnetic field (the results for various splitting schemes are shown). The lines drawn are, from top to bottom, of slope 2, 3, 3, 4, and 6 respectively. The abbreviations for the different numerical schemes are listed in Table 1.
Fig. 2.
Run time as a function of the achieved accuracy for a charged particle in an inhomogeneous magnetic field (the results for various splitting schemes are shown). The abbreviations for the different numerical schemes are listed in Table 1. For comparison, the standard Runge–Kutta scheme of order four (RK4) is also shown.
The system under consideration is Hamiltonian; therefore, the energy is exactly conserved. This is, in general, no longer true if a numerical scheme is considered. However, schemes can be engineered which, to machine precision, conserve the energy. It is clear that this is not true in this case as the partial flows do not conserve the energy (Fig. 3 confirms this behavior). However, the error in energy is still four orders of magnitude below the integration error made by the scheme under consideration.
Fig. 3.
Energy conservation for a charged particle in an inhomogeneous magnetic field, where the iterated triple jump scheme () with is employed. This results in an error (in the infinity norm) in the position/momentum that is approximately 3⋅10−4. For comparison, the standard Runge–Kutta method of order four is shown. There the step size is chosen, which results in a comparable accuracy and twice the run time. Note, however, that the error in energy is better by an order of magnitude for the iterated triple jump scheme.
To end this section let us note that the discussion here can easily be generalized to multiple particles. This is still true if particle–particle interactions (via the electric or magnetic field, for example) are considered.
5.2. Post-Newtonian Kepler problem
As a second example we consider the post-Newtonian1 approximation to the (general) relativistic -body problem. In this section we will limit ourselves to the relativistic Kepler problem in the Post-Newtonian approximation up to terms of order , where denotes the speed of light. The equations of motions for the first body are then given by (see, e.g. [11])
where , , and is the standard gravitational parameter (which can be computed from the gravitational constant and the mass of the body ). The equations of motion for the second body can then be determined by interchanging the indices corresponding to the first and the second body in the equations of motion stated above. Let us note that the Newtonian equations of motion are recovered in the limit as (in this case only the first force term remains). The structure of the equations of motion naturally lends itself to the splitting scheme described in Section 1. To that end let us define
and
where
The corresponding quantities and can once again be obtained by reversing the indices corresponding to the first and second body. It is clear that the flows corresponding to both and , as defined above, can be computed efficiently.
In the subsequent discussion, we will employ the SI system of units (for convenience we will not state the units explicitly). Let us consider the orbit of two celestial objects with , i.e., approximately solar masses, and . We initialize the first body with zero velocity and the second one with and place it at the perihelion of the orbit which we determine to be , i.e., a mercury like orbit. We integrate the equations of motion up to the final time , which corresponds to about fifteen orbits. The order plots for a number of schemes are shown in Fig. 4.
Fig. 4.
Order plots for a post-Newtonian Kepler problem (the results for various splitting schemes are shown). The lines drawn are, from top to bottom, of slope 2, 3, 3, 4, and 6 respectively. The error is scaled to the perihelion (the point of least distance between the two bodies) of the orbit. The abbreviations for the different numerical schemes are listed in Table 1.
The number of iterations necessary for the iterated triple jump scheme (ITJ) as well as the iterated composite 9 scheme (IC9) have been determined by conducting numerical experiments. A theoretical analysis is beyond the scope of this paper. Note, however, that similar to the previous example we do not observe the worst case behavior described in Section 3.
In addition, let us investigate the run time as a function of the error. This is shown in Fig. 5. As is apparent from the figure, the fourth order iterated triple jump scheme (ITJ) is superior to both the third order triple jump scheme and the Strang splitting scheme. For medium accuracy requirement it becomes advantageous to employ the sixth order IC9 scheme.
Fig. 5.
Run time as a function of the achieved accuracy for a post-Newtonian Kepler problem (the results for various splitting schemes are shown). The error is scaled to the perihelion of the orbit. The abbreviations for the different numerical schemes are listed in Table 1.
5.3. A nonlinear population model
As a third example, we consider a nonlinear population model (the so-called May model) that is given by
where in line with [12] we use , , , , , and . In this context, is interpreted as a (appropriately scaled) prey population while represents the predator population. We can argue that such an equation lends itself to splitting as if interaction effects are neglected we are usually left with either an exponential growth model or a logistic equation in each variable. In fact, this is the case for the equation stated above, since the decoupled system can be written as
of which an analytical solution can easily be found; it is given by
To complete our splitting scheme, we set
which once again is exactly the situation described in Section 1.
One might rightfully object that our splitting approach is somewhat artificial as we can simply add the operator to the operator. After all, the resulting operator still has the desired form and splitting would not be necessary. The only potential advantage of using the splitting scheme is that we can solve the flow corresponding to exactly. Although numerical experiments demonstrate that this can result in a significant increase in performance, the goal of this section is to show that the number of iterations given in Section 3 constitutes a sharp bound.
To investigate that behavior let us choose the initial values and , i.e., the prey population is significantly larger than the predator population. In Fig. 6 we plot the run time as a function of the error for a number of schemes discussed so far (we integrate up to ).
Fig. 6.
Run time as a function of the achieved accuracy for the May model. The abbreviations for the different numerical schemes are listed in Table 1.
It is also clear that contrary to the example discussed in the previous section, high order schemes (beyond triple jump) are only advantageous if very high precision is needed; however, this behavior is not surprising as the solution approaches a steady state quite rapidly.
Therefore, let us now turn our attention to the number of iterations necessary to obtain a given order. In Fig. 7 we can clearly see that the behavior described in Section 3 is regained. Thus, the system under consideration does not possess the simplifying property we discussed in Section 5.1 for the charged particle and in Section 5.2 for the post-Newtonian approximation. We can also conclude that the number of iterations given in Section 3 constitutes a sharp bound.
Fig. 7.
Order plot for the May model (the results for various splitting schemes are shown). The lines drawn are, from top to bottom and left to right, of slope 2, 3, 3, 4, 5, and 6 respectively. The abbreviations for the different numerical schemes are listed in Table 1.
As in the previous section we note that generalizations, for example the inclusion of multiple predator species, can be easily accomplished in the context of the schemes discussed.
6. Conclusion
Besides providing a theoretical analysis, we have conducted numerical simulations that demonstrate the applicability of composition schemes to three examples of interest in the sciences. In all of these examples we have demonstrated that, depending on the accuracy requirement, the high order schemes constructed in this paper can provide significant gains in performance compared to Strang splitting. For a charged particle in an inhomogeneous field, we have also demonstrated increased efficiency as well as better conservation properties as compared to the standard fourth order Runge–Kutta method.
Footnotes
This work is supported by the Austrian Science Fund (FWF)—project id: P25346.
The equations of motion in the post-Newtonian approximation are determined by expanding the field equations of general relativity for point objects in powers of .
Contributor Information
Lukas Einkemmer, Email: lukas.einkemmer@uibk.ac.at.
Alexander Ostermann, Email: alexander.ostermann@uibk.ac.at.
References
- 1.Hairer E., Lubich C., Wanner G. Springer-Verlag; Berlin, Heidelberg: 2006. Geometric Numerical Integration: Structure-Preserving Algorithms for Ordinary Differential Equations. [Google Scholar]
- 2.McLachlan R. On the numerical integration of ordinary differential equations by symmetric composition methods. SIAM J. Sci. Comput. 1995;16:151–168. [Google Scholar]
- 3.Yoshida H. Construction of higher order symplectic integrators. Phys. Lett. A. 1990;150:262–268. [Google Scholar]
- 4.McLachlan R., Quispel G. Splitting methods. Acta Numer. 2002;11:341–434. [Google Scholar]
- 5.Cheng C., Knorr G. The integration of the Vlasov equation in configuration space. J. Comput. Phys. 1976;22:330–351. [Google Scholar]
- 6.Faragó I. Some notes on the iterative operator splitting. J. Appl. Comput. Math. 2013;2:e129. [Google Scholar]
- 7.Hairer E., Wanner G. second ed. Springer-Verlag; Berlin: 1996. Solving Ordinary Differential Equations II: Stiff and Differential-Algebraic Problems. [Google Scholar]
- 8.Knapp C., Kendl A., Koskela A., Ostermann A. Splitting methods for time integration of trajectories in magnetic fields. Phys. Rev. E. 2014 doi: 10.1103/PhysRevE.92.063310. submitted for publication. [DOI] [PubMed] [Google Scholar]
- 9.Spreiter Q., Walter M. Classical molecular dynamics simulation with the velocity Verlet algorithm at strong external magnetic fields. J. Comput. Phys. 1999;152:102–119. [Google Scholar]
- 10.Kretzschmar M. Particle motion in a Penning trap. Eur. J. Phys. 2000;12:240. [Google Scholar]
- 11.Blanchet L. On the two-body problem in general relativity. C. R. Acad. Sci., Paris IV. 2001;22:1343–1352. [Google Scholar]
- 12.J. Callahan, L. Senechal, D. O’Shea, H. Polachek, K. Hoffman, Calculus in Context, http://www.math.smith.edu/Local/cicintro/book.pdf, 1993.







