Skip to main content
Springer logoLink to Springer
. 2016 Aug 8;136(1):27–73. doi: 10.1007/s00211-016-0833-y

Discontinuous Galerkin methods for nonlinear scalar hyperbolic conservation laws: divided difference estimates and accuracy enhancement

Xiong Meng 1,2, Jennifer K Ryan 1,
PMCID: PMC5445630  PMID: 28615748

Abstract

In this paper, an analysis of the accuracy-enhancement for the discontinuous Galerkin (DG) method applied to one-dimensional scalar nonlinear hyperbolic conservation laws is carried out. This requires analyzing the divided difference of the errors for the DG solution. We therefore first prove that the α-th order (1αk+1) divided difference of the DG error in the L2 norm is of order k+32-α2 when upwind fluxes are used, under the condition that |f(u)| possesses a uniform positive lower bound. By the duality argument, we then derive superconvergence results of order 2k+32-α2 in the negative-order norm, demonstrating that it is possible to extend the Smoothness-Increasing Accuracy-Conserving filter to nonlinear conservation laws to obtain at least (32k+1)th order superconvergence for post-processed solutions. As a by-product, for variable coefficient hyperbolic equations, we provide an explicit proof for optimal convergence results of order k+1 in the L2 norm for the divided differences of DG errors and thus (2k+1)th order superconvergence in negative-order norm holds. Numerical experiments are given that confirm the theoretical results.

Mathematics Subject Classification: 65M60, 65M12, 65M15

Introduction

In this paper, we study the accuracy-enhancement of semi-discrete discontinuous Galerkin (DG) methods for solving one-dimensional scalar conservation laws

ut+f(u)x=0,(x,t)(a,b)×(0,T], 1.1a
u(x,0)=u0(x),xΩ=(a,b), 1.1b

where u0(x) is a given smooth function. We assume that the nonlinear flux function f(u) is sufficiently smooth with respect to the variable u and the exact solution is also smooth. For the sake of simplicity and ease in presentation, we only consider periodic boundary conditions. We show that the α-th order (1αk+1) divided difference of the DG error in the L2 norm achieves (k+32-α2)th order when upwind fluxes are used, under the condition that |f(u)| possesses a uniform positive lower bound. By using a duality argument, we then derive superconvergence results of order 2k+32-α2 in the negative-order norm. This allows us to demonstrate that it is possible to extend the post-processing technique to nonlinear conservation laws to obtain at least (32k+1)th order of accuracy. In addition, for variable coefficient hyperbolic equations that have been discussed in [19], we provide an explicit proof for optimal error estimates of order k+1 in the L2 norm for the divided differences of the DG errors and thus 2k+1 in the negative-order norm.

Various superconvergence properties of DG methods have been studied in the past two decades, which not only provide a deeper understanding about DG solutions but are useful for practitioners. According to different measurements of the error, the superconvergence of DG methods is mainly divided into three categories. The first one is superconvergence of the DG error at Radau points, which is typically measured in the discrete L2 norm and is useful to resolve waves. The second one is superconvergence of the DG solution towards a particular projection of the exact solution (supercloseness) measured in the standard L2 norm, which lays a firm theoretical foundation for the excellent behaviour of DG methods for long-time simulations as well as adaptive computations. The last one is the superconvergence of post-processed solution by establishing negative-order norm error estimates, which enables us to obtain a higher order approximation by simply post-processing the DG solution with a specially designed kernel at the very end of the computation. In what follows, we shall review some superconvergence results for the aforementioned three properties and restrict ourselves to hyperbolic equations to save space. For superconvergence of DG methods for other types of PDEs, we refer to [21].

Let us briefly mention some superconvergence results related to the Radau points and supercloseness of DG methods for hyperbolic equations. Adjerid and Baccouch [13] studied the superconvergence property as well as the a posteriori error estimates of the DG methods for one- and two-dimensional linear steady-state hyperbolic equations, in which superconvergence of order k+2 and 2k+1 are obtained at downwind-biased Radau points and downwind points, respectively. Here and below, k is the highest polynomial degree of the discontinuous finite element space. For time-dependent linear hyperbolic equations, Cheng and Shu [9] investigated supercloseness for linear hyperbolic equations, and they obtained superconvergence of order k+32 towards a particular projection of the exact solution, by virtue of construction and analysis of the so-called generalized slopes. Later, by using a duality argument, Yang and Shu [24] proved superconvergence results of order k+2 of the DG error at downwind-biased points as well as cell averages, which imply a sharp (k+2)th order supercloseness result. By constructing a special correction function and choosing a suitable initial discretization, Cao et al. [7] established a supercloseness result towards a newly designed interpolation function. Further, based on this supercloseness result, for the DG error they proved the (2k+1)th order superconvergence at the downwind points as well as domain average, (k+2)-th order superconvergence at the downwind-biased Radau points, and superconvergent rate k+1 for the derivative at interior Radau points. We would like to remark that the work of [7, 24] somewhat indicates the possible link between supercloseness and superconvergence at Radau points. For time-dependent nonlinear hyperbolic equations, Meng et al. [18] proved a supercloseness result of order k+32. For superconvergent posteriori error estimates of spatial derivative of DG error for nonlinear hyperbolic equations, see [4].

Let us now mention in particular some superconvergence results of DG methods regarding negative-order norm estimates and post-processing for hyperbolic equations. The basic idea of post-processing is to convolve the numerical solution by a local averaging operator with the goal of obtaining a better approximation and typically of a higher order. Motivated by the work of Bramble and Schatz in the framework of continuous Galerkin methods for elliptic equations [5], Cockburn et al. [11] established the theory of post-processing techniques for DG methods for hyperbolic equations by the aid of negative-order norm estimates. The extension of this post-processing technique was later fully studied by Ryan et al. in different aspects of problems, e.g. for general boundary condition [20], for nonuniform meshes [13] and for applications in improving the visualization of streamlines [22] in which it is labeled as a Smoothness-Increasing Accuracy-Conserving (SIAC) filter. For the extension of the SIAC filter to linear convection-diffusion equations, see [15].

By the post-processing theory [5, 11], it is well known that negative-order norm estimates of divided differences of the DG error are important tools to derive superconvergent error estimates of the post-processed solution in the L2 norm. Note that for purely linear equations [11, 15], once negative-order norm estimates of the DG error itself are obtained, they trivially imply negative-order norm estimates for the divided differences of the DG error. However, the above framework is no longer valid for variable coefficient or nonlinear equations. In this case, in order to derive superconvergent estimates about the post-processed solution, both the L2 norm and negative-order norm error estimates of divided differences should be established. In particular, for variable coefficient hyperbolic equations, although negative-order norm error estimates of divided differences are given in [19], the corresponding L2 norm estimates are not provided. For nonlinear hyperbolic conservation laws, Ji et al. [16] showed negative-order norm estimates for the DG error itself, leaving the estimates of divided differences for future work.

For nonlinear hyperbolic equations under consideration in this paper, it is therefore important and interesting to address the above issues by establishing both the L2 norm and negative-order norm error estimates for the divided differences. The major part of this paper is to show L2 norm error estimates for divided differences, which are helpful for us to obtain a higher order of accuracy in the negative-order norm and thus the superconvergence of the post-processed solutions. We remark that it requires |f(u)| having a uniform positive lower bound due to the technicality of the proof. The generalization from purely linear problems [11, 15] to nonlinear hyperbolic equations in this paper involves several technical difficulties. One of these is how to establish important relations between the spatial derivatives and time derivatives of a particular projection of divided differences of DG errors. Even if the spatial derivatives of the error are switched to their time derivatives, it is still difficult to analyze the time derivatives of the error; for more details, see Sect. 3.2 and also the appendix. Another important technicality is how to construct a suitable dual problem for the divided difference of the nonlinear hyperbolic equations. However, it seems that it is not trivial for the two-dimensional extension, especially for establishing the relations between spatial derivatives and time derivatives of the errors. The main tool employed in deriving L2 norm error estimates for the divided differences is an energy analysis. To deal with the nonlinearity of the flux functions, Taylor expansion is used following a standard technique in error estimates for nonlinear problems [25]. We would like to remark that the superconvergence analysis in this paper indicates a possible link between supercloseness and negative-order norm estimates.

This paper is organized as follows. In Sect. 2, we give the DG scheme for divided differences of nonlinear hyperbolic equations, and present some preliminaries about the discontinuous finite element space. In Sect. 3, we state and discuss the L2 norm error estimates for divided differences of nonlinear hyperbolic equations, and then display the main proofs followed by discussion of variable coefficient hyperbolic equations. Section 4 is devoted to the accuracy-enhancement superconvergence analysis based on negative-order norm error estimates of divided differences. In Sect. 5, numerical experiments are shown to demonstrate the theoretical results. Concluding remarks and comments on future work are given in Sect. 6. Finally, in the appendix we provide the proofs for some of the more technical lemmas.

The DG scheme and some preliminaries

The DG scheme

In this section, we follow [11, 12] and present the DG scheme for divided differences of the problem (1.1).

Let a=x12<x32<<xN+12=b be a partition of Ω=(a,b), and set xj=(xj-12+xj+12)/2. Since we are focused on error analysis of both the L2 norm and the negative-order norm for divided differences of the DG solution and the problem under consideration is assumed to be periodic, we shall introduce two overlapping uniform (translation invariant) meshes for Ω, namely Ij=(xj-12,xj+12) and Ij+12=(xj,xj+1) with mesh size h=xj+12-xj-12. Associated with these meshes, we define the discontinuous finite element space

Vhα=v:v|IjPk(Ij),j=j+2,=αmod2,j=1,,N,

where Pk(Ij) denotes the set of polynomials of degree up to k defined on the cell Ij:=(xj-12,xj+12). Here and below, α represents the α-th order divided difference of a given function, whose definition is given in (2.6a). Clearly, Vhα is a piecewise polynomial space on mesh Ij=Ij for even α (including α=0) and a piecewise polynomial space on mesh Ij=Ij+12 for odd α of the DG scheme. For simplicity, for even α, we would like to use Vh to denote the standard finite element space of degree k defined on the cell Ij, if there is no confusion. Since functions in Vhα may have discontinuities across element interfaces, we denote by wi- and wi+ the values of w(x) at the discontinuity point xi from the left cell and the right cell, respectively. Moreover, we use [[w]]=w+-w- and {{w}}=12(w++w-) to represent the jump and the mean of w(x) at each element boundary point.

The α-th order divided difference of the nonlinear hyperbolic conservation law is

hαut+hαf(u)x=0,(x,t)Ωα×(0,T], 2.1a
hαu(x,0)=hαu0(x),xΩα, 2.1b

where Ωα=(a+2h,b+2h) with =αmod2. Clearly, (2.1) reduces to (1.1) when α=0. Then the approximation of the semi-discrete DG method for solving (2.1) becomes: find the unique function uh=uh(t)Vh (and thus hαuhVhα) such that the weak formulation

((hαuh)t,vh)j=Hj(hαf(uh),vh) 2.2

holds for all vhVhα and all j=1,,N. Note that, by (2.6a), for any xIj and t, hαuh(x,t) is a linear combination of the values of uh at α+1 equally spaced points of length h, namely x-α2h,,x+α2h. Here and in what follows, ·,·j denotes the usual inner product in L2(Ij), and Hj·,· is the DG spatial discretization operator defined on each cell Ij=(xj-12,xj+12), namely

Hjw,v=w,vxj-w^v-j+12+w^v+j-12.

We point out that in order to obtain a useful bound for the L2 norm error estimates of divided differences, the numerical flux f^j+12 is chosen to be an upwind flux, for example, the well-known Godunov flux. Moreover, the analysis requires a condition that |f(u)| has a uniform positive lower bound. Without loss of generality, throughout the paper, we only consider f(u)δ>0, and thus w^=w-. Therefore,

Hjw,v=w,vxj-w-v-|j+12+w-v+|j-12 2.3a
=-wx,vj-([[w]]v+)j-12. 2.3b

For periodic boundary conditions under consideration in this paper, we use H to denote the summation of Hj with respect to cell Ij, that is

H(w,v)=w,vx+j=1N(w-[[v]])j+12 2.4a
=-wx,v-j=1N([[w]]v+)j-12, 2.4b

where w,v=j=1Nw,vj represents the inner product in L2(Ωα). Note that we have used the summation with respect to j instead of j to distinguish two overlapping meshes, since j=j for even α and j=j+12 for odd α.

Preliminaries

We will adopt the following convention for different constants. We denote by C a positive constant independent of h but may depend on the exact solution of the Eq. (2.1), which could have a different value in each occurrence. To emphasize the nonlinearity of the flux f(u), we employ C to denote a nonnegative constant depending solely on the maximum of a high order derivative |fm| (m2). We remark that C=0 for a linear flux function f(u)=cu or a variable coefficient flux function f(u)=a(x)u, where c is a constant and a(x) is a given smooth function.

Prior to analyzing the L2 norm and the negative-order norm error estimates of divided differences, let us present some notation, definitions, properties of DG discretization operator, and basic properties about SIAC filters. These preliminary results will be used later in the proof of superconvergence property.

Sobolev spaces and norms

We adopt standard notation for Sobolev spaces. For any integer s0, we denote by Ws,p(D) the Sobolev space on subdomain DΩ equipped with the norm ·s,p,D. In particular, if p=2, we set Ws,p(D)=Hs(D), and ·s,p,D=·s,D, and further if s=0, we set ·s,D=·D. Throughout the paper, when D=Ω, we will omit the index D for convenience. Furthermore, the norms of the broken Sobolev spaces Ws,p(Ωh):={uL2(Ω):u|DWs,p(D),DΩ} with Ωh being the union of all cells can be defined analogously. The Bochner space can also be easily defined. For example, the space L1([0,T];Hs(D)) is equipped with the norm ·L1([0,T];Hs(D))=0T·s,Ddt.

Additionally, we denote by ·Γh the standard L2 norm on the cell interfaces of the mesh Ij. Specifically, for the one-dimensional case under consideration in this paper, vΓh2=j=1NvIj2 with vIj=((vj-1/2+)2+(vj+1/2-)2)12. To simplify notation in our later analysis, following [23], we would like to introduce the so-called jump seminorm [v]=(j=1N[[v]]j-122)12 for vH1(Ωh).

In the post-processing framework, it is useful to consider the negative-order norm, defined as: Given >0 and domain Ω,

v-,Ω=supΦC0(Ω)v,ΦΦ. 2.5

Properties for divided differences

For any function w and integer γ, the following standard notation of central divided difference is used

hγw(x)=1hγi=0γ(-1)iγiwx+γ2-ih. 2.6a

Note that the above notation is still valid even if w is a piecewise function with possible discontinuities at cell interfaces. In later analysis, we will repeatedly use the properties about divided differences, which are given as follows. For any functions w and v

hγw(x)v(x)=i=0γγihiwx+γ-i2hhγ-ivx-i2h, 2.6b

which is the so-called Leibniz rule for the divided difference. Moreover, for sufficiently smooth functions w(x), by using Taylor expansion with integral form of the remainder, we can easily verify that hγw is a second order approximation to xγw, namely

hγw(x)=xγw(x)+Cγh2ψγ(x), 2.6c

where Cγ is a positive constant and ψγ is a smooth function; for example, Cγ=1/8,1,3/32 for γ=1,2,3, and

ψγ(x)=1(γ+1)!01xγ+2wx+γ2hs+xγ+2wx-γ2hs(1-s)γ+1ds.

Here and below, xγ(·) denotes the γ-th order partial derivative of a function with respect to the variable x; likewise for tγ(·). The last identity is the so-called summation by parts, namely

hγw(x),v(x)=(-1)γw(x),hγv(x). 2.6d

In addition to the properties of divided differences for a single function w(x), the properties of divided differences for a composition of two or more functions are also needed. However, we would like to emphasize that properties (2.6a), (2.6b), (2.6d) are always valid whether w is a single function or w is a composition of two or more functions. As an extension from the single function case in (2.6c) to the composite function case, the following property (2.6e) is subtle, it requires a more delicate argument for composite functions. Without loss of generality, if w is the composition of two smooth functions r and u, i.e., w(x):=r(u(x)), we can prove the following identity

hγr(u(x))=xγr(u(x))+CγhΨγ(x), 2.6e

where Cγ is a positive constant and Ψγ is a smooth function. We can see that, unlike (2.6c), the divided difference of a composite function is a first order approximation to its derivative of the same order. This finding, however, is sufficient in our analysis; see Corollary 1.

It is worth pointing out that in (2.6e), xγr(u(x)) and hγr(u(x)) should be understood in the sense of the chain rule for high order derivatives and divided differences of composite functions, respectively. In what follows, we use f[x0,,xγ] to denote the standard γ-th order Newton divided difference, that is

f[xν]:=f(xν),0νγ,f[xν,,xν+μ]:=f[xν+1,,xν+μ]-f[xν,,xν+μ-1]xν+μ-xν,0νγ-μ,1μγ.

It is easy to verify that

hγr(u(x))=γ!r[x0,,xγ], 2.7

where xi=x+2i-γ2h (0iγ).

For completeness, we shall list the chain rule for the derivatives (the well-known Faà di Bruno’s Formula) and also for the divided differences [14]; it reads

xγr(u(x))=γ!b1!bγ!r()(u(x))xu(x)1!b1xγu(x)γ!bγ,r[x0,,xγ]==1γr[u0,,u]A,γu,

where ui=u(xi), and the sum is over all =1,,γ and non-negative integer solutions b1,,bγ to

b1+2b2++γbγ=γ,b1++bγ=,

and

A,γu==α0α1α=γβ=0-1u[xβ,xαβ,,xαβ+1]

with the sum being over integers α1,,α-1 such that α1α-1γ.

It follows from the mean value theorem for divided differences that

limh0r[x0,,xγ]=xγr(u(x))γ!.

Consequently, by (2.7),

limh0hγr(u(x))=xγr(u(x)).

We are now ready to prove (2.6e) for the relation between the divided difference and the derivative of composite functions. Using a similar argument as that in the proof of (2.6c) for A,γu and the relation that

r[u0,,uγ]=r(γ)(uγ2)γ!+Cγhψ(u0,u1,,uγ),

due to the smoothness of ui and the fact that ui may not necessarily be equally spaced, with uγ2=u(x) and ψ(u0,u1,,uγ) being smooth functions, we can obtain the relation (2.6e).

The inverse and projection properties

Now we list some inverse properties of the finite element space Vhα. For any pVhα, there exists a positive constant C independent of p and h, such that

(i)xpCh-1p;(ii)pΓhCh-1/2p;(iii)pCh-1/2p.

Next, we introduce the standard L2 projection of a function qL2(Ω) into the finite element space Vhk, denoted by Pkq, which is a unique function in Vhk satisfying

q-Pkq,vh=0,vhVhk. 2.8

Note that the proof of accuracy-enhancement of DG solutions for linear equations relies only on an L2 projection of the initial condition [11, 15]. However, for variable coefficient and nonlinear hyperbolic equations, a suitable choice of the initial condition and a superconvergence relation between the spatial derivative and time derivative of a particular projection of the error should be established, since both the L2 norm and negative-order norm error estimates of divided differences need to be analyzed. In what follows, we recall two kinds of Gauss–Radau projections Ph± into Vh following a standard technique in DG analysis [8, 25]. For any given function qH1(Ωh) and an arbitrary element Ij=(xj-12,xj+12), the special Gauss–Radau projection of q, denoted by Ph±q, is the unique function in Vhk satisfying, for each j,

(q-Ph+q,vh)j=0,vhPk-1(Ij),(q-Ph+q)j-12+=0; 2.9a
(q-Ph-q,vh)j=0,vhPk-1(Ij),(q-Ph-q)j+12-=0. 2.9b

We would like to remark that the exact collocation at one of the end points on each cell plus the orthogonality property for polynomials of degree up to k-1 of the Gauss–Radau projections Ph± play an important role and are used repeatedly in the proof. We denote by η=q(x)-Qhq(x)(Qh=Pk or Ph±) the projection error, then by a standard scaling argument [6, 10], it is easy to obtain, for smooth enough q(x), that,

η+hηx+h1/2ηΓhChk+1qk+1. 2.10a

Moreover,

ηChk+1qk+1,. 2.10b

The properties of the DG spatial discretization

To perform the L2 error estimates of divided differences, several properties of the DG operator H are helpful, which are used repeatedly in our proof; see Sect. 3.

Lemma 1

Suppose that r(u(xt)) ( r=f(u),tf(u) etc) is smooth with respect to each variable. Then, for any w,vVhα, there holds the following inequality

H(rw,v)Cw+wx+h-12[w]v, 2.11a

and in particular, if r=f(u)δ>0, there holds

graphic file with name 211_2016_833_Equ23_HTML.gif 2.11b
Proof

Let us first prove (2.11b), which is a straightforward consequence of the definition of H, since, after a simple integration by parts

graphic file with name 211_2016_833_Equ250_HTML.gif

We would like to emphasize that (2.11b) is still valid even if the smooth function r and wVhα depend on different x, e.g. x,x+h2 etc, as only integration by parts as well as the boundedness of r is used here.

To prove (2.11a), we consider the equivalent strong form of H (2.4b). An application of Cauchy–Schwarz inequality and inverse inequality (ii) leads to

H(rw,v)=-rxw,v-rwx,v-j=1N(r[[w]]v+)j-12C(w+wx)v+C[w]vΓhCw+wx+h-12[w]v.

This completes the proof of Lemma 1.

Corollary 1

Under the same conditions as in Lemma 1, we have, for small enough h,

H((hαr)w,v)Cw+wx+h-12[w]v,α0. 2.12
Proof

The case α=0 has been proved in Lemma 1. For general α1, let us start by using the relation (2.6e) for hαr to obtain

H((hαr)w,v)=H((xαr)w,v)+ChH(Ψαw,v)

with C a positive constant and Ψα a smooth function. Next, applying (2.11a) in Lemma 1 to H((xαr)w,v) and H(Ψαw,v), we have for small enough h

H((hαr)w,v)C(1+Ch)w+wx+h-12[w]vCw+wx+h-12[w]v.

This finishes the proof of Corollary 1.

Lemma 2

Suppose that r(u(xt)) is smooth with respect to each variable. Then, for any wHk+1(Ωh) and vVhα, there holds

H(r(w-Ph-w),v)Chk+1v. 2.13
Proof

Using the definition of the projection Ph- (2.9a), we have that (w-Ph-w)j+12-=0, and thus

H(r(w-Ph-w),v)=(r(w-Ph-w),vx).

Next, on each cell Ij, we rewrite r(u(xt)) as r(u)=r(uj)+r(u)-r(uj) with uj=u(xj,t). Clearly, on each element Ij, |r(u)-r(uj)|Ch due to the smoothness of r and u. Using the orthogonality property of Ph- again (2.9b), we arrive at

H(r(w-Ph-w),v)=(r(u)-r(uj))(w-Ph-w),vxChk+1v,

where we have used Cauchy–Schwarz inequality, inverse inequality (i) and the approximation property (2.10a) consecutively.

Corollary 2

Suppose that r(u(xt)) is smooth with respect to each variable. Then, for any wHk+1(Ωh), vVhα, there holds

H(hα(r(w-Ph-w)),v)Chk+1v,α0. 2.14
Proof

The case α=0 has been proved in Lemma 2. For α1, by the Leibniz rule (2.6b) and taking into account the fact that both the divided difference operator h and the projection operator Ph- are linear, we rewrite hα(r(w-Ph-w)) as

hα(r(w-Ph-w))==0ααhrx+α-2hhα-(w-Ph-w)x-2h=0ααrˇwˇ-Ph-wˇ

with

rˇ=hrx+α-2h,wˇ=hα-wx-2h.

Thus,

H(hα(r(w-Ph-w)),v)==0ααH(rˇwˇ-Ph-wˇ,v). 2.15

Clearly, by (2.6e), rˇ is also a smooth function with respect to each variable with leading term xrx+α-2h. To complete the proof, we need only apply the same procedure as that in the proof of Lemma 2 to each H term on the right side of (2.15).

Regularity for the variable coefficient hyperbolic equations

Since the dual problem for the nonlinear hyperbolic equation is a variable coefficient equation, we need to recall a regularity result.

Lemma 3

[16] Consider the variable coefficient hyperbolic equation with a periodic boundary condition for all t[0,T]

φt(x,t)+a(x,t)φx(x,t)=0, 2.16a
φ(x,0)=φ0(x), 2.16b

where a(xt) is a given smooth periodic function. For any 0, fix time t and a(x,t)L([0,T];W2+1,(Ω)), then the solution of (2.16) satisfies the following regularity property

φ(x,t)Cφ(x,0),

where C is a constant depending on aL([0,T];W2+1,(Ω)).

SIAC filters

The SIAC filters are used to extract the hidden accuracy of DG methods, by means of a post-processing technique, which enhances the accuracy and reduces oscillations of the DG errors. The post-processing is a convolution with a kernel function Khν,k+1 that is of compact support and is a linear combination of B-splines, scaled by the uniform mesh size,

Khν,k+1(x)=1hγZcγν,k+1ψ(k+1)xh-γ,

where ψ(k+1) is the B-spline of order k+1 obtained by convolving the characteristic function ψ(1)=χ of the interval (-1/2,1/2) with itself k times. Additionally, the kernel function Khν,k+1 should reproduce polynomials of degree ν-1 by convolution, which is used to determine the weights cγν,k+1. For more details, see [11].

The post-processing theory of SIAC filters is given in the following theorem.

Theorem 1

(Bramble and Schatz [5]) For 0<T<T, where T is the maximal time of existence of the smooth solution, let uL([0,T];Hν(Ω)) be the exact solution of (1.1). Let Ω0+2supp(Khν,k+1(x))Ω and U be any approximation to u, then

u-Khν,k+1UΩ0hνν!C1|u|ν+C1C2αk+1hα(u-U)-(k+1),Ω,

where C1 and C2 depend on Ω0,k, but is independent of h.

L2 norm error estimates for divided differences

By the post-processing theory [5, 11] (also see Theorem 1), it is essential to derive negative-order norm error estimates for divided differences, which depend heavily on their L2 norm estimates. However, for both variable coefficient equations and nonlinear equations, it is highly nontrivial to derive L2 norm error estimates for divided differences, and the technique used to prove convergence results for the DG error itself needs to be significantly changed.

The main results in L2 norm

Let us begin by denoting e=u-uh to be the error between the exact solution and numerical solution. Next, we split it into two parts; one is the projection error, denoted by η=u-Qhu, and the other is the projection of the error, denoted by ξ=Qhu-uh:=QheVhα. Here the projection Qh is defined at each time level t corresponding to the sign variation of f(u); specifically, for any t[0,T] and xΩ, if f(u(x,t))>0 we choose Qh=Ph-, and if f(u(x,t))<0, we take Qh=Ph+.

We are now ready to state the main theorem for the L2 norm error estimates.

Theorem 2

For any 0αk+1, let hαu be the exact solution of Eq. (2.1), which is assumed to be sufficiently smooth with bounded derivatives, and assume that |f(u)| is uniformly lower bounded by a positive constant. Let hαuh be the numerical solution of scheme (2.2) with initial condition hαuh(0)=Qh(hαu0) when the upwind flux is used. For a uniform mesh of Ω=(a,b), if the finite element space Vhα of piecewise polynomials with arbitrary degree k1 is used, then for small enough h and any T>0 there holds the following error estimate

graphic file with name 211_2016_833_Equ30_HTML.gif 3.1

where the positive constant C depends on the u, δ, T and f, but is independent of h.

Corollary 3

Under the same conditions as in Theorem 2, if in addition α1 we have the following error estimates:

hα(u-uh)(T)Chk+32-α2. 3.2

Proof

As shown in Corollary 2, we have that hαη=hαu-Ph-(hαu), and thus

hαηChk+1hαuk+1 3.3

by the approximation error estimate (2.10a). Now, the error estimate (3.2) follows by combining the triangle inequality and (3.1).

Remark 1

Clearly, the L2 error estimates for the divided differences in Theorem 2 and Corollary 3 also hold for the variable coefficient equation (2.1) with f(u)=a(x)u and |a(x)|δ>0. In fact, for variable coefficient equations, we can obtain optimal (k+1)th order in the L2 norm and thus (2k+1)th order in the negative-order norm; see Sect. 3.3.

Remark 2

The result with α=0 in Theorem 2 is indeed a superconvergence result towards a particular projection of the exact solution (supercloseness) that has been established in [18], which is a starting point for proving hαξ with α1. For completeness, we list the superconvergence result for ξ (zeroth order divided difference) as follows

ξ2+0T|[ξ]|2dtCh2k+3, 3.4a
ξxCh-1SC(ξt+hk+1), 3.4b
ξt2+0T|[ξt]|2dtCh2k+2, 3.4c

where, on each element Ij, we have used ξ=rj+S(x)(x-xj)/hj with rj=ξ(xj) being a constant and S(x)Pk-1(Ij). Note that the proof of such superconvergence results requires that |f(u)| is uniformly lower bounded by a positive constant; for more details, see [18].

In the proof of Theorem 2, we have also obtained a generalized version about the L2 norm estimates of ξ in terms of the divided differences, their time derivatives, and spatial derivatives. To simplify notation, for an arbitrary multi-index β=(β1,β2), we denote by Mβ(·) the mixed operator containing divided differences and time derivatives of a given function, namely

Mβ(·)=hβ1tβ2(·). 3.5

Corollary 4

Under the same conditions as in Theorem 2, for β0=0,1 and a multi-index β=(β1,β2) with |β|=β1+β2k+1, we have the following unified error estimate:

xβ0Mβξ(T)Chk+32-|β|2,

where |β|=β0+|β|.

Proof of the main results in the L2 norm

Similar to the discussion of the DG discretization operator properties in Sect. 2.2.4, without loss of generality, we will only consider the case f(u(x,t))δ>0 for all (x,t)Ω×[0,T]; the case of f(u(x,t))-δ<0 is analogous. Therefore, we take the upwind numerical flux as f^=f(uh-) on each cell interface and choose the projection as Qh=Ph- on each cell, and the initial condition is chosen as hαuh(0)=Ph-(hαu0). Since the case α=0 has already been proven in [18] (see (3.4a)), we need only to consider 1αk+1. In order to clearly display the main ideas of how to perform the L2 norm error estimates for divided differences, in the following two sections we present the detailed proof for Theorem 2 with α=1 and α=2, respectively; the general cases with 3αk+1 (k2) can be proven by induction, which are omitted to save space.

Analysis for the first order divided difference

For α=1, the DG scheme (2.2) becomes

(huh)t,vhj=Hjhf(uh),vh

with j=j+12, which holds for any vhVhα and j=1,,N. By Galerkin orthogonality and summing over all j, we have the error equation

het,vh=H(h(f(u)-f(uh)),vh) 3.6

for all vhVhα. To simplify notation, we would like to denote he:=e¯=η¯+ξ¯ with η¯=hη,ξ¯=hξ. If we now take vh=ξ¯, we get the following identity

12ddtξ¯2+η¯t,ξ¯=H(h(f(u)-f(uh)),ξ¯). 3.7

The estimate for the right side of (3.7) is complicated, since it contains some integral terms involving mixed order divided differences of ξ, namely ξ and ξ¯, which is given in the following lemma.

Lemma 4

Suppose that the conditions in Theorem 2 hold. Then we have

graphic file with name 211_2016_833_Equ39_HTML.gif 3.8

where the positive constants C and C are independent of h and uh.

Proof

Let us start by using the second order Taylor expansion with respect to the variable u to write out the nonlinear terms, namely f(u)-f(uh) and f(u)-f(uh-), as

f(u)-f(uh)=f(u)ξ+f(u)η-R1e2, 3.9a
f(u)-f(uh-)=f(u)ξ-+f(u)η--R2(e-)2, 3.9b

where R1=01(1-μ)f(u+μ(uh-u))dμ and R2=01(1-ν)f(u+ν(uh--u))dν are the integral form of the remainders of the second order Taylor expansion. We would like to emphasize that the various order spatial derivatives, time derivatives and divided differences of R1 are all bounded uniformly due to the smoothness of f and u. Thus,

H(h(f(u)-f(uh)),ξ¯)=H(h(f(u)ξ),ξ¯)+H(h(f(u)η),ξ¯)-H(h(R1e2),ξ¯)J+K-L,

which will be estimated separately below.

To estimate J, we employ the Leibniz rule (2.6b), and rewrite h(f(u)ξ) as

h(f(u)ξ)=f(u(x+h/2))ξ¯(x)+(hf(u(x)))ξ(x-h/2),

and thus,

J=H(f(u)ξ¯,ξ¯)+H((hf(u))ξ,ξ¯)J1+J2,

where we have omitted the dependence of x for convenience if there is no confusion, since the proof of (2.11b) is still valid even if f(u) and ξ¯ are evaluated at different x; see proof of (2.11b) in Sect. 2.2.4. A direct application of Lemma 1 together with the assumption that f(u)δ>0, (2.11b), leads to the estimate for J1:

graphic file with name 211_2016_833_Equ42_HTML.gif 3.10a

By Corollary 1, we arrive at the estimate for J2:

J2Cξ+ξx+h-12[ξ]ξ¯. 3.10b

Substituting (3.4a)–(3.4c) into (3.10b), and combining with (3.10a), we have, after a straightforward application of Young’s inequality, that

graphic file with name 211_2016_833_Equ44_HTML.gif 3.11

Let us now move on to the estimate of K. By Corollary 2, we have

KChk+1ξ¯. 3.12

To estimate L, let us first employ the identity (2.6b) and rewrite h(R1e2) as

h(R1e2)=R1(u(x+h/2))he2+hR1(u(x))e2(x-h/2)=R1(u(x+h/2))e¯(x)(e(x+h/2)+e(x-h/2))+hR1(u(x))e2(x-h/2)D1+D2.

Consequently,

L=H(D1,ξ¯)+H(D2,ξ¯).

It is easy to show, for the high order nonlinear term H(D1,ξ¯), that

H(D1,ξ¯)Cee¯ξ¯x+e¯Γhξ¯ΓhCh-1eξ¯+η¯+h12η¯Γhξ¯Ch-1eξ¯+hk+1ξ¯, 3.13

where in the first step we have used the Cauchy–Schwarz inequality, in the second step we have used the inverse properties (i) and (ii), and in the last step we have employed the interpolation properties (3.3). We see that in order to deal with the nonlinearity of f we still need to have a bound for e. Due to the superconvergence result (3.4a), we conclude, by combining inverse inequality (iii) and the approximation property (2.10b), that

eChk+1. 3.14

Therefore, for small enough h, we have

H(D1,ξ¯)Cξ¯2+Chk+1ξ¯. 3.15a

By using analysis similar to that in the proof of (3.13), we have, for H(D2,ξ¯), that

H(D2,ξ¯)Ch-1eξ+hk+1ξ¯.

As a consequence, by (3.14) and (3.4a)

H(D2,ξ¯)Chk+1ξ¯. 3.15b

A combination of (3.15a) and (3.15b) produces a bound for L:

LCξ¯2+Chk+1ξ¯. 3.16

To complete the proof of Lemma 4, we need only combine (3.11), (3.12), (3.16) and use Young’s inequality.

We are now ready to derive the L2 norm estimate for ξ¯. To do this, let us begin by inserting the estimate (3.8) into (3.7) and taking into account the bound for η¯ in (3.3) and thus η¯t to get, after an application of Cauchy–Schwarz inequality and Young’s inequality, that

graphic file with name 211_2016_833_Equ251_HTML.gif

Next, we integrate the above inequality with respect to time between 0 and T and note the fact that ξ¯(0)=0 due to ξ(0)=0 to obtain

graphic file with name 211_2016_833_Equ252_HTML.gif

where we have used the superconvergence result (3.4a). An application of Gronwall’s inequality leads to the desired result

graphic file with name 211_2016_833_Equ51_HTML.gif 3.17

This finishes the proof of Theorem 2 for α=1.

Remark 3

We can see that the estimates (3.17) for the L2 norm and the jump seminorm of ξ¯ are based on the corresponding results for ξ in Remark 2, which are half an order lower than that of ξ. This is mainly due to the hybrid of different order divided differences of ξ, namely ξ and ξ¯, and thus the application of inverse property (ii). It is natural that the proof for the high order divided difference of ξ, say h2ξ, should be based on the corresponding lower order divided difference results of ξ (ξ and ξ¯) that have already been established; see Sect. 3.2.2 below.

Analysis for the second order divided difference

For α=2, the DG scheme (2.2) becomes

(h2uh)t,vhj=Hjh2f(uh),vh

with j=j, which holds for any vhVhα and j=1,,N. By Galerkin orthogonality and summing over all j, we have the error equation

h2et,vh=H(h2(f(u)-f(uh)),vh) 3.18

for all vhVhα. To simplify notation, we would like to denote h2e:=e~=η~+ξ~ with η~=h2η,ξ~=h2ξ. If we now take vh=ξ~, we get the following identity

12ddtξ~2+η~t,ξ~=H(h2(f(u)-f(uh)),ξ~). 3.19

The estimate for right side of (3.19) is rather complicated, since it contains some integral terms involving mixed order divided differences of ξ, namely ξ, ξ¯ and ξ~, which is given in the following Proposition.

Proposition 1

Suppose that the conditions in Theorem 2 hold. Then we have

graphic file with name 211_2016_833_Equ54_HTML.gif 3.20

where the positive constants C and C are independent of h and uh.

Proof

By the second order Taylor expansion (3.9), we have

H(h2(f(u)-f(uh)),ξ~)=H(h2(f(u)ξ),ξ~)+H(h2(f(u)η),ξ~)-H(h2(R1e2),ξ~)P+Q-S, 3.21

which will be estimated one by one below.

To estimate P, we use the Leibniz rule (2.6b), to rewrite h2(f(u)ξ) as

h2(f(u)ξ)=f(u(x+h))ξ~(x)+2hf(u(x+h/2))ξ¯(x-h/2)+h2f(u(x))ξ(x-h),

and thus,

P=H(f(u)ξ~,ξ~)+2H((hf(u))ξ¯,ξ~)+H((h2f(u))ξ,ξ~)P1+P2+P3,

where we have omitted the dependence of x for convenience if there is no confusion. A direct application of Lemma 1 together with the assumption that f(u)δ>0, (2.11b), produces the estimate for P1:

graphic file with name 211_2016_833_Equ56_HTML.gif 3.22a

By Corollary 1, we arrive at the estimates for P2 and P3:

P2Cξ¯+ξ¯x+h-12[ξ¯]ξ~, 3.22b
P3Cξ+ξx+h-12[ξ]ξ~. 3.22c

Substituting (3.4a)–(3.4c), (3.17) into (3.22b), (3.22c), and combining with (3.22a), we have, after a straightforward application of Young’s inequality, that

graphic file with name 211_2016_833_Equ59_HTML.gif 3.23

For terms on the right side of (3.23), although we have information about Inline graphic and Inline graphic as shown in (3.4a) and (3.17), we still need a suitable bound for ξ¯x, which is given in the following lemma.

Lemma 5

Suppose that the conditions in Theorem 2 hold. Then we have

ξ¯xC(ξ¯t+hk+1), 3.24

where C depends on u and δ but is independent of h and uh.

The proof of this lemma is given in the appendix. Up to now, we see that we still need to have a bound for ξ¯t. In fact, the proof for ξ¯t would require additional bounds for (ξt)x and ξtt, whose results are shown in Lemmas 6 and 7.

Lemma 6

Suppose that the conditions in Theorem 2 hold. Then we have

(ξt)xC(ξtt+hk+1). 3.25

The proof of Lemma 6 follows along a similar argument as that in the proof of Lemma 5, so we omit the details here.

Lemma 7

Suppose that the conditions in Theorem 2 hold. Then we have

graphic file with name 211_2016_833_Equ62_HTML.gif 3.26

The proof of this lemma is deferred to the appendix. Based on the above two lemmas, we are able to prove the bound for ξ¯t in Lemma 8, whose proof is deferred to the appendix.

Lemma 8

Suppose that the conditions in Theorem 2 hold. Then we have

graphic file with name 211_2016_833_Equ63_HTML.gif 3.27

where C depends on u and δ but is independent of h and uh.

We now collect the estimates in Lemmas 5 and 8 into (3.23) to get

graphic file with name 211_2016_833_Equ64_HTML.gif 3.28

Let us now move on to the estimate of Q. By Corollary 2, we have

QChk+1ξ~. 3.29

To estimate S, let us first employ the identity (2.6b) and rewrite h2(R1e2) as

h2(R1e2)=R1(u(x+h))h2e2+2hR1(u(x+h/2))he2(x-h/2)+h2R1(u(x))e2(x-h)E1+E2+E3,

where

E1=R1(u(x+h))e(x+h)e~(x)+2e¯(x+h/2)e¯(x-h/2)+e~(x)e(x-h),E2=2hR1(u(x+h/2))e¯(x-h/2)e(x)+e(x-h),E3=h2R1(u(x))e2(x-h).

Thus,

S=H(E1,ξ~)+H(E2,ξ~)+H(E3,ξ~)S1+S2+S3.

By using analysis similar to that in the proof of (3.13), we get

S1Ch-1(e+e¯)ξ~+ξ¯+hk+1ξ~Cξ~+ξ¯+hk+1ξ~,S2Ch-1eξ¯+hk+1ξ~Cξ¯+hk+1ξ~,S3Ch-1eξ+hk+1ξ~Cξ+hk+1ξ~,

where we have used the fact that for k1 and small enough h, Ch-1(e+e¯)C; for more details, see the appendix. Consequently

SCξ~+ξ¯+ξ+hk+1ξ~. 3.30

Collecting the estimates (3.28)–(3.30) into (3.21) and taking into account (3.4a) and (3.17), we get

graphic file with name 211_2016_833_Equ253_HTML.gif

This finishes the proof of Proposition 1.

We are now ready to derive the L2 norm estimate for ξ~. To do this, we begin by combining (3.19) and (3.20) to get

graphic file with name 211_2016_833_Equ254_HTML.gif

Next, integrate the above inequality with respect to time between 0 and T and use ξ(0)=0 (and thus ξ~(0)=h2ξ(0)=0) to obtain

graphic file with name 211_2016_833_Equ255_HTML.gif

by the estimates (3.4a) and (3.17). An application of Gronwall’s inequality leads to the desired result

graphic file with name 211_2016_833_Equ67_HTML.gif 3.31

This completes the proof of Theorem 2 with α=2.

Remark 4

Through the proof of Theorem 2 with α=2, ξ~, we can see that apart from the bounds for ξ,ξx,ξt that have already been obtained for proving ξ¯, we require additional bounds for ξ¯x,ξ¯t,(ξt)x, and ξtt, as shown in Lemmas 58. The proof for the L2 norm estimates for higher order divided differences are more technical and complicated, and it would require bounds regarding lower order divided differences as well as its corresponding spatial and time derivatives. For example, when α=3, in addition to the abounds aforementioned, we need to establish the bounds for ξ~x,ξ~t,(ξ¯t)x,ξ¯tt,(ξtt)x and ξttt. Thus, Theorem 2 can be proven along the same lines for general αk+1. Finally, we would like to point out that the corresponding results on the jump seminorm for various order divided differences and time derivatives of ξ are useful, which play an important role in deriving Theorem 2.

Variable coefficient case

The main results

In this section we consider the L2 error estimates for divided differences for the variable coefficient equation (1.1) with f(u)=a(x)u. Similar to the nonlinear hyperbolic case, to obtain a suitable bound for the L2 norm the numerical flux should be chosen as an upwind flux. Moreover, the analysis requires a condition that |a(x)| is uniformly lower bounded by a positive constant. Without loss of generality, we only consider a(x)δ>0, and thus the DG scheme is

(hαuh)t,vh=H(hα(auh),vh) 3.32

for vhVhα. We will use the same notation as before.

For nonlinear hyperbolic equations, the loss of order in Theorem 2 is mainly due to the lack of control for the interface jump terms arising from (2.11a) in the superconvergence relation, for example, (3.4b), (3.24) and (3.25). Fortunately, for variable coefficient hyperbolic equations, we can establish a stronger superconvergence relation between the spatial derivative as well as interface jumps of the various order divided difference of ξ and its time derivatives; see (3.37b) below. Thus, optimal L2 error estimates of order k+1 are obtained.

Prior to stating our main theorem, we would like to present convergence results for time derivatives of ξ, which is slightly different to those for nonlinear hyperbolic equations.

Lemma 9

Let u be the exact solution of the variable coefficient hyperbolic Eq. (1.1) with f(u)=a(x)u, which is assumed to be sufficiently smooth with bounded derivatives. Let uh be the numerical solution of scheme (3.32) (α=0) with initial condition uh(0)=Qhu0, (Qh=Ph±) when the upwind flux is used. For regular triangulations of Ω=(a,b), if the finite element space Vhα of piecewise polynomials with arbitrary degree k0 is used, then for any m0 and any T>0 there holds the following error estimate

tmξ(T)Chk+1, 3.33

where the positive constant C depends on u, T and a, but is independent of h.

The proof of this lemma is postponed to the appendix.

We are now ready to state our main theorem.

Theorem 3

For any α1, let hαu be the exact solution of the problem (2.1) with f(u)=a(x)u, which is assumed to be sufficiently smooth with bounded derivatives, and assume that |a(x)| is uniformly lower bounded by a positive constant. Let hαuh be the numerical solution of scheme (3.32) with initial condition hαuh(0)=Qh(hαu0) when the upwind flux is used. For a uniform mesh of Ω=(a,b), if the finite element space Vhα of piecewise polynomials with arbitrary degree k0 is used, then for any T>0 there holds the following error estimate

hαξ(T)Chk+1, 3.34

where the positive constant C depends on u, δ, T and a, but is independent of h.

Remark 5

Based on the optimal error estimates for hαξ together with approximation error estimates (3.3) and using the duality argument in [19], we can obtain the negative-order norm estimates

hα(u-uh)(T)-(k+1),ΩCh2k+1, 3.35

and thus

u-Khν,k+1uhCh2k+1. 3.36

For more details, see [5, 19] and also Sect. 4 below.

Proof of main results

We shall prove Theorem 3 for general α1. First we claim that if we can prove the following three inequalities

hmξChk+1,0mα-1, 3.37a
(Mβξ)x+h-12[Mβξ]Chβ1tβ2+1ξ+hk+1,|β|=β1+β2α-1, 3.37b
MγξChk+1,|γ|αandγ(α,0), 3.37c

where Mβξ=hβ1tβ2ξ represents the mixed operator containing divided differences and time derivatives of ξ that has already been defined in (3.5), then hαξChk+1. In what follows, we sketch the verification of this claim. To do that, we start by taking vh=hαξ in the following error equation

hαet,vh=H(hα(aξ),vh)+H(hα(aη),vh),

which is

12ddthαξ2+hαηt,hαξ=H(hα(aξ),hαξ)+H(hα(aη),hαξ). 3.38

Next, consider the term H(hα(aξ),hαξ). Use Leibniz rule (2.6b) to rewrite hα(aξ) and employ (2.11a), (2.11b) in Lemma 1 to get the bound

H(hα(aξ),hαξ)Chαξ2+Chk+1hαξ,

where we have also used the relations (3.37a)–(3.37c). For the estimate of H(hα(aη),hαξ), we need only use Corollary 2 to get

H(hα(aη),hαξ)Chk+1hαξ.

Collecting above two estimates into (3.38) and using Cauchy–Schwarz inequality as well as Gronwall’s inequality, we finally get

hαξChk+1.

The claim is thus verified.

In what follows, we will prove (3.37) by induction.

Step 1  For α=1, ξChk+1 is well known, and thus (3.37a) is valid for α=1. Moreover, (3.37c), namely ξtChk+1 has been given in (3.4c); see [18]. To complete the proof for α=1, we need only to establish the following relation

ξx+h-12[ξ]Cξt+hk+1. 3.39
Proof

Noting the relation (3.4b), we need only to prove

h-12[ξ]Cξt+hk+1. 3.40

To do that, we consider the cell error equation

et,vhj=Hjae,vh=Hjaξ,vh+Hjaη,vh,

which holds for any vhVhα and j=1,,N. If we now take vh=1 in the above identity and use the strong form (2.3b) for Hjaξ,vh, we get

et,1j=-(aξ)x,1j-(a[[ξ]])j-12+Hjaη,1-W1-W2+W3.

It follows from the assumption |a(x)|δ>0 that

δ|[[ξ]]j-12||W2||W1|+|W3|+|et,1j|. 3.41

By Cauchy–Schwarz inequality, we have

|W1|+|et,1j|Ch12(ξIj+ξxIj+ξtIj+ηtIj).

By the definition of the projection Ph-, (2.9b)

|W3|=0.

Inserting the above two estimates into (3.41), we arrive at

|[[ξ]]j-12|Ch12(ξIj+ξxIj+ξtIj+ηtIj),

which is

graphic file with name 211_2016_833_Equ256_HTML.gif

where we have used the bound for ξ, the relation (3.4b) and approximation error estimates (2.10a), and thus (3.40) follows. Therefore, (3.37) is valid for α=1.

Step 2  Suppose that (3.37) is true for α=. That is

hmξChk+1,0m-1, 3.42a
(Mβξ)x+h-12[Mβξ]C(hβ1tβ2+1ξ+hk+1),|β|=β1+β2-1, 3.42b
MγξChk+1,|γ|andγ(,0), 3.42c

let us prove that it also holds for α=+1.

First, as shown in our claim, (3.42) implies that

hξ(T)Chk+1.

The above estimate together with (3.42a) produces

hmξChk+1,0m. 3.43

Therefore, (3.37a) is valid for α=+1.

Next, by assumption (3.42b), we can see that to show (3.37b) for α=+1, we need only to show

(Mβξ)x+h-12[Mβξ]Chβ1tβ2+1ξ+hk+1,|β|=.

Without loss of generality, let us take β=(,0) for example. To this end, we consider the following error equation

het,vh=H(h(aξ),vh)+H(h(aη),vh),

which holds for any vhVhα. We use Leibniz rule (2.6b) to write out h(aξ) as

haξ=i=0ihiax+-i2hh-iξx-i2hi=0zi.

Therefore, the error equation becomes

het,vh=i=0Zi+H(h(aη),vh), 3.44

where Zi=H(zi,vh) for i=0,,. Let us now work on Z0. By the strong form of H, (2.4b), we have

Z0=H(ahξ,vh)=-(ahξ)x,vh-j=1Na[[hξ]]vh+j-12.

Denote Lk the standard Legendre polynomials of degree k in [-1,1]. If we now let vh=(hξ)x-dLk(s) with d=(-1)k(hξ)xj-12+ being a constant and s=2(x-xj)h, we get

Z0=-a(xj)(hξ)x,vh-(a(x)-a(xj))(hξ)x,vh-axhξ,vh-Z0,0-Z0,1-Z0,2,

since (vh)j-12+=0. Substituting above expression into (3.44) and taking into account the assumption that a(x)δ>0, we have

δ(hξ)x2Z0,0=i=1Zi+H(h(aη),vh)-Z0,1-Z0,2-het,vh. 3.45

It is easy to show by Corollary 1 that

i=1ZiCi=1h-iξ+(h-iξ)x+h12[h-iξ]vhChk+1vh, 3.46a

where we have used (3.42a)–(3.42c), since -i-1 for i1. By Corollary 2, we have

H(h(aη),vh)Chk+1vh. 3.46b

By (3.43) and inverse property (i), we arrive at a bound for Z0,1 and Z0,2

|Z0,1|+|Z0,2|ChξvhChk+1vh. 3.46c

The triangle inequality and the approximation error estimate (3.3) yield

het,vhChtξ+hk+1vh. 3.46d

Collecting the estimates (3.46a)–(3.46d) into (3.45) and using the fact that vhC(hξ)x, we arrive at

(hξ)xC(htξ+hk+1). 3.47

If we take vh=1 in the cell error equation and use an analysis similar to that in the proof of (3.40), we will get the following relation

h-12[hξ]C(htξ+hk+1). 3.48

A combination of (3.47) and (3.48) gives us

(hξ)x+h-12[hξ]C(htξ+hk+1).

Therefore, (3.37b) still holds for α=+1.

Finally, let us verify that (3.37c) is valid for α=+1. Noting the assumption (3.42c), we need only consider |γ|=+1 and γ(+1,0). To do that, we start from the estimate for Mγξ with γ=(0,+1) that has already been established in (3.33). By an analysis similar to that in the proof of Lemma 8 and taking into account relations (3.37a) and (3.37b) for α=+1, we conclude that (3.37c) is valid for γ=(1,). Repeating the above procedure, we can easily verify that (3.37c) is also valid for γ=(2,-1),,(,1). Therefore, (3.37c) holds true for α=+1, and thus (3.34) in Theorem 3 is valid for general α1.

Superconvergent error estimates

For nonlinear hyperbolic equations, the negative-order norm estimate of the DG error itself has been established in [16]. However, by post-processing theory [5, 11], negative-order norm estimates of divided differences of the DG error are also needed to obtain superconvergent error estimates for the post-processed solution in the L2 norm. Using a duality argument together with L2 norm estimates established in Sect. 3, we show that for a given time T, the α-th order divided difference of the DG error in the negative-order norm achieves 2k+32-α2th order superconvergence. As a consequence, the DG solution uh(T), converges with at least 32k+1th order in the L2 norm when convolved with a particularly designed kernel.

We are now ready to state our main theorem about the negative-order norm estimates of divided differences of the DG error.

Theorem 4

For any 1αk+1, let hαu be the exact solution of the problem (2.1), which is assumed to be sufficiently smooth with bounded derivatives, and assume that |f(u)| is uniformly lower bounded by a positive constant. Let hαuh be the numerical solution of scheme (2.2) with initial condition hαuh(0)=Qh(hαu0) when the upwind flux is used. For a uniform mesh of Ω=(a,b), if the finite element space Vhα of piecewise polynomials with arbitrary degree k1 is used, then for small enough h and any T>0 there holds the following error estimate

hα(u-uh)(T)-(k+1),ΩCh2k+32-α2, 4.1

where the positive constant C depends on u, δ, T and f, but is independent of h.

Combining Theorems 4 and 1, we have

Corollary 5

Under the same conditions as in Theorem 4, if in addition Khν,k+1 is a convolution kernel consisting of ν=2k+1+ω (ω-k2) B-splines of order k+1 such that it reproduces polynomials of degree ν-1, then we have

u-uhCh32k+1, 4.2

where uh=Khν,k+1uh.

Remark 6

The (32k+1)th order superconvergence is shown for the negative k+1 norm, and thus is valid for B-splines of order k+1 (by Theorem 1). For general order of B-splines and α, using similar argument for the proof of the negative k+1 norm estimates (see Sect. 4.1), we can prove the following superconvergent error estimate

hα(u-uh)(T)-,ΩChk+32-α2+-1Chk++12.

Therefore, from the theoretical point of view, a higher order of B-splines may lead to a superconvergence result of higher order, for example =k+1 and thus (32k+1)th order in Corollary 5. However, from the practical point of view, changing the order of B-splines does not affect the order of superconvergence; see Sect. 5 below and also [17].

Proof of the main results in the negative-order norm

Similar to the proof for the L2 norm estimates of the divided differences in Sect. 3.2, we will only consider the case f(u(x,t))δ>0 for all (x,t)Ω×[0,T]. To perform the analysis for the negative-order norm, by (2.5), we need to concentrate on the estimate of

hα(u-uh)(T),Φ 4.3

for ΦC0(Ω). To do that, we use the duality argument, following [16, 19]. For the nonlinear hyperbolic Eq. (2.1), we choose the dual equation as: Find a function φ such that φ(·,t) is periodic for all t[0,T] and

hαφt+f(u)hαφx=0,(x,t)Ω×[0,T), 4.4a
φ(x,T)=Φ(x),xΩ. 4.4b

Unlike the purely linear case [11, 15] or the variable coefficient case [19], the dual equations for nonlinear problems will no longer preserve the inner product of original solution hαu and its dual solution φ, namely ddthαu,φ0. In fact, if we multiply (2.1a) by φ and (4.4a) by (-1)αu and integrate over Ω, we get, after using integration by parts and summation by parts (2.6d), that

ddthαu,φ+F(u;φ)=0, 4.5

where

F(u;φ)=(-1)αf(u)u-f(u),hαφx.

Note that F(u;φ) is the same as that in [16] when α=0. We now integrate (4.5) with respect to time between 0 and T to obtain a relation hαu,φ in different time level

hαu,φ(T)=hαu,φ(0)-0TF(u;φ)dt. 4.6

In what follows, we work on the estimate of (4.3). To do that, let us begin by using the relation (4.6) to get an equivalent form of (4.3). It reads, for any χVhα

hα(u-uh)(T),Φ=hα(u-uh)(T),φ(T)=hαu,φ(0)-0TF(u;φ)dt-hαuh,φ(0)-0Tddthαuh,φdt=hα(u-uh),φ(0)-0T(hαuh)t,φ+hαuh,φtdt-0TF(u;φ)dt=G1+G2+G3,

where

G1=hα(u-uh),φ(0),G2=-0Thαuht,φ-χ-H(hαf(uh),φ-χ)dt,G3=-0Thαuh,φt+H(hαf(uh),φ)+F(u,φ)dt

will be estimated one by one below.

Note that in our analysis for hα(u-uh)(T) in Theorem 2, we need to choose a particular initial condition, namely hαuh(0)=Ph-(hαu0) instead of hαuh(0)=Pk(hαu0) for purely linear equations [11, 15]. Thus, we arrive at a slightly different bound for G1, as shown in the following lemma. We note that using the L2 projection in the numerical examples is still sufficient to obtain superconvergence.

Lemma 10

(Projection estimate) There exists a positive constant C, independent of h, such that

|G1|Ch2k+1hαu0k+1φ(0)k+1. 4.7

Proof

Since hαuh(0)=Ph-(hαu0), we have the following identity

G1=hα(u-uh),φ(0)=hαu0-Ph-(hαu0),φ(0)-Pk-1φ(0),

where Pk-1 is the L2 projection into Vhk-1. A combination of Cauchy–Schwarz inequality and approximation error estimates (2.10a) leads to the desired result (4.7).

The bound for G2 is given in the following lemma.

Lemma 11

(Residual) There exists a positive constant C, independent of h, such that

|G2|Ch2k+32-α2φL1([0,T];Hk+1). 4.8

Proof

Denoting by G the term inside the time integral of G2, we get, by taking χ=Pkφ, the following expression for G,

G=-H(hαf(uh),φ-Pkφ),

which is equivalent to

G=-hα(f(uh)-f(u)),(φ-Pkφ)x+hαf(u)x,φ-Pkφ+j=1Nhα(f(u)-f(uh-))[[φ-Pkφ]]j-12G1+G2+G3,

where we have added and subtracted the term hαf(u),(φ-Pkφ)x and used integration by parts.

Let us now consider the estimates of G1,G2,G3. For G1, by using the second order Taylor expansion for f(u)-f(uh), (3.9), we get

G1=hαf(u)e-R1e2,(φ-Pkφ)x=hα(f(u)e),(φ-Pkφ)x-hα(R1e2),(φ-Pkφ)xG1lin-G1nlr,

where G1lin and G1nlr, respectively, represent the linear part and the nonlinear part of G1. It is easy to show, by using the Leibniz rule (2.6b) and Cauchy–Schwarz inequality, that

|G1lin|C=0αhα-e(φ-Pkφ)xCh2k+32-α2φk+1, 4.9a

where we have used the estimate for hα-e in Corollary 3 and the approximation error estimate (2.10a). Analogously, for high order nonlinear term G1nlr, we have

|G1nlr|C=0αhα-e2(φ-Pkφ)xCm=0αhmehα-me(φ-Pkφ)xCh3k+52-α2φk+1, 4.9b

where we have used the (2.6b) twice, the inverse property (iii), the L2 norm estimate (3.2), and the approximation error estimate (2.10a). A combination of above two estimates yields

|G1|Ch2k+32-α2φk+1. 4.10

To estimate G2, we use an analysis similar to that in the proof of G1 in Lemma 10 and make use of the orthogonal property of the L2 projection Pk to get

G2=hαf(u)x-Pk(hαf(u)x),φ-PkφCh2k+2hαf(u)xk+1φk+1, 4.11

where we have used the approximation error estimate (2.10a).

We proceed to estimate G3. It follows from the Taylor expansion (3.9), the Leibniz rule (2.6b), the Cauchy–Schwarz inequality and the inverse properties (ii), (iii) that

|G3|C=0αheΓhφ-PkφΓh+Cm=0αhmehα-meΓhφ-PkφΓhCh2k+32-α2φk+1+Ch3k+52-α2φk+1Ch2k+32-α2φk+1, 4.12

where we have also used (3.2) and (2.10a). Collecting the estimates (4.10)–(4.12), we get

|G|Ch2k+32-α2φk+1. 4.13

Consequently, the estimate for G2 follows by integrating the above inequality with respect to time.

We move on to the estimate of G3, which is given in the following lemma.

Lemma 12

(Consistency) There exists a positive constant C, independent of h, such that

|G3|Ch2k+3-α2φL1([0,T];Hk+1). 4.14

Proof

To do that, let us denote by G4 the term inside the integral G3 and take into account (2.6d) to obtain an equivalent form of G4

G4=(-1)αuh,hαφt+(-1)αf(uh),hαφx+(-1)αf(u)u-f(u),hαφx+j=1Nhαf(uh-)[[φ]]j+12=(-1)αf(uh)-f(u)-f(u)(uh-u),hαφx,

where we have used the dual problem (4.4) and the fact that [[φ]]=0 due to the smoothness of φ. Next, by using the second order the Taylor expansion (3.9) and (2.6d) again, we arrive at

G4=hα(R1e2),φx.

If we now use (2.6b) twice for hα(R1e2) and the Cauchy–Schwarz inequality together with the error estimate (3.2), we get

|G4|C=0αm=0hmeh-meφxCh2k+3-α2φk+1, 4.15

where we have also used the Sobolev inequality φxCφk+1, under the condition that k>1/2. The bound for G3 follows immediately by integrating the above inequality with respect to time.

We are now ready to obtain the final negative-order norm error estimates for the divided differences. By collecting the results in Lemmas 1012 and taking into account the regularity result in Lemma 3, namely φk+1CΦk+1, we get a bound for hα(u-uh)(T),Φ

hα(u-uh)(T),ΦCh2k+32-α2Φk+1.

Thus, by (2.5), we have the bound for the negative-order norm

hα(u-uh)(T)-(k+1),ΩCh2k+32-α2.

This finishes the proof of Theorem 4.

Numerical examples

For nonlinear hyperbolic equations, we proved L2 norm superconvergence results of order 32k+1 for post-processed errors, as shown in Corollary 5. The superconvergence results together with the post-processing theory by Bramble and Schatz in Theorem 1 entail us to design a more compact kernel to achieve the desired superconvergence order. We note that superconvergence of post-processed errors using the standard kernel (a kernel function composed of a linear combination of 2k+1 B-splines of order k+1) for nonlinear hyperbolic equations has been numerically studied in [11, 16]. Note that the order of B-splines does not have significant effect on the rate of convergence numerically and that it is the number of B-splines that has greater effect to the convergence order theoretically [11], we will only focus on the effect of different total numbers (denoted by ν=2k+1+ω with ω-k2) of B-splines of the kernel in our numerical experiments. For more numerical results using different orders of B-splines, we refer the readers to [17].

We consider the DG method combined with the third-order Runge–Kutta method in time. We take a small enough time step such that the spatial errors dominate. We present the results for P2 and P3 polynomials only to save space, in which a specific value of ω is chosen to match the orders given in Corollary 5. For the numerical initial condition, we take the standard L2 projection of the initial condition and we have observed little difference if the Qh projection is used instead. Uniform meshes are used in all experiments. Only one-dimensional scalar equations are tested, whose theoretical results are covered in our main theorems.

Example 1

We consider the Burgers quation on the domain Ω=(0,2π)

ut+u22x=0,u(x,0)=sin(x) 5.1

with periodic boundary conditions.

Noting that f(u) changes its sign in the computational domain, we use the Godunov flux, which is an upwind flux. The errors at T=0.3, when the solution is still smooth, are given in Table 1. From the table, we can see that one can improve the order of convergence from k+1 to at least 2k+1, which is similar to the results for Burgers equations in [11]. Moreover, superconvergence of order 2k can be observed for the compact kernel with ω=-2, as, in general, a symmetric kernel could yield one additional order. This is why instead of ω=-k2=-1, ω=-2 is chosen in our kernel. The pointwise errors are plotted in Fig. 1, which show that the post-processed errors are less oscillatory and much smaller in magnitude for a large number of elements as observed in [11], and that the errors of our more compact kernel with ω=-2 are less oscillatory than that for the standard kernel with ω=0, although the magnitude of the errors increase. This example demonstrates that the superconvergence result also holds for conservation laws with a general flux function.

Table 1.

Before post-processing (left), after post-processing (middle) and post-processing with the more compact kernel (right). T = 0.3 L2- and L errors for Example 1

Mesh Before post-processing Post-processed (ω=0) Post-processed (ω=-2)
L2 error Order L error Order L2 error Order L error Order L2 error Order L error Order
P2
   20 1.54E−04 5.70E−04 1.04E−04 3.16E−04 5.36E−04 1.40E−03
   40 2.06E−05 2.90 1.03E−04 2.47 2.28E−06 5.52 7.53E−06 5.39 3.69E−05 3.86 9.93E−05 3.81
   80 2.73E−06 2.92 1.55E−05 2.73 3.97E−08 5.84 1.38E−07 5.77 2.37E−06 3.96 6.43E−06 3.95
   160 3.56E−07 2.93 2.25E−06 2.78 1.13E−09 5.13 9.86E−09 3.81 1.49E−07 3.99 4.06E−07 3.99
P3
   20 7.68E−06 2.91E−05 5.88E−05 1.88E−04 1.59E−04 4.80E−04
   40 5.21E−07 3.88 2.36E−06 3.62 5.47E−07 6.75 1.97E−06 6.58 3.71E−06 5.42 1.21E−05 5.31
   80 3.45E−08 3.92 1.74E−07 3.76 2.87E−09 7.57 1.09E−08 7.50 6.56E−08 5.82 2.20E−07 5.78
   160 2.23E−09 3.95 1.19E−08 3.87 1.22E−11 7.88 4.70E−11 7.86 1.06E−09 5.95 3.58E−09 5.94

Fig. 1.

Fig. 1

The errors in absolute value and in logarithmic scale for P2 (top) and P3 (bottom) polynomials with N=20,40,80 and 160 elements for Example 1 where f(u)=u2/2. Before post-processing (left), after post-processing (middle) and post-processing with the more compact kernel (right). T=0.3

Example 2

In this example we consider the conservation laws with more general flux functions on the domain Ω=(0,2π)

ut+(eu)x=0,u(x,0)=sin(x) 5.2

with periodic boundary conditions.

We test the Example 2 at T=0.1 before the shock is developed. The orders of convergence with different kernels are listed in Table 2 and pointwise errors are plotted in Fig. 2. We can see that the post-processed errors are less oscillatory and much smaller in magnitude for most of elements as observed in [16], and that the errors of our more compact kernel with ω=-2 are slightly less oscillatory than that for the standard kernel with ω=0. This example demonstrates that the accuracy-enhancement technique also holds true for conservation laws with a strong nonlinearity that is not a polynomial of u.

Table 2.

Before post-processing (left), after post-processing (middle) and post-processing with the more compact kernel (right). T = 0.1 L2- and L errors for Example 2

Mesh Before post-processing Post-processed (ω=0) Post-processed (ω=-2)
L2 error Order L error Order L2 error Order L error Order L2 error Order L error Order
P2
   20 1.25E−04 5.76E−04 4.45E−05 1.61E−04 2.49E−04 7.98E−04
   40 1.61E−05 2.95 7.64E−05 2.91 1.01E−06 5.46 4.03E−06 5.32 1.68E−05 3.88 5.73E−05 3.80
   80 1.96E−06 3.04 1.02E−05 2.91 1.80E−08 5.81 7.35E−08 5.78 1.08E−06 3.97 3.72E−06 3.95
   160 2.45E−07 3.00 1.32E−06 2.95 3.02E−10 5.90 1.25E−09 5.88 6.77E−08 3.99 2.35E−07 3.99
P3
   20 3.99E−06 2.52E−05 2.50E−05 9.12E−05 6.64E−05 2.38E−04
   40 2.62E−07 3.93 1.67E−06 3.91 2.41E−07 6.70 1.00E−06 6.51 1.57E−06 5.40 6.17E−06 5.27
   80 1.68E−08 3.96 1.13E−07 3.89 1.29E−09 7.55 5.66E−09 7.47 2.79E−08 5.81 1.14E−07 5.76
   160 1.04E−09 4.01 7.38E−09 3.93 5.45E−12 7.88 2.45E−11 7.85 4.51E−10 5.95 1.86E−09 5.94

Fig. 2.

Fig. 2

The errors in absolute value and in logarithmic scale for P2 (top) and P3 (bottom) polynomials with N=20,40,80 and 160 elements for Example 2 where f(u)=eu. Before post-processing (left), after post-processing (middle) and post-processing with the more compact kernel (right). T=0.1

Concluding remarks

In this paper, the accuracy-enhancement of the DG method for nonlinear hyperbolic conservation laws is studied. We first prove that the α-th order divided difference of the DG error in the L2 norm is of order k+32-α2 when piecewise polynomials of degree k and upwind fluxes are used, provided that |f(u)| is uniformly lower bounded by a positive constant. Then, by a duality argument, the corresponding negative-order norm estimates of order 2k+32-α2 are obtained, ensuring that the SIAC filter will achieve at least (32k+1)th order superconvergence. As a by-product, we show, for variable coefficient hyperbolic equations with f(u)=a(x)u, the optimal error estimates of order k+1 for the L2 norm of divided differences of the DG error, provided that |a(x)| is uniformly lower bounded by a positive constant. Consequently, the superconvergence result of order 2k+1 is obtained for the negative-order norm. Numerical experiments are given which show that using more compact kernels are less oscillatory and that the superconvergence property holds true for nonlinear conservation laws with general flux functions, indicating that the restriction on f(u) is artificial. Based on our numerical results we can see that these estimates are not sharp. However, they indicate that a more compact kernel can be used in obtaining superconvergence results.

Future work includes the study of accuracy-enhancement of the DG method for one-dimensional nonlinear symmetric/symmetrizable systems and scalar nonlinear conservation laws in multi-dimensional cases on structured as well as unstructured meshes. Analysis of the superconvergence property of the local DG (LDG) method for nonlinear diffusion equations is also on-going work.

Appendix

The proof of Lemma 5

Let us prove the relation (3.24) in Lemma 5. Use the Taylor expansion (3.9) and the identity (2.6b) to rewrite h(f(u)-f(uh)) as

h(f(u)-f(uh))=h(f(u)ξ)+h(f(u)η)-h(R1e2)=f(u(x+h/2))ξ¯+(hf(u))ξ(x-h/2)+h(f(u)η)-R1(u(x+h/2))(he2)-(hR1)e2(x-h/2)θ1++θ5.

This allows the error Eq. (3.6) to be written as

e¯t,vh=Θ1++Θ5, 7.1

with Θi=H(θi,vh) (i=1,,5). In what follows, we will estimate each term above separately.

First consider Θ1. Begin by using the strong form of H, (2.4b), to get

Θ1=H(f(u)ξ¯,vh)=-(f(u)ξ¯)x,vh-j=1Nf(u)[[ξ¯]]vh+j.

Next, let Lk be the standard Legendre polynomial of degree k in [-1,1], so Lk(-1)=(-1)k, and Lk is orthogonal to any polynomials of degree at most k-1. If we now let vh=ξ¯x-bLk(s) with b=(-1)k(ξ¯x)j+ being a constant and s=2(x-xj+1/2)h[-1,1], we arrive at

Θ1=-xf(u)ξ¯,vh-f(u)ξ¯x,ξ¯x-bLk(s)-X-Y, 7.2

since (vh)j+=0. On each element Ij=Ij+12=(xj,xj+1), by the linearization f(u)=f(uj+12)+(f(u)-f(uj+12)) with uj+12=u(xj+12,t) and noting ξ¯x,Lkj+12=0, we arrive at an equivalent form of Y

Y=Y1+Y2, 7.3

where

Y1=j=1Nf(uj+12)ξ¯xIj+122,Y2=(f(u)-f(uj+12))ξ¯x,ξ¯x-bLk.

By the inverse property (ii), it is easy to show, for vh=ξ¯x-bLk(s), that

vhCξ¯x.

Plugging above results into (7.1) and using the assumption that f(u(x,t))δ>0, we get

δξ¯x2Y1=i=25Θi-X-Y2-e¯t,ξ¯x-bLk. 7.4

We shall estimate the terms on the right side of (7.4) one by one below.

For Θ2, by the strong form of H, (2.4b), we have

Θ2=-(hf(u)ξ)x,vh-j=1Nhf(u)[[ξ]]vh+j=-(hf(u)ξ)x,vh,

since (vh)j+=0. Thus, by Cauchy–Schwarz inequality, we arrive at a bound for Θ2

|Θ2|C(ξ+ξx)ξ¯x. 7.5a

A direct application of Corollary 2 leads to a bound for Θ3

|Θ3|Chk+1ξ¯x. 7.5b

By using analysis similar to that in the proof of (3.13), we get

|Θ4|Ch-1e(ξ¯+hk+1)ξ¯x, 7.5c
|Θ5|Ch-1e(ξ+hk+1)ξ¯x. 7.5d

By the Cauchy–Schwarz inequality, we have

|X|Cξ¯ξ¯x. 7.5e

Using the Cauchy–Schwarz inequality again together with the inverse property (i), and taking into account the fact that |f(u)-f(uj+12)|Ch on each element Ij+12, we obtain

|Y2|Cξ¯ξ¯x. 7.5f

The triangle inequality and the approximation error estimate (3.3) yield that

|e¯t,vh|C(ξ¯t+hk+1)ξ¯x. 7.5g

Finally, the error estimate (3.24) follows by collecting the estimates (7.5a)–(7.5g) into (7.4) and by using the estimates (3.4a)–(3.4c), (3.17) and (3.14). This finishes the proof of Lemma 5.

The proof of Lemma 7

To prove the error estimate (3.26), it is necessary to get a bound for the initial error ξtt(0). To do that, we start by noting that ξ(0)=0, and that ξt(0)Chk+1, which have already been proved in [18, Appendix A.2]. Next, note also that the first order time derivative of the original error equation

ett,vh=H(t(f(u)-f(uh)),vh)

still holds at t=0 for any vhVhα. If we now let vh=ξtt(0) and use a similar argument for the proof of ξt(0) in [18], we arrive at a bound for ξtt(0)

ξtt(0)Chk+1. 7.6

We then move on to the estimate of ξtt(T) for T>0. To this end, we take the second order derivative of the original error equation with respect to t and let vh=ξtt to get

ettt,ξtt=H(tt(f(u)-f(uh)),ξtt),

which is

12ddtξtt2+ηttt,ξtt=H(tt(f(u)-f(uh)),ξtt). 7.7

To estimate the right-hand side of (7.7), we use the Taylor expansion (3.9) and the Leibniz rule for partial derivatives to rewrite tt(f(u)-f(uh)) as

tt(f(u)-f(uh))=tt(f(u)ξ)+tt(f(u)η)-tt(R1e2)=(ttf(u))ξ+2(tf(u))ξt+f(u)ξtt+(ttf(u))η+2(tf(u))ηt+f(u)ηtt-(ttR1)e2-2(tR1)te2-R1(tte2)λ1++λ9.

Therefore, the right side of (7.7) can be written as

H(tt(f(u)-f(uh)),ξtt)=Λ1++Λ9 7.8

with Λi=H(λi,ξtt) (i=1,,9), which will be estimated one by one below.

By (2.11a) in Lemma 1, it is easy to show for Λ1 that

graphic file with name 211_2016_833_Equ125_HTML.gif 7.9a

where we have used the estimates (3.4a)–(3.4c) and also Young’s inequality. Analogously,

graphic file with name 211_2016_833_Equ126_HTML.gif 7.9b

where we have also used the estimate (3.4c) and the relation (3.25). A direct application of (2.11b) in Lemma 1 together with the assumption that f(u)δ>0 leads to the estimate for Λ3:

graphic file with name 211_2016_833_Equ127_HTML.gif 7.9c

Noting that ηt=ut-Ph-(ut) and ηtt=utt-Ph-(utt), we have, by Lemma 2

|Λ4|+|Λ5|+|Λ6|Chk+1ξtt. 7.9d

By an analysis similar to that in the proof of (3.13), we get

|Λ7|Ch-1e(ξ+hk+1)ξtt,|Λ8|Ch-1e(ξt+hk+1)ξtt,|Λ9|Ch-1(e+et)(ξt+ξtt+hk+1)ξtt.

Note that the result of Lemma 7 is used to prove the convergence result for the second order divided difference of the DG error, which implies that k1. Therefore, by using the inverse property (iii), the superconvergence result (3.4a), (3.4c), and the approximation error estimate (2.10b), we have for small enough h

Ch-1eCh-1(ξ+η)ChkC,Ch-1etCh-1(ξt+ηt)Chk-12C,

where C is a positive constant independent of h. Consequently,

|Λ7|C(ξ+hk+1)ξtt, 7.9e
|Λ8|C(ξt+hk+1)ξtt, 7.9f
|Λ9|C(ξt+ξtt+hk+1)ξtt. 7.9g

Collecting the estimates (7.9a)–(7.9g) into (7.7) and (7.8), we get, after a straightforward application of Cauchy–Schwarz inequality and Young’s inequality, that

graphic file with name 211_2016_833_Equ257_HTML.gif

where we have used the estimates (3.4a) and (3.4c) for the last step. Now, we integrate the above inequality with respect to time between 0 and T and combine with the initial error estimate (7.6) to obtain

graphic file with name 211_2016_833_Equ258_HTML.gif

By the estimates (3.4a) and (3.4c) again, we arrive at

graphic file with name 211_2016_833_Equ132_HTML.gif 7.10

Finally, using Gronwall’s inequality gives us

graphic file with name 211_2016_833_Equ133_HTML.gif 7.11

which completes the proof of Lemma 7.

The proof of Lemma 8

To prove the error estimate (3.27), it is necessary to get a bound for the initial error ξ¯t(0). To do that, we start by noting that ξ(0)=0, and thus ξ¯(0)=0, due to the choice of the initial data. Next, note also that the error equation (3.6) still holds at t=0 for any vhVhα. If we now let vh=ξ¯t(0) and use a similar argument for the proof of ξt(0) in [18], we arrive at a bound for ξ¯t(0)

ξ¯t(0)Chk+1. 7.12

We then move on to the estimate of ξ¯t(T) for T>0. To obtain this, take the time derivative of the error Eq. (3.6) and let vh=ξ¯t to get

e¯tt,ξ¯t=H(th(f(u)-f(uh)),ξ¯t),

which is

12ddtξ¯t2+η¯tt,ξ¯t=H(th(f(u)-f(uh)),ξ¯t). 7.13

To estimate the right-hand side of (7.13), we use the Taylor expansion (3.9) and the Leibniz rule (2.6b) to rewrite th(f(u)-f(uh)) as

th(f(u)-f(uh))=ht(f(u)ξ)+ht(f(u)η)-ht(R1e2)=h(tf(u)ξ)+h(f(u)ξt)+h(tf(u)η)+h(f(u)ηt)-h(R1te2)-h(tR1e2)=tf(u(x+h/2))ξ¯(x)+h(tf(u))ξ(x-h/2)+f(u(x+h/2))ξ¯t(x)+hf(u)ξt(x-h/2)+h(tf(u)η)+h(f(u)ηt)-R1(u(x+h/2))h(te2)-hR1te2(x-h/2)-tR1(u(x+h/2))he2-h(tR1)e2(x-h/2)π1++π10.

This allows the right side of (7.13) to be written as

H(th(f(u)-f(uh)),ξ¯t)=Π1++Π10 7.14

with Πi=H(πi,ξ¯t) for i=1,,10, which is estimated separately below.

By (2.11a) in Lemma 1, it is easy to show for Π1 that

graphic file with name 211_2016_833_Equ137_HTML.gif 7.15a

where we have used the estimate (3.17), the relation (3.24), and also the Young’s inequality. Analogously, for Π2 and Π4, we apply Corollary 1 to get

|Π2|Cξ¯t2+h-1|[ξ]|2+h2k+2, 7.15b
|Π4|Cξ¯t2+ξtt2+h-1|[ξt]|2+h2k+2, 7.15c

where we have also used the estimates (3.4a)–(3.4c), and the relation (3.25). A direct application of (2.11b) in Lemma 1 together with the assumption that f(u)δ>0 leads to the estimate for Π3:

graphic file with name 211_2016_833_Equ140_HTML.gif 7.15d

Noting that ηt=ut-Ph-(ut), we have, by Corollary 2

|Π5|+|Π6|Chk+1ξ¯t. 7.15e

By an analysis similar to that in the proof of (3.13), we get

|Π7|C(ξt+ξ¯t+hk+1)ξ¯t, 7.15f
|Π8|C(ξt+hk+1)ξ¯t, 7.15g
|Π9|C(ξ¯+hk+1)ξ¯t, 7.15h
|Π10|C(ξ+hk+1)ξ¯t. 7.15i

Collecting the estimates (7.15a)–(7.15i) into (7.13) and (7.14), we get, after a straightforward application of Cauchy–Schwarz inequality and Young’s inequality, that

graphic file with name 211_2016_833_Equ259_HTML.gif

where we have used the estimates (3.4a), (3.4c), (3.17) and (3.26) in the last step. Now, we integrate the above inequality with respect to time between 0 and T and combine with the initial error estimate (7.12) to obtain

graphic file with name 211_2016_833_Equ260_HTML.gif

By the estimates (3.4a), (3.4c) and (3.17) again, we arrive at

graphic file with name 211_2016_833_Equ261_HTML.gif

Finally, Gronwall’s inequality gives

graphic file with name 211_2016_833_Equ146_HTML.gif 7.16

This completes the proof of Lemma 8.

The proof of Lemma 9

We will only give the proof for |a(x)|0, for example a(x)>0; the general case follows by using linear linearization of a(x) at xj in each cell Ij and the fact that |a(x)-a(xj)|Ch. For a(x)>0, by Galerkin orthogonality, we have the error equation

et,vh=H(ae,vh),

which holds for any vhVhα. If we now take m-th order time derivative of the above equation and let vh=tmξ with ξ=Ph-u-uh, we arrive at

12ddttmξ2+tm+1η,tmξ=H(atmξ,tmξ)+H(atmη,tmξ). 7.17

By (2.11b) and the assumption that a(x)>0, we get

graphic file with name 211_2016_833_Equ262_HTML.gif

It follows from Lemma 2 that

H(atmη,tmξ)Chk+1tmξ.

Inserting above two estimates into (7.17), we have

12ddttmξ2Ctmξ2+Ch2k+2,

where we have used the approximation error estimates (2.10a) and Young’s inequality. For the initial error estimate, we use an analysis similar to that in the proof of (7.6) to get

tmξ(0)Chk+1.

To complete the proof of Lemma 9, we need only to combine above two estimates and use Gronwall’s inequality.

Footnotes

The research of the Xiong Meng was supported by the EU Grant FP7-PEOPLE-2013-IIF, GA Number 622845. The research of the Jennifer K. Ryan was supported by the EU Grant FP7-PEOPLE-2013-IIF, GA Number 622845 and by the AFOSR Grant FA8655-09-1-3055.

Contributor Information

Xiong Meng, Email: xiongmeng@hit.edu.cn.

Jennifer K. Ryan, Email: Jennifer.Ryan@uea.ac.uk

References

  • 1.Adjerid S, Baccouch M. Asymptotically exact a posteriori error estimates for a one-dimensional linear hyperbolic problem. Appl. Numer. Math. 2010;60:903–914. doi: 10.1016/j.apnum.2010.04.014. [DOI] [Google Scholar]
  • 2.Adjerid S, Baccouch M. The discontinuous Galerkin method for two-dimensional hyperbolic problems Part II: a posteriori error estimation. J. Sci. Comput. 2009;38:15–49. doi: 10.1007/s10915-008-9222-8. [DOI] [Google Scholar]
  • 3.Adjerid S, Baccouch M. The discontinuous Galerkin method for two-dimensional hyperbolic problems. Part I: superconvergence error analysis. J. Sci. Comput. 2007;33:75–113. doi: 10.1007/s10915-007-9144-x. [DOI] [Google Scholar]
  • 4.Baccouch M. Recovery-based error estimator for the discontinuous Galerkin method for nonlinear scalar conservation laws in one space dimension. J. Sci. Comput. 2016;66:459–476. doi: 10.1007/s10915-015-0030-7. [DOI] [Google Scholar]
  • 5.Bramble JH, Schatz AH. Higher order local accuracy by averaging in the finite element method. Math. Comput. 1977;31:94–111. doi: 10.1090/S0025-5718-1977-0431744-9. [DOI] [Google Scholar]
  • 6.Brenner, S.C., Scott, L.R.: The mathematical theory of finite element methods, vol. 15 of Texts in Applied Mathematics, 3rd edn. Springer, New York (2007)
  • 7.Cao W, Zhang Z, Zou Q. Superconvergence of discontinuous Galerkin methods for linear hyperbolic equations. SIAM J. Numer. Anal. 2014;52:2555–2573. doi: 10.1137/130946873. [DOI] [Google Scholar]
  • 8.Castillo P, Cockburn B, Schötzau D, Schwab C. Optimal a priori error estimates for the hp-version of the local discontinuous Galerkin method for convection-diffusion problems. Math. Comput. 2002;71:455–478. doi: 10.1090/S0025-5718-01-01317-5. [DOI] [Google Scholar]
  • 9.Cheng Y, Shu C-W. Superconvergence of discontinuous Galerkin and local discontinuous Galerkin schemes for linear hyperbolic and convection-diffusion equations in one space dimension. SIAM J. Numer. Anal. 2010;47:4044–4072. doi: 10.1137/090747701. [DOI] [Google Scholar]
  • 10.Ciarlet PG. The finite element method for elliptic problems. Amsterdam: North-Holland; 1978. [Google Scholar]
  • 11.Cockburn B, Luskin M, Shu C-W, Süli E. Enhanced accuracy by post-processing for finite element methods for hyperbolic equations. Math. Comput. 2003;72:577–606. doi: 10.1090/S0025-5718-02-01464-3. [DOI] [Google Scholar]
  • 12.Cockburn B, Shu C-W. TVB Runge-Kutta local projection discontinuous Galerkin finite element method for conservation laws II: General framework. Math. Comput. 1989;52:411–435. [Google Scholar]
  • 13.Curtis S, Kirby RM, Ryan JK, Shu C-W. Postprocessing for the discontinuous Galerkin method over nonuniform meshes. SIAM J. Sci. Comput. 2007;30:272–289. doi: 10.1137/070681284. [DOI] [Google Scholar]
  • 14.Floater M, Lyche T. Two chain rules for divided differences and Faà di Bruno’s formula. Math. Comput. 2007;76:867–877. doi: 10.1090/S0025-5718-06-01916-8. [DOI] [Google Scholar]
  • 15.Ji L, Xu Y, Ryan JK. Accuracy-enhancement of discontinuous Galerkin solutions for convection-diffusion equations in multiple-dimensions. Math. Comput. 2012;81:1929–1950. doi: 10.1090/S0025-5718-2012-02586-5. [DOI] [Google Scholar]
  • 16.Ji L, Xu Y, Ryan JK. Negative-order norm estimates for nonlinear hyperbolic conservation laws. J. Sci. Comput. 2013;54:531–548. doi: 10.1007/s10915-012-9668-6. [DOI] [Google Scholar]
  • 17.Li X, Ryan JK, Kirby RM, Vuik C. Smoothness-Increasing Accuracy-Conserving filters for derivative approximations of discontinuous Galerkin solutions over nonuniform meshes and near boundaries. J. Comput. Appl. Math. 2016;294:275–296. doi: 10.1016/j.cam.2015.08.011. [DOI] [Google Scholar]
  • 18.Meng X, Shu C-W, Zhang Q, Wu B. Superconvergence of discontinuous Galerkin methods for scalar nonlinear conservation laws in one space dimension. SIAM J. Numer. Anal. 2012;50:2336–2356. doi: 10.1137/110857635. [DOI] [Google Scholar]
  • 19.Mirzaee H, Ji L, Ryan JK, Kirby RM. Smoothness-Increasing Accuracy-Conserving (SIAC) postprocessing for discontinuous Galerkin solutions over structured triangular meshes. SIAM J. Numer. Anal. 2011;49:1899–1920. doi: 10.1137/110830678. [DOI] [Google Scholar]
  • 20.Ryan JK, Shu C-W. On a one-sided post-processing technique for the discontinuous Galerkin methods. Methods Appl. Anal. 2003;10:295–308. [Google Scholar]
  • 21.Shu, C.-W.: Discontinuous Galerkin method for time dependent problems: survey and recent developments. Recent Developments in Discontinuous Galerkin Finite Element Methods for Partial Differential Equations (2012 John H. Barrett Memorial Lectures). In: Feng, X., Karakashian, O., Xing, Y. (Eds.) The IMA Volumes in Mathematics and Its Applications, vol. 157, pp. 25–62. Springer, Switzerland (2014)
  • 22.Steffen M, Curtis S, Kirby RM, Ryan JK. Investigation of smoothness-increasing accuracy-conserving filters for improving streamline integration through discontinuous fields. IEEE TVCG. 2008;14:680–692. doi: 10.1109/TVCG.2008.9. [DOI] [PubMed] [Google Scholar]
  • 23.Wang H, Shu C-W, Zhang Q. Stability and error estimates of local discontinuous Galerkin methods with implicit-explicit time-marching for advection-diffusion problems. SIAM J. Numer. Anal. 2015;53:206–227. doi: 10.1137/140956750. [DOI] [Google Scholar]
  • 24.Yang Y, Shu C-W. Analysis of optimal superconvergence of discontinuous Galerkin method for linear hyperbolic equations. SIAM J. Numer. Anal. 2012;50:3110–3133. doi: 10.1137/110857647. [DOI] [Google Scholar]
  • 25.Zhang Q, Shu C-W. Stability analysis and a priori error estimates to the third order explicit Runge-Kutta discontinuous Galerkin method for scalar conservation laws. SIAM J. Numer. Anal. 2010;48:1038–1063. doi: 10.1137/090771363. [DOI] [Google Scholar]

Articles from Numerische Mathematik are provided here courtesy of Springer

RESOURCES