Error Estimates and Adaptivity of the Space-Time Discontinuous Galerkin Method for Solving the Richards Equation

Vít Dolejší; Hyun-Geun Shin; Miloslav Vlasák

doi:10.1007/s10915-024-02650-x

. 2024 Aug 20;101(1):11. doi: 10.1007/s10915-024-02650-x

Error Estimates and Adaptivity of the Space-Time Discontinuous Galerkin Method for Solving the Richards Equation

Vít Dolejší ^1,^✉, Hyun-Geun Shin ¹, Miloslav Vlasák ²

PMCID: PMC11415456 PMID: 39309293

Abstract

We present a higher-order space-time adaptive method for the numerical solution of the Richards equation that describes a flow motion through variably saturated media. The discretization is based on the space-time discontinuous Galerkin method, which provides high stability and accuracy and can naturally handle varying meshes. We derive reliable and efficient a posteriori error estimates in the residual-based norm. The estimates use well-balanced spatial and temporal flux reconstructions which are constructed locally over space-time elements or space-time patches. The accuracy of the estimates is verified by numerical experiments. Moreover, we develop the hp-adaptive method and demonstrate its efficiency and usefulness on a practically relevant example.

Keywords: Space-time discontinuous Galerkin method, Richards equation, A posteriori error estimate, hp-mesh adaptation

Introduction

Fluid flows in variably saturated porous media are usually described by the Richards equation [33], which is expressed in the form

\begin{matrix} \partial_{t} ϑ (ψ) - \nabla \cdot (K (θ (ψ)) (\nabla ψ + \nabla z)) = g, \end{matrix}

where $\partial_{t}$ denotes the derivative with respect to time, $\nabla \cdot$ and $\nabla$ are the divergence and gradient operators, respectively, $ψ$ is the sought pressure head (= normalized pressure), z is the vertical coordinate, $θ$ is the water content function, $K$ is the hydraulic conductivity tensor and g is the source term. In addition, the active pore volume $ϑ$ is related to $θ$ by the following relation

\begin{matrix} ϑ (ψ) : = θ (ψ) + \frac{S_{s}}{θ_{s}} \int_{- \infty}^{ψ} θ (s) d s, \end{matrix}

where $S_{s}, θ_{s} \geq 0$ are material parameters. The hydraulic conductivity satisfies $K (ψ) = K_{s} K_{r} (ψ)$ , where $K_{s}$ is the saturated conductivity tensor, and $K_{r} \in [0, 1]$ is the relative saturation. The functions $θ$ and $K_{r}$ are given by constitutive relations, e.g., by van Genuchten’s law [27] and by Mualem’s law [31], respectively.

The Richards equation belongs to the nonlinear parabolic problems, and it can degenerate, in particular $K \to 0$ or $\frac{d ϑ}{d ψ} \to 0$ . Due to the degeneracy, the numerical solution is challenging, and various techniques have been developed for its solution in the last decades, see [25] for a survey.

In [14], we presented the adaptive space-time discontinuous Galerkin (STDG) method for the numerical solution of (1). This technique is based on a piecewise polynomial discontinuous approximation with respect to both the spatial and temporal coordinates. The resulting scheme is sufficiently stable, provides high accuracy, and is suitable for the hp-mesh adaptation. This is an important property, since the weak solution of the Richards equation is (only) piecewise regular and exhibits singularities along the material interfaces and the unsaturated/saturated zone (when $ψ \approx 0$ ). Therefore, an adaptive method that allows different meshes at different time levels, can achieve an accurate approximation with a relatively small number of degrees of freedom.

The numerical experiments presented in [14] showed the potential of the adaptive STDG method. However, the mesh adaptation used is based on interpolation error estimates that do not guarantee an upper error bound. The aim of this work is to overcome this bottleneck, derive a posteriori error estimates, and use them in the hp-mesh adaptation framework.

A posteriori error estimates for the numerical solution of the Richards equation have been treated in many papers for different numerical methods. We mention the finite volume framework with multistep time discretization in [5], the mixed finite element method in [6], the two-point finite volume discretization in [8], the lowest-order discretization on polytopal meshes in [38], finite element techniques in [30] and the references cited therein.

Guaranteed error estimates without unknown constants are usually obtained by measuring the error in a dual norm of the residual. Introducing reconstructed fluxes from the space $H^{1} (div, Ω)$ , the upper bound can then be obtained directly. In [18], we developed this approach to the higher-order STDG method for nonlinear parabolic problems, where the temporal discontinuities were treated by temporal flux reconstructions considering the time jumps.

In this paper, we extend the approach [18] to the Richards equation (1). Although the definition of the temporal and spatial flux reconstructions as well as the derivation of the upper bounds is straightforward, the proof of the lower bound (efficiency) is rather tricky since the term $θ (ψ)$ in the time derivative is not a polynomial function for a polynomial $ψ$ . In contrary to [18], the proof of efficiency requires the additional oscillatory data terms. We construct spatial fluxes by solving local Neumann problems defined on space-time patches that generalize the approach from [22]. Moreover, we provide numerical experiments verifying derived error estimates. Compared to [18], the resulting effectivity indices are much closer to one. This is the first novelty of this paper.

Secondly, we deal with the errors arising due to iterative solution of nonlinear algebraic systems. We introduce a cheap stopping criterion for iterative solvers and justify it by numerical experiments. Thirdly, we introduce a space-time adaptive algorithm that employs the anisotropic hp-mesh adaptation technique [15]. The algorithm admits local adaptation of size and shape of mesh elements and the local adaptation of degrees of polynomial approximation with respect to space. However, the size of the time step can vary globally, and the degree of polynomial approximation with respect to time is fixed. Using the equidistribution principle, the algorithm gives an approximate solution with the error estimate under the given tolerance. The performance of the adaptive algorithm is demonstrated numerically, including a practically relevant example.

The rest of the paper is organized as follows. In Sect. 2, we introduce the problem considered, its STDG discretization is briefly described in Sect. 3. The main theoretical results are derived in Sect. 4, where the upper and lower bounds are proved. Two possible spatial reconstructions are discussed in Sect. 5 together with the stopping criteria of iterative solvers. The numerical verification of the error estimates is given in Sect. 6. Furthermore, we present the resulting hp-mesh adaptation algorithm in Sect. 7 and demonstrate its performance by numerical examples. Finally, we conclude with some remarks in Sect. 8.

Problem Formulation

Let $Ω \subset R^{d}$ ( $d = 2, 3$ ) be the domain occupied by a porous medium and $T > 0$ the physical time to be reached. For simplicity, we assume that $Ω$ is polygonal. By $Γ : = \partial Ω$ , we denote the boundary of $Ω$ which consists of two disjoint parts: the Dirichlet boundary $Γ_{D}$ and the Neumann boundary $Γ_{N}$ . We write the Richards equation (1) in a different form, which is more suitable for the analysis. We seek a function $u = u (x, t) : Ω \times (0, T) \to R$ , which represents a hydraulic head (with the physical unit $L$ ). The quantity $u$ is related to the pressure head $ψ$ by $u = ψ + z$ . The Richards equation (1) reads

\begin{matrix} \partial_{t} ϑ (u) - \nabla \cdot (K (u) \nabla u) = g in Ω \times (0, T) \\ u = u_{D} on Γ_{D} \times (0, T) \\ K (u) \nabla u \cdot n = g_{N} on Γ_{N} \times (0, T), \\ u (x, 0) = u_{0} in Ω, \end{matrix}

where $g : Ω \times (0, T) \to R$ represents a source term if g is positive or a sink term if g is negative, $ϑ : R \to R$ denotes the dimensionless active pore volume, and $K : R \to R^{d \times d}$ is the hydraulic conductivity with the physical unit $L \cdot T^{- 1}$ (L = length, T = time). Moreover, $u_{D}$ is a trace of a function $u^{*} \in L^{2} (0, T ; H^{1} (Ω))$ on $Γ_{D} \times (0, T)$ , $g_{N} \in L^{2} (0, T ; L^{2} (Γ_{N}))$ and $u_{0} \in L^{2} (Ω)$ . We note that with respect to (1), we should write $ϑ = ϑ (u - z)$ and $K = K (θ (u - z))$ , however, we skip this notation for simplicity. We assume that the function $ϑ (u)$ is non-negative, non-decreasing and Lipschitz continuous. Moreover, the tensor $K (u)$ is symmetric, positively semi-definite, and Lipschitz continuous.

In order to introduce the weak solution, we set $H (div, Ω) = {v \in L^{2} {(Ω)}^{d} : \nabla \cdot v \in L^{2} (Ω)}$ and define the spaces

\begin{matrix} X & = L^{2} (0, T, H^{1} (Ω)), & V = {v \in X : v |_{Γ_{D}} = 0}, \\ Y & = {v \in X : ϑ^{'} (v) \in L^{2} (0, T, L^{2} (Ω))}, & Y^{0} = {v \in Y : v (0) = u_{0}}, \end{matrix}

where $ϑ^{'} (u) = \partial_{t} ϑ (u) = \frac{d ϑ}{d u} \partial_{t} u$ denotes the time derivative (in the weak sense). Obviously, if $v \in Y$ then $ϑ (v) \in C ([0, T], L^{2} (Ω))$ . In order to shorten the notation, we set the physical flux

\begin{matrix} σ (u, \nabla u) : = K (u) \nabla u, u \in X . \end{matrix}

Definition 1

We say that $u \in Y$ is the weak solution of (3) if $u - u^{*} \in V$ and

\begin{matrix} \int_{0}^{T} ({(ϑ^{'} (u), v)}_{Ω} + {(σ (u, \nabla u), \nabla v)}_{Ω} - {(g, v)}_{Ω} - (g_{N}, v)_{Γ_{N}}) d t = 0 \forall v \in V, \end{matrix}

where $(u, v)_{Ω} : = \int_{Ω} u v d x$ and $(u, v)_{Γ_{N}} : = \int_{Γ_{N}} u v d S$ .

The existence and uniqueness of the Richards equation is studied in [2], see also the later works [3, 28].

Space-time discretization

We briefly describe the discretization of (6) by the space-time discontinuous Galerkin (STDG) method, for more details, see [13, 14]. Let $0 = t_{0} < t_{1} < \dots < t_{r} = T$ be a partition of the time interval (0, T) and set $I_{m} = (t_{m - 1}, t_{m})$ and $τ_{m} = t_{m} - t_{m - 1}$ . For each $m = 0, \dots, r$ , we consider a simplicial mesh $T_{h}^{m}$ covering $\bar{Ω}$ . For simplicity, we assume that $T_{h}^{m}$ , $m = 0, \dots, r$ are conforming, i.e., neighbouring elements share an entire edge or face. However, this assumption can be relaxed by the technique from [12].

For each element $K \in T_{h}^{m}$ , we denote by $\partial K$ its boundary, $n_{K}$ its unit outer normal and $h_{K} = diam (K)$ its diameter. In order to shorten the notation, we write ${\partial K}_{N} : = \partial K \cap Γ_{N}$ . By the generic symbol $γ$ , we denote an edge ( $d = 2$ ) or a face ( $d = 3$ ) of $K \in T_{h}^{m}$ and $h_{γ}$ denotes its diameter. In the following, we speak only about edges, but we mean faces for $d = 3$ . We assume that

$T_{h}^{m}$ , $m = 0, \dots, r$ are shape regular, i.e., $h_{K} / ρ_{K} \leq C$ for all $K \in T_{h}$ , where $ρ_{K}$ is the radius of the largest d-dimensional ball inscribed in K and constant C does not depend on $T_{h}^{m}$ for $h \in (0, h_{0})$ , $m = 0, \dots, r$ .
$T_{h}^{m}$ , $m = 0, \dots, r$ are locally quasi-uniform, i.e., $h_{K} \leq C h_{K^{'}}$ for any pair of two neighbouring elements K and $K^{'}$ , where the constant C does not depend on $h \in (0, h_{0})$ , $m = 0, \dots, r$ .

Let $p_{K} \geq 1$ be an integer denoting the degree of polynomial approximation on $K \in T_{h}^{m}$ , $m = 0, \dots, r$ and $P_{p_{K}} (K)$ be the corresponding space of polynomial functions on K. Let

\begin{matrix} S_{h p, m} = {v \in L^{2} (Ω) : v |_{K} \in P_{p_{K}} (K), K \in T_{h}^{m}}, m = 0, \dots, r \end{matrix}

denote the spaces of discontinuous piecewise polynomial functions on $T_{h}^{m}$ with possibly varying polynomial approximation degrees. Furthermore, we consider the space of space-time discontinuous piecewise polynomial functions

\begin{matrix} S_{hp}^{τ q} = {v \in L^{2} (Ω \times (0, T)) : v |_{I_{m}} \in P_{q} (I_{m}, S_{h p, m}), m = 1, \dots, r}, \end{matrix}

where $q \geq 0$ denotes the time polynomial approximation degree and $P_{q} (I_{m}, S_{h p, m})$ is the Bochner space, i.e., $v \in P_{q} (I_{m}, S_{h p, m})$ can be written as $v (x, t) = \sum_{j = 0}^{q} t^{j} v_{j} (x)$ , $v_{j} \in S_{h p, m}$ , $j = 0, \dots, q$ .

For $v \in S_{hp}^{τ q}$ , we define the one-sided limits and time jumps by

\begin{matrix} v_{+}^{m} = lim_{t \to t_{m}^{+}} v (t), m = 0, \dots, r - 1, v_{-}^{m} = lim_{t \to t_{m}^{-}} v (t), m = 1, \dots, r, \\ {v}_{m} = v_{+}^{m} - v_{-}^{m}, m = 1, \dots, r - 1, v_{-}^{0} = ϑ (u_{0}), {v}_{0} = v_{+}^{0} - ϑ (u_{0}), \end{matrix}

where $u_{0}$ is the initial condition. In the following, we use the notation

\begin{matrix} (u, v)_{M} & = \int_{M} u v d x, (u, v)_{M, m} = \int_{M \times I_{m}} u v d x d t, m = 1, \dots, r, \end{matrix}

where M is either element $K \in T_{h}^{m}$ or its (part of) boundary $\partial K$ . The corresponding norms are denoted by ${∥\cdot∥}_{M}$ and ${∥\cdot∥}_{M, m}$ , respectively. By $\sum_{K, m} = \sum_{m = 1}^{r} \sum_{K \in T_{h}^{m}}$ , we denote the sum over all space-time elements $K \times I_{m}$ , where $K \in T_{h}^{m}$ and $m = 1, \dots, r$ .

Moreover, we define the jumps and mean values of $v \in S_{h p, m}$ on edges $γ \subset \partial K, K \in T_{h}^{m}$ by

\begin{matrix} [v] = \{\begin{matrix} (v^{(+)} - v^{(-)}) n_{K} & for γ \in Ω, \\ (v^{(+)} - u_{D}) n_{K} & for γ \subset Γ_{D}, \\ 0 & for γ \subset Γ_{N}, \end{matrix}) 〈v〉 = \{\begin{matrix} (v^{(+)} + v^{(-)}) / 2 & for γ \in Ω, \\ v^{(+)} & for γ \subset Γ_{D}, \\ 0 & for γ \subset Γ_{N}, \end{matrix}) \end{matrix}

where $v^{(+)}$ and $v^{(-)}$ denote the traces of v on $\partial K$ from interior and exterior of K, respectively, and $u_{D}$ comes from the Dirichlet boundary condition. For vector-valued $v \in {[S_{h p, m}]}^{d}$ , we set $[v] = (v^{(+)} - v^{(-)}) \cdot n_{K}$ for $γ \in Ω$ and similarly for boundary edges.

For each space-time element $K \times I_{m}$ , $K \in T_{h}^{m}$ , $m = 1, \dots, r$ , we define the forms

\begin{matrix} a_{K, m} (u, v) & : = (K (u) \nabla u, \nabla v)_{K, m} - (g, v)_{K, m} - (g_{N}, v)_{{\partial K}_{N}, m}, \\ A_{K, m} (u, v) & : = (K (u) \nabla u, \nabla v)_{K, m} - (〈K (u) \nabla u〉 \cdot n_{K} - α [u] \cdot n_{K}, v)_{\partial K \ Γ_{N}, m} \\ + (β - \frac{1}{2}) (K (u) [u], \nabla v)_{\partial K \ Γ, m} + (2 β - 1) (K (u) [u], \nabla v)_{\partial K \cap Γ_{D}, m} \\ - (g, v)_{K, m} - (g_{N}, v)_{{\partial K}_{N}, m}, \end{matrix}

where $α > 0$ is a sufficiently large penalization parameter ( $α \sim p_{K}^{2} / h_{K}$ ) and $β \in {0, \frac{1}{2}, 1}$ corresponds to the choice of the variants of the interior penalty discretization (SIPG with $β = 0$ , IIPG with $β = 1 / 2$ and NIPG with $β = 1$ ), see, e.g., [13, Chapter 2].

We introduce the space-time discontinuous Galerkin discretization of (3).

Definition 2

The function $u_{h}^{τ} \in S_{hp}^{τ q}$ is called the approximate solution of (6) obtained by the space-time discontinuous Galerkin method (STDGM), if

\begin{matrix} \sum_{K, m} B_{K, m} (u_{h}^{τ}, v) = 0 \forall v \in S_{hp}^{τ q}, \end{matrix}

where

\begin{matrix} B_{K, m} (u, v) : = (ϑ^{'} (u), v)_{K, m} + A_{K, m} (u, v) + ({ϑ (u)}_{m - 1}, v_{+}^{m - 1})_{K} \end{matrix}

with form $A_{K, m}$ given by (12) and ${\cdot}$ defined by (9).

Remark 1

We note that $u_{h}^{τ}$ is discontinuous with respect to time at $t_{m}, m = 1, \dots, r - 1$ . The solution between $I_{m - 1}$ and $I_{m}$ is stuck together by the “time-penalty” term $({ϑ (u)}_{m - 1}, v_{+}^{m - 1})_{K}$ which also makes sense for u and v belonging to different finite element spaces.

Finally, we derive some identities that will be used later. Let $F_{h}^{m}$ denote the set of all interior edges $γ ⊄ Γ$ of mesh $T_{h}^{m}$ and $F_{D}^{m}$ the set of boundary edges of $T_{h}^{m}$ lying on $Γ_{D}$ . Then, the identity

\begin{matrix} \sum_{K \in T_{h}^{m}} (w, z n_{K})_{\partial K \ Γ_{N}, m} = \sum_{γ \in F_{h}^{m}} ((〈w〉, [z])_{γ, m} + ([w], 〈z〉)_{γ, m}) + \sum_{γ \in F_{D}^{m}} (w \cdot n_{K}, z)_{γ, m} \end{matrix}

holds for a piecewise smooth vector-valued function w and a piecewise smooth scalar function z.

Using identity (15) and the following obvious formulas valid for interior edges $〈〈K (u) \nabla u〉〉 = 〈K (u) \nabla u〉$ , $〈α [u]〉 = α [u]$ , $[〈K (u) \nabla u〉] = 0$ , $[α [u]] = 0$ , we gain

\begin{matrix} \sum_{K \in T_{h}^{m}} (〈K (u) \nabla u〉 \cdot n_{K}, v)_{\partial K \ Γ_{N}, m} & = \sum_{γ \in F_{h}^{m}} (〈K (u) \nabla u〉, [v])_{γ, m} + \sum_{γ \in F_{D}^{m}} (K (u) \nabla u \cdot n_{K}, v)_{γ, m}, \\ \sum_{K \in T_{h}^{m}} (α [u] \cdot n_{K}, v)_{\partial K \ Γ_{N}, m} & = \sum_{γ \in F_{h}^{m}} (α [u], [v])_{γ, m} + \sum_{γ \in F_{D}^{m}} (α [u] \cdot n_{K}, v)_{γ, m}, \\ \sum_{K \in T_{h}^{m}} (K (u) [u], \nabla v)_{\partial K \ Γ, m} & = \sum_{K \in T_{h}^{m}} ([u], K (u) \nabla v)_{\partial K \ Γ, m} = 2 \sum_{γ \in F_{h}^{m}} ([u], 〈K (u) \nabla v〉)_{γ, m}, \\ \sum_{K \in T_{h}^{m}} (K (u) [u], \nabla v)_{\partial K \cap Γ_{D}, m} & = \sum_{γ \in F_{D}^{m}} ([u], K (u) \nabla v)_{γ, m} . \end{matrix}

Consequently, from (12) and (16), we obtain the identity

\begin{matrix} \sum_{K \in T_{h}^{m}} A_{K, m} (u, v) & = \sum_{K \in T_{h}^{m}} (K (u) \nabla u, \nabla v)_{K, m} - \sum_{γ \in F_{h}^{m}} (〈K (u) \nabla u〉, [v])_{γ, m} \\ + (2 β - 1) \sum_{γ \in F_{h}^{m}} ([u], 〈K (u) \nabla v〉)_{γ, m} - \sum_{γ \in F_{D}^{m}} (K (u) \nabla u \cdot n_{K}, v)_{γ, m} \\ + (2 β - 1) \sum_{γ \in F_{D}^{m}} ([u], K (u) \nabla v)_{γ, m} + \sum_{γ \in F_{h}^{m}} (α [u], [v])_{γ, m} \\ + \sum_{γ \in F_{D}^{m}} (α [u] \cdot n_{K}, v)_{γ, m} - (g, v)_{Ω, m} - (g_{N}, v)_{Γ_{N}, m} . \end{matrix}

A Posteriori Error Analysis

Error Measures

In order to proceed to the derivation of error estimators, we define the spaces of piecewise continuous functions with respect to time by

\begin{matrix} Y^{τ} & = {v \in X : ϑ^{'} (v) |_{I_{m}} \in L^{2} (I_{m}, L^{2} (Ω))}, V^{τ} = {v \in Y^{τ} : v |_{Γ_{D} \times (0, T)} = 0} . \end{matrix}

Obviously, $Y^{0} \subset Y \subset Y^{τ} \subset X$ and $S_{hp}^{τ q} \subset Y^{τ}$ . Moreover, we have the following result.

Lemma 1

Let $u \in Y^{0}$ be the weak solution of (6). Then it satisfies

\begin{matrix} \sum_{K, m} b_{K, m} (u, v) & = 0 \forall v \in V^{τ}, \end{matrix}

where

\begin{matrix} b_{K, m} (u, v) : = (ϑ^{'} (u), v)_{K, m} + a_{K, m} (u, v) + ({ϑ (u)}_{m - 1}, v_{+}^{m - 1})_{K} \end{matrix}

with $a_{K, m}$ given by (12) and the time jump ${\cdot}_{m - 1}$ defined by (9). Moreover, there exists a unique solution $u \in Y^{τ}$ such that $u - u^{*} \in V^{τ}$ and satisfies (19).

Proof

The proof follows directly by comparing formulas (19)–(20) with (6) and the fact that $({ϑ (u)}_{m - 1}, v_{+}^{m - 1})_{K} = 0$ for $u \in Y^{0}$ . For the proof of uniqueness, we employ the fact that $C_{0}^{\infty} (Ω)$ is dense in $L^{2} (Ω)$ , i.e., there exists a sequence ${v_{ε}} \subset C_{0}^{\infty} (Ω)$ for any $v \in L^{2} (Ω)$ such that $‖ v_{ε} - v ‖ \to 0$ as $ε \to 0$ , cf. [34, Theorem 3.14]. We apply $v = v_{s, ε_{1}} (x) v_{t, ε_{2}} (t)$ in (19), where the spatial component $v_{s, ε_{1}} \in {v \in H^{1} (Ω) : v |_{Γ_{D}} = 0}$ tends to ${ϑ (u)}_{m - 1}$ as $ε_{1} \to 0$ and the time component $v_{t, ε_{2}}$ is given as 0 outside the interval $(t_{m - 1}, t_{m - 1} + ε_{2})$ and $v_{t, ε_{2}} = 1 - (t - t_{m - 1}) / ε_{2}$ on $(t_{m - 1}, t_{m - 1} + ε_{2})$ , i.e., $v_{t, ε_{2}} (t)$ tends to 0 as $ε_{2} \to 0$ . Therefore, all the terms containing time integrals in (19) tend to 0 when $ε_{2}$ tends to 0. Since $v_{+}^{m - 1} = v_{s, ε_{1}}$ , the remaining jump term tends to ${‖ {ϑ (u)}}_{m - 1} ‖^{2}$ as $ε_{1}$ tends to 0. From this it follows that ${ϑ (u)}_{m - 1} = 0$ . Then it is possible to see that any solution of (19) satisfies the original weak formulation (6). Since the weak problem (6) has a unique solution, cf. [2], the extended problem (19) has a unique solution as well. $□$

In virtue of [11, § 2.3.1], we define a parameter $d_{K, m}$ associated with the space-time element $K \times I_{m}$ , $K \in T_{h}^{m}$ , $m = 1, \dots, r$ . The parameter $d_{K, m}$ represents a user-dependent weight, typically with physical units ${(T L)}^{1 / 2}$ so that the error measure has the same physical unit as the energy norm. In this paper, we use two choices

\begin{matrix} d_{K, m} & : = {(h_{K}^{- 2} {∥K (u_{h})∥}_{m, \infty} + τ_{m}^{- 2} T {∥\frac{d ϑ}{d u}, (u_{h})∥}_{m, \infty})}^{- 1 / 2}, \end{matrix}

21a

\begin{matrix} d_{K, m} & : = {(h_{K}^{2} {∥K (u_{h})∥}_{m, \infty}^{- 1} + τ_{m}^{2} / T {∥\frac{d ϑ}{d u}, (u_{h})∥}_{m, \infty}^{- 1})}^{1 / 2} . \end{matrix}

21b

where ${∥\cdot∥}_{m, \infty} : = {∥\cdot∥}_{L^{\infty} (Ω \times I_{m})}$ . We note that the following error analysis is independent of the choice of $d_{K, m}$ . Moreover, we define the norm in the space $V^{τ}$ (cf. (18)) by

\begin{matrix} {∥v∥}_{V^{τ}}^{2} = \sum_{K, m} {∥v∥}_{V_{K, m}}^{2}, {∥v∥}_{V_{K, m}}^{2} = d_{K, m}^{- 2} (h_{K}^{2} {∥\nabla, v∥}_{K, m}^{2} + τ_{m}^{2} {∥v^{'}∥}_{K, m}^{2}) . \end{matrix}

In virtue of (19), we introduce the error measure as a dual norm of the residual

\begin{matrix} R (u_{h}^{τ}) = sup_{0 \neq v \in V^{τ}} \frac{\sum_{K, m} b_{K, m} (u_{h}^{τ}, v)}{{∥v∥}_{V^{τ}}}, \end{matrix}

where $b_{K, m}$ is given by (20). The residual $R (v)$ represents a natural error measure for $u - v \in V^{τ}$ , cf. [11, Remark 2.3]. In Sect. 4, we estimate $R (u_{h}^{τ})$ for $u_{h}^{τ}$ being the solution of (13).

Since the approximate solution $u_{h}^{τ}$ belongs to the space of discontinuous function $S_{hp}^{τ q} ⊄ V^{τ}$ , we introduce the second building block measuring the nonconformity of the solution in spatial variables. Therefore, similarly to [18], we define the form

\begin{matrix} J (v) = \sum_{K, m} J_{K, m} (v), J_{K, m} (v) = d_{K, m}^{2} τ_{m}^{- 1} h_{K}^{- 2} C_{K, m, K, α} {∥[v]∥}_{\partial K, m}^{2}, \end{matrix}

where $C_{K, m, K, α} = α^{2} + {∥K (u_{h}^{τ})∥}_{L^{\infty} (K \times I_{m})}^{2}$ . The scaling factors are chosen such that $J {(v)}^{1 / 2}$ has the same physical unit as $R (u_{h}^{τ})$ .

We note that $J (v)$ measures also the violation of the Dirichlet boundary condition since $J (v)$ contains the term ${∥v - u_{D}∥}_{\partial K \cap Γ_{D}, m}$ , cf. (11).

The final error measure is then defined by

\begin{matrix} E (u_{h}^{τ}) : = {(R {(u_{h}^{τ})}^{2} + J (u_{h}^{τ}))}^{1 / 2}, \end{matrix}

where $R (u_{h}^{τ})$ is given by (23) and $J (u_{h}^{τ})$ by (24).

Lemma 2

The error measure $E (u_{h}^{τ}) = 0$ if and only if $u_{h}^{τ} = u$ is the weak solution given by (6).

Proof

Obviously, if $u_{h}^{τ} = u$ , then $J (u_{h}^{τ}) = 0$ and $R (u_{h}^{τ}) = 0$ due to (19). On the other hand, if $J (u_{h}^{τ}) = 0$ , then $u_{h}^{τ} \in Y^{τ}$ and $u_{h}^{τ} - u^{*} \in V^{τ}$ . Moreover, $R (u_{h}^{τ}) = 0$ and the uniqueness of (19) imply that $u_{h}^{τ}$ is the weak solution (6). $□$

Temporal and Spatial Flux Reconstructions

Similarly as in [18], we define a temporal reconstruction $R_{h}^{τ} = R_{h}^{τ} (x, t)$ as a continuous function with respect to time that mimics $ϑ (u_{h}^{τ})$ , $u_{h}^{τ} \in S_{hp}^{τ q}$ . Let $r_{m} \in P_{q + 1} (I_{m})$ be the right Radau polynomial on $I_{m}$ , i.e., $r_{m} (t_{m - 1}) = 1$ and $r_{m} (t_{m}) = 0$ , and $r_{m}$ is orthogonal to $P_{q - 1} (I_{m})$ with respect to the $L^{2} (I_{m})$ inner product. Then we set

\begin{matrix} R_{h}^{τ} (x, t) : = ϑ (u_{h}^{τ} (x, t)) - {ϑ (u_{h}^{τ})}_{m - 1} (x) r_{m} (t), x \in Ω, t \in I_{m}, \end{matrix}

where ${\cdot}$ is given by (9). The temporal flux reconstruction $R_{h}^{τ} (x, t)$ is continuous in time, namely $R_{h}^{τ} \in H^{1} (0, T, L^{2} (Ω))$ and it satisfies the initial condition due to

\begin{matrix} R_{h}^{τ} (\cdot, 0) & = ϑ (u_{h}^{τ} (\cdot, 0)) - {ϑ (u_{h}^{τ})}_{0} (\cdot) r_{1} (0) \\ = ϑ (u_{h}^{τ} (\cdot, 0)) - (ϑ (u_{h}^{τ} (\cdot, 0)) - ϑ (u_{0} (\cdot)) = ϑ (u_{0} (\cdot)) . \end{matrix}

Moreover, by the integration by parts and the properties $r_{m} (t_{m - 1}) = 1$ , $r_{m} (t_{m}) = 0$ , we obtain

\begin{matrix} ({(R_{h}^{τ} - ϑ (u_{h}^{τ}))}^{'}, v)_{K, m} & = - (r_{m}^{'} {ϑ (u_{h}^{τ})}_{m - 1}, v)_{K, m} \\ = (r_{m} {ϑ (u_{h}^{τ})}_{m - 1}, v^{'})_{K, m} - r_{m} (t_{m}) ({ϑ (u_{h}^{τ})}_{m - 1}, v_{-}^{m})_{K} \\ + r_{m} (t_{m - 1}) ({ϑ (u_{h}^{τ})}_{m - 1}, v_{+}^{m - 1})_{K} \\ = (r_{m} {ϑ (u_{h}^{τ})}_{m - 1}, v^{'})_{K, m} + ({ϑ (u_{h}^{τ})}_{m - 1}, v_{+}^{m - 1})_{K}, v \in V^{τ}, \end{matrix}

which together with definition (26) implies

\begin{matrix} ({(R_{h}^{τ} - ϑ (u_{h}^{τ}))}^{'}, v)_{K, m} - ({ϑ (u_{h}^{τ})}_{m - 1}, v_{+}^{m - 1})_{K} = - (R_{h}^{τ} - ϑ (u_{h}^{τ}), v^{'})_{K, m}, v \in V^{τ} . \end{matrix}

Finally, using the orthogonality of $r_{m}$ to $P_{q - 1} (I_{m})$ , we obtain from (28), the formula

\begin{matrix} {({(R_{h}^{τ} - ϑ (u_{h}^{τ}))}^{'}, v)}_{m, K} = {({ϑ (u_{h}^{τ})}_{m - 1}, v_{+}^{m - 1})}_{K} \forall v \in P_{q} (I_{m}, L^{2} (K)) . \end{matrix}

Consequently, if $u_{h}^{τ}$ is the approximate solution given by (13), then it satisfies

\begin{matrix} ({(R_{h}^{τ})}^{'}, v)_{K, m} & = (ϑ^{'} (u_{h}^{τ}), v)_{K, m} + ({ϑ (u_{h}^{τ})}_{m - 1}, v_{+}^{m - 1})_{K} = - A_{K, m} (u_{h}^{τ}, v) \\ \forall v \in P_{q} (I_{m}, P_{p_{K}} (K)) . \end{matrix}

Obviously, the reconstruction $R_{h}^{τ}$ is local and explicit, so its computation is fast and easy to implement.

The spatial flux reconstruction needs to define a function $σ_{h}^{τ} \in L^{2} (0, T, H (div, Ω))$ which mimics the flux $σ (u_{h}^{τ}, \nabla u_{h}^{τ}) = K (u_{h}^{τ}) \nabla u_{h}^{τ}$ , cf. (5). In particular, $σ_{h}^{τ} |_{K \times I_{m}} \in P_{q} (I_{m}, {RTN}_{p} (K))$ where

\begin{matrix} {RTN}_{p} (K) = P_{p} {(K)}^{d} + x P_{p} (K), K \in T_{h}, m = 1, \dots, r \end{matrix}

is the Raviart-Thomas-Nedelec finite elements, cf. [7] for more details. We assume that the reconstructed flux $σ_{h}^{τ}$ has to be equilibrated with the temporal flux $R_{h}^{τ}$

\begin{matrix} (\nabla \cdot σ_{h}^{τ}, v)_{K, m} = ({(R_{h}^{τ})}^{'} - g, v)_{K, m} \forall v \in P_{q} (I_{m}, P_{p_{K}} (K)), K \in T_{h}^{m}, \end{matrix}

and with the Neumann boundary condition

\begin{matrix} (σ_{h}^{τ} \cdot n, v)_{γ, m} = (g_{N}, v)_{γ, m} \forall v \in P_{q} (I_{m}, P_{p_{K}} (γ)) \forall γ \subset {\partial K}_{N}, K \in T_{h}^{m} . \end{matrix}

In Sect. 5 we present two possible constructions of $σ_{h}^{τ}$ including the choice of the spatial polynomial degree p in (32).

Auxiliary Results

In the forthcoming numerical analysis, we need several technical tools. We will employ the scaled space-time Poincarè inequality, cf. [11, Lemma 2.2]: Let $φ_{K, m} \in P_{0} (K \times I_{m})$ be the $L^{2}$ -orthogonal projection of $φ \in H^{1} (K \times I_{m})$ onto a constant in each space-time element $K \times I_{m}$ , $K \in T_{h}^{m}$ , $m = 0, \dots, r$ . Then,

\begin{matrix} {∥φ - φ_{K, m}∥}_{K, m} \leq C_{P} {(h_{K}^{2} {∥\nabla, φ∥}_{K, m}^{2} + τ_{m}^{2} {∥φ^{'}∥}_{K, m}^{2})}^{1 / 2} = C_{P} d_{K, m} {∥φ∥}_{V_{K, m}}, \end{matrix}

where $C_{P}$ is the Poincarè constant equal to $1 / π$ for simplicial elements and the last equality follows from (22).

Moreover, we introduce the space-time trace inequality

Lemma 3

Let $φ_{γ, m} \in P_{0} (γ \times I_{m})$ be the $L^{2}$ -orthogonal projection of $φ \in H^{1} (K \times I_{m})$ onto a constant on each $γ \times I_{m}$ , where $γ \subset \partial K$ is an edge of $K \in T_{h}^{m}$ . Then there exists a constant $C_{T} > 0$ such that

\begin{matrix} {∥φ - φ_{γ, m}∥}_{γ \times I_{m}} \leq C_{T} max (1, h_{γ}^{- 1 / 2}) d_{K, m} {∥φ∥}_{V_{K, m}}, \end{matrix}

where $C_{T} = max (c_{T}, C_{P})$ , $C_{P}$ is from (35) and $c_{T} > 0$ is the constant from the (space) trace inequality.

Proof

The proof is straightforward, we present it for completeness. Let $φ \in H^{1} (K \times I_{m})$ and, for all $t \in I_{m}$ , set $\tilde{φ} (t) : = {| γ |}^{- 1} \int_{γ} φ (x, t) d S$ . Observing that $(φ - \tilde{φ})$ and $(\tilde{φ} - φ_{γ, m})$ are $L^{2} (γ \times I_{m})$ -orthogonal, we have

\begin{matrix} {∥φ - φ_{γ, m}∥}_{γ \times I_{m}}^{2} = {∥φ - \tilde{φ}∥}_{γ \times I_{m}}^{2} + {∥\tilde{φ} - φ_{γ, m}∥}_{γ \times I_{m}}^{2} . \end{matrix}

Using the standard trace inequality (e.g., [21, Lemma 3.32]), we have

\begin{matrix} {∥φ (\cdot, t) - \tilde{φ} (t)∥}_{γ} \leq c_{T} h_{γ}^{1 / 2} {∥\nabla, φ∥}_{K} \forall t \in I_{m}, \end{matrix}

where $c_{T} > 0$ is a constant whose values can be set relatively precisely, see the discussion in [37, Section 4.6]. Hence, integrating the square of (38) over $I_{m}$ and using the fact that $h_{γ} \leq h_{K}$ , $γ \subset h_{K}$ , we estimate the first term on the right-hand side of (37) as

\begin{matrix} {∥φ - \tilde{φ}∥}_{γ \times I_{m}}^{2} \leq c_{T}^{2} h_{γ} {∥\nabla, φ∥}_{K \times I_{m}}^{2} \leq c_{T}^{2} h_{γ}^{- 1} h_{K}^{2} {∥\nabla, φ∥}_{K \times I_{m}}^{2} . \end{matrix}

Using the fact that $φ_{γ, m} = τ_{m}^{- 1} \int_{I_{m}} \tilde{φ} (t) d t$ , the one-dimensional Poincarè inequality on $I_{n}$ and the Cauchy–Schwarz inequality yield

\begin{matrix} {∥\tilde{φ} - φ_{γ, m}∥}_{γ \times I_{m}}^{2} & = | γ | \int_{I_{m}} | \tilde{φ} - φ_{γ, m} |^{2} (t) d t \leq | γ | C_{P}^{2} τ_{m}^{2} \int_{I_{m}} {| \frac{d}{d t} \tilde{φ} (t) |}^{2} d t \\ = \frac{C_{P}^{2} τ_{m}^{2}}{| γ |} \int_{I_{m}} {(\int_{γ}, \partial_{t}, φ, (x, t), d x)}^{2} d t \leq C_{P}^{2} τ_{m}^{2} \int_{I_{m}} (\int_{γ}, {| \partial_{t} φ |}^{2}, d x) d t \\ = C_{P}^{2} τ_{m}^{2} {∥\partial_{t}, φ∥}_{γ \times I_{m}}^{2} . \end{matrix}

Collecting bounds (37), (39), (40) and the definition of the norm (22) yields (36). $□$

Reliability

We presented the upper bound of $R (u_{h}^{τ})$ , cf. (23).

Theorem 1

Let $u \in Y$ be the weak solution of (6) and $u_{h}^{τ} \in S_{hp}^{τ q}$ be the approximate solution given by (13). Let $R_{h}^{τ} \in H^{1} (0, T, L^{2} (Ω))$ be the temporal reconstruction given by (26) and $σ_{h}^{τ} \in L^{2} (0, T, H (div, Ω))$ be the spatial reconstruction satisfying (33). Then

\begin{matrix} R {(u_{h}^{τ})}^{2} & \leq η^{2} : = \sum_{K, m} η_{K, m}^{2}, η_{K, m} : = C_{P} η_{R, K, m} + {(η_{S, K, m}^{2} + η_{T, K, m}^{2})}^{1 / 2} + C_{T} η_{N, K, m}, \end{matrix}

where $C_{P}$ is the constant from Poincarè inequality (35), $C_{T}$ is the constant from the trace inequality (36) and the estimators $η_{R, K, m}$ , $η_{S, K, m}$ , $η_{T, K, m}$ , and $η_{N, K, m}$ are given by

\begin{matrix} η_{R, K, m} & : = d_{K, m} {∥{(R_{h}^{τ})}^{'} - \nabla \cdot σ_{h}^{τ} - g∥}_{K, m}, \end{matrix}

42a

\begin{matrix} η_{S, K, m} & : = \frac{d_{K, m}}{h_{K}} {∥σ_{h}^{τ} - σ (u_{h}^{τ}, \nabla u_{h}^{τ})∥}_{K, m}, \end{matrix}

42b

\begin{matrix} η_{T, K, m} & : = \frac{d_{K, m}}{τ_{m}} {∥R_{h}^{τ} - ϑ (u_{h}^{τ})∥}_{K, m}, \end{matrix}

42c

\begin{matrix} η_{N, K, m} & : = \sum_{γ \subset {\partial K}_{N}} max (1, h_{γ}^{- 1 / 2}) d_{K, m} {∥σ_{h}^{τ} \cdot n - g_{N}∥}_{{\partial K}_{N}, m} . \end{matrix}

42d

The proof of Theorem 1 can be found in [19] for the case of the homogeneous Dirichlet boundary condition. For completeness, we present its modification including mixed Dirichlet-Neumann boundary conditions.

Proof

Starting from (20), adding the terms $\pm (R_{h}^{τ}, v)_{K, m}$ and $\pm (\nabla \cdot σ_{h}^{τ}, v)_{K, m}$ , and using the integration by parts, we obtain

\begin{matrix} \sum_{K, m} b_{K, m} (u_{h}^{τ}, v) \\ = \sum_{K, m} \{(ϑ^{'} (u_{h}^{τ}) - g, v)_{K, m} - (g_{N}, v)_{{\partial K}_{N}, m} + (σ (u_{h}^{τ}, \nabla u_{h}^{τ}), \nabla v)_{K, m} + ({ϑ (u_{h}^{τ})}_{m - 1}, v_{+}^{m - 1})_{K}\} \\ = \sum_{K, m} ({(R_{h}^{τ})}^{'} - \nabla \cdot σ_{h}^{τ} - g, v)_{K, m} - \sum_{K, m} (σ_{h}^{τ} - σ (u_{h}^{τ}, \nabla u_{h}^{τ}), \nabla v)_{K, m} \\ - \sum_{K, m} \{({(R_{h}^{τ} - ϑ (u_{h}^{τ}))}^{'}, v)_{K, m} - ({ϑ (u_{h}^{τ})}_{m - 1}, v_{+}^{m - 1})_{K}\} + \sum_{K, m} (σ_{h}^{τ} \cdot n - g_{N}, v)_{{\partial K}_{N}, m} \\ = : ξ_{1} + ξ_{2} + ξ_{3} + ξ_{4} . \end{matrix}

The terms $ξ_{i}$ , $i = 1, \dots, 4$ are estimated separately.

Let $v_{K, m} \in P_{0} (K \times I_{m})$ be the piecewise constant projection of $v \in V^{τ}$ given by the identity $(v_{K, m}, 1)_{K, m} = (v, 1)_{K, m}$ . Using the Cauchy–Schwarz inequality, assumption (33), the Poincarè inequality (35), and (22), we have

\begin{matrix} | ξ_{1} | & \leq \sum_{K, m} | ({(R_{h}^{τ})}^{'} - \nabla \cdot σ_{h}^{τ} - g, v)_{K, m} | = \sum_{K, m} | ({(R_{h}^{τ})}^{'} - \nabla \cdot σ_{h}^{τ} - g, v - v_{K, m})_{K, m} | \\ \leq \sum_{K, m} C_{P} {∥{(R_{h}^{τ})}^{'} - \nabla \cdot σ_{h}^{τ} - g∥}_{K, m} {(h_{K}^{2} {∥\nabla, v∥}_{K, m}^{2} + τ_{m}^{2} {∥v^{'}∥}_{K, m}^{2})}^{1 / 2} \\ = \sum_{K, m} C_{P} d_{K, m} {∥{(R_{h}^{τ})}^{'} - \nabla \cdot σ_{h}^{τ} - g∥}_{K, m} {∥v∥}_{V_{K, m}} = \sum_{K, m} C_{P} η_{R, K, m} {∥v∥}_{V_{K, m}} . \end{matrix}

Furthermore, by the Cauchy–Schwarz inequality and (22), we obtain

\begin{matrix} | ξ_{2} | & \leq \sum_{K, m} | (σ_{h}^{τ} - σ (u_{h}^{τ}, \nabla u_{h}^{τ}), \nabla v)_{K, m} | \\ \leq \sum_{K, m} \frac{d_{K, m}}{h_{K}} {∥σ_{h}^{τ} - σ (u_{h}^{τ}, \nabla u_{h}^{τ})∥}_{K, m} \frac{h_{K}}{d_{K, m}} {∥\nabla, v∥}_{K, m} = \sum_{K, m} η_{S, K, m} \frac{h_{K}}{d_{K, m}} {∥\nabla, v∥}_{K, m} . \end{matrix}

The use of (29), and a similar manipulations as in (45), give

\begin{matrix} | ξ_{3} | & \leq \sum_{K, m} | ({(R_{h}^{τ} - ϑ (u_{h}^{τ}))}^{'}, v)_{K, m} - ({ϑ (u_{h}^{τ})}_{m - 1}, v_{+}^{m - 1})_{K} | = \sum_{K, m} | (R_{h}^{τ} - ϑ (u_{h}^{τ}), v^{'})_{K, m} | \\ \leq \sum_{K, m} \frac{d_{K, m}}{τ_{m}} {∥R_{h}^{τ} - ϑ (u_{h}^{τ})∥}_{K, m} \frac{τ_{m}}{d_{K, m}} {∥v^{'}∥}_{K, m} = \sum_{K, m} η_{T, K, m} \frac{τ_{m}}{d_{K, m}} {∥v^{'}∥}_{K, m} . \end{matrix}

Hence, estimates (45)–(46), the Cauchy inequality and (22) imply

\begin{matrix} | ξ_{2} | + | ξ_{3} | & \leq \sum_{K, m} (η_{S, K, m} \frac{h_{K}}{d_{K, m}} {∥\nabla, v∥}_{K, m} + η_{T, K, m} \frac{τ_{m}}{d_{K, m}} {∥v^{'}∥}_{K, m}) \\ \leq \sum_{K, m} {(η_{S, K, m}^{2} + η_{T, K, m}^{2})}^{1 / 2} {∥v∥}_{V_{K, m}} . \end{matrix}

Furthermore, let $v_{γ, m} \in P_{0} (γ \times I_{m})$ , $γ \subset {\partial K}_{N}$ be the $L^{2}$ -orthogonal projection from Lemma 3. Then using assumption (34), the Cauchy inequality and the space-time trace inequality (36), we have

\begin{matrix} | ξ_{4} | & = \sum_{K, m} \sum_{γ \subset {\partial K}_{N}} (σ_{h}^{τ} \cdot n - g_{N}, v - v_{γ, m})_{γ, m} \leq \sum_{K, m} \sum_{γ \subset {\partial K}_{N}} {∥σ_{h}^{τ} \cdot n - g_{N}∥}_{γ, m} {∥v - v_{γ, m}∥}_{γ, m} \\ \leq C_{T} \sum_{K, m} \sum_{γ \subset {\partial K}_{N}} max (1, h_{γ}^{- 1 / 2}) d_{K, m} {∥σ_{h}^{τ} \cdot n - g_{N}∥}_{γ, m} {∥v∥}_{V_{K, m}} . \end{matrix}

The particular estimates (44), (47), and (48), together with the discrete Cauchy–Schwarz inequality, imply (41). $□$

Remark 2

Obviously, if $\partial K \cap Γ_{N} \neq \emptyset$ , then $η_{N, K, m} = 0$ .

Efficiency

The aim is to show that the local individual error estimators $η_{R, K, m}$ , $η_{S, K, m}$ and $η_{T, K, m}$ from (41)–(42) are locally efficient, i.e., they provide local lower bounds to the error measure up to a generic constant $C > 0$ which is independent of u, $u_{h}^{τ}$ , h and $τ$ , but may depend on data problems and the degrees of polynomial approximation p and q. A dependence of the estimate up to this generic constant we will denote by $≲$ .

In order to derive the local variants of the error measure, we denote by $ω_{K}$ the set of elements sharing at least a vertex with $K \in T_{h}^{m}$ , i.e.,

\begin{matrix} ω_{K} = \cup_{K^{'} \cap K \neq 0} K^{'}, K \in T_{h}^{m}, m = 0, \dots, r . \end{matrix}

Moreover, we define the functional sub-spaces $V_{D, m} = {v \in V^{τ} : supp (v) \subset \bar{D \times I_{m}}}$ for any set $D \subset Ω$ (cf. (18)) and the corresponding error measures (cf. (23))

\begin{matrix} R_{D, m} (w) = sup_{{0 \neq v \in V_{D, m}}} \frac{1}{{∥v∥}_{V^{τ}}} \sum_{K, m} b_{K, m} (w, v) . \end{matrix}

Obviously, the definition of $V_{D, m}$ and $R_{D, m} (u_{h}^{τ})$ together with the shape regularity implies

\begin{matrix} \sum_{K, m} R_{K, m} (u_{h}^{τ}) \leq \sum_{K, m} R_{ω_{K}, m} (u_{h}^{τ}) ≲ R (u_{h}^{τ}) . \end{matrix}

Moreover, for each space-time element $K \times I_{m}$ , $K \in T_{h}^{m}$ , $m = 1, \dots, r$ , we introduce the $L^{2} (K \times I_{m})$ -projection of the non-polynomial functions, namely

\begin{matrix} \bar{ϑ^{'} (u_{h}^{τ})} \in P_{q} (I_{m}, P_{p_{K}} (K) : (\bar{ϑ^{'} (u_{h}^{τ})}, v)_{K, m} = (ϑ^{'} (u_{h}^{τ}), v)_{K, m} \forall v \in P_{q} (I_{m}, P_{p_{K}} (K)) \\ \bar{g} \in P_{q} (I_{m}, P_{p_{K}} (K)) : (\bar{g}, v)_{K, m} = (g, v)_{K, m} \forall v \in P_{q} (I_{m}, P_{p_{K}} (K)) . \end{matrix}

Finally, for each vertex a of the mesh $T_{h}^{m}$ , we denote by $ω_{a}$ a patch of elements $K \in T_{h}^{m}$ that share this vertex. By $p_{a} = {max}_{K \in ω_{a}} p_{K}$ we denote the maximal polynomial degree on $ω_{a}$ . Then, for each a of $K \in T_{h}^{m}$ , we define a vector-valued function ${\bar{σ}}_{a} = {\bar{σ}}_{a} (u_{h}^{τ}, \nabla u_{h}^{τ}) \in P_{q} (I_{m}, {RTN}_{p_{a}} (K))$ (cf. (32)) by

\begin{matrix} ({\bar{σ}}_{a} \cdot n_{K}, v)_{γ, m} & = (ψ_{a} 〈σ (u_{h}^{τ}, \nabla u_{h}^{τ})〉 \cdot n_{K}, v)_{γ, m} \forall v \in P_{q} (I_{m}, P_{p_{a}} (γ)), γ \subset K \\ {({\bar{σ}}_{a} \cdot v)}_{K, m} & = {(ψ_{a} σ (u_{h}^{τ}, \nabla u_{h}^{τ}), v)}_{K, m} \forall v \in P_{q} (I_{m}, P_{p_{a} - 1} {(K)}^{d}), \end{matrix}

where $〈\cdot〉$ denotes the mean value on $γ \subset \partial K$ and $ψ_{a}$ is a continuous piecewise linear function such that $ψ_{a} (a) = 1$ and it vanishes at the other vertices of K. Finally, we set $\bar{σ} |_{K \times I_{m}} = \sum_{a \in K} {\bar{σ}}_{a}$ .

The proof of the local efficiency of the error estimates presented is based on the choice of a suitable test function in (23). We set

\begin{matrix} w (x, t) = \frac{d_{K, m}^{2}}{τ_{m}} P_{h} ({ϑ (u_{h}^{τ})}_{m - 1}) (x) χ_{K} (x) Φ_{m} (t) . \end{matrix}

where $χ_{K} (x)$ is the standard bubble function on K, $Φ_{m} (t)$ is the Legendre polynomial of degree $q + 1$ on $I_{m}$ (and vanishing outside) and $P_{h} ({ϑ (u_{h}^{τ})}_{m - 1}) \in P_{p_{K}} (K)$ is the $L^{2} (K)$ -projection weighted by $χ_{K} (x)$ given by

\begin{matrix} (P_{h} ({ϑ (u_{h}^{τ})}_{m - 1}), χ_{K} v)_{K} = ({ϑ (u_{h}^{τ})}_{m - 1}, χ_{K} v)_{K} \forall v \in P_{p_{K}} (K) . \end{matrix}

We note that

\begin{matrix} P_{h} ({ϑ (u_{h}^{τ})}_{m - 1}) \neq {\bar{ϑ (u_{h}^{τ})}}_{m - 1}, \end{matrix}

in general, compare with (52).

Using the inverse inequality, the polynomial function w given by (54) can be estimated as

\begin{matrix} {∥w∥}_{V_{K, m}}^{2} & = d_{K, m}^{- 2} (h_{K}^{2} {∥\nabla, w∥}_{K, m}^{2} + τ_{m}^{2} {∥w^{'}∥}_{K, m}^{2}) ≲ d_{K, m}^{- 2} {∥w∥}_{K, m}^{2} \\ \leq \frac{d_{K, m}^{2}}{τ_{m}^{2}} {∥P_{h}, (, {, ϑ (u_{h}^{τ}), }_{m - 1}, )∥}_{K}^{2} \int_{I_{m}} Φ_{m}^{2} (t) d t ≲ \frac{d_{K, m}^{2}}{τ_{m}} {∥P_{h}, (, {, ϑ (u_{h}^{τ}), }_{m - 1}, )∥}_{K}^{2} . \end{matrix}

Similarly as in [11] or [18], we introduce the oscillation terms

\begin{matrix} η_{G, K, m} & : = d_{K, m} {∥\bar{g} - g∥}_{K, m}, η_{ϑ, K, m} : = \frac{d_{K, m}}{\sqrt{τ_{m}}} {∥{ϑ (u_{h}^{τ})}_{m - 1} - P_{h} ({ϑ (u_{h}^{τ})}_{m - 1})∥}_{K}, \\ η_{ϑ^{'}, K, m} & : = d_{K, m} {∥\bar{ϑ^{'} (u_{h}^{τ})} - ϑ^{'} (u_{h}^{τ})∥}_{K, m}, \\ η_{σ, K, m} & : = \frac{d_{K, m}}{h_{K}} {∥\bar{σ} - σ (u_{h}^{τ}, \nabla u_{h}^{τ})∥}_{K, m} + d_{K, m} {∥\nabla \cdot \bar{σ} - \nabla \cdot σ (u_{h}^{τ}, \nabla u_{h}^{τ})∥}_{K, m} . \end{matrix}

The goal is to prove the lower bounds of the proposed error estimates, namely to estimate $η_{T, K, m}$ , $η_{R, K, m}$ and $η_{S, K, m}$ by $R_{K, m} (u_{h}^{τ})$ and the oscillation terms (58), $K \in T_{h}$ , $m = 1, \dots, r$ .

Theorem 2

Let $η_{T, K, m}$ , $K \in T_{h}^{m}$ , $m = 1, \dots, r$ be the error estimates given by (42), then

\begin{matrix} η_{T, K, m} ≲ R_{K, m} (u_{h}^{τ}) + η_{G, K, m} + η_{ϑ^{'}, K, m} + η_{ϑ, K, m} + η_{S, K, m} . \end{matrix}

where $R_{K, m}$ are the local error measures defined by (49)–(50) and the oscillation terms $η_{G, K, m}$ , $η_{ϑ, K, m}$ and $η_{ϑ^{'}, K, m}$ are given by (58).

Proof

We start the proof by the putting function w from (54) as the test function in (50), i.e.

\begin{matrix} R_{K, m} (u_{h}^{τ}) = sup_{0 \neq v \in V_{K, m}} \frac{\sum_{K, m} b_{K, m} (u_{h}^{τ}, v)}{{∥v∥}_{V^{τ}}} \geq \frac{b_{K, m} (u_{h}^{τ}, w)}{{∥w∥}_{V^{τ}}} \end{matrix}

since $supp (w) = K \times I_{m}$ , cf. (54). Then, using (20) and the fact that w vanishes on $\partial K$ , we have

\begin{matrix} R_{K, m} (u_{h}^{τ}) & \geq \frac{(ϑ^{'} (u_{h}^{τ}) - g, w)_{K, m} + (σ (u_{h}^{τ}, \nabla u_{h}^{τ}), \nabla w)_{K, m} + ({ϑ (u_{h}^{τ})}_{m - 1}, w_{+}^{m - 1})_{K}}{{∥w∥}_{V_{K, m}}} \\ = \frac{(\bar{ϑ^{'} (u_{h}^{τ})} - \bar{g}, w)_{K, m} + (σ_{h}^{τ}, \nabla w)_{K, m}}{{∥w∥}_{V_{K, m}}} + \frac{({ϑ (u_{h}^{τ})}_{m - 1}, w_{+}^{m - 1})_{K}}{{∥w∥}_{V_{K, m}}} = : ξ_{1} + ξ_{2} \\ + \frac{(\bar{g} - g, w)_{K, m} + (σ - σ_{h}^{τ}, \nabla w)_{K, m} + (ϑ^{'} (u_{h}^{τ}) - \bar{ϑ^{'} (u_{h}^{τ})}, w)_{K, m}}{{∥w∥}_{V_{K, m}}} \\ = : ξ_{3} + ξ_{4} + ξ_{5} . \end{matrix}

The functions $\bar{ϑ^{'} (u_{h}^{τ})}$ , $\bar{g}$ and $σ_{h}^{τ}$ are polynomials of degree q in time whereas w and $\nabla w$ are the (Legendre) polynomial of degree $(q + 1)$ in time, cf. (54). Due to the $L^{2} (I_{m})$ -orthogonality of the Legendre polynomials, we have $ξ_{1} = 0$ , since

\begin{matrix} (\bar{ϑ^{'} (u_{h}^{τ})} - \bar{g}, w)_{K, m} + (σ_{h}^{τ}, \nabla w)_{K, m} = 0 \end{matrix}

Moreover, using inequality (57), relations (54)-(55) and the equivalence of norms on finite dimensional spaces,

we obtain

\begin{matrix} ξ_{2} & ≳ \frac{(P_{h} ({ϑ (u_{h}^{τ})}_{m - 1}), \frac{d_{K, m}^{2}}{τ_{m}} P_{h} ({ϑ (u_{h}^{τ})}_{m - 1}) χ_{K})_{K}}{\frac{d_{K, m}}{\sqrt{τ_{m}}} {∥P_{h}, (, {, ϑ (u_{h}^{τ}), }_{m - 1}, )∥}_{K}} \\ ≳ \frac{d_{K, m}}{\sqrt{τ_{m}}} \frac{(P_{h} ({ϑ (u_{h}^{τ})}_{m - 1}), P_{h} ({ϑ (u_{h}^{τ})}_{m - 1}))_{K}}{{∥P_{h}, (, {, ϑ (u_{h}^{τ}), }_{m - 1}, )∥}_{K}} = \frac{d_{K, m}}{\sqrt{τ_{m}}} {∥P_{h}, (, {, ϑ (u_{h}^{τ}), }_{m - 1}, )∥}_{K} . \end{matrix}

Furthermore, let $w_{K, m} = \frac{1}{K \times I_{m}} \int_{K \times I_{m}} w d x d t$ be the mean value of w on the space-time element $K \times I_{m}$ . Due to (52), the Cauchy–Schwarz inequality and (35), we have

\begin{matrix} | ξ_{3} | & = \frac{| (\bar{g} - g, w - w_{K, m})_{K, m} |}{{∥w∥}_{V_{K, m}}} \\ \leq \frac{{∥\bar{g} - g∥}_{K, m} {∥w - w_{K, m}∥}_{K, m}}{{∥w∥}_{V_{K, m}}} ≲ d_{K, m} {∥\bar{g} - g∥}_{K, m} = η_{G, K, m}, \end{matrix}

and

\begin{matrix} | ξ_{5} | ≲ d_{K, m} {∥ϑ^{'} (u_{h}^{τ}) - \bar{ϑ^{'} (u_{h}^{τ})}∥}_{K, m} = η_{ϑ^{'}, K, m} . \end{matrix}

Similarly, the Cauchy–Schwarz inequality and (22) imply

\begin{matrix} | ξ_{4} | & \leq \frac{d_{K, m}}{h_{K}} {∥σ (u_{h}^{τ}, \nabla u_{h}^{τ}) - σ_{h}^{τ}∥}_{K, m} \frac{h_{K} {∥\nabla, w∥}_{K, m}}{d_{K, m} {∥w∥}_{V_{K, m}}} \\ \leq \frac{d_{K, m}}{h_{K}} {∥σ (u_{h}^{τ}, \nabla u_{h}^{τ}) - σ_{h}^{τ}∥}_{K, m} = η_{S, K, m} . \end{matrix}

Collecting (61)–(66), we have

\begin{matrix} R_{K, m} (u_{h}^{τ}) & ≳ \frac{d_{K, m}}{\sqrt{τ_{m}}} {∥P_{h}, (, {, ϑ (u_{h}^{τ}), }_{m - 1}, )∥}_{K} - η_{G, K, m} - η_{S, K, m} - η_{ϑ^{'}, K, m} . \end{matrix}

Moreover, using (42c), (26), integration by parts, the boundedness of the Radau polynomials, the triangle inequality and (58), we have

\begin{matrix} η_{T, K, m} & = \frac{d_{K, m}}{τ_{m}} {∥R_{h}^{τ} - ϑ (u_{h}^{τ})∥}_{K, m} = \frac{d_{K, m}}{τ_{m}} {∥{, ϑ (u_{h}^{τ}), }_{m - 1}, r_{m}∥}_{K, m} \\ = \frac{d_{K, m}}{τ_{m}} {∥{, ϑ (u_{h}^{τ}), }_{m - 1}∥}_{K} \sqrt{\int_{I_{m}} r_{m}^{2} d t} ≲ \frac{d_{K, m}}{\sqrt{τ_{m}}} {∥{, ϑ (u_{h}^{τ}), }_{m - 1}∥}_{K} \\ \leq \frac{d_{K, m}}{\sqrt{τ_{m}}} {∥P_{h}, (, {, ϑ (u_{h}^{τ}), }_{m - 1}, )∥}_{K} + η_{ϑ, K, m} . \end{matrix}

Hence, (67) and (68)

\begin{matrix} η_{T, K, m} \leq R_{K, m} (u_{h}^{τ}) + η_{ϑ, K, m} + η_{G, K, m} + η_{ϑ^{'}, K, m} + η_{S, K, m}, \end{matrix}

which proves the theorem. $□$

Theorem 3

Let $η_{S, K, m}$ and $η_{R, K, m}$ , $K \in T_{h}^{m}$ , $m = 1, \dots, r$ be the error estimates given by (42), then

\begin{matrix} η_{R, K, m} & ≲ R_{ω_{K}, m} (u_{h}^{τ}) + η_{G, K, m} + η_{σ, K, m} + η_{S, K, m}, \end{matrix}

\begin{matrix} η_{S, K, m} & ≲ R_{ω_{K}, m} (u_{h}^{τ}) + η_{G, K, m} + \sum_{K \subset ω_{K}} η_{σ, K, m}, \end{matrix}

where $R_{ω_{K}, m}$ is the local error measures defined by (49)–(50) and the oscillation terms $η_{G, K, m}$ , $η_{ϑ, K, m}$ and $η_{ϑ^{'}, K, m}$ are given by (58).

Proof

The proof is in principle identical with the proof [18, Lemmas 7-9], we present the main step for completeness. Let $\bar{g}$ and $\bar{σ}$ be the projection given by (52) and (53). Using the triangle inequality, the inverse inequality and (58), we obtain

\begin{matrix} η_{R, K, m} & = d_{K, m} {∥{(R_{h}^{τ})}^{'} - \nabla \cdot σ_{h}^{τ} - g∥}_{K, m} \\ \leq d_{K, m} {∥{(R_{h}^{τ})}^{'} - \nabla \cdot \bar{σ} - \bar{g}∥}_{K, m} + d_{K, m} {∥\bar{g} - g∥}_{K, m} + d_{K, m} {∥\nabla \cdot \bar{σ} - \nabla \cdot σ_{h}^{τ}∥}_{K, m} \\ ≲ d_{K, m} {∥{(R_{h}^{τ})}^{'} - \nabla \cdot \bar{σ} - \bar{g}∥}_{K, m} + η_{G, K, m} + \frac{d_{K, m}}{h_{K}} {∥\bar{σ} - σ_{h}^{τ}∥}_{K, m} . \end{matrix}

The first term on the right-hand side of (72) can be estimated as in [36, Theorem 4.10] by

\begin{matrix} d_{K, m} {∥{(R_{h}^{τ})}^{'} - \nabla \cdot \bar{σ} - \bar{g}∥}_{K, m} ≲ R e s_{ω_{K}, m} (u_{h}^{τ}) + η_{G, K, m} + η_{σ, K, m}, \end{matrix}

where the resulting oscillation terms are estimated with the aid (58). Moreover, the last term on the right-hand side of (72) together with (42b) and assumption (58), reads

\begin{matrix} \frac{d_{K, m}}{h_{K}} {∥\bar{σ} - σ_{h}^{τ}∥}_{K, m} & \leq \frac{d_{K, m}}{h_{K}} {∥\bar{σ} - σ (u_{h}^{τ}, \nabla u_{h}^{τ})∥}_{K, m} + \frac{d_{K, m}}{h_{K}} {∥σ (u_{h}^{τ}, \nabla u_{h}^{τ}) - σ_{h}^{τ}∥}_{K, m} \\ \leq η_{σ, K, m} + η_{S, K, m}, \end{matrix}

which proves (70).

The proof of (71) is based on the decomposition

\begin{matrix} {∥σ_{h}^{τ} - σ (u_{h}^{τ}, \nabla u_{h}^{τ})∥}_{K, m} \leq {∥σ_{h}^{τ} - \bar{σ}∥}_{K, m} + {∥\bar{σ} - σ (u_{h}^{τ}, \nabla u_{h}^{τ})∥}_{K, m} . \end{matrix}

While the second term on the right-hand side of (75) can be estimated by assumption (58), the estimate of the first term is somewhat more technical. It depends on the flux reconstruction used. For the flux reconstruction in Sect. 5.2, the proof is identical to the proof of [18, Lemma 9], which mimics the stationary variant [24, Theorem 3.12]. On the other hand, using the flux reconstruction from Sect. 5.1, it is possible to apply the technique from [11, Lemma 7.5], where the final relation has to be integrated over $I_{m}$ . $□$

Spatial Flux Reconstructions and Stopping Criteria

We present two ways of reconstructing the spatial flux $σ_{h}^{τ} \in L^{2} (0, T, H (div, Ω))$ that satisfies the assumptions (33)–(34). The first one, proposed in [19] for the case of homogeneous Dirichlet boundary condition, is defined by the volume and edge momenta of the Raviart-Thomas-Nedelec (RTN) elements, cf. [7], and is easy to compute. The second approach is based on the solution of local Neumann problems on patches associated with each vertex of the mesh. This idea comes from, e.g., [24], its space-time variant was proposed in [18] for nonlinear convection-diffusion equations. Finally, in Sect. 5.3, we discuss the errors arising from the solution of algebraic systems and introduce a stopping criterion for the appropriate iterative solver.

Element-Wise Variant

We denote by $p_{K, max}$ the maximum polynomial degree over the element K and its neighbours that share the entire edge with K and $p_{γ, max}$ the maximum polynomial degree on neighbouring elements having a common edge $γ$ . Let ${RTN}_{p_{K, max}} (K)$ be the space of RTN finite elements of order $p_{K, max}$ for element $K \in T_{h}^{m}$ , cf. (32), and $u_{h}^{τ} \in S_{hp}^{τ q}$ be the approximate solution. The spatial reconstruction $σ_{h}^{τ}$ is defined element-wise: for each $K \in T_{h}^{m}$ , find $σ_{h}^{τ} |_{K \times I_{m}} \in P_{q} (I_{m}, {RTN}_{p_{K, max}} (K))$ with $σ_{h}^{τ} \cdot n |_{γ \times I_{m}} \in P_{q} (I_{m}, P_{p_{γ, max}} (γ))$ such that

\begin{matrix} (σ_{h}^{τ} \cdot n, v)_{γ, m} & = \{\begin{matrix} (〈K (u_{h}^{τ}) \nabla u_{h}^{τ}〉 \cdot n - α [u_{h}^{τ}] \cdot n, v)_{γ, m} & \forall v \in P_{q} (I_{m}, P_{p_{γ, max}} (γ)), γ \subset \partial K \ Γ_{N} \\ (g_{N}, v)_{γ} & \forall v \in P_{q} (I_{m}, P_{p_{γ, max}} (γ)), γ \subset {\partial K}_{N} \end{matrix}) \\ (σ_{h}^{τ}, v)_{K, m} & = (K (u_{h}^{τ}) \nabla u_{h}^{τ}, \nabla v)_{K, m} + (β - \frac{1}{2}) (K (u_{h}^{τ}) [u_{h}^{τ}], \nabla v)_{\partial K \ Γ, m} \\ + (2 β - 1) (K (u_{h}^{τ}) [u_{h}^{τ}], \nabla v)_{\partial K \cap Γ_{D}, m} \forall v \in P_{q} (I_{m}, P_{p_{K, max} - 1} {(K)}^{d}) . \end{matrix}

The edge momenta in (76) are uniquely defined and since $p_{γ, max} \leq p_{K, max}$ , $σ_{h}^{τ}$ in (76) is well defined as well. Here, the numerical flux $〈K (u_{h}^{τ}) \nabla u_{h}^{τ}〉 \cdot n - α [u_{h}^{τ}] \cdot n$ is conservative on interior edges, which implies that $σ_{h}^{τ} \cdot n$ are the same on each interior edge $γ$ and therefore the resulting reconstruction $σ_{h}^{τ} \in L^{2} (0, T, H (div, Ω))$ globally.

Obviously, the first relation in (76) with $p_{K} \leq p_{γ, max}$ directly implies assumption (34). Moreover, using the Green theorem, (76), (12), (31) and $p_{K} \leq p_{γ, max} \leq p_{K, max}$ , we obtain

\begin{matrix} (\nabla \cdot σ_{h}^{τ}, v)_{K, m} & = - (σ_{h}^{τ}, \nabla v)_{K, m} + (σ_{h}^{τ} \cdot n_{K}, v)_{\partial K, m} \\ = - (K (u_{h}^{τ}) \nabla u_{h}^{τ}, \nabla v)_{K, m} + (〈K (u_{h}^{τ}) \nabla u_{h}^{τ}〉 \cdot n - α [u_{h}^{τ}] \cdot n, v)_{\partial K \ Γ_{N}, m} \\ - (β - \frac{1}{2}) (K (u_{h}^{τ}) [u_{h}^{τ}], \nabla v)_{\partial K \ Γ, m} - (2 β - 1) (K (u_{h}^{τ}) [u_{h}^{τ}], \nabla v)_{\partial K \cap Γ_{D}, m} \\ + (g_{N}, v)_{{\partial K}_{N}, m} \\ = - A_{K, m} (u_{h}^{τ}, v) - (g, v)_{K, m} = ({(R_{h}^{τ})}^{'} - g, v)_{K, m} \\ \forall v \in P_{q} (I_{m}, P_{p_{K}} (K)), K \in T_{h}^{m}, \end{matrix}

which justifies the assumption (33).

Patch-Wise Variant

For each vertex a of the mesh $T_{h}^{m}$ , we denote by $ω_{a}$ a patch of elements $K \in T_{h}^{m}$ sharing this vertex. By $p_{a} = {max}_{K \in ω_{a}} p_{K}$ we denote the maximal polynomial degree on $ω_{a}$ . Let $P_{p_{a}}^{*} (ω_{a})$ be the space of piecewise polynomial discontinuous functions of degree $p_{a}$ on $ω_{a}$ with mean value zero for $a \notin \partial Ω$ . We define the space

\begin{matrix} W_{RTN, p_{a}}^{N} (ω_{a}) & = {v \in H (div, ω_{a}) ; v |_{K} \in {RTN}_{p_{a}} (K), v \cdot n = 0 on \partial ω_{a}}, a \notin \partial Ω \\ W_{RTN, p_{a}}^{N} (ω_{a}) & = {v \in H (div, ω_{a}) ; v |_{K} \in {RTN}_{p_{a}} (K), v \cdot n = 0 on \partial ω_{a} \ \partial Ω, \\ & {(v \cdot n, ϕ)}_{γ, m} = {(g_{N}, ϕ)}_{γ, m} \forall ϕ \in P_{q} (I_{m}, P_{p_{a}} (γ)) on \partial ω_{a} \cap {\partial K}_{N}}, a \in \partial Ω . \end{matrix}

We set the local problems on patches $ω_{a}$ for all vertices a: find $σ_{h}^{τ} \in P_{q} (I_{m}, W_{RTN, p_{a}}^{N} (ω_{a}))$ and $r_{a}^{τ} \in P_{q} (I_{m}, P_{p_{a}}^{*} (ω_{a}))$ such that

\begin{matrix} (σ_{a}^{τ}, v)_{ω_{a}, m} - (r_{a}^{τ}, \nabla \cdot v)_{ω_{a}, m} & = (ξ_{a}^{1}, v)_{ω_{a}, m} \forall v \in P_{q} (I_{m}, W_{RTN, p_{a}}^{N} (ω_{a})) \\ (\nabla \cdot σ_{a}^{τ}, ϕ)_{ω_{a}, m} & = (ξ_{a}^{2}, ϕ)_{ω_{a}, m} \forall ϕ \in P_{q} (I_{m}, P_{p_{a}}^{*} (ω_{a})), \end{matrix}

where

\begin{matrix} ξ_{a}^{1} & = ψ_{a} σ (u_{h}^{τ}, \nabla u_{h}^{τ}) \\ ξ_{a}^{2} & = ψ_{a} {(R_{h}^{τ})}^{'} - ψ_{a} g + \nabla ψ_{a} \cdot ξ (u_{h}^{τ}, \nabla u_{h}^{τ}), \end{matrix}

with

\begin{matrix} ξ (u_{h}^{τ}, \nabla u_{h}^{τ}) = σ (u_{h}^{τ}, \nabla u_{h}^{τ}) + (2 β - 1) \sum_{γ ⊄ Γ_{N}} ℓ_{m, γ} (u_{h}^{τ}), \end{matrix}

and $ℓ_{m, γ} : S_{h p, m} \to {[S_{h 0, m}]}^{d}$ is the lifting operator defined by

\begin{matrix} \int_{Ω} ℓ_{m, γ} (u_{h}^{τ}) \cdot v d x = \int_{γ} [u_{h}^{τ}] 〈K (u_{h}^{τ}) v〉 d x \forall v \in {[S_{h 0, m}]}^{d}, γ ⊄ Γ_{N} . \end{matrix}

Then the final reconstructed flux is obtained by summing up $σ_{a}^{τ}$ on each element that contains vertex a, i.e.,

\begin{matrix} σ_{h}^{τ} {|_{K, m} = \sum_{a \in K} σ_{a}^{τ} |}_{K} . \end{matrix}

The assumption (34) follows directly from (78) and $p_{K} \leq p_{a}$ . Inserting the hat function $ψ_{a} v$ for $a \notin \partial Ω$ and $v \in P_{q} (I_{m})$ in (17), using (5), (82) and omitting the zero terms, we have

\begin{matrix} \sum_{K \in T_{h}^{m}} A_{K, m} (u_{h}^{τ}, ψ_{a} v) \\ = \sum_{K \in T_{h}^{m}} (K (u_{h}^{τ}) \nabla u_{h}^{τ}, \nabla ψ_{a} v)_{K, m} \\ + (2 β - 1) \sum_{γ ⊄ \partial Ω} ([u_{h}^{τ}], 〈K (u_{h}^{τ}) \nabla ψ_{a} v〉)_{γ, m} + (2 β - 1) \sum_{γ \subset Γ_{D}} ([u_{h}^{τ}], K (u_{h}^{τ}) \nabla ψ_{a} v)_{γ, m} \\ - (g, ψ_{a} v)_{Ω, m} = (ξ_{a}^{2}, v)_{ω_{a}, m} - (R_{h}^{τ}, ψ_{a} v)_{ω_{a}, m} \end{matrix}

Applying (13) and (31), we gain for $a \notin \partial Ω$ and $v \in P_{q} (I_{m})$

\begin{matrix} (\nabla \cdot σ_{a}^{τ}, v)_{ω_{a}, m} = \sum_{K \subset ω_{a}} (A_{K, m} (u_{h}^{τ}, ψ_{a} v) + (R_{h}^{τ}, ψ_{a} v)_{K, m}) = (ξ_{a}^{2}, v)_{ω_{a}, m} . \end{matrix}

From this it follows that the second relation in (79) holds element-wise, i.e.

\begin{matrix} (\nabla \cdot σ_{a}^{τ}, ϕ)_{K, m} = (ξ_{a}^{2}, ϕ)_{K, m}, \forall ϕ \in P_{q} (I_{m}, P_{p_{a}} (K)) . \end{matrix}

Then (33) follows from

\begin{matrix} {(\nabla \cdot σ_{h}^{τ}, ϕ)}_{K, m} & = \sum_{a \subset K} {(\nabla \cdot σ_{a}^{τ}, ϕ)}_{K, m} = \sum_{a \subset K} (ξ_{a}^{2}, ϕ)_{K, m} \\ = {({(R_{h}^{τ})}^{'} - g, ϕ)}_{K, m} \forall ϕ \in P_{q} (I_{m}, P_{p_{a}} (K)) \end{matrix}

and from $p_{K} \leq p_{a}$ .

Stopping Criteria for Iterative Solvers

The space-time discretization (13) leads to a system of nonlinear algebraic equations for each time level $m = 1, \dots, r$ . These systems have to be solved iteratively by a suitable solver, e.g., the Picard method, the Newton method or their variants. Therefore, it is necessary to set a suitable stopping criterion for the iterative solvers so that, on the one hand, the algebraic errors do not affect the quality of the approximate solution and, on the other hand, an excessive number of algebraic iterations is avoided.

However, the error estimates presented in Sect. 4 do not take into account errors arising from the inaccurate solution of these systems. Indeed, the aforementioned reconstructions fulfill assumption (33) only if the systems given by (13) are solved exactly. The full a posteriori error analysis including algebraic errors has been treated, e.g., in [8, 23, 29]. These error estimators are based on additional flux reconstructions that need to be evaluated at each iteration, and therefore, the overall computational time is increased.

To speed up the computations and control the algebraic errors, we adopt the technique of [17]. This approach offers (i) the measurement of algebraic errors by a quantity similar to the error measure (23), (ii) the setting of the stopping criterion for iterative solvers with one parameter corresponding to the relative error, and (iii) a fast evaluation of the required quantities.

For each $m = 1, \dots, r$ , we define the estimators (cf. (23))

\begin{matrix} η_{alg}^{m} (u_{h}^{τ}) = sup_{0 \neq v \in S_{hp}^{τ q}} \frac{\sum_{K \in T_{h}^{m}} b_{K, m} (u_{h}^{τ}, v)}{{∥v∥}_{V^{τ}}}, η_{spa}^{m} (u_{h}^{τ}) = sup_{0 \neq v \in S_{h p + 1}^{τ q + 1}} \frac{\sum_{K \in T_{h}^{m}} b_{K, m} (u_{h}^{τ}, v)}{{∥v∥}_{V^{τ}}}, \end{matrix}

where the norm ${∥\cdot∥}_{V^{τ}}$ is given by (22),

\begin{matrix} S_{h p + 1}^{τ q + 1} = {v \in L^{2} (Ω \times (0, T)) : v |_{I_{m}} \in P_{q + 1} (I_{m}, S_{h p + 1, m}), m = 1, \dots, r}, \\ and S_{h p + 1, m} = {v \in L^{2} (Ω) : v |_{K} \in P_{p_{K} + 1} (K), K \in T_{h}^{m}}, m = 0, \dots, r . \end{matrix}

The space $S_{h p + 1}^{τ q + 1}$ is an enrichment space of $S_{hp}^{τ q}$ by polynomials of the space degree $p_{K} + 1$ and the time degree $q + 1$ for each $K \times I_{m}$ , $K \in T_{h}^{m}$ , $m = 1, \dots, r$ . Finally, we define the global in time quantities

\begin{matrix} η_{alg} (u_{h}^{τ}) = {(\sum_{m = 1}^{r}, {(η_{alg}^{m} (u_{h}^{τ}))}^{2})}^{1 / 2}, η_{spa} (u_{h}^{τ}) = {(\sum_{m = 1}^{r}, {(η_{spa}^{m} (u_{h}^{τ}))}^{2})}^{1 / 2} . \end{matrix}

Obviously, if $u_{h}^{τ}$ fulfills (13) exactly, then $η_{alg}^{m} (u_{h}^{τ}) = 0$ for all $m = 0, \dots, r$ . Moreover, if $u_{h}^{τ}$ is the weak solution (6) then $η_{spa}^{m} (u_{h}^{τ}) = 0$ for all $m = 0, \dots, r$ . Comparing (88) with (23), the quantity $η_{spa} (u_{h}^{τ})$ exhibits a variant of the error measure $R (u_{h}^{τ})$ . Nevertheless, $η_{spa} (u_{h}^{τ})$ is neither lower nor upper bound of $R (u_{h}^{τ})$ since $S_{h p + 1}^{τ q + 1} ⊄ V^{τ}$ and $V^{τ} ⊄ S_{h p + 1}^{τ q + 1}$ .

The quantities (88) can be evaluated very fast since the suprema (maxima) are the sum of the suprema (maxima) for all space-time elements $K \times I_{m}$ , $K \in T_{h}^{m}$ , $m = 1, \dots, r$ , which are computable separately, cf. [17] for details. Hence, we prescribe the stopping criterion for the corresponding iterative solver as

\begin{matrix} η_{alg}^{m} (u_{h}^{τ}) \leq c_{A} η_{spa}^{m} (u_{h}^{τ}), m = 1, \dots, r, \end{matrix}

where $c_{A} \in (0, 1)$ is the user-dependent constant. The justification of this approach and the influence of algebraic errors on the error estimates are studied numerically in Sect. 6.1.1.

Numerical Experiments

We present numerical experiments that justify the a posteriori error estimates (41)–(42). Since the error measure (23) is the dual norm of the residual, it is not possible to evaluate the error even if the exact solution is known. Therefore, similarly to [18], we approximate the error by solving the dual problem given for each time interval $I_{m}, m = 1, \dots, r$ : Find $ψ_{m} \in Y_{m}^{τ} = L^{2} (I_{m}, H^{1} (Ω))$ ,

\begin{matrix} (ψ_{m}, ϕ)_{Y_{m}^{τ}} = \sum_{K, m} b_{K, m} (u_{h}^{τ}, ϕ) \forall ϕ \in Y_{m}^{τ}, \end{matrix}

where (cf. (21a)–(22))

\begin{matrix} (u, v)_{Y_{m}^{τ}} = \sum_{K \in T_{h}^{m}} d_{K, m}^{- 2} (h_{K}^{2} (\nabla u, \nabla v)_{K, m} + τ_{m}^{2} (u^{'}, v^{'})_{K, m}), m = 1, \dots, r . \end{matrix}

Then we have $R {(u_{h}^{τ})}^{2} = \sum_{m = 1}^{r} {∥ψ∥}_{Y_{m}^{τ}}^{2}$ . We solve (92) for each $m = 1, \dots, r$ by linear conforming finite element on a global refinement of the space-time mesh $T_{h}^{m} \times I_{m}$ which is proportional to the space and time polynomial approximation degrees. We denote this quantity by $\tilde{R} (u_{h}^{τ})$ . The second error contribution $J$ given by (24) is computable, so the total error $E$ (cf. (25)) is approximated by $\tilde{E} (u_{h}^{τ}) : = {(\tilde{R} {(u_{h}^{τ})}^{2} + J (u_{h}^{τ}))}^{1 / 2}$ .

Remark 3

Sometimes, this approximate evaluation of the (exact) error is not sufficiently accurate for fine grids and high polynomial approximation degrees. In this case, very fine global refinement is required and then the resulting algebraic systems are too large to be solved in a reasonable time.

All numerical experiments were carried out using the patch-wise reconstruction from Sect. 5.2 using the in-house code ADGFEM [10]. The arising nonlinear algebraic systems are solved iteratively by a Newton-like method, we refer to [14] for details. Each Newton-line iteration leads to a linear algebraic system that is solved by GMRES method with block ILU(0) preconditioner.

Barenblatt Problems

We consider two nonlinear variants of (3) following from the Barenblatt problem [4] where the analytical solution exists. The first variant reads

\begin{matrix} \partial_{t} ϑ (u) - Δ u = 0, ϑ (u) = u^{1 / m}, m \in (0, 1), \end{matrix}

where the analytical solution is

\begin{matrix} u (x_{1}, x_{2}, t) = \frac{1}{1 + t} (⌊ [1 - \frac{m - 1}{4 m^{2}} \frac{x_{1}^{2} + x_{2}^{2}}{{(1 + t)}^{1 / m}} ⌋_{+})^{\frac{m}{m - 1}}, {⌊ z ⌋}_{+} : = max (z, 0), z \in R \end{matrix}

Using the substitution $v : = u^{1 / m}$ , we have the second variant

\begin{matrix} \partial_{t} v - \nabla \cdot {(m | v |}^{m - 1} \nabla v) = 0, m > 1, \end{matrix}

having the solution

\begin{matrix} v (x_{1}, x_{2}, t) = {\frac{1}{1 + t} (⌊ 1 - \frac{m - 1}{4 m^{2}} \frac{x_{1}^{2} + x_{2}^{2}}{{(1 + t)}^{1 / m}} ⌋_{+})^{\frac{m}{m - 1}}}^{1 / m} . \end{matrix}

For both problems ((94)–(95) and (96)–(97)), the computational domain is $Ω = {(- 6, 6)}^{2}$ and the Dirichlet boundary condition is prescribed on all boundaries by (95) or (97). The final time is $T = 1$ .

We carried out computation using a sequence of uniform triangular grids (having 288, 1152, 4608 and 18432 triangles) with several combinations of polynomial approximation degrees with respect to space (p) and time (q). The time step has been chosen constant $τ = 0.01$ . Besides the error quantities ( $\tilde{R} (u_{h}^{τ})$ and $J (u_{h}^{τ})$ ) and its estimators $η$ , $η_{R} : = \sum_{K, m} η_{R, K, m}$ , $η_{S} : = \sum_{K, m} η_{S, K, m}$ and $η_{T} : = \sum_{K, m} η_{T, K, m}$ , we evaluate the effectivity indices

\begin{matrix} i_{eff} = \frac{η}{\tilde{R} (u_{h}^{τ})}, i_{eff}^{tot} = \frac{{(η^{2} + J (u_{h}^{τ}))}^{1 / 2}}{\tilde{E} (u_{h}^{τ})} . \end{matrix}

In addition, we present the experimental order of convergence (EoC) of the errors and the estimators for each pair of successive meshes.

Tables 1–4 show the results for both Barenblatt problems ((94)–(95) with $m = 0.25$ and (96)–(97) with $m = 2$ ) with two variants of the scaling parameter $d_{K, m}$ , $K \in T_{h}^{m}$ , $m = 1, \dots, r$ given by (21a) and (21b). The quantity $# DoF$ represents the number of degrees of freedom in the space, that is, $# DoF = dim S_{h p, m}$ , $m = 1, \dots, r$ . We observe a good correspondence between $\tilde{R} (u_{h}^{τ})$ and $η$ , the effectivity index $i_{eff}$ varies between 1 and 2.5 for all tested values of p and q and both variants of $d_{K, m}$ ((21a) and (21b)).

Table 1.

Barenblatt problem (94)–(95), $m = 0.25$ , scaling parameter $d_{K, m}$ given by (21a), approximation of the error and the error estimators, EOC in parenthesis

h	$# DoF$	$\tilde{R} (u_{h}^{τ})$	$η$	$J (u_{h}^{τ})$	$η_{R}$	$η_{S}$	$η_{T}$	$i_{eff}$	$i_{eff}^{tot}$
$p = 1$ & $q = 1$
1.41	864	$8.42 \times 10^{- 4}$	$1.46 \times 10^{- 3}$	$4.01 \times 10^{- 3}$	$7.67 \times 10^{- 6}$	$1.22 \times 10^{- 3}$	$7.94 \times 10^{- 4}$	1.73	1.13
0.71	3456	$7.31 \times 10^{- 4}$	$1.29 \times 10^{- 3}$	$3.69 \times 10^{- 3}$	$7.68 \times 10^{- 6}$	$1.16 \times 10^{- 3}$	$5.38 \times 10^{- 4}$	1.76	1.13
0.71	3456	(0.20)	(0.18)	(0.12)	(0.00)	(0.07)	(0.56)	1.76	1.13
0.35	13824	$4.98 \times 10^{- 4}$	$1.04 \times 10^{- 3}$	$2.95 \times 10^{- 3}$	$8.56 \times 10^{- 6}$	$1.02 \times 10^{- 3}$	$1.55 \times 10^{- 4}$	2.09	1.16
0.35	13824	(0.55)	(0.30)	(0.33)	( $-$ 0.16)	(0.18)	(1.80)	2.09	1.16
0.18	55296	$4.40 \times 10^{- 4}$	$1.01 \times 10^{- 3}$	$2.81 \times 10^{- 3}$	$9.00 \times 10^{- 6}$	$1.01 \times 10^{- 3}$	$3.22 \times 10^{- 5}$	2.31	1.18
0.18	55296	(0.18)	(0.04)	(0.07)	( $-$ 0.07)	(0.02)	(2.26)	2.31	1.18
$p = 2$ & $q = 2$
1.41	1728	$2.06 \times 10^{- 4}$	$3.49 \times 10^{- 4}$	$1.32 \times 10^{- 3}$	$5.60 \times 10^{- 7}$	$3.17 \times 10^{- 4}$	$1.46 \times 10^{- 4}$	1.70	1.09
0.71	6912	$1.16 \times 10^{- 4}$	$1.86 \times 10^{- 4}$	$7.91 \times 10^{- 4}$	$4.59 \times 10^{- 8}$	$1.74 \times 10^{- 4}$	$6.64 \times 10^{- 5}$	1.60	1.08
0.71	6912	(0.82)	(0.91)	(0.74)	(3.61)	(0.86)	(1.14)	1.60	1.08
0.35	27648	$6.51 \times 10^{- 5}$	$8.69 \times 10^{- 5}$	$4.33 \times 10^{- 4}$	$5.46 \times 10^{- 8}$	$8.57 \times 10^{- 5}$	$1.44 \times 10^{- 5}$	1.34	1.04
0.35	27648	(0.84)	(1.10)	(0.87)	( $-$ 0.25)	(1.02)	(2.20)	1.34	1.04
0.18	110592	$3.51 \times 10^{- 5}$	$4.34 \times 10^{- 5}$	$2.28 \times 10^{- 4}$	$6.54 \times 10^{- 8}$	$4.32 \times 10^{- 5}$	$2.46 \times 10^{- 6}$	1.23	1.03
0.18	110592	(0.89)	(1.00)	(0.92)	( $-$ 0.26)	(0.99)	(2.56)	1.23	1.03
$p = 3$ & $q = 2$
1.41	2880	$6.32 \times 10^{- 5}$	$1.16 \times 10^{- 4}$	$3.74 \times 10^{- 4}$	$4.39 \times 10^{- 8}$	$9.31 \times 10^{- 5}$	$6.83 \times 10^{- 5}$	1.83	1.12
0.71	11520	$2.03 \times 10^{- 5}$	$3.02 \times 10^{- 5}$	$1.45 \times 10^{- 4}$	$3.48 \times 10^{- 8}$	$2.82 \times 10^{- 5}$	$1.08 \times 10^{- 5}$	1.49	1.06
0.71	11520	(1.64)	(1.94)	(1.37)	(0.33)	(1.73)	(2.66)	1.49	1.06
0.35	46080	$5.77 \times 10^{- 6}$	$8.16 \times 10^{- 6}$	$4.11 \times 10^{- 5}$	$5.31 \times 10^{- 8}$	$8.03 \times 10^{- 6}$	$1.20 \times 10^{- 6}$	1.41	1.05
0.35	46080	(1.81)	(1.89)	(1.81)	( $-$ 0.61)	(1.81)	(3.17)	1.41	1.05
0.18	184320	$1.35 \times 10^{- 6}$	$2.09 \times 10^{- 6}$	$9.47 \times 10^{- 6}$	$6.53 \times 10^{- 8}$	$2.04 \times 10^{- 6}$	$8.52 \times 10^{- 8}$	1.55	1.07
0.18	184320	(2.10)	(1.97)	(2.12)	( $-$ 0.30)	(1.98)	(3.82)	1.55	1.07

Open in a new tab

Table 4.

Barenblatt problem (96)–(97), $m = 2$ , scaling parameter $d_{K, m}$ given by (21b), approximation of the error and the error estimators, EOC in parenthesis

h	$# DoF$	$\tilde{R} (u_{h}^{τ})$	$η$	$J (u_{h}^{τ})$	$η_{R}$	$η_{S}$	$η_{T}$	$i_{eff}$	$i_{eff}^{tot}$
$p = 1$ & $q = 1$
1.41	864	$3.35 \times 10^{- 1}$	$5.76 \times 10^{- 1}$	1.59	$6.44 \times 10^{- 5}$	$4.58 \times 10^{- 1}$	$3.50 \times 10^{- 1}$	1.72	1.13
0.71	3456	$8.54 \times 10^{- 2}$	$1.60 \times 10^{- 1}$	$3.84 \times 10^{- 1}$	$2.95 \times 10^{- 5}$	$1.37 \times 10^{- 1}$	$8.19 \times 10^{- 2}$	1.87	1.16
0.71	3456	(1.97)	(1.85)	(2.05)	(1.13)	(1.74)	(2.09)	1.87	1.16
0.35	13824	$2.74 \times 10^{- 2}$	$5.68 \times 10^{- 2}$	$1.14 \times 10^{- 1}$	$1.20 \times 10^{- 5}$	$5.43 \times 10^{- 2}$	$1.68 \times 10^{- 2}$	2.07	1.21
0.35	13824	(1.64)	(1.49)	(1.75)	(1.29)	(1.34)	(2.29)	2.07	1.21
0.18	55296	$1.15 \times 10^{- 2}$	$2.57 \times 10^{- 2}$	$4.53 \times 10^{- 2}$	$4.56 \times 10^{- 6}$	$2.55 \times 10^{- 2}$	$3.03 \times 10^{- 3}$	2.22	1.25
0.18	55296	(1.25)	(1.15)	(1.34)	(1.40)	(1.09)	(2.47)	2.22	1.25
0.09	221184	$5.54 \times 10^{- 3}$	$1.27 \times 10^{- 2}$	$2.13 \times 10^{- 2}$	$1.64 \times 10^{- 6}$	$1.26 \times 10^{- 2}$	$6.05 \times 10^{- 4}$	2.29	1.27
0.09	221184	(1.06)	(1.02)	(1.09)	(1.48)	(1.01)	(2.32)	2.29	1.27
$p = 2$ & $q = 2$
1.41	1728	$6.33 \times 10^{- 2}$	$1.27 \times 10^{- 1}$	$4.52 \times 10^{- 1}$	$1.98 \times 10^{- 5}$	$9.68 \times 10^{- 2}$	$8.18 \times 10^{- 2}$	2.00	1.12
0.71	6912	$1.63 \times 10^{- 2}$	$2.99 \times 10^{- 2}$	$1.10 \times 10^{- 1}$	$8.16 \times 10^{- 6}$	$2.35 \times 10^{- 2}$	$1.85 \times 10^{- 2}$	1.84	1.11
0.71	6912	(1.96)	(2.08)	(2.04)	(1.28)	(2.04)	(2.14)	1.84	1.11
0.35	27648	$4.78 \times 10^{- 3}$	$7.49 \times 10^{- 3}$	$2.94 \times 10^{- 2}$	$3.10 \times 10^{- 6}$	$6.52 \times 10^{- 3}$	$3.69 \times 10^{- 3}$	1.57	1.08
0.35	27648	(1.77)	(2.00)	(1.90)	(1.40)	(1.85)	(2.33)	1.57	1.08
0.18	110592	$1.55 \times 10^{- 3}$	$2.15 \times 10^{- 3}$	$9.04 \times 10^{- 3}$	$1.13 \times 10^{- 6}$	$2.04 \times 10^{- 3}$	$6.62 \times 10^{- 4}$	1.39	1.06
0.18	110592	(1.63)	(1.80)	(1.70)	(1.46)	(1.67)	(2.48)	1.39	1.06
$p = 3$ & $q = 2$
1.41	2880	$2.57 \times 10^{- 2}$	$5.65 \times 10^{- 2}$	$2.00 \times 10^{- 1}$	$1.26 \times 10^{- 5}$	$3.69 \times 10^{- 2}$	$4.28 \times 10^{- 2}$	2.20	1.14
0.71	11520	$7.54 \times 10^{- 3}$	$1.35 \times 10^{- 2}$	$5.31 \times 10^{- 2}$	$4.33 \times 10^{- 6}$	$1.03 \times 10^{- 2}$	$8.70 \times 10^{- 3}$	1.79	1.10
0.71	11520	(1.77)	(2.06)	(1.91)	(1.54)	(1.84)	(2.30)	1.79	1.10
0.35	46080	$2.51 \times 10^{- 3}$	$3.76 \times 10^{- 3}$	$1.68 \times 10^{- 2}$	$1.53 \times 10^{- 6}$	$3.34 \times 10^{- 3}$	$1.73 \times 10^{- 3}$	1.50	1.06
0.35	46080	(1.59)	(1.85)	(1.66)	(1.50)	(1.63)	(2.33)	1.50	1.06
0.18	184320	$8.71 \times 10^{- 4}$	$1.18 \times 10^{- 3}$	$5.64 \times 10^{- 3}$	$5.42 \times 10^{- 7}$	$1.14 \times 10^{- 3}$	$3.09 \times 10^{- 4}$	1.36	1.05
0.18	184320	(1.53)	(1.67)	(1.57)	(1.50)	(1.55)	(2.48)	1.36	1.05

Open in a new tab

Finally, we note that the experimental orders of convergence EoC in Tables 1–4) of the error $\tilde{R} (u_{h}^{τ})$ and its estimate $η$ are $O (h^{p})$ for the choice (21b) of the scaling parameter $d_{K, m}$ but only $O (h^{p - 1})$ for the choice (21a). This follows from the fact that $τ_{m} ≪ h_{K}$ for the computations of the Barenblatt problem and then the dominant part of $d_{K, m}$ is $τ_{m}^{- 2} T {∥\frac{d ϑ}{d u}∥}_{K, m, \infty}$ , cf. (21a), which implies that $d_{K, m} = O (h^{0})$ (the time step is the same for all computations). The dominant part of the error estimator is $η_{S, K, m}$ , hence if ${∥σ_{h}^{τ} - σ (u_{h}^{τ}, \nabla u_{h}^{τ})∥}_{K, m} = O (h^{p})$ then $η_{S, K, m} = O (h^{p - 1})$ , cf. (42b). Nevertheless, comparing the pairs of Tables 1–2 and Tables 3–4, we found that the effectivity indexes are practically independent of the choice of $d_{K, m}$ .

Table 2.

Barenblatt problem (94)–(95), $m = 0.25$ , scaling parameter $d_{K, m}$ given by (21b), approximation of the error and the error estimators, EOC in parenthesis

h	$# DoF$	$\tilde{R} (u_{h}^{τ})$	$η$	$J (u_{h}^{τ})$	$η_{R}$	$η_{S}$	$η_{T}$	$i_{eff}$	$i_{eff}^{tot}$
$p = 1$ & $q = 1$
1.41	864	$2.34 \times 10^{- 1}$	$4.06 \times 10^{- 1}$	1.12	$2.14 \times 10^{- 3}$	$3.39 \times 10^{- 1}$	$2.21 \times 10^{- 1}$	1.73	1.13
0.71	3456	$1.02 \times 10^{- 1}$	$1.79 \times 10^{- 1}$	$5.14 \times 10^{- 1}$	$1.07 \times 10^{- 3}$	$1.62 \times 10^{- 1}$	$7.50 \times 10^{- 2}$	1.76	1.13
0.71	3456	(1.20)	(1.18)	(1.12)	(1.00)	(1.07)	(1.56)	1.76	1.13
0.35	13824	$3.47 \times 10^{- 2}$	$7.26 \times 10^{- 2}$	$2.05 \times 10^{- 1}$	$5.96 \times 10^{- 4}$	$7.14 \times 10^{- 2}$	$1.08 \times 10^{- 2}$	2.09	1.16
0.35	13824	(1.55)	(1.30)	(1.33)	(0.84)	(1.18)	(2.80)	2.09	1.16
0.18	55296	$1.53 \times 10^{- 2}$	$3.54 \times 10^{- 2}$	$9.80 \times 10^{- 2}$	$3.14 \times 10^{- 4}$	$3.51 \times 10^{- 2}$	$1.12 \times 10^{- 3}$	2.31	1.18
0.18	55296	(1.18)	(1.04)	(1.07)	(0.93)	(1.02)	(3.26)	2.31	1.18
0.09	221184	$7.63 \times 10^{- 3}$	$1.76 \times 10^{- 2}$	$4.85 \times 10^{- 2}$	$1.59 \times 10^{- 4}$	$1.75 \times 10^{- 2}$	$1.44 \times 10^{- 4}$	2.31	1.18
0.09	221184	(1.01)	(1.00)	(1.01)	(0.98)	(1.00)	(2.97)	2.31	1.18
$p = 2$ & $q = 2$
1.41	1728	$5.73 \times 10^{- 2}$	$9.73 \times 10^{- 2}$	$3.68 \times 10^{- 1}$	$1.56 \times 10^{- 4}$	$8.83 \times 10^{- 2}$	$4.08 \times 10^{- 2}$	1.70	1.09
0.71	6912	$1.62 \times 10^{- 2}$	$2.60 \times 10^{- 2}$	$1.10 \times 10^{- 1}$	$6.40 \times 10^{- 6}$	$2.43 \times 10^{- 2}$	$9.24 \times 10^{- 3}$	1.60	1.08
0.71	6912	(1.82)	(1.91)	(1.74)	(4.61)	(1.86)	(2.14)	1.60	1.08
0.35	27648	$4.53 \times 10^{- 3}$	$6.06 \times 10^{- 3}$	$3.02 \times 10^{- 2}$	$3.81 \times 10^{- 6}$	$5.97 \times 10^{- 3}$	$1.01 \times 10^{- 3}$	1.34	1.04
0.35	27648	(1.84)	(2.10)	(1.87)	(0.75)	(2.02)	(3.20)	1.34	1.04
0.18	110292	$1.22 \times 10^{- 3}$	$1.51 \times 10^{- 3}$	$7.96 \times 10^{- 3}$	$2.28 \times 10^{- 6}$	$1.51 \times 10^{- 3}$	$8.56 \times 10^{- 5}$	1.23	1.03
0.18	110292	(1.89)	(2.00)	(1.92)	(0.74)	(1.99)	(3.55)	1.23	1.03
$p = 3$ & $q = 2$
1.41	2880	$1.76 \times 10^{- 2}$	$3.22 \times 10^{- 2}$	$1.04 \times 10^{- 1}$	$1.22 \times 10^{- 5}$	$2.59 \times 10^{- 2}$	$1.90 \times 10^{- 2}$	1.83	1.12
0.71	11520	$2.83 \times 10^{- 3}$	$4.20 \times 10^{- 3}$	$2.02 \times 10^{- 2}$	$4.85 \times 10^{- 6}$	$3.92 \times 10^{- 3}$	$1.51 \times 10^{- 3}$	1.49	1.06
0.71	11520	(2.64)	(2.94)	(2.37)	(1.33)	(2.73)	(3.66)	1.49	1.06
0.35	46080	$4.42 \times 10^{- 4}$	$5.68 \times 10^{- 4}$	$2.87 \times 10^{- 3}$	$3.70 \times 10^{- 6}$	$5.59 \times 10^{- 4}$	$8.38 \times 10^{- 5}$	1.29	1.04
0.35	46080	(2.68)	(2.89)	(2.81)	(0.39)	(2.81)	(4.17)	1.29	1.04
0.18	184320	$4.71 \times 10^{- 5}$	$7.28 \times 10^{- 5}$	$3.30 \times 10^{- 4}$	$2.27 \times 10^{- 6}$	$7.10 \times 10^{- 5}$	$2.97 \times 10^{- 6}$	1.55	1.07
0.18	184320	(3.23)	(2.97)	(3.12)	(0.70)	(2.98)	(4.82)	1.55	1.07

Open in a new tab

Table 3.

Barenblatt problem (96)–(97), $m = 2$ , scaling parameter $d_{K, m}$ given by (21a), approximation of the error and the error estimators, EOC in parenthesis

h	$# DoF$	$\tilde{R} (u_{h}^{τ})$	$η$	$J (u_{h}^{τ})$	$η_{R}$	$η_{S}$	$η_{T}$	$i_{eff}$	$i_{eff}^{tot}$
$p = 1$ & $q = 1$
1.41	864	$3.35 \times 10^{- 3}$	$5.76 \times 10^{- 3}$	$1.59 \times 10^{- 2}$	$6.44 \times 10^{- 7}$	$4.58 \times 10^{- 3}$	$3.50 \times 10^{- 3}$	1.72	1.13
0.71	3456	$1.71 \times 10^{- 3}$	$3.19 \times 10^{- 3}$	$7.67 \times 10^{- 3}$	$5.90 \times 10^{- 7}$	$2.74 \times 10^{- 3}$	$1.64 \times 10^{- 3}$	1.87	1.16
0.71	3456	(0.97)	(0.85)	(1.05)	(0.13)	(0.74)	(1.09)	1.87	1.16
0.35	13824	$1.09 \times 10^{- 3}$	$2.27 \times 10^{- 3}$	$4.57 \times 10^{- 3}$	$4.81 \times 10^{- 7}$	$2.17 \times 10^{- 3}$	$6.69 \times 10^{- 4}$	2.07	1.21
0.35	13824	(0.64)	(0.49)	(0.75)	(0.29)	(0.34)	(1.29)	2.07	1.21
0.18	55296	$9.17 \times 10^{- 4}$	$2.04 \times 10^{- 3}$	$3.60 \times 10^{- 3}$	$3.62 \times 10^{- 7}$	$2.02 \times 10^{- 3}$	$2.41 \times 10^{- 4}$	2.22	1.25
0.18	55296	(0.26)	(0.15)	(0.34)	(0.41)	(0.10)	(1.48)	2.22	1.25
$p = 2$ & $q = 2$
1.41	1728	$6.33 \times 10^{- 4}$	$1.27 \times 10^{- 3}$	$4.52 \times 10^{- 3}$	$1.98 \times 10^{- 7}$	$9.68 \times 10^{- 4}$	$8.18 \times 10^{- 4}$	2.00	1.12
0.71	6912	$3.26 \times 10^{- 4}$	$5.98 \times 10^{- 4}$	$2.20 \times 10^{- 3}$	$1.63 \times 10^{- 7}$	$4.70 \times 10^{- 4}$	$3.71 \times 10^{- 4}$	1.84	1.11
0.71	6912	(0.96)	(1.08)	(1.04)	(0.28)	(1.04)	(1.14)	1.84	1.11
0.35	27648	$1.91 \times 10^{- 4}$	$2.99 \times 10^{- 4}$	$1.17 \times 10^{- 3}$	$1.24 \times 10^{- 7}$	$2.60 \times 10^{- 4}$	$1.47 \times 10^{- 4}$	1.57	1.08
0.35	27648	(0.77)	(1.00)	(0.91)	(0.40)	(0.85)	(1.33)	1.57	1.08
0.18	110592	$1.23 \times 10^{- 4}$	$1.71 \times 10^{- 4}$	$7.18 \times 10^{- 4}$	$8.96 \times 10^{- 8}$	$1.63 \times 10^{- 4}$	$5.26 \times 10^{- 5}$	1.39	1.06
0.18	110592	(0.63)	(0.81)	(0.71)	(0.47)	(0.68)	(1.48)	1.39	1.06
$p = 3$ & $q = 2$
1.41	2880	$2.57 \times 10^{- 4}$	$5.65 \times 10^{- 4}$	$2.00 \times 10^{- 3}$	$1.26 \times 10^{- 7}$	$3.69 \times 10^{- 4}$	$4.28 \times 10^{- 4}$	2.20	1.14
0.71	11520	$1.51 \times 10^{- 4}$	$2.70 \times 10^{- 4}$	$1.06 \times 10^{- 3}$	$8.66 \times 10^{- 8}$	$2.07 \times 10^{- 4}$	$1.74 \times 10^{- 4}$	1.79	1.10
0.71	11520	(0.77)	(1.06)	(0.91)	(0.54)	(0.84)	(1.30)	1.79	1.10
0.35	46080	$1.00 \times 10^{- 4}$	$1.50 \times 10^{- 4}$	$6.70 \times 10^{- 4}$	$6.12 \times 10^{- 8}$	$1.33 \times 10^{- 4}$	$6.91 \times 10^{- 5}$	1.50	1.06
0.35	46080	(0.59)	(0.85)	(0.66)	(0.50)	(0.63)	(1.33)	1.50	1.06
0.18	184320	$6.92 \times 10^{- 5}$	$9.37 \times 10^{- 5}$	$4.48 \times 10^{- 4}$	$4.30 \times 10^{- 8}$	$9.04 \times 10^{- 5}$	$2.46 \times 10^{- 5}$	1.35	1.05
0.18	184320	(0.53)	(0.68)	(0.58)	(0.51)	(0.56)	(1.49)	1.35	1.05

Open in a new tab

Justification of the Algebraic Stopping Criterion (91)

We present the numerical study of the stopping criterion (91) which is used in the iterative solution of algebraic systems given by (13). We consider again the Barenblatt problem (94)–(95) with $m = 0.25$ and (96)–(97) with $m = 2$ . The user-dependent constant $c_{A}$ in (91) has been chosen as $10^{- 1}$ , $10^{- 2}$ , $10^{- 3}$ and $10^{- 4}$ . Tables 5 and 6 show the estimators $η$ , $J (u_{h}^{τ})$ , $η_{alg}$ and $η_{alg}$ , cf. (90), for selected meshes and polynomial approximation degrees and the scaling parameter $d_{K, m}$ chosen by (21a).

Table 5.

Barenblatt problem (94)–(95), $m = 0.25$ , scaling parameter $d_{K, m}$ given by (21a), numerical study of the algebraic stopping criterion (91)

$c_{A}$	$η$	$J (u_{h}^{τ})$	$η_{alg}$	$η_{spa}$	$N_{non}$	$N_{lin}$	time(s)
$h = 0.35$ , $p = 1$ & $q = 1$ , $# DoF = 13824$
$1.0 \times 10^{- 1}$	$1.2475 \times 10^{- 3}$	$3.0760 \times 10^{- 3}$	$8.1322 \times 10^{- 4}$	$1.3804 \times 10^{- 2}$	202	14148	422.1
$1.0 \times 10^{- 2}$	$1.0559 \times 10^{- 3}$	$2.9565 \times 10^{- 3}$	$7.6470 \times 10^{- 5}$	$1.3483 \times 10^{- 2}$	362	21589	606.8
$1.0 \times 10^{- 3}$	$1.0435 \times 10^{- 3}$	$2.9468 \times 10^{- 3}$	$7.9268 \times 10^{- 6}$	$1.3458 \times 10^{- 2}$	529	26545	693.6
$1.0 \times 10^{- 4}$	$1.0423 \times 10^{- 3}$	$2.9457 \times 10^{- 3}$	$7.3279 \times 10^{- 7}$	$1.3456 \times 10^{- 2}$	579	27766	705.1
$h = 0.35$ , $p = 2$ & $q = 2$ , $# DoF = 27648$
$1.0 \times 10^{- 1}$	$1.0443 \times 10^{- 4}$	$4.3586 \times 10^{- 4}$	$5.7369 \times 10^{- 5}$	$1.1019 \times 10^{- 3}$	406	10581	1968.3
$1.0 \times 10^{- 2}$	$8.8249 \times 10^{- 5}$	$4.3375 \times 10^{- 4}$	$6.1984 \times 10^{- 6}$	$1.0961 \times 10^{- 3}$	536	12059	2119.4
$1.0 \times 10^{- 3}$	$8.7054 \times 10^{- 5}$	$4.3350 \times 10^{- 4}$	$6.0680 \times 10^{- 7}$	$1.0956 \times 10^{- 3}$	576	12541	2030.1
$1.0 \times 10^{- 4}$	$8.6948 \times 10^{- 5}$	$4.3347 \times 10^{- 4}$	$5.1172 \times 10^{- 8}$	$1.0955 \times 10^{- 3}$	618	13580	2132.1
$h = 0.35$ , $p = 3$ & $q = 2$ , $# DoF = 46080$
$1.0 \times 10^{- 1}$	$9.9098 \times 10^{- 6}$	$4.1201 \times 10^{- 5}$	$5.3825 \times 10^{- 6}$	$1.0693 \times 10^{- 4}$	534	10480	6610.2
$1.0 \times 10^{- 2}$	$8.2946 \times 10^{- 6}$	$4.1156 \times 10^{- 5}$	$6.0342 \times 10^{- 7}$	$1.0670 \times 10^{- 4}$	602	11479	6998.3
$1.0 \times 10^{- 3}$	$8.1647 \times 10^{- 6}$	$4.1150 \times 10^{- 5}$	$4.5285 \times 10^{- 8}$	$1.0669 \times 10^{- 4}$	636	12288	7181.7
$1.0 \times 10^{- 4}$	$8.1577 \times 10^{- 6}$	$4.1150 \times 10^{- 5}$	$4.5439 \times 10^{- 9}$	$1.0669 \times 10^{- 4}$	668	13178	7566.5

Open in a new tab

Table 6.

Barenblatt problem (96)–(97), $m = 2$ , scaling parameter $d_{K, m}$ given by (21a), numerical study of the algebraic stopping criterion (91)

$c_{A}$	$η$	$J (u_{h}^{τ})$	$η_{alg}$	$η_{spa}$	$N_{non}$	$N_{lin}$	time(s)
$h = 0.35$ , $p = 1$ & $q = 1$ , $# DoF = 13824$
$1.0 \times 10^{- 1}$	$2.3055 \times 10^{- 3}$	$4.6659 \times 10^{- 3}$	$4.0757 \times 10^{- 4}$	$9.3084 \times 10^{- 3}$	100	2199	224.6
$1.0 \times 10^{- 2}$	$2.2689 \times 10^{- 3}$	$4.5703 \times 10^{- 3}$	$1.3712 \times 10^{- 5}$	$9.2211 \times 10^{- 3}$	200	4957	284.5
$1.0 \times 10^{- 3}$	$2.2688 \times 10^{- 3}$	$4.5694 \times 10^{- 3}$	$3.5878 \times 10^{- 6}$	$9.2216 \times 10^{- 3}$	299	7715	352.6
$1.0 \times 10^{- 4}$	$2.2688 \times 10^{- 3}$	$4.5696 \times 10^{- 3}$	$3.9773 \times 10^{- 7}$	$9.2220 \times 10^{- 3}$	378	9790	413.7
$h = 0.35$ , $p = 2$ & $q = 2$ , $# DoF = 27648$
$1.0 \times 10^{- 1}$	$3.0332 \times 10^{- 4}$	$1.1855 \times 10^{- 3}$	$7.0279 \times 10^{- 5}$	$1.7702 \times 10^{- 3}$	201	4065	1535.3
$1.0 \times 10^{- 2}$	$2.9940 \times 10^{- 4}$	$1.1753 \times 10^{- 3}$	$7.2859 \times 10^{- 6}$	$1.7634 \times 10^{- 3}$	286	5820	1675.9
$1.0 \times 10^{- 3}$	$2.9916 \times 10^{- 4}$	$1.1747 \times 10^{- 3}$	$8.0717 \times 10^{- 7}$	$1.7619 \times 10^{- 3}$	393	7803	1759.7
$1.0 \times 10^{- 4}$	$2.9916 \times 10^{- 4}$	$1.1747 \times 10^{- 3}$	$8.9211 \times 10^{- 8}$	$1.7620 \times 10^{- 3}$	529	10172	1984.6
$h = 0.35$ , $p = 3$ & $q = 2$ , $# DoF = 46080$
$1.0 \times 10^{- 1}$	$1.5523 \times 10^{- 4}$	$6.8813 \times 10^{- 4}$	$7.1710 \times 10^{- 5}$	$1.1705 \times 10^{- 3}$	202	4222	5539.6
$1.0 \times 10^{- 2}$	$1.5037 \times 10^{- 4}$	$6.6961 \times 10^{- 4}$	$5.7493 \times 10^{- 6}$	$1.1586 \times 10^{- 3}$	316	6538	6068.2
$1.0 \times 10^{- 3}$	$1.5026 \times 10^{- 4}$	$6.6968 \times 10^{- 4}$	$6.3387 \times 10^{- 7}$	$1.1580 \times 10^{- 3}$	453	8880	6615.7
$1.0 \times 10^{- 4}$	$1.5026 \times 10^{- 4}$	$6.6967 \times 10^{- 4}$	$5.9895 \times 10^{- 8}$	$1.1580 \times 10^{- 3}$	591	11150	7101.3

Open in a new tab

Additionally, we present the total number of steps of the Newton-like solver $N_{non}$ , the total number of GMRES iterations $N_{lin}$ and the computational time in seconds. The computational time has only an informative character.

We observe that the error estimators $η$ , $J (u_{h}^{τ})$ and also $η_{spa}$ converge to the limit values for decreasing $c_{A}$ in (91) which mimic the case when the algebraic errors are negligible. Moreover, the relative differences between the actual values $η$ and $J (u_{h}^{τ})$ and their limits correspond more or less to the value of $c_{A}$ . Obviously, smaller values of $c_{A}$ cause prolongation of the computational time, due to a higher number of iterations, with a negligible effect on accuracy. Thus, the choice $c_{A} = 10^{- 2}$ seems to be optimal in order to balance accuracy and efficiency.

The presented numerical experiments indicate that the estimator $η_{spa} (u_{h}^{τ})$ gives an upper bound of $R (u_{h}^{τ})$ , however, this observation is not supported by the theory. The quantity $η_{spa} (u_{h}^{τ})$ is used only in the stopping criterion (91).

Tracy Problem

Tracy problem represents a standard benchmark, where the analytical solutions of the Richards equation are available [35]. We consider the Gardners constitutive relations [26]

\begin{matrix} K (u) = \{\begin{matrix} K_{s} exp (- α ψ) & if ψ > 0 \\ K_{s} & if ψ \leq 0 \end{matrix}), ϑ (u) = \{\begin{matrix} θ_{r} + (θ_{s} - θ_{r}) exp (- α ψ) & if ψ > 0 \\ θ_{s} & if ψ \leq 0 \end{matrix}) \end{matrix}

where $ψ = u - z$ is the pressure head, z is the vertical coordinate and the material parameters $K_{s} = 1.2 I$ , $θ_{s} = 0.5$ , $θ_{r} = 0.0$ , and $α = 0.1$ are the isotropic conductivity, saturated water content, residual water content, and the soil index parameter related to the pore-size distribution, respectively.

The computational domain is $Ω = {(0, 1)}^{2}$ , the initial condition is set $u = u_{r} : = - 10$ in $Ω$ where $u_{r}$ corresponds to the hydraulic head when the porous medium is dry. On the top part of the boundary $Γ_{1} : = {(x, z), x \in (0, 1), z = 1}$ , we prescribe the boundary condition

\begin{matrix} u (x) = \frac{1}{α} log (exp (α u_{r}) + (1 - exp (α u_{r}) sin (π x)), x \in (0, 1) \end{matrix}

100

and on the rest of boundary $Γ$ we set $u = u_{r}$ . We note that this benchmark poses an inconsistency between the initial and boundary conditions on $Γ_{1}$ . Hence, the most challenging part is the computation close to $t = 0$ . In order to avoid the singularity at $t = 0$ , we investigate the error only on the interval $t \in [1.0 \times 10^{- 5}, 1.1 \times 10^{- 4}]$ with the fixed time step $τ$ is $1.0 \times 10^{- 6}$ .

We perform a computation using a sequence of uniform triangular grids with several combinations of polynomial approximation degrees and the choice (21b), the results are shown in Table 7. We observe reasonable values of the effectivity indices except for the finest grids and the higher degrees of polynomial approximation, where the effectivity indices $i_{eff}$ are below 1. Based on the values of EoC, we suppose that $i_{eff}$ below 1 is not caused by the failure of the error estimator but due to an inaccurate approximation $\tilde{R} (u_{h}^{τ})$ of the exact error; see Remark 3.

Table 7.

Tracy problem scaling parameter $d_{K, m}$ given by (21b), approximation of the error and the error estimators, EOC in parenthesis

h	$# DoF$	$\tilde{R} (u_{h}^{τ})$	$η$	$J (u_{h}^{τ})$	$η_{R}$	$η_{S}$	$η_{T}$	$i_{eff}$	$i_{eff}^{tot}$
$p = 1$ & $q = 1$
0.18	384	$2.43 \times 10^{- 1}$	$2.95 \times 10^{- 1}$	$6.62 \times 102$	$2.25 \times 10^{- 3}$	$2.90 \times 10^{- 1}$	$4.83 \times 10^{- 2}$	1.22	1.00
0.09	1536	$8.77 \times 10^{- 2}$	$1.19 \times 10^{- 1}$	$2.47 \times 102$	$9.92 \times 10^{- 4}$	$1.10 \times 10^{- 1}$	$4.36 \times 10^{- 2}$	1.35	1.00
0.09	1536	(1.47 )	(1.32)	(1.43)	(1.18)	(1.40)	(0.15)	1.35	1.00
0.04	6144	$1.50 \times 10^{- 2}$	$2.51 \times 10^{- 2}$	$4.68 \times 101$	$1.33 \times 10^{- 4}$	$2.39 \times 10^{- 2}$	$7.31 \times 10^{- 3}$	1.67	1.00
0.04	6144	(2.55)	(2.24)	(2.40)	(2.90)	(2.20)	(2.58)	1.67	1.00
0.02	24576	$7.34 \times 10^{- 3}$	$1.22 \times 10^{- 2}$	$2.27 \times 101$	$8.11 \times 10^{- 5}$	$1.20 \times 10^{- 2}$	$1.86 \times 10^{- 3}$	1.66	1.00
0.02	24576	(1.03)	(1.04)	(1.05)	(0.71)	(0.99)	(1.98)	1.66	1.00
$p = 2$ & $q = 2$
0.18	768	$4.88 \times 10^{- 2}$	$6.18 \times 10^{- 2}$	$1.92 \times 102$	$3.35 \times 10^{- 4}$	$6.04 \times 10^{- 2}$	$1.22 \times 10^{- 2}$	1.27	1.00
0.09	3072	$1.75 \times 10^{- 2}$	$1.98 \times 10^{- 2}$	$6.03 \times 101$	$1.44 \times 10^{- 5}$	$1.95 \times 10^{- 2}$	$3.35 \times 10^{- 3}$	1.13	1.00
0.09	3072	(1.48)	(1.65)	(1.67)	(4.54)	(1.63)	(1.86)	1.13	1.00
0.04	12288	$5.59 \times 10^{- 3}$	$6.28 \times 10^{- 3}$	$1.94 \times 101$	$4.68 \times 10^{- 6}$	$6.08 \times 10^{- 3}$	$1.56 \times 10^{- 3}$	1.12	1.00
0.04	12288	(1.64)	(1.65)	(1.64)	(1.62)	(1.68)	(1.11)	1.12	1.00
0.02	49,152	$1.90 \times 10^{- 3}$	$1.33 \times 10^{- 3}$	4.37	$2.41 \times 10^{- 6}$	$1.31 \times 10^{- 3}$	$1.87 \times 10^{- 4}$	0.70	1.00
0.02	49,152	(1.56)	(2.24)	(2.15)	(0.96)	(2.21)	(3.06)	0.70	1.00
$p = 3$ & $q = 2$
0.18	1280	$2.24 \times 10^{- 2}$	$2.64 \times 10^{- 2}$	$9.60 \times 101$	$2.42 \times 10^{- 5}$	$2.61 \times 10^{- 2}$	$4.48 \times 10^{- 3}$	1.18	1.00
0.09	5120	$6.26 \times 10^{- 3}$	$7.94 \times 10^{- 3}$	$2.77 \times 101$	$1.21 \times 10^{- 5}$	$7.13 \times 10^{- 3}$	$3.48 \times 10^{- 3}$	1.27	1.00
0.09	5120	(1.84)	(1.74)	(1.79)	(1.00)	(1.87)	(0.37)	1.27	1.00
0.04	20480	$1.40 \times 10^{- 3}$	$4.60 \times 10^{- 4}$	1.63	$2.87 \times 10^{- 6}$	$4.49 \times 10^{- 4}$	$8.90 \times 10^{- 5}$	0.33	1.00
0.04	20480	(2.16)	(4.11)	(4.08)	(2.08)	(3.99)	(5.29)	0.33	1.00
0.02	81920	$1.37 \times 10^{- 3}$	$8.85 \times 10^{- 5}$	$3.08 \times 10^{- 1}$	$2.37 \times 10^{- 6}$	$8.59 \times 10^{- 5}$	$1.09 \times 10^{- 5}$	0.06	1.00
0.02	81920	(0.03)	(2.38)	(2.41)	(0.28)	(2.39)	(3.03)	0.06	1.00

Open in a new tab

Mesh Adaptive Algorithm

We introduce the mesh adaptive algorithm which is based on the a posteriori error estimates $η$ , cf. (41). Let $δ > 0$ be the given tolerance, the goal of the algorithm is to define the sequence of time steps $τ_{m}$ , meshes $T_{h}^{m}$ and spaces $S_{h p, m}$ , $m = 1, \dots, r$ such that the corresponding approximate solution $u_{h}^{τ} \in S_{hp}^{τ q}$ given by (13) satisfies the condition

\begin{matrix} η = η (u_{h}^{τ}) \leq δ . \end{matrix}

101

Another possibility is to require ${(η^{2} + J (u_{h}^{τ}))}^{1 / 2} \leq δ$ , then the following considerations have to be modified appropriately.

The mesh adaptation strategy is built on the equi-distribution principle, namely the sequences ${τ_{m}, T_{h}^{m}, S_{h p, m}}_{m = 1}^{r}$ should be generated such that

\begin{matrix} η_{m} & \leq δ_{m} : = δ \sqrt{τ_{m} / T} \forall m = 1, \dots, r, \end{matrix}

102a

\begin{matrix} η_{K, m} & \leq δ_{K, m} : = δ_{m} \sqrt{1 / # T_{h}^{m}} \forall K \in T_{h}^{m} \forall m = 1, \dots, r, \end{matrix}

102b

where $η_{m} : = {(\sum_{K \in T_{h}^{m}} η_{K, m}^{2})}^{1 / 2}$ is the error estimate corresponding to the time interval $I_{m}$ , $m = 1, \dots, r$ and $# T_{h}^{m}$ denotes the number of elements of $T_{h}^{m}$ . Obviously, if all the conditions in (102) are valid, then the criterion (101) is achieved.

Based on (101)–(102), we introduce the abstract Algorithm 1. The size of $τ_{m}$ , $m = 1, \dots, r$ (step 8 of the algorithm) are chosen to equilibrate estimates of the spatial and temporal reconstruction, $η_{S, m} : = {(\sum_{K \in T_{h}^{m}} {(η_{S, K, m})}^{2})}^{1 / 2}$ and $η_{T, m} : = {(\sum_{K \in T_{h}^{m}} {(η_{T, K, m})}^{2})}^{1 / 2}$ , cf. (42). Particularly, we set the new time step according to the formula

\begin{matrix} τ_{m + 1} = τ_{m} c_{F} {(\frac{η_{S, m}}{η_{T, m}})}^{1 / (q + 1)}, m = 1, \dots, r, \end{matrix}

103

where $c_{F} \in (0, 1)$ is the security factor and $q \geq 0$ is the polynomial degree with respect to time. Therefore, $q + 1$ corresponds to the temporal order of convergence.

The construction of the new mesh (step 11 in Algorithm 1) is based on the modification of the anisotropic hp-mesh adaptation method from [15, 20]. Having the actual mesh $T_{h}^{m}$ , for each $K \in T_{h}^{m}$ we set the new volume of K according the formula

\begin{matrix} ν_{K} = | K | Λ (δ_{K, m} / η_{K, m}), K \in T_{h}^{m}, \end{matrix}

104

where $δ_{K, m}$ is the local tolerance from (102b), |K| is the volume of |K| and $Λ : R^{+} \to R +$ is a suitable increasing function such that $Λ (1) = 1$ . For particular variants of $Λ$ , we refer to [15, 20].

When the new volume of mesh elements is established by (104), the new shape of K and a new polynomial approximation degree $p_{K}$ are optimized by minimizing the interpolation error. This optimization is done locally for each mesh element. In one adaptation level, we admit the increase or decrease of $p_{K}$ by one. Setting the new area, shape, and polynomial approximation degree for each element of the current mesh, we define the continuous mesh model [16] and carry out a remeshing using the code ANGENER [9].

The generated meshes are completely non-nested and non-matching, hence the evaluation of the time-penalty term (cf. Remark 1) is delicate. We refer to [20] where this aspect is described in detail and numerically verified. The presented numerical analysis takes into account the errors arising from the re-meshing in the temporal reconstruction $R_{h}^{τ}$ , which contains term ${ϑ (u_{h}^{τ})}_{m - 1}$ , cf. (26). The following numerical experiments show that the error estimator is under the control also after each remeshing.

Barenblatt Problem

We apply Algorithm 1 to the Barenblatt problem (96) with $m = 2$ . Table 8 shows the error estimators obtained by adaptive computation for three different tolerances $δ$ . Compared with the error estimators from Table 4, we observe that the adaptive computations achieve significantly smaller error estimates using a significantly smaller number of degrees of freedom. We note that we are not able to present the quantity $\tilde{R}$ (cf. (92)–(93)) approximating the error since the finite element code used for the evaluation of $\tilde{R}$ supports only uniform grids.

Table 8.

Barenblatt problem (96)–(97), scaling parameter $d_{K, m}$ given by (21b), the error estimators obtained by the adaptive computations using Algorithm 1

hp adaptation
$δ$	$# DoF$	$η$	$J {(u_{h}^{τ})}^{1 / 2}$	$η_{R}$	$η_{S}$	$η_{T}$
2.0E−03	4 543	$7.82 \times 10^{- 4}$	$1.69 \times 10^{- 3}$	$2.18 \times 10^{- 4}$	$5.40 \times 10^{- 4}$	$3.23 \times 10^{- 4}$
1.0E−03	6 244	$4.57 \times 10^{- 4}$	$1.17 \times 10^{- 3}$	$1.48 \times 10^{- 4}$	$3.13 \times 10^{- 4}$	$1.43 \times 10^{- 4}$
5.0E−04	9 071	$2.10 \times 10^{- 4}$	$7.02 \times 10^{- 4}$	$6.75 \times 10^{- 5}$	$1.38 \times 10^{- 4}$	$7.79 \times 10^{- 5}$

Open in a new tab

The quantity $# DoF$ is the average number of space degrees of freedom per one time step

Figure 1 shows the performance of Algorithm 1, where each dot corresponds to one time step $m = 1, \dots, r$ . We plot the values of the accumulated estimators ${\bar{η}}_{m} = \sum_{i = 1}^{m} η_{i}$ for all $m = 1, \dots, r$ . The red nodes correspond to all computed time steps, including the rejected ones (steps 11–12 of Algorithm 1) whereas the blue nodes mark only the accepted time steps. The rejected time steps indicate the re-meshing. Moreover, we plot the “accumulated” tolerance $δ {(t_{m} / T)}^{1 / 2}$ , cf. (101) and (102a). We observe that the resulting estimator $η$ at $t = T$ is below the tolerance $δ$ by a factor of approximately 2.5 since conditions (102) are stronger than (101).

Fig. 1 — Barenblatt problem, (96)–(97), $m = 2$ , performance of Algorithm 1, accumulated error estimator ${\bar{η}}_{m}$ and the “accumulated” tolerance $δ {(t_{m} / T)}^{1 / 2}$ for $m = 1, \dots, r$

Figure 2, left, shows the hp-mesh obtained by Algorithm 1 at the final time $T = 1$ , each triangle is highlighted by a color corresponding to the polynomial degree used $p_{K}$ , $K \in T_{h}^{m}$ . We observe a strong anisotropic refinement about the circular singularity of the solution when $u \to 0^{+}$ , see the analytical formula (97). Outside of this circle, large triangles with the smallest polynomial degree ( $p = 1$ ) are generated. On the other hand, due to the regularity of the solution in the interior of the circle, the polynomial degrees $p = 2$ or $p = 3$ are generated.

Moreover, Fig. 2, right, shows the error estimator $η_{K, m}$ , $K \in T_{h}^{m}$ at $T = 1$ . The elements in the exterior of the circle have small values of $η_{K, m} \approx 10^{- 17}$ – $10^{- 14}$ due to a constant solution and negligible errors. On the other hand, the values of $η_{K, m}$ for the rest of elements $K \in T_{h}^{m}$ are in the range $10^{- 13}$ – $10^{- 11}$ due to the equidistant principle used.

Single Ring Infiltration

We deal with the numerical solution of the single ring infiltration experiment, which is frequently used for the identification of saturated hydraulic conductivity, cf. [32, 39] for example. We consider the Richards equation (3) where the active pore volume $ϑ$ is given by (2), the water content function $θ$ is given by the van Genuchten’s law [27] and the conductivity $K (u) = K_{s} K_{r} (u)$ is given by the Mualem function [31], namely

\begin{matrix} θ (u) = \{\begin{matrix} \frac{θ_{s} - θ_{r}}{(1 + {({- α ψ)}^{n})}^{m}} + θ_{r} & for ψ < 0, \\ θ_{s} & for ψ \geq 0, \end{matrix}) \\ K_{r} (u) & = \{\begin{matrix} \frac{{(1 - {(- α ψ)}^{m n} {(1 + {(- α ψ)}^{n})}^{- m})}^{2}}{{(1 + {(- α ψ)}^{n})}^{m / 2}} & for ψ < 0, \\ 1 & for ψ \geq 0, \end{matrix}) \end{matrix}

105

where $ψ = u - z$ is the pressure head, z is the vertical coordinate and the material parameters $K_{s} = 0.048 I m \cdot {hours}^{- 1}$ , $θ_{s} = 0.55$ , $θ_{r} = 0.0$ , $α = 0.8 m^{- 1}$ , $n = 1.2$ , $m = 1 / 6$ and $S_{s} = 10^{- 3} m^{- 1}$ (cf. (2)).

The computational domain together with the boundary parts is sketched in Fig. 3a. On the boundary part $Γ_{D}$ we set the Dirichlet boundary condition $u = 1.05 m$ , and on $Γ_{N} = Γ \ Γ_{D}$ we consider the homogeneous Neumann boundary condition. The smaller “magenta” vertical lines starting at $Γ_{D}$ belong to $Γ_{N}$ . At $t = 0$ , a dry medium with $u = ψ + z = - 2 m$ is prescribed. We carried out the computation until the physical time $T = 2 hours$ . The inconsistency of the initial and boundary condition on $Γ_{D}$ makes the computation quite difficult for $t \approx 0$ .

Figure 3b verifies the conservativity of the adaptive method. We plot the quantities

\begin{matrix} F (t) & = \int_{0}^{t} \int_{Γ} K (u) \nabla u \cdot n d S d t, \\ Δ V (t) & = V (t) - V (0), V (t) = \int_{Ω} ϑ (u (\cdot, t)) d x, t \in [0, T], \end{matrix}

106

where F(t) is the total flux of the water through the boundary $Γ$ till time t and $Δ V (t)$ is the changes of the water content in the domain between times 0 and t. From equation (3) and the Stokes theorem, we have the conservation law $F (t) = Δ V (t)$ for all $t \in [0, T]$ . Therefore, we also show the relative difference between these quantities $| F (t) - Δ V (t) | / Δ V (t)$ for $t > 0$ in Fig. 3b the vertical label on the right. We observe that, except for the time close to zero, where the inconsistency between initial and boundary conditions is problematic, the relative difference is at the level of several percent.

Furthermore, Fig. 4 shows the accumulated estimators ${\bar{η}}_{m} = \sum_{i = 1}^{m} η_{i}$ for time levels $t_{m}$ , $m = 1, \dots, m$ . The red nodes correspond to all computed time steps, including the rejected steps whereas the blue line connects only the accepted time steps. The rejected time steps are followed by the remeshing which is carried out namely for small t. We observe that the elimination of the rejected time steps causes that the errors arising from the remeshing do not essentially affect the total error estimate $η$ .

Moreover, Fig. 5 shows the hp-meshes, the hydraulic head and the error estimator $η_{K, m}$ , $K \in T_{h}^{m}$ at selected time levels obtained from Algorithm 1 with $δ = 5.0 \times 10^{- 3}$ . We observe the mesh adaptation namely at the (not sharp) interface between the saturated and non-saturated medium and also in the vicinity of the domain singularities. The error estimators $η_{K, m}$ , $K \in T_{h}^{m}$ indicate an equi-distribution of the error.

Conclusion

We derived reliable and efficient a posteriori error estimates in the residual-based norm for the Richards equation discretized by the space-time discontinuous Galerkin method. The numerical verification indicates the effectivity indexes between 1 and 2.5 for the tested examples. Moreover, we introduced the hp-mesh adaptive method handling varying non-nested and non-matching meshes and demonstrated its efficiency for simple test benchmark and its applicability for the numerical solution of the single ring infiltration experiment.

It will be possible to generalize the presented approach to genuinely space-time hp-adaptive method, where the (local) polynomial order q in time is varied as well. However, the question is of potential benefit. Based on our experience, the setting $q = 1$ gives sufficiently accurate approximation for the majority of tested problems.

On the other hand, the choice $q = 0$ would be sufficient only in subdomains of $Ω$ where the solution is almost constant in time. Therefore, we suppose that the benefit of local varying of polynomial order in time will be low.

Although the presented numerical examples are two-dimensional, it would be possible to apply the presented error estimates and mesh adaptation to three-dimensional problems as well. We refer, e.g., to [1] and the references therein, where the anisotropic mesh adaptation techniques are developed for time-dependent 3D problems.

Funding

Open access publishing supported by the National Technical Library in Prague. This work has been supported by the Czech Science Foundation Grant No. 20-01074 S (V.D.), the Charles University grant SVV-2023-260711, and the Grant Agency of Charles University Project No. 28122 (H.S.), European Development Fund-Project “Center for Advanced Aplied Science” No. CZ.02.1.01/0.0/0.0./16 019/0000778 (M.V.). V.D. acknowledges the membership in the Nečas Center for Mathematical Modeling ncmm.karlin.mff.cuni.cz.

Data Availability

No datasets were generated or analysed during the current study.

Declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Alauzet, F., Loseille, A., Olivier, G.: Time-accurate multi-scale anisotropic mesh adaptation for unsteady flows in CFD. J. Comput. Phys. 373, 28–63 (2018) [Google Scholar]
2.Alt, H.W., Luckhaus, S.: Quasilinear elliptic-parabolic differential equations. Math. Z. 183(3), 311–341 (1983) [Google Scholar]
3.Baradi, M., Difonzo, F.V.: Strong solutions for Richards’ equation with Cauchy conditions and constant pressure. Environ. Fluid Mech. 20, 165–174 (2020) [Google Scholar]
4.Barenblatt, G.I.: On some unsteady motions of a liquid and gas in a porous medium. Akad. Nauk SSSR. Prikl. Mat. Meh. 16, 67–78 (1952) [Google Scholar]
5.Baron, V., Coudière, Y., Sochala, P.: Adaptive multistep time discretization and linearization based on a posteriori error estimates for the Richards equation. Appl. Numer. Math. 112, 104–125 (2017) [Google Scholar]
6.Bernardi, C., El Alaoui, L., Mghazli, Z.: A posteriori analysis of a space and time discretization of a nonlinear model for the flow in partially saturated porous media. IMA J. Numer. Anal. 34(3), 1002–1036 (2014) [Google Scholar]
7.Brezzi, F., Fortin, M.: Mixed and Hybrid Finite Element Methods, Springer Series in Computational Mathematics, vol. 15. Springer, New York (1991) [Google Scholar]
8.Di Pietro, D.A., Vohralík, M., Yousef, S.: An a posteriori-based, fully adaptive algorithm with adaptive stopping criteria and mesh refinement for thermal multiphase compositional flows in porous media. Compu. Math. Appl. 68(12, Part B), 2331–2347 (2014) [Google Scholar]
9.Dolejší, V.: ANGENER – Anisotropic mesh generator, in-house code. Charles University, Prague, Faculty of Mathematics and Physics (2000). https://msekce.karlin.mff.cuni.cz/~dolejsi/angen/
10.Dolejší, V.: ADGFEM – Adaptive discontinuous Galerkin finite element method, in-house code. Charles University, Prague, Faculty of Mathematics and Physics (2020). https://msekce.karlin.mff.cuni.cz/~dolejsi/adgfem/
11.Dolejší, V., Ern, A., Vohralík, M.: A framework for robust a posteriori error control in unsteady nonlinear advection-diffusion problems. SIAM J. Numer. Anal. 51(2), 773–793 (2013) [Google Scholar]
12.Dolejší, V., Ern, A., Vohralík, M.: -adaptation driven by polynomial-degree-robust a posteriori error estimates for elliptic problems. SIAM J. Sci. Comput. 38(5), A3220–A3246 (2016) [Google Scholar]
13.Dolejší, V., Feistauer, M.: Discontinuous Galerkin Method: Analysis and Applications to Compressible Flow. Springer Series in Computational Mathematics, vol. 48. Springer, Cham (2015) [Google Scholar]
14.Dolejší, V., Kuráž, M., Solin, P.: Adaptive higher-order space-time discontinuous Galerkin method for the computer simulation of variably-saturated porous media flows. Appl. Math. Model. 72, 276–305 (2019) [Google Scholar]
15.Dolejší, V., May, G.: Anisotropic -Mesh Adaptation Methods. Birkhäuser, Cham (2022) [Google Scholar]
16.Dolejší, V., May, G., Rangarajan, A.: A continuous -mesh model for adaptive discontinuous Galerkin schemes. Appl. Numer. Math. 124, 1–21 (2018) [Google Scholar]
17.Dolejší, V., Roskovec, F., Vlasák, M.: Residual based error estimates for the space-time discontinuous Galerkin method applied to the compressible flows. Comput. Fluids 117, 304–324 (2015) [Google Scholar]
18.Dolejší, V., Roskovec, F., Vlasák, M.: A posteriori error estimates for higher order space-time Galerkin discretizations of nonlinear parabolic problems. SIAM J. Numer. Anal. 59(3), 1486–1509 (2021) [Google Scholar]
19.Dolejší, V., Shin, H.G.: A posteriori error estimate and mesh adaptation for the numerical solution of the richards equation. In: J.M. Melenk, et al. (eds.) Spectral and High Order Methods for Partial Differential Equations ICOSAHOM 2020+1, Lecture Notes in Computational Science and Engineering 137 (2023)
20.Dolejší, V., May, G.: An anisotropic -mesh adaptation method for time-dependent problems based on interpolation error control. J. Sci. Comput. 95(2) (2023)
21.Ern, A., Guermond, J.L.: Theory and Practice of Finite Elements. Springer, Berlin (2004) [Google Scholar]
22.Ern, A., Smears, I., Vohralík, M.: Guaranteed, locally space-time efficient, and polynomial-degree robust a posteriori error estimates for high-order discretizations of parabolic problems. SIAM J. Numer. Anal. 55(6), 2811–2834 (2017) [Google Scholar]
23.Ern, A., Vohralík, M.: Adaptive inexact Newton methods with a posteriori stopping criteria for nonlinear diffusion PDEs. SIAM J. Sci. Comput. 35(4), A1761–A1791 (2013) [Google Scholar]
24.Ern, A., Vohralík, M.: Polynomial-degree-robust a posteriori estimates in a unified setting for conforming, nonconforming, discontinuous Galerkin, and mixed discretizations. SIAM J. Numer. Anal. 53(2), 1058–1081 (2015) [Google Scholar]
25.Farthing, M., Ogden, F.: Numerical solution of Richards’ equation: a review of advances and challenges. Soil Sci. Soc. Am. J. 81(6), 1257–1269 (2017) [Google Scholar]
26.Gardner, W.R.: Some steady state solutions of the unsaturated moisture flow equation with application to evaporation from a water table. Soil Sci. 85, 228–232 (1958) [Google Scholar]
27.van Genuchten, M.T.: Closed-form equation for predicting the hydraulic conductivity of unsaturated soils. Soil Sci. Soc. Am. J. 44(5), 892–898 (1980) [Google Scholar]
28.Kim, I.C., Požár, N.: Nonlinear elliptic-parabolic problems. Arch. Ration. Mech. Anal. 210, 975–1020 (2013) [Google Scholar]
29.Mallik, G., Vohralík, M., Yousef, S.: Goal-oriented a posteriori error estimation for conforming and nonconforming approximations with inexact solvers. J. Comput. Appl. Math. 366 (2020)
30.Mitra, K., Vohralík, M.: A posteriori error estimates for the Richards equation. Tech. Rep. hal-03328944v2, INRIA (2022)
31.Mualem, Y.: A new model for predicting the hydraulic conductivity of unsaturated porous media. Water Resour. Res. 12(3), 513–522 (1976) [Google Scholar]
32.Nakhaei, M., Šimnek, J.: Parameter estimation of soil hydraulic and thermal property functions for unsaturated porous media using the hydrus-2d code. J. Hydrol. Hydromech. 62(1) (2014)
33.Richards, L.A.: Capillary conduction of liquids through porous mediums. J. Appl. Phys. 1(5), 318–333 (1931) [Google Scholar]
34.Rudin, W.: Real and Complex Analysis, 3rd edn. McGraw-Hill, New York (1987) [Google Scholar]
35.Tracy, F.T.: Clean two- and three-dimensional analytical solutions of Richards equation for testing numerical solvers. Water Resour Res 42(8) (2006)
36.Verfürth, R.: A Posteriori Error Estimation Techniques for Finite Element Methods. Numerical Mathematics and Scientific Computation. Oxford University Press, Oxford (2013) [Google Scholar]
37.Vohralík, M.: A posteriori error estimates for efficiency and error control in numerical simulations. Universite Pierre et Marie Curie – Paris 6 (2018). Lecture notes Course NM497
38.Vohralík, M., Yousef, S.: A simple a posteriori estimate on general polytopal meshes with applications to complex porous media flows. Comput. Methods Appl. Mech. Eng. 331, 728–760 (2018) [Google Scholar]
39.Xu, X., Lewis, C., Liu, W., Albertson, J., Kiely, G.: Analysis of single-ring infiltrometer data for soil hydraulic properties estimation: comparison of BEST and Wu methods. Agric. Water Manag. 107, 34–41 (2012) [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

No datasets were generated or analysed during the current study.

[CR1] 1.Alauzet, F., Loseille, A., Olivier, G.: Time-accurate multi-scale anisotropic mesh adaptation for unsteady flows in CFD. J. Comput. Phys. 373, 28–63 (2018) [Google Scholar]

[CR2] 2.Alt, H.W., Luckhaus, S.: Quasilinear elliptic-parabolic differential equations. Math. Z. 183(3), 311–341 (1983) [Google Scholar]

[CR3] 3.Baradi, M., Difonzo, F.V.: Strong solutions for Richards’ equation with Cauchy conditions and constant pressure. Environ. Fluid Mech. 20, 165–174 (2020) [Google Scholar]

[CR4] 4.Barenblatt, G.I.: On some unsteady motions of a liquid and gas in a porous medium. Akad. Nauk SSSR. Prikl. Mat. Meh. 16, 67–78 (1952) [Google Scholar]

[CR5] 5.Baron, V., Coudière, Y., Sochala, P.: Adaptive multistep time discretization and linearization based on a posteriori error estimates for the Richards equation. Appl. Numer. Math. 112, 104–125 (2017) [Google Scholar]

[CR6] 6.Bernardi, C., El Alaoui, L., Mghazli, Z.: A posteriori analysis of a space and time discretization of a nonlinear model for the flow in partially saturated porous media. IMA J. Numer. Anal. 34(3), 1002–1036 (2014) [Google Scholar]

[CR7] 7.Brezzi, F., Fortin, M.: Mixed and Hybrid Finite Element Methods, Springer Series in Computational Mathematics, vol. 15. Springer, New York (1991) [Google Scholar]

[CR8] 8.Di Pietro, D.A., Vohralík, M., Yousef, S.: An a posteriori-based, fully adaptive algorithm with adaptive stopping criteria and mesh refinement for thermal multiphase compositional flows in porous media. Compu. Math. Appl. 68(12, Part B), 2331–2347 (2014) [Google Scholar]

[CR9] 9.Dolejší, V.: ANGENER – Anisotropic mesh generator, in-house code. Charles University, Prague, Faculty of Mathematics and Physics (2000). https://msekce.karlin.mff.cuni.cz/~dolejsi/angen/

[CR10] 10.Dolejší, V.: ADGFEM – Adaptive discontinuous Galerkin finite element method, in-house code. Charles University, Prague, Faculty of Mathematics and Physics (2020). https://msekce.karlin.mff.cuni.cz/~dolejsi/adgfem/

[CR11] 11.Dolejší, V., Ern, A., Vohralík, M.: A framework for robust a posteriori error control in unsteady nonlinear advection-diffusion problems. SIAM J. Numer. Anal. 51(2), 773–793 (2013) [Google Scholar]

[CR12] 12.Dolejší, V., Ern, A., Vohralík, M.: -adaptation driven by polynomial-degree-robust a posteriori error estimates for elliptic problems. SIAM J. Sci. Comput. 38(5), A3220–A3246 (2016) [Google Scholar]

[CR13] 13.Dolejší, V., Feistauer, M.: Discontinuous Galerkin Method: Analysis and Applications to Compressible Flow. Springer Series in Computational Mathematics, vol. 48. Springer, Cham (2015) [Google Scholar]

[CR14] 14.Dolejší, V., Kuráž, M., Solin, P.: Adaptive higher-order space-time discontinuous Galerkin method for the computer simulation of variably-saturated porous media flows. Appl. Math. Model. 72, 276–305 (2019) [Google Scholar]

[CR15] 15.Dolejší, V., May, G.: Anisotropic -Mesh Adaptation Methods. Birkhäuser, Cham (2022) [Google Scholar]

[CR16] 16.Dolejší, V., May, G., Rangarajan, A.: A continuous -mesh model for adaptive discontinuous Galerkin schemes. Appl. Numer. Math. 124, 1–21 (2018) [Google Scholar]

[CR17] 17.Dolejší, V., Roskovec, F., Vlasák, M.: Residual based error estimates for the space-time discontinuous Galerkin method applied to the compressible flows. Comput. Fluids 117, 304–324 (2015) [Google Scholar]

[CR18] 18.Dolejší, V., Roskovec, F., Vlasák, M.: A posteriori error estimates for higher order space-time Galerkin discretizations of nonlinear parabolic problems. SIAM J. Numer. Anal. 59(3), 1486–1509 (2021) [Google Scholar]

[CR19] 19.Dolejší, V., Shin, H.G.: A posteriori error estimate and mesh adaptation for the numerical solution of the richards equation. In: J.M. Melenk, et al. (eds.) Spectral and High Order Methods for Partial Differential Equations ICOSAHOM 2020+1, Lecture Notes in Computational Science and Engineering 137 (2023)

[CR20] 20.Dolejší, V., May, G.: An anisotropic -mesh adaptation method for time-dependent problems based on interpolation error control. J. Sci. Comput. 95(2) (2023)

[CR21] 21.Ern, A., Guermond, J.L.: Theory and Practice of Finite Elements. Springer, Berlin (2004) [Google Scholar]

[CR22] 22.Ern, A., Smears, I., Vohralík, M.: Guaranteed, locally space-time efficient, and polynomial-degree robust a posteriori error estimates for high-order discretizations of parabolic problems. SIAM J. Numer. Anal. 55(6), 2811–2834 (2017) [Google Scholar]

[CR23] 23.Ern, A., Vohralík, M.: Adaptive inexact Newton methods with a posteriori stopping criteria for nonlinear diffusion PDEs. SIAM J. Sci. Comput. 35(4), A1761–A1791 (2013) [Google Scholar]

[CR24] 24.Ern, A., Vohralík, M.: Polynomial-degree-robust a posteriori estimates in a unified setting for conforming, nonconforming, discontinuous Galerkin, and mixed discretizations. SIAM J. Numer. Anal. 53(2), 1058–1081 (2015) [Google Scholar]

[CR25] 25.Farthing, M., Ogden, F.: Numerical solution of Richards’ equation: a review of advances and challenges. Soil Sci. Soc. Am. J. 81(6), 1257–1269 (2017) [Google Scholar]

[CR26] 26.Gardner, W.R.: Some steady state solutions of the unsaturated moisture flow equation with application to evaporation from a water table. Soil Sci. 85, 228–232 (1958) [Google Scholar]

[CR27] 27.van Genuchten, M.T.: Closed-form equation for predicting the hydraulic conductivity of unsaturated soils. Soil Sci. Soc. Am. J. 44(5), 892–898 (1980) [Google Scholar]

[CR28] 28.Kim, I.C., Požár, N.: Nonlinear elliptic-parabolic problems. Arch. Ration. Mech. Anal. 210, 975–1020 (2013) [Google Scholar]

[CR29] 29.Mallik, G., Vohralík, M., Yousef, S.: Goal-oriented a posteriori error estimation for conforming and nonconforming approximations with inexact solvers. J. Comput. Appl. Math. 366 (2020)

[CR30] 30.Mitra, K., Vohralík, M.: A posteriori error estimates for the Richards equation. Tech. Rep. hal-03328944v2, INRIA (2022)

[CR31] 31.Mualem, Y.: A new model for predicting the hydraulic conductivity of unsaturated porous media. Water Resour. Res. 12(3), 513–522 (1976) [Google Scholar]

[CR32] 32.Nakhaei, M., Šimnek, J.: Parameter estimation of soil hydraulic and thermal property functions for unsaturated porous media using the hydrus-2d code. J. Hydrol. Hydromech. 62(1) (2014)

[CR33] 33.Richards, L.A.: Capillary conduction of liquids through porous mediums. J. Appl. Phys. 1(5), 318–333 (1931) [Google Scholar]

[CR34] 34.Rudin, W.: Real and Complex Analysis, 3rd edn. McGraw-Hill, New York (1987) [Google Scholar]

[CR35] 35.Tracy, F.T.: Clean two- and three-dimensional analytical solutions of Richards equation for testing numerical solvers. Water Resour Res 42(8) (2006)

[CR36] 36.Verfürth, R.: A Posteriori Error Estimation Techniques for Finite Element Methods. Numerical Mathematics and Scientific Computation. Oxford University Press, Oxford (2013) [Google Scholar]

[CR37] 37.Vohralík, M.: A posteriori error estimates for efficiency and error control in numerical simulations. Universite Pierre et Marie Curie – Paris 6 (2018). Lecture notes Course NM497

[CR38] 38.Vohralík, M., Yousef, S.: A simple a posteriori estimate on general polytopal meshes with applications to complex porous media flows. Comput. Methods Appl. Mech. Eng. 331, 728–760 (2018) [Google Scholar]

[CR39] 39.Xu, X., Lewis, C., Liu, W., Albertson, J., Kiely, G.: Analysis of single-ring infiltrometer data for soil hydraulic properties estimation: comparison of BEST and Wu methods. Agric. Water Manag. 107, 34–41 (2012) [Google Scholar]

PERMALINK

Error Estimates and Adaptivity of the Space-Time Discontinuous Galerkin Method for Solving the Richards Equation

Vít Dolejší

Hyun-Geun Shin

Miloslav Vlasák

Abstract

Introduction

Problem Formulation

Definition 1

Space-time discretization

Definition 2

Remark 1

A Posteriori Error Analysis

Error Measures

Lemma 1

Proof

Lemma 2

Proof

Temporal and Spatial Flux Reconstructions

Auxiliary Results

Lemma 3

Proof

Reliability

Theorem 1

Proof

Remark 2

Efficiency

Theorem 2

Proof

Theorem 3

Proof

Spatial Flux Reconstructions and Stopping Criteria

Element-Wise Variant

Patch-Wise Variant

Stopping Criteria for Iterative Solvers

Numerical Experiments

Remark 3

Barenblatt Problems

Table 1.

Table 4.

Table 2.

Table 3.

Justification of the Algebraic Stopping Criterion (91)

Table 5.

Table 6.

Tracy Problem

Table 7.

Mesh Adaptive Algorithm

Algorithm 1.

Barenblatt Problem

Table 8.

Fig. 1.

Fig. 2.

Single Ring Infiltration

Fig. 3.

Fig. 4.

Fig. 5.

Conclusion

Funding

Data Availability

Declarations

Conflict of interest

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases