Skip to main content
Springer logoLink to Springer
. 2022 Nov 16;153(1):111–140. doi: 10.1007/s00211-022-01334-8

Goal-oriented adaptive finite element methods with optimal computational complexity

Roland Becker 1, Gregor Gantner 2, Michael Innerberger 3,, Dirk Praetorius 3
PMCID: PMC9829645  PMID: 36644212

Abstract

We consider a linear symmetric and elliptic PDE and a linear goal functional. We design and analyze a goal-oriented adaptive finite element method, which steers the adaptive mesh-refinement as well as the approximate solution of the arising linear systems by means of a contractive iterative solver like the optimally preconditioned conjugate gradient method or geometric multigrid. We prove linear convergence of the proposed adaptive algorithm with optimal algebraic rates. Unlike prior work, we do not only consider rates with respect to the number of degrees of freedom but even prove optimal complexity, i.e., optimal convergence rates with respect to the total computational cost.

Mathematics Subject Classification: 65N30, 65N50, 65Y20, 41A25, 65N22

Introduction

Let ΩRd be a bounded Lipschitz domain, d2. For given fL2(Ω) and f[L2(Ω)]d, we consider a linear symmetric and elliptic partial differential equation

-divAu+cu=f+divfinΩ,u=0onΓ:=Ω, 1

where A(x)Rsymd×d is symmetric and c(x)R. As usual, we assume that A,cL(Ω), that A is uniformly positive definite and that the weak form (see (5) below) fits into the setting of the Lax–Milgram lemma. Standard adaptivity aims to approximate the unknown solution uH01(Ω) of (1) in the energy norm at optimal rate; see [1, 6, 7, 9, 11, 19, 23] for adaptive finite element methods (AFEMs) and [5] for an overview of available results. Instead, the quantity of interest for goal-oriented adaptivity is only some functional value of the unknown solution uH01(Ω) of (1), and the present paper aims to compute the linear goal functional

G(u):=Ω(gu-g·u)dx, 2

for given gL2(Ω) and g[L2(Ω)]d. To approximate G(u) accurately, it is not necessary (and might even waste computational time) to accurately approximate the solution u on the whole computational domain. Due to this potential decrease of computational cost, goal-oriented adaptivity is of high relevance in practice as well as in mathematical research; see, e.g., [3, 4, 10, 17] for some prominent contributions.

The present work formulates a goal-oriented adaptive finite element method (GOAFEM), where the sought goal G(u) is approximated by some computable G such that

|G(u)-G|0even at optimal algebraic rate. 3

The earlier works [2, 12, 14, 20] are essentially concerned with optimal convergence rates for GOAFEM, where all arising linear FEM systems are solved exactly. While [12, 14] particularly aim to transfer ideas from the AFEM analysis of [5, 6] to GOAFEM for general elliptic PDEs, the seminal work [20] considers the Poisson model problem and additionally addresses the total computational cost by formulating realistic assumptions on a generic inexact solver (called GALSOLVE in [20, 23]).

The focus of the present work is also on the iterative (and hence inexact) solution of the arising FEM systems. However, we avoid any realistic assumptions on the solver, but rather rely on energy contraction per solver step, which is proved to hold for the preconditioned CG method with optimal multilevel additive Schwarz preconditioner [8] or the geometric multigrid method [25]. In the proposed GOAFEM algorithm, the termination of such a contractive iterative solver is then based on appropriate computable a posteriori error estimates by a similar criterion as in [20, 23]. We discuss several implementations of such termination criteria and prove that these allow to control the total computational cost of computing the approximate goal value G, where we already stress now that G(u)G=G(u)+R, where uu is a FEM approximation of u and R is a residual correction related to inexact solution of the FEM formulation. While [20] shows algebraic convergence with optimal rates (in the present setting of FEM on quasi-uniform meshes) with respect to the overall computational cost for the final iterates on every level for sufficiently small adaptivity parameters (for mesh-refinement and solver termination), our main contribution is full linear convergence, i.e., linear convergence of the estimator product independently of the algorithmic decision for either mesh-refinement or solver step and even for arbitrary adaptivity parameters. An immediate consequence is that the convergence rate of the computed solutions with respect to the number of elements will be the same as with respect to the overall computational cost (i.e., the cumulative computational time). Moreover, for sufficiently small adaptivity parameters, we show convergence with optimal rates with respect to the number of elements and, hence, with respect to the overall computational cost. This extends the results of [20] to the present setting of symmetric second-order linear elliptic PDEs. Finally, we stress that, unlike [20], our GOAFEM algorithm does not require any inner loop for data approximation and therefore does not require different (but still nested) meshes for the primal and dual problem. Overall, the present paper thus provides further mathematical understanding for bridging the gap between applied GOAFEM and theoretical optimality results.

Outline In Sect. 2, we present our GOAFEM algorithm (Algorithm 3) and the details of its individual steps. This includes the details of our finite element discretization as well as the precise assumptions for the iterative solver, the marking strategy, and the error estimators. We then state in Sect. 3 that Algorithm 3 leads to linear convergence for arbitrary stopping parameters (Theorem 6) and even achieves optimal rates with respect to the total computational cost if the adaptivity parameters are sufficiently small (Theorem 8). We emphasize that linear convergence applies to all steps of the adaptive strategy, independently of whether the algorithm decides for one solver step or one step of local mesh-refinement. This turns out to be the key argument for optimal rates with respect to the total computational cost (see Corollary 7). Section 3.2 comments on alternative termination criteria for the iterative solver. Section 4 then illustrates our theoretical findings with numerical experiments. Finally, we give a proof of our main Theorems 6 and 8 in Sects. 5 and 6, respectively.

Notation In the following text, we write ab for a,bR if there exists a constant C>0 (which is independent of the mesh width h) such that aCb. If there holds aba, we abbreviate this by ab. Furthermore, we denote by #A the cardinality of a finite set A and by |ω| the d-dimensional Lebesgue measure of a subset ωRd.

Goal-oriented adaptive finite element method

Variational formulation

Defining the symmetric bilinear form

a(u,v):=ΩAu·vdx+Ωcuvdx, 4

we suppose that a(·,·) is continuous and elliptic on H01(Ω) and thus fits into the setting of the Lax–Milgram lemma, i.e., there exist constants 0<CellCcnt< such that

CelluH01(Ω)2a(u,u)anda(u,v)CcntuH01(Ω)vH01(Ω)for allu,vH01(Ω).

In particular, a(·,·) is a scalar product that yields an equivalent norm |||v|||2:=a(v,v) on H01(Ω). The weak formulation of (1) reads

a(u,v)=F(v):=Ω(fvdx-f·v)dxfor allvH01(Ω). 5

The Lax–Milgram lemma proves existence and uniqueness of the solution uH01(Ω) of (5). The same argument applies and proves that the dual problem

a(v,z)=G(v)for allvH01(Ω) 6

admits a unique solution zH01(Ω), where the linear goal functional GH-1(Ω):=H01(Ω) is defined by (2).

Remark 1

For ease of presentation, we restrict our model problem (1) to homogeneous Dirichlet boundary conditions. We note, however, that for mixed homogeneous Dirichlet and inhomogeneous Neumann boundary conditions our main results hold true with the obvious modifications. In particular, with the partition Ω=Γ¯DΓ¯N into Dirichlet boundary ΓD with |ΓD|>0 and Neumann boundary ΓN, the space H01(Ω) (and its discretization) has to be replaced by HD1(Ω):={vH1(Ω):v|ΓD=0in the sense of traces} and the Neumann data has to be given in L2(ΓN). Furthermore, the coefficient f must vanish in a neighborhood of ΓN to go from the strong form (1) to the weak form (5) via integration by parts.

Finite element discretization and solution

For a conforming triangulation TH of Ω into compact simplices and a polynomial degree p1, let

XH:={vHH01(Ω):TTH,vH|Tis a polynomial of degreep}. 7

To obtain conforming finite element approximations uuHXH and zzHXH, we consider the Galerkin discretizations of (5)–(6). First, we note that the Lax–Milgram lemma yields the existence and uniqueness of exact discrete solutions uH,zHXH, i.e., there holds that

a(uH,vH)=F(vH)anda(vH,zH)=G(vH)for allvHXH. 8

In practice, the discrete systems (8) are rarely solved exactly (or up to machine precision). Instead, a suitable iterative solver is employed, which yields approximate discrete solutions uHm,zHnXH. We suppose that this iterative solver is contractive, i.e., for all m,nN, it holds that

|||uH-uHm|||qctr|||uH-uHm-1|||and|||zH-zHn|||qctr|||zH-zHn-1|||, 9

where 0<qctr<1 is a generic constant and, in particular, independent of XH. Assumption (9) is satisfied, e.g., for an optimally preconditioned conjugate gradient (PCG) method (see [8]) or geometric multigrid solvers (see [25]); see also the discussion in [16]. We note that these solvers are also guaranteed to satisfy the realistic assumptions from [20, 23] (which require that any initial energy error can be improved by a factor 0<τ<1 at O(|log(τ)|#TH) cost). However, while (9) is slightly less general, it allows to prove full linear convergence; see Theorem 6 below.

Discrete goal quantity

To approximate G(u), we proceed as in [17]: For any uH,zHXH, it holds that

graphic file with name 211_2022_1334_Equ128_HTML.gif

Defining the discrete quantity of interest

GH(uH,zH):=G(uH)+[F(zH)-a(uH,zH)], 10

the goal error can be controlled by means of the Cauchy–Schwarz inequality

|G(u)-GH(uH,zH)||a(u-uH,z-zH)||||u-uH||||||z-zH|||. 11

We note that the additional term in (10) is the residual of the discrete primal problem (8) evaluated at an arbitrary function zHXH and hence G(uH)=GH(uH,zH).

In the following, we design an adaptive algorithm that provides a computable upper bound to (11) which tends to zero at optimal algebraic rate with respect to the number of elements #TH as well as with respect to the total computational cost.

Mesh refinement

Let T0 be a given conforming triangulation of Ω. We suppose that the mesh-refinement is a deterministic and fixed strategy, e.g., newest vertex bisection [24]. For each conforming triangulation TH and marked elements MHTH, let Th:=refine(TH,MH) be the coarsest conforming triangulation, where all TMH have been refined, i.e., MHTH\Th. We write ThT(TH), if Th results from TH by finitely many steps of refinement. To abbreviate notation, let T:=T(T0). We note that the order on T is respected by the finite element spaces, i.e., ThT(TH) implies that XHXh.

We further suppose that each refined element has at least two sons, i.e.,

#(TH\Th)+#TH#Thfor allTHTand allThT(TH), 12

and that the refinement rule satisfies the mesh-closure estimate

#T-#T0Cclsj=0-1#Mjfor allN, 13

where Ccls>0 depends only on T0. For newest vertex bisection, this has been proved under an additional admissibility assumption on T0 in [1, 24] and for 2D even without any additional assumption in [18]. Finally, we suppose that the overlay estimate holds, i.e., for all triangulations TH,ThT, there exists a common refinement THThT(TH)T(Th) which satisfies that

#(THTh)#TH+#Th-#T0, 14

which has been proved in [6, 23] for newest vertex bisection.

Estimator properties

For THT and vHXH, let

ηH(T,vH)0andζH(T,vH)0for allTTH

be given refinement indicators. For μH{ηH,ζH}, we use the usual convention that

μH(vH):=μH(TH,vH),whereμH(UH,vH)=(TUHμH(T,vH)2)1/2 15

for all vHXH and all UHTH.

We suppose that the estimators ηH and ζH satisfy the so-called axioms of adaptivity (which are designed for, but not restricted to, weighted-residual error estimators) from [5]: There exist constants Cstab,Crel,Cdrel>0 and 0<qred<1 such that for all THT(T0) and all ThT(TH), the following assumptions are satisfied:

  1. Stability: For all vhXh, vHXH, and UHThTH, it holds that
    |ηh(UH,vh)-ηH(UH,vH)|+|ζh(UH,vh)-ζH(UH,vH)|Cstab|||vh-vH|||.
  2. Reduction: For all vHXH, it holds that
    ηh(Th\TH,vH)qredηH(TH\Th,vH)andζh(Th\TH,vH)qredζH(TH\Th,vH).
  3. Reliability: The Galerkin solutions uH,zHXH to (8) satisfy that
    |||u-uH|||CrelηH(uH)and|||z-zH|||CrelζH(zH).
  4. Discrete reliability: The Galerkin solutions uH,zHXH and uh,zhXh to (8) satisfy that
    |||uh-uH|||CdrelηH(TH\Th,uH)and|||zh-zH|||CdrelζH(TH\Th,zH).

By assumptions (A1) and (A3), we can estimate for every discrete function wHXH the errors in the energy norm of the primal and the dual problem by

|||u-wH|||C[ηH(wH)+|||uH-wH|||]and|||z-wH|||C[ζH(wH)+|||zH-wH|||],

respectively, where C=max{Crel,CrelCstab+1}>0. Together with (11), we then obtain that the goal error for approximations uHmuH and zHnzH in XH is bounded by

|G(u)-GH(uHm,zHn)|C2[ηH(uHm)+|||uH-uHm|||][ζH(zHn)+|||zH-zHn|||]. 16

In the following sections, we provide building blocks for our adaptive algorithm that allow to control the arising estimators (by a suitable marking strategy) as well as the arising norms in the upper bound of (16) (by an appropriate stopping criterion for the iterative solver).

Marking strategy

We suppose that the refinement indicators ηH(T,uHm) and ζH(T,zHn) for some m,nN are used to mark a subset MHTH of elements for refinement, which, for fixed marking parameter 0<θ1, satisfies that

2θηH(uHm)2ζH(zHn)2ηH(MH,uHm)2ζH(zHn)2+ζH(MH,zHn)2ηH(uHm)2. 17

Remark 2

Given 0<ϑ1, possible choices of marking strategies satisfying assumption (17) are the following:

  1. The strategy proposed in [2] defines the weighted estimator
    ρH(T,uHm,zHn)2:=ηH(T,uHm)2ζH(zHn)2+ηH(uHm)2ζH(T,zHn)2
    and then determines a set MHTH such that
    ϑρH(uHm,zHn)ρH(MH,uHm,zHn) 18
    which is the Dörfler marking criterion introduced in [9] and well-known in the context of AFEM analysis; see, e.g., [5]. This strategy satisfies (17) with θ=ϑ2.
  2. The strategy proposed in [20] determines sets M¯Hu,M¯HzTH such that
    ϑηH(uHm)η(M¯Hu,uHm)andϑζH(zHn)ζH(M¯Hz,zHn) 19
    and then chooses MH:=argmin{#M¯Hu,#M¯Hz}. This strategy satisfies (17) with θ=ϑ2/2.
  3. A more aggressive variant of (b) was proposed in [14]: Let M¯Hu and M¯Hz as above. Then, choose MHuM¯Hu and MHzM¯Hz with #MHu=#MHz=min{#M¯Hu,#M¯Hz}. Finally, define MH:=MHuMHz. Again, this strategy satisfies (17) with θ=ϑ2/2.

Note that our main results of Theorem 6 and 8 below hold true for all presented marking criteria (a)–(c). For our numerical experiments, we focus on criterion (a), which empirically tends to achieve slightly better performance in practice.

Adaptive algorithm

Any adaptive algorithm strives to drive down the bound in (16). However, the errors of the iterative solver, |||uH-uHm||| and |||zH-zHn|||, cannot be computed in general since the exact discrete solutions uH,zHXH to (8) are unknown and will not be computed. Thus, we note that (9) and the triangle inequality prove that

(1-qctr)|||uH-uHm-1||||||uHm-uHm-1|||(1+qctr)|||uH-uHm-1||| 20a

as well as

(1-qctr)|||zH-zHn-1||||||zHn-zHn-1|||(1+qctr)|||zH-zHn-1|||. 20b

With Cgoal=max{Crel,CrelCstab+1}(1+qctr/(1-qctr)), (16) leads to

|G(u)-GH(uHm,zHn)|Cgoal2[ηH(uHm)+|||uHm-uHm-1|||][ζH(zHn)+|||zHn-zHn-1|||], 21

which is a computable upper bound to the goal error if m,n1. Moreover, given some λctr>0, this motivates to stop the iterative solvers as soon as

|||uHm-uHm-1|||λctrηH(uHm)and|||zHn-zHn-1|||λctrζH(zHn)

to equibalance the contributions of the upper bound in (21); alternative stopping criteria are introduced and analyzed below. Overall, we thus consider the following adaptive algorithm.

Algorithm 3

Let u00,z00X0 be initial guesses. Let 0<θ1 as well as λctr>0 be arbitrary but fixed marking parameters. For all =0,1,2,, perform the following steps (i)–(vi):

  • (i)
    Employ (at least one step of) the iterative solver to compute iterates u1,,um and z1,,zn together with the corresponding refinement indicators η(T,uk) and ζ(T,zk) for all TT, until
    |||um-um-1|||λctrη(um)and|||zn-zn-1|||λctrζ(zn). 22
  • (ii)

    Define m_():=m and n_():=n.

  • (iii)

    If η(um)=0 or ζ(zm)=0, then define _:= and terminate.

  • (iv)

    Otherwise, find a set MT such that the marking criterion (17) is satisfied.

  • (v)

    Generate T+1:=refine(T,M).

  • (vi)

    Define the initial guesses u+10:=um and z+10:=zn for the iterative solver.

Remark 4

Theorem 6 below proves (linear) convergence for any choice of the marking parameters 0<θ1 and λctr>0, and for any of the marking strategies from Remark 2. Theorem 8 below proves optimal convergence rates (with respect to the number of elements and the total computational cost) if both parameters are sufficiently small (see (32) for the precise condition) and if the set M is constructed by one of the strategies from Remark 2, where the respective sets have quasi-minimal cardinality.

Remark 5

Note that Algorithm 3(i) requires to evaluate the error estimator after each solver step. Clearly, it would be favorable to replace η(um) (resp. ζ(zn)) by η(u0) (resp. ζ(z0)) in (22). Arguing as in [13, Lemma 8], this allows to prove convergence of the adaptive strategy, but full linear convergence (Theorem 6 below) and optimal convergence rates (Theorem 8 below) are exptected to fail.

For each adaptive level , Algorithm 3 performs at least one solver step to compute um as well as one solver step to compute zn. By definition, m_()1 is the solver step, for which the discrete solution um_() is accepted (to contribute to the set of marked elements M). Analogously, n_()1 is the solver step, for which the discrete solution zn_() is accepted (to contribute to M). If the iterative solver for either the primal or the dual problem fails to terminate for some level N0, i.e., (22) cannot be achieved for finite m, or n, we define m_():=, or n_():=, respectively, and _:=. With k_():=max{m_(),n_()}, we define

uk:=um_()for allkNwithm_()<kk_(),zk:=zn_()for allkNwithn_()<kk_(). 23

For ease of presentation, we omit the -dependence of the indices for final iterates m_(), n_(), and k_() in the following, if they appear as upper indices and write, e.g., um_:=um_() and um_-1:=um_()-1. If Algorithm 3 does not terminate in step (iii) for some N, then we define _:=. To formulate the convergence of Algorithm 3, we define the ordered set

Q:={(,k)N02:_and1kk_()},where|(,k)|:=k+j=0-1k_(j). 24

Note that |(,k)| is proportional to the overall number of solver steps to compute the estimator product η(uk)ζ(zk). Additionally, we sometimes require the notation

Q0:={(,k)N02:_and0kk_()}=Q{(,0)N02:_}. 25

To estimate the work necessary to compute a pair (uk,zk)X×X, we make the following assumptions which are usually satisfied in practice:

  • The iterates uk and zk are computed in parallel and each step of the solver in Algorithm 3(i) can be done in linear complexity O(#T);

  • Computation of all indicators η(T,uk) and ζ(T,zk) for TT requires O(#T) steps;

  • The marking in Algorithm 3(iv) can be performed at linear cost O(#T) (according to [23] this can be done for the strategies outlined in Remark 2 with M having almost minimal cardinality; moreover, we refer to a recent own algorithm in [21] with linear cost even for M having minimal cardinality);

  • We have linear cost O(#T) to generate the new mesh T+1.

Since a step (,k)Q of Algorithm 3 depends on the full history of preceding steps, the total work spent to compute (uk,zk)X×X is then of order

work(,k):=(,k)Q|(,k)||(,k)|#Tfor all(,k)Q. 26

Finally, we note that Algorithm 3(vi) employs nested iteration to obtain the initial guesses u+10,z+10 of the solver from the final iterates um_,zn_ for the mesh T. According to (21), this allows for a posteriori error control for all indices (,k)Q0\{(0,0)} beyond the initial step.

Main results

Linear convergence with optimal rates

Our first main result states linear convergence of the quasi-error product

Λk:=[|||u-uk|||+η(uk)][|||z-zk|||+ζ(zk)]for all(,k)Q0 27

for every choice of the stopping parameter λctr>0. Recall from (16) that the quasi-error product is an upper bound for the error |G(u)-G(uk,zk)|. Moreover, if k=k_(), then () and (22) give that Λk_η(uk_)ζ(zk_).

Theorem 6

Suppose (A1)–(A3). Suppose that 0<θ1 and λctr>0. Then, Algorithm 3 satisfies linear convergence in the sense of

ΛkClinqlin|(,k)|-|(,k)|Λkfor all(,k),(,k)Q{(0,0)}with|(,k)||(,k)|. 28

The constants Clin>0 and 0<qlin<1 depend only on Cstab, qred, Crel, qctr, and the (arbitrary) adaptivity parameters 0<θ1 and λctr>0.

Full linear convergence implies that convergence rates with respect to degrees of freedom and with respect to total computational cost are equivalent. From this point of view, full linear convergence indeed turns out to be the core argument for optimal complexity.

Corollary 7

Recall the definition of the total computational cost work(,k) from (26). Let r>0 and Cr:=sup(,k)Q(#T-#T0+1)rΛk[0,]. Then, under the assumptions of Theorem 6, it holds that

Crsup(,k)Q(#T)rΛksup(,k)Qwork(,k)rΛkCrateCr, 29

where the constant Crate>0 depends only on r, #T0, and on the constants qlin,Clin from Theorem 6.

Proof

The first two estimates in (29) are obvious. It remains to prove the last estimate in (29). To this end, note that it follows from the definition of Cr that

#T-#T0+1(Λk)-1/rCr1/rfor all(,k)Q.

Moreover, elementary algebra yields that

#T#T0(#T-#T0+1)for all(,0)Q0.

For (,k)Q, Theorem 6 and the geometric series thus show that

work(,k)=26(,k)Q|(,k)||(,k)|#T#T0(,k)Q|(,k)||(,k)|(#T-#T0+1)#T0Cr1/r(,k)Q|(,k)||(,k)|(Λk)-1/r#T0Cr1/rClin1/r11-qlin1/r(Λk)-1/r.

With Crate:=(#T0)rClin1/(1-qlin1/r)r, this gives that

work(,k)rΛkCrateCrfor all(,k)Q.

This shows the final inequality in (29) and thus concludes the proof.

If θ and λctr are small enough, we are able to show that linear convergence from Theorem 6 even guarantees optimal rates with respect to both the number of unknowns #T and the total cost work(,k). Given NN0, let T(N) be the set of all THT with #TH-#T0N. With

uAr:=supNN0(N+1)rminToptT(N)ηopt(uopt)[0,] 30a

and

zAr:=supNN0(N+1)rminToptT(N)ζopt(zopt)[0,] 30b

for all r>0, there holds the following result.

Theorem 8

Recall the definition of the total computational cost work(,k) from (26). Suppose the mesh properties (12)–(14) as well as the axioms (A1)–(A4). Define

θ:=11+Cstab2Cdrel2andλ:=1-qctrqctrCstab. 31

Let both adaptivity parameters 0<θ1 and 0<λctr<λ be sufficiently small such that

0<(2θ+λctr/λ1-λctr/λ)2<θ. 32

Let 1Cmark<. Suppose that the set of marked elements M in Algorithm 3(iv) is constructed by one of the strategies from Remark 2(a)–(c), where the sets in (18) and (19) have up to the factor Cmark minimal cardinality. Let s,t>0 with uAs+zAt<. Then, there exists a constant Copt>0 such that

sup(,k)Qwork(,k)s+tΛkCoptmax{uAszAt,Λ00}. 33

The constant Copt depends only on Ccls, Cstab, qred, Crel, Cdrel, qctr, Cmark, θ, λctr, #T0, s, and t.

Remark 9

The constraint (32) is enforced by our analysis of the marking strategy from Remark 2(a), while the marking strategies from Remark 2(b)–(c) allow to relax the condition to

0<(θ+λctr/λ1-λctr/λ)2<θ. 34

Alternative termination criteria for iterative solver

The above formulations of Algorithm 3 stops the iterative solver for um and the iterative solver for zn independently of each other as soon as the respective termination criteria in (22) are satisfied. In this section, we briefly discuss two alternative termination criteria:

Stronger termination: The current proof of linear convergence (and of the subsequent proof of optimal convergence) does only exploit that uk_ and zk_ satisfy the stopping criterion and the previous iterates do not (cf. Lemma 10(iii)). This can also be ensured by the following modification of Algorithm 3(i):

  • (i)
    Employ the iterative solver to compute iterates u1,,uk and z1,,zk together with the corresponding refinement indicators η(T,uk) and ζ(T,zk) for all TT, until
    |||uk-uk-1|||λctrη(uk)and|||zk-zk-1|||λctrζ(zk). 35

Note that this will lead to more solver steps, since now k=k_() (if it exists) is the smallest index for which the stopping criterion holds simultaneously for both uk_ and zk_.

Inspecting the proof of Lemma 10 below, we see that all results hold verbatim also for this stopping criterion. Thus, we conclude linear and optimal convergence (in the sense of Theorem 6 and Theorem 8) also in this case.

Natural termination: The following stopping criterion (which is somehow the most natural candidate) also leads to linear convergence: Let m_(),n_()N be minimal with (22). If either of them do not exist, we set again m_()=, or n_()=, respectively. Define k_():=max{m_(),n_()}. Then, employ the iterative solver k_() times for both the primal and the dual problem, i.e., the solver provides iterates uk and zk until both stopping criteria in (22) have been satisfied once (which avoids the artificial definition (23)). For instance, if m_()<n_()=k_()<, we continue to iterate for the primal problem until uk_ is obtained (or never stop the iteration if n_()=k_()=). If λctr>0 is sufficiently small such that 1-qctr1-qctrCstab(1+qctr)λctr>0, then we can define

λctrλctr:=max{1,(1+qctr)qctr(1-qctr)(1-qctr1-qctrCstab(1+qctr)λctr)}λctr<,

and we can guarantee the stopping condition (22) with the larger constant λctr, i.e.,

|||uk_-uk_-1|||λctrη(uk_)and|||zk_-zk_-1|||λctrζ(zk_); 36

see the proof below. Again, we notice that then the assumptions of Lemma 10 below are met. Hence, we conclude linear convergence (in the sense of Theorem 6) also for this stopping criterion. Moreover, optimal rates in the sense of Theorem 8 hold if λctr in (32) is replaced by λctr.

Proof of (36)

Without loss of generality, let us assume that m_()<k_()=n_()<. First, we have that

|||uk_-um_||||||u-uk_|||+|||u-um_|||(1+qctrk_()-m_())|||u-um_|||.

Then, using the fact that um_ satisfies the stopping criterion in (22) and stability (A1), we get that

graphic file with name 211_2022_1334_Equ129_HTML.gif

For λctr<(1-qctr)/[Cstabqctr(1+qctrk_()-m_())] we can absorb the last term to obtain

|||u-um_|||qctr1-qctr(1-Cstabqctr1-qctr(1+qctrk_()-m_())λctr)-1λctrη(uk_).

Finally, we observe that

|||uk_-uk_-1|||(1+qctr)|||u-uk_-1|||(1+qctr)qctrk_-m_-1|||u-um_|||.

Combining the last two estimates we obtain that

|||uk_-uk_-1|||(1+qctr)qctrk_()-m_()(1-qctr)(1-qctr1-qctrCstab(1+qctrk_()-m_())λctr)λctrη(uk_).

Hence, (36) follows with qctrk_()-m_()qctr and |||zk_-zk_-1|||λctrζ(zk_)λctrζ(zk_).

Numerical examples

In this section, we consider two numerical examples which solve the equation

-Δu=finΩ,u=0onΓD,u·n=ϕonΓN, 37

where ϕL2(ΓN) and n is the element-wise outwards facing unit normal vector. We refer the reader to Remark 1 for a comment on the applicability of our results to this model problem. We further suppose that the goal functional is a slight variant of the one proposed in [20], i.e.,

G(v)=-ωv·gdxforvHD1(Ω), 38

with a subset ωΩ and a fixed direction g(x)=g0R2. Moreover, for error estimation, we employ standard residual error estimators, which in our case, for all (,k)Q and all TT, read

η(T,uk)2:=hT2Δuk+fL2(T)2+hT[[uk·n]]L2(TΩ)2+hTuk·n-ϕL2(TΓN)2,ζ(T,zk)2:=hT2div(zk+g)L2(T)2+hT[[(zk+g)·n]]L2(TΩ)2,

where hT=|T|1/2 is the local mesh-width and [[·]] denotes the jump across interior edges. It is well-known [5, 14] that η and ζ satisfy the assumptions (A1)–(A4). The examples are chosen to showcase the performance of the proposed GOAFEM algorithm for different types of singularities.

Throughout this section, we solve (37) as well as the corresponding dual problem numerically using Algorithm 3, where we make the following choices:

  • We solve the problems on the lowest order finite element space, i.e., with polynomial degree p=1.

  • As initial values, we use u00=z00=0.

  • To solve the arising linear systems, we use a preconditioned conjugate gradient (PCG) method with an optimal additive Schwarz preconditioner. We refer to [8, 22] for details and, in particular, the proof that this iterative solver satisfies (9).

  • We use the marking criterion from Remark 2(a) and choose M such that it has minimal cardinality.

  • Unless mentioned otherwise, we use ϑ=0.5 and λctr=10-5.

Singularity in goal functional only

In our first example, the primal problem is (37) with f=2x1(1-x1)+2x2(1-x2) on the unit square Ω=(0,1)2, and ΓD=Ω (and thus, ΓN=). For this problem, the exact solution reads

u(x)=x1x2(1-x1)(1-x2).

The goal functional is (38) with ω=T1:={xΩ:x1+x23/2} and g0=(-1,0). The exact goal value can be computed analytically to be

G(u)=T1ux1dx=11/960.

The initial mesh T0 as well as a visualization of the set T1 can be seen in Fig. 1.

Fig. 1.

Fig. 1

Left: Initial mesh T0. The shaded area is the set T1 from Section (). Right: Mesh after 14 iterations of Algorithm 3 with #T14=4157

For this setting, we compare our iterative solver to a conjugate gradient method without preconditioner in Fig. 2, where we plot the computable upper bound from (21),

Ξk:=[η(uk)+|||uk-uk-1|||][ζ(zk)+|||zk-zk-1|||]for all(,k)Q,

over work(,k) for all iterates (,k)Q and the estimator product for the final iterates η(uk_)ζ(zk_) over #T. We stress that, for (,k)Q, the computable upper bound Ξk and the quasi-error product Λk from (27) are related by ΛkΞkΛk-1 so that linear convergence (28) with optimal rates (33) of Λk also yields linear convergence with optimal rates of Ξk. Since in our experiments λctr=10-5 is small, it is plausible to assume that the final estimates on every level approximate the exact solutions sufficiently well in the sense of estimator products, i.e., η(uk_)ζ(zk_)η(u)ζ(z) (cf. Lemma 13 below) for which [14] proves optimal convergence rates with respect to #T. Indeed, we see optimal rates for η(uk_)ζ(zk_) with respect to #T for both solvers in Fig. 2. However, the non-preconditioned CG method fails to satisfy uniform contraction (9) and thus Theorem 8 cannot be applied. In fact, Fig. 2 shows that this method fails to drive down Ξk with optimal rates with respect to work(,k) (cf. (26)), as opposed to the optimally preconditioned CG method.

Fig. 2.

Fig. 2

Comparison between iterative solvers for the problem from Sect. 4.1. A conjugate gradient method without preconditioner (CG) leads to optimal rates with respect to #T for the final iterates where k=k_(), but not with respect to work(,k) for every (,k)Q. Our choice of the iterative solver (ML) achieves optimal rates with respect to both measures

Furthermore, we plot in Fig. 3 different error measures over work(,k) for every iterate (,k)Q. This shows that the corrector term

a(uk,zk)-F(zk) 39

(which is the residual of uk evaluated at the dual solution zk) in the definition of the discrete goal functional (10) is indeed necessary. We see that throughout the iteration, the goal value G(uk) highly oscillates and, for large values of λctr, even shows a different rate than the Ξk over work(,k). In general, we thus cannot expect the quantity Ξk to bound the uncorrected goal-error |G(u)-G(uk)|.

Fig. 3.

Fig. 3

Comparison between Ξk, discrete goal G(uk,zk), primal residual evaluated at the dual solution zk, and direct evaluation of goal functional G(uk) for every iterate (,k)Q and different values of λctr{1,10-2,10-4,10-6}. The primal residual evaluated at the dual solution zk is the difference between goal and discrete goal; see (10)

For the discrete goal, the corrector term compensates the oscillations of the goal functional, such that their sum decreases with the same rate as Ξk, as predicted by (21). Smaller values of λctr imply that on every level the approximate solutions uk,zk are computed more accurately, such that the corrector term becomes smaller and the effect on the rate of the goal value becomes negligible.

Geometrical singularity

Our second example is the classical example of a geometric singularity on the so-called Z-shape Ω=(-1,1)2\conv{(-1,-1),(0,0),(-1,0)}, where ΓD is only the re-entrant corner (cf. Fig. 4). The primal problem is (37) with f=0 and ϕ=u·n, where the exact solution in polar coordinates r(x) and φ(x) of xR2 is prescribed as

u(x)=r(x)4/7sin(47φ(x)+3π7).

The goal functional is (38) with ω=T2:=(0.5,0.5)2Ω and g0=(-1,-1) and can be computed directly via numerical integration to be

G(u)=T2(ux1+ux2)dx0.82962247157810.

In Fig. 4, the initial triangulation T0 as well as the mesh after several iterations of Algorithm 3 can be seen. The adaptive algorithm resolves the singularity at the re-entrant corner, as well as critical points of the goal functional, which are at the corners of T2.

Fig. 4.

Fig. 4

Left: Initial mesh T0. The shaded area is the set T2 from Section () and the Dirichlet boundary at the re-entrant corner is marked in red. Right: Mesh after 13 iterations of Algorithm 3 with #T13=4534

Figure 5 shows the rate of the estimator product η(uk_)ζ(zk_) of the final iterates over #T as well as the rate of Ξk over work(,k) for all (,k)Q.

Fig. 5.

Fig. 5

Rates of the estimator product for final iterates over #T and Ξk as well as goal error over work(,k) for all (,k)Q

Proof of Theorem 6

The following core lemma extends one of the key observations of [16] to the present setting, where we stress that the nonlinear product structure of Δk leads to technical challenges which go much beyond [16].

Lemma 10

Suppose (A1)–(A3). Then, there exist constants μ,Caux>0, and 0<qaux<1, and some scalar sequence (R)N0R such that the quasi-error product

Δk:=[|||u-uk|||+μη(uk)][|||z-zk|||+μζ(zk)]for all(,k)Q0

satisfies the following statements (i)–(v):

  • (i)

    ΔkΔj    for all 0jkk_().

  • (ii)

    Δk_-1CauxΔk_    if k_()<.

  • (iii)

    ΔkqauxΔk-1    for all 0<k<k_().

  • (iv)

    Δ+10qauxΔk_-1+R    for all 0<<_.

  • (v)

    =_-1R2Caux(Δk_-1)2    for all 0<_-1.

The constants μ, Caux, and qaux depend only on Cstab, qred, Crel, and qctr as well as on the (arbitrary) adaptivity parameters 0<θ1 and λctr>0.

For the following proofs, we define

αk:=|||u-uk|||,x:=|||u+1-u|||,βk:=|||z-zk|||,y:=|||z+1-z|||,

such that the quasi-error product reads Δk=[αk+μη(uk)][βk+μζ(zk)] with a free parameter μ>0 which will be fixed below.

Proof of Lemma 10(i)

Recall from (23) that uk=um_ for all m_()<kk_(). Thus, we have that

αk+μη(uk)=αm_+μη(um_)for allm_()<kk_().

For 0<k<m_(), on the other hand, the solution uk is obtained by one step of the iterative solver. From stability (A1) and solver contraction (9), we have for all 0j<km_() that

graphic file with name 211_2022_1334_Equ130_HTML.gif

If μ is chosen small enough such that qctr+2μCstab1, together with the trivial case j=k, the last two equations show that

αk+μη(uk)αj+μη(uj)for all0jkk_().

The same argument shows that

βk+μζ(zk)βj+μζ(zj).for all0jkk_(). 40

Multiplication of the last two estimates shows the assertion.

Proof of Lemma 10(ii)

Recall that for the index k_() there holds (22). From the triangle inequality, we thus get for the primal estimator that

graphic file with name 211_2022_1334_Equ131_HTML.gif

Furthermore, stability (A1) leads to

graphic file with name 211_2022_1334_Equ132_HTML.gif

Combining the last two estimates, we see that

αk_-1+μη(uk_-1)(1+λctr(Cstab+μ-1))[αk_+μη(uk_)].

Together with the analogous estimate for βk_-1+μζ(zk_-1), we conclude the proof with Caux=(1+λctr(Cstab+μ-1))2.

Proof of Lemma 10(iii)

Without loss of generality, suppose that k_()=m_() and thus |||uk-uk-1|||>λctrη(uk). Then, this yields that

graphic file with name 211_2022_1334_Equ133_HTML.gif

With contraction of the solver (9), this leads to

αk+μη(uk)qctrαk-1+μλctr-1(1+qctr)αk-1for all0<k<k_().

From (40) for μ small enough, we see that βk+μζ(zk)βk-1+μζ(zk-1). Together with the previous estimate, this shows that

Δk(qctr+μλctr-1(1+qctr))Δk-1. 41

Up to the choice of μ, this concludes the proof.

Proof of Lemma 10(iv)

First, we note that η(uk_)ζ(zk_)0, according to Algorithm 3(iii) and the assumption that <_. From reduction of the solver (9) and nested iteration, we get that

α+10=|||u+1-uk_||||||u+1-u|||+qctr|||u-uk_-1|||=x+qctrαk_-1,β+10=|||z+1-zk_||||||z+1-z|||+qctr|||z-zk_-1|||=y+qctrβk_-1 42

and thus

α+10β+10qctr2αk_-1βk_-1+qctr(αk_-1y+βk_-1x)+xy. 43

For the estimator terms, we have with stability (A1) and reduction (A2) that

η+1(u+10)2=η+1(uk_)2=η+1(T+1T,uk_)2+η+1(T+1\T,uk_)2η(T+1T,uk_)2+qred2η(T\T+1,uk_)2=η(uk_)2-(1-qred2)η(T\T+1,uk_)2.

On the one hand, with C1:=Cstab(1+qred), this implies that

graphic file with name 211_2022_1334_Equ46_HTML.gif 44

On the other hand, with 0<qθ:=1-(1-qred2)θ<1, we get that

η+1(u+10)2η(uk_)2qθ+(1-qred2)[θ-η(T\T+1,uk_)2η(uk_)2]. 45

Using (45), the corresponding estimate for the dual estimator, and the Young inequality, we obtain that

η+1(u+10)η(uk_)ζ+1(z+10)ζ(zk_)qθ+(1-qred2)2[2θ-η(T\T+1,uk_)2η(uk_)2-ζ(T\T+1,zk_)2ζ(zk_)2].

The marking criterion (17), which is applicable due to <_, estimates the term in brackets by zero. Thus stability (A1) leads to

graphic file with name 211_2022_1334_Equ48_HTML.gif 46

For the mixed terms in Δ+10, we have with (42) and (44) that

η+1(u+10)β+10[η(uk_-1)+C1αk_-1][y+qctrβk_-1]=qctrη(uk_-1)βk_-1+η(uk_-1)y+C1αk_-1y+C1qctrαk_-1βk_-1. 47

Analogously, we see that

ζ+1(z+10)α+10qctrζ(zk_-1)αk_-1+ζ(zk_-1)x+C1βk_-1x+C1qctrαk_-1βk_-1. 48

Combining (43) and (46)–(48), we get that

Δ+10=α+10β+10+μ[η+1(u+10)β+10+ζ+1(z+10)α+10]+μ2η+1(u+10)ζ+1(z+10)qctr2αk_-1βk_-1+qctr(αk_-1y+βk_-1x)+xy+μ[qctrη(uk_-1)βk_-1+η(uk_-1)y+C1αk_-1y+C1qctrαk_-1βk_-1]+μ[qctrζ(zk_-1)αk_-1+ζ(zk_-1)x+C1βk_-1x+C1qctrαk_-1βk_-1]+μ2[qθη(uk_-1)ζ(zk_-1)+qθC1(η(uk_-1)βk_-1+ζ(zk_-1)αk_-1)+C12αk_-1βk_-1].

Rearranging the terms, we obtain that

Δ+10(qctr2+2μqctrC1+μ2C12)αk_-1βk_-1+μ(qctr+μqθC1)[η(uk_-1)βk_-1+ζ(zk_-1)αk_-1]+μ2qθη(uk_-1)ζ(zk_-1)+R, 49

where the remainder term is defined as

R:=μ[η(uk_-1)y+ζ(zk_-1)x]+(qctr+μC1)[αk_-1y+βk_-1x]+xy. 50

Up to the choice of μ, this concludes the proof.

Proof of Lemma 10 (choosing μ) For Lemma 10(i), we choose μ small enough such that qctr+2μCstab1. From (41) and (49) in the proofs of Lemma 10(iii)–(iv), we see that we additionally require

qctr+μλctr-1(1+qctr)<1,qctr2+2μqctrC1+μ2C12<1,andqctr+μqθC1<1. 51

Choosing μ small enough, we satisfy all estimates. We define qaux<1 as the maximum of all terms in (51) and qθ

Proof of Lemma 10(v)

First, we note that from stability (A1) it follows that

η(uk_-1)η(u)+αk_-1andη(u)ζ(z)Δjfor all0jk_. 52

Furthermore, Galerkin orthogonality and reliability (A3) imply that, for all nN with +n<_,

graphic file with name 211_2022_1334_Equ55_HTML.gif 53

With (52) and (53) for n=1, we can bound the remainder term from (50) by

Rη(u)y+ζ(z)x+αk_-1y+βk_-1x.

Next, let us recall from [5, Lemma 3.6] the quasi-monotonicity of the estimator, which follows from (A1)–(A3) and the Céa lemma, i.e., for all <_,

η(u)η(u)+Cstab|||u-u|||η(u)+Cstab|||u-u|||η(u). 54

For η(u)y, we get by summation for all 0jk_() and all nN with +n<_ that

graphic file with name 211_2022_1334_Equ134_HTML.gif

Analogously, we see that

=+n(x)2η(u)2as well as=+nζ(z)2(x)2(Δj)2. 55

We proceed with αk_-1y. From (42) and the Young inequality with δ>0, we see for 0<<_ that

(αk_-1)2(α0)2(1+δ-1)(x-1)2+qctr(1+δ)(α-1k_-1)2.

For δ small enough such that q2:=qctr(1+δ)<1 and all for 0<_, the geometric series proves that

(αk_-1)2(1+δ-1)j=-1(xj)2+(αk_-1)2j=0q2jη(u)2+(αk_-1)2

and thus

graphic file with name 211_2022_1334_Equ135_HTML.gif

Analogously, we see that =+n(βk_-1)2(x)2(Δk_-1)2. Combining all estimates with

R2η(u)2(y)2+ζ(z)2(x)2+(αk_-1)2(y)2+(βk_-1)2(x)2,

we conclude the proof.

With the foregoing auxiliary result, we are in the position to prove linear convergence.

Proof of Theorem 6

Let (,k)Q. We recall the quasi-error products

Λk=[|||u-uk|||+η(uk)][|||z-zk|||+ζ(zk)],Δk=[|||u-uk|||+μη(uk)][|||z-zk|||+μζ(zk)]

from Theorem 6 and Lemma 10, respectively. Note that

ΛkΔkμ2Λkifμ1,ΔkΛkμ-2Δkifμ<1,

which yields the equivalence

min{1,μ2}ΛkΔkmax{1,μ2}Λk. 56

We first show linear convergence of Δk. By Lemma 10(i), we can absorb the term Δk_Δk_-1 for all . Paying attention to the possible case k=k_(), this allows us to estimate

(,k)Q|(,k)||(,k)|(Δk)2(Δk)2+k=kk_()-1(Δk)2+=+1_k=0k_()-1(Δk)2.

Lemma 10(iii) shows uniform reduction of the quasi-error on every level. This yields that

(,k)Q|(,k)||(,k)|(Δk)2(Δk)2k=kk_()qaux2(k-k)+=+1_(Δ0)2k=0k_()-1qaux2k(Δk)2+=+1_(Δ0)2.

To estimate the sum over all levels, we use that, for the refinement step, Lemma 10(iv) shows contraction up to a remainder term. The Young inequality with δ>0 and Lemma 10(i) then prove that

(Δ0)2qaux2(1+δ)(Δ-1k_-1)2+(1+δ-1)R-12qaux2(1+δ)(Δ-10)2+(1+δ-1)R-12for all0<_.

Choosing δ small enough such that q:=qaux2(1+δ)<1, we obtain from repeatedly applying the previous estimates that

(Δ0)2q-(Δk_-1)2+(1+δ-1)n=-1q(-1)-nRn2for all0<_.

Using this estimate and a change of summation indices, the geometric series and Lemma 10(v) uniformly bound the sum over all levels by

=+1_(Δ0)2=+1_[q-(Δk_-1)2+n=-1q(-1)-nRn2](Δk_-1)2+n=_-1Rn2i=0qi(Δk_-1)2+n=_-1Rn2(v)(Δk_-1)2.

Combining the estimates above, we obtain that

(,k)Q|(,k)||(,k)|(Δk)2(Δk)2+=+1_(Δ0)2(Δk)2+(Δk_-1)2.

In the case k<k_(), Lemma 10(i) proves that

(,k)Q|(,k)||(,k)|(Δk)2C(Δk)2.

In the case k=k_(), this follows with Lemma 10(ii). In either case, the constant C>0 depends only on Caux and qaux from Lemma 10. Basic calculus then provides the existence of Clin:=(1+C)1/2>1 and 0<qlin:=(1-C-1)-1/2<1 such that

ΔkClinqlin|(,k)|-|(,k)|Δkfor all(,k),(,k)Qwith(,k)(,k);

see [5, Lemma 4.9]. Finally, the claim of Theorem 6 follows from (56) with Clin=max{μ-2,μ2}Clin.

Proof of Theorem 8 (optimal rates)

We recall the following comparison lemma from [12]. While [12] is concerned with point errors in boundary element computations, we stress that the proof of [12, Lemma 14] works on a completely abstract level and thus is applicable here as well.

Lemma 11

([12, Lemma 14]) The overlay estimate (14) and the axioms (A1)–(A2) and (A4) yield the existence of a constant C1>0 such that, given 0<κ<1, each mesh THT admits some refinement ThT(TH) such that for all s,t>0, it holds that

ηh(uh)2ζh(zh)2κ2ηH(uH)2ζH(zH)2, 57a
#Th-#TH2(C1κ-1uAszAt)1/(s+t)(ηH(uH)ζ(zH))1/(s+t). 57b

The constant C1 depends only on Cstab, qred, and Cdrel.

Note that (57a) immediately implies that

ηh(uh)2κηH(uH)2orζh(zh)2κζH(zH)2. 58

We will employ this lemma in combination with the so-called optimality of Dörfler marking from [5].

Lemma 12

([5, Proposition 4.12]) Under (A1) and (A4), for all 0<Θ<1/(1+Cstab2Cdrel2), there exists 0<κΘ<1 such that for all THT and all ThT(TH), (58) with κ=κΘ implies that

ΘηH(uH)2ηH(TH\Th,uH)2orΘζH(zH)2ζH(TH\Th,zH)2. 59

The constant κΘ depends only on Cstab, Cdrel, and Θ.

The next lemma is already implicitly found in [15]. It shows that, if λctr>0 is sufficiently small, then Dörfler marking for the exact discrete solution implicitly implies Dörfler marking for the approximate discrete solution. This will turn out to be the key observation to prove optimal convergence rates. We include the proof for the convenience of the reader.

Lemma 13

Suppose (A1)–(A3). Let 0<Θ1 and 0<λctr<λ:=(1-qctr)/(qctrCstab). Define Θ:=(Θ+λctr/λ1-λctr/λ)2. Then, as soon as the iterative solver terminates (22), there hold the following statements (i)–(iv) for all 0<_ and all UT:

  • (i)

    (1-λctr/λ)η(um_)η(u)(1+λctr/λ)η(um_).

  • (ii)

    Θη(um_)2η(U,um_)2    provided that Θη(u)2η(U,u)2.

  • (iii)

    (1-λctr/λ)ζ(zn_)ζ(z)(1+λctr/λ)ζ(zn_).

  • (iv)

    Θζ(zn_)ζ(U,zn_)    provided that Θζ(z)2ζ(U,z)2.

Proof

It holds that

η(U,u)(A1)η(U,um_)+Cstab|||u-um_|||(20)η(U,um_)+Cstabqctr1-qctr|||um_-um_-1|||(22)η(U,um_)+Cstabqctr1-qctrλctrη(um_)=η(U,um_)+λctrλη(um_).

The same argument proves that

η(U,um_)η(U,u)+λctrλη(um_).

For U=T, the latter two estimates lead to

(1-λctr/λ)η(um_)η(u)(1+λctr/λ)η(um_).

This concludes the proof of (i). To see (ii), we use the assumption

(1-λctr/λ)Θη(um_)(i)Θη(u)η(U,u)η(U,um_)+λctrλη(um_).

Noting that Θ=(1-λctr/λ)Θ-λctr/λ, this concludes the proof of (ii). The remaining claims (iii)–(iv) follow verbatim.

Proof of Theorem 8

By Corollary 7, it is sufficient to prove that

Cs+t=sup(,k)Q(#T-#T0+1)s+tΛkmax{uAszAt,Λ00}.

We prove this inequality in two steps.

Step 1: In this step, we bound the number of marked elements #M for arbitrary 0<_. Let Θ>0 and corresponding Θ from Lemma 13 such that

Θ=(Θ+λctr/λ1-λctr/λ)2<11+Cstab2Cdrel2. 60

Let Th()T(T) be the corresponding mesh as in Lemma 11. With Lemma 12, this yields that

Θη(u)2η(T\Th(),u)2orΘζ(z)2ζ(T\Th(),z)2.

Lemma 13 with U=T\Th() shows that

Θη(um_)2η(T\Th(),u)2orΘζ(zn_)2ζ(T\Th(),z)2. 61

We consider the marking strategies from Remark 2 separately.

For strategy (a), we have with Θ:=2θ and assumption (32) that (60) is satisfied. Hence, (61) implies that there holds (17), i.e.,

2θη(um_)2ζ(zn_)2η(T\Th(),um_)2ζ(zn_)2+η(um_)2ζ(T\Th(),zn_)2.

By assumption of Theorem 8, M is essentially minimal with (17). We infer that

#MCmark#(T\Th())#Th()-#T. 62

For the strategies (b)–(c), we set Θ=θ and note that assumption (32) (as well as the weaker assumption (34)) imply (60), and hence (61). Again, by assumption of Theorem 8, M is chosen essentially minimal (with an additional factor two for the strategy (c)) such that (61) holds. For all three strategies, we therefore conclude that

graphic file with name 211_2022_1334_Equ136_HTML.gif

Recall that () and (22) give that η(uk_)ζ(zk_)Λk_. This finally shows that

#M(uAszAt)1/(s+t)(Λk_)-1/(s+t).

Step 2: Let (,k)Q. First, we consider >0 and thus #T>#T0. The closure estimate and Step 1 prove that

#T-#T0+1#T-#T0=0-1#M(uAszAt)1/(s+t)=0-1(Λk_)-1/(s+t)(uAszAt)1/(s+t)(,k)Q|(,k)||(,k)|(Λk)-1/(s+t).

Linear convergence of Theorem 6, further shows that

#T-#T0+1(uAszAt)1/(s+t)Clin1/(s+t)(Λk)-1/(s+t)(,k)Q|(,k)||(,k)|(qlin1/(s+t))|(,k)|-|(,k)|(uAszAt)1/(s+t)Clin1/(s+t)1-qlin1/(s+t)Clin1/(s+t)(Λk)-1/(s+t).

Rearranging this estimate, we see that

(#T-#T0+1)s+tΛkuAszAtfor all(,k)Qwith>0.

It remains to consider =0. By Theorem 6, we have that

(#T-#T0+1)s+tΛk=Λ0kΛ00for all(,k)Qwith=0.

This concludes the proof.

Acknowledgements

The authors thankfully acknowledge support by the Austrian Science Fund (FWF) through the doctoral school Dissipation and dispersion in nonlinear PDEs (grant W1245), the SFB Taming complexity in partial differential systems (grant SFB F65), the stand-alone project Computational nonlinear PDEs (grant P33216), and the Erwin Schrödinger Fellowship Optimal adaptivity for space-time methods (grant J4379).

Funding Information

Open access funding provided by Austrian Science Fund (FWF).

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Roland Becker, Email: roland.becker@univ-pau.fr.

Gregor Gantner, Email: g.gantner@uva.nl.

Michael Innerberger, Email: michael.innerberger@asc.tuwien.ac.at.

Dirk Praetorius, Email: dirk.praetorius@asc.tuwien.ac.at.

References

  • 1.Binev P, Dahmen W, DeVore R. Adaptive finite element methods with convergence rates. Numer. Math. 2004;97(2):219–268. doi: 10.1007/s00211-003-0492-7. [DOI] [Google Scholar]
  • 2.Becker R, Estecahandy E, Trujillo D. Weighted marking for goal-oriented adaptive finite element methods. SIAM J. Numer. Anal. 2011;49(6):2451–2469. doi: 10.1137/100794298. [DOI] [Google Scholar]
  • 3.Becker R, Rannacher R. An optimal control approach to a posteriori error estimation in finite element methods. Acta Numer. 2001;10:1–102. doi: 10.1017/S0962492901000010. [DOI] [Google Scholar]
  • 4.Bangerth W, Rannacher R. Adaptive finite element methods for differential equations. Basel: Birkhäuser; 2003. [Google Scholar]
  • 5.Carstensen C, Feischl M, Page M, Praetorius D. Axioms of adaptivity. Comput. Math. Appl. 2014;67(6):1195–1253. doi: 10.1016/j.camwa.2013.12.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Cascon JM, Kreuzer C, Nochetto RH, Siebert KG. Quasi-optimal convergence rate for an adaptive finite element method. SIAM J. Numer. Anal. 2008;46(5):2524–2550. doi: 10.1137/07069047X. [DOI] [Google Scholar]
  • 7.Cascón JM, Nochetto RH. Quasioptimal cardinality of AFEM driven by nonresidual estimators. IMA J. Numer. Anal. 2012;32(1):1–29. doi: 10.1093/imanum/drr014. [DOI] [Google Scholar]
  • 8.Chen L, Nochetto RH, Jinchao X. Optimal multilevel methods for graded bisection grids. Numer. Math. 2012;120(1):1–34. doi: 10.1007/s00211-011-0401-4. [DOI] [Google Scholar]
  • 9.Dörfler W. A convergent adaptive algorithm for Poisson’s equation. SIAM J. Numer. Anal. 1996;33(3):1106–1124. doi: 10.1137/0733054. [DOI] [Google Scholar]
  • 10.Eriksson K, Estep D, Hansbo P, Johnson C. Introduction to adaptive methods for differential equations. Acta Numer. 1995;4:105–158. doi: 10.1017/S0962492900002531. [DOI] [Google Scholar]
  • 11.Feischl M, Führer T, Praetorius D. Adaptive FEM with optimal convergence rates for a certain class of nonsymmetric and possibly nonlinear problems. SIAM J. Numer. Anal. 2014;52(2):601–625. doi: 10.1137/120897225. [DOI] [Google Scholar]
  • 12.Feischl M, Gantner G, Haberl A, Praetorius D, Führer T. Adaptive boundary element methods for optimal convergence of point errors. Numer. Math. 2016;132(3):541–567. doi: 10.1007/s00211-015-0727-4. [DOI] [Google Scholar]
  • 13.Führer T, Praetorius D. A linear Uzawa-type FEM-BEM solver for nonlinear transmission problems. Comput. Math. Appl. 2018;75(8):2678–2697. doi: 10.1016/j.camwa.2017.12.035. [DOI] [Google Scholar]
  • 14.Feischl M, Praetorius D, van der Zee KG. An abstract analysis of optimal goal-oriented adaptivity. SIAM J. Numer. Anal. 2016;54(3):1423–1448. doi: 10.1137/15M1021982. [DOI] [Google Scholar]
  • 15.Gantner G, Haberl A, Praetorius D, Stiftner B. Rate optimal adaptive FEM with inexact solver for nonlinear operators. IMA J. Numer. Anal. 2018;38:1797–1831. doi: 10.1093/imanum/drx050. [DOI] [Google Scholar]
  • 16.Gantner G, Haberl A, Praetorius D, Schimanko S. Rate optimality of adaptive finite element methods with respect to overall computational costs. Math. Comp. 2021;90(331):2011–2040. doi: 10.1090/mcom/3654. [DOI] [Google Scholar]
  • 17.Giles, M., Süli, Endre: Adjoint methods for PDEs: a posteriori error analysis and postprocessing by duality. Acta Numer. 11, 145–236 (2002)
  • 18.Karkulik M, Pavlicek D, Praetorius D. On 2D newest vertex bisection: optimality of mesh-closure and H1-stability of L2-projection. Constr. Approx. 2013;38(2):213–234. doi: 10.1007/s00365-013-9192-4. [DOI] [Google Scholar]
  • 19.Morin P, Nochetto RH, Siebert KG. Data oscillation and convergence of adaptive FEM. SIAM J. Numer. Anal. 2000;38(2):466–488. doi: 10.1137/S0036142999360044. [DOI] [Google Scholar]
  • 20.Mommer MS, Stevenson R. A goal-oriented adaptive finite element method with convergence rates. SIAM J. Numer. Anal. 2009;47(2):861–886. doi: 10.1137/060675666. [DOI] [Google Scholar]
  • 21.Pfeiler CM, Praetorius D. Dörfler marking with minimal cardinality is a linear complexity problem. Math. Comp. 2020;89(326):2735–2752. doi: 10.1090/mcom/3553. [DOI] [Google Scholar]
  • 22.Schimanko, S.: On rate-optimal adaptive algorithms with inexact solvers. PhD thesis, TU Wien, Institute of Analysis and Scientific Computing, 2021
  • 23.Stevenson R. Optimality of a standard adaptive finite element method. Found. Comput. Math. 2007;7(2):245–269. doi: 10.1007/s10208-005-0183-0. [DOI] [Google Scholar]
  • 24.Stevenson R. The completion of locally refined simplicial partitions created by bisection. Math. Comp. 2008;77(261):227–241. doi: 10.1090/S0025-5718-07-01959-X. [DOI] [Google Scholar]
  • 25.Jinbiao W, Zheng H. Uniform convergence of multigrid methods for adaptive meshes. Appl. Numer. Math. 2017;113:109–123. doi: 10.1016/j.apnum.2016.11.005. [DOI] [Google Scholar]

Articles from Numerische Mathematik are provided here courtesy of Springer

RESOURCES