Abstract
We consider a linear symmetric and elliptic PDE and a linear goal functional. We design and analyze a goal-oriented adaptive finite element method, which steers the adaptive mesh-refinement as well as the approximate solution of the arising linear systems by means of a contractive iterative solver like the optimally preconditioned conjugate gradient method or geometric multigrid. We prove linear convergence of the proposed adaptive algorithm with optimal algebraic rates. Unlike prior work, we do not only consider rates with respect to the number of degrees of freedom but even prove optimal complexity, i.e., optimal convergence rates with respect to the total computational cost.
Mathematics Subject Classification: 65N30, 65N50, 65Y20, 41A25, 65N22
Introduction
Let be a bounded Lipschitz domain, . For given and , we consider a linear symmetric and elliptic partial differential equation
| 1 |
where is symmetric and . As usual, we assume that , that is uniformly positive definite and that the weak form (see (5) below) fits into the setting of the Lax–Milgram lemma. Standard adaptivity aims to approximate the unknown solution of (1) in the energy norm at optimal rate; see [1, 6, 7, 9, 11, 19, 23] for adaptive finite element methods (AFEMs) and [5] for an overview of available results. Instead, the quantity of interest for goal-oriented adaptivity is only some functional value of the unknown solution of (1), and the present paper aims to compute the linear goal functional
| 2 |
for given and . To approximate accurately, it is not necessary (and might even waste computational time) to accurately approximate the solution on the whole computational domain. Due to this potential decrease of computational cost, goal-oriented adaptivity is of high relevance in practice as well as in mathematical research; see, e.g., [3, 4, 10, 17] for some prominent contributions.
The present work formulates a goal-oriented adaptive finite element method (GOAFEM), where the sought goal is approximated by some computable such that
| 3 |
The earlier works [2, 12, 14, 20] are essentially concerned with optimal convergence rates for GOAFEM, where all arising linear FEM systems are solved exactly. While [12, 14] particularly aim to transfer ideas from the AFEM analysis of [5, 6] to GOAFEM for general elliptic PDEs, the seminal work [20] considers the Poisson model problem and additionally addresses the total computational cost by formulating realistic assumptions on a generic inexact solver (called GALSOLVE in [20, 23]).
The focus of the present work is also on the iterative (and hence inexact) solution of the arising FEM systems. However, we avoid any realistic assumptions on the solver, but rather rely on energy contraction per solver step, which is proved to hold for the preconditioned CG method with optimal multilevel additive Schwarz preconditioner [8] or the geometric multigrid method [25]. In the proposed GOAFEM algorithm, the termination of such a contractive iterative solver is then based on appropriate computable a posteriori error estimates by a similar criterion as in [20, 23]. We discuss several implementations of such termination criteria and prove that these allow to control the total computational cost of computing the approximate goal value , where we already stress now that , where is a FEM approximation of and is a residual correction related to inexact solution of the FEM formulation. While [20] shows algebraic convergence with optimal rates (in the present setting of FEM on quasi-uniform meshes) with respect to the overall computational cost for the final iterates on every level for sufficiently small adaptivity parameters (for mesh-refinement and solver termination), our main contribution is full linear convergence, i.e., linear convergence of the estimator product independently of the algorithmic decision for either mesh-refinement or solver step and even for arbitrary adaptivity parameters. An immediate consequence is that the convergence rate of the computed solutions with respect to the number of elements will be the same as with respect to the overall computational cost (i.e., the cumulative computational time). Moreover, for sufficiently small adaptivity parameters, we show convergence with optimal rates with respect to the number of elements and, hence, with respect to the overall computational cost. This extends the results of [20] to the present setting of symmetric second-order linear elliptic PDEs. Finally, we stress that, unlike [20], our GOAFEM algorithm does not require any inner loop for data approximation and therefore does not require different (but still nested) meshes for the primal and dual problem. Overall, the present paper thus provides further mathematical understanding for bridging the gap between applied GOAFEM and theoretical optimality results.
Outline In Sect. 2, we present our GOAFEM algorithm (Algorithm 3) and the details of its individual steps. This includes the details of our finite element discretization as well as the precise assumptions for the iterative solver, the marking strategy, and the error estimators. We then state in Sect. 3 that Algorithm 3 leads to linear convergence for arbitrary stopping parameters (Theorem 6) and even achieves optimal rates with respect to the total computational cost if the adaptivity parameters are sufficiently small (Theorem 8). We emphasize that linear convergence applies to all steps of the adaptive strategy, independently of whether the algorithm decides for one solver step or one step of local mesh-refinement. This turns out to be the key argument for optimal rates with respect to the total computational cost (see Corollary 7). Section 3.2 comments on alternative termination criteria for the iterative solver. Section 4 then illustrates our theoretical findings with numerical experiments. Finally, we give a proof of our main Theorems 6 and 8 in Sects. 5 and 6, respectively.
Notation In the following text, we write for if there exists a constant (which is independent of the mesh width h) such that . If there holds , we abbreviate this by . Furthermore, we denote by the cardinality of a finite set A and by the d-dimensional Lebesgue measure of a subset .
Goal-oriented adaptive finite element method
Variational formulation
Defining the symmetric bilinear form
| 4 |
we suppose that is continuous and elliptic on and thus fits into the setting of the Lax–Milgram lemma, i.e., there exist constants such that
In particular, is a scalar product that yields an equivalent norm on . The weak formulation of (1) reads
| 5 |
The Lax–Milgram lemma proves existence and uniqueness of the solution of (5). The same argument applies and proves that the dual problem
| 6 |
admits a unique solution , where the linear goal functional is defined by (2).
Remark 1
For ease of presentation, we restrict our model problem (1) to homogeneous Dirichlet boundary conditions. We note, however, that for mixed homogeneous Dirichlet and inhomogeneous Neumann boundary conditions our main results hold true with the obvious modifications. In particular, with the partition into Dirichlet boundary with and Neumann boundary , the space (and its discretization) has to be replaced by and the Neumann data has to be given in . Furthermore, the coefficient must vanish in a neighborhood of to go from the strong form (1) to the weak form (5) via integration by parts.
Finite element discretization and solution
For a conforming triangulation of into compact simplices and a polynomial degree , let
| 7 |
To obtain conforming finite element approximations and , we consider the Galerkin discretizations of (5)–(6). First, we note that the Lax–Milgram lemma yields the existence and uniqueness of exact discrete solutions , i.e., there holds that
| 8 |
In practice, the discrete systems (8) are rarely solved exactly (or up to machine precision). Instead, a suitable iterative solver is employed, which yields approximate discrete solutions . We suppose that this iterative solver is contractive, i.e., for all , it holds that
| 9 |
where is a generic constant and, in particular, independent of . Assumption (9) is satisfied, e.g., for an optimally preconditioned conjugate gradient (PCG) method (see [8]) or geometric multigrid solvers (see [25]); see also the discussion in [16]. We note that these solvers are also guaranteed to satisfy the realistic assumptions from [20, 23] (which require that any initial energy error can be improved by a factor at cost). However, while (9) is slightly less general, it allows to prove full linear convergence; see Theorem 6 below.
Discrete goal quantity
To approximate , we proceed as in [17]: For any , it holds that
Defining the discrete quantity of interest
| 10 |
the goal error can be controlled by means of the Cauchy–Schwarz inequality
| 11 |
We note that the additional term in (10) is the residual of the discrete primal problem (8) evaluated at an arbitrary function and hence .
In the following, we design an adaptive algorithm that provides a computable upper bound to (11) which tends to zero at optimal algebraic rate with respect to the number of elements as well as with respect to the total computational cost.
Mesh refinement
Let be a given conforming triangulation of . We suppose that the mesh-refinement is a deterministic and fixed strategy, e.g., newest vertex bisection [24]. For each conforming triangulation and marked elements , let be the coarsest conforming triangulation, where all have been refined, i.e., . We write , if results from by finitely many steps of refinement. To abbreviate notation, let . We note that the order on is respected by the finite element spaces, i.e., implies that .
We further suppose that each refined element has at least two sons, i.e.,
| 12 |
and that the refinement rule satisfies the mesh-closure estimate
| 13 |
where depends only on . For newest vertex bisection, this has been proved under an additional admissibility assumption on in [1, 24] and for 2D even without any additional assumption in [18]. Finally, we suppose that the overlay estimate holds, i.e., for all triangulations , there exists a common refinement which satisfies that
| 14 |
which has been proved in [6, 23] for newest vertex bisection.
Estimator properties
For and , let
be given refinement indicators. For , we use the usual convention that
| 15 |
for all and all .
We suppose that the estimators and satisfy the so-called axioms of adaptivity (which are designed for, but not restricted to, weighted-residual error estimators) from [5]: There exist constants and such that for all and all , the following assumptions are satisfied:
- Stability: For all , , and , it holds that
- Reduction: For all , it holds that
- Reliability: The Galerkin solutions to (8) satisfy that
- Discrete reliability: The Galerkin solutions and to (8) satisfy that
By assumptions (A1) and (A3), we can estimate for every discrete function the errors in the energy norm of the primal and the dual problem by
respectively, where . Together with (11), we then obtain that the goal error for approximations and in is bounded by
| 16 |
In the following sections, we provide building blocks for our adaptive algorithm that allow to control the arising estimators (by a suitable marking strategy) as well as the arising norms in the upper bound of (16) (by an appropriate stopping criterion for the iterative solver).
Marking strategy
We suppose that the refinement indicators and for some are used to mark a subset of elements for refinement, which, for fixed marking parameter , satisfies that
| 17 |
Remark 2
Given , possible choices of marking strategies satisfying assumption (17) are the following:
A more aggressive variant of (b) was proposed in [14]: Let and as above. Then, choose and with . Finally, define . Again, this strategy satisfies (17) with .
Note that our main results of Theorem 6 and 8 below hold true for all presented marking criteria (a)–(c). For our numerical experiments, we focus on criterion (a), which empirically tends to achieve slightly better performance in practice.
Adaptive algorithm
Any adaptive algorithm strives to drive down the bound in (16). However, the errors of the iterative solver, and , cannot be computed in general since the exact discrete solutions to (8) are unknown and will not be computed. Thus, we note that (9) and the triangle inequality prove that
| 20a |
as well as
| 20b |
With , (16) leads to
| 21 |
which is a computable upper bound to the goal error if . Moreover, given some , this motivates to stop the iterative solvers as soon as
to equibalance the contributions of the upper bound in (21); alternative stopping criteria are introduced and analyzed below. Overall, we thus consider the following adaptive algorithm.
Algorithm 3
Let be initial guesses. Let as well as be arbitrary but fixed marking parameters. For all , perform the following steps (i)–(vi):
-
(i)Employ (at least one step of) the iterative solver to compute iterates and together with the corresponding refinement indicators and for all , until
22 -
(ii)
Define and .
-
(iii)
If or , then define and terminate.
-
(iv)
Otherwise, find a set such that the marking criterion (17) is satisfied.
-
(v)
Generate .
-
(vi)
Define the initial guesses and for the iterative solver.
Remark 4
Theorem 6 below proves (linear) convergence for any choice of the marking parameters and , and for any of the marking strategies from Remark 2. Theorem 8 below proves optimal convergence rates (with respect to the number of elements and the total computational cost) if both parameters are sufficiently small (see (32) for the precise condition) and if the set is constructed by one of the strategies from Remark 2, where the respective sets have quasi-minimal cardinality.
Remark 5
Note that Algorithm 3(i) requires to evaluate the error estimator after each solver step. Clearly, it would be favorable to replace (resp. ) by (resp. ) in (22). Arguing as in [13, Lemma 8], this allows to prove convergence of the adaptive strategy, but full linear convergence (Theorem 6 below) and optimal convergence rates (Theorem 8 below) are exptected to fail.
For each adaptive level , Algorithm 3 performs at least one solver step to compute as well as one solver step to compute . By definition, is the solver step, for which the discrete solution is accepted (to contribute to the set of marked elements ). Analogously, is the solver step, for which the discrete solution is accepted (to contribute to ). If the iterative solver for either the primal or the dual problem fails to terminate for some level , i.e., (22) cannot be achieved for finite m, or n, we define , or , respectively, and . With , we define
| 23 |
For ease of presentation, we omit the -dependence of the indices for final iterates , , and in the following, if they appear as upper indices and write, e.g., and . If Algorithm 3 does not terminate in step (iii) for some , then we define . To formulate the convergence of Algorithm 3, we define the ordered set
| 24 |
Note that is proportional to the overall number of solver steps to compute the estimator product . Additionally, we sometimes require the notation
| 25 |
To estimate the work necessary to compute a pair , we make the following assumptions which are usually satisfied in practice:
The iterates and are computed in parallel and each step of the solver in Algorithm 3(i) can be done in linear complexity ;
Computation of all indicators and for requires steps;
The marking in Algorithm 3(iv) can be performed at linear cost (according to [23] this can be done for the strategies outlined in Remark 2 with having almost minimal cardinality; moreover, we refer to a recent own algorithm in [21] with linear cost even for having minimal cardinality);
We have linear cost to generate the new mesh .
Since a step of Algorithm 3 depends on the full history of preceding steps, the total work spent to compute is then of order
| 26 |
Finally, we note that Algorithm 3(vi) employs nested iteration to obtain the initial guesses of the solver from the final iterates for the mesh . According to (21), this allows for a posteriori error control for all indices beyond the initial step.
Main results
Linear convergence with optimal rates
Our first main result states linear convergence of the quasi-error product
| 27 |
for every choice of the stopping parameter . Recall from (16) that the quasi-error product is an upper bound for the error . Moreover, if , then () and (22) give that .
Theorem 6
Suppose (A1)–(A3). Suppose that and . Then, Algorithm 3 satisfies linear convergence in the sense of
| 28 |
The constants and depend only on , , , , and the (arbitrary) adaptivity parameters and .
Full linear convergence implies that convergence rates with respect to degrees of freedom and with respect to total computational cost are equivalent. From this point of view, full linear convergence indeed turns out to be the core argument for optimal complexity.
Corollary 7
Recall the definition of the total computational cost from (26). Let and . Then, under the assumptions of Theorem 6, it holds that
| 29 |
where the constant depends only on r, , and on the constants from Theorem 6.
Proof
The first two estimates in (29) are obvious. It remains to prove the last estimate in (29). To this end, note that it follows from the definition of that
Moreover, elementary algebra yields that
For , Theorem 6 and the geometric series thus show that
With , this gives that
This shows the final inequality in (29) and thus concludes the proof.
If and are small enough, we are able to show that linear convergence from Theorem 6 even guarantees optimal rates with respect to both the number of unknowns and the total cost . Given , let be the set of all with . With
| 30a |
and
| 30b |
for all , there holds the following result.
Theorem 8
Recall the definition of the total computational cost from (26). Suppose the mesh properties (12)–(14) as well as the axioms (A1)–(A4). Define
| 31 |
Let both adaptivity parameters and be sufficiently small such that
| 32 |
Let . Suppose that the set of marked elements in Algorithm 3(iv) is constructed by one of the strategies from Remark 2(a)–(c), where the sets in (18) and (19) have up to the factor minimal cardinality. Let with . Then, there exists a constant such that
| 33 |
The constant depends only on , , , , , , , , , , s, and t.
Remark 9
The constraint (32) is enforced by our analysis of the marking strategy from Remark 2(a), while the marking strategies from Remark 2(b)–(c) allow to relax the condition to
| 34 |
Alternative termination criteria for iterative solver
The above formulations of Algorithm 3 stops the iterative solver for and the iterative solver for independently of each other as soon as the respective termination criteria in (22) are satisfied. In this section, we briefly discuss two alternative termination criteria:
Stronger termination: The current proof of linear convergence (and of the subsequent proof of optimal convergence) does only exploit that and satisfy the stopping criterion and the previous iterates do not (cf. Lemma 10(iii)). This can also be ensured by the following modification of Algorithm 3(i):
-
(i)Employ the iterative solver to compute iterates and together with the corresponding refinement indicators and for all , until
35
Note that this will lead to more solver steps, since now (if it exists) is the smallest index for which the stopping criterion holds simultaneously for both and .
Inspecting the proof of Lemma 10 below, we see that all results hold verbatim also for this stopping criterion. Thus, we conclude linear and optimal convergence (in the sense of Theorem 6 and Theorem 8) also in this case.
Natural termination: The following stopping criterion (which is somehow the most natural candidate) also leads to linear convergence: Let be minimal with (22). If either of them do not exist, we set again , or , respectively. Define . Then, employ the iterative solver times for both the primal and the dual problem, i.e., the solver provides iterates and until both stopping criteria in (22) have been satisfied once (which avoids the artificial definition (23)). For instance, if , we continue to iterate for the primal problem until is obtained (or never stop the iteration if ). If is sufficiently small such that , then we can define
and we can guarantee the stopping condition (22) with the larger constant , i.e.,
| 36 |
see the proof below. Again, we notice that then the assumptions of Lemma 10 below are met. Hence, we conclude linear convergence (in the sense of Theorem 6) also for this stopping criterion. Moreover, optimal rates in the sense of Theorem 8 hold if in (32) is replaced by .
Proof of (36)
Without loss of generality, let us assume that . First, we have that
Then, using the fact that satisfies the stopping criterion in (22) and stability (A1), we get that
For we can absorb the last term to obtain
Finally, we observe that
Combining the last two estimates we obtain that
Hence, (36) follows with and .
Numerical examples
In this section, we consider two numerical examples which solve the equation
| 37 |
where and is the element-wise outwards facing unit normal vector. We refer the reader to Remark 1 for a comment on the applicability of our results to this model problem. We further suppose that the goal functional is a slight variant of the one proposed in [20], i.e.,
| 38 |
with a subset and a fixed direction . Moreover, for error estimation, we employ standard residual error estimators, which in our case, for all and all , read
where is the local mesh-width and denotes the jump across interior edges. It is well-known [5, 14] that and satisfy the assumptions (A1)–(A4). The examples are chosen to showcase the performance of the proposed GOAFEM algorithm for different types of singularities.
Throughout this section, we solve (37) as well as the corresponding dual problem numerically using Algorithm 3, where we make the following choices:
We solve the problems on the lowest order finite element space, i.e., with polynomial degree .
As initial values, we use .
To solve the arising linear systems, we use a preconditioned conjugate gradient (PCG) method with an optimal additive Schwarz preconditioner. We refer to [8, 22] for details and, in particular, the proof that this iterative solver satisfies (9).
We use the marking criterion from Remark 2(a) and choose such that it has minimal cardinality.
Unless mentioned otherwise, we use and .
Singularity in goal functional only
In our first example, the primal problem is (37) with on the unit square , and (and thus, ). For this problem, the exact solution reads
The goal functional is (38) with and . The exact goal value can be computed analytically to be
The initial mesh as well as a visualization of the set can be seen in Fig. 1.
Fig. 1.

Left: Initial mesh . The shaded area is the set from Section (). Right: Mesh after 14 iterations of Algorithm 3 with
For this setting, we compare our iterative solver to a conjugate gradient method without preconditioner in Fig. 2, where we plot the computable upper bound from (21),
over for all iterates and the estimator product for the final iterates over . We stress that, for , the computable upper bound and the quasi-error product from (27) are related by so that linear convergence (28) with optimal rates (33) of also yields linear convergence with optimal rates of . Since in our experiments is small, it is plausible to assume that the final estimates on every level approximate the exact solutions sufficiently well in the sense of estimator products, i.e., (cf. Lemma 13 below) for which [14] proves optimal convergence rates with respect to . Indeed, we see optimal rates for with respect to for both solvers in Fig. 2. However, the non-preconditioned CG method fails to satisfy uniform contraction (9) and thus Theorem 8 cannot be applied. In fact, Fig. 2 shows that this method fails to drive down with optimal rates with respect to (cf. (26)), as opposed to the optimally preconditioned CG method.
Fig. 2.

Comparison between iterative solvers for the problem from Sect. 4.1. A conjugate gradient method without preconditioner (CG) leads to optimal rates with respect to for the final iterates where , but not with respect to for every . Our choice of the iterative solver (ML) achieves optimal rates with respect to both measures
Furthermore, we plot in Fig. 3 different error measures over for every iterate . This shows that the corrector term
| 39 |
(which is the residual of evaluated at the dual solution ) in the definition of the discrete goal functional (10) is indeed necessary. We see that throughout the iteration, the goal value highly oscillates and, for large values of , even shows a different rate than the over . In general, we thus cannot expect the quantity to bound the uncorrected goal-error .
Fig. 3.
Comparison between , discrete goal , primal residual evaluated at the dual solution , and direct evaluation of goal functional for every iterate and different values of . The primal residual evaluated at the dual solution is the difference between goal and discrete goal; see (10)
For the discrete goal, the corrector term compensates the oscillations of the goal functional, such that their sum decreases with the same rate as , as predicted by (21). Smaller values of imply that on every level the approximate solutions are computed more accurately, such that the corrector term becomes smaller and the effect on the rate of the goal value becomes negligible.
Geometrical singularity
Our second example is the classical example of a geometric singularity on the so-called Z-shape , where is only the re-entrant corner (cf. Fig. 4). The primal problem is (37) with and , where the exact solution in polar coordinates r(x) and of is prescribed as
The goal functional is (38) with and and can be computed directly via numerical integration to be
In Fig. 4, the initial triangulation as well as the mesh after several iterations of Algorithm 3 can be seen. The adaptive algorithm resolves the singularity at the re-entrant corner, as well as critical points of the goal functional, which are at the corners of .
Fig. 4.

Left: Initial mesh . The shaded area is the set from Section () and the Dirichlet boundary at the re-entrant corner is marked in red. Right: Mesh after 13 iterations of Algorithm 3 with
Figure 5 shows the rate of the estimator product of the final iterates over as well as the rate of over for all .
Fig. 5.

Rates of the estimator product for final iterates over and as well as goal error over for all
Proof of Theorem 6
The following core lemma extends one of the key observations of [16] to the present setting, where we stress that the nonlinear product structure of leads to technical challenges which go much beyond [16].
Lemma 10
Suppose (A1)–(A3). Then, there exist constants , and , and some scalar sequence such that the quasi-error product
satisfies the following statements (i)–(v):
-
(i)
for all .
-
(ii)
if .
-
(iii)
for all .
-
(iv)
for all .
-
(v)
for all .
The constants , , and depend only on , , , and as well as on the (arbitrary) adaptivity parameters and .
For the following proofs, we define
such that the quasi-error product reads with a free parameter which will be fixed below.
Proof of Lemma 10(i)
Recall from (23) that for all . Thus, we have that
For , on the other hand, the solution is obtained by one step of the iterative solver. From stability (A1) and solver contraction (9), we have for all that
If is chosen small enough such that , together with the trivial case , the last two equations show that
The same argument shows that
| 40 |
Multiplication of the last two estimates shows the assertion.
Proof of Lemma 10(ii)
Recall that for the index there holds (22). From the triangle inequality, we thus get for the primal estimator that
Furthermore, stability (A1) leads to
Combining the last two estimates, we see that
Together with the analogous estimate for , we conclude the proof with .
Proof of Lemma 10(iii)
Without loss of generality, suppose that and thus . Then, this yields that
With contraction of the solver (9), this leads to
From (40) for small enough, we see that . Together with the previous estimate, this shows that
| 41 |
Up to the choice of , this concludes the proof.
Proof of Lemma 10(iv)
First, we note that , according to Algorithm 3(iii) and the assumption that . From reduction of the solver (9) and nested iteration, we get that
| 42 |
and thus
| 43 |
For the estimator terms, we have with stability (A1) and reduction (A2) that
On the one hand, with , this implies that
| 44 |
On the other hand, with , we get that
| 45 |
Using (45), the corresponding estimate for the dual estimator, and the Young inequality, we obtain that
The marking criterion (17), which is applicable due to , estimates the term in brackets by zero. Thus stability (A1) leads to
| 46 |
For the mixed terms in , we have with (42) and (44) that
| 47 |
Analogously, we see that
| 48 |
Combining (43) and (46)–(48), we get that
Rearranging the terms, we obtain that
| 49 |
where the remainder term is defined as
| 50 |
Up to the choice of , this concludes the proof.
Proof of Lemma 10 (choosing ) For Lemma 10(i), we choose small enough such that . From (41) and (49) in the proofs of Lemma 10(iii)–(iv), we see that we additionally require
| 51 |
Choosing small enough, we satisfy all estimates. We define as the maximum of all terms in (51) and .
Proof of Lemma 10(v)
First, we note that from stability (A1) it follows that
| 52 |
Furthermore, Galerkin orthogonality and reliability (A3) imply that, for all with ,
| 53 |
With (52) and (53) for , we can bound the remainder term from (50) by
Next, let us recall from [5, Lemma 3.6] the quasi-monotonicity of the estimator, which follows from (A1)–(A3) and the Céa lemma, i.e., for all ,
| 54 |
For , we get by summation for all and all with that
Analogously, we see that
| 55 |
We proceed with . From (42) and the Young inequality with , we see for that
For small enough such that and all for , the geometric series proves that
and thus
Analogously, we see that . Combining all estimates with
we conclude the proof.
With the foregoing auxiliary result, we are in the position to prove linear convergence.
Proof of Theorem 6
Let . We recall the quasi-error products
from Theorem 6 and Lemma 10, respectively. Note that
which yields the equivalence
| 56 |
We first show linear convergence of . By Lemma 10(i), we can absorb the term for all . Paying attention to the possible case , this allows us to estimate
Lemma 10(iii) shows uniform reduction of the quasi-error on every level. This yields that
To estimate the sum over all levels, we use that, for the refinement step, Lemma 10(iv) shows contraction up to a remainder term. The Young inequality with and Lemma 10(i) then prove that
Choosing small enough such that , we obtain from repeatedly applying the previous estimates that
Using this estimate and a change of summation indices, the geometric series and Lemma 10(v) uniformly bound the sum over all levels by
Combining the estimates above, we obtain that
In the case , Lemma 10(i) proves that
In the case , this follows with Lemma 10(ii). In either case, the constant depends only on and from Lemma 10. Basic calculus then provides the existence of and such that
see [5, Lemma 4.9]. Finally, the claim of Theorem 6 follows from (56) with .
Proof of Theorem 8 (optimal rates)
We recall the following comparison lemma from [12]. While [12] is concerned with point errors in boundary element computations, we stress that the proof of [12, Lemma 14] works on a completely abstract level and thus is applicable here as well.
Lemma 11
([12, Lemma 14]) The overlay estimate (14) and the axioms (A1)–(A2) and (A4) yield the existence of a constant such that, given , each mesh admits some refinement such that for all , it holds that
| 57a |
| 57b |
The constant depends only on , , and .
Note that (57a) immediately implies that
| 58 |
We will employ this lemma in combination with the so-called optimality of Dörfler marking from [5].
Lemma 12
([5, Proposition 4.12]) Under (A1) and (A4), for all , there exists such that for all and all , (58) with implies that
| 59 |
The constant depends only on , , and .
The next lemma is already implicitly found in [15]. It shows that, if is sufficiently small, then Dörfler marking for the exact discrete solution implicitly implies Dörfler marking for the approximate discrete solution. This will turn out to be the key observation to prove optimal convergence rates. We include the proof for the convenience of the reader.
Lemma 13
Suppose (A1)–(A3). Let and . Define . Then, as soon as the iterative solver terminates (22), there hold the following statements (i)–(iv) for all and all :
-
(i)
.
-
(ii)
provided that .
-
(iii)
.
-
(iv)
provided that .
Proof
It holds that
The same argument proves that
For , the latter two estimates lead to
This concludes the proof of (i). To see (ii), we use the assumption
Noting that , this concludes the proof of (ii). The remaining claims (iii)–(iv) follow verbatim.
Proof of Theorem 8
By Corollary 7, it is sufficient to prove that
We prove this inequality in two steps.
Step 1: In this step, we bound the number of marked elements for arbitrary . Let and corresponding from Lemma 13 such that
| 60 |
Let be the corresponding mesh as in Lemma 11. With Lemma 12, this yields that
Lemma 13 with shows that
| 61 |
We consider the marking strategies from Remark 2 separately.
For strategy (a), we have with and assumption (32) that (60) is satisfied. Hence, (61) implies that there holds (17), i.e.,
By assumption of Theorem 8, is essentially minimal with (17). We infer that
| 62 |
For the strategies (b)–(c), we set and note that assumption (32) (as well as the weaker assumption (34)) imply (60), and hence (61). Again, by assumption of Theorem 8, is chosen essentially minimal (with an additional factor two for the strategy (c)) such that (61) holds. For all three strategies, we therefore conclude that
Recall that () and (22) give that . This finally shows that
Step 2: Let . First, we consider and thus . The closure estimate and Step 1 prove that
Linear convergence of Theorem 6, further shows that
Rearranging this estimate, we see that
It remains to consider . By Theorem 6, we have that
This concludes the proof.
Acknowledgements
The authors thankfully acknowledge support by the Austrian Science Fund (FWF) through the doctoral school Dissipation and dispersion in nonlinear PDEs (grant W1245), the SFB Taming complexity in partial differential systems (grant SFB F65), the stand-alone project Computational nonlinear PDEs (grant P33216), and the Erwin Schrödinger Fellowship Optimal adaptivity for space-time methods (grant J4379).
Funding Information
Open access funding provided by Austrian Science Fund (FWF).
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Roland Becker, Email: roland.becker@univ-pau.fr.
Gregor Gantner, Email: g.gantner@uva.nl.
Michael Innerberger, Email: michael.innerberger@asc.tuwien.ac.at.
Dirk Praetorius, Email: dirk.praetorius@asc.tuwien.ac.at.
References
- 1.Binev P, Dahmen W, DeVore R. Adaptive finite element methods with convergence rates. Numer. Math. 2004;97(2):219–268. doi: 10.1007/s00211-003-0492-7. [DOI] [Google Scholar]
- 2.Becker R, Estecahandy E, Trujillo D. Weighted marking for goal-oriented adaptive finite element methods. SIAM J. Numer. Anal. 2011;49(6):2451–2469. doi: 10.1137/100794298. [DOI] [Google Scholar]
- 3.Becker R, Rannacher R. An optimal control approach to a posteriori error estimation in finite element methods. Acta Numer. 2001;10:1–102. doi: 10.1017/S0962492901000010. [DOI] [Google Scholar]
- 4.Bangerth W, Rannacher R. Adaptive finite element methods for differential equations. Basel: Birkhäuser; 2003. [Google Scholar]
- 5.Carstensen C, Feischl M, Page M, Praetorius D. Axioms of adaptivity. Comput. Math. Appl. 2014;67(6):1195–1253. doi: 10.1016/j.camwa.2013.12.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Cascon JM, Kreuzer C, Nochetto RH, Siebert KG. Quasi-optimal convergence rate for an adaptive finite element method. SIAM J. Numer. Anal. 2008;46(5):2524–2550. doi: 10.1137/07069047X. [DOI] [Google Scholar]
- 7.Cascón JM, Nochetto RH. Quasioptimal cardinality of AFEM driven by nonresidual estimators. IMA J. Numer. Anal. 2012;32(1):1–29. doi: 10.1093/imanum/drr014. [DOI] [Google Scholar]
- 8.Chen L, Nochetto RH, Jinchao X. Optimal multilevel methods for graded bisection grids. Numer. Math. 2012;120(1):1–34. doi: 10.1007/s00211-011-0401-4. [DOI] [Google Scholar]
- 9.Dörfler W. A convergent adaptive algorithm for Poisson’s equation. SIAM J. Numer. Anal. 1996;33(3):1106–1124. doi: 10.1137/0733054. [DOI] [Google Scholar]
- 10.Eriksson K, Estep D, Hansbo P, Johnson C. Introduction to adaptive methods for differential equations. Acta Numer. 1995;4:105–158. doi: 10.1017/S0962492900002531. [DOI] [Google Scholar]
- 11.Feischl M, Führer T, Praetorius D. Adaptive FEM with optimal convergence rates for a certain class of nonsymmetric and possibly nonlinear problems. SIAM J. Numer. Anal. 2014;52(2):601–625. doi: 10.1137/120897225. [DOI] [Google Scholar]
- 12.Feischl M, Gantner G, Haberl A, Praetorius D, Führer T. Adaptive boundary element methods for optimal convergence of point errors. Numer. Math. 2016;132(3):541–567. doi: 10.1007/s00211-015-0727-4. [DOI] [Google Scholar]
- 13.Führer T, Praetorius D. A linear Uzawa-type FEM-BEM solver for nonlinear transmission problems. Comput. Math. Appl. 2018;75(8):2678–2697. doi: 10.1016/j.camwa.2017.12.035. [DOI] [Google Scholar]
- 14.Feischl M, Praetorius D, van der Zee KG. An abstract analysis of optimal goal-oriented adaptivity. SIAM J. Numer. Anal. 2016;54(3):1423–1448. doi: 10.1137/15M1021982. [DOI] [Google Scholar]
- 15.Gantner G, Haberl A, Praetorius D, Stiftner B. Rate optimal adaptive FEM with inexact solver for nonlinear operators. IMA J. Numer. Anal. 2018;38:1797–1831. doi: 10.1093/imanum/drx050. [DOI] [Google Scholar]
- 16.Gantner G, Haberl A, Praetorius D, Schimanko S. Rate optimality of adaptive finite element methods with respect to overall computational costs. Math. Comp. 2021;90(331):2011–2040. doi: 10.1090/mcom/3654. [DOI] [Google Scholar]
- 17.Giles, M., Süli, Endre: Adjoint methods for PDEs: a posteriori error analysis and postprocessing by duality. Acta Numer. 11, 145–236 (2002)
- 18.Karkulik M, Pavlicek D, Praetorius D. On 2D newest vertex bisection: optimality of mesh-closure and -stability of -projection. Constr. Approx. 2013;38(2):213–234. doi: 10.1007/s00365-013-9192-4. [DOI] [Google Scholar]
- 19.Morin P, Nochetto RH, Siebert KG. Data oscillation and convergence of adaptive FEM. SIAM J. Numer. Anal. 2000;38(2):466–488. doi: 10.1137/S0036142999360044. [DOI] [Google Scholar]
- 20.Mommer MS, Stevenson R. A goal-oriented adaptive finite element method with convergence rates. SIAM J. Numer. Anal. 2009;47(2):861–886. doi: 10.1137/060675666. [DOI] [Google Scholar]
- 21.Pfeiler CM, Praetorius D. Dörfler marking with minimal cardinality is a linear complexity problem. Math. Comp. 2020;89(326):2735–2752. doi: 10.1090/mcom/3553. [DOI] [Google Scholar]
- 22.Schimanko, S.: On rate-optimal adaptive algorithms with inexact solvers. PhD thesis, TU Wien, Institute of Analysis and Scientific Computing, 2021
- 23.Stevenson R. Optimality of a standard adaptive finite element method. Found. Comput. Math. 2007;7(2):245–269. doi: 10.1007/s10208-005-0183-0. [DOI] [Google Scholar]
- 24.Stevenson R. The completion of locally refined simplicial partitions created by bisection. Math. Comp. 2008;77(261):227–241. doi: 10.1090/S0025-5718-07-01959-X. [DOI] [Google Scholar]
- 25.Jinbiao W, Zheng H. Uniform convergence of multigrid methods for adaptive meshes. Appl. Numer. Math. 2017;113:109–123. doi: 10.1016/j.apnum.2016.11.005. [DOI] [Google Scholar]

