Summary
We are studying the efficient solution of the system of linear equations stemming from the mass conserving stress‐yielding (MCS) discretization of the Stokes equations. We perform static condensation to arrive at a system for the pressure and velocity unknowns. An auxiliary space preconditioner for the positive definite velocity block makes use of efficient and scalable solvers for conforming Finite Element spaces of low order and is analyzed with emphasis placed on robustness in the polynomial degree of the discretization. Numerical experiments demonstrate the potential of this approach and the efficiency of the implementation.
Keywords: auxiliary space preconditioner, exact divergence‐free velocity, high order robustness, iterative solver, Stokes equations
1. INTRODUCTION
Let be a bounded domain with or 3 with Lipschitz boundary . Let and be the velocity and the pressure, respectively. Given an external body force and the double of kinematic viscosity denoted by , the velocity‐pressure formulation of the Stokes system is given by
| (1a) |
| (1b) |
where . By introducing additional matrix valued variables for the stress and , these equations can be restated as
| (2a) |
| (2b) |
| (2c) |
| (2d) |
where denotes the deviatoric part (see Section 2), and (2a) is motivated by the fact that for the solution of (1) we have . The introduction of as a Lagrange multiplier enables the derivation of discrete methods that enforce the symmetry constraint (2c) weakly, see also References 1, 2, 3. As boundary conditions, we consider Dirichlet ones for the velocity , homogenous purely for clarity of the presentation, and two kinds of outlet conditions,
| (2e) |
| (2f) |
| (2g) |
where is the identity matrix and is the tangential part of . We assume that both and at least one of or have positive measure. As usual, when , an additional condition must be imposed on the pressure to make it unique.
In recent years, divergence‐free and pressure‐robust Finite Element discretizations, that is those whose solutions fulfill (2d) strongly, and allow for pressure‐independent a‐priori error estimates respectively, have been of great interest. 4
For the velocity‐pressure formulation (1), one class of such methods are certain Hybrid Discontinuous Galerkin (HDG) methods that take the velocity in and the pressure in , that is, they only build normal continuity into the Finite Element space while the tangential continuity of the solution is enforced via Lagrange parameters. To make the resulting system for the velocity positive definite, a consistent stabilization term has to be added, often involving either a parameter that has to be sufficiently large or a lifting of the jump, see References 5, 6.
In References 7, 8, the authors presented a novel variational formulation for the Stokes equations that still takes the velocity in and pressure in , retaining the property of yielding exactly divergence‐free and pressure‐robust solutions, but is based on (2) instead of (1). This mass conserving stress‐yielding (MCS) method features a normal‐tangential continuous stress space and requires no stabilizing term. It was already remarked in the original work 7 that static condensation can be performed to eliminate certain degrees of freedom (dofs). Later, in Reference 9, the normal‐tangential continuity of was broken and instead an additional Lagrange parameter was introduced. This technique makes it possible to eliminate altogether and to reduce the problem to one for the velocity in , the pressure in , and the newly introduced which approximates the tangential velocity trace on the mesh facets. The velocity unknowns take the place of as primal variables in the condensed saddle point system, with the pressure remaining the Lagrange parameter enforcing (2d). The condensed system involves the same variables, and has the same structure as the above HDG methods, but does not require a stabilization term. The first contribution of this work is a characterization of the velocity block of the condensed system based on its relation to the velocity block resulting from an HDG method with optimal stabilization. In particular, we prove that the velocity block, as previously claimed in Reference 9 for that of a related low order MCS method, is indeed positive definite.
We then move on to the question of how to efficiently solve the condensed system and consider preconditioned Krylov space methods. Preconditioning techniques for saddle point systems based on separate preconditioners for the primal (velocity) and Lagrange (pressure) unknowns are a well studied subject, see Reference 10, and the pressure Schur complement is easily preconditioned, see Reference 11. Therefore, our focus is on identifying and analyzing suitable preconditioners for the condensed velocity block.
The literature on preconditioners for conforming methods is vast and includes, among others, domain decomposition, see Reference 12, as well as Geometric, see Reference 13, and Algebraic, see Reference 14, Multigrid methods and an even somewhat comprehensive review would be beyond the scope of this work. We will take as given that efficient and scalable solvers for conforming methods exist and are available.
Preconditioners for HDG methods are not quite as well studied in the literature, one recurring theme is the attempt to reuse conforming preconditioners for these non‐conforming spaces. For example, a non‐nested Multigrid method with conforming coarse grid spaces was studied in Reference 15, and auxiliary space preconditioners (ASP, see Reference 16) that also feature a conforming sub‐space were considered in Reference 17.
The idea at the heart of both approaches is to decompose functions in the non‐conforming space into a conforming component plus a (small) remainder and to address them separately with some pre‐existing conforming preconditioner and a simple, computationally inexpensive method such as (Block‐)Jacobi, respectively.
The principal focus in this work is on the introduction and analysis of ASPs for the MCS method. The main improvement over the theory in Reference 17 is that the analysis of the velocity preconditioners extends techniques from Reference 18 and is explicit in the polynomial degree of the discretization. In particular, the main result, Theorem 3, states that the condition number of a particular ASP is bounded by , where is the polynomial degree of the discretization and is a constant stemming from the relation between condensed MCS and HDG norms.
We close out the discussion with numerical experiments that demonstrate the robustness and scalability of the proposed preconditioners. It is a testament to the elegance and simplicity of the ASP method that we were able to scale the computations to a relatively large scale by leveraging existing, scalable and highly Performant software.
Outline
We gather notation used throughout this work in Section 2 and introduce various Finite Element spaces and norms in Section 3 which also contains some useful technical results. Section 4 reviews the MCS method itself and contains a thorough discussion of static condensation as well as results on the obtained condensed systems. Approaches for preconditioning saddle point matrices with separate preconditioners for the primal unknowns and Lagrange multipliers as well as the method of auxiliary space preconditioning are recalled in Section 5. The main results can be found in Section 6, where different variations of ASPs for the velocity block of the Stokes system are discussed. In Section 7, we sketch the treatment of the lowest order case which is not covered by the theory developed in previous sections. Finally, numerical experiments are performed in Section 8.
2. NOTATION
With denoting the vector space of real matrices, we define the subsets of skew‐symmetric and skew‐symmetric trace‐free matrices by
where denotes the transpose and the identity matrix. To differentiate between scalar‐, vector‐ and matrix‐valued functions on some subset we include the range in the notation for the latter two while we omit it for the former one, i.e. where denotes the space of square integrable ‐valued scalar functions, the spaces and denote the analogous vector‐ and matrix‐valued spaces. Similarly, , and so forth, denote the set of scalar‐, vector‐ or matrix‐valued polynomials up to degree on . We use the notation for the ‐inner product on and set . The ‐orthogonal projection onto (the range should be clear from context) is denoted by and we will occasionally omit the subscript. Similarly, the ‐orthogonal projector onto the (restrictions to of) the rigid body modes is written as .
In the following, let , , and be smooth scalar‐, vector‐, and matrix‐valued functions, respectively. The operator is to be understood from context as resulting in either in a vector whose components are or a matrix with components . For vector‐valued functions in three dimensions the operator is defined as and in two we understand it to refer to the scalar‐valued . The divergence operator is understood as for vectors and is applied row‐wise to matrices, that is, . Besides the well known trace operator and the deviatoric part we further introduce the operator by
Based on these differential operators, we use standard notation for the Sobolev spaces and with . Further, for some , a subscript “” indicates that the corresponding natural traces vanish on , and we use only the zero subscript if .
We denote by a quasi‐uniform and shape regular triangulation of the domain into simplices. Let denote the maximum of the diameters of all elements in . The set of element interfaces and boundaries, or facets, is denoted by and the set of facets of a particular element is . By an abuse of notation, we shall also use to denote the domain formed by union of all . We assume that the mesh resolves the domain boundary parts in the sense that with such that . This splits into boundary facets , , and , and interior facets . According to this mesh we also introduce the “broken” spaces
where, as before, we include the range explicitly, for example, as in . On each we denote by and the standard jump and mean value operators and take them to be the identity on boundary facets. On each element boundary and each facet we denote by the outward unit normal vector. The scalar normal and vector‐valued tangential traces of a sufficiently smooth function are given by and . Similarly, the normal‐normal and normal‐tangential traces of a smooth matrix‐valued function are and .
We write functions in general Sobolev spaces as , and so forth, discrete functions with a subscript as , and so forth, and their via Galerkin isomorphism identified coefficient vectors w.r.t to some given Finite Element basis as , and so forth. For readability of the presentation we make no difference between row and column vectors and, for example, write for the coefficient vector of which should strictly speaking be the column vector . Similarly, operators are capital letters , and so forth, their discrete counterparts , and so forth, and the corresponding Finite Element matrices , etc. Occasionally, when it is useful to emphasize the Galerkin isomorphism we use , for example, or .
Finally, throughout this work we write when there exists a constant independent of the mesh size and the viscosity such that and . For example, due to quasi‐uniformity we have . For two elliptic operators (or symmetric and positive definite matrices ) we take to mean that the maximum eigenvalue of the generalized eigenvalue problem is bounded by a constant similarly independent of and . Note that in inequalities related to discrete functions or operators, unless explicitly stated otherwise, these constants can depend on the polynomial degree. Henceforth we assume that is a constant.
3. FINITE ELEMENTS AND NORM EQUIVALENCES
Reminding our self that the lowest order case is addressed separately in Section 7, we define the following approximation spaces for :
| (3) |
| (4) |
| (5) |
| (6) |
| (7) |
| (8) |
See Reference 19 for a detailed discussion of the ‐conforming Brezzi‐Douglas‐Marini () space appearing in the definition of . Note that, restricted to a single element , in addition to , the stress space also includes functions in with vanishing normal tangential trace (“‐bubbles”). We further define the space of divergence free velocities and the product spaces , and . Following Reference 20, for , and we write
| (9) |
where and the equivalence was shown in [Theorem 2]. 20 Where it is clear from the context which volume element is meant, we omit it from the subscript and simply write . We define Hybrid Discontinuous Galerkin (HDG) norms on and by
| (10) |
| (11) |
| (12) |
In (10), the terms for , where , weakly enforce from (2g). There holds the equivalence (see Reference 9)
| (13) |
3.1. Technical results
For readability, the technical details of this Section are moved to the Appendix.
3.1.1. Interpolation operators
A well known interpolation operator is defined by
| (14) |
where is the set of all elements that share the vertex , and is the number of such elements. Bounds for the approximation error of in ‐like norms are very standard and well known, and with a Korn inequality for broken spaces like
| (15) |
derived in Reference 21, it can easily be bounded by an like one. However, as the kernel of is controlled only by the terms, can degenerate depending on the shape of and . As it would otherwise later on enter into condition number estimates, the following Lemma 1 bounds the approximation error of independent of .
Lemma 1
There holds
(16)
See Appendix A.
A minor technical detail is our need for an interpolation operator not into but into . It can be obtained by simply interpolating into and then zeroing out degrees of freedom on via
Lemma 2
For there holds
(17)
See Appendix A.
3.1.2. Trace norms
For and an arbitrary element with we define for all discrete versions of the and the ‐norm for (scalar) HDG spaces as
In Reference 20 the authors proved the inverse estimate
| (18) |
A similar estimate can be derived for the hybrid, vector‐valued velocity space and norms involving the symmetric gradient,
| (19) |
| (20) |
The difference lies not only in the appearance of instead of but also, and more importantly, in the fact that, as , the normal trace is enforced strongly, and one has to slightly modify the strategy from Reference 20.
Corollary 1
For with there holds
(21)
See Appendix B.
4. THE MCS METHOD
The method considered in this work is based on formulation (2), where is used as a Lagrange multiplier to weakly enforce the symmetry constraint (2c), see also References 1, 2, 3. In Reference 7, a novel variational formulation of (2) without the symmetry constraint was presented where the velocity and pressure spaces were and and the stress space for the variable was defined as , where the superscript denotes the classical dual space. The variational version of (2b) then became
| (22) |
where denotes the duality pairing on . The authors showed that Finite Element approximation of in demands normal‐tangential continuity. The method described in the following is based on this variational formulation and in many ways is a variation of previous MCS methods from Reference 7, 8, 9, 22. Like in the method from Reference 9, we incorporate the normal‐tangential continuity of via a Lagrange multiplier in , similar to approaches taken in hybridized mixed methods for the Poisson problem, see Reference 23, 24, 25. For a detailed discussion on this hybridization technique see also section 7.2.2 in Reference 19. The main motivation for breaking the normal‐tangential continuity by hybridization is that it enables local, element‐wise elimination, or static condensation, of all and dofs. The resulting, condensed, system is the one we actually have to solve and are interested in preconditioning. It will therefore be discussed in great detail in Section 4.2.
The hybridized mass conserving stress‐yielding method with weakly imposed symmetry finds such that
| (23a) |
| (23b) |
| (23c) |
with the bilinear form
The first two integrals in can be interpreted as a discrete version of the duality pair given in (22) and the third weakly enforces the symmetry constraint. The last terms incorporate the normal‐tangential continuity of and the tangential part of (2f). Since
and , testing (23b) with , and all results in on all . On and the integrals vanish together with and on the remaining integrals weakly incorporate the tangential part of (2f), . For more details on boundary conditions see Remark 2.
We also define the sub problem where we leave out the divergence constraint and solve only for the velocity and stress: Find such that
| (24) |
with
Note the addition of the term in (23b) which is not present in any of the previous MCS methods. It is consistent since the solution is exactly divergence‐free (by (23c) and ) and guarantees the inf‐sup stability of by compensating for the missing trace of which can only approximate the deviatoric part of . The choice of the constant is motivated by the identity . It makes (24) a discretization of and the velocity Schur complement introduced in Section 4.2 equivalent to a linear elasticity problem, see (31) in the proof of Lemma 5. We want to stress again that preconditioners for such problems (although only for conforming methods) are standard in the literature and widely available. A different choice of constant would be possible, but changes the norm represented by the velocity Schur complement. For example, a large (penalty) parameter (i.e., with a small ) instead of would be a valid choice but would make the resulting velocity Schur complement extremely difficult to precondition as it then has the divergence constraint essentially built in.
Remark 1
The MCS method here, like the one from Reference 8 to which it is most closely related to, features insufficient inter‐element coupling for the lowest order case and is only stable for . This is discussed in detail in Reference 9, where a stable minimal order MCS method for was introduced. We treat the lowest order case separately in Section 7.
Remark 2
Reference 7 contains an exhaustive discussion of possible boundary conditions for the standard MCS method. One type of boundary condition for which it is particularly suitable are non‐homogenous viscous stress ones, where can be demanded strongly for a given tangential traction , and is incorporated as an essential boundary condition in . In the hybridized MCS method, where the normal‐tangential continuity in is no longer given, this boundary condition is instead realized by an additional right‐hand side term in (23). Since , this still strongly yields , as testing (23b) with shows.
4.1. Stability analysis
In the following we summarize the stability results for the discrete method defined above. We only prove solvability of (24), all other results follow with the same techniques and steps as in References 7, 8, 9, 22. Lemma 4, which can, just as Lemma 3, be found in the stated literature, is an inf‐sup stability result for the constraint given by the bilinear form . It is posed in the semi‐norm as, since all elements in are trace‐free, the divergence of functions in can not be controlled. Theorem 1 states that with the addition of the term in (23b) we can switch to the proper norm and (24) is solvable independently of the divergence constraint. Finally, Corollary 2, which is again already proven in the literature, provides solvability of (23) including the divergence constraint. Note that the stability analysis in this section is only robust in terms of the mesh size . Particularly the inf‐sup constant in Corollary 2 and Theorem 1 is not known to be independent of the polynomial order .
Lemma 3
There hold the continuity estimates
Lemma 4
Let be arbitrary. There exists a such that
Theorem 1
Let be arbitrary, there holds the inf‐sup stability
This follows with standard techniques, that is, using Lemma 4, Young's and Cauchy Schwarz's inequality and the norm equivalence (13).
Corollary 2
Let be arbitrary, there holds the inf‐sup stability
For a smooth exact solution we can further derive the following optimal error estimate.
Corollary 3
Let , , , and be the exact solution of the Stokes problem (2) and set for all . Let be the solution of (23) and let , then there holds the error estimate
4.2. Static condensation of local variables
We now discuss the structure of the Finite Element matrix directly obtained from the MCS method (23) and that of various Schur complements thereof. Writing and for the basis functions of and respectively and, complying with the notation for the Galerkin isomorphism introduced in Section 2, for the coefficients of with respect to the basis given by , and so forth, (23) in matrix form is
| (25) |
The right hand side vector is given by . The system matrix with its entries
is a saddle point matrix with Lagrange multipliers and enforcing (2c), (2a), the ‐continuity of , and (2d) respectively.
Static condensation of
The diagonal block for does not couple with the pressure via the incompressibility constraint and, thanks to the introduction of as additional multiplier, is block diagonal. It is also invertible since every block represents the simple projection problem of finding for some such that
| (26a) |
| (26b) |
where are the restrictions of the corresponding (discontinuous) global spaces to and is some right hand side. Standard arguments and the Brezzi theorem prove that (26) is inf‐sup stable, that is, writing
is invertible and the Schur complement is well defined and, as is block diagonal, can be computed element‐wise. Eliminating from (25) in this way leaves us with the system
| (27) |
The symmetry of is obvious and in the next Lemma 5 we show that the upper left block is also positive definite and we are now in the very standard setting of a saddle point problem with symmetric and positive definite (SPD) “‐block”. The velocity unknowns move to the position of primal variables, while the pressure remains the Lagrange parameter for the divergence constraint. After solving (27) to get , we can recover and by solving the local problems (26).
Lemma 5
The Schur complement is symmetric positive definite and with
there holds
(28)
(29)
Let be arbitrary and set , that is, the local functions are the solution of (26) with right hand side
(30) where we used an element‐wise integration by parts for . From (26b) we see and there holds
With this gives
(31) We now insert (30) into (26a) and test with . The term drops out due to (26b) and we see that
We can use (26b) again to see , as is built into and get
Thus, with (31)
It remains to prove the other direction. By Lemma 4 there exists a with such that, once again inserting (30) into (26a), we see that
and therefore there holds
Finally, for we have and the distributional terms in (30) vanish. The solutions of (26) are then simply given by and and (31) states
Remark 3
In , we have a discretization of with degrees of freedom and only. This is less reminiscent of a mixed method like MCS than of a HDG method and it is interesting to further elaborate on the relationship between the MCS method and DG and HDG methods. In general, DG and HDG methods require a stabilizing term to assure solvability. An example is the well known interior penalty method where the ‐norm of jumps, for with some sufficiently large is used. Any dependence on such a parameter is avoided here, however this is not an unique feature of the MCS method. Other DG and HDG methods that also avoid this parameter feature a lifting of the jump similar to (9) instead of its norm, see Reference 5. That lifting has to be explicitly computed and is then condense out. A final class of DG methods, for example the one in Reference 26, see also References 5, 27, features a simultaneous lifting of the jump and the fluxes. This is similar to what happens here, where both approximates the flux and automatically and canonically stabilizes the condensed system through its interaction with the tangential jumps.
Static condensation of high order velocity functions
The basis functions of can be split into two different types, see Reference 28. We write for the high order “element bubble” basis functions whose support is entirely within some element and whose normal trace on vanishes. The span of these basis functions is denoted by and we write . The remaining basis functions of have support entirely within the patch of some facet and their normal trace on all other facets in the patch vanishes. As the supports of belonging to different elements do not overlap, in
the upper left block is block diagonal and invertible. This lets us form a second, “double” Schur complement
In the bigger system (27), all degrees of freedom couple with the divergence constraint and we cannot perform this static condensation independently of the pressure variables. However, for higher order problems, implementing multiplication with via the exact factorization
| (32) |
is still advantageous. Both the left and right factors as well as are block diagonal and only instead of the larger needs to be assembled as a proper sparse matrix. We will revisit the idea of also preconditioning via this factorization in Section 6.1.
Splitting the coordinate vector of the component of into and , the norm induced by on is
| (33) |
That is, the norm induced by is just the one induced by on the energy minimal extension to dofs. The lifting operator, or (discrete) harmonic extension, maps , to the minimizer in (33):
Equivalently, writing , the operator is defined by
| (34) |
The range of is
| (35) |
and for such “discrete harmonic” or “lifted” functions there holds
| (36) |
Here we encounter a slight complication of notation: Per default, is associated with its coordinate vector , but only takes the coordinates (which determine according to (35)). The space is spanned by lifted, discrete harmonic, basis functions,
where is the set of all and the one of all basis functions. The induced Galerkin Isomorphism defines the natural operator associated with and identifies with . Where there is potential for confusion we explicitly write in contrast to .
Analogously, we define the Schur complement like norm
| (37) |
and the associated lifting operator such that
Note that both and can be defined on the entirety of but are only semi‐norms as they vanish on . Restricted to they are proper norms and equivalent.
Corollary 4
There holds
(38) for some discussed in more detail shortly in Remark 4.
Follows immediately from Lemma 5.
Remark 4
An interesting question is whether, and if so, how strongly, both the constant in the lower bound in (28) and depend on . Note that only the latter constant enters into condition number estimates in Section 6. Numerical experiments on the unit tetrahedron suggest that the former depends linearly on while there holds or possibly with some moderate . We have not further pursued a rigorous proof for this latter, admittedly crucial, fact. Such a proof would essentially require a ‐explicit version of Lemma 4 for functions in .
5. PRECONDITIONING FRAMEWORK
A final Schur complement can be formed with respect to the pressure unknowns, however this involves the inverse . With the resulting (negative) pressure Schur complement , we have the exact factorization
| (39) |
for the saddle point matrix . Solving (27) could in principle be reduced to solving separate problems for the pressure and velocity. While this is not feasible due to the appearance of in the pressure Schur complement, this line of thought still takes a prominent role in common preconditioning techniques for based on separate preconditioners for and for . See Reference 10 and the references therein for an overview of such methods. Motivated by (39), here we use
| (40) |
Note that unlike suggested by (40), the operation can be implemented such that it requires only two applications of instead of three. A rigorous analysis of for the generic saddle point case as well as a number of other, similar, preconditioners built from and can be found in Reference 29.
5.1. Pressure Schur preconditioner
From the standard Stokes‐LBB condition on using the norm , see for example in Reference 9, and the equivalence result Lemma 5, we can conclude the MCS Stokes‐LBB condition
with independent of . It is generally well known that given this LBB condition, is equivalent to the scaled mass matrix for . The bounds in that equivalence depend on a continuity constant (Lemma 3) and, more importantly, , see References 11, 30. As is completely discontinuous across elements in the MCS discretization, inverting the block‐diagonal matrix is feasible and we use . Since therefore is the primary constant determining the quality of the pressure preconditioner, it is very relevant what else it might depend on. For norms similar to the ones used here, was proven to be independent of in two dimensions in Reference 31 and numerical experiments carried out in the same work strongly suggest that the independence also holds in three dimensions. The ‐robustness is further supported by numerical experiments carried out in this work, see Section 8. There remains, however, a dependence of the domain itself, and in fact even tends to zero for certain domain shapes, for example, channels with increasingly higher aspect ratios, see Reference 32. This imposes a fundamental limitation on all methods that, like the one here, precondition the velocity problem independently of the divergence constraint.
5.2. Auxiliary space preconditioning
We give the fictitious space Lemma 6 below in the compact form it takes, for example, in Reference 14 [Theorem 6.3].
Lemma 6
Let be two real Hilbert spaces equipped with norms induced by and and let there exist a linear operator such that the continuity condition
(41) and the stability condition,
(42) hold. Then, for the preconditioner defined by , there holds the spectral estimate
(43)
The term auxiliary space method, as coined in Reference 16, refers to the case where the titular fictitious space is a product space that contains itself as a component, , so takes the form , and is a diagonal operator with and and induced norm . The stability condition (42) then demands the existence of a stable decomposition with and in the range of . The underlying idea is that the remainder in this composition is small and somehow localized and can be a computationally cheap method (or “smoother”). Often, is given by some form of additive or multiplicative Schwarz method such as (Block‐)Jacobi or (Block‐)Gauss‐Seidel. In the only relevant case here, where all involved spaces are finite dimensional and , the ASP is just
As alluded to by the subscript, is an additive preconditioner in that, given some right hand side vector and intermediate approximation with residual , one Richardson iteration with preconditioner is to perform
that is, to perform two updates additively. The multiplicative ASP is implicitly defined by performing these updates successively instead,
and then, performing another smoothing step with the adjoint smoother
yielding . Multiplication with is just performing this procedure once starting with . With symmetry ensured by the additional smoothing step, positive definiteness of follows from and which can always be achieved by scaling the component preconditioners. If is a (Block‐)Jacobi preconditioner, scaling of can be avoided by replacing it with the corresponding (Block‐)Gauss‐Seidel iteration which never over‐corrects, see Reference 33.
Lemma 7
Let an ASP with fulfill the conditions of Lemma 6, and be either self‐adjoint and positive definite with or given by (Block‐)Gauss‐Seidel iterations. Let self‐adjoint and positive definite with
(44) Then is self‐adjoint and positive definite and there holds
(45)
This is proven within the framework of space decomposition and subspace correction, see Reference 33. The analysis there rests on a strengthened Cauchy‐Schwarz type inequality and a stable decomposition. The former is implied by limited overlap of subspaces and the additional requirements posed on and and the latter is directly related to (42). See also the discussion in Reference 14 [section 6], where convergence bounds for multiplicative two‐grid Algebraic Multigrid methods are derived from the fictitious space lemma.
6. PRECONDITIONERS FOR
From the point of view of Section 5.2, a straightforward approach to preconditioning is to use the conforming low order space , where preconditioning is well understood and efficient and scalable software is widely available, as basis for an ASP. A slight complication in the analysis arises due to the non‐conformity in boundary conditions between , where tangential Dirichlet conditions on are imposed in , and , where does not feature any Dirichlet conditions. While imposing strong tangential Dirichlet conditions in would sidestep the issue and be convenient for theory, in practice this is only a simple matter when the outflow lies in an axis‐aligned plane and we can impose Dirichlet conditions in the or component. Therefore, we for now assume that and address the case separately in Lemma 9 at the end of this Section.
On , we define the bilinear form (as usual, with associated operator and Finite Element matrix ) by
| (46) |
To define the operator in (41) we need the embedding operator
| (47) |
with associated Finite Element matrix .
Corollary 5
For there holds
(48)
For and there holds from Lemma 5 and (48) follows from (29).
To establish the stable decomposition (42) we use from Lemma 2 and define
| (49) |
Corollary 6
For and there holds
(50)
Per definition of there holds and the facet terms are bounded trivially. The volume terms are bounded with Lemma 2 and the identity for the jump terms in (9).
With being of low order, robustness in the polynomial degree has to be achieved by the smoother.
Theorem 2
Let be the overlapping Block‐Jacobi preconditioner for that has one block per facet that contains all degrees of freedom associated to either or any such that . Let be an SPD preconditioner for such that . Then, conditions (41) and (42) of Lemma 6 are fulfilled for , ,
and with and . That is, for there holds
We postpone the proof of Theorem 2 to Section 6.1, where we discuss preconditioning of , as obtaining the logarithmic bound in is more natural in that context.
Remark 5
If one is satisfied with a polynomial bound in , Theorem 2 can be shown only using standard Finite Element inverse estimates and Corollaries 5 and 6.
6.1. Preconditioning via the condensed system
Using the factorization in (32) to implement multiplication with also opens up a way to precondition it. Replacing by some preconditioner yields
as a preconditioner for . From the factorization (32) it clearly follows that
and we are left with the task to precondition the “double” Schur complement . Analogues for of the preconditioners and can be constructed straightforwardly with the modified embedding operator
| (51) |
Note that the matrix is just a sub‐matrix of as does not change coefficients, see (34), with
We could modify the bilinear form in and use , which would be computable element‐wise. In that case, the exact analogue of Corollary 5 would hold. However, as we now show, this is not strictly necessary and for ease of implementation we opt to keep defined by (46).
Lemma 8
For there holds
(52)
(53)
Any restricted to is a normal bubble. At any vertex of , linearly independent components of vanish, and therefore and also vanish as a whole. This concludes the proof as only add some to the component of .
Corollary 7
For there holds
(54)
The sharp upper bound is a consequence of the energy minimization (33) and Corollary 5,
With and (52) we conclude the identity . Now, (17) and (9) show
and the rest follows form the lower bound in (38).
Corollary 8
For and either or there holds
(55)
With the readily apparent and (53) we see
This lets us insert a zero into to obtain an expression without in front,
Corollary 6 applied to shows
The proof is concluded by the energy minimization of ,
where the continuity of in the norm follows from Lemma 2. The other case works analogously.
An operator that, like , extracts a low order component out of is
| (56) |
where is the standard interpolator, see Reference 19, that is, there holds .
Corollary 9
For and there holds
(57)
Follows from the Bramble‐Hilbert Lemma, an element‐level Korn inequality and Lemma 8 with similar arguments as the previous Corollary 8.
Theorem 3
Let be the block Jacobi preconditioner for , consisting of one block per facet that contains all degrees of freedom associated to . Let be an SPD preconditioner for such that . Then, conditions (41) and (42) of Lemma 6 are fulfilled for , ,
and with and . That is, for there holds
The continuity condition (41) holds as is shown in Corollary 7 and follows from limited overlap of basis functions. For some , the choice
fulfills . The stability condition (42) is verified by showing
(58) For the second term, and Corollary 7 bound it by which is then further bounded by the norm with the continuity of in , as implied by Corollary 8, and (38) where we incur the factor . The other bound requires a more careful approach. For general , and therefore also for , the lower bound in (38) shows
(59) In the first step we bounded the facet terms in the sum, where an infimum is taken over functions with arbitrary traces on neighboring faces, by , where these traces are fixed. On the other hand, the upper inequality in (38) shows
(60) where denotes the element of that has the same coordinates as for degrees of freedom associated to and whose degrees of freedom are zero otherwise (Galerkin isomorphism ). Given the continuity of in (see Corollary 8), the crucial step is therefore to bound by , as in Corollary 1. However, Corollary 1 is only applicable if , which is not usually true for .
This does not pose a problem for low‐order functions or, crucially, their harmonic extensions, where an alternative path via an inverse inequality bypasses the trace estimate. For a low order , a standard, and necessarily ‐independent, inverse estimate is
(61) Because of the energy minimization in , the estimate holds with the same constant also for discrete harmonic extensions of these low order functions where . The right hand side can then further be bounded using Corollary 8 or Corollary 9 if takes the form of an approximation error, as required there.
Therefore, the strategy is to use the operator as defined in (56) to split the term in (58) into low and high order components. The former can then be bounded via the inverse estimate and the latter via the trace inequality, we have
For , the (low order) normal trace of the component of and the entire component are not changed by and there holds
that is . Therefore and the high order term can be simplified,
Note that we can apply Corollary 1 not only to , which is apparent from the definition of , but also to because again, as argued above, does not change the relevant traces. Therefore, (60), (21) and then (59) shows
where the continuity of used in the last estimate follows from the Bramble Hilbert Lemma as in the proof of Corollary 9. Finally, the bound
follows with (38). As for the low order term, with we see
and applying (61) to the (harmonic extension of) the low order function gives
where we write . We further split into and and get
Corollary 8 and (38) bound the former two terms,
and Corollary 9 and (38) the latter two,
Proof of Theorem 2
Similarly to the proof of Theorem 3, the continuity condition follows from Corollary 5, limited overlap of basis functions and this time also limited overlap of the Jacobi blocks themselves. Also similarly, the stability condition is proven by setting and using Corollary 6. The bound via the triangle inequality follows from
which was already shown in the proof of Theorem 3, and the estimate
The latter holds because is a normal bubble, that is all its coupling degrees of freedom are zero, and restricted to such functions is block diagonal.
Corollary 10
Let and be the multiplicative versions of and , respectively, with the Block‐Jacobi smoothers , replaced by Block‐Gauss‐Seidel sweeps and let . Then there holds
(62)
(63)
The former result (62) follows from Theorem 2 and Lemma 7, where condition (44) is fulfilled due to and Corollary 5. The latter one (63) follows along the same lines with Theorem 3 and the strict upper bound in (54) for Lemma 7.
Remark 6
Although we have only experimental evidence that the constant in Theorems 2 and 3 is benign, the proofs of these theorems show that in the and norm they hold independently of . That is, we have results for ASPs for HDG methods with optimal stabilization that are explicit and robust in .
6.2. Non‐conformity in boundary conditions
We now return to the case of . Instead of enforcing zero tangential Dirichlet conditions on in , it suffices to add a tangential penalty to and for to zero out degrees of freedom on .
Lemma 9
For some , let be defined by the modified bilinear form
and be the operator that zeros out degrees of freedom on . Then, for large enough there holds
These estimates are robust in .
With the upper bound in (28) and (9) there holds
that is for large enough we have . The lower bound similarly follows from the lower bound in (28) and the fact that, as is of low order, the high order terms in (9) vanish and there holds
with a ‐robust constant. The estimates for the ‐norm follow from the ones for the ‐norm with energy minimization as in the proof of Corollary 7.
Modifying and the embedding operators like this, one shows the proofs of Section 6 also for the case .
7. THE LOWEST ORDER CASE
The MCS method of Section 4 is, as already mentioned there, not stable in the lowest order case . Stability of the method is recovered when a simplified stress tensor is used in (1a), but we are interested in treating the full symmetric stress tensor . The five coupling degrees of freedom per facet we have for , that is three in and two enforced by , are too few to capture the six rigid body modes.
In Reference 9, this was remedied by using a vector‐valued instead of the ‐valued one here, which just means that all occurrences of have to be replaced by everywhere, and taking it as a subset of ,
providing the missing coupling degree of freedom per facet. Motivated by the fact that the divergence of vanishes for the true solution , a consistent stabilizing term was added to the bilinear form. We only briefly sketch how to adapt the preconditioners and their analysis developed here. Since has a coupling degree of freedom per facet, remains after static condensation and is a system for . The norm in is
this is justified by the discrete Korn inequality
introduced in Reference 9 [lemma 3.1]. We only need to change the "embedding" operator which now has a component and projects into the component as for the piecewise tangential trace ,
The analysis also needs to be only slightly modified using the equivalence
introduced together with the Korn inequality in Reference 9 [lemma 3.1].
8. NUMERICAL RESULTS
We now present numerical results that were achieved using the Netgen/NGSolve meshing and Finite Element software, 34 , 35 and the Algebraic Multigrid extension library NgsAMG, 36 available from References 37, 38. The computations were performed on the Vienna Scientific Cluster (VSC4).
We considered two problems, the first of which is a standard benchmark problem from the literature (see Reference 39) where we investigate the relative performance of different ASP variations and demonstrate robustness in the polynomial degree. The second problem is a flow around an airplane model and is meant to demonstrate the effectiveness of the method even in less academic situations. For both cases, the viscosity is fixed to , the preconditioner in the conforming auxiliary space was given by a single Algebraic Multigrid V‐cycle. Instead of the difficult to parallelize Block‐Gauss‐Seidel smoothers in and , we use block versions of the scalable semi‐multiplicative ‐smoothers from Reference 40.
Although is symmetric (see Equation 40) and would be suitable for MINRES, we use GMRES as the Krylov space method since in our experience it performs better, perhaps due to the explicit orthogonalization in every iteration. We chose to solve with a relative tolerance of as is used commonly in the AMG literature (e.g., References 41, 42); Figure 1 demonstrates that we generally do not suffer notable loss of convergence until the error has been reduced by more than a factor of .
FIGURE 1.

Relative errors (left) and error reduction (right) on the y‐axis versus iteration number on the x‐axis for the setup from Section 8.1.1 on the condensed system, as in Table 1 on the right. Only every other computation is shown.
We show weak scaling results and therefore aim to keep the number of elements per core constant, however are only able to ensure this approximately because of the unstructured simplicial meshes we use.
The obtained results, listed in Tables 1, 2, 3 will be discussed in detail below. For every computation we list the number of elements in the mesh and the number of cores #P. With the dofs condensed out of the system, the relevant number of dofs is that of which we list as #D. We give the number of iterations of GMRES needed as #IT and the total time to solve the problem (excluding loading of the mesh) as . The latter is further subdivided into the setup time , which includes the assembly of all finite element matrices and the setup of the preconditioner, and the time taken for GMRES iterations.
TABLE 1.
Comparison of multiplicative ASPs for the full system and the condensed system for the channel problem with .
| Full system | Condensed system | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
#D | #P | #IT |
|
|
|
#IT |
|
|
|
|||||||
| 5.5 | 1.9 | 7 | 166 | 67.2 | 9.0 | 58.2 | 76 | 29.3 | 9.5 | 19.8 | |||||||
| 1.8 | 6.3 | 19 | 119 | 58.2 | 10.2 | 48.0 | 63 | 32.4 | 11.3 | 21.1 | |||||||
| 3.1 | 1.1 | 36 | 235 | 163.1 | 9.1 | 154.0 | 92 | 59.1 | 10.2 | 48.9 | |||||||
| 7.9 | 2.7 | 81 | 128 | 110.6 | 11.8 | 98.8 | 65 | 53.0 | 12.3 | 40.7 | |||||||
| 1.5 | 5.0 | 166 | 159 | 134.9 | 10.6 | 124.3 | 73 | 54.5 | 10.8 | 43.7 | |||||||
| 3.5 | 1.2 | 408 | 172 | 171.1 | 11.3 | 159.8 | 78 | 65.3 | 11.8 | 53.5 | |||||||
| 6.3 | 2.2 | 720 | 164 | 141.7 | 11.1 | 130.6 | 74 | 55.2 | 11.9 | 43.3 | |||||||
| 1.2 | 4.0 | 1333 | 164 | 169.2 | 12.0 | 157.2 | 75 | 68.0 | 12.9 | 55.1 | |||||||
| 2.3 | 7.9 | 2667 | 193 | 430.2 | 24.0 | 406.2 | 81 | 78.6 | 14.8 | 63.8 | |||||||
TABLE 2.
Comparison of additive and multiplicative ASPs for for the channel problem with .
| Additive | Multiplicative | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
#D | #P | #IT |
|
|
|
#IT |
|
|
|
|||||||
| 5.5 | 1.9 | 5 | 191 | 60.9 | 13.9 | 47.0 | 75 | 46.0 | 14.2 | 31.8 | |||||||
| 2.1 | 7.3 | 17 | 191 | 75.8 | 12.8 | 63.0 | 73 | 45.6 | 14.3 | 31.3 | |||||||
| 4.4 | 1.5 | 35 | 206 | 154.4 | 13.4 | 141.0 | 77 | 65.7 | 14.6 | 51.1 | |||||||
| 1.5 | 5.0 | 111 | 169 | 132.8 | 14.0 | 118.7 | 73 | 73.1 | 15.2 | 57.9 | |||||||
| 6.3 | 2.2 | 480 | 179 | 176.0 | 15.7 | 160.3 | 74 | 90.9 | 17.0 | 73.9 | |||||||
| 1.4 | 4.6 | 1040 | 209 | 261.5 | 16.7 | 244.8 | 78 | 120.0 | 18.0 | 101.9 | |||||||
| 3.5 | 1.2 | 2698 | 230 | 400.2 | 18.0 | 382.3 | 88 | 159.4 | 20.3 | 139.13 | |||||||
| 5.1 | 1.7 | 3876 | 202 | 301.4 | 18.1 | 283.3 | 84 | 151.9 | 19.4 | 132.5 | |||||||
TABLE 3.
Multiplicative ASP for with two smoothing steps for the channel problem and varying polynomial order .
|
|
|
|||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
#P | #D | #IT |
|
|
|
#D | #IT |
|
|
|
|||||||
| 8.6 | 1 | 1.2 | 86 | 11.6 | 4.2 | 7.4 | 3.1 | 49 | 23.9 | 7.9 | 16.0 | |||||||
| 3.2 | 36 | 4.2 | 82 | 21.6 | 5.8 | 15.8 | 1.1 | 53 | 42.8 | 10.2 | 32.6 | |||||||
| 7.6 | 85 | 1.0 | 79 | 23.7 | 6.6 | 17.2 | 2.6 | 52 | 54.2 | 12.0 | 42.2 | |||||||
| 2.0 | 225 | 2.7 | 81 | 27.4 | 6.9 | 20.5 | 6.9 | 57 | 61.2 | 12.1 | 49.1 | |||||||
| 6.4 | 712 | 8.4 | 82 | 30.4 | 7.3 | 23.2 | 2.2 | 58 | 67.9 | 12.8 | 55.2 | |||||||
| 2.1 | 2347 | 2.8 | 85 | 32.6 | 8.2 | 24.3 | 7.2 | 62 | 74.8 | 14.7 | 60.1 | |||||||
| 4.6 | 5165 | 6.1 | 97 | 44.8 | 12.0 | 32.8 | 1.6 | 75 | 92.1 | 15.0 | 77.1 | |||||||
| 6.5 | 7168 | 8.4 | 98 | 43.1 | 10.2 | 32.9 | 2.2 | 75 | 94.6 | 16.1 | 78.5 | |||||||
| 9.7 | 10775 | 1.3 | 106 | 66.0 | 30.1 | 35.9 | ||||||||||||
|
|
||||||||||||||||||
| 8.6 | 11 | 1.2 | 63 | 40.8 | 19.4 | 21.4 | ||||||||||||
| 4.5 | 57 | 6.3 | 65 | 48.8 | 22.9 | 25.9 | ||||||||||||
| 1.8 | 227 | 2.5 | 65 | 63.7 | 28.2 | 35.5 | ||||||||||||
| 3.2 | 397 | 4.3 | 67 | 61.4 | 27.0 | 34.4 | ||||||||||||
| 7.6 | 953 | 1.0 | 65 | 65.7 | 27.4 | 38.3 | ||||||||||||
| 1.7 | 2064 | 2.2 | 66 | 61.7 | 24.8 | 36.9 | ||||||||||||
| 6.4 | 8008 | 8.7 | 64 | 68.7 | 27.7 | 41.0 | ||||||||||||
8.1. Flow around a cylinder
This first series of computations concerns the flow around a cylinder as in Reference 39. The cuboid‐shaped channel with cylindrical obstacle , is depicted on the left in Figure 2. The boundary parts are , with split into inflow boundary , where we impose a parabolic velocity inflow and wall boundary with homogenous Dirichlet conditions.
FIGURE 2.

Channel with cylindrical obstacle (left) and airplane model (right).
8.1.1. Full versus condensed system
We first discuss whether preconditioning via as described in Section 6.1 is purely convenient for theory or also advantageous in practice. For that, we compare the multiplicative ASPs over a range of problem sizes and fixed polynomial degree . As can be clearly seen in Table 1, preconditioning via the condensed system leads to considerably better performance and is the approach we take from here on out.
8.1.2. Additive versus multiplicative ASP
The second choice is between additive and multiplicative ASPs, we again fix the polynomial degree to for the comparison in Table 2. From the results it is once again clear that the multiplicative preconditioner is superior and our method of choice going forward.
8.1.3. High order robustness
Finally, we demonstrate robustness in the polynomial degree with results for .
For the lowest order case , we cannot use the MCS method (23) from Section 4 since it is simply not stable, see also Remark 1. Instead, we employ the minimal order MCS method from Reference 9, see Section 7, where, analogous to (23), we added the consistent term .
Our choice of preconditioner, informed by previous results, is the multiplicative ASP for the condensed system, this time with two smoothing steps. Due to considerably increased memory requirements, different meshes were used for than for .
8.2. Flow around an airplane model
The computational domain here is the “air” in a cuboid‐shaped box surrounding an airplane model depicted in Figure 2, we have . The airplane itself is contained in the bounding box . Boundary conditions, similar to the last case, are imposed velocity inflow conditions on the side of the box in front of the plane , and homogenous Dirichlet conditions on . The rest of the boundary is considered as outflow, that is, . The results can be found in Table 4.
TABLE 4.
Results for the model airplane problem, , multiplicative ASP for , two smoothing steps.
|
|
#D | #P | #IT |
|
|
|
||||
|---|---|---|---|---|---|---|---|---|---|---|
| 9.2 | 3.2 | 63 | 61 | 68.4 | 24.2 | 44.3 | ||||
| 1.4 | 5.0 | 99 | 41 | 55.8 | 25.1 | 30.7 | ||||
| 1.9 | 6.7 | 132 | 44 | 67.4 | 26.5 | 40.8 | ||||
| 3.7 | 1.3 | 253 | 41 | 65.7 | 27.4 | 38.3 | ||||
| 1.2 | 4.0 | 876 | 50 | 72.3 | 25.1 | 47.2 | ||||
| 1.6 | 5.3 | 1176 | 52 | 79.1 | 25.6 | 53.5 | ||||
| 3.0 | 1.0 | 2261 | 53 | 82.6 | 26.6 | 56.0 |
9. CONCLUSIONS
In this work, we introduced and analyzed a series of auxiliary space preconditioners for certain mass conserving stress‐yielding discretizations of the Stokes equations. In the norm induced by these MCS methods, the analysis is mostly explicit in the polynomial degree and even yields completely explicit results in the norm induced by certain hybrid discontinuous Galerkin methods that feature optimal stabilization. Numerical experiments demonstrate the robustness of the preconditioners in the polynomial degree.
CONFLICT OF INTEREST STATEMENT
The authors declare that they have no conflicts of interest.
ACKNOWLEDGMENTS
The authors have been partially funded by the Austrian Science Fund (FWF) through the research program “Taming complexity in partial differential systems” (F65)–project “Automated discretization in multiphysics” (P10) and the standalone project P35931‐N.
APPENDIX A. INTERPOLATION
A.1.
A standard result for , which, for example, follows from the Bramble Hilbert Lemma, the bounds for jump terms arising from nodal averaging for functions in Reference 21 [section 3], and a trace inequality is
| (A1) |
Following Reference 21 [section 3], for we define by
that is and , such that (also Reference 21 [section 3])
| (A2) |
With the element center of mass , elementary calculations show
Proof of Lemma 1
For any , define the set of element patch elements and the element patch . We write for the local interpolation operator defined only on as is on , that is, by averaging only over values from elements in . There holds and
In combination with estimate (A1) applied to on this shows
where denotes the set of interior facets of . We can further bound the volume terms by inserting and using (A2),
We see that it remains to find such that
(A3) Similar to the definition of , with a suitable is
Calculations show with , and therefore
As , there holds , i.e.
where we used and therefore, with ,
Any two elements in are connected via a path over a bounded number of other elements in , and we can bound this last sum by one over facet terms (see Figure A1):
(A4) where and denote the two elements that share the facet . Any facet is only summed up over a bounded number of times after reordering of the sum and due to the shape regularity of , (A4) holds with a single constant for all patches in . These jump terms can be bounded by an expansion of at ,
and the elementary estimate . Writing we see
Finally, an trace inequality and (A2) let us bound
in summary,
that is, (A3) holds for our specific choice of which finishes the proof.
FIGURE A1.

Example path in two dimensions for the last estimate in (A4).
Proof of Lemma 2
An inverse estimate for the piecewise linear shows
As on , and
We bound the second one with an trace inequality and a scaling argument,
where is the unique element such that . An explicit expansion of the piecewise linear at the facet center of mass shows
Finally, with on , by Lemma 1 there holds
The other volume terms in (17) are bounded analogously.
APPENDIX B. TRACE ESTIMATES
B.1.
The crucial step in the proof of (18) in Reference 20 was to construct for a given a that approximates it in and is bounded in the semi‐norm and yet . That is, implicitly, for and with , an operator
| (B1) |
was constructed. As no boundary conditions are enforced strongly in , the trace of could be extended to a function admissible for the infimum in . In addition to , of the commuting and extensions introduced in References 43, 44, 45, we need the first,
see Reference 43 [theorem 6.1], and last,
Note although that was constructed from in Reference 44 [theorem 7.1], the authors actually proved continuity in the norm.
Lemma 10
For and there exists with on and on such that
(B2)
To simplify the notation we only show a proof in three dimensions, the two‐dimensional case works analogously. It also suffices to show the estimate on a reference tetrahedron, for general tetrahedra it follows from scaling arguments.
First, we use to get a which fulfills on and on with a bounded norm,
We have no control over the tangential traces of and have to add correction terms. For all we pick two arbitrary normalized, orthogonal tangent vectors and and write the “errors” we need to compensate for on and as
We write for applied to the extension by zero to of functions that vanish on . This defines an stable extension
and construct a corrected as
where we understand to be applied to the respective trace on . The added corrections are normal bubbles because on their associated facet they are a scalar times a tangential and their trace vanishes on all others, that is, and is admissible and we need to show that if fulfills (B2).
As restricted to is just the identity, for there holds
and (B1) implies
The volume terms arising from the correction of for can be bounded with the inverse estimate for polynomials that vanish on , see Reference 45 [lemma 4.7],
and we can continue with (B1) to see
Analogously, we show these bounds for volume and trace terms for as well as instead of . In summary, we have
Proof of Lemma 1
For the minimizer in (19), a Korn inequality on shows
Choosing as in Lemma 10, finishes the proof as it is admissible for the infimum in (20) and bounds it by .
Kogler L, Lederer PL, Schöberl J. A conforming auxiliary space preconditioner for the mass conserving stress‐yielding method. Numer Linear Algebra Appl. 2023;30(5):e2503. 10.1002/nla.2503
DATA AVAILABILITY STATEMENT
The data that support the findings of this study are available from the corresponding author upon reasonable request.
REFERENCES
- 1. Stenberg R. A family of mixed finite elements for the elasticity problem. Numer Math. 1988;53(5):513–38. 10.1007/BF01397550 [DOI] [Google Scholar]
- 2. Cockburn B, Gopalakrishnan J, Guzmán J. A new elasticity element made for enforcing weak stress symmetry. Math Comp. 2010;79(271):1331–49. 10.1090/S0025-5718-10-02343-4 [DOI] [Google Scholar]
- 3. Arnold DN, Falk RS, Winther R. Mixed finite element methods for linear elasticity with weakly imposed symmetry. Math Comp. 2007;76(260):1699–723. 10.1090/S0025-5718-07-01998-9 [DOI] [Google Scholar]
- 4. John V, Linke A, Merdon C, Neilan M, Rebholz L. On the divergence constraint in mixed finite element methods for incompressible flows. SIAM Rev. 2017;59:492–544. [Google Scholar]
- 5. Stamm B, Wihler TP. hp‐optimal discontinuous Galerkin methods for linear elliptic problems. Math Comp. 2010;79(272):2117–33. 10.1090/S0025-5718-10-02335-5 [DOI] [Google Scholar]
- 6. Arnold DN, Brezzi F, Cockburn B, Marini LD. Unified analysis of discontinuous Galerkin methods for elliptic problems. SIAM J Numer Anal. 2002;39(5):1749–79. [Google Scholar]
- 7. Lederer PL. A Mass Conserving Mixed Stress Formulation for Incompressible Flows (PHD Thesis). Technical University of Vienna. Wiedner Hauptstrasse 8‐10, 1040 Vienna, Austria. 2019.
- 8. Gopalakrishnan J, Lederer PL, Schöberl J. A mass conserving mixed stress formulation for Stokes flow with weakly imposed stress symmetry. SIAM J Numer Anal. 2020;58(1):706–32. [Google Scholar]
- 9. Gopalakrishnan J, Kogler L, Lederer PL, Schöberl J. Divergence‐conforming velocity and vorticity approximations for incompressible fluids obtained with minimal inter‐element coupling. 2021. [DOI] [PMC free article] [PubMed]
- 10. Benzi M, Golub GH, Liesen J. Numerical solution of saddle point problems. Acta Numer. 2005;14:1–137. [Google Scholar]
- 11. Wathen AJ, Silvester DJ. Fast iterative solution of stabilised Stokes systems, part I: using simple diagonal preconditioners. SIAM J Numer Anal. 1993;30:630–49. [Google Scholar]
- 12. Toselli A, Widlund OB. Domain decomposition methods. 2005th ed. Springer series in computational mathematics. Berlin, Germany: Springer; 2005. [Google Scholar]
- 13. Braess D. Finite elemente–Theorie, Schnelle Löser und Anwendungen in der Elastizitätstheorie. Berlin, Heidelberg: Springer; 2013. [Google Scholar]
- 14. Xu J, Zikatanov L. Algebraic multigrid methods. Acta Numer. 2017;26:591–721. [Google Scholar]
- 15. Cockburn B, Dubois O, Gopalakrishnan J, Tan S. Multigrid for an HDG method†. IMA J Numer Anal. 2013;34(4):1386–425. 10.1093/imanum/drt024 [DOI] [Google Scholar]
- 16. Xu J. The auxiliary space method and optimal multigrid preconditioning techniques for unstructured grids. Comput. 1996;56(3):215–35. 10.1007/bf02238513 [DOI] [Google Scholar]
- 17. Fu G. Uniform auxiliary space preconditioning for HDG methods for elliptic operators with a parameter dependent low order term. SIAM J Sci Comput. 2021;43(6):A3912–37. 10.1137/20m1382325 [DOI] [Google Scholar]
- 18. Schöberl J, Lehrenfeld C. In: Apel T, Steinbach O, editors. Domain decomposition preconditioning for high order hybrid discontinuous Galerkin methods on tetrahedral meshes. Berlin, Heidelberg: Springer; 2013. p. 27–56. 10.1007/978-3-642-30316-6_2 [DOI] [Google Scholar]
- 19. Boffi D, Brezzi F, Fortin M. Mixed finite element methods and applications. Berlin, Heidelberg: Springer Science & Business Media; 2013. [Google Scholar]
- 20. Schöberl J, Lehrenfeld C. Domain decomposition preconditioning for high order hybrid discontinuous Galerkin methods on tetrahedral meshes. Advanced finite element methods and applications. Berlin Heidelberg: Springer; 2013. p. 27–56. 10.1007/978-3-642-30316-6_2 [DOI] [Google Scholar]
- 21. Brenner SC. Korn's inequalities for piecewise vector fields. Math Comp. 2004;73(247):1067–87. 10.1090/S0025-5718-03-01579-5 [DOI] [Google Scholar]
- 22. Gopalakrishnan J, Lederer PL, Schöberl J. A mass conserving mixed stress formulation for the Stokes equations. IMA J Numer Anal. 2019;40(3):1838–74. [Google Scholar]
- 23. Arnold DN, Brezzi F. Mixed and nonconforming finite element methods: implementation, postprocessing and error estimates. RAIRO Modél Math Anal Numér. 1985;19(1):7–32. [Google Scholar]
- 24. Brezzi F, Douglas J Jr, Durán R, Fortin M. Mixed finite elements for second order elliptic problems in three variables. Numer Math. 1987;51(2):237–50. 10.1007/BF01396752 [DOI] [Google Scholar]
- 25. Fraeijs de Veubeke B. Displacement and equilibrium models in the finite element method by B. Fraeijs de Veubeke, Chapter 9, pages 145–197 of Stress Analysis, Edited by O. C. Zienkiewicz and G. S. Holister, Published by John Wiley & Sons, 1965. Internat J Numer Methods Engrg. 2001;52(3):287–342. Edited by O. C. Zienkiewicz and G. S. Holister and with introductory remarks by Zienkiewicz. 10.1002/nme.339 [DOI]
- 26. Bassi F, Rebay S. A high‐order accurate discontinuous finite element method for the numerical solution of the compressible Navier–Stokes equations. J Comput Phys. 1997;131(2):267–79. 10.1006/jcph.1996.5572 [DOI] [Google Scholar]
- 27. Cockburn B, Gopalakrishnan J, Lazarov R. Unified hybridization of discontinuous Galerkin, mixed, and continuous Galerkin methods for second order elliptic problems. SIAM J Numer Anal. 2009;47(2):1319–65. [Google Scholar]
- 28. Beuchler S, Pillwein V, Zaglmayr S. Sparsity optimized high order finite element functions for H(div) on simplices. Numer Math. 2012;122(2):197–225. 10.1007/s00211-012-0461-0 [DOI] [Google Scholar]
- 29. Zulehner W. Analysis of iterative methods for saddle point problems: a unified approach. Math Comput. 2002;71:479–505. [Google Scholar]
- 30. Verfürth R. A combined conjugate gradient–multi‐grid algorithm for the numerical solution of the Stokes problem. IMA J Numer Anal. 1984;4:441–55. [Google Scholar]
- 31. Lederer P, Schöberl J. Polynomial robust stability analysis for H(div)‐conforming finite elements for the Stokes equations. arXiv preprint arXiv:161201482. 2016.
- 32. Chizhonkov EV, Olshanskii MA. On the domain geometry dependence of the LBB condition. ESAIM: Math Model Numer Anal. 2000;34(5):935–51. 10.1051/m2an:2000110 [DOI] [Google Scholar]
- 33. Xu J. Iterative methods by space decomposition and subspace correction. SIAM Rev. 1992;34(4):581–613. 10.1137/1034116 [DOI] [Google Scholar]
- 34. Schöberl J. NETGEN An advancing front 2D/3D‐mesh generator based on abstract rules. Comput Visual Sci. 1997;1(1):41–52. [Google Scholar]
- 35. Schöberl J. C++11 Implementation of Finite Elements in NGSolve. 2014. https://www.asc.tuwien.ac.at/∼schoeberl/wiki/publications/ngs‐cpp11.pdf
- 36. Kogler L, Schöberl J. An algebraic multigrid method for elasticity based on an auxiliary topology with edge matrices. Numer Linear Algebra Appl. 2021;29(1):e2408. 10.1002/nla.2408 [DOI] [Google Scholar]
- 37. Netgen/NGSolve . Software (website). https://ngsolve.org
- 38. NgsAMG . Software (website). https://github.com/LukasKogler/NgsAMG
- 39. Schäfer M, Turek S, Durst F, Krause E, Rannacher R. In: Hirschel EH, editor. Benchmark computations of laminar flow around a cylinder. Wiesbaden: Vieweg+Teubner Verlag; 1996. p. 547–66. 10.1007/978-3-322-89849-4_39 [DOI] [Google Scholar]
- 40. Baker AH, Falgout RD, Kolev TV, Yang UM. Multigrid smoothers for ultraparallel computing. SIAM J Sci Comput. 2011;33:2864–87. [Google Scholar]
- 41. Baker AH, Kolev TV, Yang UM. Improving algebraic multigrid interpolation operators for linear elasticity problems. Numer Linear Algebra Appl. 2009;17(2‐3):495–517. 10.1002/nla.688 [DOI] [Google Scholar]
- 42. Napov A, Notay Y. An algebraic multigrid method with guaranteed convergence rate. SIAM J Sci Comput. 2012;34(2):A1079–109. 10.1137/100818509 [DOI] [Google Scholar]
- 43. Demkowicz LF, Gopalakrishnan J, Schöberl J. Polynomial extension operators. Part I. SIAM J Numer Anal. 2008;46:3006–31. [Google Scholar]
- 44. Demkowicz LF, Gopalakrishnan J, Schöberl J. Polynomial extension operators. Part II. SIAM J Numer Anal. 2009;47:3293–324. [Google Scholar]
- 45. Demkowicz LF, Gopalakrishnan J, Schöberl J. Polynomial extension operators. Part III. Math Comput. 2012;81:1289–326. [Google Scholar]
- 46. Bica I. Iterative substructuring algorithms for the P‐version finite element method for elliptic problems (PHD Thesis). USA: Courant Institute of Mathematical Sciences of New York University; 1997. AAI9808273. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The data that support the findings of this study are available from the corresponding author upon reasonable request.
