Abstract
This paper deals with the large-scale behaviour of dynamical optimal transport on -periodic graphs with general lower semicontinuous and convex energy densities. Our main contribution is a homogenisation result that describes the effective behaviour of the discrete problems in terms of a continuous optimal transport problem. The effective energy density can be explicitly expressed in terms of a cell formula, which is a finite-dimensional convex programming problem that depends non-trivially on the local geometry of the discrete graph and the discrete energy density. Our homogenisation result is derived from a -convergence result for action functionals on curves of measures, which we prove under very mild growth conditions on the energy density. We investigate the cell formula in several cases of interest, including finite-volume discretisations of the Wasserstein distance, where non-trivial limiting behaviour occurs.
Mathematics Subject Classification: Primary: 49Q22 Secondary: 49M25, 49J45, 65K10, 74Q10
Introduction
In the past decades there has been intense research activity in the field of optimal transport, both in pure mathematics and in applied areas [35, 39, 41, 42]. In continuous settings, a central result in the field is the Benamou–Brenier formula [6], which establishes the equivalence of static and dynamical optimal transport. It asserts that the classical Monge–Kantorovich problem, in which a cost functional is minimised over couplings of given probability measures and , is equivalent to a dynamical transport problem, in which an energy functional is minimised over all solutions to the continuity equation connecting and .
In discrete settings, the equivalence between static and dynamical optimal transport breaks down, and it turns out that the dynamical formulation [11, 30, 32] is essential in applications to evolution equations, discrete Ricci curvature, and functional inequalities [15–20, 33]. Therefore, it is an important problem to analyse the discrete-to-continuum limit of dynamical optimal transport in various setting.
This limit passage turns out to be highly nontrivial. In fact, seemingly natural discretisations of the Benamou–Brenier formula do not necessarily converge to the expected limit, even in one-dimensional settings [25]. The main result in [26] asserts that, for a sequence of meshes on a bounded convex domain in , an isotropy condition on the meshes is required to obtain the convergence of the discrete dynamical transport distances to . This is in sharp contrast to the scaling behaviour of the corresponding gradient flow dynamics, for which no additional symmetry on the meshes is required to ensure the convergence of discretised evolution equations to the expected continuous limit [12, 21].
The goal of this paper is to investigate the large-scale behaviour of dynamical optimal transport on graphs with a -periodic structure. Our main contribution is a homogenisation result that describes the effective behaviour of the discrete problems in terms of a continuous optimal transport problem, in which the effective energy density depends non-trivially on the geometry of the discrete graph and the discrete transport costs.
Main results
We give here an informal presentation of the main results of this paper, ignoring several technicalities for the sake of readability. Precise formulations and a more general setting can be found from Sect. 2 onwards.
Dynamical optimal transport in the continuous setting
For , let be the Wasserstein–Kantorovich–Rubinstein distance between probability measures on a metric space : for ,
where denotes the set of couplings of and , i.e., all measures with marginals and . On the torus (or more generally, on Riemannian manifolds), the Benamou–Brenier formula [3, 6] provides an equivalent dynamical formulation for , namely
| 1.1 |
where the infimum runs over all solutions to the continuity equation with boundary conditions and .
In this paper we consider general lower semicontinuous and convex energy densities under suitable (super-)linear growth conditions. (The Benamou–Brenier formula above corresponds to the special case ). For sufficiently regular curves of measures , we consider the continuous action
| 1.2 |
Here, the infimum runs over all time-dependent vector-valued measures satisfying the continuity equation in the sense of distributions.
Dynamical optimal transport in the discrete setting
A natural discrete counterpart to (1.2) can be defined on finite (undirected) graphs . For each edge we fix a lower semicontinuous and convex energy density1. For sufficiently regular curves we then consider the discrete action
| 1.3 |
Here, the infimum runs over all time-dependent “discrete vector fields”, i.e., all anti-symmetric functions satisfying the discrete continuity equation for all , where denotes the discrete divergence. Variational problems of the form (1.3) arise naturally in the formulation of jump processes as generalised gradient flows [37].
Dynamical optimal transport on -periodic graphs
In this work we fix a -periodic graph embedded in , as in Fig. 1. For sufficiently small with , we then consider the finite graph obtained by scaling by a factor , and wrapping the resulting graph around the torus, so that the resulting graph is embedded in . We are interested in the behaviour of the rescaled discrete action, defined for curves by
| 1.4 |
As above, the infimum runs over all time-dependent “discrete vector fields” satisfying the discrete continuity equation on the rescaled graph .
Fig. 1.

A fragment of a -periodic graph . The unit cube is shown in red. In blue and in orange, respectively, and (color figure online)
Convergence of the action
One of our main results (Theorem 5.1) asserts that, as , the action functionals converge to a limiting functional of the form (1.2), with an effective energy density which depends non-trivially on the geometry of the graph and the discrete energy densities . We only require a very mild linear growth condition on the energy densities :
As , the functionals -converge to in the weak (and vague) topology of .
The precise formulation of this result involves an extension of to measures on ; see Sect. 3 below.
Let us now explain the form of the effective energy density , which is given by a cell formula. For given and , is obtained by minimising the discrete energy per unit cube among all periodic mass distributions representing , and all periodic divergence-free discrete vector fields representing j in the following sense. Set and . Then is given by
| 1.5 |
where the set of representatives consists of all -periodic functions and all -periodic discrete vector fields satisfying
| 1.6 |
Boundary value problems
Our second main result deals with the corresponding boundary value problems, which arise by minimising the action functional among all curves with given boundary conditions, as in the Benamou–Brenier formula (1.1). We define
We then obtain the following result (Theorem 5.10):
As , the minimal actions -converge to
in the weak topology of .
This result is proved under a superlinear growth condition on the discrete energy densities, which holds for discretisations of the Wasserstein distance for .
A special case of interest is the case where is a Riemannian transport distance associated to a gradient flow structure for Markov chains as in [30, 32]. In this situation, we show that the discrete transport distances converge to a 2-Wasserstein distance on the torus (Corollary 5.3). Interestingly, the underlying distance is induced by a Finsler metric, which is not necessarily Riemannian.
We also investigate transport distances with nonlinear mobility [13, 29] and their finite-volume discretisations on the torus . In the spirit of [26], we give a geometric characterisation of finite-volume meshes for which the discretised transport distances converge to the expected limit.
Compactness
The results for boundary value problems are obtained by combining our first main result with a compactness result for sequence of measures with bounded action, which is of independent interest. We obtain two results of this type.
In the first compactness result (Theorem 5.4) we assume at least linear growth of the discrete energies at infinity. Under this condition we prove compactness in the space , which consists of curves of bounded variation, with respect to the Kantorovich–Rubinstein () norm on the space of measures. The convergence holds for almost every .
In the second compactness result (Theorem 5.9), which is used in the analysis of the boundary value problems, we assume a stronger condition of at least superlinear growth on the energy densities . We then obtain compactness in the space , which consists of absolutely continuous curves with respect to the -norm. The convergence is uniform for . We refer to the “Appendix” for precise definitions of these spaces.
Related works
For a classical reference to the study of flows on networks, we refer to Ford and Fulkerson [22].
Many works are devoted to discretisations of continuous energy functionals in the framework of Sobolev and BV spaces, e.g., [1, 4, 5, 36]. Cell formulas appear in various discrete and continuous variational homogenisation problems; see, e.g., [4, 7, 9, 27, 31].
The large scale behaviour of optimal transport on random point clouds has been studied by Garcia–Trillos, who proved convergence to the Wasserstein distance [23].
Organisation of the paper
Sects. 2 and 3 contain the necessary definitions as well as the assumptions we use throughout the article in the discrete and continuous settings. Section 4 deals with the definition of the homogenised action functional. In Sect. 5 we present the rigorous statements of our main results, including the -convergence of the discrete energies to the effective homogenised limit and the compactness theorems for curves of bounded discrete energies. The proof of our main results can be found in Sect. 6 (compactness and convergence of the boundary value problems) and Sects. 7 and 8 (-convergence of ). Finally, in Sect. 9, we discuss several examples and apply our results to some common finite-volume and finite-difference discretisations.
Sketch of the proof of Theorem 5.1
In the last part of this section, we sketch the proof of our main result on the convergence of to the homogenised limit (Theorem 5.1). Crucial tools to show both the lower bound and the upper bound in Theorem 5.1 are regularisation procedures for solutions to the continuity equation, both at the discrete and at the continuous level.
In this section, we use the informal notation and to mean that the corresponding inequality holds up to a small error in , e.g., means that where as .
For and (or more generally, for ), we set . For , we denote by the unique element of satisfying . Note that defines a partition of .
To compare discrete and continuous objects, we consider embeddings of probability measures and anti-symmetric functions defined by
These embeddings preserve the continuity equation in the following sense: if , then .
We also use the notation .
Sketch of the liminf inequality. For with , consider the curve and let be the corresponding measure on space-time defined by . Suppose that vaguely in as . The goal is to show the liminf inequality
| 1.7 |
Without loss of generality we assume that for every , for some sequence of vector fields such that . As we will see in (4.11), the embedded solutions to the continuity equation define curves of measures with densities with respect to on of the form
for every . Here, is a convex combination of the functions .
As we will estimate the discrete energies at any time , for simplicity we drop the time dependence and write , , , , . A crucial step is to construct, for every , a representative
| 1.8 |
which is approximately equal to the values of close to . The lower bound (1.7) would then follow by time-integration of the static estimate
| 1.9 |
together with the lower semicontinuity of . In the last inequality we used the definition of the homogenised density , which corresponds to the minimal microscopic cost with total mass and flux j(u).
To find the sought representatives in (1.8), it may seem natural to define and by taking the values of m and in the -cube at , and insert these values at every cube in , so that the result is -periodic. Precisely:
where . This would ensure that . Unfortunately, this construction produces a vector field which does not in general belong to : indeed, while has the desired effective flux (i.e., , as given in (1.6)), it is not in general divergence-free.
To deal with this complication, we introduce a corrector field associated to , i.e., an anti-symmetric and -periodic function satisfying
| 1.10 |
whose existence we prove in Lemma 7.3.
It is clear that if we set by construction we have and , thus
To carry out this program and prove a lower bound of the form (1.9), we need to quantify the error we perform passing from to . It is evident by construction and from (1.10) that spatial and time regularity of are crucial to this purpose. For example, an -bound on the time derivative of the form (or, in other words, a Lipschitz bound in time for ) together with would imply a control on and thus a control of the error in (1.10) of the form .
This is why the key first step in our proof is a regularisation procedure at the discrete level: for any given sequence of curves of (uniformly) bounded action , we can exihibit another sequence , quantitatively close as measures and in action to the first one, which enjoy good Lipschitz and properties and for which the above explained program can be carried out.
This result is the content of Proposition 7.1 and it is based on a three-fold regularisation, that is in energy, in time, and in space (see Sect. 7.1).
Sketch of the limsup inequality. Fix . The goal is to find such that weakly in and
| 1.11 |
As in the the proof of the liminf inequality, the first step is a regularisation procedure, this time at the continuous level (Proposition 8.26). Thanks to this approximation result, we can assume without loss of generality that
| 1.12 |
where are the smooth densities of with respect to on , and denotes the interior of the domain of (see “Appendix 1”). The convexity of ensures its Lipschitz-continuity on every compact set , hence the assumption (1.12) allows us to assume such regularity for the rest of the proof.
We split the proof of the upper bound into several steps. In short, we first discretise the continuous measures and identify an optimal discrete microstructure, i.e., minimisers of the cell problem described by , on each -cube , . A key difficulty at this stage is that the optimal selection does not preserve the continuity equation, hence an additional correction is needed. For this purpose, we first apply the discrete regularisation result Proposition 7.1 to obtain regular discrete curves and then find suitable small correctors that provide discrete competitors for , i.e., solutions to which are close to the optimal selection.
Let us explain these steps in more detail.
Step 1: For every , , and each cube we consider the natural discretisation of , that we denote by , given by
An important feature of this construction is that the continuity equation is preserved from to , in the sense that
for and .
Step 2: We build the associated optimal discrete microstructure for the cell problem for each cube , meaning we select such that
where denotes the set of optimal representatives in the definition of the cell-formula (1.5). Using the smoothness of and , one can in particular show that
| 1.13 |
Step 3: The next step is to glue together the microstructures defined for every via a gluing operator (Definition 8.4) to produce a global microstructure . As the gluing operators are mass preserving and , it is not hard to see that weakly in as .
Step 4: In contrast to , the latter operation produces curves which do not in general solve the discrete continuity equation . Therefore, we seek to find suitable corrector vector fields in order to obtain a discrete solution, and thus a candidate for . For this purpose we regularise using Proposition 7.1 below. This yields a regular curve which is close in the sense of measures and in energy to the original one. Note that no discrete regularity is guaranteed for , despite the smoothness assumption on , due to possible singularities of .
For the sake of the exposition, we shall discuss the last steps of the proof assuming that already enjoy the Lipschitz and –regularity properties ensured by Proposition 7.1.
Step 5: For sufficiently regular , we seek a discrete competitor for which is close to . As the latter does not necessary belong to , we find suitable correctors such that the corrected curves belong to , with small in the sense that it satisfies the bound
| 1.14 |
The proof of existence of the corrector , together with the quantitative bound relies on a localisation argument (Lemma 8.22) and a study of the divergence equation on periodic graphs (Lemma 8.16), performed at the level of each cube , for every . The regularity of is crucial in order to obtain the estimate (1.14).
Step 6: The final step consists of estimating the action of the measures defined by weakly as , and the vector fields .
Using the regularity assumption on , the smoothness (1.12) of , and the convexity of , together with the bounds (1.13) and (1.14) for the corrector, we obtain
Using this bound and the fact that , an integration in time yields
which is the sought upper bound (1.11).
Discrete dynamical optimal transport on -periodic graphs
This section contains the definition of the optimal transport problem in the discrete periodic setting. In Sect. 2.1 we introduce the basic objects: a -periodic graph and an admissible cost function F. Given a triple , we introduce a family of discrete transport actions on rescaled graphs in Sect. 2.2.
Discrete -periodic setting
Our setup consists of the following data:
Assumption 2.1
is a locally finite and -periodic connected graph of bounded degree.
More precisely, we assume that
where is a finite set. The coordinates of will be denoted by
The set of edges is symmetric and -periodic, in the sense that
Here, is the shift operator defined by
We write whenever .
Let be the maximal edge length, measured with respect to the supremum norm on . It will be convenient to use the notation
Remark 2.2
(Abstract vs. embedded graphs) Rather than working with abstract -periodic graphs, it is possible to regard as a -periodic subset of , by choosing to be a subset of and using the identification , see Fig. 2. Since the embedding plays no role in the formulation of the discrete problem, we work with the abstract setup. Note that edges between nodes that are not in adjacent cells are also allowed.
Fig. 2.

A fragment of a -periodic graph . The blue nodes represent and the orange edges represent (color figure online)
Assumption 2.3
(Admissible cost function) The function is assumed to have the following properties:
F is convex and lower semicontinuous.
- F is local in the sense that there exists such that whenever and agree within a ball of radius , i.e.,
- F is of at least linear growth, i.e., there exist and such that
for any and . Here, .2.1 - There exist a -periodic function and a -periodic and divergence-free vector field such that
2.2
Remark 2.4
As F is local, it depends on finitely many parameters. Therefore, , the topological interior of the domain of F is defined unambiguously.
Remark 2.5
In many examples, the function F takes one of the following forms, for suitable functions and :
We then say that F is vertex-based (respectively, edge-based).
Remark 2.6
Of particular interest are edge-based functions of the form
| 2.3 |
where , the constants are fixed parameters defined for , and is a suitable mean (i.e., is a jointly concave and 1-homogeneous function satisfying ). Functions of this type arise naturally in discretisations of Wasserstein gradient-flow structures [11, 30, 32].
We claim that these cost functions satisfy the growth condition (2.1). Indeed, using Young’s inequality we infer that
with constant depending on . This shows that (2.1) is satisfied.
Rescaled setting
Let be a locally finite and -periodic graph as above. Fix such that . The assumption that remains in force throughout the paper.
The rescaled graph. Let be the discrete torus of mesh size . The corresponding equivalence classes are denoted by for . To improve readability, we occasionally omit the brackets. Alternatively, we may write where .
The rescaled graph is constructed by rescaling the -periodic graph and wrapping it around the torus. More formally, we consider the finite sets
where, for ,
| 2.4 |
Throughout the paper we always assume that , to avoid that edges in “bite themselves in the tail” when wrapped around the torus. For we will write
The rescaled energies. Let be a cost function satisfying Assumption 2.3. For satisfying the conditions above, we shall define a corresponding energy functional in the rescaled periodic setting.
First we introduce some notation, which we use to transfer functions defined on to (and from to ). Let . Each function induces a -periodic function
see Fig. 3. Similarly, each function induces a -periodic function
Fig. 3.
On the left, the values of a function correspond to different colors over the nodes. On the right, the corresponding values of (color figure online)
Definition 2.7
(Discrete energy functional) The rescaled energy is defined by
Remark 2.8
We note that is well-defined as an element in . Indeed, the (at least) linear growth condition (2.1) yields
For it will be useful to consider the shift operator and defined by
Moreover, for and we define
| 2.5 |
Definition 2.9
(Discrete continuity equation) A pair is said to be a solution to the discrete continuity equation if is continuous, is Borel measurable, and
| 2.6 |
for all in the sense of distributions. We use the notation
Remark 2.10
Lemma 2.11
(Mass preservation) Let . Then we have for all .
Proof
Without loss of generality, suppose that with . Approximating the characteristic function by smooth test functions, we obtain, for all ,
Summing the above over and using the anti-symmetry of , the result follows.
We are now ready to define one of the main objects in this paper.
Definition 2.12
(Discrete action functional) For any continuous function such that and any Borel measurable function , we define
Furthermore, we set
Remark 2.13
We claim that is well-defined as an element in . Indeed, the (at least) linear growth condition (2.1) yields as in Remark 2.8
for any . Since , the claim follows.
In particular, is well-defined whenever , since is constant by Lemma 2.11.
Remark 2.14
If the time interval is clear from the context, we often simply write and .
The aim of this work is to study the asymptotic behaviour of the energies as .
Dynamical optimal transport in the continuous setting
We shall now define a corresponding class of dynamical optimal transport problems on the continuous torus . We start in Sect. 3.1 by defining the natural continuous analogues of the discrete objects from Sect. 2. In Sect. 3.2 we define generalisations of these objects that have better compactness properties.
Continuous continuity equation and action functional
First we define solutions to the continuity equation on a bounded open time interval .
Definition 3.1
(Continuity equation) A pair is said to be a solution to the continuity equation if the following conditions holds:
-
(i)
is vaguely continuous;
-
(ii)
is a Borel family satisfying ;
-
(iii)The equation
holds in the sense of distributions, i.e., for all ,3.1
We use the notation
We will consider the energy densities f with the following properties.
Assumption 3.2
Let be a lower semicontinuous and convex function, whose domain has nonempty interior. We assume that there exist constants and such that the (at least) linear growth condition
| 3.2 |
holds for all and .
The corresponding recession function is defined by
where is arbitrary. It is well known that the function is lower semicontinuous and convex, and it satisfies
| 3.3 |
We refer to [2, Section 2.6] for a proof of these facts.
Let denote the Lebesgue measure on . For and we consider the Lebesgue decompositions given by
for some and . It is always possible to introduce a measure such that
for some and . (Take, for instance, .) Using this notation we define the continuous energy as follows.
Definition 3.3
(Continuous energy functional) Let satisfy Assumption 3.2. We define the continuous energy functional by
Remark 3.4
By 1-homogeneity of , this definition does not depend on the choice of the measure .
Definition 3.5
(Action functional) For any curve with and any Borel measurable curve we define
Furthermore, we set
Remark 3.6
As by (3.2), the assumption ensures that is well-defined in .
Remark 3.7
(Dependence on time intervals) Remark 2.14 applies in the continuous setting as well. If the time interval is clear from the context, we often simply write and .
Under additional assumptions on the function f, it is possible to prove compactness for families of solutions to the continuity equation with bounded action; see [13, Corollary 4.10]. However, in our general setting, such a compactness result fails to hold, as the following example shows.
Example 3.8
(Lack of compactness) To see this, let be the position of a particle of mass m that moves from 0 to in the time interval with constant speed . At all other times in the time interval the particle is at rest:
The associated solution to the continuity equation is given by
Let be the total momentum, which satisfies Assumption 3.2. We then have , hence , independently of .
However, as , the motion converges to the discontinuous curve given by for and for . In particular, it does not satisfy the continuity equation in the sense above.
Generalised continuity equation and action functional
In view of this lack of compactness, we will extend the definition of the continuity equation and the action functional to more general objects.
Definition 3.9
(Continuity equation) A pair of measures is said to be a solution to the continuity equation
| 3.4 |
if, for all , we have
As above, we use the notation .
Clearly, this definition is consistent with Definition 3.5.
Let us now extend the action functional as well. For this purpose, let denote the Lebesgue measure on . For and we consider the Lebesgue decompositions given by
for some and . As above, it is always possible to introduce a measure such that
| 3.5 |
for some and .
Definition 3.10
(Action functional) We define the action by
Furthermore, we set
Remark 3.11
This definition does not depend on the choice of , in view of the 1-homogeneity of . As and from (3.2) and (3.3), the fact that ensures that is well-defined in .
Example 3.12
(Lack of compactness) Continuing Example 3.8, we can now describe the limiting jump process as a solution to the generalised continuity equation. Consider the measures and defined by
Then we have and weakly, respectively, in and , where represents the discontinuous curve
The measure does not admit a disintegration with respect to the Lebesgue measure on ; in other words, it is not associated to a curve of measures on . We have
Here denotes the 1-dimensional Hausdorff measure on the (shortest) line segment connecting 0 and .
Note that solves the continuity equation, as is stable under joint weak-convergence. Furthermore, we have .
The next result shows that any solution to the continuity equation induces a (not necessarily continuous) curve of measures . The measure is not always associated to a curve of measures on ; see Example 3.12. We refer to “Appendix 1” for the definition of .
Lemma 3.13
(Disintegration of solutions to ) Let . Then for some measurable curve with finite constant mass. If , then this curve belongs to and
| 3.6 |
Proof
Let be the time-marginal of , i.e., where , . We claim that is a constant multiple of the Lebesgue measure on . By the disintegration theorem (see, e.g., [3, Theorem 5.3.1]), this implies the first part of the result.
To prove the claim, note that the continuity equation yields
| 3.7 |
for all .
Write , let be arbitrary, and set . We define . Then and . Applying (3.7) we obtain , which implies the claim, and hence the first part of the result.
To prove the second part, suppose that has finite action, and let be a solution to the continuity equation (3.4). Applying (3.4) to a test function such that , we obtain
| 3.8 |
which implies the desired bound in view of (B.14).
The next lemma deals with regularity properties for curves of measures with finite action and fine properties for the functionals defined in Definition 3.10 with .
Lemma 3.14
(Properties of ) Let be a bounded open interval. The following statements hold:
-
(i)
The functionals and are convex.
-
(ii)Let . Let be a sequence of bounded open intervals such that and as . Let be such that2
as . Then:
If, additionally, and satisfy vaguely in , then we have3.9
In particular, the functionals and are lower semicontinuous with respect to (joint) vague convergence.3.10
Proof
(i): Convexity of follows from convexity of f, , and the linearity of the constraint (3.4).
(ii): First we show (3.10). Consider the convex energy density , which is nonnegative by (2.1). Let be the corresponding action functional defined using g instead of f. Using the nonnegativity of g, the fact that , and the lower semicontinuity result from [2, Theorem 2.34], we obtain
for every open interval . Taking the supremum over , we obtain
| 3.11 |
Since we have by assumption, the desired result (3.10) follows from (3.11) and the identity
Let us now show (3.9). Let be such that and vaguely in . Let be such that and
From Lemma 3.13, we infer that where is a curve of constant total mass . Moreover, , since vaguely. The growth condition (3.2) implies that
Hence, by the Banach–Alaoglu theorem, there exists a subsequence of (still indexed by n) such that vaguely in and . Another application of Lemma 3.13 ensures that where is of constant mass .
We can thus apply the first part of (ii) to obtain
which ends the proof.
The homogenised transport problem
Throughout this section we assume that safisfies Assumption 2.1 and F safisfies Assumption 2.3.
Discrete representation of continuous measures and vector fields
To define , the following definition turns out to be natural.
Definition 4.1
(Representation)
-
(i)We say that represents if m is -periodic and
-
(ii)We say that represents a vector if
- J is -periodic;
- J is divergence-free (i.e., for all );
- The effective flux of J equals j; i.e., , where
4.1
We use the (slightly abusive) notation and . We will also write .
Remark 4.2
Let us remark that in the formula for , since .
Remark 4.3
The definition of the effective flux is natural in view of Lemmas 4.9 and 4.11 below. These results show that a solution to the continuous continuity equation can be constructed starting from a solution to the discrete continuity equation, with a vector field of the form (4.1).
Clearly, for every . It is also true, though less obvious, that for every . We will show this in Lemma 4.5 using the -periodicity and the connectivity of .
To prove the result, we will first introduce a natural vector field associated to each simple directed path P on , For an edge , the corresponding reversed edge will be denoted by .
Definition 4.4
(Unit flux through a path; see Fig. 4) Let be a simple path in , thus for , and for . The unit flux through P is the discrete field given by
| 4.2 |
The periodic unit flux through P is the vector field defined by
| 4.3 |
Fig. 4.
In the first figure, in red, the (directed) path P from to , support of the vector field . In the second one, in red, the support of the vector field and its values (color figure online)
In the next lemma we collect some key properties of these vector fields. Recall the definition of the discrete divergence in (B.15).
Lemma 4.5
(Properties of ) Let be a simple path in .
-
(i)The discrete divergence of the associated unit flux is given by
4.4 -
(ii)The discrete divergence of the periodic unit flux is given by
In particular, iff .4.5 -
(iii)
The periodic unit flux satisfies .
-
(iv)
For every we have .
Proof
(i) is straightforward to check, and (ii) is a direct consequence.
To prove (iii), we use the definition of to obtain
By construction, we have
which yields the result.
For (iv), taking , we use the connectivity and nonemptyness of to find a simple path connecting some to . The resulting is divergence-free by (ii) and by (iii), so that . For a general we have .
The homogenised action
We are now in a position to define the homogenised energy density.
Definition 4.6
(Homogenised energy density) The homogenised energy density is defined by the cell formula
| 4.6 |
For , we say that is an optimal representative if . The set of optimal representatives is denoted by
In view of Lemma 4.5, the set of representatives is nonempty for every . The next result shows that is nonempty as well.
Lemma 4.7
(Properties of the cell formula) Let . If , then the set of optimal representatives is nonempty, closed, and convex.
Proof
This follows from the coercivity of F and the direct method of the calculus of variations.
Lemma 4.8
(Properties of and ) The following properties hold:
-
(i)
The functions and are lower semicontinuous and convex.
-
(ii)There exist constants and such that, for all and ,
4.7 -
(iii)The domain has nonempty interior. In particular, for any pair satisfying (2.2), the element defined by
belongs to .4.8
Proof
(i): The convexity of follows from the convexity of F and the affinity of the constraints. Let us now prove lower semicontinuity of .
Take and sequences and converging to and j respectively. Without loss of generality we may assume that . By definition of , there exist such that . From the growth condition (2.1) we deduce that, for some ,
From the Bolzano–Weierstrass theorem we infer subsequential convergence of to some -periodic pair . Therefore, by lower semicontinuity of F, it follows that
| 4.9 |
Since , we have , which yields the desired result. Convexity and lower semicontinuity of follow from the definition, see [2, Section 2.6].
(ii) Take and . If , the assertion is trivial, so we assume that . Then there exists a competitor such that . The growth condition (2.1) asserts that
Therefore, the claim follows from the fact that
where .
(iii): Let satisfy Assumption 2.3, and define by (4.8). For , let be the coordinate unit vector. Using Lemma 4.5(iv) we take . For with sufficiently small, and we define
It follows that , and therefore, . By Assumption 2.3, the right-hand side is finite for sufficiently small. This yields the result.
The homogenised action can now be defined by taking in Definition 3.10.
Embedding of solutions to the discrete continuity equation
For and (or more generally, for ) let denote the cube of side-length based at . For and we define and by
| 4.10a |
| 4.10b |
The embeddings (4.10) are chosen to ensure that solutions to the discrete continuity equation are mapped to solutions to the continuous continuity equation, as the following result shows.
Lemma 4.9
Let solve the discrete continuity equation and define and . Then solves the continuity equation (i.e., ).
Proof
Let be smooth with compact support. Then:
On the other hand, the discrete continuity equation yields
Comparing both expressions, we obtain the desired identity in the sense of distributions.
The following result provides a useful bound for the norm of the embedded flux.
Lemma 4.10
For we have
Proof
This follows immediately from (4.11), since and for .
Note that both measures in (4.10) are absolutely continuous with respect to the Lebesgue measure. The next result provides an explicit expression for the density of the momentum field. Recall the definition of the shifting operators in (2.5).
Lemma 4.11
(Density of the embedded flux) Fix . For we have where is given by
| 4.11 |
Here, is a convex combination of , i.e.,
where and . Moreover,
| 4.12 |
Proof
Fix , let and . We have
which is the desired form (4.11) with
for with . Since the family of cubes is a partition of , it follows that .
To prove the final claim, let with as above and take with . Since , the triangle inequality yields
for . Therefore, implies , hence as desired.
Main results
In this section we present the main result of this paper, which asserts that the discrete action functionals converge to a continuous action functional with the nontrivial homogenised action density function defined in Sect. 4.
Main convergence result
We are now ready to state our main result. We use the embedding defined in (4.10a). The proof of this result is given in Sects. 7 and 8.
Theorem 5.1
(-convergence) Let be a locally finite and -periodic connected graph of bounded degree (see Assumption 2.1). Let be a cost function satisfying Assumption 2.3. Then the functionals -converge to as with respect to the weak (and vague) topology. More precisely:
-
(i)(liminf inequality) Let . For any sequence of curves with such that vaguely in as , we have the lower bound
5.1 -
(ii)(limsup inequality) For any there exists a sequence of curves with such that weakly in as , and we have the upper bound
5.2
Remark 5.2
(Necessity of the interior domain condition) Assumption 2.3 is crucial in order to obtain the -convergence of the discrete energies. Too see this, let us consider the one-dimensional graph and the edge-based cost associated with
Clearly F satisfies conditions from Assumption 2.3, but (d) fails to hold. The constraint on neighbouring forces every with to be constant in space (and hence in time, by mass preservation). Therefore, the -limit of the is finite only on constant measures , with . On the other hand, we have3 that , which corresponds to the action on the line.
It is interesting to note that if the constraint “” is replaced by a weaker one of the form “ ” for some , then all the assumptions are satisfied and our theorem can be applied. Intuitively speaking, the constraint which forces admissible curves to be constant is replaced by a constraint that merely forces admissible curves to be Lipschitz; in this case the limit coincides with the action.
See also Sect. 9.2 for a general treatment of the cell formula on the integer lattice .
Scaling limits of Wasserstein transport problems
For , recall that the energy density associated to the Wasserstein metric on is given by . This function satisfies the scaling relations and for .
In discrete approximations of on a periodic graph , it is reasonable to assume analogous scaling relations for the function F, namely and . The next result shows that if such scaling relations are imposed, we always obtain convergence to with respect to some norm on . This norm does not have to be Hilbertian (even in the case ) and is characterised by the cell problem (4.6).
Corollary 5.3
Let , and suppose that F has the following scaling properties for and :
-
(i)
for all ;
-
(ii)
for all .
Then for some norm on .
Proof
Fix and . The scaling assumptions imply that
| 5.3 |
Consequently,
We claim that whenever . Indeed, it follows from (4.7) that whenever |j| is sufficiently large. By homogeneity (5.3), the same holds for every . It also follows from (5.3) that .
We can thus define . In view of the previous comments, we have and for all . The homogeneity (5.3) implies that for and .
It remains to show the triangle inequality for . Without loss of generality we assume that . For , the convexity of (see Lemma 4.8) and the homogeneity (5.3) yield
Substitution of yields the triangle inequality.
Compactness results
As we frequently need to compare measures with unequal mass in this paper, it is natural to work with the the Kantorovich–Rubinstein norm. This metric is closely related to the transport distance ; see “Appendix 1”.
The following compactness result holds for solutions to the continuity equation with bounded action. As usual, we use the notation .
Theorem 5.4
(Compactness under linear growth) Let be such that
Then there exists a curve such that, up to extracting a subsequence,
-
(i)
weakly in ;
-
(ii)
weakly in for almost every ;
-
(iii)
is constant.
The proof of this result is given in Sect. 6.
Under a superlinear growth condition on the cost function F, the following stronger compactness result holds.
Assumption 5.5
(Superlinear growth) We say that F is of superlinear growth if there exists a function with and a constant such that
| 5.4 |
for all and all , where
| 5.5 |
with as in Assumption 2.3.
Remark 5.6
The superlinear growth condition (5.4) implies the linear growth condition (2.1). To see this, suppose that F has superlinear growth. Let be such that for . If , we have
| 5.6 |
On the other hand, if , the nonnegativity of implies that
| 5.7 |
Combining (5.6) and (5.7), we have
which is of the desired form (2.1).
Example 5.7
The edge-based costs
have superlinear growth if and only if (with and ). Indeed,
Example 5.8
The functions (2.3) arising in discretisation of p-Wasserstein distances have superlinear growth if and only if (with ).
To see this, consider the function . Since G is convex, non increasing in , and positively one-homogeneous, we obtain
where depends on R, the maximum degree and the weights .
Theorem 5.9
(Compactness under superlinear growth) Suppose that Assumption 5.5 holds. Let be such that
Then there exists a curve such that, up to extracting a subsequence,
-
(i)
weakly in ;
-
(ii)
uniformly for ;
-
(iii)
is constant.
This is proven in Sect. 6.2.
Note that curve can be continuously extended to . Therefore, it is meaningful to assign boundary values to these curves.
Result with boundary conditions
Under Assumption 5.5, we are able to obtain the following result on the convergence of dynamical optimal transport problems. Fix an open interval. Define for with the minimal action as
| 5.8 |
Similarly, define the minimal homogenised action for with as
| 5.9 |
Note that in general, both and may be infinite even if the two measures have equal mass. Here, the values and are well-defined under Assumption 5.5 by Theorem 5.9. Under linear growth, and can still be defined using the trace theorem in , but we cannot prove the following statement in that case (see also Remark 6.2). We prove this in Sect. 6.3.
Theorem 5.10
(-convergence of the minimal actions) Assume that Assumption 5.5 holds. Then the minimal actions -converge to in the weak topology of . Precisely:
-
(i)For any sequences , such that weakly in as for , we have
5.10 -
(ii)For any , there exist two sequences such that weakly in as for and
5.11
Proof of compactness and convergence of minimal actions
This section is divided into three sub-parts: in the first one, we prove the general compactness result Theorem 5.4, which is valid under the linear growth assumption 2.3.
In the second and third part, we assume the stronger superlinear growth condition 5.5 and prove the improved compactness result Theorem 5.9 and the convergence results for the problems with boundary data, i.e. Theorem 5.10.
Compactness under linear growth
The only assumption here is the linear growth condition 2.3.
Proof of Theorem 5.4
For , let be a curve such that
| 6.1 |
We can find a solution to the discrete continuity equation , such that
Set , where is defined in (4.10). Lemma 4.9 implies that for every .
Using Lemma 4.10, the growth condition (2.1), and the bounds (6.1) on the masses and the action, we infer that
| 6.2 |
Up to extraction of a subsequence, the Banach–Alaoglu Theorem yields existence of a measure such that weakly in . It also follows that ; see, e.g., [8, Theorem 8.4.7].
Furthermore, (6.1) and (6.2) imply that the -seminorms of are bounded:
| 6.3 |
In particular, . Thus, by another application of the Banach–Alaoglu Theorem, there exists a measure and a subsequence (not relabeled) such that weakly in .
We claim that does not charge the boundary and that for a curve of constant total mass in time. To prove the claim, write , and note that each curve is of constant mass. Therefore, the time-marginals are constant multiples of the Lebesgue measure. Since these measures are weakly-convergent to the time-marginal , it follows that the latter is also a constant multiple of the Lebesgue measure, which implies the claim. See also the proof of Lemma 3.13 for a similar discussion.
By what we just proved, can be identified with a measure on the open set . Let be the restriction of to . Since (resp. ) converges vaguely to (resp. ), it follows that belongs to .
In view of (6.3), we can apply the -compactness theorem (see, e.g., [34, Theorem B.5.10]) to obtain a further subsequence such that for almost every , and the limiting curve belongs to . Proposition A.5 yields weakly in for almost every .
Uniform compactness under superlinear growth
In the last two sections, we shall work with the stronger growth condition from Assumption 5.5.
Remark 6.1
(Property of , superlinear case) Let us first observe that under Assumption 5.5, one has superlinear growth of :
where we recall is such that .
In addition for all we have
| 6.4 |
In particular, if , then . Indeed, fix as in (3.5) and suppose that for some . By positivity of the measures, this implies that , thus by construction
From the first condition and , we deduce that for -a.e. . From the assumption of finite energy and (6.4), writing , we infer that for -a.e. as well. It follows that , which proves the claim.
We are ready to prove Theorem 5.9.
Proof of Theorem 5.4
(Proof of Theorem 5.9) Let be a sequence of measures such that
| 6.5 |
Thanks to Remark 6.1, we have that for all solutions with . Applying Lemma 3.13 we can write and because , we also have disintegration with for almost every .
Moreover, it follows from the definition of that, for any test function we have
This shows that , with weak derivative
We are left with showing uniform convergence of in . We claim that the curves are equicontinuous with respect to the Kantorovich–Rubinstein norm .
To show the claimed equicontinuity, take and with . Since we obtain using Lemma 4.10,
| 6.6 |
To estimate the latter integral, we consider for the quantities
We fix a “velocity threshold” , and split into the low velocity region and its complement . Then:
| 6.7 |
where . For we use the growth condition (5.4) to estimate
Since (5.4) implies non-negativity of the term in brackets, we obtain
| 6.8 |
Integrating in time, we combine (6.7) and (6.8) with (6.5) to obtain
| 6.9 |
Combining (6.6) and (6.9) we conclude that
To prove the claimed equicontinuity, it suffices to show that as . But this follows from the growth properties of by picking, e.g., .
Of course the masses are uniformly bounded in and t. The Arzela-Ascoli theorem implies that every subsequence has a subsequence converging uniformly in .
The boundary value problems under superlinear growth
The last part of this section is devoted to the proof of the convergence of the minimal actions, under the assumption of superlinear growth, i.e. Theorem 5.10. The proof is a straightforward consequence of the stronger compactness result Theorem 5.9 (and the general convergence result Theorem 5.1) proved in the previous section, which ensures the stability of the boundary conditions as well. We fix .
Proof of Theorem 5.10
We shall prove the upper and the lower bound.
Liminf inequality. Pick any , weakly in , and let with the same boundary data such that
By Theorem 5.9, there exists a (non-relabeled) subsequence of such that , uniformly for . In particular, , . We can then apply the lower bound of Theorem 5.1, and conclude
Limsup inequality. Fix such that . By the definition of and the lower semicontinuity of (Lemma 3.14), there exists with and .
We can then apply Theorem 5.1 and find a recovery sequence such that weakly and
By the improved compactness result Theorem 5.9, in for every , in particular for . This allows us to conclude
for , which is the sought recovery sequence for .
Remark 6.2
It is instructive to see that under the simple linear growth condition 2.3, the above written proof cannot be carried out. Indeed, by the lack of compactness in (but rather only in by Theorem 5.4), we are not able to ensure stability at the level of the initial data, i.e. in general, (and similarly for ).
Proof of the lower bound
In this section we present the proof of the lower bound in our main result, Theorem 5.1. The proof relies on two key ingredients. The first one is a partial regularisation result for discrete measures of bounded action, which is stated in Proposition 7.1 and proved in Sect. 7.1 below. The second ingredient is a lower bound of the energy under partial regularity conditions on the involved measures (Proposition 7.4). The proof of the lower bound in Theorem 5.1, which combines both ingredients, is given right before Sect. 7.1.
First we state the regularisation result. Recall the Kantorovich–Rubinstein norm (see “Appendix 1”).
Proposition 7.1
(Discrete Regularisation) Fix and let be a solution to the discrete continuity equation satisfying
Then, for any there exists an interval with and a solution such that:
-
(i)the following approximation properties hold:
7.1a 7.1b -
(ii)the following regularity properties hold, uniformly for any and any :
7.2a 7.2b 7.2c
The constants and the compact set depend on , M and A, but not on .7.2d
Remark 7.2
The -bounds in (7.2a) are explicitly stated for the sake of clarity, although they are implied by the compactness of the set K in (7.2d).
Since , inequality (7.2b) in effect bounds .
In the next result, we start by showing how to construct -periodic solutions to the static continuity equation by superposition of unit fluxes. Additionally, we can build these solutions with vanishing effective flux and ensure good -bounds.
Lemma 7.3
[Periodic solutions to the divergence equation] Let be a -periodic function with . There exists a -periodic discrete vector field satisfying
Proof
For any , fix a simple path in connecting (0, v) and (0, w). Let be the associated periodic unit flux defined in (4.3). Since , we can pick a coupling between the negative part and the positive part of g. More precisely, we may pick a function with such that
We then define
It is straightforward to verify using Lemma 4.5 that J has the three desired properties.
The following result states the desired relation between the functionals and under suitable regularity conditions for the measures involved. These regularity conditions are consistent with the regularity properties obtained in Proposition 7.1.
Proposition 7.4
(Energy lower bound for regular measures) Let and let be a compact set. There exists a threshold and a constant such that the following implication holds for any : if and satisfy the regularity properties (7.2a)–(7.2d), then we have the energy bound
Proof
Recall from (4.11) that and , where, for and ,
where is a convex combination of , i.e.,
where , , and whenever .
Step 1. Construction of a representative. Fix and . Our first goal is to construct a representative
For this purpose we define candidates and as follows. We take the values of m and in the -cube at , and insert these values at every cube in , so that the result is -periodic. In formulae:
see Fig. 5.
Fig. 5.
On the left, using different colors for different values, the measures m and . On the right, the corresponding and , for (color figure online)
We emphasise that the right-hand side does not depend on z, hence and are -periodic. Our construction also ensures that
hence . However, the vector field does (in general) not belong to : indeed, while has the desired effective flux (i.e., ), is not (in general) divergence-free.
To remedy this issue, we introduce a corrector field , i.e., an anti-symmetric and -periodic function satisfying
| 7.3 |
The existence of such a vector field is guaranteed by Lemma 7.3. It immediately follows that satisfies and , thus
Step 2. Density comparison. We will now use the regularity assumptions (7.2a)-(7.2d) to show that the representative is not too different from the shifted density . Indeed, for with we obtain using (7.2c),
| 7.4 |
Let us now turn to the momentum field. For with , we have, using (7.2c),
Moreover, using (7.3), (7.2c), and (7.2b), we obtain
for some not depending on . Combining these bounds we obtain
| 7.5 |
Step 3. Energy comparison. Since by assumption, it follows from (7.4) and (7.5) that for sufficiently small. Here K is a compact set, possibly slightly larger than K, contained in .
Since F is convex, it is Lipschitz continuous on compact subsets in the interior of its domain. In particular, it is Lipschitz continuous on . Therefore, there exists a constant depending on and such that
with depending on , , , and , but not on . Here, the subscript in and indicates that only elements with are considered.
Integration over followed by summation over yields
which is the desired result.
We are now ready to give the proof of the lower bound in our main result, Theorem 5.1. We use the notation to denote the inequality for some constant that only depends on the geometry of the graph , on the function F (see Assumption 2.3), and on the length of the time interval .
Proof of Theorem 5.1 (lower bound)
Let and let be such that the induced measures defined by satisfy vaguely in as . Observe that
Without loss of generality, we may assume that
Step 1 (Regularisation): Fix . Let be an approximately optimal discrete vector field, i.e.,
| 7.6 |
Using Proposition 7.1 we take an interval , and an approximating pair satisfying
| 7.7 |
together with the regularity properties (7.2) for some constants and a compact set depending on , but not on . By virtue of these regularity properties, we may apply Proposition 7.4 to . This yields
| 7.8 |
with depending on , but not on .
Step 2 (Limit passage ): It follows by definition of the Kantorovich–Rubinstein norm that
It follows from the growth condition (2.1) and (7.7) that
| 7.9 |
Therefore, there exist measures and and convergent subsequences satisfying
| 7.10 |
The vague lower semicontinuity of the limiting functional (see Lemma 3.14), combined with (7.6), (7.7), and (7.8) thus yields
| 7.11 |
Step 3 (Limit passage ): Let , . For brevity, write . Since from (7.10) and weakly, and we obtain
It follows that , which together with implies vaguely as .
Furthermore, (7.9) implies that . Therefore, we may extract a subsequence so that vaguely in as . It thus follows from (7.11) and the joint vague-lower semicontinuity of (see Lemma 3.14) that
To conclude the desired estimate , it remains to show that solves the continuity equation. To show this, we first note that in view of Lemma 4.9. It then follows from the weak convergence in (7.10) that . Since , vaguely, and it holds , which completes the proof.
Proof of the discrete regularisation result
This section is devoted to the proof of main discrete regularisation result, Proposition 7.1.
The regularised approximations are constructed by a three-fold regularisation: in time, space, and energy. Let us now describe the relevant operators. Recall the definition of and as given in Assumption 2.3.
Energy regularisation
First we embed and into the graph . We thus define and by
It follows that (by continuity of , ) and
We then consider the energy regularisation operators defined by
Lemma 7.5
(Energy regularisation) Let . The following inequalities hold for any , , and :
Proof
The proof is straightforward consequence of the convexity of F and the periodicity of and .
Space regularisation
Our space regularisation is a convolution in the z-variable with the discretised heat kernel. It is of crucial importance that the regularisation is performed in the -variable only. Smoothness in the -variable is not expected.
For and , let be the heat kernel on . We consider the discrete version
where the integration ranges over the cube . Using the boundedness and Lipschitz properties of , we infer that for ,
| 7.12 |
| 7.13 |
for some non-negative constant depending only on . We then define
where is defined in (2.5).
Lemma 7.6
(Regularisation in space) Let . There exist constants and such that the following estimates hold, for any , , , and :
-
(i)
Energy bound:
-
(ii)Gain of integrability:
-
(iii)
Density lower bound:
-
(iv)Spatial regularisation:
Proof
Using the convexity of F and the identity we obtain
where in the last equality we used (7.13). This shows (i). Properties (ii), (iii), and (iv) are straightforward consequence of the uniform bounds (7.12), (7.13) for the discrete kernels .
Time regularisation
Fix an interval and a regularisation parameter . For , we define for
Note that, thanks to the linearity of the continuity equation we get .
We have the following regularisation properties for the operator .
Lemma 7.7
(Regularisation in time) Let . The following estimates hold for all and all Borel curves and :
-
(i)Energy estimate: for some depending only on (2.1) we have
-
(ii)
Mass estimate: .
-
(iii)
Momentum estimate:
-
(iv)
Time regularity: .
Proof
Set for . Then we have
| 7.14 |
as a consequence of Jensen’s inequality and Fubini’s theorem. Using that , , and the growth condition (2.1) we infer
which together with (7.14) shows (i).
Properties (ii), (iii) follow directly from the convexity of the -norms and the subadditivity of the integral.
Finally, (iv) follows from the direct computation .
Effects of the three regularisations
We start with a lemma that shows that the effect of the three regularising operators is small if the parameters are small.
Recall the definition of the Kantorovich-Rubinstein norm as given in “Appendix 1”.
Lemma 7.8
(Bounds in -norm) Let an interval and be a Borel measurable curve of constant total mass (i.e., is constant), and let be the associated measure on space-time defined by . Then there exists a constant depending on such that:
-
(i)
for any .
-
(ii)
for any .
-
(iii)
for any .
Proof
(i): For any and any Lipschitz function (and, in fact, for any temporally Lipschitz function) we have
Since we obtain the result.
(ii): In view of mass-preservation, we have
Here in the last inequality we used scaling law of the heat kernel.
(iii): Let us write for brevity. By linearity, we have
Proof of Proposition 7.1
We define
We will show that the desired inequalities hold if are chosen to be sufficiently small, depending on the desired accuracy . Set .
(i): We use the shorthand notation . Using Lemma 7.8 we obtain
| 7.15 |
Furthermore, using Lemma 7.5, Lemma 7.6(i), and Lemma 7.7(i) we obtain the action bound
| 7.16 |
The desired inequalities (7.1) follow by choosing , , and sufficiently small.
(ii): We will show that all the estimates hold with constants depending on through the parameters , , and .
Boundedness: We apply Lemma 7.5, Lemma 7.6(ii), and Lemma 7.7(ii) &(iii) and obtain the uniform bounds on the mass
| 7.17 |
as well as the uniform bounds on the momentum
| 7.18 |
Time-regularity: From Lemma 7.7(iv), together with Lemma 7.5 and Lemma 7.6(ii), we obtain the uniform bound on the time derivative
| 7.19 |
Space-regularity: For and , Lemma 7.6(iv) and Lemma 7.7(ii) yield
which shows that
| 7.20 |
Similarly, using the growth condition (2.1) we deduce
| 7.21 |
Domain-regularity: For all , reasoning as in (7.17) and (7.18), we observe that
We infer that
Since
the claim follows by an application of Lemma C.1 to the product of balls in and , taking into account that F is defined on a finite-dimensional subspace by the locality assumption.
Proof of the upper bound
In this section we present the proof of the -limsup inequality in Theorem 5.1. The first step is to introduce the notion of optimal microstructures.
The optimal discrete microstructures
Let be an open interval in . We will make use of the following canonical discretisation of measures and vector fields on the cartesian grid .
Definition 8.1
(-discretisation of measures) Let and have continuous densities and j, respectively, with respect to the Lebesgue measure. Their -discretisations and are defined by
An important feature of this discretisation is the preservation of the continuity equation, in the following sense.
Definition 8.2
(Continuity equation on ) Fix an open interval. We say that and satisfy the continuity equation on , and write , if is continuous, is Borel measurable, and the following discrete continuity equation is satisfied in the sense of distributions:
| 8.1 |
Lemma 8.3
(Discrete continuity equation on ) Let have continuous densities with respect to the space-time Lebesgue measure on . Then .
Proof
This follows readily from the Gauss divergence theorem.
The key idea of the proof of the upper bound in Theorem 5.1 is to start from a (smooth) solution to the continuous equation , and to consider the optimal discrete microstructure of the mass and the flux in each cube . The global candidate is then obtained by gluing together the optimal microstructures cube by cube.
We start defining the gluing operator. Recall the operator defined in (2.4).
Definition 8.4
(Gluing operator) Fix . For each , let
be -periodic. The gluings of and are the functions and defined by
| 8.2 |
Remark 8.5
(Well-posedness) Note that and are well-defined thanks to the -periodicity of the functions and .
Remark 8.6
(Mass preservation and KR-bounds) The gluing operation is locally mass-preserving in the following sense. Let and consider a family of measures satisfying for some . Then:
for every . Consequently,
| 8.3 |
for all weakly continuous curves and all such that for all and .
Energy estimates for Lipschitz microstructures
The next lemma shows that the energy of glued measures can be controlled under suitable regularity assumptions.
Lemma 8.7
(Energy estimates under regularity) Fix . For each , let and be -periodic functions satisfying:
-
(i)(Lipschitz dependence): For all
-
(ii)(Domain regularity): There exists a compact and convex set such that, for all ,
8.4
Then there exists depending only on K, F such that for
| 8.5 |
where depends only on L, the (finite) Lipschitz constant , and the locality radius .
Proof
Fix . As m is -periodic, (i) yields for ,
| 8.6 |
Similarly, using the -periodicity of J, (i) yields for with and ,
| 8.7 |
Hence the domain regularity assumption (ii) imply a domain regularity property for the glued measures, namely
for all and , where is a slightly bigger compact set than K.
Consequently, we can use the Lipschitzianity of F on the compact set and its locality to estimate the energy as
where and .
Combining the estimate above with (8.6) and (8.7), we conclude that
for . Summation over yields the desired estimate (8.5).
We now introduce the notion of optimal microstructure associated with a pair of measures . First, let us define regular measures.
Definition 8.8
(Regular measures) We say that is a regular pair of measures if the following properties hold:
-
(i)
(Lipschitz regularity): With respect to the Lebesgue measure on , the measures and have Lipschitz continuous densities and j respectively.
-
(ii)(Compact inclusion): There exists a compact set such that
We say that is a regular curve of measures if are regular measures for every and is measurable for every .
Definition 8.9
(Optimal microstructure) Let be a regular pair of measures.
-
(i)We say that is an admissible microstructure for if
for every . -
(ii)
If, additionally, for every , we say that is an optimal microstructure for .
Remark 8.10
(Measurable dependence) If is a measurable curve in , it is possible to select a collection of admissible (resp. optimal) microstructures that depend measurably on t. This follows from Lemma 4.7; see e.g. [38, Theorem 14.37]. In the sequel, we will always work with measurable selections.
The next proposition shows that each optimal microstructures associated with a regular pair of measures has discrete energy which can be controlled by the homogenised continuous energy .
Proposition 8.11
(Energy bound for optimal microstructures) Let be an optimal microstructure for a regular pair of measures . Then:
where depends only on and the modulus of continuity of the densities and j of and .
Proof
Let us denote the densities of and by and j respectively. Using the regularity of and , and the fact that is Lipschitz on , we obtain
which is the desired estimate.
Remark 8.12
(Lack of regularity) Suppose that and are constructed by gluing the optimal microstructure from the previous lemma. It is then tempting to seek for an estimate of the form
However, (m, J) does not have the required a priori regularity estimates to obtain such a bound. Moreover, the gluing procedure does not necessarily produce solutions to the discrete continuity equation if we start with solutions to the continuous continuity equation.
We conclude the subsection with the following and estimates.
Lemma 8.13
( and estimates) Let be a regular curve of measures satisfying
| 8.8 |
Let be corresponding optimal microstructures. Then:
-
(i)satisfies the uniform estimate
8.9 -
(ii)satisfies the uniform estimate
8.10 8.11
Proof
The first claim follows since by construction.
To prove (ii), note that
which yields (8.10).
To prove (8.11), we use the growth condition on F, the periodicity of , and (i) to obtain for and :
where in the last inequality we applied Proposition 8.11. Integrating in time and taking the supremum in space and , we obtain (8.11).
Approximation result
The goal of this subsection is to show that despite the issues outlined in Remark 8.12, we can find a solution to with almost optimal energy that is -close to a glued optimal microstructure.
In the following result, denotes the -extension of the open interval for .
Proposition 8.14
(Approximation of optimal microstructures) Let be a regular curve of measures sastisfying
Let be a measurable family of optimal microstructures associated to and consider their gluing . Then, for every , there exists such that the following holds for all : there exists a solution satisfying the bounds
| 8.12a |
| 8.12b |
where depends on M, A, , and , but not on .
Remark 8.15
It is also true that
but this information is not “useful”, as we do not expect to be able to control in terms of ; see also Remark 8.12.
The proof consists of four steps: the first one is to consider optimal microstructures associated with on every scale and glue them together to obtain a discrete curve (we omit the -dependence for simplicity). The second step is the space-time regularisation of such measures in the same spirit as done in the proof of Proposition 7.1. Subsequently, we aim at finding suitable correctors in order to obtain a solution to the continuity equation and thus a discrete competitor (in the definition of ). Finally, the energy estimates conclude the proof of Proposition 8.14.
Let us first discuss the third step, i.e. how to find small correctors for in order to obtain discrete solutions to which are close to the first ones. Suppose for a moment that are "regular", as in the outcome of Proposition 7.1. Then the idea is to consider how far they are from solving the continuity equation, i.e. to study the error in the continuity equation
and find suitable (small) correctors to in such a way that .
This is based on the next result, which is obtained on the same spirit of Lemma 7.3 in a non-periodic setting. In this case, we are able to ensure good -bounds and support properties.
Lemma 8.16
(Bounds for the divergence equation) Let with . There exists a vector field such that
| 8.13 |
Moreover, with C depending only on .
Proof
Let be the positive part of g, and let be the negative part. By assumption, these functions have the same -norm . Let be an arbitrary coupling between the discrete probability measures and .
For any : take an arbitrary path connecting these two points. Let be the unit flux field constructed in Definition 4.4. Then the vector field has the desired properties.
Remark 8.17
(Measurability) It is clear from the previous proof that one can choose the vector field in such a way that the function is a measurable map.
The plan is to apply Lemma 8.16 to a suitable localisation of , in each cube , for every . Precisely, the goal is to find for every such that
| 8.14 |
which is small on the right scale, meaning
| 8.15 |
Remark 8.18
Note that for all , since has constant mass in time and is skew-symmetric. However, an application of Lemma 8.16 without localisation would not ensure a uniform bound on the corrector field, as we are not able to control the -norm of a priori.
Remark 8.19
A seemingly natural attempt would be to define . However, this choice is not of zero-mass, due to the flow of mass across the boundary of the cubes.
Recall that we use the notation to denote solutions to the continuity equation on in the sense of Definition 8.2. We shall later apply Lemma 8.22 to the pair , thanks to Lemma 8.3.
The notion of shortest path in the next definition refers to the -distance on .
Definition 8.20
For all , we choose simultaneously a shortest path of nearest neighbors in connecting to such that for all . Then define for and the signs as
Note that since the paths are simple, each pair of nearest neighbours appears at most once in any order, so that is well-defined.
It follows readily from Definition 8.20 that
| 8.16 |
for all and .
Remark 8.21
A canonical choice of the paths is to interpolate first between and one step at a time, then between and , and so on. The precise choice of path is irrelevant to our analysis as long as paths are short and satisfy . Since the paths are invariant under translations, so are the signs, i.e.
| 8.17 |
for all , which is used in the prof of Lemma 8.22 below.
Lemma 8.22 shows that if we start from a solution to the continuity equation and consider an admissible microstructure associated to , then it is possible to localise the error in the continuity equation arising from the gluing as in (8.14).
Lemma 8.22
(Localisation of the error to ) Let and suppose that and satisfy
for every and . Consider their gluings and and define, for and ,
| 8.18 |
| 8.19 |
where is the -periodic map satisfying for all . Then the following statements hold for every :
-
(i)is a localisation of the error of from solving , i.e.,
-
(ii)Each localised error has zero mass, i.e.,
Proof
(i): For , consider the path constructed in Definition 8.20. For all we have
Summation over all neighbours of yields, for all ,
where we used the -periodicity of and the vanishing divergence of .
(ii): Fix and . Using the periodicity of , the identity (8.17), the group structure of , the relation between and J, the fact that , and the identity (8.16), we obtain
By definition of we obtain
where we used that and eventually that .
Now we are ready to prove Proposition 8.14.
Proof of Proposition 8.14
The proof consists of four steps. For simplicity: .
Step 1: Regularisation. Recall the operators , , and as defined in Sect. 7.1. We define
where , will be chosen sufficiently small, depending on the desired accuracy . Due to special linear structure of the gluing operator , it is clear that
for some . More precisely, they correspond to the regularised version of the measures with respect to the graph structure of . In particular, an application4 of Lemma 8.13, Lemma 7.6, and Lemma 7.7 yields
| 8.20 |
for any , as well as the domain regularity
| 8.21 |
for a constant C and a compact set K depending only on M, A, , , and . We can then apply Lemma 8.7 and deduce that for every , (depending on K and F),
| 8.22 |
for a depending on the same set of parameters (via C and ) and .
Step 2: Construction of a solution to . From now on, the constant C appearing in the estimates might change line by line, but it always depends on the same set of parameters as the constant C in Step 1, and possibly on the size of the time interval .
The next step is to find a quantitative small corrector in such a way that . To do so, we observe that by construction we have for every
where (by the linearity of equation (8.1)). Consider the corresponding error functions, for , , given by (8.18) and (8.19),
where is the -periodic map satisfying , for any . Thanks to Lemma 8.22, we know that
Moreover, from the regularity estimates (8.20) and the local finiteness of the graph , we infer for every
| 8.23 |
where only depends on . Hence, as an application of Lemma 8.16, we deduce the existence of corrector vector fields such that
| 8.24 |
for every , . The existence of a measurable (in and ) map follows from the measurability of and Remark 8.17.
We then define and as
and obtain a solution to the discrete continuity equation .
Step 3: Energy estimates. The locality property (8.24) of and local finiteness of the graph allow us to deduce the same uniform estimates on the global corrector as well. Indeed for every , we have
and hence from the estimate (8.24) we also deduce .
Since (8.21) implies that , it then follows that for sufficiently small, where depends on K and C. Here is a compact set, possibly slightly larger than K, contained in .
Therefore, we can estimate the energy
and hence . Together with (8.22), this yields
Finally, to control the action of the regularised microstructures , we take advantage (as in (7.16)) of Lemma 7.5, Lemma 7.6 (i), and Lemma 7.7 (i) to obtain5
![]() |
for a , where at last we used Proposition 8.11 and that is Lipschitz on .
For every given , the action bound (8.12b) then follows choosing small enough.
Step 4: Measures comparison. We have seen in (7.15) that Lemma 7.8 implies
where we also used that mass preservation of the gluing operator, see Remark 8.6. For every , the distance bound (8.12a) can be then obtained choosing , , sufficiently small.
Proof of the upper bound
This subsection is devoted to the proof of the limsup inequality in Theorem 5.1. First we formulate the existence of a recovery sequence in the smooth case.
Proposition 8.23
(Existence of a recovery sequence, smooth case) Fix , , , and set . Let be a solution to the continuity equation with smooth densities and such that
| 8.25 |
Then there exists a sequence of curves such that weakly in as and
| 8.26 |
for some .
Proof
We write . Let be smooth curves of measures satisfying the assumptions (8.25). Let be the gluing of a measurable family of optimal microstructure associated with , for every . For every , Proposition 8.14 yields the existence of , a constant , and depending on such that
for every .
Using Remark 8.6, in particular (8.3), and that , we infer
for every . Therefore, we can find a diagonal sequence as such that, if we set , we obtain
where at last we used the growth condition (3.2).
In order to apply Proposition 8.23 for the existence of the recovery sequence in Theorem 5.1 we prove that the set of solutions to the continuity equation (3.4) with smooth densities are dense-in-energy for .
Definition 8.24
(Affine change of variable in time) Fix . For any , we consider the unique bijective increasing affine map . For every interval and every vector-valued measure , , we define the changed-variable measure
| 8.27 |
Remark 8.25
(Properties of ) The scaling factor of is chosen so that if , then and we have for the equality of densities
| 8.28 |
Moreover, if then .
We are ready to state and prove the last result of this section.
Proposition 8.26
(Smooth approximation of finite action solutions to ) Fix and fix with . Then there exists a sequence such that as and measures for so that as
| 8.29 |
| 8.30 |
and such that the following action bound holds true:
| 8.31 |
Moreover, for any given we have the inclusion
| 8.32 |
Proof
Without loss of generality we can assume , if not we simply consider for as in Lemma 3.14. For simplicity, we also assume , the extension to a generic interval is straightforward.
Fix with .
Step 1: regularisation. The first step is to regularise in time and space. To do so, we consider two sequences of smooth mollifiers , for of integral 1, where , with as to be suitably chosen. We then set as .
We define space-time regular solutions to the continuity equation as
where . Note that the mollified measures are defined only We choose , so that .
Finally, for as given in (4.8), we define
| 8.33 |
for some suitable choice of .
Step 2: Properties of the regularised measures. First of all, we observe that with smooth densities for every , so that (8.30) is satisfied. Secondly, the convergence (8.29) easily follows by the properties of the mollifiers and the fact that uniformly in (0, T) as .
Moreover, we note that for , using that is constant on (0, T) one gets
| 8.34 |
and thanks to (8.28) an analogous uniform estimate holds true for too. We can then apply Lemma C.1 and find convex compact sets such that , so that (8.32) follows.
Additionally, pick such that . From (8.28), if one sets , we see that
| 8.35 |
for and , where the functions are given by
We choose such that and from (8.34) we get that
| 8.36 |
For example we can pick and , both going to zero when .
Step 3: action estimation. As the next step we show that
| 8.37 |
One can prove (8.37) using e.g. the fact [10] that for every interval the action is the relaxation of the functional
for which (8.37) follows from the convexity and nonnegativity of and the properties of the mollifiers .
We shall then estimate the action of . From (8.35) and (8.36), using the convexity of and the definition of the map , we obtain
where in the last inequality we used (8.37). Taking the limsup in
| 8.38 |
which concludes the proof of (8.31).
Now we are ready to prove the limsup inequality (5.2) in Theorem 5.1.
Proof of Theorem 5.1 (upper bound)
Fix . By definition of , it suffices to prove that for every such that and , we can find such that weakly in and .
For any such , we apply Proposition 8.26 and find a smooth sequence where , where and such that (8.31) and (8.32) hold with weakly in as . In particular
| 8.39 |
Hence we can apply Proposition 8.23 and find such that weakly in and for each ,
| 8.40 |
We conclude by extracting a subsequence such that weakly in as and from (8.39), (8.40), (8.31) we have
which concludes the proof.
Analysis of the cell problem
In the final section of this work, we discuss some properties of the limit functional and analyse examples where explicit computations can be performed. For and , recall that
where denotes the set of representatives of , i.e., all -periodic functions and satisfying
Invariance under rescaling
We start with an invariance property of the cell-problem. Fix a -periodic graph as defined in Assumption 2.1. For fixed with , we consider the rescaled -periodic graph obtained by zooming out by a factor , so that each unit cube contains copies of . Slightly abusing notation, we will identify the corresponding set with the points in .
Let be the analogue of F on , and let be the corresponding limit density. In view of our convergence result, the cell-formula must be invariant under rescaling, namely . We will verify this identity using a direct argument that crucially uses the convexity of F.
One inequality follows from the natural inclusion of representatives
| 9.1 |
which is obtained as and for every . Here we note that the inverse of is well-defined on -periodic maps. In particular we have
which implies that .
The opposite inequality is where the convexity of F comes into play. Pick . A first attempt to define a couple in would be to consider the inverse map of what we did in (9.1), but the resulting maps would not be -periodic (but only -periodic). What we can do is to consider a convex combination of such values. Precisely, we define
for all . The linearity of the constraints implies that . Moreover, using the convexity of F we obtain
which in particular proves that .
The simplest case: and nearest-neighbor interaction.
The easiest example we can consider is the one where the set V consists of only one element . In other words, we focus on the case when and thus . We then consider the graph structure defined via the nearest-neighbor interaction, meaning that consists of the elements of such that .
In this setting, consists of only one element and has cardinality 2d. In particular, for every and , the set consists of only one element given by
Consequently, the homogenised energy density is given by .
In the special case where F is edge-based (see Remark 2.5) with edge-energies for , we have
The even more special case of the discretised p-Wasserstein distance corresponds to , where the mean is a mean as in (2.3). We then obtain
for and , which corresponds to the p-Wasserstein distance induced by the -distance on the underlying space . The case corresponds to the framework studied in [24].
As we will discuss in Sect. 9.4, this result can also be cast in the more general framework of isotropic finite-volume partitions of .
Embedded graphs
In this section, we shall use an equivalent geometric definition of the effective flux. We can indeed formulate an interesting expression for in the case where is an embedded -periodic graph in , in the sense of Remark 2.2. We thus choose to be a subset of and use the identification , so that can be identified with a -periodic subset of .
Let us define
Note that we simply replaced by in the definition of . Remarkably, the following result shows that for any periodic and divergence-free vector field J. In particular, does not depend on the choice of the embedding into . As a consequence, one can equivalently define , and hence the homogenised energy density , in terms of instead of .
Proposition 9.1
For every periodic and divergence-free vector field we have .
Proof
Note first that any given point configuration can be transformed into any other configuration by successively shifting each of the points. Therefore it suffices to show that is invariant when perturbing the location of any single point.
Fix . For a positive (small enough) parameter and a vector , consider the modified embedded -periodic graph in obtained from by shifting the nodes by , i.e., we consider the shifted node instead of (and with it, the associated edges). Fix a divergence-free and -periodic discrete vector field and consider, for , the corresponding effective flux
We claim that . Indeed, by construction we have
Since J is divergence-free, this proves the claim. In particular, is constant, hence the value of does not depend on the location of the embedded points. This also implies the sought equality , since corresponds to the limiting case where all the elements of V “collapse” into a single point of .
Periodic finite-volume partitions
The next class of examples are the graph structures associated with -periodic finite-volume partitions (FVPs) of . We refer to [14] for a general treatment.
Definition 9.2
(-periodic finite-volume partition) Consider a countable, locally finite, -periodic family of points together with a family of nonempty open bounded convex polytopes for , such that for all and . We call
a -periodic finite-volume partition of if
;
whenever ;
whenever .
We define a graph structure on by declaring those pairs with to be nearest neighbors.
It is not difficult to see that the graph is connected, -periodic, and locally finite, even if . Throughout this section we use the following notation for :
In the finite-volume framework, we are interested in transport distances with a nonlinear mobility. These distances were introduced in [13] as natural generalisations of the 2-Wasserstein metric. We thus fix a concave upper-semicontinuous function and consider the energy density functional
| 9.2 |
To discretise this energy density, we fix for every edge an admissible version of , i.e., a nonnegative concave upper-semicontinuous function satisfying for all and . We always assume that for all . It is easy to check that F satisfies the superlinear growth condition 5.4. Furthermore, concavity of implies convexity of F.6 We then consider the edge-based cost defined by
| 9.3 |
Consistent with (9.2), we use the convention that
| 9.4 |
It is now natural to ask whether the discrete action functionals associated to F converge to the continuous action funtional associated to f: is it true that ?
In the linear case where , which corresponds to the 2-Wasserstein metric, this question has been extensively studied in [26] for a large class of (not necessarily periodic) meshes. The main result in [26] asserts that the limit of the discrete transport distances (in the Gromov-Hausdorff sense) as coincides with the 2-Wasserstein distance on if an asymptotic local isotropy condition is satisfied. Moreover, it is shown that this convergence fails to hold if the isotropy condition fails to hold (in a sufficiently strong sense).
For periodic finite-volume partitions we show here that these results are direct consequences of Theorem 5.1. In particular, the following result contains a necessary and condition on a periodic finite-volume partition that ensures that .
Proposition 9.3
Consider a -periodic finite-volume partition of , and let F and f be as in (9.2) and (9.3) respectively. The following assertions hold:
-
(i)
for all and .
-
(ii)Suppose that for every and there is a family of vectors such that
9.5
Then: .9.6 -
(iii)Suppose that all are differentiable in a neigbourhood of the diagonal in . Then if and only if
9.7
Remark 9.4
The condition (iii) is satisfied for a class of meshes satisfying a weighted isotropy condition. For given edge weights , this condition reads as
We refer to [26, Definition 1.4] for this notion on domains in and to [25, Definition 4.3] for the one-dimensional periodic setting. In this case, given a mobility function , the functions can be chosen to be of the form where is a mean that is compatible with in the sense that ; see [26, Definition 1.4]. In this situation the identity holds for all choices of the mobility , since . Therefore, the condition (9.7) reduces to the isotropy condition above; in particular, it does not depend on .
Before we prove Proposition 9.3, we first show an elementary identity for finite-volume partitions; see also [26, Lemma 5.4] for a similar result in a non-periodic setting.
Lemma 9.5
Let be a -periodic finite-volume partition of . Then
| 9.8 |
Proof
For and , consider the open bounded convex polytope
Note that . We claim that the family forms a partition of up to a set of Lebesgue measure zero. To see this, fix a point and consider the function defined by if . If v is not orthogonal to any of the finitely many , then X(t) is well-defined up to a countable set . By Fubini’s theorem, it follows that .
If and , , then . This shows that is nondecreasing and that z is in at most one parallelepiped.
On the other hand, we have
Then we have
Since this identity holds for almost every , (9.8) holds by polarization.
Proof of Proposition 9.3
(i): We construct a competitor to the cell problem (4.6) for and . Define
| 9.9 |
We claim that . Indeed, the periodicity of yields
which shows that . To show that , we use the divergence theorem to obtain, for ,
Moreover, using Proposition 9.1 and Lemma 9.5 we find
which proves that . Therefore, using that is an admissible version of , another application of Lemma 9.5 yields (taking (9.4) into account),
which proves (i).
(iii): Suppose first that condition (9.7) holds. We will show that is a critical point of F. Take and define, for sufficiently small,
Then:
Furthermore, using the symmetry for , we obtain
| 9.10 |
where we write , so that the condition (9.7) reads as for all , , and . Hence, if this condition holds, we obtain, since ,
Adding the identities above, we conclude that whenever (9.7) holds. Therefore, is a critical point of F in . By convexity of F, it is a minimiser. Consequently, using Lemma 9.5, we obtain
which is the desired identity.
To prove the converse, we assume that (9.7) does not hold, i.e., we have for some , , and . On the other hand, we claim that
To see this, observe first that, by definition of admissibility of and the symmetry assumption , we have
Using this identity, the periodicity of m and J, and the identity (9.8) we obtain
which proves the claim.
We thus infer that is non-constant in x. (If it were, the identity would imply that for all x. But we assume that this doesn’t hold for .) Consequently, there exists a -periodic function with such that
As before, we consider defined by (9.9). In view of (9.10), we infer that is not a critical point of F in . As is a relatively interior point of , it cannot be a minimiser, hence .
(ii): We construct an element of the subgradient with for all , .
We set
and check by a simple calculation involving the chosen supergradients that for all periodic. The isotropy condition (9.6) implies that is independent of x and thus for all .
Since F is differentiable in J, we have to choose . By the same calculation as in (3) we see that for all .
To see that (m, J) is indeed a local (and thus global) minimiser of F in , we introduce a parameter and show that
| 9.11 |
for all and .
To see this, we expand the difference
where we used that is continuous. Because F is convex, (9.11) implies that (m, J) is a minimiser of F in .
Remark 9.6
Given a concave mobility , a popular admissible version is to take , with weights . If is differentiable, this means that . As a result, for certain finite-volume partitions we have to choose the weights to satisfy (9.7).
Of particular importance is the case , which was treated in [26] and [25]. Here an admissible version is called an admissible mean. For differentiable , condition (9.7) reduces to
We note that condition (9.7) cannot be satisfied for a large class of finite-volume partitions, although the square partition fulfills it with . The condition also holds for some other partitions that are not -periodic, such as the equilateral triangular and hexagonal partitions; see [26].
If we allow ourselves to use nonsmooth admissible versions of , it makes sense to use , as this choice guarantees the largest possible supergradient along the diagonal, making it more likely that .
Example 9.7
Let us consider the triangulation given in Fig. 6, where each unit square consists of four triangles: north, south, west, and east. We now show that (9.7) cannot be satisfied here, but (9.6) is satisfied for the particular nonsmooth choice .
Fig. 6.

A -periodic finite-volume partition of . The unit cube is shown in red (color figure online)
For the smooth case we assume that and define and . Note that by the chain rule . Let
For in the north triangle and in the south triangle we obtain that
since , , and Similarly we obtain for in the west and in the east triangle
Inserting the last two equalities into (9.7) we find that for all , i.e. that
But this is a contradiction to . In particular there exists no satisfying (9.7).
For the nonsmooth case note that the supergradient for is given by
For and we set
We need to show that is independent of x. For x in the north or the south triangle we find
Similarly for x in the west or east triangle we obtain
Consequently, this is independent of x and (9.6) holds.
Acknowledgements
J.M. gratefully acknowledges support by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (Grant Agreement No. 716117). J.M and L.P. also acknowledge support from the Austrian Science Fund (FWF), grants No F65 and W1245. E.K. gratefully acknowledges support by the German Research Foundation through the Hausdorff Center for Mathematics and the Collaborative Research Center 1060. P.G. is partially funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation)—350398276. We thank the anonymous reviewer for the careful reading and for useful suggestions.
Appendix A: The Kantorovich–Rubinstein metric on signed measures
We collect some facts on the Kantorovich–Rubinstein metric that are used in the paper. We refer to [8, Section 8.10(viii)] for more details.
Let (X, d) be a metric space. Let denote the space of finite signed Borel measures on X. For , let be the positive and negative parts, respectively. Let be its variation, and be its total variation.
Definition A.1
(Weak and vague convergence) Let for .
-
(i)
We say that weakly in if for every .
-
(ii)
We say that vaguely in if for every .
If (X, d) is compact, is a Banach space endowed with the norm . By the Riesz-Markov theorem, it is the dual space of the Banach space of all continuous functions endowed with the supremum norm .
For let be its Lipschitz constant.
Definition A.2
Let (X, d) be a compact metric space. The Kantorovich–Rubinstein norm on is defined by
| A.12 |
In non-trivial situations (i.e., when X contains an infinite convergent sequence), the norms and are not equivalent. Thus, by the open mapping theorem, is not a complete space.
A closely related norm on that is often considered is
for some fixed ; see [8, Section 8.10(viii)]. The next result shows that these norms are equivalent.
Proposition A.3
Let (X, d) be a compact metric space. For we have
where depends only on .
Proof
We start with the first inequality. Let with and . Define , so that and . Then
Taking the supremum over yields the desired bound.
Let us now prove the second inequality. Set . Take with and . Then for all , so that and . We obtain
Moreover, as can be seen by taking in (A.12) It follows that
as desired.
Proposition A.4
(Relation to ) Let (X, d) be a compact metric space. If are nonnegative measures of equal total mass, we have .
Proof
This follows from the Kantorovich duality for the distance .
On the subset of nonnegative measures, the -norm induces the weak topology:
Proposition A.5
(Relation to weak-convergence) Let (X, d) be a compact metric space. For we have
Proof
See [8, Theorem 8.3.2].
Remark A.6
(Testing against smooth functions) If , the space of functions with is dense in the set of Lipschitz functions with ; see, e.g., [40, Proposition A.5]. Consequently,
| A.13 |
Remark A.7
The identity (A.13) shows that is the dual norm of the separable Banach space . The dual space of is a strict superset of the finite Borel measures.
Appendix B: Norms on curves in the space of measures
We work with curves of bounded variation taking values in the space .
Definition B.1
(Curves of bounded variation) The space consists of all curves of measures such that the -seminorm
| B.14 |
is finite.
Remark B.2
The space is a (non-closed) subset of the space , where X is the separable Banach space . We refer to [28, Section 2] for the equivalence of several definitions of .
Definition B.3
The space consists of all curves in the Banach space-valued Sobolev space such that for a.e. .
Appendix C: Domain property of convex functions
Lemma C.1
(Domain properties of convex functions) Let be convex, and let . For every and every bounded set , there exists a compact convex set such that
Proof
Let be bounded and . Since , we can pick such that . Fix and set . We claim that .
To prove the claim, it suffices to show that , since is open. Take and pick a sequence such that . Observe that with if n is large enough (indeed, and ). Since , the claim follows by convexity of f.
We now define
By construction, is convex, bounded, and closed, thus compact. Let us show that .
By convexity of f, it suffices to show that . Pick and such that . Then there exists such that . Passing to a subsequence, we may assume that for some and for . Taking the limit as we infer that . Since , it follows that .
Appendix D: Notation
For the convenience of the reader we collect some notation used in this paper.
| Topological interior of a set A. | |
| The domain of . | |
| Bounded open time interval. | |
| The space of finite -valued Radon measures on A. | |
| The space of finite (positive) Radon measures on A. | |
| Anti-symmetric vector fields on : . | |
| The set of all with . | |
| The set of all with . | |
| The set of anti-symmetric real functions on . | |
| , | The discrete torus of mesh size : . |
| The effective flux of J: . | |
| The set of representatives of , i.e, all s.t. . | |
| The set of representatives of , i.e, all divergence-free and s.t. | |
| . |
| The set of representatives of , : . | |
| The cube of size centered in : for , . | |
| Shift operator: , for . | |
| Shift operator: , for | |
| Rescaling operator: : for . | |
| . | |
| . | |
| Discrete continuity equation: iff on . | |
| Continuous continuity equation: iff on . | |
| More precisely : the space of time-dependent curves of | |
| (Positive) measures with bounded variation with respect to the norm | |
| (Kantorovich–Rubenstein) on . | |
| More precisely : the space of time-dependent curves of | |
| (Positive) measures belonging to the Banach space . | |
| Discretisation of , : for , ( | |
| , given by , . |
In the paper we use some standard terminology from graph theory. Let be a locally finite graph.
A discrete vector field is an anti-symmetric function .
Its discrete divergence is the function defined by
| B.15 |
We say that J is divergence-free if .
Funding Information
Open access funding provided by Austrian Science Fund (FWF).
Data Availability
Data sharing not applicable to this article as no datasets were generated or analysed during the current study.
Footnotes
In the sequel we consider more general discrete energy densities F(m, J), not necessarily sums of edge-energies.
We regard measures on as measures on the bigger set by the canonical inclusion.
See also Sect. 9.2.
To be precise, this is an application of these lemmas to the case of , thus .
As before, it’s an application of these lemmas on (corresponding to ).
Concavity of is not necessary for convexity of F. If is not concave, a local version of the supergradient can be substituted into (9.6). For readability we restrict ourselves to the concave case.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Peter Gladbach, Email: gladbach@iam.uni-bonn.de.
Eva Kopfer, Email: eva.kopfer@iam.uni-bonn.de.
Jan Maas, Email: jan.maas@ist.ac.at.
Lorenzo Portinale, Email: portinale@iam.uni-bonn.de.
References
- 1.Alicandro R, Cicalese M, Gloria A. Integral representation results for energies defined on stochastic lattices and application to nonlinear elasticity. Arch. Ration. Mech. Anal. 2011;200(3):881–943. doi: 10.1007/s00205-010-0378-7. [DOI] [Google Scholar]
- 2.Ambrosio L, Fusco N, Pallara D. Functions of Bounded Variation and Free Discontinuity Problems. Oxford University Press, New York: Oxford Mathematical Monographs. The Clarendon Press; 2000. [Google Scholar]
- 3.Ambrosio L, Gigli N, Savaré G. Gradient Flows in Metric Spaces and in the Space of Probability Measures. Lectures in Mathematics ETH Zürich. 2. Basel: Birkhäuser Verlag; 2008. [Google Scholar]
- 4.Alicandro R, Cicalese M. A general integral representation result for continuum limits of discrete energies with superlinear growth. SIAM J. Math. Anal. 2004;36(1):1–37. doi: 10.1137/S0036141003426471. [DOI] [Google Scholar]
- 5.Bach A, Braides A, Cicalese M. Discrete-to-continuum limits of multibody systems with bulk and surface long-range interactions. SIAM J. Math. Anal. 2020;52(4):3600–3665. doi: 10.1137/19M1289212. [DOI] [Google Scholar]
- 6.Benamou J-D, Brenier Y. A computational fluid mechanics solution to the Monge–Kantorovich mass transfer problem. Numer. Math. 2000;84(3):375–393. doi: 10.1007/s002110050002. [DOI] [Google Scholar]
- 7.Braides A, Fonseca I, Leoni G. A-quasiconvexity: relaxation and homogenization. ESAIM Control Optim. Calculus Var. 2000;5:539–577. doi: 10.1051/cocv:2000121. [DOI] [Google Scholar]
- 8.Bogachev VI. Measure Theory. II. Berlin: Springer-Verlag; 2007. [Google Scholar]
- 9.Braides A, Defranceschi A. Homogenization of Multiple Integrals. Oxford: Oxford University Press; 1998. [Google Scholar]
- 10.Buttazzo G, Freddi L. Functionals defined on measures and applications to non-equi-uniformly elliptic problems. Ann. Mat. Pura Appl. 1991;4(159):133–149. doi: 10.1007/BF01766298. [DOI] [Google Scholar]
- 11.Chow S-N, Huang W, Li Y, Zhou H. Fokker-Planck equations for a free energy functional or Markov process on a graph. Arch. Ration. Mech. Anal. 2012;203(3):969–1008. doi: 10.1007/s00205-011-0471-6. [DOI] [Google Scholar]
- 12.Disser K, Liero M. On gradient structures for Markov chains and the passage to Wasserstein gradient flows. Netw. Heterog. Media. 2015;10(2):233–253. doi: 10.3934/nhm.2015.10.233. [DOI] [Google Scholar]
- 13.Dolbeault J, Nazaret B, Savaré G. A new class of transport distances between measures. Calc. Var. Partial Differ. Equ. 2009;34(2):193–231. doi: 10.1007/s00526-008-0182-5. [DOI] [Google Scholar]
- 14.Eymard, R., Gallouët, T., Herbin, R.: Finite Volume Methods. In: Handbook of Numerical Analysis, Vol. VII, Handb. Numer. Anal., VII, pp. 713–1020. North-Holland, Amsterdam (2000)
- 15.Erbar M, Henderson C, Menz G, Tetali P. Ricci curvature bounds for weakly interacting Markov chains. Electron. J. Probab. 2017;22:23. doi: 10.1214/17-EJP49. [DOI] [Google Scholar]
- 16.Erbar M, Maas J, Tetali P. Discrete Ricci curvature bounds for Bernoulli–Laplace and random transposition models. Ann. Fac. Sci. Toulouse Math. 2015;24(4):781–800. doi: 10.5802/afst.1464. [DOI] [Google Scholar]
- 17.Erbar M, Fathi M. Poincaré, modified logarithmic Sobolev and isoperimetric inequalities for Markov chains with non-negative Ricci curvature. J. Funct. Anal. 2018;274(11):3056–3089. doi: 10.1016/j.jfa.2018.03.011. [DOI] [Google Scholar]
- 18.Erbar M, Maas J. Ricci curvature of finite Markov chains via convexity of the entropy. Arch. Ration. Mech. Anal. 2012;206(3):997–1038. doi: 10.1007/s00205-012-0554-z. [DOI] [Google Scholar]
- 19.Erbar M, Maas J. Gradient flow structures for discrete porous medium equations. Discrete Contin. Dyn. Syst. 2014;34(4):1355–1374. doi: 10.3934/dcds.2014.34.1355. [DOI] [Google Scholar]
- 20.Fathi M, Maas J. Entropic Ricci curvature bounds for discrete interacting systems. Ann. Appl. Probab. 2016;26(3):1774–1806. doi: 10.1214/15-AAP1133. [DOI] [Google Scholar]
- 21.Forkert D, Maas J, Portinale L. Evolutionary -convergence of entropic gradient flow structures for Fokker-Planck equations in multiple dimensions. SIAM J. Math. Anal. 2022;54(4):4297–4333. doi: 10.1137/21M1410968. [DOI] [Google Scholar]
- 22.Ford LR, Jr, Fulkerson DR. Flows in Networks. Princeton: Princeton University Press; 1962. [Google Scholar]
- 23.García Trillos, N.: Gromov–Hausdorff limit of Wasserstein spaces on point clouds. Calc. Var. Partial Differ. Equ. 59(2), 43 (2020)
- 24.Gigli N, Maas J. Gromov-Hausdorff convergence of discrete transportation metrics. SIAM J. Math. Anal. 2013;45(2):879–899. doi: 10.1137/120886315. [DOI] [Google Scholar]
- 25.Gladbach P, Kopfer E, Maas J, Portinale L. Homogenisation of one-dimensional discrete optimal transport. J. Math. Pures Appl. 2020;9(139):204–234. doi: 10.1016/j.matpur.2020.02.008. [DOI] [Google Scholar]
- 26.Gladbach P, Kopfer E, Maas J. Scaling limits of discrete optimal transport. SIAM J. Math. Anal. 2020;52(3):2759–2802. doi: 10.1137/19M1243440. [DOI] [Google Scholar]
- 27.Gladbach, P., Kopfer, E.: Limits of density-constrained optimal transport. Calc. Var. Partial Differential Equations. 61(2), (2022). 10.1007/s00526-021-02163-7 [DOI] [PMC free article] [PubMed]
- 28.Heida M, Patterson RIA, Renger DRM. Topologies and measures on the space of functions of bounded variation taking values in a Banach or metric space. J. Evol. Equ. 2019;19(1):111–152. doi: 10.1007/s00028-018-0471-1. [DOI] [Google Scholar]
- 29.Lisini S, Marigonda A. On a class of modified Wasserstein distances induced by concave mobility functions defined on bounded intervals. Manuscr. Math. 2010;133(1–2):197–224. doi: 10.1007/s00229-010-0371-3. [DOI] [Google Scholar]
- 30.Maas J. Gradient flows of the entropy for finite Markov chains. J. Funct. Anal. 2011;261(8):2250–2292. doi: 10.1016/j.jfa.2011.06.009. [DOI] [Google Scholar]
- 31.Marcellini P. Periodic solutions and homogenization of non linear variational problems. Annali di Matematica. 1978;117(1):139–152. doi: 10.1007/BF02417888. [DOI] [Google Scholar]
- 32.Mielke A. A gradient structure for reaction–diffusion systems and for energy-drift-diffusion systems. Nonlinearity. 2011;24(4):1329–1346. doi: 10.1088/0951-7715/24/4/016. [DOI] [Google Scholar]
- 33.Mielke A. Geodesic convexity of the relative entropy in reversible Markov chains. Calc. Var. Partial Differ. Equ. 2013;48(1–2):1–31. doi: 10.1007/s00526-012-0538-8. [DOI] [Google Scholar]
- 34.Mielke, A., Roubíček, T.: Rate-Independent Systems. Applied Mathematical Sciences, vol. 193. Springer, New York (2015)
- 35.Peyré, G., Cuturi, M.: Computational optimal transport. Found. Trends® Mach. Learn. 11(5-6), 355–607 (2019)
- 36.Piatnitski A, Remy E. Homogenization of elliptic difference operators. SIAM J. Math. Anal. 2001;33(1):53–83. doi: 10.1137/S003614100033808X. [DOI] [Google Scholar]
- 37.Peletier, M., Rossi, R., Savaré, G., Tse, O.: Jump processes as generalized gradient flows. Calc. Var. Partial Differential Equations. 61(1), (2022). 10.1007/s00526-021-02130-2
- 38.Rockafellar, R.T., Wets, R.J.-B.: Variational analysis. Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], vol. 317. Springer-Verlag, Berlin (1998)
- 39.Santambrogio, F.: Optimal transport for applied mathematicians. Progress in Nonlinear Differential Equations and their Applications, vol. 87. Birkhäuser/Springer, Cham (2015)
- 40.Schmitzer B , Wirth B. Dynamic models of Wasserstein-1-type unbalanced transport. ESAIM Control Optim. Calc. Var. 2019;25:54. doi: 10.1051/cocv/2018017. [DOI] [Google Scholar]
- 41.Villani, C.: Topics in Optimal Transportation. Graduate Studies in Mathematics, vol. 58. American Mathematical Society, Providence, RI (2003)
- 42.Villani, C.: Optimal Transport, volume 338 of Grundlehren der Mathematischen Wissenschaften. Springer-Verlag, Berlin (2009)
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Data sharing not applicable to this article as no datasets were generated or analysed during the current study.




