Skip to main content
Entropy logoLink to Entropy
. 2020 Apr 25;22(5):498. doi: 10.3390/e22050498

Lie Group Cohomology and (Multi)Symplectic Integrators: New Geometric Tools for Lie Group Machine Learning Based on Souriau Geometric Statistical Mechanics

Frédéric Barbaresco 1, François Gay-Balmaz 2,*
PMCID: PMC7516986  PMID: 33286271

Abstract

In this paper, we describe and exploit a geometric framework for Gibbs probability densities and the associated concepts in statistical mechanics, which unifies several earlier works on the subject, including Souriau’s symplectic model of statistical mechanics, its polysymplectic extension, Koszul model, and approaches developed in quantum information geometry. We emphasize the role of equivariance with respect to Lie group actions and the role of several concepts from geometric mechanics, such as momentum maps, Casimir functions, coadjoint orbits, and Lie-Poisson brackets with cocycles, as unifying structures appearing in various applications of this framework to information geometry and machine learning. For instance, we discuss the expression of the Fisher metric in presence of equivariance and we exploit the property of the entropy of the Souriau model as a Casimir function to apply a geometric model for energy preserving entropy production. We illustrate this framework with several examples including multivariate Gaussian probability densities, and the Bogoliubov-Kubo-Mori metric as a quantum version of the Fisher metric for quantum information on coadjoint orbits. We exploit this geometric setting and Lie group equivariance to present symplectic and multisymplectic variational Lie group integration schemes for some of the equations associated with Souriau symplectic and polysymplectic models, such as the Lie-Poisson equation with cocycle.

Keywords: momentum maps, cocycles, Lie group actions, coadjoint orbits, variational integrators, (multi)symplectic integrators, fisher metric, Gibbs probability density, entropy, Lie group machine learning, Casimir functions

1. Introduction

A geometric theory of statistical mechanics was developed by Souriau [1], motivated by the observation that Gibbs equilibrium states do not satisfy the usual physical covariance assumptions. This geometric theory, called by him Lie Groups Thermodynamics, is based on a Hamiltonian action of a Lie group on a symplectic manifold, to which are associated generalized Gibbs states, indexed by a Lie algebra parameter β playing the role of a geometric (Planck) temperature. Usual Gibbs states defined from a Hamiltonian appear as special cases in which the Lie group is a one-parameter group. The generalized Gibbs states become compatible with Galileo relativity in classical mechanics and with Poincaré relativity in relativistic mechanics, and the maximum entropy principle is preserved. See [2] for an exposition of Souriau’s approach.

A natural equilibrium state is characterized by an element β of the Lie algebra of the Lie group, determining the equilibrium temperature. In this geometric setting, the logarithm of the partition function, identified with the Massieu potential Φ(β), is defined on this Lie algebra. Its derivative, called the thermodynamic heat Q(β), gives the mean value of the energy and is an element of the dual of the Lie algebra. From this, two important quantities are defined. First the geometric heat capacity, given by minus the derivative of Q and giving the Fisher metric of the generalized Gibbs probability densities, second the entropy defined on the dual of the Lie algebra as the Legendre transform of the Massieu potential.

This geometric setting of Souriau was exploited and developed in [3,4,5,6,7,8,9,10] towards applications in information geometry and Lie group machine learning. Different tools developed based on Souriau Lie groups thermodynamics are explored in artificial intelligence for “Supervised Machine Learning” and “Non-Supervised Machine Learning” approaches. For “Supervised Machine Learning”, neural network natural gradient from information geometry could be extended on Lie algebra based on Fisher extension with respect to Souriau covariant maximum entropy Gibbs density on coadjoint orbits. For “Non-Supervised Machine Learning”, Souriau-Fisher metric transforms problems of learning on Lie groups to more classical problems of learning on metric spaces: extension of mean/median barycenter on Lie groups by Fréchet definition of geodesic barycenter, solved by Hermann Karcher flow and by exponential map (based on Souriau algorithm for matrix characteristic polynomial computation). For “Non-Supervised Machine Learning”, extension of “mean-shift” for homogeneous symplectic manifolds and Souriau-Fisher metric space. We can also make reference to GEOMSTATS libraries [11] developing codes for machine learning on Riemannian manifolds and Lie groups.

This paper introduces basic tools to extend supervised classification, using tools from geometric statistical mechanics and information geometry, which are applied to an extension of statistical learning theory for data as elements of Lie Groups. Classically, statistical machine learning is based on convex analysis on the set of posterior probability measures with respect to Gibbs posterior measures. Lie Groups thermodynamics introduces a generalized fully covariant Gibbs density on symplectic manifolds endowed with a symplectic Lie group action admitting a momentum map. An important example is the case of coadjoint orbits endowed with the Kirilov-Kostant-Souriau symplectic forms, and its extension with non-zero cohomology. We illustrate these statistical and geometrical tools for general Lie groups, which include affine groups such as Galileo group in mechanics, and the special Euclidean groups SE(2) and SE(3) in robotics. Classically we can associate to any posterior distribution an effective generalized geometric temperature, given by an element of the dual space of the Lie algebra, relating it to the Gibbs prior distribution. Classification rules could be introduced by Gibbs measures defined on parameter sets and depending on the observed sample value. A Gibbs measure is a special kind of probability measure used in statistical mechanics to describe the state of a particle system driven by a given energy function at some given temperature. Gibbs measures will be realized as minimizers of the average loss value under entropy constraints. In this extension for Lie Groups, an important tool is the log-Laplace transform related to the Massieu Characteristic Function in Thermodynamics (a re-parameterization of the free energy by Planck temperature preserving Legendre transform with respect to Entropy). As we want to deal with Lie group data for machine learning, we will consider tools very similar to those used in statistical mechanics to describe particle systems with many degrees of freedom. Classification rules could be described by Gibbs measures defined on parameter sets and depending on the observed sample value. Comparing any posterior distribution with a Gibbs prior distribution make it possible to provide a way to build an estimator which can be proved to reach adaptively at the best possible asymptotic error rate (by temperature selection of a Gibbs posterior distribution built within a single parametric model). Estimators derived from Gibbs posteriors show excellent performance in diverse tasks, such as classification, regression and ranking. The usual recommendation is to sample from a Gibbs posterior using MCMC (Markov chain Monte Carlo). With covariant Souriau Gibbs density, it is possible to extend MCMC and Gibbs sampler approach for Lie Groups Machine Learning.

More recently, the use of perturbation techniques was proposed as an alternative to MCMC techniques for sampling. These results were extended in conditional random fields loss, proving that the maximum in expectation with low-rank perturbations, provides an upper bound on the log partition (what we call Massieu characteristic function). New lower bounds on the partition function and new unbiased sequential sampler for the Gibbs distribution based on low-rank perturbations were introduced. All these methods are based on sampling from the Gibbs distribution, upper bounding the log partition function. All these results are synthetized in [12], where a new general method is also proposed, with connections to the recently proposed Fenchel-Young losses [13], using doubly stochastic scheme for minimization of these losses, for unsupervised and supervised learning. This is a generalization to the Gibbs distribution.

Methods for learning parameters of a Gibbs disribution on data (yi)i=1, , n are based on maximization of the likelihood

^n(θ)=1ni=1nlogpGibbs,θ(yi)=1ni=1nyi,θlogψ(θ)

that is optimized by gradients methods using the empirical log-likelyhood, given by

θ^n=y^nEGibbs,θ(y).

For this method of moment-matching, the expectation of the Gibbs distribution is a challenge in some cases. This approach was replaced by a method called “perturb-and-MAP” to learn the parameters in this model as a proxy for log-likelihood. This minimization is equivalent to maximizing previous equation by substituting the log-partition logψ(θ) with

Fϵ(θ)=EF(θ+ϵV)=EmaxyCy,θ+ϵV

with a random noise vector ϵV, ϵ>0. This approach could be linked with the use of Fenchel-Young losses [13]. In the perturbed model, the Fenchel-Young loss is given by:

Lϵ(θ;y)=Fϵ(θ)+ϵΩ(y)θ,y=DϵΩ(y,y^ϵ(θ))

with loss gradient θLϵ(θ;y)=θFϵ(θ)y=yϵ(θ)y, where

yϵ(θ)=Epθ(y)y=EargmaxyCy,θ+ϵV

and DϵΩ(y,y^ϵ(θ)) is the Bregman divergence associated with ϵΩ. As Fϵ generalizes the log-sum-exp function on the simplex, its dual Ω is a generalization of the negative entropy (which is the Fenchel dual of log-sum-exp). These connections were studied in [14].

In this paper, we describe a geometric framework for the study of Gibbs probability densities in statistical mechanics and information geometry, as well as the associated concepts of thermodynamic heat, entropy, and Fisher metric, inspired by Souriau’s symplectic model of statistical mechanics. This geometric framework unifies several earlier works on the subject, including Souriau’s symplectic model of statistical mechanics, its polysymplectic extension, Koszul model, and approaches developed in quantum information geometry. This approach helps to identify the common geometric structures appearing in various examples and provides a body of geometric tools for information geometry and Lie group machine learning. The emphasis is put on the role of the equivariance with respect to Lie group actions. For instance, we discuss the expression of the Fisher metric in presence of equivariance, we consider the associated Lie-Poisson equations with cocycle (also called affine Lie-Poisson equations) as well as their field theoretic versions, and we exploit the property of the entropy of the Souriau model as a Casimir function, to apply a geometric model for energy preserving entropy production on Lie algebras. In our developments, we make heavily use of several concepts from geometric mechanics, such as momentum maps, Casimir functions, coadjoint orbits, and Lie-Poisson brackets, as unifying concepts appearing in various applications of this framework to information geometry and machine learning. We consider in detail the Koszul model, the polysymplectic extension of the Souriau model, the case of the multivariate Gaussian probability densities, models of information geometry for quantum systems. We exploit the geometric framework to build geometric numerical integrator schemes for some of the equations associated with Souriau’s model and its polysymplectic extension. This is achieved by identifying the variational principles underlying these equations and by discretizing these principles, following the techniques of variational discretization, which result in schemes that preserve coadjoint orbits, (multi)symplectic structures, and discrete versions of Noether theorems.

The content of the paper is as follows. In Section 2.1 we present the general geometric framework for Gibbs probability densities that will be used in the paper. In particular, we review the definition of the Massieu potential, the thermodynamic heat, the entropy, the identification of the Fisher metric with the Hessian of the Massieu potential, and the maximum entropy principle. These results are independent of the existence of Lie group symmetries of the theory. The implications of such symmetries are studied in detail Section 2.2 where we present a Lie group equivariant setting that includes as special cases the Souriau model, its polysymplectic extension, and the case of multivariate Gaussian probability densities. The Souriau model is reviewed in Section 2.3 where we show that the associated entropy is a Casimir function for the Lie-Poisson bracket with cocycle and, motivated by an approach developed in quantum information geometry, we take advantage of this property to formulate a geometric model for entropy production. We also present the stochastic Hamiltonian equations associated with the Lie-Poisson bracket with cocycle. The polysymplectic model is reviewed in Section 2.4, where we show that the entropy also satisfies a natural extension of the Casimir property and we formulate a polysymplectic extension of the Lie-Poisson equations with cocycle. Finally, in Section 2.5 we give a general expression of the Fisher metric on orbits when equivariance is assumed. In Section 3 we apply the framework considered in Section 2 to various examples and identify common underlying geometric structures. We start in Section 3.1 with the case of multivariate Gaussian probability densities as an illustration of the general framework for which a cocycle is needed and which does not fall into the setting of the Souriau model. We apply Noether theorem to derive invariant quantities for geodesics of the Fisher metric. We then enlighten in Section 3.2 the strong analogies with quantum information geometry by considering Lie algebras with unitary representation and show that the Fisher metric as defined from the generalized heat capacity in Section 2.1, coincides with the Bogoliubov-Kubo-Mori metric. In this particular case the equation with Casimir dissipation/production reproduces a dissipative model used in quantum information geometry. Finally, in Section 3.3 we consider in detail the case of the Euclidean group of the plane SE(2), the associated Fisher metric, Lie-Poisson equations with cocycle and entropy production equations. In Section 4, we make use of this geometric setting to propose geometric integrators for some of the equations associated with the Souriau model and its polysymplectic extension. We first review some facts on variational integrators on Lie groups in Section 4.1 and about central extensions of Lie groups and the associated Euler-Poincaré equations in Section 4.2. This allows obtaining a variational formulation for the Lie-Poisson equations with cocycle. Based on this, we present a symplectic integrator for the Lie-Poisson equation with cocycle in Section 4.3 and a multisymplectic integrator for the Lie-Poisson field equations with cocycle in Section 4.4.

2. A General Framework for Lie Group Statistical Mechanics and Symmetries

2.1. A Class of Generalized Gibbs Probability Densities, Its Associated Entropy and Fisher Metric

In this section, we present a general framework for Gibbs probability densities in statistical mechanics and information geometry, that includes the classes considered for instance in the Koszul and Souriau models, as well as multivariate exponential families. In particular, we review the importance of the logarithm of the characteristic function, identified as the Massieu potential, from which the entropy arises as its Legendre transform and the Fisher information metric as its Hessian. We also discuss the relation of these Gibbs sates with the maximum entropy principle. While the concepts manipulated here are standard, our aim is to organize them in a general setting that is appropriate for the developments made in this paper.

The results described in this paragraph are independent of possible Lie group symmetries of the theory whose implications will be discussed in Section 2.2.

Let E be a vector space, whose elements will be denoted β since they are generalisations of the inverse temperature. The duality pairing between elements ν of the dual space E and elements βE is denoted as ν,β. Besides the vector space E, the setting also involves a manifold M, endowed with a volume form dμ.

Let U:ME be a smooth function defined on M with values in E. Denote by ΩE the largest open set such that for all βΩ the two integrals

MeU(m),βdμRandMU(m)eU(m),βdμE (1)

converge. We denote by ψ:ΩR the partition function (or characteristic function), given by

ψ(β)=MeU(m),βdμ. (2)

For all βΩ, we consider the generalized Gibbs probability densities

pβ(m)=1ψ(β)eU(m),β. (3)

For application in information geometry it is required that βpβ is injective. It is important to note that the Gibbs densities are not defined on the whole vector space E but only on the open subset Ω. An element βΩ is called a geometric temperature. From now on we assume that Ω is not empty.

The Massieu potential is the function Φ:ΩR defined by

Φ(β)=log(ψ(β)) (4)

from which we can write the generalized Gibbs probability densities as

pβ(m)=eΦ(β)U(m),β,βΩ.

The thermodynamic heat Q:ΩE is the first derivative of the Massieu potential, i.e.,

Q(β):=DΦ(β)=MU(m)pβ(m)dμ=Eβ[U]E, (5)

where Eβ denotes the expectation with respect to pβ.

We denote by Ω the image of the function Ω by Q and assume that Q=DΦ:ΩΩ is a diffeomorphism. In this case, we can define the entropys:ΩR as the Legendre transform of the Massieu potential Φ:ΩR, namely

s(ν):=ν,βΦ(β), (6)

where β=Q1(ν). In other words, βΩ in (6) is such that

DΦ(β)=ν.

The name entropy for this Legendre transform is justified by the following result.

Lemma 1.

For every βΩ , we have the equality

s(Q(β))=S(pβ),

where Q(β) is the thermodynamic heat and

S(p)=Mplogpdμ

is the entropy of the probability density p.

Proof. 

On one hand, using the definition of s in Equation (6) and Φ in Equation (4), we have

s(Q(β))=Q(β),βΦ(β)=MU(m),βpβ(m)dμ+log(ψ(β)).

On the other hand, we compute

S(pβ)=Mpβ(m)log(pβ(m))dμ=Mpβ(m)log(ψ(β))U(m),βdμ=log(ψ(β))Mpβ(m)dμ+MU(m),βpβ(m)dμ=log(ψ(β))+MU(m),βpβ(m)dμ.

These expressions are equal. ☐

Equation (6) is referred to as the Clairaut equation, see [15].

The generalized heat capacity is the symmetric tensor field K:Ωsym(E), defined as minus the Hessian matrix of the Massieu potential, i.e.,

K(β):=D2Φ(β)=D2logψ(β):E×ER.

A direct computation gives, for all vectors δβ1,δβ2E,

K(β)(δβ1,δβ2)=D2Φ(β)(δβ1,δβ2)=ddεε=0DΦ(β+εδβ1)·δβ2=ddεε=0MU(m),δβ2ψ(β+εδβ1)eU(m),β+εδβ1dμ=EβU,δβ1U,δβ2EβU,δβ1EβU,δβ2

hence the generalized heat capacity is

K(β)=Eβ(UEβ(U))(UEβ(U))=Eβ(UQ(β))(UQ(β)).

As a consequence, K(β) is positive semidefinite for all βΩ. Being the derivative of Q:ΩΩ, it is positive definite if Q is a diffeomorphism.

Recall that in information geometry, the Fisher metric associated with the family pβ, βΩ, of probability densities is the symmetric tensor field K:Ωsym(E) defined by

I(β)=Eβ[D2logpβ].

In our setting we have the following identification.

Proposition 2.

The generalized heat capacity of pβ cocincides with the Fisher metric of pβ :

I(β)=K(β).

In other words, the Fisher metric is the Hessian of the characteristic function logarithm K(β)=D2logψ(β) .

Proof. 

From Equation (3), we have

log(pβ(m))=logψ(β)U(m),β

hence, taking the second derivative with respect to β we get

D2logpβ=D2logψ(β)=D2Φ(β).

Please note that this equality does not depend on m, which proves that

I(β)=Eβ[D2logpβ]=Eβ(D2Φ(β))=D2Φ(β)=K(β).

Hence the result is proved. ☐

Proposition 3.

Let us assume thatQ:ΩΩis a diffeomorphism. The inverse of the Fisher metric, i.e., the cometric onΩinduced from the Fisher metric on Ω, is given by minus the Hessian of the entropy:

D2s(ν):E×ER,νΩ.

Proof. 

From the definition of the thermodynamic heat, we have DΦ(Q1(ν))·δβ=ν,δβ, for every νΩ and δβΩ. Taking the derivative with respect to ν, we get

D2Φ(Q1(ν))DQ1(ν1)·δν,δβ=δν,δβ. (7)

Taking now the derivative of Equation (6), we get Ds(ν)=Q1(ν) hence D2s(ν)=DQ1(ν). This can be used in Equation (7) and shows that D2Φ(β)·D2s(ν)=idE, where ν=Q(β). The result follows then from Proposition 2. ☐

The following result shows that the generalized Gibbs probability densities satisfy the maximum entropy principle [16].

Proposition 4

(Maximum entropy principle). Let U:ME be a smooth function and νΩE a given element. The generalized Gibbs probability density pβ in Equation (3) with β=Q1(ν) is a solution of the maximum entropy principle:

maxqMqlogqdμsuchthatMqdμ=1MUqdμ=ν.

Proof. 

Given a probability density q, we have

M(qlogqqlogpβ)dμ=MqlogqpβdμMq1pβqdμ=M(qpβ)dμ=0.

Hence, if q satisfies the constraints we get

MqlogqdμMqlogpβdμ=Mqlogψ(β)+U(m),βdμ=logψ(β)+MUqdμ,β=Φ(β)+ν,β=ν,Q1(β)Φ(Q1(ν))=S(ν)=s(pβ).

In the fourth equality we used β=Q1(ν), in the fifth equality we used definition Equation (6), and in the last equality we used Lemma 1. ☐

Koszul-Vinberg characteristic function. We now quickly describe a particular case of the above setting, which is related to Hessian geometry and in which the characteristic function Equation (2) recovers the Koszul-Vinberg characteirstic function, see [17,18,19,20,21] and the references in [3]. In this case, the Fisher information metric of information geometry coincides with the canonical Koszul Hessian metric given by Koszul forms. Analogies between Koszul-Vinberg model and Souriau symplectic model of statistical mechanics were enlightened in [3]. Here we will show how these two models precisely arise as special cases of the general setting presented in Section 2.1.

Let E be a vector space and ΩE an open convex cone in E. The cone Ω is assumed to be regular, i.e., Ω contains no straight line, which is equivalent to the condition Ω¯(Ω¯)={0}. We chose M=ΩE as the dual cone defined by

Ω:=ξEξ,β>0,βΩ¯{0},

and we choose the function U:M=ΩE as the identity function on Ω. We take the volume form as the Lebesgue measure dξ. The generalized Gibbs probability densities defined in Equation (3) are

pβ(ξ)=1ψ(β)eξ,β, (8)

with characteristic function Equation (2) given by

ψ(β)=Ωeξ,βdξ. (9)

This expression recovers the Koszul-Vinberg characteristic function of the cone Ω, see [17,18,19,20,21,22,23]. We call Equation (8) the Koszul density of the cone Ω.

The Koszul 1-form, [18], defined as the differential of logψ(β) coincides with the thermodynamic heat Q:ΩΩ of the general setting above. It reads

Q(β)=Ωξpβ(ξ)dξ=Eβ(ξ).

The Koszul metric defined as the second derivative of logψ(β) coincides with the Fisher metric of information geometry from Proposition 2. It reads

I(β)(δβ1,δβ2)=Ωξ,δβ1ξ,δβ2pβ(ξ)dξΩξ,δβ1pβ(ξ)dξΩξ,δβ2pβ(ξ)dξ.

From Proposition 4, given νΩ, the Koszul density of the cone Ω with β=Q1(ν), satisfies the maximum entropy principle

maxqΩqlogqdξsuchthatΩqdξ=1Ωξqdξ=ν,

see [3] for a direct proof.

An important example is Ω:=sym+(n)E=sym(n), the cone of symmetric positive definite n×n matrices. The dual space is chosen as E=sym(n) with duality pairing ν,β=Tr(νTβ). In this case, it is well-known that Ω=Ω. The generalized Gibbs probability densities are

pβ:sym+(n)R,pβ(ξ)=1ψ(β)eξ,β,

where the Koszul-Vinberg characteristic function can be explicitly computed as

ψ(β)=sym+(n)eξ,βdξ=det(β)n+12ψ(In).

The Massieu potential is deduced as

Φ(β)=log(ψ(β))=n+12log(det(β))log(ψ(In)) (10)

and the thermodynamic heat and entropy are

Q(β)=DΦ(β)=n+12β1=Eβ(ξ)
s(ν)=n+12log(det(ν))+n(n+1)21logn+12+log(ψ(In)).

We can thus write the generalized Gibbs probability densities as

pβ(ξ)=1ψ(β)eξ,β=det(β)n+121ψ(In)eξ,β.

Finally, the expression of the Fisher metric on Ω is found by using Equation (10) as

I(β)(δβ1,δβ2)=D2Φ(β)(δβ1,δβ2)=n+12Tr(β1δβ1β1δβ2), (11)

for every δβ1,δβ2E.

2.2. Equivariance with Respect to Lie Group Actions

In this section, we study the consequences of the equivariance of the function U appearing in the generalized Gibbs probability densities. More precisely, given a Lie group G, we assume that U:ME is G-equivariant with respect to an action of the Lie group on M and an affine action of the Lie group on E. This setting includes as special cases the Souriau symplectic model of statistical mechanics [24], its polysymplectic extension [5], the case of multivariate Gaussian densities, as treated for instance in [4], and approaches developed in quantum information geometry [25], for which the Fisher metric will be shown to coincide with the Bogoliubov-Kubo-Mori metric in Section 3.2.

Let G be a Lie group, and let

ϕ:G×MM,(g,m)ϕg(m)

be a left action of G on M, i.e., ϕ satisfies

ϕe=idMandϕgϕh=ϕgh,

for every g,hG, with idM the identity on M. We denote by g the Lie algebra of G. The infinitesimal generator of the action corresponding to ξg is the vector field ξM on M defined by

ξM(m)=ddεε=0ϕexp(εξ)(m), (12)

for every mM, where exp:gG is the Lie group exponential map.

We also consider a left linear action

ρ:G×EE,(g,β)ρg(β)

of G on the vector space E, ρgL(E,E). We denote by ρ:G×EE the linear right action of G induced on the dual space E

ρg(ν),β=ν,ρg(β),βE,νE,gG. (13)

We recall that a group one-cocycle with respect to ρ is a map θC(G,E) such that

θ(gh)=θ(g)+ρg1(θ(h)), (14)

for every g,hG. Equivalently, a group one-cocycle θC(G,E) with respect to ρ is such that A:G×EE defined by

Ag(ν)=ρg1(ν)+θ(g) (15)

is an affine left action of G on E.

Finally, we recall that the Jacobian of the action ϕg:MM relative to the volume form dμ is the function Jϕg:MR defined by ϕgdμ=Jϕgdμ, where ϕg denotes the pull-back of the n-form dμ by the diffeomorphism ϕg. We will be interested in actions which satisfy Jϕg=c(g) is a constant on M. Please note that c(gh)=c(g)c(h), for every g,hG. The particular case c(g)=1 corresponds to volume preserving diffeomorphisms.

Proposition 5.

Assume that the action ϕ of G on M satisfies ϕgμ=c(g)μ and the function U is G-equivariant:

U(ϕg(m))=ρg1(U(m))+θ(g), (16)

for all gG and mM , where θC(G,E) is a group one-cocycle. Then the open subset ΩE is invariant under the action of G on E, the partition function ψ satisfies

ψ(ρg(β))=ψ(β)c(g)eθ(g1),β

for every gG , and the probability density pβ satisfies

ϕgpβ=pρg1(β),

for every gG , where ϕgpβ=(pβϕg)Jϕg=(pβϕg)c(g) is the pull-back of a density.

As a consequence, the Massieu potential Φ(β) , the thermodynamic heat Q(β) , the entropy s(ν) , and the heat capacity K(β) satisfy the following equivariance properties

Φ(ρg(β))=Φ(β)log(c(g))θ(g1),β (17)
Q(ρg(β))=ρg1(Q(β))+θ(g) (18)
s(ρg1(ν)+θ(g))=s(ν)+log(c(g)) (19)
K(ρg(β))ρg(δβ1),ρg(δβ2)=K(β)δβ1,δβ2, (20)

for every gG .

Proof. 

Using Equation (16) and a change of variables, we have

ψ(ρg(β))=MeU(m),ρg(β)dμ=Meρg(U(m)),βdμ=MeU(ϕg1(m))θ(g1),βdμ=MeU(ϕg1(m)),βdμeθ(g1),β=MeU(m),βJϕgdμeθ(g1),β=ψ(β)c(g)eθ(g1),β.

The other statement are checked in a similar way, by using Equations (13)–(16). ☐

This proposition unifies in a single statement, several Lie group equivariance properties observed in several models for information geometry and Lie group machine learning, see, e.g., [3,4,5,7,8]. Before discussing the symplectic and polysymplectic models we illustrate below these equivariance properties for the Koszul model recalled above.

Equivariance in the Koszul model. For the Koszul model recalled above, see [3] and references therein, G=Aut(Ω) is the group of linear isomorphism that preserves ΩE. Given gAut(Ω), we have ρg:ΩΩ and it is clear that the dual action ρg preserves the dual cone Ω. In this very special case, M=Ω and the G action on M is chosen as ϕg:=ρg1. Since U:ΩE is the identity, there is no cocycle. However, we have c(g)=Jϕg which is not equal to one in general and, for instance, the transformation Equation (17) of the Massieu potential reads

Φ(ρg(β))=Φ(β)log(c(g)).

Let us consider as special case the cone of symmetric positive definite matrices Ω=sym+(n)E=sym(n). The dual space is chosen as E=sym(n) with duality pairing ν,β=Tr(νTβ) and we have Ω=Ω.

We consider the left action of GL(n) on E=sym(n) given by

ρA(β)=ATβA1. (21)

Therefore, we have

ρA(ν)=A1νATandϕA(ξ)=ρA1(ξ)=AξAT. (22)

Proposition 5 directly yields the following equivariance properties

ψ(ATβA1)=ψ(β)c(A)pβ(AξAT)c(A)=pATβA(ξ)Φ(ATβA1)=Φ(β)logc(A)Q(ATβA1)=AQ(β)ATs(AνAT)=s(ν)+logc(A),

for all AGL(n), where c(A)=(detA)n+1.

2.3. Souriau Symplectic Model of Statistical Mechanics

In this section, we show that the Souriau symplectic model of statistical mechanics [24] arises as a special case of the preceding setting, by considering (M,ω) a symplectic manifold and dμ the Liouville form associated with ω.

We then exploit this setting to show that the entropy in the Souriau model is a Casimir function of the Lie-Poisson bracket with Lie algebra cocycle associated with the nonequivariance cocycle of the momentum map, i.e., it Poisson commutes with every functions. Based on this we formulate a dynamical geometric model for dissipation/production of this Casimir, following the Lie algebraic setting proposed in [26,27]. This allows us to clarify the link between the geometry underlying Souriau symplectic models and that underlying models proposed in [25] in the framework of quantum physics by information geometry for some Lie algebras, see also [28]. Details will be given in Section 3.2. Finally, we present a stochastic perturbation of the Lie-Poisson equations with cocycle within the setting of stochastic Hamiltonian dynamics.

To present the Souriau model, we first quickly recall below the notion of momentum map and nonequivariance cocycle for symplectic manifolds, see, e.g., [29,30,31]. Consider a symplectic manifold (M,ω), i.e., a manifold M endowed with a closed non-degenerate two form ω. The associated Liouville form is dμ=(1)n(n1)/2n!ωω (n times), where 2n=dimM. Given a function h:MR, the Hamiltonian vector field associated with H is the vector field Xh defined by

iXhω=dh. (23)

Recall that the symplectic form ω defines the Poisson bracket (see Remark 7)

{f,g}=ω(Xf,Xg) (24)

on functions f,gC(M).

A Lie group action ϕ:G×MM of G on M is symplectic, if it preserves the symplectic form, i.e., ϕgω=ω, for every gG. Taking the derivative of this identity with respect to g at g=e, we get £ξMω=0, for every ξg, where ξM is the infinitesimal generator associated with the Lie algebra element ξg, see Equation (12), and £ is the Lie derivative. Equivalently, we have

d(iξMω)=0,

for every ξg, i.e., the one-form iξMω is locally exact. If it is globally exact, i.e., if ξM is a Hamiltonian vector field for every ξg, then the action is called Hamiltonian and admits a momentum mapJ:Mg, which satisfies

iξMω=dJξ,

where Jξ:MR is defined by Jξ(m):=J(m),ξ, for every ξg.

When M is connected, there is a well-defined group one-cocycle θ:Gg, called the nonequivariance cocycle, given by

θ(g)=J(Φg(m))Adg1(J(m)),

where mM can be arbitrarily chosen. It characterizes the nonequivariance of the momentum map with respect to the action of G on M and the coadjoint action of G on g. The group one-cocycle property is

θ(gh)=θ(g)+Adg1(θ(h)),

for every g,h,G. We consider its differential Θ:=Teθ seen as a map Θ:g×gR, i.e.,

Θ(ξ,η)=Teθ(ξ),η=ddεε=0θ(exp(εξ)),η. (25)

Taking the derivative of the relation above, we get

Θ(ξ,η)=J[ξ,η]{Jξ,Jη}, (26)

where the last term uses the Poisson bracket Equation (24). The map Θ:g×gR is bilinear, skew-symmetric, and, as can be readily verified, satisfies the Lie algebra two-cocycle identity

Θ([ξ,η],ζ)+Θ([η,ζ],ξ)+Θ([ζ,ξ],η)=0. (27)

We refer to [29,30,31] for detailed introductions to these concepts.

Remark 6

(Lie group and Lie algebra cohomology). A group one-cocycle θC(G,g) is called a group one-coboundary if there is a λg such that

θ(g)=λAdg1λ

for every gG. The quotient space of one-cocycles modulo one-coboundaries is called the first group cohomology of G and is denoted by H1(G,g). These definitions extend to arbitrary representation of G on a vector space, as in Equation (14).

A Lie algebra two-cocycle Θ is called a Lie algebra two-coboundary if there is λg such that

Θ(ξ,η)=λ,[ξ,η],

for all ξ,ηg. The quotient space of Lie algebra two-cocycles by Lie algebra two-coboundaries is called the second Lie algebra cohomology of g and is denoted by H2(g,R).

2.3.1. Souriau Symplectic Model of Satistical Mechanics

The Souriau symplectic model of statistical mechanics is obtained by considering the following specific situation in the setting described in Section 2.2:

M:asymplecticmanifolddμ:theLiouvillevolumeϕg:aHamiltonianactionE=g:theLiealgebraofGρg=Adg:theadjointactionofGongU=J:Mg:amomentummap.

In particular, the thermodynamic heat becomes Q(β)=Eβ(J) and the Fisher metric on Ωg is

I(β)=Eβ(JEβ(J))(JEβ(J))sym(g).

Proposition 5 directly yields the following equivariance properties

AdgΩ=Ω,ψ(Adgβ)=ψ(β)eθ(g1),β,pβϕg=pAdg1β

and

Φ(Adgβ)=Φ(β)θ(g1),β (28)
Q(Adgβ)=Adg1(Q(β))+θ(g) (29)
s(Adg1ν+θ(g))=s(ν) (30)
K(Adgβ)Adgδβ1,Adgδβ2=K(β)δβ1,δβ2, (31)

for every gG. Note also that Ω is invariant under the affine action νAdg1ν+θ(g).

From Proposition 4, given νΩg, the generalized Gibbs probability density

pβ(m)=1ψ(β)eJ(m),β,

with β=Q1(ν), satisfies the maximum entropy principle

maxqMqlogqdμsuchthatMqdμ=1MJqdμ=ν.

We refer to [2] for a detailed presentation of Souriau’s model. We refer to [32] for recent developments exploiting Souriau’s model. As mentioned earlier in the general case, it is important to note that the generalized Gibbs densities are not defined on the whole Lie algebra g but only on the open subset Ωg of geometric temperatures. As already observed by Souriau the set Ω can be empty in some examples, such as the case of the action of the Galilean group. In this case, Souriau’s method considers Gibbs densities associated with one-parameter subgroups of the acting Lie group.

2.3.2. Lie-Poisson Equations with Cocycle and Property of the Entropy in Souriau’s Model

From Equation (30), we note that the entropy s is constant on the affine coadjoint orbits defined by

O={Adg1μ0+θ(g)gG}, (32)

for μ0g. It is well-known that affine coadjoint orbits are symplectic manifolds, with symplectic form given by

ωO(μ)adξμΘ(ξ,·),adημΘ(η,·)=μ,[ξ,η]Θ(ξ,η), (33)

for μO, ξ,ηg. This is an extension to the affine case of the well-known Kirilov-Kostant-Souriau symplectic form on coadjoint orbits. The connected components of the affine coadjoint orbits Equation (32) are the symplectic leaves in the Poisson manifold (g,{,}Θ), where {,}Θ is the Lie-Poisson bracket with cocycle (or affine Lie-Poisson bracket)

{f,g}Θ(μ)=μ,δfδμ,δgδμΘδfδμ,δgδμ,f,g:gR, (34)

see, e.g., [29].

The Hamiltonian system (see Remark 7) associated with the Lie-Poisson bracket with cocycle Equation (34) and to a given Hamiltonian function h:gR is given by the Lie-Poisson equations with cocycle (or affine Lie-Poisson equations)

ddtf={f,h}Θ,f:gR, (35)

which yield the dynamical system

ddtμ+adδhδμμ=Θδhδμ,·, (36)

for a curve μ(t)g. This dynamical system preserves each affine coadjoint orbit Equation (32) and defines on each of them a Hamiltonian system with respect to the symplectic form Equation (33). The Lie-Poisson equations with cocycle have important applications, in particular they appear in the geometric formulation of complex fluids, see [33,34,35], and geometrically exact (Cosserat) rods, see [36,37]. See [38] for another point of view on Lie-Poisson equations with cocycle. These equations are also referred to as affine Lie-Poisson equations or Lie-Poisson equations with non-zero cohomology.

Remark 7

(Poisson brackets and reduction, see [31]). Recall that a Poisson bracket on a manifold M is a Lie algebra structure {·,·} on C(M) which is a derivation in each factor: {fg,h}=f{g,h}+{f,h}g. For instance, a symplectic structure ω on M defines the Poisson bracket {f,g}=ω(Xf,Xg) for f,gC(M). Another example is the Lie-Poisson bracket

{f,g}(μ)=±μ,δfδμ,δgδμf,gC(g) (37)

on the dual of any Lie algebra g, as well as its affine modified version Equation (34) by a two-cocycle Θ. The Hamiltonian system associated with a Poisson bracket and a given Hamiltonian hC(M) is the dynamical system characterized by the condition

ddtf={f,h}

for every functions fC(M), see for instance Equations (35) and (36).

An important point for applications in mechanics is the understanding of such Poisson structures as being induced from a canonical symplectic form (or, equivalently, from the associated canonical Poisson bracket) on a cotangent bundle, via reduction by symmetry relative to a Lie group action. This is the case for the Lie-Poisson bracket Equation (37) which is induced by the canonical symplectic form on TG and the action of G on TG given by the cotangent lifted action of right or left translation. The Lie-Poisson bracket with cocycle Equation (34) is induced by the canonical symplectic form on TG and an affine modified cotangent lifted action of right or left translation ([33]).

Corollary 8.

The entropy s of the Souriau model is a Casimir function for the Lie-Poisson bracket with cocycle Equation (36), i.e., it satisfies

{s,f}Θ=0,

for every smooth functions f:gR .

Proof. 

From Equation (30), we have

δsδμ,adξμ+Θ(ξ,·)=0

for all ξg. This is equivalent to adδsδμμΘδsδμ,·=0, which shows that {s,f}Θ=0, for all f. ☐

As a consequence of the above, the information manifold foliates into level sets of the entropy, containing a family of coadjoint orbits, that could be interpreted in Thermodynamics: motion remaining on theses level sets is non-dissipative, whereas motion transversal to these level sets is dissipative. The affine Kirillov-Kostant-Souriau form makes each orbit into a homogeneous symplectic manifold. Hamiltonian motion on these affine coadjoint orbits is given by the solutions of the Lie-Poisson equations with cocycle Equation (36). We shall present below a geometric way to introduce dissipation and hence, motion through affine coadjoint orbits.

Elementary examples. A particularly simple case of Souriau symplectic model is when the symplectic manifold is a cotangent bundle M=TQ endowed with the canonical symplectic form. Let G be a Lie group acting on the left on Q. Then its cotangent lifted action on TQ is symplectic and admits the momentum map J:TQg given by

J(αq),ξ=αq,ξQ(q). (38)

In this case, there is no cocycle, which yields obvious simplifications in the properties Equations (28)–(30).

Another case without cocycle is when M is an affine coadjoint orbit M=O={Adg1μ+θ(g)gG} endowed with the symplectic form Equation (33). In this case, the momentum map is simply the inclusion J:Og of the affine coadjoint orbit in the dual of the Lie algebra g, [29]. While this example is simple, it plays an important role in the applications, e.g., [8,39]. An example with nonequivariance cocycle will be treated in detail in Section 3.3 for the special Euclidean group of the plane.

2.3.3. Dynamics with Casimir Dissipation/Production

We take advantage of the Casimir function s associated with the Souriau model, to formulate a dynamical geometric model for dissipation/production of this Casimir. This allows us to clarify the link between Souriau symplectic models and models proposed in [25] in the framework of quantum physics by information geometry for some Lie algebras, see also [28].

We follow the general Lie algebraic approach developed in [26,27] for Casimir dissipation, slightly extended here to take into account of a cocycle, and to a wider class of dissipation.

Given a symmetric positive bilinear form γ:g×gR, a Hamiltonian h:gR, a parameter Λ0, and a function k:gR such that

δhδμ,δkδμ=0, (39)

we consider the modification of the Lie-Poisson equations with cocycle Equation (35) given by

ddtf={f,h}ΘΛγδfδμ,δkδμ,δsδμ,δkδμ (40)

for every f. We denote by :gg the flat operator associated with γ. That is, the linear form ξg is given by ξ(η)=γ(ξ,η), for all ξ,ηg. Please note that the flat operator need not be either injective or surjective. Using the equality

γδfδμ,δkδμ,δsδμ,δkδμ=δsδμ,δkδμ,δfδμ,δkδμ=adδkδμδsδμ,δkδμ,δfδμ,

Equation (40) yields the dynamical system

ddtμ+adδhδμμ=Θδhδμ+Λadδkδμδsδμ,δkδμ. (41)

For Θ=0 and h=k, this is the model proposed in [26,27] and applied there in the infinite dimensional setting, with applications to geophysical fluids and magnetohydrodynamics.

The main properties of system Equation (41) are the following.

  • (i)
    Energy conservation: taking f=h in Equation (40), we obtain
    ddth={h,h}ΘΛγδhδμ,δkδμ,δsδμ,δkδμ=0
    because of Equation (39) and since {h,h}Θ=0. Hence the total energy h is preserved.
  • (ii)
    Casimir dissipation (Λ>0) or production (Λ<0): taking f=s in Equation (40), and using {s,f}Θ=0, we obtain
    ddts={s,h}ΘΛγδsδμ,δhδμ,δsδμ,δkδμ=Λδsδμ,δkδμ20/0,
    where ξ2=γ(ξ,ξ).

We will explain in Section 3.2 how system Equation (41) recovers the model proposed in [25] in the context of information geometry for quantum systems for Lie algebras with unitary representation.

2.3.4. Stochastic Hamiltonian Dynamics

We shall briefly discuss here a stochastic perturbation of the Lie-Poisson equation with cocycle Equation (35) within the setting of stochastic Hamiltonian dynamics, see [40,41,42], which preserves the affine coadjoint orbits. This theory was recently extended for stochastic geometric modeling in fluid dynamics via variational principles in [43], see also [44,45].

In the context of this paper, this stochastic extension is motivated in geometric statistical mechanics to model Gibbs density in the case of centrifuge with random vibration along the axis (that is an open problem for industrial centrifuge, because for large equipment it is difficult to reduce vibration of this axis). In statistical machine learning, the problem is motivated for small data analytics, where the Gibbs density as maximum entropy of first order is an approximation. In this case, there is some fluctuations in estimation of mean momentum map due to the fact that the true Gibbs density is a density of higher order. This approximation could be modeled by an additional noise on the moment map.

In the setting of the Lie-Poisson equations with cocycle given in Equation (35), we consider the stochastic Hamiltonian dynamics given by

df={f,h}Θdt+i=1N{f,hi}ΘdWi(t), (42)

where hi:gR, i=1,,N are given Hamiltonians and Wi(t), i=1,,N are independent Brownian motions introduced in the Stratonovich sense, as indicated by the symbol ∘. Please note that the contribution of each Hamiltonian is inserted via the Lie-Poisson bracket with cocycle {·,·}Θ given in Equation (34). This results in the following Stratonovich differential equation for the stochastic process μ(t)g

dμ+adδhδμμΘδhδμ,·dt+i=1NadδhiδμμΘδhiδμ,·dWi(t)=0. (43)

The Itô form of Equation (43) can be obtained by the usual conversion formula. It can be expressed in a concise and general way as

df={f,h}Θ12{hi,{hi,f}Θ}Θdt+i=1N{f,hi}ΘdWi(t).

In a similar way with its deterministic version in Equation (36) the system Equation (43) restricts to a stochastic Hamiltonian system on each affine coadjoint orbits Equation (32) with respect to the Kirillov-Kostant-Souriau symplectic structure with cocycle Equation (33). From Corollary 8 it follows that the entropy s of the Souriau model is preserved by the stochastic dynamics Equation (43).

In absence of the cocycle, Equation (43) can be formally obtained from the variational principle

δ0Tμ,dgg1h(μ)dti=1Nhi(μ)dWi(t)=0, (44)

for variations δg and δμ of (g,μ)G×g. More precisely, the variations δg and δμ give the two conditions

dμ+addgg1μ=0anddgg1=δhδμdt+i=1NδhiδμdWi(t)

which yield Equation (43) in the special case Θ=0. Such variational principles play an essential role in stochastic geometric modelling [43,44,45], where the emphasis is made on the Lagrangian side. For instance in [44] the Lagrangian version of Equation (44) given by

δ0T(ξ)dt+μ,dgg1ξdti=1Nhi(μ)dWi(t)=0, (45)

is used, for variations δg, δξ, δμ of (g,ξ,μ)G×g×g, where :gR is the Lagrangian associated with h:gR in the hyperregular case. Variations δg, δξ, and δμ give the three conditions

dμ+addgg1μ=0,δδξ=μ,anddgg1=ξdt+i=1NδhiδμdWi(t)

which yield equivalent equations to Equation (43) with Θ=0.

To extend Equation (44) to cover the case of a Lie algebra two-cocycle Θ0 we shall formulate the variational principle on the central extension G^=G×R of the Lie group G with respect to a group two-cocyce B:G×GR that integrates Θ:g×gR. We refer to Section 4.2 below for a quick review of the main formulas for central extensions and their use in connection with the Lie-Poisson equations with cocycle. For the application to Equation (43) here, we just need to recall the expression of the group multiplication (g,α)(h,β)=(gh,α+β+B(g,h)) for (g,α),(h,β)G^ where B:G×GR is the group two-cocycle, and the relation

Θ(ξ,η)=ddss=0ddtt=0Bexp(tξ)1,exp(sη)Bexp(sη),exp(tξ)1 (46)

between Θ and B. Based on this, we consider the variational principle

δ0T(μ,a),(dg,dα)(g,α)1h(μ)dti=1Nhi(μ)dWi(t)=0,

for variations δg, δα, δμ, and δa of (g,α)G^=G×R and (μ,a)g^=g×R. In the first term, the operations are associated with the tangent lift of the multiplication on the central extension G^ and the pairing is (μ,a),(ξ,u)=μ,ξ+au, so that we have (μ,a),(dg,dα)(g,α)1=μ,dgg1+a(dα+D1B(g,g1)·dg). A computation, using the fact that B integrates Θ, i.e., Equation (46), shows that the variations δg, δα, δμ, and δa give the four conditions

dμ+addgg1μaΘdgg1,·=0,da=0,dgg1=δhδμdt+i=1NδhiδμdWi(t),dα+D1B(g,g1)·dg=0.

One notes that taking the initial condition a=1, we get the stochastic Lie-Poisson system with cocycle Equation (43) as desired. This approach using central extension can be easily used on the Lagrangian side too and yields the appropriate extension of Equation (45) to handle a cocycle Θ0.

We refer to [43,44,45,46], for various finite and infinite dimensional applications of the stochastic Lie-Poisson Hamiltonian system Equation (42) in the case Θ=0.

2.4. Polysymplectic Model of Statistical Mechanics

Polysymplectic geometry, as developed in [47], arises as a special case of multisymplectic geometry which is the natural geometric setting of classical field theories, see, e.g., [48]. When used in conjunction with the general setting developed in Section 2.1 and Section 2.2, the polysymplectic setting furnishes a natural generalisation of the Souriau symplectic model, to which many properties extend. This extension was proposed in [5]. Here we emphazise this model as a specific case of the general framework described in Section 2.1 and Section 2.2. This allows transposing immediately all the properties of this framework to the polysymplectic model. In particular, we will see that the entropy of the polysymplectic model enjoys a natural extension of the Casimir property observed in Section 2.3.2. The relevant equation is here an Lie-Poisson field equation with cocycle that we will describe in detail below.

This model is motivated by higher-order model of statistical physics. For instance, for small data analytics (rarified gases, sparse statistical surveys, …), the density of maximum entropy should consider higher order moments constraints, so that the Gibbs density is not only defined by first moment but fluctuations request 2nd order and higher moments, as introduced in [49,50,51,52,53,54].

Polysymplectic manifolds. We only need a restricted amount of notions from polysymplectic geometry which are straighforward extensions of those recalled above in the symplectic context. We refer to [47] for more information. A polysymplectic manifold (M,ω) is a manifold M endowed with a closed nondegenerate Rn-valued 2-form. We can identify ω with a collection (ω1,,ωn), of closed 2-forms with i=1nkerωi={0}.

A Lie group action ϕ:G×MM of G on M is polysymplectic, if Φgωi=ωi, for every gG and i=1,,n. Similarly as before, this implies that iξMω is a closed Rn-valued one-form on M. If this form is exact, then the action is called Hamiltonian and admits a polysymplectic momentum map J:ML(g,Rn), which satisfies

iξMω=dJξ,

where Jξ:MRn is defined by Jξ(m)=J(m)·ξ. In a similar way with the symplectic case, if M is connected, there is group one-cocycle θC(G,L(g,Rn)), θ=(θ1,,θn), defined by

θi(g)=Ji(Φg(m))Adg1(Ji(m)).

and one defines the map Θ:g×gRn by

Θ(ξ,η):=ddεε=0θ(exp(εξ))(η)Rn. (47)

Taking the derivative of the relation above, we get

Θ(ξ,η)i=J[ξ,η]iωi(ξM,ηM). (48)

As a consequence Θ is skew-symmetric, and satisfies the two-cocycle identity

Θ([ξ,η],ζ)+Θ([η,ζ],ξ)+Θ([ζ,ξ],η)=0, (49)

see [47].

Polysymplectic model. The polysymplectic model of statistical mechanics is obtained by considering the following specific situation in the equivariant setting described in Section 2.2:

M:apolysymplecticmanifolddμ:avolumeformϕg:avolumepreservingHamiltonianactionE=L(Rn,g):thelinearmapsfromRntotheLiealgebraofGρg=(Adg)n:theactioninducedonL(Rn,g)bytheadjointactionofGongU=J:ME=L(g,Rn):apolysymplecticmomentummap.

Here the space E=L(Rn,g) of linear maps is identified with the Cartesian product gn=g××g and (Adg)n acts on βE as

(Adg)n(β1,,βn)=(Adgβ1,,Adbβn).

The thermodynamic heat becomes a map Q:ΩL(Rn,g)ΩL(g,Rn) with Q(β)=Eβ(J)ΩL(g,Rn) and the Fisher metric on ΩL(Rn,g) is

I(β)=Eβ(JEβ(J))(JEβ(J))symL(Rn,g).

Proposition 5 directly yields the following equivariance properties

(Adg)nΩ=Ω,ψ((Adg)nβ)=ψ(β)eθ(g1),β,pβϕg=p(Adg1)nβ

and

Φ((Adg)nβ)=Φ(β)θ(g1),β (50)
Q((Adg)nβ)=(Adg1)n(Q(β))+θ(g) (51)
s((Adg1)nν+θ(g))=s(ν) (52)
K((Adg)nβ)(Adg)nδβ1,(Adg)nδβ2=K(β)δβ1,δβ2, (53)

for every gG. Note also that Ω is invariant under the affine action νL(g,Rn)(Adg1)nν+θ(g)L(g,Rn).

From Proposition 4, given νΩL(g,Rn), the generalized Gibbs probability density

pβ(m)=1ψ(β)eJ(m),β,

with β=Q1(ν), satisfies the maximum entropy principle

maxqMqlogqdμsuchthatMqdμ=1MJqdμ=ν.

Particular cases. A particularly simple case of polysymplectic Souriau model is given by the manifold M=TQTQ (Whitney sum with n factors) endowed with the polysymplectic form Ω=(Ω1,,Ωn), with Ωk=(πk)Ωcan. Here πk:TQTQTQ is the projection onto the kth factor of the sum and Ωcan is the canonical symplectic form on TQ. Let G be a Lie group acting on the left on Q. Then its naturally induced action on TQTQ is polysymplectic and admits the polysymplectic momentum map J:TQTQL(g,Rn) given by

J(αq1,,αqn)=(J(αq1),,J(αqn)),

where J:TQg is the momentum map associated with the cotangent lifted action of G on TQ given in Equation (38). In this case, there is no cocycle.

Another case without cocycle in the polysymplectic momentum map is when M is chosen as an orbit M=O={(Adg1)nμ+θ(g)gG}, μL(g,Rn), of the affine left action of G on L(g,Rn) given by μ(Adg1)nμ+θ(g), with θC(G,L(g,Rn)) a group one-cocycle. This orbit M is endowed with a natural polysymplectic form ω=(ω1,,ωn) with ωi defined by

ωi(μ)adξμΘ(ξ,·),adημΘ(η,·)=μi,[ξ,η]Θi(ξ,η)

where Θ is given in Equation (47), which is the polysymplectic version of Equation (33). In this case, the polysymplectic momentum map is simply the inclusion J:OL(g,Rn) of the orbit in L(g,Rn).

Property of the entropy and polysymplectic Lie-Poisson equations with cocycle. In the context of the polysymplectic model, a natural generalisation of the Lie-Poisson equations with cocycle Equation (36) are

k=1nxkμk+k=1nadδhδμkμk=k=1nΘkδhδμk,·, (54)

for a map μ:x=(x1,,xn)URnμ(x)=(μ1(x),,μn(x))L(g,Rn), with h:L(g,Rn)R a given Hamiltonian. In absence of the cocycle, such a field theoretic version of the Lie-Poisson equation appears, for instance, for the spacetime Lagrangian and Hamiltonian theoretic description of Cosserat rods and molecular strands, see [36,37].

From the invariance property Equation (52), we have

k=1nδsδμk,adξμk+Θk(ξ,·)=0

for all ξg. This is equivalent to

k=1nadδsδμkμkk=1nΘiδsδμk,·=0.

For h=s, Equation (54) thus reduce to k=1nxkμk=0. This is the natural extension of the Casimir property of s observed in the Souriau model in Section 2.3.2, given there by the condition adδsδμμΘδsδμ,·=0, giving ddtμ=0.

2.5. The Fisher Metric on Orbits and Equivariance

We give here a general expression of the Fisher metric on orbits of the action ρ:G×EE, in the general setting described in Section 2.1 and Section 2.2. This clarifies the link between the Fisher metric and the metric on adjoint orbits considered by Souriau, as enlightened in [4].

As in Section 2.1 we consider a manifold M, a vector space E, a function U:ME, and the class of generalized Gibbs probability densities

pβ(m)=1ψ(β)eU(m),β,βΩ.

As in Section 2.2 given a Lie group G we consider an action ϕ:G×MM and a representation ρ:G×EE. We denote by

ξE(β):=ddεε=0ρexp(ϵξ)(β)andξE(ν):=ddεε=0ρexp(εξ)(ν),

βE, νE the infinitesimal generators of the representations ρg and ρg associated with ξg. We will use the equality ξE(ν),β=ν,ξE(β). Given the group one-cocycle θC(G,E) associated with the function U, see Equation (16), we define ΘC(g,E) by

Θ(ξ),β=ddεε=0θ(exp(εξ)),β, (55)

for ξg and βE. Recall that the Fisher metric is I(β)=Eβ[D2logpβ] and coincides with the generalized heat capacity, see Proposition 2.

Proposition 9.

On the G-orbit throughβΩ, the Fisher metric is written in terms of Θ and Q as follows

I(β)(ξE(β),ζE(β))=Θ(ξ),ζE(β)+ξE(Q(β)),ζE(β). (56)

Proof. 

Taking the derivative with respect to g at e of the equality Equation (18) given by

Q(ρg(β)),γ=ρg1(Q(β)),γ+θ(g),γ

for every γE, we get

DQ(β)·ξE(β),γ=Θ(ξ),γξE(Q(β)),γ,

for every ξg. For γ=ζE(β)TβO, we get

DQ(β)·ξE(β),ζE(β)=Θ(ξ),ζE(β)ξE(Q(β)),ζE(β).

Therefore, from Proposition 2, we can write

I(β)(ξE(β),ζE(β))=D2Φ(β)(ξE(β),ζE(β))=Θ(ξ),ζE(β)+ξE(Q(β)),ζE(β),

which proves the result. ☐

We illustrate this result for the Souriau model, its polysymplectic extension, and the Koszul model.

Souriau Lie group statistical model. In this case, M is endowed with a symplectic structure ω, we take E=g and U=J:Mg with nonequivariance cocycle θC(G,g), i.e., θ(g)=J(ϕg(m))Adg1J(m)g. The map Θ defined in Equation (55) becomes here a two cocycle Θ:g×gR, see Equations (25)–(27), via the relation Θ(ξ),η=Θ(ξ,η). Proposition 9 immediately yields the following result as a corollary, which is obtained by noting that ξE(β)=adξβ and ξE(ν)=adξν and is a consequence of Equation (29).

Corollary 10.

On the adjoint orbit through β in g , the Fisher metric is written as follows

I(β)(adξβ,adζβ)=Θ(ξ,adζβ)+adξQ(β),adζβ. (57)

Please note that Equation (57) can be written as

I(β)(adξβ,adζβ)=Θβ(ξ,adζβ),

where Θβ(ξ,η):=Θ(ξ,η)adξQ(β),η=Θ(ξ,η)Q(β),[ξ,η] is a two-cocycle. In particular, the last term is a coboundary. We refer to [2] for more information.

Polysymplectic Lie group statistical model. In this case, M is endowed with a polysymplectic structure ω=(ω1,,ωn), we take E=L(Rn,g) and U=J:ML(g,Rn) with nonequivariance cocycle θC(G,L(g,Rn)):

θ(g)=J(ϕg(m))(Adg1)nJ(m)L(g,Rn).

The map ΘC(g,L(g,Rn)) defined in Equation (55) is identified here with the map Θ:g×gRn defined in Equation (47), with the properties Equations (48) and (49). The identification being Θ(ξ)(η)=Θ(ξ,η), where Θ(ξ)L(g,Rn) is applied to ηg. We note the equality Θ(ξ),(η1,,ηn)=i=1nΘi(ξ,ηi), for (η1,,ηn)gn identified with L(Rn,g), where , on the left hand side is the duality pairing between L(Rn,g) and L(g,Rn).

We now apply Proposition 9, which follows here from Equation (51). We have the infinitesimal generators ξE(β)=(adξβ1,,adξβn) and ξE(ν)=(adξν1,,adξνn), and we get

Θ(ξ),ζE(β1,,βn)=Θ(ξ),(adζβ1,,adζβn)=Θi(ξ,adζβi)

and

ξEQ(β),ζE(β)=iadξQ(β)i,adζβi.

Therefore, the following result is obtained.

Corollary 11.

On the adjoint orbit through β in L(Rn,g) , the Fisher metric is written as follows

I(β)(adξβ1,,adξβn),(adζβ1,,adζβn)=iΘi(ξ,adζβi)+iadξQ(β)i,adζβi. (58)

Koszul model. For the Koszul model with Ω=sym(n)+ the cone of positive definite matrices and the Lie group G=GL(n), the actions Equations (21) and (22) have the associated infinitesimal generators

ξE(β)=ξTββξandξE(ν)=ξννξT.

In this case, Θ=0 and Proposition 9 is satisfied by noting the equalities

I(β)(ξE(β),ζE(β))=(n+1)Trξζ+β1ξTβζ=ξE(Q(β)),ζE(β).

3. Applications

In this section, we show how the framework considered in Section 2 applies to various examples and helps identifying common underlying geometric structures. We start with the case of multivariate Gaussian probability densities as an illustration of the general framework, for which a cocycle is needed and which does not fall into the setting of the Souriau model. We then enlighten the strong analogies with quantum information geometry by considering Lie algebras with unitary representation and show that the Fisher metric as defined from the generalized heat capacity in Section 2.1, coincides with the Bogoliubov-Kubo-Mori metric. In this particular case the equation with Casimir dissipation/production considered in Section 2.3.3 reproduces a dissipative model of [25]. Finally, we consider in detail the case of the Euclidean group of the plane SE(2) since it allows explicit and relatively easy computations while exhibiting the interesting feature of having cocycle. This example fits into the setting of the Souriau symplectic model.

3.1. Multivariate Gaussian Probability Densities

In this paragraph we study in detail the case of multivariate Gaussian densities, following the approach developed in Section 2.1 and Section 2.2. A first treatment in this spirit was given in [4], Section 8. Here we clarify several steps in this approach by following systematically the general setting presented in Section 2.1 and Section 2.2, while we note that this example is not a particular case of the Souriau model. We present explicitly the cocycle, which is here defined on the general affine group, with values in the Cartesian product of symmetric matrices and the Euclidean space.

Gaussian probability densities in generalized Gibbs form. Consider a multivariate Gaussian density with symmetric and positive definite covariance matrix Rsym+(n) and mean mRn. The Gaussian probability density is written in the generalized Gibbs form pβ discussed above in Section 2.1 as follows:

p(R,m)(z)=1(2π)n/2det(R)1/2e12(zm)TR1(zm)=1(2π)n/2det(R)1/2e12mTR1me12zTR1zmTR1z=1(2π)n/2det(R)1/2e12mTR1mezzT,z,12R1,R1m=:1ψ(β)eU(z),β=:pβ(z),

for every zRn. In the last equality above, we have defined the energy function

U:Rnsym(n)×Rn,U(z)=(zzT,z),

the vector βsym+(n)×Rn in terms of (R,m) as

β=(β1,β2):=(12R1,R1m)R=12β11,m=12β11β2, (59)

and the partition function

ψ(β)=(2π)n/2det(R)1/2e12mTR1m=πn/2det(β1)1/2e14Tr(β2Tβ11β2).

The general theory of Section 2.1 will be applied here with the manifold M=Rn, the vector space E=sym(n)×Rn, and the open subset Ω=sym+(n)×Rn. It is important to note that the element β of the general theory is not given by the couple (R,m), but related to (R,m) via Equation (59). This plays a main role in the understanding of the equivariance properties below.

Characteristic function, thermodynamic heat, and entropy. The Massieu potential is computed in terms of βΩ as

Φ(β)=log(ψ(β))=n2log(2π)12log(det(R))12mTR1m=K+12log(det(β1))14β2Tβ11β2,

where we defined the constant K=n2log(π). To compute the derivative, we consider the dual space E=sym(n)×Rn, with duality pairing

(ν1,ν2),(β1,β2)=Tr(ν1β1)+ν2·β2,

(ν1,ν2)E, (β1,β2)E. With respect to this duality pairing we have

δΦδβ1=12β11+14β11β2(β11β2)T=R+mmT,δΦδβ2=12β11β2=m,

so we get the thermodynamic heat Q:ΩEΩE as

β=(β1,β2)Q(β1,β2)=12β11+14β11β2(β11β2)T,12β11β2=(ν1,ν2).

In terms of the covariance matrix R and the mean m, this is written as

(12R1,R1m)ΩE(R+mmT,m)ΩE.

The entropy in terms of β=(β1,β2) and (R,m) is computed by taking the Legendre transform of Φ as

s(β1,β2)=n2(1+logπ)12log(det(β1))s(R,m)=n2(1+log(2π))+12log(det(R)).

Its expression s:ΩR in terms of (ν1,ν2) is found by using

Q1(ν1,ν2)=12(ν1ν2ν2T)1,(ν1ν2ν2T)1ν2.

Fisher information metric. We compute the generalized heat capacity K(β):=D2Φ(β) as follows, see Section 2.1:

K(β)=D2Φ(β1,β2)((δβ1,δβ2),(Δβ1,Δβ2))=ddεε=0DΦ(β1+εΔβ1,β2+εΔβ2)(δβ1,δβ2)=ddεε=012Tr(β11δβ1)14Tr(β11β2β2Tβ11δβ1)+12Tr(β2Tβ11δβ2)=12Tr(β11Δβ1β11δβ1)+12Tr(β11Δβ1β11β2β2Tβ11δβ1)12Tr(β11Δβ2β2Tβ11δβ1)12Tr(β11δβ2β2Tβ11Δβ1)+12Tr(Δβ2Tβ11δβ2).

From Proposition 2 this coincides with the Fisher metric. Let us verify that this is the case by rewriting these five terms in terms of the mean and covariance matrix (m,R). The above expression equals

=12Tr(ΔRR1δRR1)+Tr(R1δRR1mmTR1ΔR)Tr(R1δRR1mmTR1ΔR)+Tr(ΔmmTR1δRR1)Tr(R1ΔRR1mmTR1δR)+Tr(δmmTR1ΔRR1)+Tr(R1ΔRR1mmTR1δR)Tr(δmmTR1ΔRR1)Tr(ΔmTR1δRR1m)+Tr(ΔmTR1δm)=12Tr(ΔRR1δRR1)+Tr(ΔmTR1δm),

which gives the Fisher metric I(R,m) for multivariate Gaussian densities.

Equivariance with respect to the general affine group. We consider the general affine group

GA(n)=GL(n)Rn

defined as the semidirect product of the general linear group and Rn. The group multiplication is

(A,a)(B,b)=(AB,Ab+a)

and the inverse of an element is (A,a)1=(A1,A1a). The Lie algebra is the semidirect product Lie algebra ga(n)=gl(n)Rn with Lie brackets [(U,u),(V,v)]=(UVVU,UvVu).

The group GA(n) acts on the left on the covariance matrix and the mean (R,m)sym+(n)×Rn as follows:

Ψ(A,a)(R,m)=(ARAT,Am+a). (60)

We consider the left action of GA(n) on Rn given by

ϕ(A,a)(z)=Az+a.

We note that Jϕ(A,a)=det(A), a constant function on Rn, hence ϕ satisfies the hypothesis of Lemma 5.

It is instructive to observe that the expression

Φ(Ψ(A,a)(R,m))Φ(R,m)

is not linear in (R,m), compare with Equation (17). However, such a statement is true when it is expressed in terms of the variables (β1,β2). We first need the expression of the action of GL(n) on (β1,β2). This is done in the next lemma.

Lemma 12.

The left action ofGA(n)induced on(β1,β2)sym(n)×Rnby the action Ψ bin Equation (60) is given by

ρ(A,a)(β1,β2)=ATβ1A1,ATβ22ATβ1A1a.

Its dual left action is

ρ(A,a)1(ν1,ν2)=Aν1AT+[2Aν2aT]sym,Aν2

Proof. 

This is a direct computation using Equation (59). ☐

The situation is illustrated in the following commuting diagram.

(R,m)sym(n)×RnΨ(A,a)sym(n)×Rn(β1,β2)sym(n)×Rnρ(A,a)sym(n)×Rn

The following result shows that the equivariant setting developed in Section 2.2 applies here with the action ϕ(A,a) and the representation ρ(A,a) (not Ψ(A,a)).

Lemma 13.

The energy function U(z)=(zzT,z) satisfies the relation

U(ϕ(A,a)(z))=ρ(A,a)1(U(z))+θ(A,a) (61)

for the group one-cocycle θ:GA(n)sym(n)×Rn given by

θ(A,a)=(aaT,a).

The Massieu potential, the thermodynamic heat, and the entropy satisfy the equivariance properties

Φ(ρ(A,a)(β1,β2))Φ(β1,β2)=log(det(A))+θ((A,a)1),(β1,β2)Q(ρ(A,a)(β1,β2))=ρ(A,a)1(Q(β1,β2))+θ(A,a)sρ(A,a)1(ν1,ν2)+θ(A,a)=s(ν1,ν2)+log(det(A)).

Proof. 

To prove Equation (61) we note that

U(ϕ(A,a)(z))ρ(A,a)1(U(z))=U(Az+a)ρ(A,a)1(zzT,z)=((Az+a)(Az+a)T,Az+a)(AzzTAT+[2AzaT]sym,Az)=(aaT,a).

The other results follow from Proposition 5 and from Jϕ(A,a)=det(A). Alternatively, we can compute explicitly

Φ(ψ(A,a)(β1,β2))=Φ(ATβ1A1,ATβ22ATβ1A1a)=K+12log(det(ATβ1A1))14Tr(ATβ1A1)1(ATβ22ATβ1A1a)(ATβ22ATβ1A1a)T=K+12log(det(A)2β1)14Trβ11(β22β1A1a)(β2T2aTATβ1)=Klog(det(A))+12log(det(β1))14Trβ11β2β2T+12Trβ2aTAT+12TrA1aβ2TTrA1aaTATβ1=Φ(β1,β2)log(det(A))+(A1aaTAT,A1a),(β1,β2)

which shows the result since (A1aaTAT,A1a)=θ((A,a)1). ☐

The identity relating the Fisher information metric, the cocycle, and the thermodynamic heat follows from the general Equation (56) as

I(β)(ξE(β),ζE(β))=Θ(ξ),ζE(β)+ξE(Q(β)),ζE(β),

where (β1,β2)sym+(n)×Rn, ξ=(ξ1,ξ2),ζ=(ζ1,ζ2)ga(n), Θ(ξ1,ξ2)=(0,ξ2) and the infinitesimal generators are

ξE(β)=ξ1Tβ1β1ξ1,ξ1Tβ22β1ξ2ξE(ν)=ξ1ν1ν1ξ1T2[ν2ξ2T]sym,ξ1ν2.

Geodesics on multivariate Gaussian densities and Noether theorem. Let us consider the Lagrangian L:TΩ=Ω×ER given by the kinetic energy of the Fisher metric

L(R,R˙,m,m˙)=12Tr((R1R˙)2)+m˙TR1m˙. (62)

The associated Euler-Lagrange equations are

R¨+m˙m˙TR˙R1R˙=0m¨R˙R1m˙=0. (63)

In accordance with Proposition 5, see Equation (20), the Fisher metric is invariant with respect to the action of GA(n) on (R,m)Ω given in Equation (60). As a consequence, the Lagrangian is invariant under the tangent lifted action of GA(n) on TΩ given by

Φ(A,a)T(R,R˙,m,m˙)=(ARAT,AR˙AT,Am+a,Am˙).

From Noether theorem, the corresponding momentum map is conserved. The momentum map JL:TΩga(n) associated with this Lagrangian and this action is given by

JL(R,R˙,m,m˙)=JR,LR˙,m,Lm˙=JR,R1R˙R1,m,2R1m˙

with J:TΩga(n) the momentum map of the cotangent lifted action of GA(n) relative to the canonical symplectic form, see Equation (38). Using the expression of the infinitesimal generator of Ψ given by

(U,u)Ω(R,m)=R,UR+RUT,m,Um+u,

for (U,u)ga(n), we get J(R,m,pR,pm)=2pRR+pmmT,pm, so that

JL(R,R˙,m,m˙)=2R1R˙+2R1m˙mT,2R1m˙.

From Noether theorem, we have the conservation laws

R1R˙+R1m˙mT=csteR1m˙=cste.

We also refer to [55,56].

3.2. Unitary Representations and Quantum Fisher Metric

In this paragraph, we highlight the strong analogies between the equivariant setting considered in this paper, and techniques in quantum information geometry, as developed in [25], see also [28]. In particular, when this setting is considered in the quantum context, the Fisher metric, as defined from the derivative of the generalized heat capacity, coincides with the Bogoliubov-Kubo-Mori metric. We also illustrate how the general equations with Casimir dissipation/production considered above reproduce the dissipative model proposed in [25].

In [25] information geometry was studied for some Lie algebras where for certain unitary representations, the statistical manifold of states was defined as convex cone for which the partition function is finite, making reference to Bogoliubov-Kubo-Mori metric. Please note that only the case with zero cohomology for the Lie algebras g=so(3) and g=sl(2,R) was studied.

Let G be a Lie group, acting on a complex Hilbert space by a unitary left representation, Ug:HH. We denote by βH the associated infinitesimal generator, giving the Lie algebra representation, and consider the self-adjoint operator iβH. We assume dimH<. The following class of density matrices is considered

ρβ=1ψ(β)exp(iβH), (64)

for βg, with partition function ψ(β)=Tr(exp(iβH)). We adopted in Equation (64) a general form for the class of density matrices, which includes the class considered in [25] and reference therein. Please note that Equation (64) is a model of faithful quantum states. Note also that the map βρβ is not necessarily injective in general. We do not assume this hypothesis for the development below, but it is required in quantum information geometry.

As in Section 2.1, we adopt the following definitions

Φ(β)=log(ψ(β)),Q(β)=DΦ(β),K(β)=D2Φ(β)

corresponding to the Massieu potential, the thermodynamic heat, and the generalized heat capacity. We note that

Q(β),δβ=Tr(ρβiδβH)=iδβHρβ,

for all δβg, which gives the expectation value of the observable iδβH in the quantum state ρβ. A result analogue to Equation (5) in the classical case. The generalized heat capacity is computed as

K(β)(δβ1,δβ2)=D2Φ(β)(δβ1,δβ2)=Trρβi(δβ1)Hi(δβ2)HTrρβi(δβ1)HTrρβi(δβ2)H,

thereby giving the covariance of the observables i(δβ1)H and i(δβ2)H in the quantum state ρβ. In [25], K is called the Bogoliubov-Kubo-Mori metric and chosen as the quantum version to the Fisher metric. Such a choice is geometrically natural in view of the result of Proposition 2 which identifies K with the Fisher metric in the classical case.

The von Neumann entropy of the density matrix can be expressed in terms of Φ and Q as Tr(ρβlogρβ)=Tr(ρβiβH)+logψ(β)=Q(β),βΦ(β)=s(ν), for ν=Q(β)=i(·)Hβg. This is analogue to the result of Lemma 1 giving the entropy as the Legendre transform of Φ(β), thus giving a quantum version of the Clairaut equation. See also [57] for a link with the Fisher metric.

Using (Adgβ)H=UgβHUg1, we have the following equivariance properties, which are obtained as in Proposition 5,

ψ(Adgβ)=ψ(β)Φ(Adgβ)=Φ(β)ρAdgβ=UgρβUg1Q(Adgβ)=Adg1(Q(β))s(Adg1ν)=s(ν)K(Adgβ)(Adgδβ1,Adgδβ2)=K(β)(δβ1,δβ2),

for every gG. In particular, ψ and Φ are constant on adjoint orbits and s is a Casimir for the Lie-Poisson bracket on g

{f,g}(μ)=μ,δfδμ,δgδμ, (65)

where we identify g with g using the duality pairing ν,β=Tr(νβ) and view g as a Lie subalgebra of u(H). Please note that with this pairing, we have Adg1=Adg and adβμ=[μ,β], so adjoint and coadjoint orbits are identified, and the Kirillov-Kostant-Souriau symplectic form on coadjoint orbits becomes

ωO(μ)(adξμ,adημ)=μ,[ξ,η].

Relation Equation (56) and adξμ=[μ,ξ] gives here the following expression of the Bogoliubov-Kubo-Mori metric on (co)adjoint orbits

K(μ)adξμ,adημ=adξQ(μ),adημ=Q(μ),[ξ,[η,μ]]=μ,[[Q(μ),ξ],η].

Casimir dissipation/production. The general equations for Casimir dissipation/production Equation (40) applied here with gu(H), g=g, and γ(ν,β)=ν,β=Tr(νβ), become

ddtf={f,h}Λδfδμ,δkδμ,δsδμ,δkδμ (66)

for every f, with {f,g} the Lie-Poisson bracket Equation (65). Since adξμ=[μ,ξ], Equation (41) yield

ddtμ+μ,δhδμ=Λδsδμ,δkδμ,δkδμ. (67)

Such equations where proposed in [25] with h(μ)=iH,μ, k(μ)=iT,μ, s(μ)=12μ,μ, with T and H two commuting self-adjoint operator [T,H]=0, thereby yielding the system

ddtμ+iμ,H=Λμ,T,T, (68)

with energy conservation and entropy production (Λ<0)

ddth=0andddts=Λ[μ,T]2.

3.3. Souriau Symplectic Model for SE(2), Lie-Poisson Equations with Cocycle, and Casimir Dissipation

In this paragraph, we illustrate many aspects of the geometric setting by considering the special Euclidean group of the plane, as it allows explicit and relatively easy computations while having a nonequivariant momentum map. We present the Lie-Poisson equations with cocycle (affine Lie-Poisson equations) with Casimir dissipation/production associated with the entropy of the Souriau symplectic model.

Momentum map and cocycle. Consider the special Euclidean group of the plane SE(2)=SO(2)R2 with semidirect product group multiplication

(Rφ,a)(Rψ,b)=(RφRψ,Rφb+a),

where Rφ is a rotation of angle φ. It acts on the plane R2 as

ϕ(θ,a)(x)=Rθx+a (69)

with infinitesimal generator

(λ,u)R2(x)=λJx+u

for (λ,u)se(2)=so(2)R2, where we identify so(2) with R and with

J=0110.

We consider on R2 the symplectic form ω(x,y)=x·Jy. It is easy to see that the action (69) is symplectic and admits the momentum map

J(x)=12|x|2,Jx.

This momentum map is not equivariant, with nonequivariance cocycle given by

θ(Rφ,a)=12|a|2,Ja. (70)

Gibbs densities, entropy, and Fisher metric. The generalized Gibbs probability densities are here given on M=R2 by

pβ(x)=1ψ(β)eJ(x),β=1ψ(β)e12λ|x|2u·Jx,

where β=(λ,u)Ωse(2), with Ω=(,0)×R2 and the partition function and Massieu potential are computed to be

ψ(β)=2πλe12λ|u|2,Φ(β)=log(2π)+log(λ)+12λ|u|2,β=(λ,u)Ω.

From this, we get the thermodynamic heat Q:Ωse(2)Ωse(2) as

Q(λ,u)=DΦ(λ,u)=1λ|u|22λ2,1λu

and we note that Ω={(μ,m)se(2)μ+|m|22<0}. The entropy s:ΩR is obtained as the Legendre transform of Φ:ΩR as

s(μ,m)=1+log(2π)+logμ|m|22. (71)

We note the relation

δsδm=δsδμm,δsδμ=μ+|m|221,

between the partial derivatives of s. From Proposition 2, the Fisher metric is found as

I(β)(δβ1,δβ2)=D2Φ(β)(δβ1,δβ2)=1λ21|u|2λδλ1δλ2+1λ2(u·δu1δλ2+u·δu2δλ1)1λδu1·δu2,

for every β=(λ,u)Ωse(2), i.e.,

I(β)=1λ21|u|2λuTuλI2.

Affine Lie-Poisson equations and Casimir dissipation. The affine coadjoint action associated with Equation (70) is found as

Ad(φ,a)1(μ,m)+θ(φ,a)=μRφm·Ja12|a|2,Rφm+Ja,

from which we directly observe that the entropy Equation (71) is constant on affine coadjoint orbits OΘse(2) and hence is a Casimir of the Lie-Poisson bracket with cocycle on se(2).

Using the expression Θ((λ,u),(γ,v))=ω(u,v) of the associated two cocycle, we get the Lie-Poisson bracket with cocycle

{f,g}Θ(μ,m)=(μ,m),δfδμ,δfδm,δgδμ,δgδmΘδfδμ,δfδm,δgδμ,δgδm=δgδμm·Jδfδmδfδμm·Jδgδm+δfδm·Jδgδm=δgδμωm,δfδmδfδμωm,δgδm+ωδfδm,δgδm.

Given a Hamiltonian h:se(2)R, one gets the Lie-Poisson equations with cocycle as the following system of ODEs

f˙={f,h}Θddtμ+m·Jδhδm=0ddtm+δhδμJm=Jδhδm. (72)

These equations determine Hamiltonian dynamics on affine coadjoint orbits that are the level sets of the entropy. From the point of view of thermodynamics, motion remaining on these surfaces is non-dissipative, whereas motion transversal to these surfaces is dissipative. We apply below the geometric approach to include dissipation and hence, motion through affine coadjoint orbits, as considered in general in Section 2.3.3.

Given the entropy Equation (71) and a function k:se(2)R which commutes with the Hamiltonian, i.e.,

δkδμJδhδm=δhδμJδkδmδkδμδhδm=δhδμδkδm

(for instance k=h), the Casimir dissipative/production Equation (40) gives here

ddtf={f,h}ΘΛδkδμJδfδmδfδμJδkδm·δkδμJδsδmδsδμJδkδm={f,h}ΘΛδkδμδfδmδfδμδkδm·δkδμδsδmδsδμδkδm

for every f, therefore, the following equations emerge

ddtμ+m·Jδhδm=Λδsδμ(δkδμmδkδm)·δkδmddtm+δhδμJm=JδhδmΛδsδμ(δkδμmδkδm)δkδμ, (73)

which have the property of preserving the Hamiltonian while dissipating/producing entropy as

ddts=Λ|δkδμδsδmδsδμδkδm|2=Λδsδμ2|δkδμmδkδm|2=Λ(μ+|m|22)2|δkδμmδkδm|2.

They are the SE(2) version of the Equation (67) proposed in the quantum context.

4. Variational Principles and (Multi)Symplectic Integrators

In this section, we make use of the geometric setting presented above to propose geometric integrators for some of the equations described earlier. Geometric integrators are numerical schemes designed with the aim to preserve as much possible the geometric structures underlying the equations they discretize [58]. It turns out that the preservation of geometric structures not only produces an improved qualitative behaviour, but also allows for a more accurate long-time integration. One efficient way to derive geometric integrators is to exploit the variational formulation of the continuous equations and to mimic this formulation at the spatial and/or temporal discrete level. For instance, for the ODEs of classical mechanics, a time discretization of the Lagrangian variational formulation permits the derivation of numerical schemes, called variational integrators, that are symplectic, exhibit good energy behavior, and inherit a discrete version of Noether’s theorem which guarantees the exact preservation of momenta arising from symmetries, see [59]. These methods are especially well-suited for systems on Lie group [60].

Variational integrators were extended to PDEs in various ways, one way being given by multisymplectic variational integrators ([61,62,63,64]) in which the starting point is a spacetime discretization of the Hamilton principle. Here also, a discrete version of Noether’s theorem for field theories is available in presence of symmetries. We refer to [64,65,66] for recent applications of multisymplectic variational discretizations.

In this section, we will present a geometric discretization of the Lie-Poisson equations with cocycle, see Section 2.3, that is symplectic and preserves the affine coadjoint orbits. We will then extend this approach to treat the case of the polysymplectic version of these Lie-Poisson equations with cocycle, see Section 2.4, by constructing a multisymplectic integrator. To achieve these goals, we will first present the variational principles attached to these equations, by looking at them from the Lagrangian side. Then these variational principles will be discretized in time or in space and time.

4.1. Preliminaries on Variational Lie Group Integrators

We very briefly recall the broad idea of variational integrators and refer to [59] for the detailed description. They are based on a discrete version of the Hamilton principle given, for a Lagrangian L:TQR, as

δ0TL(q(t),q˙(t))dt=0, (74)

for arbitrary variations of the curve q(t) with fixed extremities at t=0,T.

Euler-Poincaré and Lie-Poisson equations. We will be especially interested in the case where the configuration manifold is a Lie group, Q=G, and the Lagrangian L:TGR is right G-invariant. In this case, L induces a reduced Lagrangian on the quotient space (TG)/G identified with the Lie algebra g, i.e., we get :gR defined by the relation L(g,g˙)=(g˙g1). The Euler-Lagrange equations for L are equivalent to equations on g written in terms of the reduced Lagrangian :gR, called the Euler-Poincaré equations. They are obtained by computing the variational principle for induced by the Hamilton principle Equation (74). It is given by

δ0T(ξ(t))dt=0,forδξ=tη+[η,ξ] (75)

and yields the Euler-Poincaré equations

ddtδδξ+adξδδξ=0 (76)

for the curve ξ(t)g. In Equation (75), η(t) is an arbitrary curve in g vanishing at the extremities. If the Lagrangian is hyperregular, one can rewrite the Euler-Lagrange equations and the Euler-Poincaré equations in terms of the Hamiltonian associated with L or . In terms of the Hamiltonian h:gR obtained by the Legendre transform of , Equation (76) become the Lie-Poisson equations

ddtμ+adδhδμμ=0, (77)

see [31].

Variational integrators. Let Q be a configuration manifold and let L:TQR be a Lagrangian. Suppose that a time step Δt was fixed, denote by {tk=kΔtk=0,,N} the sequence of time, and by qd:{tk}k=0NQ, qd(tk)=qk a discrete curve. A discrete Lagrangian is a map Ld:Q×QR, Ld=Ld(qk,qk+1) that approximates the action integral of L along the curve segment between qk and qk+1, that is, we have

Ld(qk,qk+1)tktk+1L(q(t),q˙(t))dt,

where q(tk)=qk and q(tk+1)=qk+1. Usually this approximation is related to some numerical quadrature rule of the integral above. The discrete analogue of Hamilton’s principle Equation (74) reads

δk=0N1Ld(qk,qk+1)=0 (78)

for all variations δqd of qd with vanishing endpoints. After taking variations and applying a discrete integration by parts formula (change of indices), we obtain the discrete Euler-Lagrange equations:

D2Ld(qk1,qk)+D1Ld(qk,qk+1)=0,k{1,,N1}. (79)

These equations define, under appropriate conditions, an algorithm which solves for qk+1 knowing the two previous configuration variables qk and qk1.

To define the discrete momentum maps, one first needs to consider the discrete Legendre transforms defined by

F+Ld(qk,qk+1):=D2Ld(qk,qk+1)Tqk+1QFLd(qk,qk+1):=D1Ld(qk,qk+1)TqkQ. (80)

Then, given a Lie group action Φ:G×QQ, the discrete Lagrangian momentum maps JLd+,JLd:Q×Qg are defined by

JLd+(qk,qk+1),ξ=D2Ld(qk,qk+1),ξQ(qk+1)JLd(qk,qk+1),ξ=D1Ld(qk,qk+1),ξQ(qk). (81)

If the discrete curve {qj}j=0N satisfies the discrete Euler-Lagrange equations then we have the equality

JLd+(qk1,qk)=JLd(qk,qk+1),forallj=1,,N1. (82)

If the discrete Lagrangian Ld is G-invariant under the diagonal action of G induced by Φ on Q×Q, then the two discrete momentum maps coincide, JLd=JLd+=:JLd, therefore from Equation (82), we obtain that JLd is a conserved quantity along the discrete curve solution of Equation (79), that is,

JLd(qk,qk+1)=JLd(qk1,qk),forallj=1,,N1. (83)

This result is referred to as the discrete Noether’s theorem.

The symplectic character of the integrator is obtained by showing that the scheme (qk1,qk)(qk,qk+1) preserves the discrete symplectic two-forms ΩLd±:=(F±Ld)Ωcan on Q×Q, where Ωcan is the canonical symplectic two-form on TQ, see [59].

Discrete Euler-Poincaré equations. For Lie groups, variational discretization and the associated discrete Lagrangian reductions, was started in [60,67], and referred to as Lie group variational integrators. The essential idea behind such integrators is to discretize Hamilton’s principle and to update group elements using group operations. For the case of invariant systems on Lie group, one chooses a discrete Lagrangian that inherits the invariance of the continuous Lagrangian, i.e., Ld:G×GR satisfies Ld(gkh,gk+1h)=Ld(gk,gk+1), for all hG.

From this invariance, one defines the reduced discrete Lagrangian Ld on the associated quotient space (G×G)/G identified with G with quotient map (gk,gk+1)G×Ggk+1gk1G, i.e., the two discrete Lagrangians are related as Ld(gk,gk+1)=Ld(gk+1gk1), this is the point of view developed in [60]. The discrete Hamilton principle Equation (78) for Ld induces a discrete Euler-Poincaré variational principle for Ld that yields the discrete Euler-Poincaré equations on G. Numerically speaking it is desirable to obtain the algorithm on a vector space rather than on a Lie group. For this aim, a local diffeomorphism τ:gG with τ(0)=e is introduced to express small discrete changes in the group configuration through unique Lie algebra elements. Such a map is referred to as a retraction map ([68,69]). The discrete reduced Lagrangian is transported into a discrete Lagrangian d defined on a neighborhood of 0 in g via the relation

g(ξk)=Ld(gk+1gk1),withτ(Δtξk)=gk+1gk1. (84)

The relation on the right in Equation (84) is thought of as a discrete version of ξ=g˙g1.

The discrete Euler-Poincaré equations for d are obtained by computing the discrete variational principle induced on the discrete action k=0N1d(ξk) from the discrete Hamilton principle δk=0N1L(gk,gk+1)=0 recalled above in Equation (78). The main step in this process is to compute the variations δξk of ξk=1Δtτ1(gk+1gk1) induced by arbitrary variations δgk. One finds the expression

δξk=1ΔtdLτ1(Δtξk)·Adτ(Δtξk)1ηk+1ηk, (85)

where ηk=δgkgk1 and dLτ1(ξ):gg is the inverse to the left trivialized derivative of τ, dLτ(ξ):gg defined by

dLτ(ξ)·η=τ(ξ)1Dτ(ξ)·η. (86)

The discrete Euler-Poincaré variational principle thus reads

δk=0N1d(ξk)=0, (87)

with respect to variations δξk of the form Equation (85) with ηk vanishing at the endpoints. It yields the discrete Euler-Poincaré equations.

Adτ(Δtξk1)1μk1μk=0,μk:=dLτ1(Δtξk)δδξk. (88)

Here dLτ1(ξ):gg denotes the dual map to dLτ1(ξ):gg. We refer to [60,67,69,70] for the discrete Euler-Poincaré equations.

Being equivalent to the discrete Euler-Lagrange equations on the Lie group, this scheme is equivalent to a symplectic scheme (gk1,gk)(gk,gk+1) on G×G. From the discrete Noether theorem, the scheme also preserves the discrete momentum map and the coadjoint orbits Og. Moreover, the scheme μk1OμkO is symplectic on coadjoint orbits with respect to the Kirillov-Kostant-Souriau symplectic form, see [60]. Please note that the discrete momentum map is computed as

JLd(gk,gk+1)=1ΔtAdgkdLτ1(Δtξk)δδξk=1ΔtAdgkμk,

which is readily seen to be preserved, JLd(gk1,gk)=JLd(gk,gk+1), along the solutions of Equation (88)

4.2. Central Extensions and Variational Principle for the Lie-Poisson Equations with Cocycle

We considered in Equation (36) the Lie-Poisson equations with cocycle given by

ddtμ+adδhδμμ=Θδhδμ,·, (89)

associated with the Souriau symplectic model. Our aim is to derive a geometric integrator for this system that is symplectic and preserves the affine coadjoint orbits for general Hamiltonian. One systematic step is to look at Equation (89) from the Lagrangian side, as it was done for the ordinary Lie-Poisson equations above. Assuming that h is hyperregular, we can take the associated Lagrangian :gR and rewrite the equations as

ddtδδξ+adξδδξ=Θξ,·, (90)

for a curve ξ(t)g. However, in general (i.e., for arbitrary , arbitrary g, and arbitrary Θ) there is no natural variational principle for these equations, in the sense of a variational principle induced from the ordinary Hamilton principle for a Lagrangian L:TGR.

Nevertheless, there is a way to interpret the system Equation (90) as being induced by an ordinary Euler-Poincaré equations on a central extension of the Lie group G, integrating the Lie algebra cocycle Θ. This is related to a well-known fact that affine coadjoint orbits can be seen as ordinary coadjoint orbits of a central extension. We recall this fact below.

Lie group operations on central extensions. We shall focus on topologically trivial central extensions of finite dimensional Lie groups by R. The central extended group is thus of the form G^=G×R with group multiplication

(g,α)(h,β)=(gh,α+β+B(g,h))

where B:G×GR is a group two-cocycle, i.e., it satisfies

B(f,g)+B(fg,h)=B(f,gh)+B(g,h)

for all f,g,hG. It can always been chosen such that B(e,g)=B(g,e)=0, in which case we have B(g,g1)=B(g1,g) and (g,α)1=(g1,αB(g1,g)). One obtains from this the expression of the adjoint and coadjoint actions as

Ad(g,α)(η,v)=(Adgη,v+θ(g1),η) (91)
Ad(g,α)(μ,a)=(Adgμ+aθ(g1),a) (92)

where the group one-cocycle θC(G,g) is defined by

θ(g),η=D2B(g1,g)·ηgD1B(g,g1)·ηg. (93)

Equation (92) shows that the ordinary coadjoint orbits of G^ through (μ,1) are affine coadjoint orbits of G. We have the corresponding formulas

ad(ξ,u)(η,v)=[ξ,η],Θ(ξ,η) (94)
ad(ξ,u)(μ,a)=(adξμaΘ(ξ,·),0). (95)

Euler-Poincaré and Lie-Poisson equations on central extensions. From Equation (95), the Euler-Poincaré equations for a reduced Lagrangian ^:g^=g×RR take the form

ddtδ^δξ+adξδ^δξ=δ^δuΘ(ξ,·)ddtδ^δu=0. (96)

They are the critical conditions for the Euler-Poincaré variational principle

δ0T^(ξ(t),u(t))dt=0,forδξ=tη+[η,ξ],δu=tvΘ(η,ξ), (97)

which is just a special instance of Equation (75) applied to central extensions. In Equation (97) η(t)g and v(t)R are arbitrary curves vanishing at the extremities.

Given a Lagrangian :gR, one can then define the Lagrangian

^(ξ,u)=(ξ)+12u2 (98)

on g^ for which Equation (96) does reduce to Equation (90) if the initial condition for the curve u(t) is u(0)=1. This means that Equation (90) have a natural Euler-Poincaré variational formulation, if one interprets them as an invariant subsystem of an Euler-Poincaré equation on a central extension of G via a group two-cocycle B that integrates the one-cocycle θ as in Equation (93).

The same reasoning also directly applies on the Hamiltonian side, in which case the Lie-Poisson equation with cocycle Equation (89) is an invariant subsystem of an ordinary Lie-Poisson equation associated with a central extension of G.

All these considerations are standard, see, e.g., [29,31].

4.3. Variational Symplectic Integrators for the Lie-Poisson Equations with Cocycle

Here we shall present a geometric symplectic Lie group integrator for Lie-Poisson equations with cocycle Equation (36) that preserves the affine coadjoint orbits for general Hamiltonian. In particular, the scheme preserves the affine Kirillov-Kostant-Souriau symplectic form on these affine coadjoint orbits. We shall use the Euler-Poincaré variational formulation on central extensions presented in Section 4.2.

Some useful identities. Given a central extension G^=G×R, we shall consider the retraction map τ:g^G^ defined by

τ(ξ,u)=(τ¯(ξ),u) (99)

where τ¯:gG is a retraction map for G. To derive the discrete Euler-Poincaré equations we shall need several identities involving dLτ and dLτ¯, see Equation (86), that are shown in the next Lemma.

Lemma 14.

For a local diffeomorphism of the form Equation (99) on central extension, we have the following identities

  • (a)

    dLτ(ξ,u)·(η,v)=dLτ¯(ξ)·η,vD2B(τ¯(ξ),e)·(dLτ¯(ξ)·η)

  • (b)

    dLτ(ξ,u)·(μ,a)=dLτ¯(ξ)(μaD2B(τ¯(ξ),e)),a

  • (c)

    dLτ1(ξ,u)·(ζ,w)=dLτ¯1(ξ)·ζ,w+D2B(τ¯(ξ),e)·ζ

  • (d)

    dLτ1(ξ,u)·(μ,a)=dLτ¯1(ξ)·μ+aD2B(τ¯(ξ),e),a ,

where B:G×GR is the group two-cocycle.

Proof. 

These identities are proven as follows.

  • (a)
    Using the definition of dLτ, we compute
    dLτ(ξ,u)·(η,v)=τ(ξ,u)1(Dτ(ξ,u)·(η,v))=(τ¯(ξ),u)1(τ¯(ξ),Dτ¯(ξ)·η,u,v)=(τ¯(ξ)1Dτ¯(ξ)·η,v+D2B(τ¯(ξ)1,τ¯(ξ))·(Dτ¯(ξ)·η))=(dLτ¯(ξ)·η,v+D2B(τ¯(ξ)1,τ¯(ξ))·(τ¯(ξ)dLτ¯(ξ)·η))
    where in the third equality, we used the formula for the tangent lift of left translation on G^. Using the properties of the group two-cocycle B, we get the identity
    D2B(g1,g)·(gη)=D2B(g,e)·η,
    for all gG and ηg. Hence we get the result.
  • (b)
    Taking the dual map and using (a), we get
    dLτ(ξ,u)·(μ,a),(η,v)=(μ,a),dLτ(ξ,u)·(η,v)=μ,dLτ¯(ξ)·η+avD2B(τ¯(ξ),e)·(dLτ¯(ξ)·η)=dLτ¯(ξ)·μadLτ¯(ξ)D2B(τ¯(ξ),e),η+av,
    which proves the result.
  • (c)
    It follows by (a) and by inverting the relation (ζ,w)=dLτ(ξ,u)·(η,v)
    (ζ,w)=dLτ(ξ,u)·(η,v)=dLτ¯(ξ)·η,v+D2B(τ¯(ξ)1,τ¯(ξ))·(τ¯(ξ)dLτ¯(ξ)·η)
    is equivalent to
    (η,v)=dLτ¯1(ξ)·ζ,wD2B(τ¯(ξ)1,τ¯(ξ))·(τ¯(ξ)ζ)
  • (d)

    This follows by taking the dual map and using (c) as earlier.

 ☐

Variational discretization of the Lie-Poisson equations with cocycle. With the previous result, we first give below a symplectic integrator for the Euler-Poincaré equations on central extensions. Then we will show how this provides a symplectic integrator for the Lie-Poisson equations with cocycle.

Proposition 15

(Discrete Euler-Poincaré equations on central extensions). The following are equivalent:

  • (a)
    The discrete curve(ξk,uk)is critical for the discrete Euler-Poincaré variational principle
    δk^(ξk,uk)=0,
    with respect to variations
    δξk=1ΔtdLτ¯1(Δtξk)·(Adτ(Δtξk)1ηk+1ηk)δuk=1Δtvk+1vkD2B(τ¯(Δtξk),e)·ηk+D1B(e,τ¯(Δtξk))·ηk+1
    whereηkg and vkR are arbitrary discrete curves vanishing at the endpoints.
  • (b)
    The discrete curve (ξk,uk) is a solution of the discrete Euler-Poincaré equations
    Adτ(Δtξk1)1μk1+ak1θ(τ¯(Δtξk1))μk=0ak1ak=0 (100)
    with
    μk=dLτ¯1(Δtξk)δ^δξk+δ^δukD2B(τ¯(Δtξk),e)ak=δ^δuk. (101)

Proof. 

We use the discrete Euler-Poincaré formulation Equations (87)–(118). For (a), we use Equation (85) and Lemma 14, and we compute

δ(ξk,uk)=1ΔtdLτ1(Δtξk,Δtuk)·Adτ(Δtξk,Δtuk)1(ηk+1,vk+1)(ηk,vk)=1ΔtdLτ1(Δtξk,Δtuk)·Adτ¯(Δtξk)1ηk+1ηk,vk+1vk+θ(τ¯(Δtξk)),ηk+1=1Δt(dLτ¯1(Δtξk)·(Adτ(Δtξk)1ηk+1ηk),vk+1vk+θ(τ¯(Δtξk)),ηk+1+D2B(τ¯(Δtξk),e)·Adτ(Δtξk)1ηk+1ηk).

Using the identity θ(g),ξ+D2B(g,e)·Adg1ξ=D1B(e,g)·η, we get the desired result.

For (b), we use the formula for the coadjoint action on central extension to get

Adτ(Δtξk1,Δtuk1)1(μk1,ak1)(μk,ak)=(Adτ(Δtξk1)1μk1+ak1θ(τ¯(Δtξk1)),ak1)(μk,ak)

which proves Equation (100). Then, to get Equation (101), we note that

(μk,ak):=dLτ1(Δtξk,Δtuk)δδξk,δδuk=dLτ¯1(Δtξk)δδξk+δδukD2B(τ¯(Δtξk),e),δδuk

by Lemma 14. ☐

We note that the relation with the solution (gk,αk) of the discrete Euler-Lagrange on the Lie group G^ is given as

(ξk,uk)=1Δtτ1(gk+1,αk+1)(gk,αk)1

which is explicitly given by the relations

ξk=1Δtτ1(gk+1gk1)uk=1Δtαk+1αkB(gk,gk1)+B(gk+1,gk1). (102)

Similarly, the variations ηk,vk used in discrete Euler-Poincaré variational principle are related to the variations δgk, δαk used in the discrete Hamilton principle via the equality (ηk,vk)=(δgk,δαk)(gk,αk)1=(δgkgk1,δαk+D1B(gk,gk1)·δgk).

The discrete momentum map JLd:G^×G^g^ is computed as

JLd((gk,αk),(gk+1,αk+1)=1Δt(Adgkμk+akθ(gk1),ak)

where (μk,ak) are given in Equation (101) and relation Equation (102) are assumed. It is readily seen that JLd is preserved along the solutions of Equation (100).

The symplectic integrator for the Lie-Poisson equations with cocycle is deduced as follows.

Proposition 16. 

(Symplectic integrator for Lie-Poisson equations with cocycle) Let h:gR be a Hamiltonian assumed to be hyperregular, with associated Lagrangian :gR. Then the numerical scheme

Adτ(Δtξk1)1μk1+θ(τ¯(Δtξk1))μk=0 (103)

with

μk=dLτ¯1(Δtξk)δδξk+D2B(τ¯(Δtξk),e) (104)

is a symplectic scheme for the Lie-Poisson equations with cocycle

ddtμ+adδhδμμ=Θδhδμ,·. (105)

It preserves the affine coadjoint orbits

O={Adg1μ+θ(g)GG}

and μk1μk is symplectic relative to the affine Kirillov-Kostant-Souriau symplectic form

ωO(μ)adξμΘ(ξ,·),adημΘ(η,·)=μ,[ξ,η]Θ(ξ,η).

Proof. 

It is a direct consequence of Proposition 15, by choosing the reduced Lagrangian Equation (98), taking the initial condition a0=1 and noting that ak+1=ak=1. ☐

It is possible to rewrite the scheme in a way that is more advantageous from the point of view of implementation. By inserting Equation (104) in Equation (103) and using the identity

Adg1D2B(g,e)+θ(g)=D1B(e,g)

we get the scheme in terms of ξk as

Adτ(Δtξk1)1dLτ¯1(Δtξk1)δδξk1dLτ¯1(Δtξk)δδξk+D1B(e,τ¯(Δtξk1))D2B(τ¯(Δtξk),e)=0. (106)

It is also often assumed that the retraction map τ satisfies τ(ξ)τ(ξ)=e. In this case, we have the identity Adτ(ξ)dLτ1(ξ)=dLτ1(ξ), see [69], and the scheme Equation (106) takes the form

dLτ¯1(Δtξk1)δδξk1dLτ¯1(Δtξk)δδξk+D1B(e,τ¯(Δtξk1))D2B(τ¯(Δtξk),e)=0. (107)

In absence of the last two terms, we recover the most practically used form of the ordinary discrete Euler-Poincaré equations, e.g., [69]. The last two terms correspond to a discretization of the cocycle which ensures that the resulting scheme is symplectic on each affine coadjoint orbit. It is clear that such a form is not likely to be guessed from the continuous equations without having at hands the discrete variational principle.

Remark 17

(Choice of retraction map). For an exposition of retraction maps, such as canonical coordinates of the first and second kind, and their applications to Lie group methods, the reader is referred to [68]. A possible choice is the exponential map exp:gG. In this case, dLexp(ξ)·η and dLexp1(ξ)·η are given as series which are truncated in order to achieve a desired order of accuracy [58]. A standard choice is the Cayley map cay:gG defined by cay(ξ)=(eξ/2)1(e+ξ/2) which is valid for a general class of quadratic matrix groups (which include the groups SO(3), SE(2), and SE(3)). Based on this simple form, the derivative maps become

dLcay(ξ)·η=e+ξ/21ηeξ/21dLcay1(ξ)·η=e+ξ/2ηeξ/2,

for each ξ,ηg.

Example 17.

Consider the Lie-Poisson equations with cocycle for SE(2) derived in Section 3.3. The central extension integrating the group one-cocycle Equation (70) is SE(2)^=SE(2)×R with group two-cocyle B:SE(2)×SE(2)R given by

B((φ,a),(ψ,b))=12a·JRφb.

This group is referred to as the oscillator group. To apply the scheme Equation (107) to this case, we use the identities

D1B((I,0),(φ,a))=0,12JaandD2B((φ,a),(I,0))=0,12JRφ1a

as well as the Cayley map for SE(2) given by

cay(λ,u1,u2)=R(λ),24+λ2(λu2+2u1,λu1+2u2),R(λ)=14+λ2λ244λ4λλ24

and the expression dLcay1(λ,u1,u2):se(2)se(2) given in matrix representation as

I3+12000u20λu1λ0+14λ200λu100λu200

see [70].

4.4. Multisymplectic Lie Group Variational Integrators

In this paragraph, we briefly indicate how the discrete variational setting of the previous section can be extended to variational discretization in several independent variables, i.e., when the unknown is a field rather than a curve. At the continuous setting, the underlying geometric variational setting is the multisymplectic framework of field theories, see, e.g., [48]. Discrete multisymplectic variational versions of this setting were developed and applied in [61,62]. Multisymplectic variational discretization on Lie groups and the discrete Euler-Poincaré field equations were carried out in [63,64].

We will focus on the special case of fields defined on an open subset U of Rn with smooth boundary, with values in a configuration manifold Q. We also assume that the Lagrangian only depends on the values of the fields and their first derivatives, not on the parameter xRn, so it is a map L:TQTQR. Hamilton’s principle for a field q:URnQ is

δULq(x),1q(x),,nq(x)dx=0,

for arbitrary variations of the field q that vanish on the boundary of U, from which the Euler-Lagrange equations for the field q(x) are obtained.

We shall focus on the case Q=G a Lie group and for right-invariant Lagrangians, i.e.,

Lgh,v1h,,vnh=Lg,v1,,vn,

for every v1,,vnTgG and every hG. In this case, L induces the reduced Lagrangian :ggR defined by (v1g1,,vng1)=Lg,v1,,vn. As in the ordinary Euler-Poincaré case recalled above, Hamilton’s principle yields the reduced variational principle

δU(ξ1,,ξn)dx=0,δξk=kη+[η,ξk], (108)

for an arbitrary field η:Ug vanishing on the boundary, which results in the Euler-Poincaré field equations

k=1nkδδξk+k=1nadξkδδξk=0. (109)

To guarantee the existence of a field g:UG such that ξk=kgg1, k=1,,n, the fields ξi in Equation (109) must satisfy the relation kξiiξk=[ξk,ξi]. In terms of the associated Hamiltonian h:ggR, these equations give Equation (54) without cocycle, i.e., with Θk=0.

To include the case with cocycle in a variational setting, we shall proceed exactly as in Section 4.2, by passing to a central extension of G. This is here done in the context of the Euler-Poincaré field equations, rather than for the ordinary Euler-Poincaré equations. This is the content of the next paragraph.

Variational principle for the Lie-Poisson field equations with cocycle. The goal of this paragraph is to obtain a variational principle for the Lie-Poisson field equations with cocycle Equation (54) associated with Souriau’s polysymplectic model. By considering the Euler-Poincaré field equations Equation (109) on a central extension, we get the system

kkδ^δξk+kadξkδ^δξk=kδ^δukΘ(ξk,·)kkδ^δuk=0. (110)

They are the critical conditions for the variational principle

δ0T^((ξ1,u1),,(ξn,un))dt=0,forvariationsδξk=kη+[η,ξk],δuk=kvΘ(η,ξk), (111)

which is just a special instance of Equation (109) applied to central extensions. The existence of a field (g,α):UG^ imposes the conditions kξiiξk=[ξk,ξi] and kuiiuk=Θ(ξk,ξi).

Given a Lagrangian :ggR, one can define the Lagrangian

^((ξ1,u1),,(ξn,un))=(ξ1,,ξn)+kuk (112)

on g^g^ for which Equation (110) does reduce to a Lagrangian version of the Lie-Poisson field equation with cocycle Equation (54), as desired, where it is assumed that Θk=Θ, for all k.

The same reasoning also directly applies on the Hamiltonian side, in which case the Lie-Poisson field equation with cocycle Equation (89) is an invariant subsystem of an ordinary Lie-Poisson field equation associated with a central extension of G.

Remark 18

(Polysymplectic vs multisymplectic setting). The field theories that are used in this paper can be described within the restricted setting of polysymplectic geometry. For such particular field theories the configuration bundle of the theory is trivial, the base is Euclidean, and the Lagrangian does not depend on the variables in the base. General classical field theories cannot be described by polysymplectic geometry and fit into the more general setting of multisymplectic geometry. This mainly comes from the fact that the field theoretic analogue to the cotangent bundle (endowed with the canonical symplecic form) of mechanics is the dual jet bundle of the configuration bundle (endowed with a canonical multisymplectic form). We shall apply below multisymplectic variational integrators to field equations belonging to the setting of polysysmplectic geometry. In this case, they could logically be called polysymplectic variational integrators. Please note that we kept the original naming multisymplectic variational integrators, since the theory applies to general multisymplectic field theories not just polysymplectic ones. See, e.g., [62,64] for applications of multisymplectic integrators to situations that are not covered by the polysymplectic formalism.

Multisymplectic Lie group integrators. To present multisymplectic integrators, we shall focus on the two dimensional case and assume that the fields are defined on a rectangle U=[0,A]×[0,B]R2. We shall write (x1,x2)=(x,y). Let Q be a configuration manifold and let L:TQTQR be a Lagrangian. We shall consider the very special case of a discrete grid determined by {(xk,ya)=(kΔx,aΔy)k=0,,N1,a=1,,N2} with given Δx and Δy. We shall denote by qd:{(xk,ya)}k=0NQ, qd(xk,ya)=qka a discrete field. A discrete Lagrangian is a map Ld:Q×Q×QR, Ld=Ld(qka,qk+1a,qa+1k) that approximates the action integral of L on the rectangle [xk,xk+1]×[ya,ya+1] for a field interpolating the values qka,qk+1a,qa+1k. The discrete Hamilton principle reads

δk=0N11a=0N21Ld(qka,qk+1a,qa+1k)=0, (113)

for all variations δqd of qd with vanishing boundary values. The discrete Euler-Lagrange equations are obtained as the critical point condition for a discrete field qd.

Given a Lie group action Φ:G×QQ, the discrete Lagrangian field momentum maps JLdi,:Q×Q×Qg, i=1,2,3 are defined by

JLd1(qka,qk+1a,qka+1),ξ=D1Ld(qka,qk+1a,qka+1),ξQ(qka)JLd2(qka,qk+1a,qka+1),ξ=D2Ld(qka,qk+1a,qka+1),ξQ(qk+1a)JLd3(qka,qk+1a,qka+1),ξ=D3Ld(qka,qk+1a,qka+1),ξQ(qka+1) (114)

which satisfies JLd1+JLd2+JLd3=0.

We refer to [61,62] for an introduction to multisymplectic variational integrators, including the notion of discrete multisymplecticity, discrete Cartan forms, and discrete field momentum maps, see also [71]. These integrators, also satisfy a discrete Noether theorem in presence of symmetries, as we shall see below in the special case of Lie groups.

Multisymplectic variational integrators on Lie groups were developed in [63,71], for application to geometrically exact (Cosserat) rods. As above, we shall focus on the two dimensional case and U=[0,A]×[0,B]R2. For Q=G a Lie group, the discrete Lagrangian is a map Ld:G×G×GR. We assume that the continuous Lagrangian is G invariant and that the discrete Lagrangian Ld inherits this invariance, i.e.,

L(gkah,gk+1ah,ga+1kh)=L(gka,gk+1a,ga+1k),

for every hG. Hence, by passing to the quotient associated with this action we get a reduced Lagrangian Ld:G×GR, Ld(gk+1a(gka)1,gka+1(gka)1)=L(gka,gk+1a,ga+1k). As mentioned earlier, it is advantageous to introduce a retraction map τ:gG, τ(0)=e, from which the discrete reduced Lagrangian can be defined on a neighborhood of (0,0) in g×g via the relation

g(ξka,ζka)=Ld(gk+1a(gka)1,gka+1(gka)1)withτ(Δxξka)=gk+1a(gka)1andτ(Δyζka)=gka+1(gka)1. (115)

The last two relations are thought of as discrete versions of ξ=1gg1, ζ=2gg1.

The discrete Euler-Poincaré field equations for d are obtained by computing the discrete variational principle induced on the discrete action k=0N11a=0N21d(ξka,ζka) from the discrete Hamilton principle δk=0N11a=0N21L(gka,gk+1a,gka+1)=0 recalled above in Equation (113). The main step in this process is to compute the variations δξkaδζka induced by arbitrary variations δgka. One finds the expression

δξka=1ΔxdLτ1(Δxξka)·Adτ(Δxξka)1ηk+1aηkaδζka=1ΔydLτ1(Δyζka)·Adτ(Δyζka)1ηka+1ηka. (116)

The discrete field Euler-Poincaré variational principle thus reads

δk=0N11a=0N21d(ξka,ζka)=0, (117)

with respect to variations δξka, δζka of the form Equation (116) with ηka vanishing at the boundary. It yields the discrete Euler-Poincaré field equations.

1ΔxAdτ(Δxξk1a)1μk1aμka+1ΔyAdτ(Δyζka1)1νka1νka=0,μka:=dLτ1(Δxξka)δδξka,νka:=dLτ1(Δyζka)δδζka. (118)

We refer to [63,71] for details, including the treatment of boundary conditions, the description of the associated discrete Cartan forms, the discrete field momentum maps, as well as the symplectic and multisymplectic characters of the scheme.

We just recall below the expression of the field momentum maps Equation (114) which take the following form:

JLd1(gka,gk+1a,gka+1)=1ΔxAdgkμka1ΔyAdgkνkaJLd2(gka,gk+1a,gka+1)=1ΔxAdgkμkaJLd3(gka,gk+1a,gka+1)=1ΔyAdgkνka.

The discrete Noether theorem, then asserts that a certain g-valued discrete integral of JLdi along the boundary of any subgrid domain is zero, see [71].

Multisymplectic variational discretization for Lie-Poisson field equations with cocycle. Based on the previous result, we first give below a multisymplectic integrator for the Euler-Poincaré field equations on central extensions. Then we deduce a multisymplectic integrator for the Lie-Poisson field equations with cocycle appearing in the polysymplectic Souriau model. The next proposition is the multisymplectic version of Proposition 15.

Proposition 19. 

(Discrete Euler-Poincaré field equations on central extensions) The following are equivalent:

  • (a)
    The discrete curve(ξk,uk)is critical for the discrete Euler-Poincaré field variational principle
    δa,k^((ξka,uka),(ζka,wka))=0,
    with respect to variations
    δξka=1ΔxdLτ¯1(Δxξka)·(Adτ(Δxξka)1ηk+1aηka)δζka=1ΔydLτ1(Δyζka)·Adτ(Δyζka)1ηka+1ηkaδuka=1Δxvk+1avkaD2B(τ¯(Δxξka),e)·ηka+D1B(e,τ¯(Δxξka))·ηk+1aδwka=1Δyvka+1vkaD2B(τ¯(Δyζka),e)·ηka+D1B(e,τ¯(Δyζka))·ηka+1
    whereηkag and vkaR are arbitrary discrete fields vanishing at the boundary.
  • (b)
    The discrete curve (ξka,uka,ζka,wka) is a solution of the discrete Euler-Poincaré field equations
    1ΔxAdτ(Δxξk1a)1μk1a+ak1aθ(τ¯(Δxξk1a))μka+1ΔyAdτ(Δyζka1)1νka1+bka1θ(τ¯(Δyζka1))νka=01Δx(ak1aaka)+1Δy(bka1bka)=0 (119)
    with
    μka=dLτ¯1(Δxξka)δ^δξka+δ^δukaD2B(τ¯(Δxξka),e)νka=dLτ¯1(Δyζka)δ^δζka+δ^δwkaD2B(τ¯(Δyζka),e)aka=δ^δuka,bka=δ^δwka. (120)

Proof. 

The proof can be obtained by appropriate extension of the proof of Proposition 15, by using the multisymplectic variational setting recalled in the previous paragraph. ☐

We note that the relation between the solution of the discrete Euler-Poincaré equations and the solution (gka,αka) of the discrete Euler-Lagrange field equations on the Lie group G^ is given as

(ξka,uka)=1Δxτ1(gk+1a,αk+1a)(gka,αka)1(ζka,wka)=1Δyτ1(gka+1,αka+1)(gka,αka)1

which is explicitly given by the relations

ξka=1Δxτ1(gk+1a(gka)1),ζk=1Δyτ1(gka+1(gka)1)uka=1Δxαk+1aαkaB(gka,(gka)1)+B(gk+1a,(gka)1)wka=1Δyαka+1αkaB(gka,(gka)1)+B(gka+1,(gka)1). (121)

Similarly, the variations ηka,vka used in discrete Euler-Poincaré variational principle are related to the variations δgka, δαka used in the discrete Hamilton principle via the equality (ηka,vka)=(δgka,δαka)(gka,αka)1=(δgka(gka)1,δαka+D1B(gka,(gka)1)·δgka).

The discrete field momentum maps are computed as

JLd1((gka,αka),(gk+1a,αk+1a),(gka+1,αka+1))=1ΔxAdgkaμka+akaθ((gka)1),aka1ΔyAdgkaνka+bkaθ((gka)1),bkaJLd2((gka,αka),(gk+1a,αk+1a),(gka+1,αka+1))=1ΔxAdgkaμka+akaθ((gka)1),akaJLd3((gka,αka),(gk+1a,αk+1a),(gka+1,αka+1))=1ΔyAdgkAdgkaνka+bkaθ((gka)1),bka

from which the discrete field Noether theorem can be stated for the solutions of Equation (119).

The multisymplectic integrator for the Lie-Poisson field equations with cocycle is obtained in the next Proposition, which is the multisymplectic analogue to Proposition 16.

Proposition 20. 

(Multisymplectic integrator for Lie-Poisson field equations with cocycle) Let h:g×gR be a Hamiltonian assumed to be hyperregular, with associated Lagrangian :g×gR. Then the numerical scheme

1ΔxAdτ(Δxξk1a)1μk1a+θ(τ¯(Δxξk1a))μka+1ΔyAdτ(Δyζka1)1νka1+θ(τ¯(Δyζka1))νka=0 (122)

with

μka=dLτ¯1(Δxξka)δδξka+D2B(τ¯(Δxξka),e)νka=dLτ¯1(Δyζka)δδζka+D2B(τ¯(Δyζka),e) (123)

is a multisymplectic scheme for the Lie-Poisson field equations with cocycle

xμ+yν+adδhδμμ+adδhδνν=Θδhδμ,·+Θδhδν,·. (124)

Proof. 

This follows from Proposition 19 and the choice Equation (112). ☐

If the retraction map τ satisfies τ(ξ)τ(ξ)=e, the scheme can be rewritten in a simpler way, as done in Equation (107) in the symplectic case.

The benefit of the structure preserving properties of the proposed numerical schemes will be exploited in a future work.

5. Conclusions

In the context of artificial intelligence, machine learning algorithms use more and more methodological tools coming from physics or statistical mechanics. The laws and principles that underpin this physics can shed new light on the conceptual basis of artificial intelligence. Thus, the principles of maximum entropy and François Massieu’s notions of characteristic functions enrich the variational formalism of machine learning. Conversely, the pitfalls encountered by artificial intelligence to extend its application domains, question the foundations of statistical physics, such as the generalization of the notions of Gibbs densities in spaces of more elaborate representation such as data on homogeneous symplectic manifolds and Lie groups. The porosity between the two disciplines has been established since the birth of artificial intelligence with the use of Boltzmann machines and the problem of robust methods for calculating partition function. More recently, gradient algorithms for neural network learning use large-scale robust extensions of the natural gradient of Fisher-based information geometry (to ensure reparameterization invariance), and stochastic gradient based on the Langevin equation (to ensure regularization), or their coupling called “Natural Langevin Dynamics”. Concomitantly, during the last fifty years, statistical physics has been the object of new geometrical formalizations (contact, Dirac, or symplectic geometry, variational principles, etc.) to try to give a new covariant formalization to the thermodynamics of dynamical systems, as Lie Groups thermodynamics. Finally, the study of geometric integrators as symplectic integrators with good properties of covariances and stability (use of symmetries, preservation of invariants and momentum maps) will open the door to new generation of numerical schemes. Machine learning inference processes are just beginning to adapt these new integration schemes and their remarkable stability properties to increasingly abstract data representation spaces. Artificial intelligence currently uses only a very limited portion of the conceptual and methodological tools of statistical physics. The purpose of this paper was to encourage constructive dialogue around a common foundation, to allow the establishment of new principles and laws governing the two disciplines in a unified approach.

Author Contributions

Writing—original draft preparation, F.B., F.G.-B.; The authors contributed equally to this work. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

  • 1.Souriau J.-M. Structure des Systèmes Dynamiques. Dunod; Paris, France: 1969. [Google Scholar]
  • 2.Marle C.-M. From tools in symplectic and Poisson geometry to J.-M. Souriau’s theories of statistical mechanics and thermodynamics. Entropy. 2016;18:370. doi: 10.3390/e18100370. [DOI] [Google Scholar]
  • 3.Barbaresco F. Koszul information geometry and Souriau geometric temperature/capacity of Lie Group Thermodynamics. Entropy. 2014;16:4521–4565. doi: 10.3390/e16084521. [DOI] [Google Scholar]
  • 4.Barbaresco F. Geometric theory of heat from Souriau Lie groups thermodynamics and Koszul Hessian geometry: Applications in information geometry for exponential families. Entropy. 2016;18:386. doi: 10.3390/e18110386. [DOI] [Google Scholar]
  • 5.Barbaresco F. Higher order geometric theory of information and heat based on polysymplectic geometry of Souriau Lie groups thermodynamics and their contextures: The bedrock for Lie Group machine learning. Entropy. 2018;20:840. doi: 10.3390/e20110840. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Barbaresco F. Geometric Structures of Information. Springer; Berlin, Germany: 2018. Jean-Louis Koszul and the Elementary Structures of Information Geometry; pp. 333–392. [Google Scholar]
  • 7.Barbaresco F. Souriau exponential map algorithm for machine learning on matrix Lie groups. In: Nielsen F., Barbaresco F., editors. GSI 2019. LNCS. Volume 11712 Springer; Berlin/Heidelberg, Germany: 2019. [Google Scholar]
  • 8.Barbaresco F. Lie group machine learning and Gibbs density on Poincaré unit disk from Souriau Lie groups thermodynamics and SU(1,1) coadjoint orbits. In: Nielsen F., Barbaresco F., editors. GSI 2019. LNCS. Volume 11712 Springer; Cham, Switzerland: 2019. [Google Scholar]
  • 9.Barbaresco F. Application Exponentielle de Matrice par l’extension de l’algorithme de Jean-Marie Souriau, Utilisable pour le tir Géodésique et l’apprentissage Machine pour les Groupes de Lie. Colloque GRETSI 2019. [(accessed on 20 April 2020)];2019 Available online: http://gretsi.fr/colloque2019/
  • 10.Barbaresco F. Les Structures Géométriques de l’information de Jean-Louis Koszul. Colloque GRETSI. [(accessed on 20 April 2020)];2019 Available online: http://gretsi.fr/colloque2019/
  • 11.Miolane N., Le Brigant A., Cabanes Y. Geomstats: A Python Package for Riemannian Geometry in Machine Learning. [(accessed on 20 April 2020)];2020 Available online: https://hal.inria.fr/hal-02536154/file/main.pdf.
  • 12.Berthet Q., Blondel M., Teboul O., Cuturi M., Vert J.-P., Bach F. Learning with Differentiable Perturbed Optimizers. arXiv. 20202002.08676 [Google Scholar]
  • 13.Blondel M., Martins A.F.T., Niculae V. Learning with Fenchel-Young Losses. J. Mach. Learn. Res. 2020;21:1–69. [Google Scholar]
  • 14.Wainwright M.J., Jordan M.I. Graphical models, exponential families, and variational inference. Found. Trends Mach. Learn. 2008;1:1–305. doi: 10.1561/2200000001. [DOI] [Google Scholar]
  • 15.Fréchet M.R. Sur l’extension de certaines évaluations statistiques au cas de petits échantillons. Rev. Inst. Int. Stat. 1943;11:182–205. doi: 10.2307/1401114. (In French) [DOI] [Google Scholar]
  • 16.Jaynes E.T. Information theory and statistical mechanics I, II. Phys. Rev. 1957;106:620. doi: 10.1103/PhysRev.106.620. [DOI] [Google Scholar]
  • 17.Koszul J.L. Variétés localement plates et convexité. Osaka J. Math. 1965;2:285–290. (In French) [Google Scholar]
  • 18.Koszul J.L. Exposés sur les Espaces Homogènes Symétriques. Publicação da Sociedade de Matematica de São Paulo; São Paulo, Brazil: 1959. (In French) [Google Scholar]
  • 19.Koszul J.L. Déformations des variétés localement plates. Ann. Inst. Fourier. 1968;18:103–114. doi: 10.5802/aif.279. (In French) [DOI] [Google Scholar]
  • 20.Vinberg E.B. Homogeneous convex cones. Trans. Mosc. Math. Soc. 1963;12:340–363. [Google Scholar]
  • 21.Vinberg E.B. The Theory of Homogeneous Convex Cones. Tr. Mosk. Mat. Obs. 1963;12:303–358. [Google Scholar]
  • 22.Koszul J.L. Introduction to Symplectic Geometry. Science Press; Beijing, China: 1986. (In Chinese) [Google Scholar]
  • 23.Koszul J.L. Series in Pure Mathematics. Volume 17 World Scientific Publishing; Singapore: 1994. Selected Papers. [Google Scholar]
  • 24.Souriau J.-M. Mécanique Statistique, Groupes de Lie et Cosmologie. [(accessed on 20 April 2020)];2020 Available online: https://www.academia.edu/42630654/Statistical_Mechanics_Lie_Group_and_Cosmology_1_st_part_Symplectic_Model_of_Statistical_Mechanics.
  • 25.Nencka H., Streater R.F. Information geometry for some Lie algebras. Infin. Dimens. Anal. Quantum Probab. Relat. Top. 1999;2:441–460. doi: 10.1142/S0219025799000254. [DOI] [Google Scholar]
  • 26.Gay-Balmaz F., Holm D.D. Selective decay by Casimir dissipation in inviscid fluids. Nonlinearity. 2013;26:495–524. doi: 10.1088/0951-7715/26/2/495. [DOI] [Google Scholar]
  • 27.Gay-Balmaz F., Holm D.D. A geometric theory of selective decay with applications in MHD. Nonlinearity. 2014;27:1747–1777. doi: 10.1088/0951-7715/27/8/1747. [DOI] [Google Scholar]
  • 28.Balian R., Alhassid Y., Reinhardt H. Dissipation in many-body systems: A geometric approach based on information theory. Phys. Rep. 1986;131:1–146. doi: 10.1016/0370-1573(86)90005-0. [DOI] [Google Scholar]
  • 29.Libermann P., Marle C.-M. Symplectic Geometry and Analytical Mechanics. Reidel; Kufstein, Austria: 1987. [Google Scholar]
  • 30.Abraham R., Marsden J.E. Foundations of Mechanics. Benjamin-Cummings Publ. Co.; San Francisco, CA, USA: 1978. [Google Scholar]
  • 31.Marsden J.E., Ratiu T.S. Introduction to Mechanics and Symmetry. Springer; Berlin, Germany: 2003. [Google Scholar]
  • 32.Chirco G., Laudato M., Mele F.M. Covariant momentum map thermodynamics for parametrized field theories. arXiv. 2019abs/1911.06224 [Google Scholar]
  • 33.Gay-Balmaz F., Ratiu T.S. Affine Lie-Poisson reduction, Yang-Mills magnetohydrodynamics, and superfluids. J. Phys. A Math. Theor. 2008;41:344007. doi: 10.1088/1751-8113/41/34/344007. [DOI] [Google Scholar]
  • 34.Gay-Balmaz F., Ratiu T.S. The geometric structure of complex fluids. Adv. Appl. Math. 2009;42:176–275. doi: 10.1016/j.aam.2008.06.002. [DOI] [Google Scholar]
  • 35.Gay-Balmaz F., Ratiu T.S., Tronci C. Equivalent theories of liquid crystal dynamics. Arch. Ration. Mech. Anal. 2013;210:773–811. doi: 10.1007/s00205-013-0673-1. [DOI] [Google Scholar]
  • 36.Ellis D.C.P., Gay-Balmaz F., Holm D.D., Putkaradze V., Ratiu T.S. Symmetry reduced dynamics of charged molecular strands. Arch. Ration. Mech. Anal. 2009;197:811–902. doi: 10.1007/s00205-010-0305-y. [DOI] [Google Scholar]
  • 37.Gay-Balmaz F., Holm D.D., Ratiu T.S. Variational principles for spin systems and the Kirchhoff rod. J. Geom. Mech. 2009;1:417–444. doi: 10.3934/jgm.2009.1.417. [DOI] [Google Scholar]
  • 38.de Saxcé G. Euler-Poincaré equation for Lie groups with non null symplectic cohomology. Application to the mechanics. In: Nielsen F., Barbaresco F., editors. GSI 2019. LNCS. Volume 11712 Springer; Berlin, Germany: 2019. [Google Scholar]
  • 39.Marle C.-M. Projection Stéréographique et Moments. [(accessed on 20 April 2020)];2019 Available online: https://hal.archives-ouvertes.fr/hal-02157930/
  • 40.Bismut J.-M. Mécanique Aléatoire. Volume 866 Springer; Berlin/Heidelberg, Germany: New York, NY, USA: 1981. Lecture Notes in Math. [Google Scholar]
  • 41.Lazaro-Cami J.A., Ortega J.-P. Stochastic Hamiltonian dynamical systems. Rep. Math. Phys. 2008;61:65–122. doi: 10.1016/S0034-4877(08)80003-1. [DOI] [Google Scholar]
  • 42.Bou-Rabee N., Owhadi H. Stochastic variational integrators. IMA J. Numer. Anal. 2009;29:421–443. doi: 10.1093/imanum/drn018. [DOI] [Google Scholar]
  • 43.Holm D.D. Variational principles for stochastic fluid dynamics. Proc. R. Soc. Lond. Ser. A Math. Phys. Eng. Sci. 2015;471:2176. doi: 10.1098/rspa.2014.0963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Gay-Balmaz F., Holm D.D. Stochastic geometric models with non-stationary spatial correlations in Lagrangian fluid flows. J. Nonlin. Sci. 2018;28:873–904. doi: 10.1007/s00332-017-9431-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Gay-Balmaz F., Holm D.D. Predicting uncertainty in geometric fluid mechanics. Disc. Cont. Dyn. Syst. Ser. S. 2020;13:1229–1242. doi: 10.3934/dcdss.2020071. [DOI] [Google Scholar]
  • 46.Arnaudon A., De Castro A.L., Holm D.D. Noise and Dissipation on Coadjoint Orbits. J. Nonlinear Sci. 2018;28:91–145. doi: 10.1007/s00332-017-9404-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Günther C. The polysymplectic Hamiltonian formalism in field theory and calculus of variations I: The local case. J. Diff. Geom. 1987;25:23–53. doi: 10.4310/jdg/1214440723. [DOI] [Google Scholar]
  • 48.Gotay M.J., Isenberg J., Marsden J.E., Montgomery R., Sniatycki J., Yasskin P.B. Momentum maps and classical fields. Part I: Covariant field theory. arXiv. 1997physics/9801019v2 [Google Scholar]
  • 49.Ingarden R.S., Nakagomi T. The second order extension of the Gibbs state. Open Syst. Inf. Dyn. 1992;1:259–268. doi: 10.1007/BF02228947. [DOI] [Google Scholar]
  • 50.Ingarden R.S., Kossakowski A., Ohya M. Information Dynamics and Open Systems. Volume 86 Springer; Berlin, Germany: 1997. Classical and Quantum Approach, Fundamental Theories of Physics. [Google Scholar]
  • 51.Ingarden R.S., Meller J. Temperatures in linguistics as a model of thermodynamics. Open Syst. Inf. Dyn. 1994;2:211–230. doi: 10.1007/BF02228965. [DOI] [Google Scholar]
  • 52.Jaworski W., Lngarden R.S. On the partition function in information thermodynamics with higher order temperatures. Bull. Acad. Pol. Sci. Sér. Phys. Astron. 1980;1:28–119. [Google Scholar]
  • 53.Jaworski W. Ph.D. Thesis. Institute of Physics, Nicolaus Copernicus University; Torun, Poland: 1983. Investigation of the Thermodynamic Limit for the States Maximizing Entropy under Auxiliary Conditions for Higher-Order Statistical Moments. (In Polish) [Google Scholar]
  • 54.Jaworski W. On the thermodynamic limit in information thermodynamics with higher-order temperatures. Acta Phys. Pol. 1983;A63:3–19. [Google Scholar]
  • 55.Eriksen P.S. Geodesics Connected with the Fisher Metric on the Multivariate Normal Manifold. Institute of Electronic Systems, Aalborg University; Aalborg, Denmark: 1986. Technical Report 86-13. [Google Scholar]
  • 56.Eriksen P.S. Proceedings of the GST Workshop. University of Lancaster; Lancaster, UK: 1987. Geodesics connected with the Fisher metric on the multivariate normal manifold. [Google Scholar]
  • 57.Balian R. The entropy-based quantum metric. Entropy. 2014;16:3878–3888. doi: 10.3390/e16073878. [DOI] [Google Scholar]
  • 58.Hairer E., Lubich C., Wanner G. Geometric Numerical Integration, Structure-Preserving Algorithms for Ordinary Differential Equations. Volume 31 Springer; Berlin, Germany: 2010. (Springer Series in Computational Mathematics). [Google Scholar]
  • 59.Marsden J.E., West M. Discrete mechanics and variational integrators. Acta Numer. 2001;10:357–514. doi: 10.1017/S096249290100006X. [DOI] [Google Scholar]
  • 60.Marsden J.E., Pekarsky S., Shkoller S. Discrete Euler-Poincaré and Lie-Poisson equations. Nonlinearity. 1998;12:1647–1662. doi: 10.1088/0951-7715/12/6/314. [DOI] [Google Scholar]
  • 61.Marsden J.E., Patrick G.W., Shkoller S. Multisymplectic geometry, variational integrators, and nonlinear PDEs. Commun. Math. Phys. 1998;199:351–395. doi: 10.1007/s002200050505. [DOI] [Google Scholar]
  • 62.Lew A., Marsden J.E., Ortiz M., West M. Asynchronous variational integrators. Arch. Ration. Mech. Anal. 2003;167:85–146. doi: 10.1007/s00205-002-0212-y. [DOI] [Google Scholar]
  • 63.Demoures F., Gay-Balmaz F., Kobilarov M., Ratiu T.S. Multisymplectic Lie group variational integrators for a geometrically exact beam in R3. Commun. Nonlinear Sci. Numer. Simul. 2014;19:3492–3512. doi: 10.1016/j.cnsns.2014.02.032. [DOI] [Google Scholar]
  • 64.Demoures F., Gay-Balmaz F., Ratiu T.S. Forum of Mathematics, Sigma. Volume 4. Cambridge University Press; Cambridge, UK: 2016. Multisymplectic variational integrators for nonsmooth Lagrangian continuum mechanics.54p [Google Scholar]
  • 65.Demoures F., Gay-Balmaz F., Desbrun M., Ratiu T.S., Aragón A. A multisymplectic integrator for elastodynamic frictionless impact problems. Comput. Methods Appl. Mech. Eng. 2017;315:1025–1052. doi: 10.1016/j.cma.2016.11.011. [DOI] [Google Scholar]
  • 66.Gay-Bamaz F., Putkaradze V. Variational discretizations for the dynamics of fluid-conveying flexible tubes. Comptes Rendus Mécanique. 2016;344:769–775. doi: 10.1016/j.crme.2016.08.004. [DOI] [Google Scholar]
  • 67.Bobenko A.I., Suris Y.B. Discrete Lagrangian reduction, discrete Euler-Poincaré equations, and semi-direct products. Lett. Math. Phys. 1999;49:79–93. doi: 10.1023/A:1007654605901. [DOI] [Google Scholar]
  • 68.Iserles A., Munthe-Kaas H.Z., Nørsett S.P., Zanna A. Lie-group methods. Acta Numer. 2000;9:215–365. doi: 10.1017/S0962492900002154. [DOI] [Google Scholar]
  • 69.Bou-Rabee N., Marsden J.E. Hamilton-Pontryagin integrators on Lie groups. Found. Comput. Math. 2009;9:197–219. doi: 10.1007/s10208-008-9030-4. [DOI] [Google Scholar]
  • 70.Kobilarov M.B., Marsden J.E. Discrete geometric optimal control on Lie groups. IEEE Trans. Robot. 2011;27:641–655. doi: 10.1109/TRO.2011.2139130. [DOI] [Google Scholar]
  • 71.Demoures F., Gay-Balmaz F., Ratiu T.S. Multisymplectic variational integrators and space/time symplecticity. Anal. Appl. 2016;14:341–391. doi: 10.1142/S0219530515500025. [DOI] [Google Scholar]

Articles from Entropy are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES